llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Sanjay Patel	77f74c5e5e	[AMDGPU] fix tests to be independent of FP undef llvm-svn: 327211	2018-03-10 16:39:59 +00:00
Sanjay Patel	ad67224a59	[PowerPC] fix tests to be independent of FP undef llvm-svn: 327210	2018-03-10 16:14:05 +00:00
Matt Arsenault	99343a3814	AMDGPU: Fix crash when constant folding with physreg operand llvm-svn: 327209	2018-03-10 16:05:35 +00:00
Sanjay Patel	2052a093cc	[ConstantFold] fp_binop AnyConstant, undef --> NaN With the updated LangRef ( D44216 / rL327138 ) in place, we can proceed with more constant folding. I'm intentionally taking the conservative path here: no matter what the constant or the FMF, we can always fold to NaN. This is because the undef operand can be chosen as NaN, and in our simplified default FP env, nothing else happens - NaN just propagates to the result. If we find some way/need to propagate undef instead, that can be added subsequently. The tests show that we always choose the same quiet NaN constant (0x7FF8000000000000 in IR text). There were suggestions to improve that with a 'NaN' string token or not always print a 64-bit hex value, but those are independent changes. We might also consider setting/propagating the payload of NaN constants as an enhancement. Differential Revision: https://reviews.llvm.org/D44308 llvm-svn: 327208	2018-03-10 15:56:25 +00:00
Florian Hahn	3a66267822	[PartialInlining] Use isInlineViable to detect constructs preventing inlining. Use isInlineViable to prevent inlining of functions with non-inlinable constructs, in case cost analysis is skipped. Reviewers: efriedma, sfertile, davide, davidxl Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D42846 llvm-svn: 327207	2018-03-10 14:53:44 +00:00
Craig Topper	3667827157	[X86] Add a missing EVEX instruction to EmitAnyX86InstComments. The equivalent SSE and VEX instruction are already there. llvm-svn: 327205	2018-03-10 06:05:13 +00:00
Sam Clegg	047b2e537b	[WebAssembly] Object: Add accessor for wasm symbols llvm-svn: 327203	2018-03-10 05:34:04 +00:00
Craig Topper	d7116b6014	[X86] Move the AC_EVEX_2_VEX AsmComments enum to X86InstrInfo.h from X86InstComments.h. X86InstComments.h is used by tools that only have the MC layer. We shouldn't be importing a file from CodeGen into this. X86InstrInfo.h isn't a great place, but I couldn't find a better one. llvm-svn: 327202	2018-03-10 05:15:22 +00:00
George Burgess IV	a84d40c19d	[AliasAnalysis] Shrink AliasResults; NFC We're persisting AliasResults in some places in MemorySSA, so the size of these now matters a little bit (well, 8 regular-sized bits, to be precise). Do the same for ModRefInfo for consistency. llvm-svn: 327201	2018-03-10 03:34:43 +00:00
Craig Topper	5514b7c51d	[X86] Rewrite printMasking code in X86InstComments to use TSFlags to determine whether the instruction is masked. This should have been NFC, but it looks like we were missing PUNPCKLHQDQ/PUNPCKLQDQ instructions in there. llvm-svn: 327200	2018-03-10 03:12:00 +00:00
George Burgess IV	c5340f63b0	Clean up a temp file on the buildbots. r327100 made us stop producing vecreduce-propagate-sd-flags.s, but it's still sticking around on some bots. This makes the bots unhappy. I'll revert this tomorrow. llvm-svn: 327199	2018-03-10 02:51:10 +00:00
Rafael Espindola	63fb26477c	Go back to sometimes assuming intristics are local. This fixes pr36674. While it is valid for shouldAssumeDSOLocal to return false anytime, always returning false for intrinsics is not optimal on i386 and also hits a bug in the backend. To use a plt, the caller must first setup ebx to handle the case of that file being linked into a PIE executable or shared library. In those cases the generated PLT uses ebx. Currently we can produce "calll expf@plt" without setting ebx. We could fix that by correctly setting ebx, but this would produce worse code for the case where the runtime library is statically linked. It would also required other tools to handle R_386_PLT32. llvm-svn: 327198	2018-03-10 02:42:14 +00:00
Nirav Dave	fc86632712	Revert: r327172 "Correct load-op-store cycle detection analysis" r327171 "Improve Dependency analysis when doing multi-node Instruction Selection" r328170 "[DAG] Enforce stricter NodeId invariant during Instruction selection" Reverting patch as NodeId invariant change is causing pathological increases in compile time on PPC llvm-svn: 327197	2018-03-10 02:16:15 +00:00
Craig Topper	fd7eb669d2	[TwoAddressInstructionPass] Improve tryInstructionCommute of X86 FMA and vpternlog instructions These instructions have 3 operands that can be commuted. The first commute we find may not be the best. So we should keep searching if we performed an aggressive commute. There may still be an operand that is killed or a physical register constraint that might be better. Differential Revision: https://reviews.llvm.org/D44324 llvm-svn: 327188	2018-03-09 23:36:58 +00:00
Heejin Ahn	10a149a017	[WebAssembly] Add EVT::getEVTString() for except_ref type Summary: This was missing in D43706 (rL326985). Reviewers: Subscribers: jfb, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D44325 llvm-svn: 327187	2018-03-09 23:29:06 +00:00
Peter Collingbourne	759fab8799	ADT: Make MapVector::value_type and MapVector::size_type public. NFC. llvm-svn: 327180	2018-03-09 22:52:22 +00:00
Craig Topper	9b2fe7d931	[TargetLowering] Remove redundant term in two ifs in SimplifySetCC. NFC If we get into the right hand side of the OR, we know that isOpaque is true since the left hand size is just a check that it's false. llvm-svn: 327179	2018-03-09 22:24:12 +00:00
Ulrich Weigand	734a11275c	Revert "[Debug] Retain both sets of debug intrinsics in HoistThenElseCodeToIf" This reverts commit r327175 as problems in debug info generation were shown. llvm-svn: 327176	2018-03-09 22:00:10 +00:00
Ulrich Weigand	270a9824ed	[Debug] Retain both sets of debug intrinsics in HoistThenElseCodeToIf When hoisting common code from the "then" and "else" branches of a condition to before the "if", there is no need to require that debug intrinsics match before moving them (and merging them). Instead, we can simply always keep all debug intrinsics from both sides of the "if". This fixes PR36410, which describes a problem where as a result of the attempt to merge debug locations for two debug intrinsics we end up with an invalid intrinsic, where the scope indicated in the !dbg location no longer matches the scope of the variable tracked by the intrinsic. In addition, this has the benefit that we no longer throw away information that is actually still valid, helping to generate better debug data. Reviewed By: vsk Differential Revision: https://reviews.llvm.org/D44312 llvm-svn: 327175	2018-03-09 21:37:07 +00:00
Stefan Pintilie	5df876097c	[Power9] Code Cleaup and adding Comments for Power 9 Scheduler Did some code cleanup up removing ItinRW that are not needed and resource types that are no longer used. Also added more comments to the td files related to the Power 9 sheduler model. llvm-svn: 327174	2018-03-09 21:08:35 +00:00
Renato Golin	0bfc33f33e	[NFC] Consolidate six getPointerOperand() utility functions into one place There are six separate instances of getPointerOperand() utility. LoopVectorize.cpp has one of them, and I don't want to create a 7th one while I'm trying to move LoopVectorizationLegality into a separate file (eventual objective is to move it to Analysis tree). See http://lists.llvm.org/pipermail/llvm-dev/2018-February/120999.html for llvm-dev discussions Closes D43323. Patch by Hideki Saito <hideki.saito@intel.com>. llvm-svn: 327173	2018-03-09 21:05:58 +00:00
Nirav Dave	6d554c47b0	Correct load-op-store cycle detection analysis Add missing cycle dependency checks in load-op-store fusion. Fixes PR36274. Reviewers: craig.topper, bogner Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D43154 llvm-svn: 327172	2018-03-09 20:58:07 +00:00
Nirav Dave	f402fe9644	Improve Dependency analysis when doing multi-node Instruction Selection Relanding after fixing NodeId Invariant. Cleanup cycle/validity checks in ISel (IsLegalToFold, HandleMergeInputChains) and X86 (isFusableLoadOpStore). Now do a full search for cycles / dependencies pruning the search when topological property of NodeId allows. As part of this propogate the NodeId-based cutoffs to narrow hasPreprocessorHelper searches. Reviewers: craig.topper, bogner Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D41293 llvm-svn: 327171	2018-03-09 20:57:42 +00:00
Nirav Dave	645ddf853a	[DAG] Enforce stricter NodeId invariant during Instruction selection Instruction Selection makes use of the topological ordering of nodes by node id (a node's operands have smaller node id than it) when doing cycle detection. During selection we may violate this property as a selection of multiple nodes may induce a use dependence (and thus a node id restriction) between two unrelated nodes. If a selected node has an unselected successor this may allow us to miss a cycle in detection an invalid selection. This patch fixes this by marking all unselected successors of a selected node have negated node id. We avoid pruning on such negative ids but still can reconstruct the original id for pruning. In-tree targets have been updated to replace DAG-level replacements with ISel-level ones which enforce this property. This preemptively fixes PR36312 before triggering commit r324359 relands Reviewers: craig.topper, bogner, jyknight Subscribers: arsenm, nhaehnle, javed.absar, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D43198 llvm-svn: 327170	2018-03-09 20:57:15 +00:00
Nirav Dave	e452019162	Make early exit hasPredecessorHelper return true. NFCI. All uses conservatively assume in early exit case that it will be a predecessor. Changing default removes checking code in all uses. llvm-svn: 327169	2018-03-09 20:56:51 +00:00
Rafael Auler	4c9b85b364	[llvm-objdump] Support disassembling by symbol name Summary: Add a new option -df to llvm-objdump that takes function names as arguments and instructs the disassembler to only dump those function contents. Based on code originally written by Bill Nell. Reviewers: espindola, JDevlieghere Differential Revision: https://reviews.llvm.org/D44224 llvm-svn: 327164	2018-03-09 19:13:44 +00:00
Peter Collingbourne	1834ad0e4a	Use branch funnels for virtual calls when retpoline mitigation is enabled. The retpoline mitigation for variant 2 of CVE-2017-5715 inhibits the branch predictor, and as a result it can lead to a measurable loss of performance. We can reduce the performance impact of retpolined virtual calls by replacing them with a special construct known as a branch funnel, which is an instruction sequence that implements virtual calls to a set of known targets using a binary tree of direct branches. This allows the processor to speculately execute valid implementations of the virtual function without allowing for speculative execution of of calls to arbitrary addresses. This patch extends the whole-program devirtualization pass to replace certain virtual calls with calls to branch funnels, which are represented using a new llvm.icall.jumptable intrinsic. It also extends the LowerTypeTests pass to recognize the new intrinsic, generate code for the branch funnels (x86_64 only for now) and lay out virtual tables as required for each branch funnel. The implementation supports full LTO as well as ThinLTO, and extends the ThinLTO summary format used for whole-program devirtualization to support branch funnels. For more details see RFC: http://lists.llvm.org/pipermail/llvm-dev/2018-January/120672.html Differential Revision: https://reviews.llvm.org/D42453 llvm-svn: 327163	2018-03-09 19:11:44 +00:00
Alina Sbirlea	fe81859a6a	Avoid creating a Constant for each value in a ConstantDataSequential. Summary: We create a ConstantDataSequential (ConstantDataArray or ConstantDataVector) to avoid creating a Constant for each element in an array of constants. But them in AsmPrinter, we do create a ConstantFP for each element in the ConstantDataSequential. This triggers excessive memory use when generating large global FP constants. Reviewers: bogner, lhames, t.p.northover Subscribers: jlebar, sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D44277 llvm-svn: 327161	2018-03-09 18:48:20 +00:00
Rafael Espindola	623d19e403	Delay creating an alias for @@@. With this we only create an alias for @@@ once we know if it should use @ or @@. This avoids last minutes renames and hacks to handle MS names. This only handles the ELF writer. LTO still has issues with @@@ aliases. llvm-svn: 327160	2018-03-09 18:42:25 +00:00
Simon Pilgrim	b899b52c84	[X86][AVX] createVariablePermute - fix v2i64/v2f64 VPERMILPD index creation. The input indices vector will put the index in bit0, but VPERMILPD actually selects off bit1 - so we need to scale accordingly. llvm-svn: 327159	2018-03-09 18:37:56 +00:00
Nicolai Haehnle	450382f892	TableGen: Remove space at EOL in TGLexer.{h,cpp} Change-Id: Ica5f39470174e85f173d3b6db95789033f75ce17 llvm-svn: 327158	2018-03-09 18:32:04 +00:00
Simon Pilgrim	bd3a277bb8	[X86][SSE] createVariablePermute - move source vector canonicalization to top of function. NFCI. This is to make it easier to return early from the switch statement with custom lowering. llvm-svn: 327157	2018-03-09 18:08:08 +00:00
Renato Golin	d1d0f87c24	[LV] Adding test for r327109 llvm-svn: 327155	2018-03-09 18:02:36 +00:00
Farhana Aleen	c0ae8f8658	[AMDGPU] Supported ds_read_b128 generation; Widened vector length for local address-space. Summary: Starting from GCN 2nd generation, ISA supports ds_read_b128 on top of ds_read_b64. This patch supports ds_read_b128 instruction pattern and generation of this instruction. In the vectorizer, this patch also widen the vector length so that vectorizer generates 128 bit loads for local address-space which gets translated to ds_read_b128. Since the performance benefit is not clear; compiler generates ds_read_b128 under -amdgpu-ds128. Author: FarhanaAleen Reviewed By: rampitec, arsenm Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D44210 llvm-svn: 327153	2018-03-09 17:41:39 +00:00
Aditya Nandakumar	fec67248ae	[GISel]: Add helpers for easy building G_FCONSTANT along with matchers Added helpers to build G_FCONSTANT, along with matching ConstantFP and unit tests for the same. Sample usage. auto MIB = Builder.buildFConstant(s32, 0.5); // Build IEEESingle For Matching the above const ConstantFP* Tmp; mi_match(DstReg, MRI, m_GFCst(Tmp)); https://reviews.llvm.org/D44128 reviewed by: volkan llvm-svn: 327152	2018-03-09 17:31:51 +00:00
Chad Rosier	8e25cd02db	[JumpThreading] Don't restrict cast-traversal to i1 In r263618, JumpThreading learned to look trough simple cast instructions, but only if the source of those cast instructions was a phi/cmp i1 (in an effort to limit compile time effects). I think this condition is too restrictive. For switches with limited value range, InstCombine will readily introduce an extra trunc instruction to a smaller integer type (e.g. from i8 to i2), leaving us in the somewhat perverse situation that jump-threading would work before running instcombine, but not after. Since instcombine produces this pattern, I think we need to consider it canonical and support it in JumpThreading. In general, for limiting recursion, I think the existing restriction to phi and cmp nodes should be sufficient to avoid looking through unprofitable chains of instructions. Patch by Keno Fischer! Differential Revision: https://reviews.llvm.org/D42262 llvm-svn: 327150	2018-03-09 16:43:46 +00:00
Adrian Prantl	3056a87e37	Move generic test to the Generic directory llvm-svn: 327148	2018-03-09 16:42:05 +00:00
Sanjay Patel	5271e91b8e	[AMDGPU] fix test to be independent of FP undef llvm-svn: 327147	2018-03-09 16:33:34 +00:00
Nicholas Wilson	61acc6835c	[WebAssembly] Disallow weak undefined globals in the object format This implements https://github.com/WebAssembly/tool-conventions/pull/47 Differential Revision: https://reviews.llvm.org/D44201 llvm-svn: 327146	2018-03-09 16:30:44 +00:00
Sanjay Patel	6f17f3d76f	[InstSimplify] fix FP infinite hex constant values in tests; NFC Really should improve this... llvm-svn: 327144	2018-03-09 16:14:02 +00:00
Stefan Pintilie	1651fe32fd	Revert "[PowerPC] LSR tunings for PowerPC" Revert the rest of the LST tune commit. It seems that the LSR tune commit breaks internal tests. Reverting the commit. llvm-svn: 327143	2018-03-09 16:08:55 +00:00
Stefan Pintilie	0ced7d7015	Revert "[PowerPC] Move test to correct location." Revert part of the LSR tune commit. llvm-svn: 327142	2018-03-09 16:08:48 +00:00
Simon Pilgrim	ef856f262a	Tidyup comment that was destroyed by clang-format. NFCI. llvm-svn: 327141	2018-03-09 15:50:09 +00:00
Simon Pilgrim	9e78cd8573	[X86][SSE] createVariablePermute - move index vector canonicalization to top of function. NFCI. This is to make it easier to return early from the switch statement with custom lowering. llvm-svn: 327140	2018-03-09 15:48:56 +00:00
Tim Northover	47d2b317bb	Try to fix Windows bot by forcing "rm". llvm-svn: 327139	2018-03-09 15:44:59 +00:00
Sanjay Patel	a5102714e6	[LangRef] make it clear that FP instructions do not have side effects Also, fix the undef vs. UB example to use 'sdiv' because that can trigger div-by-zero UB. The existing text for the constrained intrinsics says: "By default, LLVM optimization passes assume that the rounding mode is round-to-nearest and that floating point exceptions will not be monitored. Constrained FP intrinsics are used to support non-default rounding modes and accurately preserve exception behavior without compromising LLVM’s ability to optimize FP code when the default behavior is used." ...so the additional text with the normal FP opcodes should make the different modes clear. Differential Revision: https://reviews.llvm.org/D44216 llvm-svn: 327138	2018-03-09 15:27:48 +00:00
Jonas Devlieghere	18e69dc8f4	[dsymutil] Unify error handling and add color We improved the handling of errors and warnings in dwarfdump's verifier in rL314498. This patch does the same thing for dsymutil. Differential revision: https://reviews.llvm.org/D44052 llvm-svn: 327137	2018-03-09 15:22:42 +00:00
Hans Wennborg	28e4160e7d	CMake: Make libxml2 show up in --system-libs (PR36660) lib/WindowsManifest/CMakeLists.txt adds it to LLVM_SYSTEM_LIBS on that target, but it was never getting picked up in tools/llvm-config/CMakeLists.txt. Differential Revision: https://reviews.llvm.org/D44302 llvm-svn: 327135	2018-03-09 14:46:44 +00:00
Sebastian Pop	5cb4d1ecb7	[x86][aarch64] ask the backend whether it has a vector blend instruction The code to match and produce more x86 vector blends was enabled for all architectures even though the transform may pessimize the code for other architectures that do not provide a vector blend instruction. Added an aarch64 testcase to check that a VZIP instruction is generated instead of byte movs. Differential Revision: https://reviews.llvm.org/D44118 llvm-svn: 327132	2018-03-09 14:29:21 +00:00
Andrea Di Biagio	0a931ef145	[llvm-mca] Move the logic that prints the summary into its own view. NFCI llvm-svn: 327128	2018-03-09 13:52:03 +00:00

... 4 5 6 7 8 ...

161436 Commits