llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Nico Weber	c79c511d2f	Add REQUIRES: arm-registered-target to test using an arm-apple-ios triple. Else, the test fails in LLVM_TARGETS_TO_BUILD=X86 builds like so: bin/llvm-mc: : error: unable to get target for 'arm64-apple-ios7.0.0' llvm-svn: 327233	2018-03-11 03:17:34 +00:00
Andrea Di Biagio	286527ebc6	[TargetSchedule] Minor refactor in computeInstrLatency. NFC The intent of revision r300311 was to add a check for invalid scheduling class descriptors. However, it ended up adding a redundant call in a basic block that should not be reachable. llvm-svn: 327231	2018-03-11 00:51:33 +00:00
George Burgess IV	be98554fc6	Revert r327199: "Clean up a temp file on the buildbots" "I'll revert this tomorrow," I said yesterday. This should've reached all the bots it can by now. llvm-svn: 327230	2018-03-10 23:22:46 +00:00
Craig Topper	77d430c110	[X86] Add comments to the end of FMA3 instructions to make the operation clear Summary: There are 3 different operand orders for FMA instructions so figuring out the exact operation being performed requires a lot of thought. This patch adds a comment to the end of the assembly line to print the exact operation. I think I've got all the instructions in here except the ones with builtin rounding. I didn't update all tests, but I assume we can get them as we regenerate tests in the future. Reviewers: spatel, v_klochkov, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44345 llvm-svn: 327225	2018-03-10 21:30:46 +00:00
Andrea Di Biagio	63ab384f89	[llvm-mca] Fix use-of-uninitialized-value error reported by the MemorySanitizer. This should make the buildbots green again. llvm-svn: 327223	2018-03-10 20:52:59 +00:00
Simon Pilgrim	0b5d96e408	[X86][XOP] createVariablePermute - use VPERMIL2 for v8i32/v4i64 variable permutes llvm-svn: 327222	2018-03-10 19:49:59 +00:00
Martin Storsjo	6ebf60a0f6	[AArch64] Implement native TLS for Windows Differential Revision: https://reviews.llvm.org/D43971 llvm-svn: 327220	2018-03-10 19:05:21 +00:00
Mandeep Singh Grang	1c09e88fe4	[ADT] Shuffle containers before sorting to uncover non-deterministic behavior Summary: std::sort and array_pod_sort both use non-stable sorting algorithms. This means that the relative order of elements with the same key is undefined. This patch is an attempt to uncover such scenarios by randomly shuffling all containers before sorting, if EXPENSIVE_CHECKS is enabled. Here's the bugzilla for this: https://bugs.llvm.org/show_bug.cgi?id=35135 Reviewers: dblaikie, dexonsmith, chandlerc, efriedma, RKSimon Reviewed By: RKSimon Subscribers: fhahn, davide, RKSimon, vsk, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D39245 llvm-svn: 327219	2018-03-10 18:59:14 +00:00
Simon Pilgrim	e3afc5b47d	[X86][XOP] createVariablePermute - use VPPERM for v16i16 variable permutes llvm-svn: 327218	2018-03-10 18:33:29 +00:00
Simon Pilgrim	3cf8fd83cc	[X86][SSE] createVariablePermute - create index scaling helper. NFCI. This will help in some future changes for custom lowering. llvm-svn: 327217	2018-03-10 18:12:35 +00:00
Andrea Di Biagio	17857e9071	[llvm-mca] BackendStatistics: early exit from method printSchedulerUsage if the no scheduler resources were consumed. llvm-svn: 327215	2018-03-10 17:40:25 +00:00
Andrea Di Biagio	13307dcaad	[llvm-mca] Views are now independent from resource masks. NFCI This change removes method Backend::getProcResourceMasks() and simplifies some logic in the Views. This effectively removes yet another dependency between the views and the Backend. No functional change intended. llvm-svn: 327214	2018-03-10 16:55:07 +00:00
Simon Pilgrim	504e94710f	[X86][XOP] createVariablePermute - use VPPERM for v32i8 variable permutes llvm-svn: 327213	2018-03-10 16:51:45 +00:00
Sanjay Patel	3b1dee4eab	[InstSimplify] fp_binop X, undef --> NaN The variable operand could be NaN, so it's always safe to propagate NaN. llvm-svn: 327212	2018-03-10 16:51:28 +00:00
Sanjay Patel	77f74c5e5e	[AMDGPU] fix tests to be independent of FP undef llvm-svn: 327211	2018-03-10 16:39:59 +00:00
Sanjay Patel	ad67224a59	[PowerPC] fix tests to be independent of FP undef llvm-svn: 327210	2018-03-10 16:14:05 +00:00
Matt Arsenault	99343a3814	AMDGPU: Fix crash when constant folding with physreg operand llvm-svn: 327209	2018-03-10 16:05:35 +00:00
Sanjay Patel	2052a093cc	[ConstantFold] fp_binop AnyConstant, undef --> NaN With the updated LangRef ( D44216 / rL327138 ) in place, we can proceed with more constant folding. I'm intentionally taking the conservative path here: no matter what the constant or the FMF, we can always fold to NaN. This is because the undef operand can be chosen as NaN, and in our simplified default FP env, nothing else happens - NaN just propagates to the result. If we find some way/need to propagate undef instead, that can be added subsequently. The tests show that we always choose the same quiet NaN constant (0x7FF8000000000000 in IR text). There were suggestions to improve that with a 'NaN' string token or not always print a 64-bit hex value, but those are independent changes. We might also consider setting/propagating the payload of NaN constants as an enhancement. Differential Revision: https://reviews.llvm.org/D44308 llvm-svn: 327208	2018-03-10 15:56:25 +00:00
Florian Hahn	3a66267822	[PartialInlining] Use isInlineViable to detect constructs preventing inlining. Use isInlineViable to prevent inlining of functions with non-inlinable constructs, in case cost analysis is skipped. Reviewers: efriedma, sfertile, davide, davidxl Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D42846 llvm-svn: 327207	2018-03-10 14:53:44 +00:00
Craig Topper	3667827157	[X86] Add a missing EVEX instruction to EmitAnyX86InstComments. The equivalent SSE and VEX instruction are already there. llvm-svn: 327205	2018-03-10 06:05:13 +00:00
Sam Clegg	047b2e537b	[WebAssembly] Object: Add accessor for wasm symbols llvm-svn: 327203	2018-03-10 05:34:04 +00:00
Craig Topper	d7116b6014	[X86] Move the AC_EVEX_2_VEX AsmComments enum to X86InstrInfo.h from X86InstComments.h. X86InstComments.h is used by tools that only have the MC layer. We shouldn't be importing a file from CodeGen into this. X86InstrInfo.h isn't a great place, but I couldn't find a better one. llvm-svn: 327202	2018-03-10 05:15:22 +00:00
George Burgess IV	a84d40c19d	[AliasAnalysis] Shrink AliasResults; NFC We're persisting AliasResults in some places in MemorySSA, so the size of these now matters a little bit (well, 8 regular-sized bits, to be precise). Do the same for ModRefInfo for consistency. llvm-svn: 327201	2018-03-10 03:34:43 +00:00
Craig Topper	5514b7c51d	[X86] Rewrite printMasking code in X86InstComments to use TSFlags to determine whether the instruction is masked. This should have been NFC, but it looks like we were missing PUNPCKLHQDQ/PUNPCKLQDQ instructions in there. llvm-svn: 327200	2018-03-10 03:12:00 +00:00
George Burgess IV	c5340f63b0	Clean up a temp file on the buildbots. r327100 made us stop producing vecreduce-propagate-sd-flags.s, but it's still sticking around on some bots. This makes the bots unhappy. I'll revert this tomorrow. llvm-svn: 327199	2018-03-10 02:51:10 +00:00
Rafael Espindola	63fb26477c	Go back to sometimes assuming intristics are local. This fixes pr36674. While it is valid for shouldAssumeDSOLocal to return false anytime, always returning false for intrinsics is not optimal on i386 and also hits a bug in the backend. To use a plt, the caller must first setup ebx to handle the case of that file being linked into a PIE executable or shared library. In those cases the generated PLT uses ebx. Currently we can produce "calll expf@plt" without setting ebx. We could fix that by correctly setting ebx, but this would produce worse code for the case where the runtime library is statically linked. It would also required other tools to handle R_386_PLT32. llvm-svn: 327198	2018-03-10 02:42:14 +00:00
Nirav Dave	fc86632712	Revert: r327172 "Correct load-op-store cycle detection analysis" r327171 "Improve Dependency analysis when doing multi-node Instruction Selection" r328170 "[DAG] Enforce stricter NodeId invariant during Instruction selection" Reverting patch as NodeId invariant change is causing pathological increases in compile time on PPC llvm-svn: 327197	2018-03-10 02:16:15 +00:00
Craig Topper	fd7eb669d2	[TwoAddressInstructionPass] Improve tryInstructionCommute of X86 FMA and vpternlog instructions These instructions have 3 operands that can be commuted. The first commute we find may not be the best. So we should keep searching if we performed an aggressive commute. There may still be an operand that is killed or a physical register constraint that might be better. Differential Revision: https://reviews.llvm.org/D44324 llvm-svn: 327188	2018-03-09 23:36:58 +00:00
Heejin Ahn	10a149a017	[WebAssembly] Add EVT::getEVTString() for except_ref type Summary: This was missing in D43706 (rL326985). Reviewers: Subscribers: jfb, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D44325 llvm-svn: 327187	2018-03-09 23:29:06 +00:00
Peter Collingbourne	759fab8799	ADT: Make MapVector::value_type and MapVector::size_type public. NFC. llvm-svn: 327180	2018-03-09 22:52:22 +00:00
Craig Topper	9b2fe7d931	[TargetLowering] Remove redundant term in two ifs in SimplifySetCC. NFC If we get into the right hand side of the OR, we know that isOpaque is true since the left hand size is just a check that it's false. llvm-svn: 327179	2018-03-09 22:24:12 +00:00
Ulrich Weigand	734a11275c	Revert "[Debug] Retain both sets of debug intrinsics in HoistThenElseCodeToIf" This reverts commit r327175 as problems in debug info generation were shown. llvm-svn: 327176	2018-03-09 22:00:10 +00:00
Ulrich Weigand	270a9824ed	[Debug] Retain both sets of debug intrinsics in HoistThenElseCodeToIf When hoisting common code from the "then" and "else" branches of a condition to before the "if", there is no need to require that debug intrinsics match before moving them (and merging them). Instead, we can simply always keep all debug intrinsics from both sides of the "if". This fixes PR36410, which describes a problem where as a result of the attempt to merge debug locations for two debug intrinsics we end up with an invalid intrinsic, where the scope indicated in the !dbg location no longer matches the scope of the variable tracked by the intrinsic. In addition, this has the benefit that we no longer throw away information that is actually still valid, helping to generate better debug data. Reviewed By: vsk Differential Revision: https://reviews.llvm.org/D44312 llvm-svn: 327175	2018-03-09 21:37:07 +00:00
Stefan Pintilie	5df876097c	[Power9] Code Cleaup and adding Comments for Power 9 Scheduler Did some code cleanup up removing ItinRW that are not needed and resource types that are no longer used. Also added more comments to the td files related to the Power 9 sheduler model. llvm-svn: 327174	2018-03-09 21:08:35 +00:00
Renato Golin	0bfc33f33e	[NFC] Consolidate six getPointerOperand() utility functions into one place There are six separate instances of getPointerOperand() utility. LoopVectorize.cpp has one of them, and I don't want to create a 7th one while I'm trying to move LoopVectorizationLegality into a separate file (eventual objective is to move it to Analysis tree). See http://lists.llvm.org/pipermail/llvm-dev/2018-February/120999.html for llvm-dev discussions Closes D43323. Patch by Hideki Saito <hideki.saito@intel.com>. llvm-svn: 327173	2018-03-09 21:05:58 +00:00
Nirav Dave	6d554c47b0	Correct load-op-store cycle detection analysis Add missing cycle dependency checks in load-op-store fusion. Fixes PR36274. Reviewers: craig.topper, bogner Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D43154 llvm-svn: 327172	2018-03-09 20:58:07 +00:00
Nirav Dave	f402fe9644	Improve Dependency analysis when doing multi-node Instruction Selection Relanding after fixing NodeId Invariant. Cleanup cycle/validity checks in ISel (IsLegalToFold, HandleMergeInputChains) and X86 (isFusableLoadOpStore). Now do a full search for cycles / dependencies pruning the search when topological property of NodeId allows. As part of this propogate the NodeId-based cutoffs to narrow hasPreprocessorHelper searches. Reviewers: craig.topper, bogner Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D41293 llvm-svn: 327171	2018-03-09 20:57:42 +00:00
Nirav Dave	645ddf853a	[DAG] Enforce stricter NodeId invariant during Instruction selection Instruction Selection makes use of the topological ordering of nodes by node id (a node's operands have smaller node id than it) when doing cycle detection. During selection we may violate this property as a selection of multiple nodes may induce a use dependence (and thus a node id restriction) between two unrelated nodes. If a selected node has an unselected successor this may allow us to miss a cycle in detection an invalid selection. This patch fixes this by marking all unselected successors of a selected node have negated node id. We avoid pruning on such negative ids but still can reconstruct the original id for pruning. In-tree targets have been updated to replace DAG-level replacements with ISel-level ones which enforce this property. This preemptively fixes PR36312 before triggering commit r324359 relands Reviewers: craig.topper, bogner, jyknight Subscribers: arsenm, nhaehnle, javed.absar, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D43198 llvm-svn: 327170	2018-03-09 20:57:15 +00:00
Nirav Dave	e452019162	Make early exit hasPredecessorHelper return true. NFCI. All uses conservatively assume in early exit case that it will be a predecessor. Changing default removes checking code in all uses. llvm-svn: 327169	2018-03-09 20:56:51 +00:00
Rafael Auler	4c9b85b364	[llvm-objdump] Support disassembling by symbol name Summary: Add a new option -df to llvm-objdump that takes function names as arguments and instructs the disassembler to only dump those function contents. Based on code originally written by Bill Nell. Reviewers: espindola, JDevlieghere Differential Revision: https://reviews.llvm.org/D44224 llvm-svn: 327164	2018-03-09 19:13:44 +00:00
Peter Collingbourne	1834ad0e4a	Use branch funnels for virtual calls when retpoline mitigation is enabled. The retpoline mitigation for variant 2 of CVE-2017-5715 inhibits the branch predictor, and as a result it can lead to a measurable loss of performance. We can reduce the performance impact of retpolined virtual calls by replacing them with a special construct known as a branch funnel, which is an instruction sequence that implements virtual calls to a set of known targets using a binary tree of direct branches. This allows the processor to speculately execute valid implementations of the virtual function without allowing for speculative execution of of calls to arbitrary addresses. This patch extends the whole-program devirtualization pass to replace certain virtual calls with calls to branch funnels, which are represented using a new llvm.icall.jumptable intrinsic. It also extends the LowerTypeTests pass to recognize the new intrinsic, generate code for the branch funnels (x86_64 only for now) and lay out virtual tables as required for each branch funnel. The implementation supports full LTO as well as ThinLTO, and extends the ThinLTO summary format used for whole-program devirtualization to support branch funnels. For more details see RFC: http://lists.llvm.org/pipermail/llvm-dev/2018-January/120672.html Differential Revision: https://reviews.llvm.org/D42453 llvm-svn: 327163	2018-03-09 19:11:44 +00:00
Alina Sbirlea	fe81859a6a	Avoid creating a Constant for each value in a ConstantDataSequential. Summary: We create a ConstantDataSequential (ConstantDataArray or ConstantDataVector) to avoid creating a Constant for each element in an array of constants. But them in AsmPrinter, we do create a ConstantFP for each element in the ConstantDataSequential. This triggers excessive memory use when generating large global FP constants. Reviewers: bogner, lhames, t.p.northover Subscribers: jlebar, sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D44277 llvm-svn: 327161	2018-03-09 18:48:20 +00:00
Rafael Espindola	623d19e403	Delay creating an alias for @@@. With this we only create an alias for @@@ once we know if it should use @ or @@. This avoids last minutes renames and hacks to handle MS names. This only handles the ELF writer. LTO still has issues with @@@ aliases. llvm-svn: 327160	2018-03-09 18:42:25 +00:00
Simon Pilgrim	b899b52c84	[X86][AVX] createVariablePermute - fix v2i64/v2f64 VPERMILPD index creation. The input indices vector will put the index in bit0, but VPERMILPD actually selects off bit1 - so we need to scale accordingly. llvm-svn: 327159	2018-03-09 18:37:56 +00:00
Nicolai Haehnle	450382f892	TableGen: Remove space at EOL in TGLexer.{h,cpp} Change-Id: Ica5f39470174e85f173d3b6db95789033f75ce17 llvm-svn: 327158	2018-03-09 18:32:04 +00:00
Simon Pilgrim	bd3a277bb8	[X86][SSE] createVariablePermute - move source vector canonicalization to top of function. NFCI. This is to make it easier to return early from the switch statement with custom lowering. llvm-svn: 327157	2018-03-09 18:08:08 +00:00
Renato Golin	d1d0f87c24	[LV] Adding test for r327109 llvm-svn: 327155	2018-03-09 18:02:36 +00:00
Farhana Aleen	c0ae8f8658	[AMDGPU] Supported ds_read_b128 generation; Widened vector length for local address-space. Summary: Starting from GCN 2nd generation, ISA supports ds_read_b128 on top of ds_read_b64. This patch supports ds_read_b128 instruction pattern and generation of this instruction. In the vectorizer, this patch also widen the vector length so that vectorizer generates 128 bit loads for local address-space which gets translated to ds_read_b128. Since the performance benefit is not clear; compiler generates ds_read_b128 under -amdgpu-ds128. Author: FarhanaAleen Reviewed By: rampitec, arsenm Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D44210 llvm-svn: 327153	2018-03-09 17:41:39 +00:00
Aditya Nandakumar	fec67248ae	[GISel]: Add helpers for easy building G_FCONSTANT along with matchers Added helpers to build G_FCONSTANT, along with matching ConstantFP and unit tests for the same. Sample usage. auto MIB = Builder.buildFConstant(s32, 0.5); // Build IEEESingle For Matching the above const ConstantFP* Tmp; mi_match(DstReg, MRI, m_GFCst(Tmp)); https://reviews.llvm.org/D44128 reviewed by: volkan llvm-svn: 327152	2018-03-09 17:31:51 +00:00
Chad Rosier	8e25cd02db	[JumpThreading] Don't restrict cast-traversal to i1 In r263618, JumpThreading learned to look trough simple cast instructions, but only if the source of those cast instructions was a phi/cmp i1 (in an effort to limit compile time effects). I think this condition is too restrictive. For switches with limited value range, InstCombine will readily introduce an extra trunc instruction to a smaller integer type (e.g. from i8 to i2), leaving us in the somewhat perverse situation that jump-threading would work before running instcombine, but not after. Since instcombine produces this pattern, I think we need to consider it canonical and support it in JumpThreading. In general, for limiting recursion, I think the existing restriction to phi and cmp nodes should be sufficient to avoid looking through unprofitable chains of instructions. Patch by Keno Fischer! Differential Revision: https://reviews.llvm.org/D42262 llvm-svn: 327150	2018-03-09 16:43:46 +00:00

1 2 3 4 5 ...

161200 Commits