llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Daniel Sanders	c868861fe8	[globalisel][docs] Add a section about debugging with the block extractor Summary: Depends on D69644 Reviewers: rovka, volkan, arsenm Subscribers: wdng, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69645	2019-11-05 14:48:27 -08:00
Stanislav Mekhanoshin	43c3e520f5	[AMDGPU] Add missing flags to DS_Real Differential Revision: https://reviews.llvm.org/D69867	2019-11-05 14:24:48 -08:00
Sanjay Patel	292dd62299	[SLP] add tests for 2-wide reductions; NFC	2019-11-05 17:18:37 -05:00
Volodymyr Sapsai	2d616bdac4	Revert "[analyzer] Add test directory for scan-build." This reverts commit 0aba69eb1a01c44185009f50cc633e3c648e9950 with subsequent changes to test files. It caused test failures on GreenDragon, e.g., http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/	2019-11-05 14:03:36 -08:00
Teresa Johnson	6768a397d3	[IRMover] Use GlobalValue::getAddressSpace instead of directly from its type [NFC] Summary: Change the old form of G->getType()->getAddressSpace() to the new G->getAddressSpace() (underneath does the same). Patch by Ehud Katz <ehudkatz@gmail.com> Reviewers: tejohnson, chandlerc Reviewed By: tejohnson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69550	2019-11-05 13:54:41 -08:00
Simon Atanasyan	fa3a3af046	[mips] Fix `getRegForInlineAsmConstraint` to do not crash on empty Constraint	2019-11-06 00:50:39 +03:00
Alina Sbirlea	b44d371b7c	[LoopRotationUtils] Check values are newly inserted into maps. This is a cleanup that came up in D63680. All values added to the ValueMaps should be newly added.	2019-11-05 13:40:10 -08:00
Simon Pilgrim	5c72f1396f	[Hexagon] getCompoundCandidateGroup - fix 'false' value is implicitly cast to unsigned warning. NFCI. Consistently return HexagonII::HCG_None.	2019-11-05 21:37:53 +00:00
Philip Reames	17c2029d62	[X86/Atomics] Correct a few transforms for new atomic lowering This is a partial fix for the issues described in commit message of 027aa27 (the revert of G24609). Unfortunately, I can't provide test coverage for it on it's own as the only (known) wrong example is still wrong, but due to a separate issue. These fixes are cases where when performing unrelated DAG combines, we were dropping the atomicity flags entirely.	2019-11-05 13:20:08 -08:00
Amy Huang	4bc07fee4b	[MIR] Add MIR parsing for heap alloc site instruction markers Summary: This patch adds MIR parsing and printing for heap alloc markers, which were added in D69136. They are printed as an operand similar to pre-/post-instr symbols, with a heap-alloc-marker token and a metadata node. Reviewers: rnk Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69864	2019-11-05 12:57:45 -08:00
Benjamin Kramer	7fd0a8ebef	[X86] Gate select->fmin/fmax transform on NoSignedZeros instead of UnsafeFPMath	2019-11-05 21:28:41 +01:00
Julian Lettner	0752aef3ee	Revert "[lit] Better/earlier errors when no tests are executed" This reverts commit d8f2bff75126c6dde694ad245f9807fa12ad5630.	2019-11-05 12:10:43 -08:00
Stanislav Mekhanoshin	f1ac1c199b	[AMDGPU] Removed dead code from R600ISelLowering.cpp This was added to inhibit a warning from gcc 7.3 according to the comment. However, it triggers warning from PVS. In addition I cannot reproduce it with gcc 7.4 and I also cannot reproduce it with gcc 7.3 using compiler explorer. Differential Revision: https://reviews.llvm.org/D69863	2019-11-05 12:02:48 -08:00
Philip Reames	594dcaeec7	[X86/Atomics] (Semantically) revert G246098, switch back to the old atomic example When writing an email for a follow up proposal, I realized one of the diffs in the committed change was incorrect. Digging into it revealed that the fix is complicated enough to require some thought, so reverting in the meantime. The problem is visible in this diff (from the revert): ; X64-SSE-LABEL: store_fp128: ; X64-SSE: # %bb.0: -; X64-SSE-NEXT: movaps %xmm0, (%rdi) +; X64-SSE-NEXT: subq $24, %rsp +; X64-SSE-NEXT: .cfi_def_cfa_offset 32 +; X64-SSE-NEXT: movaps %xmm0, (%rsp) +; X64-SSE-NEXT: movq (%rsp), %rsi +; X64-SSE-NEXT: movq {{[0-9]+}}(%rsp), %rdx +; X64-SSE-NEXT: callq __sync_lock_test_and_set_16 +; X64-SSE-NEXT: addq $24, %rsp +; X64-SSE-NEXT: .cfi_def_cfa_offset 8 ; X64-SSE-NEXT: retq store atomic fp128 %v, fp128* %fptr unordered, align 16 ret void The problem here is three fold: 1) x86-64 doesn't guarantee atomicity of anything larger than 8 bytes. Some platforms observably break this guarantee, others don't, but the codegen isn't considering this, so it's wrong on at least some platforms. 2) When I started to track down the problem, I discovered that DAGCombiner had stripped the atomicity off the store entirely. This comes down to idiomatic usage of DAG.getStore passing all MMO components separately as opposed to just passing the MMO. 3) On x86 (not -64), there are cases where 8 byte atomiciy is supported, but only for floating point operations. This would seem to imply that operation typing matters for correctness, and DAGCombine happily folds away bitcasts. I'm not 100% sure there's a problem here, but I'm not entirely sure there isn't either. I plan on returning to each issue in turn; sorry for the churn here.	2019-11-05 11:24:27 -08:00
Sid Manning	c105d26f97	[llvm-objdump] Fix spurious "The end of the file was unexpectedly encountered" if a SHT_NOBITS sh_offset is larger than the file size llvm-objdump -D this file: int a[100000]; int main() { return 0; } Will produce an error: "The end of the file was unexpectedly encountered". This happens because of a check in Binary.h checkOffset. (Addr + Size > M.getBufferEnd()). The sh_offset and sh_size fields can be ignored for SHT_NOBITS sections. Fix the error by changing ELFObjectFile<ELFT>::getSectionContents to use the file base for SHT_NOBITS sections. Reviewed By: grimar, MaskRay Differential Revision: https://reviews.llvm.org/D69192	2019-11-05 11:14:12 -08:00
Joel E. Denny	729f23e751	[lit] Fix `not` calling internal commands Without this patch, when using lit's internal shell, if `not` on a lit RUN line calls `env`, `diff`, or any of the other in-process shell builtins that lit implements, lit accidentally searches for the latter as an external executable. What's worse is that works fine when a developer is testing on a platform where those executables are available and behave as expected, but it then breaks on other platforms. `not` seems useful for some builtins, such as `diff`, so this patch supports such uses. `not --crash` does not seem useful for builtins, so this patch diagnoses such uses. In all cases, this patch ensures shell builtins are found behind any sequence of `env` and `not` commands. `not` calling `env` calling an external command appears useful when the `env` and external command are part of a lit substitution, as in D65156. This patch supports that by looking through any sequence of `env` and `not` commands, building the environment from the `env`s, and storing the `not`s. The `not`s are then added back to the command line without the `env`s to execute externally. This avoids the need to replicate the `not` implementation, in particular the `--crash` option, in lit. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D66531	2019-11-05 14:09:21 -05:00
Stanislav Mekhanoshin	53c349c0d4	[AMDGPU] Removed dead code handling M0CopyReg Static analyzer complains about always false condition. See https://bugs.llvm.org/show_bug.cgi?id=43886 Differential Revision: https://reviews.llvm.org/D69860	2019-11-05 11:05:13 -08:00
Benjamin Kramer	4240cc4eb1	[X86] Specifically limit fmin/fmax commutativity to NoNaNs + NoSignedZeros The backend UnsafeFPMath flag is not a superset of all the others, so limit it to the exact bits needed.	2019-11-05 19:34:06 +01:00
Daniel Sanders	7a5b72e3a3	[globalisel] Rename G_GEP to G_PTR_ADD Summary: G_GEP is rather poorly named. It's a simple pointer+scalar addition and doesn't support any of the complexities of getelementptr. I therefore propose that we rename it. There's a G_PTR_MASK so let's follow that convention and go with G_PTR_ADD Reviewers: volkan, aditya_nandakumar, bogner, rovka, arsenm Subscribers: sdardis, jvesely, wdng, nhaehnle, hiraditya, jrtc27, atanasyan, arphaman, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69734	2019-11-05 10:31:17 -08:00
Stanislav Mekhanoshin	771ecdbd33	[AMDGPU] return Fail instead of SolfFail from addOperand() addOperand() method of AMDGPU disassembler returns SoftFail on error. All instances which may lead to that place are an impossible encdoing, not something which is possible to encode, but semantically incorrect as described for SoftFail. Then tablegen generates a check of the following form: if (Decode...(..) == MCDisassembler::Fail) { return MCDisassembler::Fail; } Since we can only return Success and SoftFail that is dead code as detected by the static code analyzer. Solution: return Fail as it should be. See https://bugs.llvm.org/show_bug.cgi?id=43886 Differential Revision: https://reviews.llvm.org/D69819	2019-11-05 10:25:27 -08:00
Sergey Dmitriev	19956457be	[SLP] - Add couple safety checks to TreeEntry::dump(). NFC Summary: Check for MainOp and AltOp for NULL before dereferencing or issue NULL. Reviewers: Vasilis, dtemirbulatov, RKSimon, ABataev Reviewed By: ABataev Subscribers: mehdi_amini, hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69812	2019-11-05 09:57:30 -08:00
Daniel Sanders	76c9170daf	[globalisel][docs] Add KnownBits Analysis documentation Summary: This is largely based off of the slides from the keynote Depends on D69545 Reviewers: volkan, rovka, arsenm Subscribers: wdng, arphaman, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69644	2019-11-05 09:55:33 -08:00
Kazu Hirata	0aa34f6ffe	[JumpThreading] Factor out code to merge basic blocks (NFC) Summary: This patch factors out code to merge a basic block with its sole successor -- partly for readability and partly to facilitate an upcoming patch of my own. Reviewers: wmi Subscribers: hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69852	2019-11-05 09:46:57 -08:00
Steven Wu	dce4269ca7	Revert "[Object][MachO] Rewrite macho-invalid-fat-arch-size into YAML" The invalid binary trying to construct triggers an assertion.	2019-11-05 09:34:26 -08:00
Simon Pilgrim	e1dd12ca74	Remove redundant assignment. NFCI. Fixes cppcheck warning.	2019-11-05 17:08:08 +00:00
Simon Pilgrim	b36df7e694	Use iterator prefix increment. NFCI.	2019-11-05 17:08:08 +00:00
Simon Pilgrim	9827ef622e	[MachineOutliner] Reduce scope of variable and stop duplicate getMF() calls. NFCI.	2019-11-05 17:08:08 +00:00
Steven Wu	4275ea1a33	[Object][MachO] Rewrite macho-invalid-fat-arch-size into YAML Rewrite one of the invalid macho test input file with YAML file. The original invalid macho is breaking our internal test infrastusture because it is too broken to be copy around. rdar://problem/56879982	2019-11-05 09:07:06 -08:00
Fangrui Song	dd2cb2d821	[llvm-objcopy][ELF] Implement --only-keep-debug --only-keep-debug produces a debug file as the output that only preserves contents of sections useful for debugging purposes (the binutils implementation preserves SHT_NOTE and non-SHF_ALLOC sections), by changing their section types to SHT_NOBITS and rewritting file offsets. See https://sourceware.org/gdb/onlinedocs/gdb/Separate-Debug-Files.html The intended use case is: ``` llvm-objcopy --only-keep-debug a a.dbg llvm-objcopy --strip-debug a b llvm-objcopy --add-gnu-debuglink=a.dbg b ``` The current layout algorithm is incapable of deleting contents and shrinking segments, so it is not suitable for implementing the functionality. This patch adds a new algorithm which assigns sh_offset to sections first, then modifies p_offset/p_filesz of program headers. It bears a resemblance to lld/ELF/Writer.cpp. Reviewed By: jhenderson, jakehehrlich Differential Revision: https://reviews.llvm.org/D67137	2019-11-05 08:56:15 -08:00
Fangrui Song	c05a8dafbb	[llvm-objcopy][ELF] Add OriginalType & OriginalFlags `llvm::objcopy:🧝:*Section::classof` matches Type and Flags, yet Type and Flags are mutable (by setSectionFlagsAndTypes and upcoming --only-keep-debug feature). Add OriginalType & OriginalFlags to be used in classof, to prevent classof results from changing. Reviewed By: jakehehrlich, jhenderson, alexshap Differential Revision: https://reviews.llvm.org/D69739	2019-11-05 08:40:39 -08:00
David Green	f8855f1e47	[ARM] Multi-vector MVE spill test This is a test from D67169, that can now be added after the vld2 intrinsics were committed upstream.	2019-11-05 16:17:25 +00:00
jmolloy	548ef37194	[DFAPacketizer] Allow up to 64 functional units Summary: To drive the automaton we used a uint64_t as an action type. This contained the transition's resource requirements as a conjunction: (a OR b) AND (b OR c) We encoded this conjunction as a sequence of four 16-bit bitmasks. This limited the number of addressable functional units to 16, which is quite low and has bitten many people in the past. Instead, the DFAEmitter now generates a lookup table from InstrItinerary class (index of the ItinData inside the ProcItineraries) to an internal action index which is essentially a dense embedding of the conjunctive form. Because we never materialize the conjunctive form, we no longer have the 16 FU restriction. In this patch we limit to 64 functional units due to using a uint64_t bitmask in the DFAEmitter. Now that we've decoupled these representations we can increase this in future. Reviewers: ThomasRaoux, kparzysz, majnemer Reviewed By: ThomasRaoux Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69110	2019-11-05 15:41:42 +00:00
Gil Rapaport	829090e5cd	[LV] Apply sink-after & interleave-groups as VPlan transformations (NFC) This recommits 2be17087f8c38934b7fc9208ae6cf4e9b4d44f4b (reverted in d3ec06d219788801380af1948c7f7ef9d3c6100b for heap-use-after-free) with a fix in IAI's reset() which was not clearing the set of interleave groups after deleting them.	2019-11-05 17:29:13 +02:00
Simon Pilgrim	0070b851b0	Fix uninitialized variable warning. NFCI.	2019-11-05 15:15:14 +00:00
Simon Pilgrim	e2779a12db	[MCObjectFileInfo] Fix uninitialized variable warnings. NFCI.	2019-11-05 15:15:14 +00:00
Simon Pilgrim	6ff5dccbdc	[MachineOutliner] Fix uninitialized variable warnings. NFCI.	2019-11-05 15:15:14 +00:00
Francis Visoiu Mistrih	11150876b9	[ObjC][ARC] Ignore lifetime markers between *ReturnValue calls When eliminating a pair of `llvm.objc.autoreleaseReturnValue` followed by `llvm.objc.retainAutoreleasedReturnValue` we need to make sure that the instructions in between are safe to ignore. Other than bitcasts and useless GEPs, it's also safe to ignore lifetime markers for both static allocas (lifetime.start/lifetime.end) and dynamic allocas (stacksave/stackrestore). These get added by the inliner as part of the return sequence and can prevent the transformation from happening in practice. Differential Revision: https://reviews.llvm.org/D69833	2019-11-05 06:39:22 -08:00
Francis Visoiu Mistrih	a5fc4b0861	[NFC][ObjC][ARC] Add tests for OptimizeRetainRVCall Add tests for bitcasts + zero GEPs, and pre-commit tests for lifetime markers.	2019-11-05 06:39:22 -08:00
Kazu Hirata	c844551e4a	[JumpThreading] Factor out common code to update the SSA form (NFC) Summary: This patch factors out common code to update the SSA form in JumpThreading.cpp -- partly for readability and partly to facilitate an coming patch of my own. Reviewers: wmi Subscribers: hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69811	2019-11-05 06:15:44 -08:00
Simon Pilgrim	d3f2640a6d	[GVN] Fix uninitialized variable warnings. NFCI.	2019-11-05 14:10:32 +00:00
Simon Pilgrim	9ea0ccd812	Add missing GVN =operator. NFCI. Fixes PVS Studio warning that the 'ValueTable' class implements a copy constructor, but lacks the '=' operator.	2019-11-05 13:41:50 +00:00
Sanjay Patel	d72f53cb0e	[InstCombine] add tests for shift-logic-shift; NFC This is based on existing CodeGen test files for x86 and AArch64. The corresponding potential transform is shown in: rL370617	2019-11-05 08:18:50 -05:00
Dávid Bolvanský	239b9756e4	[AtomicExpandPass] Silence static analyzer warnings about operator priority. NFCI.	2019-11-05 13:55:46 +01:00
David Green	73b826a198	[MachineScheduler] Enable AA in PostRA Machine scheduler This adds AA to Post-RA Machine Scheduling, allowing the pass more freedom when handling memory operations. My understanding is that this was just never done, not that it is inherently incorrect to do so. The older PostRA List scheduler already makes use of AA, it's just that the MI PostRA Scheduler was never taught to use it. Differential Revision: https://reviews.llvm.org/D69814	2019-11-05 11:58:50 +00:00
Nuno Lopes	105e48e377	[Docs] Add LangRef documentation for freeze instruction Summary: - Describe the new freeze instruction - Make it explicit that branch on undef/poison is UB Reviewers: chandlerc, majnemer, efriedma, nikic, reames, jdoerfert, lebedev.ri, regehr Subscribers: fhahn, bollu, lebedev.ri, delcypher, spatel, filcab, llvm-commits, aqjune Differential Revision: https://reviews.llvm.org/D29121	2019-11-05 11:35:55 +00:00
Thomas Preud'homme	857bdbdda2	Fix PR40644: miscompile indexed FP constant store Summary: Functions replaceStoreOfFPConstant() and OptimizeFloatStore() both replace store of float by a store of an integer unconditionally. However this generates wrong code when the store that is replaced is an indexed or truncating store. This commit solves this issue by adding an early return in these functions when the store being considered is not a normal store. Bug was only observed on out of tree targets, hence the lack of testcase in this commit. Reviewers: efriedma Subscribers: hiraditya, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68420	2019-11-05 11:07:52 +00:00
David Green	eebf5e9394	[ARM] Always enable UseAA in the arm backend This feature controls whether AA is used into the backend, and was previously turned on for certain subtargets to help create less constrained scheduling graphs. This patch turns it on for all subtargets, so that they can all make use of the extra information to produce better code. Differential Revision: https://reviews.llvm.org/D69796	2019-11-05 10:46:56 +00:00
David Green	1643bee451	[Scheduling][ARM] Consistently enable PostRA Machine scheduling In the ARM backend, for historical reasons we have only some targets using Machine Scheduling. The rest use the old list scheduler as they are using itinaries and the list scheduler seems to produce better code (and not crash running out of register on v6m codes). So whether to use the MIScheduler or not is checked at runtime from the subtarget features. This is fine, except for post-ra scheduling. Whether to use the old post-ra list scheduler or the post-ra machine schedule is decided as the pass manager is set up, in arms case from a newly constructed subtarget. Under some situations, like LTO, this won't include the correct cpu so can pick the wrong option. This can have a surprising effect on performance. To fix that, this patch overrides targetSchedulesPostRAScheduling and addPreSched2 in the ARM backend, adding _both_ post-ra schedulers and picking at runtime which to execute. To pick between the two I've had to add a enablePostRAMachineScheduler() method that normally returns enableMachineScheduler() && enablePostRAScheduler(), which can be overridden to enable just one of PostRAMachineScheduler vs PostRAScheduler. Thanks to David Penry for the identifying this problem. Differential Revision: https://reviews.llvm.org/D69775	2019-11-05 10:44:55 +00:00
Roman Lebedev	75ed81b55e	[LoopUnroll] peel-loop-conditions.ll: add some 'is even/odd' peeling tests	2019-11-05 13:02:57 +03:00
Roman Lebedev	1802097fe9	[InstCombine] dropRedundantMaskingOfLeftShiftInput(): truncation (PR42563) Summary: That fold keeps growing and growing :( I think this may be one of the last pieces for it. Since D67677/D67725, the fold knowns the general form of the pattern - where some masking is needed: https://rise4fun.com/Alive/F5R https://rise4fun.com/Alive/gslRa But there is one more huge piece missing - if you are extracting some bits, it is not impossible that the origin is wider than the extraction, i.e. there may be a truncation. And we don't deal with that yet. But we can, and the generalization remains fully identical: https://rise4fun.com/Alive/Uar https://rise4fun.com/Alive/5SW After a preparatory cleanup i think the diff looks rather clean. One missing piece is that in some patterns (especially pat. b), `-1` only needs to be `-1` in final type, but that is for later.. https://bugs.llvm.org/show_bug.cgi?id=42563 Reviewers: spatel, nikic Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69125	2019-11-05 12:41:26 +03:00

... 2 3 4 5 6 ...

187526 Commits