llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Nirav Dave	0855b10b8c	[AMDGPU] Prevent too large store merges in AMDGPU Subtargets. NFCI. Various address spaces on the SI and R600 subtargets have stricter limits on memory access size that other address spaces. Use canMergeStoresTo predicate to prevent the DAGCombiner from creating these stores as they will be split up during legalization. llvm-svn: 303767	2017-05-24 15:59:09 +00:00
Matthew Simpson	fb2142e95d	[LV] Update type in cost model for scalarization For non-uniform instructions marked for scalarization, we should update `VectorTy` when computing instruction costs to reflect the scalar type. In addition to determining instruction costs, this type is also used to signal that all instructions in the loop will be scalarized. This currently affects memory instructions and non-pointer induction variables and their updates. (We also mark GEPs scalar after vectorization, but their cost is computed together with memory instructions.) For scalarized induction updates, this patch also scales the scalar cost by the vectorization factor, corresponding to each induction step. llvm-svn: 303763	2017-05-24 15:26:15 +00:00
Vadzim Dambrouski	a6f62bac86	[MSP430] Fix PR33050: Don't use ADD16ri to lower FrameIndex. Use ADDframe pseudo instruction instead. This will fix machine verifier error, and will help to fix PR32146. Differential Revision: https://reviews.llvm.org/D33452 llvm-svn: 303758	2017-05-24 15:08:30 +00:00
Sanjay Patel	49b8ffb260	[InstCombine] add tests to show potential missing folds; NFC As noted in https://bugs.llvm.org/show_bug.cgi?id=33138 and the comments, there are multiple ways to view this. If we choose not to solve this in InstCombine, these tests will serve as documentation of that choice. llvm-svn: 303755	2017-05-24 14:56:51 +00:00
Marek Olsak	2830ae4bc5	Revert "AMDGPU: Fold CI-specific complex SMRD patterns into existing complex patterns" This reverts commit e065977c4b5f68ab845400b256f6a3822b1325fa. It doesn't work. S_LOAD_DWORD_IMM_ci and friends aren't selected by any of the patterns, so it was putting 32-bit literals into the 8-bit field. llvm-svn: 303754	2017-05-24 14:53:50 +00:00
Sanjay Patel	733d481cb3	[InstCombine] add tests to document bitcast + bitwise-logic behavior; NFC The solution for PR26702 ( https://bugs.llvm.org/show_bug.cgi?id=26702 ) added a canonicalization rule, but the minimal regression tests don't demonstrate how that rule interacts with other folds. llvm-svn: 303750	2017-05-24 14:21:31 +00:00
Diana Picus	da6888ed6b	Revert "[SCEV] Do not fold dominated SCEVUnknown into AddRecExpr start" This reverts commit r303730 because it broke all the buildbots. llvm-svn: 303747	2017-05-24 14:16:04 +00:00
Krzysztof Parzyszek	d1acb6767a	[Hexagon] Fix comment in HexagonPacketizer::runOnMachineFunction Patch by Wei-Ren Chen. Differential Revision: https://reviews.llvm.org/D33439 llvm-svn: 303745	2017-05-24 13:43:42 +00:00
Jonas Paulsson	0437145b47	[LoopVectorizer] Let target prefer scalar addressing computations. The loop vectorizer usually vectorizes any instruction it can and then extracts the elements for a scalarized use. On SystemZ, all elements containing addresses must be extracted into address registers (GRs). Since this extraction is not free, it is better to have the address in a suitable register to begin with. By forcing address arithmetic instructions and loads of addresses to be scalar after vectorization, two benefits result: * No need to extract the register * LSR optimizations trigger (LSR isn't handling vector addresses currently) Benchmarking show improvements on SystemZ with this new behaviour. Any other target could try this by returning false in the new hook prefersVectorizedAddressing(). Review: Renato Golin, Elena Demikhovsky, Ulrich Weigand https://reviews.llvm.org/D32422 llvm-svn: 303744	2017-05-24 13:42:56 +00:00
Jonas Paulsson	ff729ee8c2	[SystemZ] Fix register modelling in expandLoadStackGuard() EXPENSIVE_CHECKS found this bug (https://bugs.llvm.org/show_bug.cgi?id=33047), which this patch fixes. The EAR instruction defines a GR32, not a GR64. Review: Ulrich Weigand llvm-svn: 303743	2017-05-24 13:15:48 +00:00
Tamas Berghammer	d324b78280	Demangler: Fix constructor cv qualifier handling Previously if we parsed a constructor then we set parsed_ctor_dtor_cv to true and never reseted it. This causes issue when a template argument references a constructor (e.g. type of lambda defined inside a constructor) as we will have the parsed_ctor_dtor_cv flag set what will cause issues when parsing later arguments. Differential Revision: https://reviews.llvm.org/D33385 libcxxabi change: https://reviews.llvm.org/rL303737 llvm-svn: 303738	2017-05-24 11:29:02 +00:00
Simon Pilgrim	04720b73cb	Strip trailing whitespace. NFCI. llvm-svn: 303736	2017-05-24 11:02:27 +00:00
Florian Hahn	f11b453050	[ARM] Remove ThumbTargetMachines. (NFC) Summary: Thumb code generation is controlled by ARMSubtarget and the concrete ThumbLETargetMachine and ThumbBETargetMachine are not needed. Eric Christopher suggested removing the unneeded target machines in https://reviews.llvm.org/D33287. I think it still makes sense to keep separate TargetMachines for big and little endian as we probably do not want to have different endianess for difference functions in a single compilation unit. The MIPS backend has two separate TargetMachines for big and little endian as well. Reviewers: echristo, rengolin, kristof.beyls, t.p.northover Reviewed By: echristo Subscribers: aemerson, javed.absar, arichardson, llvm-commits Differential Revision: https://reviews.llvm.org/D33318 llvm-svn: 303733	2017-05-24 10:18:57 +00:00
Mikael Holmen	505686fe36	MachineCSE: Respect interblock physreg liveness Summary: This is a fix for PR32538. MachineCSE first looks at MO.isDead(), but if it is not marked dead, MachineCSE still wants to do its own check to see if it is trivially dead. This check for the trivial case assumed that physical registers cannot be live out of a block. Patch by Mattias Eriksson. Reviewers: qcolombet, jbhateja Reviewed By: qcolombet, jbhateja Subscribers: jbhateja, llvm-commits Differential Revision: https://reviews.llvm.org/D33408 llvm-svn: 303731	2017-05-24 09:35:23 +00:00
Max Kazantsev	b982667438	[SCEV] Do not fold dominated SCEVUnknown into AddRecExpr start When folding arguments of AddExpr or MulExpr with recurrences, we rely on the fact that the loop of our base recurrency is the bottom-lost in terms of domination. This assumption may be broken by an expression which is treated as invariant, and which depends on a complex Phi for which SCEVUnknown was created. If such Phi is a loop Phi, and this loop is lower than the chosen AddRecExpr's loop, it is invalid to fold our expression with the recurrence. Another reason why it might be invalid to fold SCEVUnknown into Phi start value is that unlike other SCEVs, SCEVUnknown are sometimes position-bound. For example, here: for (...) { // loop phi = {A,+,B} } X = load ... Folding phi + X into {A+X,+,B}<loop> actually makes no sense, because X does not exist and cannot exist while we are iterating in loop (this memory can be even not allocated and not filled by this moment). It is only valid to make such folding if X is defined before the loop. In this case the recurrence {A+X,+,B}<loop> may be existant. This patch prohibits folding of SCEVUnknown (and those who use them) into the start value of an AddRecExpr, if this instruction is dominated by the loop. Merging the dominating unknown values is still valid. Some tests that relied on the fact that some SCEVUnknown should be folded into AddRec's are changed so that they no longer expect such behavior. llvm-svn: 303730	2017-05-24 08:52:18 +00:00
Daniel Sanders	8cba4db078	Explicitly set CPU and -slow-incdec to try to fix r303678's test on llvm-clang-x86_64-expensive-checks-win. llvm-svn: 303727	2017-05-24 07:02:37 +00:00
Craig Topper	bc96c01cc7	[APInt] Use std::end to avoid mentioning the size of a local buffer repeatedly. llvm-svn: 303726	2017-05-24 07:00:55 +00:00
Daniel Sanders	7a023e79e3	Revert r303720: Tweak r303678's test to try to fix llvm-clang-x86_64-expensive-checks-win. It doesn't fix that builder. llvm-svn: 303721	2017-05-24 06:44:55 +00:00
Daniel Sanders	2372dc999f	Tweak r303678's test to try to fix llvm-clang-x86_64-expensive-checks-win. I suspect this buildbot has slow-incdec set by default, most likely due to the default CPU having this set. This feature bit can prevent optsize from having an effect on this IR. llvm-svn: 303720	2017-05-24 06:05:14 +00:00
Javed Absar	dba9bd3ffb	[ARM] Add VLDx/VSTx sched defs for machine-schedulers. NFCI This patch adds missing scheds for Neon VLDx/VSTx instructions. This will help one write schedulers easier/faster in the future for ARM sub-targets. Existing models will not affected by this patch. Reviewed by: Renato Golin, Diana Picus Differential Revision: https://reviews.llvm.org/D33120 llvm-svn: 303717	2017-05-24 05:32:48 +00:00
Davide Italiano	b88f0473eb	[NewGVN] Update additionalUsers when we simplify to a value. Otherwise we don't revisit an instruction that could be simplified, and when we verify, we discover there's something that changed, i.e. what we had wasn't a maximal fixpoint. Fixes PR32836. llvm-svn: 303715	2017-05-24 02:30:24 +00:00
Zachary Turner	e08c0bfd5f	Fix broken build. llvm-svn: 303711	2017-05-24 00:35:32 +00:00
George Karpenkov	71109af4b2	Revert "Disable coverage opt-out for strong postdominator blocks." This reverts commit 2ed06f05fc10869dd1239cff96fcdea2ee8bf4ef. Buildbots do not like this on Linux. llvm-svn: 303710	2017-05-24 00:29:12 +00:00
George Karpenkov	279ecf005a	Revert "Fixes for tests for r303698" This reverts commit 69bfaf72e7502eb08bbca88a57925fa31c6295c6. llvm-svn: 303709	2017-05-24 00:29:08 +00:00
Zachary Turner	b94565d13f	git-llvm script should add .exe on Windows. llvm-svn: 303708	2017-05-24 00:28:46 +00:00
Zachary Turner	f26f4698dc	Don't do a full scan of the type stream before processing records. LazyRandomTypeCollection is designed for random access, and in order to provide this it lazily indexes ranges of types. In the case of types from an object file, there is no partial index to build off of, so it has to index the full stream up front. However, merging types only requires sequential access, and when that is needed, this extra work is simply wasted. Changing the algorithm to work on sequential arrays of types rather than random access type collections eliminates this up front scan. llvm-svn: 303707	2017-05-24 00:26:27 +00:00
Davide Italiano	66cbd9414c	[SCCP] Use the `hasAddressTaken()` version defined in `Function`. Instead of using the SCCP homegrown one. We should eventually make the private SCCP version disappear, but that wont' be today. PR33143 tracks this issue. Add braces for consistency while here. No functional change intended. llvm-svn: 303706	2017-05-23 23:59:23 +00:00
Davide Italiano	c64bec5149	[LIR] Use the newly `getRecurrenceVar()` helper. NFCI. llvm-svn: 303704	2017-05-23 23:51:54 +00:00
George Karpenkov	591fd73a90	Fixes for tests for r303698 llvm-svn: 303701	2017-05-23 22:42:34 +00:00
Davide Italiano	8ac428956c	[LIR] Strengthen the check for recurrence variable in popcnt/CTLZ. Fixes PR33114. Differential Revision: https://reviews.llvm.org/D33420 llvm-svn: 303700	2017-05-23 22:32:56 +00:00
George Karpenkov	81ffe26dc0	Disable coverage opt-out for strong postdominator blocks. Coverage instrumentation has an optimization not to instrument extra blocks, if the pass is already "accounted for" by a successor/predecessor basic block. However (https://github.com/google/sanitizers/issues/783) this reasoning may become circular, which stops valid paths from having coverage. In the worst case this can cause fuzzing to stop working entirely. This change simplifies logic to something which trivially can not have such circular reasoning, as losing valid paths does not seem like a good trade-off for a ~15% decrease in the # of instrumented basic blocks. llvm-svn: 303698	2017-05-23 21:58:54 +00:00
Tim Northover	c81dd677e6	Revert LLVM changes for "Sema: allow imaginary constants via GNU extension if UDL overloads not present." The changes accidentally crept into a Clang commit I was making. llvm-svn: 303697	2017-05-23 21:53:11 +00:00
Rui Ueyama	647132cc23	[git-llvm] Check if svn is installed. The error message that git-llvm script prints out when svn is missing is very cryptic. I spent a fair amount of time to find what was wrong with my environment. It looks like many newcomers also exprienced a hard time to submit their first patches due to this error. This patch adds a more user-friendly error message. Differential Revision: https://reviews.llvm.org/D33458 llvm-svn: 303696	2017-05-23 21:50:40 +00:00
Vadzim Dambrouski	a328f25e7f	[MSP430] Add subtarget features for hardware multiplier. Also add more processors to make -mcpu option behave similar to gcc. Differential Revision: https://reviews.llvm.org/D33335 llvm-svn: 303695	2017-05-23 21:49:42 +00:00
Tim Northover	03923e811e	Sema: allow imaginary constants via GNU extension if UDL overloads not present. C++14 added user-defined literal support for complex numbers so that you can write something like "complex<double> val = 2i". However, there is an existing GNU extension supporting this syntax and interpreting the result as a _Complex type. This changes parsing so that such literals are interpreted in terms of C++14's operators if an overload is present but otherwise falls back to the original GNU extension. llvm-svn: 303694	2017-05-23 21:41:49 +00:00
Reid Kleckner	db1d4c56a6	Silence MSVC warning about unsigned integer overflow, which has defined behavior llvm-svn: 303693	2017-05-23 21:35:32 +00:00
Francis Visoiu Mistrih	9515ddb7fd	abtest: remove duplicate script This is fixing a mistake from r303690. Differential Revision: https://reviews.llvm.org/D33303 llvm-svn: 303692	2017-05-23 21:28:41 +00:00
Simon Pilgrim	04596189c2	[AMDGPU] Add INDIRECT_BASE_ADDR to R600_Reg32 class (PR33045) This fixes 17 of the 41 -verify-machineinstrs test failures identified in PR33045 Differential Revision: https://reviews.llvm.org/D33451 llvm-svn: 303691	2017-05-23 21:27:15 +00:00
Francis Visoiu Mistrih	4aa7483a63	AsmPrinter: mark the beginning and the end of a function in verbose mode llvm-svn: 303690	2017-05-23 21:22:16 +00:00
Tom Stellard	082356cda2	merge-request.sh: Use https url for bugzilla With the http url, the script fails with: Connection lost/failed: 411 Client Error: Length Required llvm-svn: 303685	2017-05-23 20:35:38 +00:00
Changpeng Fang	a2949e55f0	AMDGPU/SI: Move the local memory usage related checking after calling convention checking in PromoteAlloca Summary: Promoting Alloca to Vector and Promoting Alloca to LDS are two independent handling of Alloca and should not affect each other. As a result, we should not give up promoting to vector if there is not enough LDS. This patch factors out the local memory usage related checking out and replace it after the calling convention checking. Reviewer: arsenm Differential Revision: http://reviews.llvm.org/D33139 llvm-svn: 303684	2017-05-23 20:25:41 +00:00
Daniel Sanders	00072f58f8	Fix unused variable warnings after r303678 This should fix lld-x86_64-darwin13 llvm-svn: 303683	2017-05-23 20:02:48 +00:00
Geoff Berry	30786023f2	[AArch64][Falkor] Refine sched details for LSLfast/ASRfast. llvm-svn: 303682	2017-05-23 19:57:45 +00:00
Stanislav Mekhanoshin	6d04b87725	[AMDGPU] Combine and (srl) into shl (bfe) Perform DAG combine: and (srl x, c), mask => shl (bfe x, nb + c, mask >> nb), nb Where nb is a number of trailing zeroes in mask. It replaces two instructions with two and BFE is generally a more expensive one. However this is only done if we are selecting a byte or word at an aligned boundary which results in a proper SDWA operand pattern. It is only done if SDWA is supported. TODO: improve SDWA pass to actually convert this pattern. It is not done now because we have an immediate in the instruction, which has be moved into a VGPR. Differential Revision: https://reviews.llvm.org/D33455 llvm-svn: 303681	2017-05-23 19:54:48 +00:00
Geoff Berry	267ecfb00c	[AArch64][Falkor] Fix sched details for FMOV of WZR/XZR. llvm-svn: 303680	2017-05-23 19:54:28 +00:00
Oleg Ranevskyy	12688b77bb	[ARM] Temporarily disable globals promotion to constant pools to prevent miscompilation Summary: A temporary workaround for PR32780 - rematerialized instructions accessing the same promoted global through different constant pool entries. The patch turns off the globals promotion optimization leaving all its code in place, so that it can be easily turned on once PR32780 is fixed. Since this is a miscompilation issue causing generation of misbehaving code, and the problem is very subtle, the patch might be valuable enough to get into 4.0.1. Reviewers: efriedma, jmolloy Reviewed By: efriedma Subscribers: aemerson, javed.absar, llvm-commits, rengolin, asl, tstellar Differential Revision: https://reviews.llvm.org/D33446 llvm-svn: 303679	2017-05-23 19:38:37 +00:00
Daniel Sanders	fa3e6b4d4e	[globalisel][tablegen] Add support for (set $dst, 1) and test X86's OptForSize predicate. Summary: It's rare but a small number of patterns use IntInit's at the root of the match. On X86, one such rule is enabled by the OptForSize predicate and causes the compiler to use the smaller: %0 = MOV32r1 instead of the usual: %0 = MOV32ri 1 This patch adds support for matching IntInit's at the root and uses this as a test case for the optsize attribute that was implemented in r301750 Reviewers: qcolombet, ab, t.p.northover, rovka, kristof.beyls, aditya_nandakumar Reviewed By: qcolombet Subscribers: igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D32791 llvm-svn: 303678	2017-05-23 19:33:16 +00:00
Zachary Turner	8d54831bad	[CodeView] Eliminate redundant hashes and allocations. When writing field list records, we would construct a temporary type serializer that shared a bump ptr allocator with the rest of the application, so anything allocated from here would live forever. Furthermore, this temporary serializer had all the properties of a full blown serializer including record hashing and de-duplication. These features are required when you're merging multiple type streams into each other, because different streams may contain identical records, but records from the same type stream will never collide with each other. So all of this hashing was unnecessary. To solve this, two fixes are made: 1) The temporary serializer keeps its own bump ptr allocator instead of sharing a global one. When it's finished, all of its memory is freed. 2) Instead of using the same temporary serializer for the life of an entire type stream, we use it only for the life of a single field list record and delete it when the field list record is completed. This way the hash table will not grow as other records from the same type stream get inserted. Further improvements could eliminate hashing entirely from this codepath. This reduces the link time by 85% in my test, from 1 minute to 9 seconds. llvm-svn: 303676	2017-05-23 18:56:23 +00:00
Nirav Dave	904f5d5652	[DAG] Add AddressSpace parameter to canMergeStoresTo. NFC. llvm-svn: 303673	2017-05-23 18:53:02 +00:00
Craig Topper	feeefe7bdd	[InstSimplify] Add more tests for undef inputs and multiplying by 0 for the add/sub/mul with overflow intrinsics. NFC llvm-svn: 303671	2017-05-23 18:42:58 +00:00

1 2 3 4 5 ...

149361 Commits