llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 12:02:58 +02:00

Author	SHA1	Message	Date
Krzysztof Parzyszek	bc87fb9cdd	[Hexagon] Allow setting register in BitVal without storing into map In the bit tracker, references to other bit values in which the register is 0 are prohibited. This means that generating self-referential register cells like { w:32 [0-15]:s[0-15] [16-31]:s[15] } is impossible. In order to get a self-referential cell, it had to be stored into a map and then reloaded from it. To avoid this step, add a function that will set the register to a given value without going through the map. llvm-svn: 296025	2017-02-23 22:08:50 +00:00
Stanislav Mekhanoshin	b57e8718cd	[AMDGPU] Shut the warning "getRegUnitWeight hides overload...". NFC. Clang issues warning about hidden overload. That was intended, so add "using AMDGPUGenRegisterInfo::getRegUnitWeight;" to mute it. llvm-svn: 296021	2017-02-23 21:51:28 +00:00
Adam Nemet	2de8602432	[ORE] Remove ORE.emit{{.+}} functions Last use was killed in my previous patch. The preferred way is now to construct the remark, pipe things to it and pass it to ORE.emit. llvm-svn: 296019	2017-02-23 21:32:53 +00:00
Kyle Butt	9b607f87d4	CodeGen: MachineBlockPlacement: Rename member to more general name. NFC. Rename ComputedTrellisEdges to ComputedEdges to allow for other methods of pre-computing edges. Differential Revision: https://reviews.llvm.org/D30308 llvm-svn: 296018	2017-02-23 21:22:24 +00:00
Adam Nemet	9021b155b4	[LAA] Remove unused LoopAccessReport The need for this removed when I converted everything to use the opt-remark classes directly with the streaming interface. llvm-svn: 296017	2017-02-23 21:17:36 +00:00
Adam Nemet	aa371b1205	[LV] Remove unused VectorizationReport The need for this removed when I converted everything to use the opt-remark classes directly with the streaming interface. llvm-svn: 296016	2017-02-23 21:17:31 +00:00
Evgeniy Stepanov	d639ffd862	Disable TLS for stack protector on Android API<17. The TLS slot did not exist back then. llvm-svn: 296014	2017-02-23 21:06:35 +00:00
Ahmed Bougacha	b3d7aab659	[GlobalISel] Emit opt remarks on isel fallbacks. Having more fine-grained information on the specific construct that caused us to fallback is valuable for large-scale data collection. We still have the fallback warning, that's also used for FastISel. We still need to remove the fallback warning, and teach FastISel to also emit remarks (it currently has a combination of the warning, stats, and debug prints: the remarks could unify all three). The abort-on-fallback path could also be better handled using remarks: one could imagine a "-Rpass-error", analoguous to "-Werror", which would promote missed/failed remarks to errors. It's not clear whether that would be useful for other remarks though, so we're not there yet. llvm-svn: 296013	2017-02-23 21:05:42 +00:00
Ahmed Bougacha	c860642291	[CodeGen] Teach opt remarks how to print MI instructions. This will be used with GISel opt remarks. llvm-svn: 296012	2017-02-23 21:05:33 +00:00
Ahmed Bougacha	66d4daa6cf	[CodeGen] Print MI without a newline when skipping debugloc. NFC. This matches the behavior for skip-operands. While there, document it. This is a follow-up to r296007. llvm-svn: 296011	2017-02-23 21:05:29 +00:00
Ahmed Bougacha	8fb389c4c1	[CodeGen] Use const MBBs in the opt remark diagnostics. NFC. llvm-svn: 296010	2017-02-23 21:05:23 +00:00
Stanislav Mekhanoshin	cfe4b7cb52	Correct register pressure calculation in presence of subregs If a subreg is used in an instruction it counts as a whole superreg for the purpose of register pressure calculation. This patch corrects improper register pressure calculation by examining operand's lane mask. Differential Revision: https://reviews.llvm.org/D29835 llvm-svn: 296009	2017-02-23 20:19:44 +00:00
Ahmed Bougacha	c14f2beb64	[ORE] Use const CodeRegions in the remark diagnostics. NFC. llvm-svn: 296008	2017-02-23 19:17:34 +00:00
Ahmed Bougacha	2ddd846178	[CodeGen] Add a way to SkipDebugLoc in MachineInstr::print(). NFC. llvm-svn: 296007	2017-02-23 19:17:31 +00:00
Ahmed Bougacha	786dda69b3	[GlobalISel] Simplify Select type cleanup using a ScopeExit. NFC. This lets us use more natural early-returns when selection fails. llvm-svn: 296006	2017-02-23 19:17:24 +00:00
Adrian Prantl	67ed215753	Revert "Teach the IR verifier to reject conflicting debug info for function arguments." This reverts commit r295749 while investigating PR32042. It looks like this check uncovered a problem in the frontend that needs to be fixed before the check can be enabled again. llvm-svn: 296005	2017-02-23 19:13:48 +00:00
Sanjay Patel	f14bbf566d	[DAG] add convenience function to get -1 constant; NFCI llvm-svn: 296004	2017-02-23 19:02:33 +00:00
Chad Rosier	2e3937b806	[Reassociate] Add negated value of negative constant to the Duplicates list. In OptimizeAdd, we scan the operand list to see if there are any common factors between operands that can be factored out to reduce the number of multiplies (e.g., 'AA+ABC+D' -> 'A(A+BC)+D'). For each operand of the operand list, we only consider unique factors (which is tracked by the Duplicate set). Now if we find a factor that is a negative constant, we add the negated value as a factor as well, because we can percolate the negate out. However, we mistakenly don't add this negated constant to the Duplicates set. Consider the expression A2-2 + B. Obviously, nothing to factor. For the added value A2*-2 we over count 2 as a factor without this change, which causes the assert reported in PR30256. The problem is that this code is assuming that all the multiply operands of the add are already reassociated. This change avoids the issue by making OptimizeAdd tolerate multiplies which haven't been completely optimized; this sort of works, but we're doing wasted work: we'll end up revisiting the add later anyway. Another possible approach would be to enforce RPO iteration order more strongly. If we have RedoInsts, we process them immediately in RPO order, rather than waiting until we've finished processing the whole function. Intuitively, it seems like the natural approach: reassociation works on expression trees, so the optimization only works in one direction. That said, I'm not sure how practical that is given the current Reassociate; the "optimal" form for an expression depends on its use list (see all the uses of "user_back()"), so Reassociate is really an iterative optimization of sorts, so any changes here would probably get messy. PR30256 Differential Revision: https://reviews.llvm.org/D30228 llvm-svn: 296003	2017-02-23 18:49:03 +00:00
Dehao Chen	3afae04d61	Use base discriminator in sample pgo profile matching. Summary: The discriminator has been encoded, and only the base discriminator should be used during profile matching. Reviewers: dblaikie, davidxl Reviewed By: dblaikie, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30218 llvm-svn: 295999	2017-02-23 18:27:45 +00:00
Krzysztof Parzyszek	89aba7c42c	[Hexagon] Avoid IMPLICIT_DEFs as new-value producers llvm-svn: 295997	2017-02-23 17:47:34 +00:00
Adam Nemet	d52db24563	[LazyMachineBFI] Reimplement with getAnalysisIfAvailable Since LoopInfo is not available in machine passes as universally as in IR passes, using the same approach for OptimizationRemarkEmitter as we did for IR will run LoopInfo and DominatorTree unnecessarily. (LoopInfo is not used lazily by ORE.) To fix this, I am modifying the approach I took in D29836. LazyMachineBFI now uses its client passes including MachineBFI itself that are available or otherwise compute them on the fly. So for example GreedyRegAlloc, since it's already using MBFI, will reuse that instance. On the other hand, AsmPrinter in Justin's patch will generate DT, LI and finally BFI on the fly. (I am of course wondering now if the simplicity of this approach is even preferable in IR. I will do some experiments.) Testing is provided by an updated version of D29837 which requires Justin's patch to bring ORE to the AsmPrinter. Differential Revision: https://reviews.llvm.org/D30128 llvm-svn: 295996	2017-02-23 17:30:01 +00:00
Filipe Cabecinhas	816647a966	[AddressSanitizer] Add PS4 offset llvm-svn: 295994	2017-02-23 17:10:28 +00:00
Sanjay Patel	7abc64b43b	[InstCombine] use loop instead of recursion to peek through FPExt; NFCI llvm-svn: 295992	2017-02-23 16:39:51 +00:00
Sanjay Patel	1456ca20c4	[InstCombine] use 'match' to reduce code; NFCI llvm-svn: 295991	2017-02-23 16:26:03 +00:00
Jan Vesely	4c879ee690	AMDGPU/SI: Fix trunc i16 pattern Hit on ASICs that support 16bit instructions. Differential Revision: https://reviews.llvm.org/D30281 llvm-svn: 295990	2017-02-23 16:12:21 +00:00
Simon Pilgrim	885bd5a0f3	Strip trailing whitespace. llvm-svn: 295989	2017-02-23 16:07:04 +00:00
Krzysztof Parzyszek	bde21eb695	[Hexagon] Patterns for CTPOP, BSWAP and BITREVERSE llvm-svn: 295981	2017-02-23 15:02:09 +00:00
Tobias Grosser	dbd459915a	[docs] Add information about how to checkout polly to getting started page llvm-svn: 295974	2017-02-23 14:27:07 +00:00
Diana Picus	cc63e86ff2	[ARM] GlobalISel: Lower call returns Introduce a common ValueHandler for call returns and formal arguments, and inherit two different versions for handling the differences (at the moment the only difference is the way physical registers are marked as used). llvm-svn: 295973	2017-02-23 14:18:41 +00:00
Alexey Bataev	437d10997b	[SLP] Fix for PR32036: Vectorized horizontal reduction returning wrong result Summary: If the same value is used several times as an extra value, SLP vectorizer takes it into account only once instead of actual number of using. For example: ``` int val = 1; for (int y = 0; y < 8; y++) { for (int x = 0; x < 8; x++) { val = val + input[y * 8 + x] + 3; } } ``` We have 2 extra rguments: `1` - initial value of horizontal reduction and `3`, which is added 8*8 times to the reduction. Before the patch we added `1` to the reduction value and added once `3`, though it must be added 64 times. Reviewers: mkuper, mzolotukhin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30262 llvm-svn: 295972	2017-02-23 13:37:09 +00:00
Diana Picus	587106e326	[ARM] GlobalISel: Lower call parameters in regs Add support for lowering calls with parameters than can fit into regs. Use the same ValueHandler that we used for function returns, but rename it to match its new, extended purpose. llvm-svn: 295971	2017-02-23 13:25:43 +00:00
Ayman Musa	b06a1c0393	[X86][AVX] Disable VCVTSS2SD & VCVTSD2SS memory folding and fix the register class of their first input when creating node in fast-isel. (Quick fix to buildbot failure after rL295940 commit). llvm-svn: 295970	2017-02-23 13:15:44 +00:00
Simon Dardis	0a7f7f003a	[mips][ias] Further relax operands of certain assembly instructions This patch adjusts the most relaxed predicate of immediate operands to accept immediate forms such as ~(0xf0000000\|0x000f00000). Previously these forms would be accepted by GAS and rejected by IAS. This partially resolves PR/30383. Thanks to Sean Bruno for reporting the issue! Reviewers: slthakur, seanbruno Differential Revision: https://reviews.llvm.org/D29218 llvm-svn: 295965	2017-02-23 12:40:58 +00:00
Kristof Beyls	9ca2feaac5	Fix assertion failure in ARMConstantIslandPass. The ARMConstantIslandPass didn't have support for handling accesses to constant island objects through ARM::t2LDRBpci instructions. This adds support for that. This fixes PR31997. llvm-svn: 295964	2017-02-23 12:24:55 +00:00
Simon Pilgrim	53c463a54d	Fix signed/unsigned comparison warning on MSVC llvm-svn: 295962	2017-02-23 12:00:34 +00:00
Alexey Bataev	04f7dbe68e	Revert "[SLP] Fix for PR32036: Vectorized horizontal reduction returning wrong" This reverts commit 7c5141e577d9efd1c8e3087566a38ce6b3a41a84. llvm-svn: 295957	2017-02-23 11:09:35 +00:00
Alexey Bataev	f86b9bd592	[SLP] Fix for PR32036: Vectorized horizontal reduction returning wrong result Summary: If the same value is used several times as an extra value, SLP vectorizer takes it into account only once instead of actual number of using. For example: ``` int val = 1; for (int y = 0; y < 8; y++) { for (int x = 0; x < 8; x++) { val = val + input[y * 8 + x] + 3; } } ``` We have 2 extra rguments: `1` - initial value of horizontal reduction and `3`, which is added 8*8 times to the reduction. Before the patch we added `1` to the reduction value and added once `3`, though it must be added 64 times. Reviewers: mkuper, mzolotukhin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30262 llvm-svn: 295956	2017-02-23 10:57:15 +00:00
Alexey Bataev	165908b766	Revert "[SLP] Fix for PR32036: Vectorized horizontal reduction returning wrong" This reverts commit d83c81ee6a8dea662808ac22b396d1bb0595c89d. llvm-svn: 295951	2017-02-23 09:59:29 +00:00
Alexey Bataev	06a8650a4b	[SLP] Fix for PR32036: Vectorized horizontal reduction returning wrong result Summary: If the same value is used several times as an extra value, SLP vectorizer takes it into account only once instead of actual number of using. For example: ``` int val = 1; for (int y = 0; y < 8; y++) { for (int x = 0; x < 8; x++) { val = val + input[y * 8 + x] + 3; } } ``` We have 2 extra rguments: `1` - initial value of horizontal reduction and `3`, which is added 8*8 times to the reduction. Before the patch we added `1` to the reduction value and added once `3`, though it must be added 64 times. Reviewers: mkuper, mzolotukhin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30262 llvm-svn: 295949	2017-02-23 09:40:38 +00:00
Ayman Musa	bc55907e4f	[X86][AVX512] Remove VCVTSS2SDZ & VCVTSD2SSZ from memory folding tables as they introduce new read dependency when folding. (Quick fix to buildbot fail). llvm-svn: 295946	2017-02-23 08:13:36 +00:00
Ayman Musa	b129f93b10	[X86][AVX512] Change VCVTSS2SD and VCVTSD2SS node types to keep consistency between VEX/EVEX versions. AVX versions of the converts work on f32/f64 types, while AVX512 version work on vectors. Differential Revision: https://reviews.llvm.org/D29988 llvm-svn: 295940	2017-02-23 07:24:21 +00:00
Matt Arsenault	6b427f77ee	LoadStoreVectorizer: Split even sized illegal chains properly Implement isLegalToVectorizeLoadChain for AMDGPU to avoid producing private address spaces accesses that will need to be split up later. This was doing the wrong thing in the case where the queried chain was an even number of elements. A possible <4 x i32> store was being split into store <2 x i32> store i32 store i32 rather than store <2 x i32> store <2 x i32> when legal. llvm-svn: 295933	2017-02-23 03:58:53 +00:00
Craig Topper	bf80823aa6	[X86][IR] In AutoUpgrade, check explicitly for xop.vpcmov and xop.vpcmov.256 instead of anything starting with xop.vpcmov There were some older intrinsics that only existed for less than a month in 2012 that still exist in some out of tree test files that start with this string, but aren't able to be handled by the current upgrade code and fire an assert. Now we'll go back to treating them as not intrinsics at all and just passing them through to output. Fixes PR32041, sort of. llvm-svn: 295930	2017-02-23 03:22:14 +00:00
Matt Arsenault	58420e6d19	TargetOptions: Fix not accounting for NoSignedZerosFPMath in == llvm-svn: 295928	2017-02-23 03:16:44 +00:00
Matthias Braun	87464acf61	Test if we can use raw strings on all platforms compiling LLVM. llvm-svn: 295917	2017-02-23 01:09:01 +00:00
Eli Friedman	cd77ac5bfa	Explicitly state the behavior of inbounds with a null pointer. See https://llvm.org/bugs/show_bug.cgi?id=31439; this reflects LLVM's behavior in practice, and should be compatible with C/C++ rules. Differential Revision: https://reviews.llvm.org/D28026 llvm-svn: 295916	2017-02-23 00:48:18 +00:00
Matt Arsenault	f74b618f95	AMDGPU: Replace disabled exp inputs with undef llvm-svn: 295914	2017-02-23 00:44:03 +00:00
Matt Arsenault	2effbfd8b3	AMDGPU: Add another BFE pattern This is the pattern that falls out of the instruction's definition if offset == 0. llvm-svn: 295912	2017-02-23 00:23:43 +00:00
Matt Arsenault	0e188d5eaa	AMDGPU: Use clamp with f64 llvm-svn: 295908	2017-02-22 23:53:37 +00:00
Michael Kuperstein	bb42bf14f7	Revert r295868 because it breaks a different SLP lit test. llvm-svn: 295906	2017-02-22 23:35:13 +00:00

... 3 4 5 6 7 ...

145509 Commits