llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 21:42:54 +02:00

Author	SHA1	Message	Date
Gadi Haber	51f2170fda	This is a large patch for X86 AVX-512 of an optimization for reducing code size by encoding EVEX AVX-512 instructions using the shorter VEX encoding when possible. There are cases of AVX-512 instructions that have two possible encodings. This is the case with instructions that use vector registers with low indexes of 0 - 15 and do not use the zmm registers or the mask k registers. The EVEX encoding prefix requires 4 bytes whereas the VEX prefix can take only up to 3 bytes. Consequently, using the VEX encoding for these instructions results in a code size reduction of ~2 bytes even though it is compiled with the AVX-512 features enabled. Reviewers: Craig Topper, Zvi Rackoover, Elena Demikhovsky Differential Revision: https://reviews.llvm.org/D27901 llvm-svn: 290663	2016-12-28 10:12:48 +00:00
Chandler Carruth	69a65a83e5	[PM] Teach the inliner's call graph update to handle inserting new edges when they are call edges at the leaf but may (transitively) be reached via ref edges. It turns out there is a simple rule: insert everything as a ref edge which is a safe conservative default. Then we let the existing update logic handle promoting some of those to call edges. Note that it would be fairly cheap to make these call edges right away if that is desirable by testing whether there is some existing call path from the source to the target. It just seemed like slightly more complexity in this code path that isn't strictly necessary. If anyone feels strongly about handling this differently I'm happy to change it. llvm-svn: 290649	2016-12-28 03:13:12 +00:00
Craig Topper	3c08d598fe	[InstCombine] Remove a piece of a comment that said that InstCombiner contains pass infrastructure. That hasn't been true since r226618. NFC llvm-svn: 290648	2016-12-28 03:12:42 +00:00
Chandler Carruth	870a2669f0	[PM] Actually commit the test update that was supposed to accompany r290644. Sorry for this. llvm-svn: 290646	2016-12-28 02:31:24 +00:00
Chandler Carruth	ff1b18d787	[LCG] Teach the ref edge removal to handle a ref edge that is trivial due to a call cycle. This actually crashed the ref removal before. I've added a unittest that covers this kind of interesting graph structure and mutation. llvm-svn: 290645	2016-12-28 02:24:58 +00:00
Chandler Carruth	34272b757e	[PM] Disable the loop vectorizer from the new PM's pipeline as it currenty relies on the old PM's dependency system forming LCSSA. The new PM will require a different design for this, and for now this is causing most of the issues I'm currently seeing in testing. I'd like to get to a testable baseline and then work on re-enabling things one at a time. llvm-svn: 290644	2016-12-28 02:24:55 +00:00
Michael Kuperstein	04dff9bc7a	[InstCombine] Canonicalize insert splat sequences into an insert + shuffle This adds a combine that canonicalizes a chain of inserts which broadcasts a value into a single insert + a splat shufflevector. This fixes PR31286. Differential Revision: https://reviews.llvm.org/D27992 llvm-svn: 290641	2016-12-28 00:18:08 +00:00
Kostya Serebryany	d6593db5e1	[libFuzzer] add an experimental flag -experimental_len_control=1 that sets max_len to 1M and tries to increases the actual max sizes of mutations very gradually (second attempt) llvm-svn: 290637	2016-12-27 23:24:55 +00:00
Eric Fiselier	03477e9665	Mark comparator call operator as const llvm-svn: 290636	2016-12-27 23:15:58 +00:00
Kostya Serebryany	b6d58e94d4	[libFuzzer] don't create large random mutations when given an empty seed llvm-svn: 290634	2016-12-27 22:15:04 +00:00
Kostya Serebryany	aece9ad2f5	[sanitizer-coverage] sort the switch cases llvm-svn: 290628	2016-12-27 21:20:06 +00:00
Hemant Kulkarni	2cd4d50d15	llvm-readobj: ELF: Make DT tags machine aware llvm-svn: 290623	2016-12-27 19:59:29 +00:00
Kostya Serebryany	647bec73f9	[libFuzzer] fix UB and simplify the computation of the RNG seed (https://llvm.org/bugs/show_bug.cgi?id=31456 ) llvm-svn: 290622	2016-12-27 19:51:34 +00:00
Chandler Carruth	888ae91182	[PM] Teach MemDep to invalidate its result object when its cached analysis handles become invalid. Add a test case for its invalidation logic. llvm-svn: 290620	2016-12-27 19:33:04 +00:00
Saleem Abdulrasool	4c681cb7de	DebugInfo: add explicit casts for -Wqual-cast Fix a warning detected by gcc 6: warning: cast from type 'const void' to type 'uint8_t {aka unsigned char*}' casts away qualifiers [-Wcast-qual] llvm-svn: 290618	2016-12-27 18:35:24 +00:00
Saleem Abdulrasool	322826fdb1	ASMParser: use range-based for loops (NFC) Convert the verify method to use a few more range based for loops, converting to const iterators in the process. llvm-svn: 290617	2016-12-27 18:35:22 +00:00
Saleem Abdulrasool	33b04261ab	test: modernise ARM CodeGen tests Replace the use of grep with FileCheck. Tidy up some of the tests. A few of the tests have been left as weak as previously, though some have been made more stringent. llvm-svn: 290616	2016-12-27 18:35:19 +00:00
Davide Italiano	bb3467c0ce	[NewGVN] Simplify a bit removing else after return. NFCI. llvm-svn: 290615	2016-12-27 18:15:39 +00:00
Chandler Carruth	57ab1e6b3b	[PM] Remove a pointless optimization. There is no need to do this within an analysis. That method shouldn't even be reached if this predicate holds as the actual useful optimization is in the analysis manager itself. llvm-svn: 290614	2016-12-27 18:04:11 +00:00
Chad Rosier	93ef7f9c97	Attempt to make the Windows bots green after r290609. llvm-svn: 290613	2016-12-27 18:02:27 +00:00
Chandler Carruth	9a136f0e41	[PM] Add more dedicated testing to cover the invalidation logic added to BasicAA in r290603. I've kept the basic testing in the new PM test file as that also covers the AAManager invalidation logic. If/when there is a good place for broader AA testing it could move there. This test is somewhat unsatisfying as I can't get it to fail even with ASan outside of explicit checks of the invalidation. Apparently we don't yet have any test coverage of the BasicAA code paths using either the domtree or loopinfo -- I made both of them always be null and check-llvm passed. llvm-svn: 290612	2016-12-27 17:59:22 +00:00
Bryant Wong	aa5f57bf44	[MemCpyOpt] Don't sink LoadInst below possible clobber. Differential Revision: https://reviews.llvm.org/D26811 llvm-svn: 290611	2016-12-27 17:58:12 +00:00
Teresa Johnson	b787681e80	[ThinLTO] Fix "\|\|" vs "\|" mixup. The effect of the bug was that we would incorrectly create summaries for global and weak values defined in module asm (since we were essentially testing for bit 1 which is SF_Undefined, and the RecordStreamer ignores local undefined references). This would have resulted in conservatively disabling importing of anything referencing globals and weaks defined in module asm. Added these cases to the test which now fails without this bug fix. Fixes PR31459. llvm-svn: 290610	2016-12-27 17:45:09 +00:00
Chad Rosier	74fe45fa61	[AArch64][AsmParser] Add support for parsing shift/extend operands with symbols. Differential Revision: https://reviews.llvm.org/D27953 llvm-svn: 290609	2016-12-27 16:58:09 +00:00
Artem Tamazov	37997ebae5	[AMDGPU][llvm-mc] Predefined symbols to access register counts (.kernel.{v\|s}gpr_count) The feature allows for conditional assembly, filling the entries of .amd_kernel_code_t etc. Symbols are defined with value 0 at the beginning of each kernel scope. After each register usage, the respective symbol is set to: value = max( value, ( register index + 1 ) ) Thus, at the end of scope the value represents a count of used registers. Kernel scopes begin at .amdgpu_hsa_kernel directive, end at the next .amdgpu_hsa_kernel (or EOF, whichever comes first). There is also dummy scope that lies from the beginning of source file til the first .amdgpu_hsa_kernel. Test added. Differential Revision: https://reviews.llvm.org/D27859 llvm-svn: 290608	2016-12-27 16:00:11 +00:00
Piotr Padlewski	0d28db774c	[MemDep] Operand visited twice bugfix Because operand was not marked as seen it was visited twice. It doesn't change behavior of optimization, it just saves redudant visit, so no test changes. llvm-svn: 290607	2016-12-27 15:06:07 +00:00
Eugene Leviant	4135139bc6	RuntimeDyldELF: refactor AArch64 relocations. NFC. llvm-svn: 290606	2016-12-27 13:33:32 +00:00
Eugene Leviant	1c3305dfe1	Fix unit test in NDEBUG build llvm-svn: 290604	2016-12-27 11:07:53 +00:00
Chandler Carruth	b6ef94fb75	[PM] Teach BasicAA how to invalidate its result object. This requires custom handling because BasicAA caches handles to other analyses and so it needs to trigger indirect invalidation. This fixes one of the common crashes when using the new PM in real pipelines. I've also tweaked a regression test to check that we are at least handling the most immediate case. I'm going to work at re-structuring this test some to both scale better (rather than all being in one file) and check more invalidation paths in a follow-up commit, but I wanted to get the basic bug fix in place. llvm-svn: 290603	2016-12-27 10:30:45 +00:00
Eugene Leviant	85fe87b505	Attempt to fix build bot after r290597 llvm-svn: 290602	2016-12-27 10:24:58 +00:00
Chandler Carruth	f3883975e2	[PM] Disable more of the loop passes -- LCSSA and LoopSimplify are also not really wired into the loop pass manager in a way that will let us productively use these passes yet. This lets the new PM get farther in basic testing which is useful for establishing a good baseline of "doesn't explode". There are still plenty of crashers in basic testing though, this just gets rid of some noise that is well understood and not representing a specific or narrow bug. llvm-svn: 290601	2016-12-27 10:16:46 +00:00
Sam Kolton	db7d918144	[AMDGPU] Assembler: support SDWA and DPP for VOP2b instructions Reviewers: nhaustov, artem.tamazov, vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28051 llvm-svn: 290599	2016-12-27 10:06:42 +00:00
Eugene Leviant	647a39b28b	RuntimeDyldELF: add R_AARCH64_ADD_ABS_LO12_NC reloc Differential revision: https://reviews.llvm.org/D28115 llvm-svn: 290598	2016-12-27 09:51:38 +00:00
Eugene Leviant	57321df98a	Allow setting multiple debug types Differential revision: https://reviews.llvm.org/D28109 llvm-svn: 290597	2016-12-27 09:31:20 +00:00
Daniel Berlin	51fae41e7f	Change a std::vector to SmallVector in NewGVN llvm-svn: 290596	2016-12-27 09:20:36 +00:00
Chandler Carruth	8724378ec0	[PM] Teach the AAManager and AAResults layer (the worst offender for inter-analysis dependencies) to use the new invalidation infrastructure. This teaches it to invalidate itself when any of the peer function AA results that it uses become invalid. We do this by just tracking the originating IDs. I've kept it in a somewhat clunky API since some users of AAResults are outside the new PM right now. We can clean this API up if/when those users go away. Secondly, it uses the registration on the outer analysis manager proxy to trigger deferred invalidation when a module analysis result becomes invalid. I've included test cases that specifically try to trigger use-after-free in both of these cases and they would crash or hang pretty horribly for me even without ASan. Now they work nicely. The `InvalidateAnalysis` utility pass required some tweaking to be useful in this context and it still is pretty garbage. I'd like to switch it back to the previous implementation and teach the explicit invalidate method on the AnalysisManager to take care of correctly triggering indirect invalidation, but I wanted to go ahead and send this out so folks could see how all of this stuff works together in practice. And, you know, that it does actually work. =] Differential Revision: https://reviews.llvm.org/D27205 llvm-svn: 290595	2016-12-27 08:44:39 +00:00
Chandler Carruth	99ebb6e7dc	[PM] Introduce the facilities for registering cross-IR-unit dependencies that require deferred invalidation. This handles the other real-world invalidation scenario that we have cases of: a function analysis which caches references to a module analysis. We currently do this in the AA aggregation layer and might well do this in other places as well. Since this is relative rare, the technique is somewhat more cumbersome. Analyses need to register themselves when accessing the outer analysis manager's proxy. This proxy is already necessarily present to allow access to the outer IR unit's analyses. By registering here we can track and trigger invalidation when that outer analysis goes away. To make this work we need to enhance the PreservedAnalyses infrastructure to support a (slightly) more explicit model for "sets" of analyses, and allow abandoning a single specific analyses even when a set covering that analysis is preserved. That allows us to describe the scenario of preserving all Function analyses except for the one where deferred invalidation has triggered. We also need to teach the invalidator API to support direct ID calls instead of always going through a template to dispatch so that we can just record the ID mapping. I've introduced testing of all of this both for simple module<->function cases as well as for more complex cases involving a CGSCC layer. Much like the previous patch I've not tried to fully update the loop pass management layer because that layer is due to be heavily reworked to use similar techniques to the CGSCC to handle updates. As that happens, we'll have a better testing basis for adding support like this. Many thanks to both Justin and Sean for the extensive reviews on this to help bring the API design and documentation into a better state. Differential Revision: https://reviews.llvm.org/D27198 llvm-svn: 290594	2016-12-27 08:40:39 +00:00
Chandler Carruth	3a1d9fe91a	[PM] Turn on the new PM's inliner in addition to the current one for most of the inliner test cases. The inliner involves a bunch of interesting code and tends to be where most of the issues I've seen experimenting with the new PM lie. All of these test cases pass, but I'd like to keep some more thorough coverage here so doing a fairly blanket enabling. There are a handful of interesting tests I've not enabled yet because they're focused on the always inliner, or on functionality that doesn't (yet) exist in the inliner. llvm-svn: 290592	2016-12-27 07:18:43 +00:00
Craig Topper	a8167acce2	[AVX-512] Add all forms of VPALIGNR, VALIGND, and VALIGNQ to the load folding tables. llvm-svn: 290591	2016-12-27 06:51:09 +00:00
Chandler Carruth	fba79f0538	[PM] Add one of the features left out of the initial inliner patch: skipping indirectly recursive inline chains. To do this, we implicitly build an inline stack for each callsite and check prior to inlining that doing so would not form a cycle. This uses the exact same technique and even shares some code with the legacy PM inliner. This solution remains deeply unsatisfying to me because it means we cannot actually iterate the inliner externally. Doing so would not be able to easily detect and avoid such cycles. Some day I would very much like to have a solution that works without this internal state to detect cycles, but this is not that day. llvm-svn: 290590	2016-12-27 06:46:20 +00:00
Chandler Carruth	44c5a10c53	[PM] Wire up another test to the new pass manager. Nothing really interesting here, but I had to improve the test to use variables rather than hard coding value names as we happen to end up with different value names in the new PM. llvm-svn: 290589	2016-12-27 06:46:16 +00:00
George Burgess IV	1ca7f8d821	[Analysis] Ignore `nobuiltin` on `allocsize` function calls. We currently ignore the `allocsize` attribute on functions calls with the `nobuiltin` attribute when trying to lower `@llvm.objectsize`. We shouldn't care about `nobuiltin` here: `allocsize` is explicitly added by the user, not inferred based on a function's symbol. llvm-svn: 290588	2016-12-27 06:32:14 +00:00
George Burgess IV	fc289b9de6	[Analysis] Refactor as promised in r290397. This also makes us no longer check for `allocsize` on intrinsic calls. This shouldn't matter, since intrinsics should provide the information we get from `allocsize` on their own. llvm-svn: 290585	2016-12-27 06:10:50 +00:00
Craig Topper	17201bf9c4	[AVX-512] Remove masked pmuldq and pmuludq intrinsics and autoupgrade them to unmasked intrinsics plus a select. llvm-svn: 290583	2016-12-27 05:30:14 +00:00
Craig Topper	404ccdcfc2	[InstCombine][X86] Add DemandedElts support for 512-bit PMULDQ/PMULUDQ instructions PMULDQ/PMULUDQ vXi64 instructions only use the even numbered v2Xi32 input elements which SimplifyDemandedVectorElts should try and use. This builds on r290554 which added supported for 128 and 256-bit. llvm-svn: 290582	2016-12-27 05:30:09 +00:00
Chandler Carruth	c51940f6af	[LCG] Teach the LazyCallGraph to handle visiting the blockaddress constant expression and to correctly form function reference edges through them without crashing because one of the operands (the `BasicBlock` isn't actually a constant despite being an operand of a constant). llvm-svn: 290581	2016-12-27 05:00:45 +00:00
Craig Topper	8da01565db	[AVX-512] Add 512-bit unmasked intrinsics for pmuldq and pmuludq so we can add them to InstCombine with the 128 and 256 bit versions. The 128 and 256 bit masked intrinsics are currently unused by clang. The sse and avx2 unmasked intrinsics are used instead. The new 512-bit intrinsic will be used to do the same. Then all masked versions will removed and autoupgraded. llvm-svn: 290573	2016-12-27 03:46:05 +00:00
Chandler Carruth	56ddbe594d	[PM] Teach the inliner in the new PM to merge attributes after inlining. Also enable the new PM in the attributes test case which caught this issue. llvm-svn: 290572	2016-12-27 03:39:54 +00:00
Chandler Carruth	a375a1b982	[Inliner] Modernize all of the inliner tests that were using grep. This mostly involved converting from grep to FileCheck and tidying up the IR used. In one case (invoke_test-3.ll) the test had become completely pointless as we use 'resume' rather than 'unwind' now, and even then it did not occur at the end of the line. llvm-svn: 290570	2016-12-27 02:47:37 +00:00
Craig Topper	9b6bc7d9d3	[AVX-512][InstCombine] Teach InstCombine to turn masked scalar add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION. An earlier commit added support for unmasked scalar operations. At that time isel wouldn't generate an optimal sequence for masked operations, but that has now been fixed. llvm-svn: 290566	2016-12-27 01:56:30 +00:00

1 2 3 4 5 ...

142499 Commits