llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Scott Linder	2f3fe9ca94	[DebugInfo] Unify ChecksumKind and Checksum value in DIFile Rather than encode the absence of a checksum with a Kind variant, instead put both the kind and value in a struct and wrap it in an Optional. Differential Revision: http://reviews.llvm.org/D43043 llvm-svn: 324928	2018-02-12 19:45:54 +00:00
Sanjay Patel	65fb2ea1a9	[InstCombine] X / (X * Y) --> 1.0 / Y This is similar to the instsimplify fold added with D42385 ( rL323716 ) ...but this can't be in instsimplify because we're creating/morphing a different instruction. llvm-svn: 324927	2018-02-12 19:39:21 +00:00
Sanjay Patel	860ba4b209	[InstCombine] add tests for missing fdiv fold; NFC llvm-svn: 324926	2018-02-12 19:23:39 +00:00
Sanjay Patel	493cd4b305	[InstCombine] regenerate checks; NFC llvm-svn: 324924	2018-02-12 19:14:01 +00:00
Sanjay Patel	067d5132f3	[InstCombine] various clean-ups for div transforms; NFC llvm-svn: 324922	2018-02-12 18:38:35 +00:00
Jun Bum Lim	1f9742c135	[LICM] update BlockColors after splitting predecessors Update BlockColors after splitting predecessors. Do not allow splitting EHPad for sinking when the BlockColors is not empty, so we can simply assign predecessor's color to the new block. Fixes PR36184 llvm-svn: 324916	2018-02-12 17:56:55 +00:00
Abderrazek Zaafrani	788c698674	[AArch64] Fixes for ARMv8.2-A FP16 scalar intrinsic - llvm portion https://reviews.llvm.org/D42993 llvm-svn: 324912	2018-02-12 17:35:42 +00:00
Simon Pilgrim	c7c90124a2	[X86] Add missing scheduling class tag for i64 absolute address moves Expand existing SchedRW to encompass these like it did for the other memory offset movs - added comments to closing braces to keep track of def scopes. We only tagged it with the itinerary class, so completeness checks were erroneously passed (PR35639). llvm-svn: 324910	2018-02-12 17:21:28 +00:00
Oliver Stannard	1a1cf03fd4	[AArch64] Improve v8.1-A code-gen for atomic load-and Armv8.1-A added an atomic load-clear instruction (which performs bitwise and with the complement of it's operand), but not a load-and instruction. Our current code-generation for atomic load-and always inserts an MVN instruction to invert its argument, even if it could be folded into a constant or another instruction. This adds lowering early in selection DAG to convert a load-and operation into an xor with -1 and a load-clear, allowing the normal DAG optimisations to work on it. To do this, I've had to add a new ISD opcode, ATOMIC_LOAD_CLR. I don't see any easy way to do this with an AArch64-specific ISD node, because the code-generation for atomic operations assumes the SDNodes are of type AtomicSDNode. I've left the old tablegen patterns in because they are still needed for global isel. Differential revision: https://reviews.llvm.org/D42478 llvm-svn: 324908	2018-02-12 17:03:11 +00:00
Simon Pilgrim	00452705e4	[X86][AVX512] Add missing scheduling class tag for KMOVB/KMOVW/KMOVD/KMOVQ moves/loads/stores. We only tagged it with the itinerary class, so completeness checks were erroneously passed (PR35639). llvm-svn: 324905	2018-02-12 16:59:04 +00:00
Evandro Menezes	b3ce05d85c	[AArch64] Refactor identification of SIMD immediates Get rid of icky goto loops and make the code easier to maintain (NFC). Differential revision: https://reviews.llvm.org/D42723 llvm-svn: 324903	2018-02-12 16:41:41 +00:00
Simon Pilgrim	2c978151de	[X86][AVX512] Add missing scheduling class tag for VMOVQ/VMOVHLPS/VMOVLHPS/VMOVHPD/VMOVHPS/VMOVLPD/VMOVLPS Tag AVX512 variants to match SSE/AVX originals. We only tagged it with the itinerary class, so completeness checks were erroneously passed (PR35639). llvm-svn: 324901	2018-02-12 16:18:36 +00:00
Momchil Velikov	7bd8b6b601	Re-commit r324489: [DebugInfo] Improvements to representation of enumeration types (PR36168) Differential Revision: https://reviews.llvm.org/D42734 llvm-svn: 324899	2018-02-12 16:10:09 +00:00
Simon Pilgrim	09469e72e3	[X86] Tag CET-IBT instruction scheduler classes llvm-svn: 324898	2018-02-12 15:57:00 +00:00
Simon Pilgrim	c527dc7f40	[X86][MMX] Add missing scheduling class tag for EMMS/FEMMS We only tagged it with the itinerary class, so completeness checks were erroneously passed (PR35639). AMD targets can perform these a lot quicker than WriteMicrocoded so will need an override in the models. llvm-svn: 324897	2018-02-12 15:52:59 +00:00
Krzysztof Parzyszek	8b0ac822d5	[NFC] Fix comment of class InstrStage Patch by Wei-Ren Chen. Differential Revision: https://reviews.llvm.org/D42905 llvm-svn: 324894	2018-02-12 15:02:49 +00:00
Alexey Bataev	0610dd4c30	[SLP] Take user instructions cost into consideration in insertelement vectorization. Summary: For better vectorization result we should take into consideration the cost of the user insertelement instructions when we try to vectorize sequences that build the whole vector. I.e. if we have the following scalar code: ``` <Scalar code> insertelement <ScalarCode>, ... ``` we should consider the cost of the last `insertelement ` instructions as the cost of the scalar code. Reviewers: RKSimon, spatel, hfinkel, mkuper Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D42657 llvm-svn: 324893	2018-02-12 14:54:48 +00:00
Oliver Stannard	8dae8033b4	[AArch64] Improve v8.1-A code-gen for atomic load-subtract Armv8.1-A added an atomic load-add instruction, but not a load-subtract instruction. Our current code-generation for atomic load-subtract always inserts a NEG instruction to negate it's argument, even if it could be folded into a constant or another instruction. This adds lowering early in selection DAG to convert a load-subtract operation into a subtract and a load-add, allowing the normal DAG optimisations to work on it. I've left the old tablegen patterns in because they are still needed for global isel. Some of the tests in this patch are copied from D35375 by Chad Rosier (which was abandoned). Differential revision: https://reviews.llvm.org/D42477 llvm-svn: 324892	2018-02-12 14:22:03 +00:00
Sanjay Patel	ff0846d7c8	[InstCombine] various clean-ups for commonIDivTransforms; NFC llvm-svn: 324891	2018-02-12 14:14:56 +00:00
Nicholas Wilson	a736760f0a	Test commit: reformat comment llvm-svn: 324889	2018-02-12 13:17:09 +00:00
Hans Wennborg	0264d64e8e	Revert r324835 "[X86] Reduce Store Forward Block issues in HW" It asserts building Chromium; see PR36346. (This also reverts the follow-up r324836.) > If a load follows a store and reloads data that the store has written to memory, Intel microarchitectures can in many cases forward the data directly from the store to the load, This "store forwarding" saves cycles by enabling the load to directly obtain the data instead of accessing the data from cache or memory. > A "store forward block" occurs in cases that a store cannot be forwarded to the load. The most typical case of store forward block on Intel Core microarchiticutre that a small store cannot be forwarded to a large load. > The estimated penalty for a store forward block is ~13 cycles. > > This pass tries to recognize and handle cases where "store forward block" is created by the compiler when lowering memcpy calls to a sequence > of a load and a store. > > The pass currently only handles cases where memcpy is lowered to XMM/YMM registers, it tries to break the memcpy into smaller copies. > breaking the memcpy should be possible since there is no atomicity guarantee for loads and stores to XMM/YMM. llvm-svn: 324887	2018-02-12 12:43:39 +00:00
Simon Atanasyan	8f7b436546	[mips] Fix 'l' constraint handling for types smaller than 32 bits In case of correct using of the 'l' constraint llvm now generates valid code; otherwise it shows an error message. Initially these triggers an assertion. This commit is the same as r324869 with fixed the test's file name. llvm-svn: 324885	2018-02-12 12:21:55 +00:00
Simon Atanasyan	21131cb47f	[mips] Revert rL324869 This commit adds inlineasm-cnstrnt-bad-l.ll which is clashing with inlineasm-cnstrnt-bad-L.ll on case insensitive file systems. llvm-svn: 324882	2018-02-12 11:15:37 +00:00
Florian Hahn	e6f9d324f3	[LoopInterchange] Simplify splitInnerLoopHeader logic (NFC). We can use SplitBlock for both cases, which makes the code slightly simpler and updates both LoopInfo and the dominator tree. llvm-svn: 324881	2018-02-12 11:10:58 +00:00
David Green	7df65b2048	[CodeGen] Add a -trap-unreachable option for debugging Add a common -trap-unreachable option, similar to the target specific hexagon equivalent, which has been replaced. This turns unreachable instructions into traps, which is useful for debugging. Differential Revision: https://reviews.llvm.org/D42965 llvm-svn: 324880	2018-02-12 11:06:27 +00:00
Sam McCall	f68b2fe1f1	[gtest] Support raw_ostream printing functions more comprehensively. Summary: These are functions like operator<<(raw_ostream&, Foo). Previously these were only supported for messages. In the assertion EXPECT_EQ(A, B) << C; the local modifications would explicitly try to use raw_ostream printing for C. However A and B would look for a std::ostream printing function, and often fall back to gtest's default "168 byte object <00 01 FE 42 ...>". This patch pulls out the raw_ostream support into a new header under `custom/`. I changed the mechanism: instead of a convertible stream, we wrap the printed value in a proxy object to allow it to be sent to a std::ostream. I think the new way is clearer. I also changed the policy: we prefer raw_ostream printers over std::ostream ones. This is because the fallback printers are defined using std::ostream, while all the raw_ostream printers should be "good". Reviewers: ilya-biryukov, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43091 llvm-svn: 324876	2018-02-12 10:20:09 +00:00
Simon Atanasyan	cf9cb29548	[mips] Fix 'l' constraint handling for types smaller than 32 bits In case of correct using of the 'l' constraint llvm now generates valid code; otherwise it shows an error message. Initially these triggers an assertion. llvm-svn: 324869	2018-02-12 07:51:21 +00:00
Gerolf Hoflehner	91a20825ce	[MC] Issue error message when data region is not terminated llvm-svn: 324868	2018-02-12 07:19:05 +00:00
Max Kazantsev	64c8bc6792	[NFC] Fix typos llvm-svn: 324867	2018-02-12 05:16:28 +00:00
Max Kazantsev	f028dfc8f5	[SCEV] Make getPostIncExpr guaranteed to return AddRec The current implementation of `getPostIncExpr` invokes `getAddExpr` for two recurrencies and expects that it always returns it a recurrency. But this is not guaranteed to happen if we have reached max recursion depth or refused to make SCEV simplification for other reasons. This patch changes its implementation so that now it always returns SCEVAddRec without relying on `getAddExpr`. Differential Revision: https://reviews.llvm.org/D42953 llvm-svn: 324866	2018-02-12 05:09:38 +00:00
Craig Topper	8259f5e976	[X86] Don't look for TEST instruction shrinking opportunities when the root node is a X86ISD::SUB. I don't believe we ever create an X86ISD::SUB with a 0 constant which is what the TEST handling needs. The ternary operator at the end of this code shows up as only going one way in the llvm-cov report from the bots. llvm-svn: 324865	2018-02-12 03:02:02 +00:00
Craig Topper	544fea01ab	[X86] Remove check for X86ISD::AND with no flag users from the TEST instruction immediate shrinking code. We turn X86ISD::AND with no flag users back to ISD::AND in PreprocessISelDAG. llvm-svn: 324864	2018-02-12 03:02:01 +00:00
Craig Topper	08505b384a	[X86] Change some compare patterns to use loadi8/loadi16/loadi32/loadi64 helper fragments. This enables CMP8mi to fold zextloadi8i1 which in all tests allows us to avoid creating a TEST8rr that peephole can't fold. llvm-svn: 324863	2018-02-12 02:48:42 +00:00
Craig Topper	480211ca64	[X86] Autogenerate complete checks. NFC llvm-svn: 324862	2018-02-12 02:03:36 +00:00
Craig Topper	e53e2913c0	[X86] Add KADD X86ISD opcode instead of reusing ISD::ADD. ISD::ADD implies individual vector element addition with no carries between elements. But for a vXi1 type that would be the same as XOR. And we already turn ISD::ADD into ISD::XOR for all vXi1 types during lowering. So the ISD::ADD pattern would never be able to match anyway. KADD is different, it adds the elements but also propagates a carry between them. This just a way of doing an add in k-register without bitcasting to the scalar domain. There's still no way to match the pattern, but at least its not obviously wrong. llvm-svn: 324861	2018-02-12 01:33:38 +00:00
Craig Topper	f31386f462	[X86] Allow zextload/extload i1->i8 to be folded into instructions during isel Previously we just emitted this as a MOV8rm which would likely get folded during the peephole pass anyway. This just makes it explicit earlier. The gpr-to-mask.ll test changed because the kaddb instruction has no memory form. llvm-svn: 324860	2018-02-12 01:33:36 +00:00
Charles Saternos	246be55b14	Follow on to rL324854 (Added tests) llvm-svn: 324859	2018-02-12 00:20:16 +00:00
Craig Topper	bf1d141721	[X86] Remove MASK_BINOP intrinsic type. NFC llvm-svn: 324858	2018-02-11 22:32:30 +00:00
Craig Topper	00e4bc826d	[X86] Remove dead code from getMaskNode that looked for a i64 mask with a maskVT that wasn't v64i1. NFC llvm-svn: 324857	2018-02-11 22:32:29 +00:00
Craig Topper	da1bc8da2f	[X86] Remove LowerBoolVSETCC_AVX512, we get this with a target independent DAG combine now. NFC llvm-svn: 324856	2018-02-11 22:32:27 +00:00
Charles Saternos	2f97999973	[ThinLTO] Add GraphTraits for FunctionSummaries Add GraphTraits definitions to the FunctionSummary and ModuleSummaryIndex classes. These GraphTraits will be used to construct find SCC's in ThinLTO analysis passes. llvm-svn: 324854	2018-02-11 22:06:20 +00:00
Brock Wyma	5b36a8af8d	[CodeView] Allow variable names to be as long as the codeview format supports Instead of reserving 0xF00 bytes for the fixed length portion of the CodeView symbol name, calculate the actual length of the fixed length portion. Differential Revision: https://reviews.llvm.org/D42125 llvm-svn: 324850	2018-02-11 21:26:46 +00:00
Craig Topper	7b1cb86a56	[X86] Update some required-vector-width.ll test cases to not pass 512-bit vectors in arguments or return. ABI for these would require 512 bits support so we don't want to test that. llvm-svn: 324845	2018-02-11 18:52:16 +00:00
Simon Pilgrim	b203875de8	[X86][SSE] Use SplitBinaryOpsAndApply to recognise PSUBUS patterns before they're split on AVX1 This needs to be generalised further to support AVX512BW cases but I want to add non-uniform constants first. llvm-svn: 324844	2018-02-11 17:29:42 +00:00
Sanjay Patel	d3c3b122ba	[InstCombine] X / (X * Y) -> 1 / Y if the multiplication does not overflow The related cases for (X * Y) / X were handled in rL124487. https://rise4fun.com/Alive/6k9 The division in these tests is subsequently eliminated by existing instcombines for 1/X. llvm-svn: 324843	2018-02-11 17:20:32 +00:00
Craig Topper	598a86fcc7	[X86] Use min/max for vector ult/ugt compares if avoids a sign flip. Summary: Currently we only use min/max to help with ule/uge compares because it removes an invert of the result that would otherwise be needed. But we can also use it for ult/ugt compares if it will prevent the need for a sign bit flip needed to use pcmpgt at the cost of requiring an invert after the compare. I also refactored the code so that the max/min code is self contained and does its own return instead of setting up a flag to manipulate the rest of the function's behavior. Most of the test cases look ok with this. I did notice that we added instructions when one of the operands being sign flipped is a constant vector that we were able to constant fold the flip into. I also noticed that sometimes the SSE min/max clobbers a register that is needed after the compare. This resulted in an extra move being inserted before the min/max to preserve the register. We could try to detect this and switch from min to max and change the compare operands to use the operand that gets reused in the compare. Reviewers: spatel, RKSimon Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42935 llvm-svn: 324842	2018-02-11 17:11:40 +00:00
Simon Pilgrim	4f6d54b20e	[X86][SSE] Moved SplitBinaryOpsAndApply earlier so more methods can use it. NFCI. llvm-svn: 324841	2018-02-11 17:01:43 +00:00
Sanjay Patel	4cfbc64275	[InstCombine] add tests for div-mul folds; NFC The related cases for (X * Y) / X were handled in rL124487. llvm-svn: 324840	2018-02-11 16:52:44 +00:00
Sanjay Patel	fb62a27499	[TargetLowering] try to create -1 constant operand for math ops via demanded bits This reverses instcombine's demanded bits' transform which always tries to clear bits in constants. As noted in PR35792 and shown in the test diffs: https://bugs.llvm.org/show_bug.cgi?id=35792 ...we can do better in codegen by trying to form -1. The x86 sub test shows a missed opportunity. I did investigate changing instcombine's behavior, but it would be more work to change canonicalization in IR. Clearing bits / shrinking constants can allow killing instructions, so we'd have to figure out how to not regress those cases. Differential Revision: https://reviews.llvm.org/D42986 llvm-svn: 324839	2018-02-11 14:38:23 +00:00
Simon Pilgrim	772b66be55	[X86] Add PR33747 test case llvm-svn: 324838	2018-02-11 13:12:50 +00:00

1 2 3 4 5 ...

160087 Commits