llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-18 18:42:46 +02:00

Author	SHA1	Message	Date
Alex Bradbury	047632c571	[RISCV] Quick fix for PR40333 Avoid the infinite loop caused by the target DAG combine converting ANYEXT to SIGNEXT and the target-independent DAG combine logic converting back to ANYEXT. Do this by not adding the new node to the worklist. Committing directly as this definitely doesn't make the problem any worse, and I intend to follow-up with a patch that avoids this custom combiner logic altogether and just lowers the i32 operations to a target-specific SelectionDAG node. This should be easier to reason about and improve codegen quality in some cases (though may miss out on some later DAG combines). llvm-svn: 351806	2019-01-22 12:11:53 +00:00
Max Kazantsev	daef624965	[LoopPredication] Support guards expressed as branches by widenable condition This patch adds support of guards expressed as branches by widenable conditions in Loop Predication. Differential Revision: https://reviews.llvm.org/D56081 Reviewed By: reames llvm-svn: 351805	2019-01-22 11:49:06 +00:00
Simon Pilgrim	a25521a255	[X86] Add test for matchAddressRecursively's MUL handling Noticed in code coverage tests that this isn't tested. llvm-svn: 351804	2019-01-22 11:39:21 +00:00
Max Kazantsev	03355e4336	[NFC] Add function to parse widenable conditional branches llvm-svn: 351803	2019-01-22 11:21:32 +00:00
Martin Storsjo	ac45f0ce9d	[llvm-objcopy] [COFF] Implement --add-gnu-debuglink Differential Revision: https://reviews.llvm.org/D57007 llvm-svn: 351801	2019-01-22 10:58:18 +00:00
Martin Storsjo	2151d56b45	[llvm-objcopy] [COFF] Update symbol indices in weak externals Differential Revision: https://reviews.llvm.org/D57006 llvm-svn: 351800	2019-01-22 10:58:09 +00:00
Martin Storsjo	70751a99e9	[llvm-objcopy] Consistently use createStringError instead of make_error<StringError> This was requested in the review of D57006. Also add missing quotes around symbol names in error messages. Differential Revision: https://reviews.llvm.org/D57014 llvm-svn: 351799	2019-01-22 10:57:59 +00:00
James Henderson	7a925cd03f	[NFC][llvm-readobj]Normalise --/- inconsistency in test options llvm-svn: 351798	2019-01-22 10:57:21 +00:00
Simon Pilgrim	01208cb3d3	[X86] HADDPS/HADDPD scalar lowering was added at rL350421 llvm-svn: 351797	2019-01-22 10:49:41 +00:00
Chandler Carruth	3764443332	Revert r351778: IR: Add fp operations to atomicrmw This broke the RISCV build, and even with that fixed, one of the RISCV tests behaves surprisingly differently with asserts than without, leaving there no clear test pattern to use. Generally it seems bad for hte IR to differ substantially due to asserts (as in, an alloca is used with asserts that isn't needed without!) and nothing I did simply would fix it so I'm reverting back to green. This also required reverting the RISCV build fix in r351782. llvm-svn: 351796	2019-01-22 10:29:58 +00:00
James Henderson	23166c2b13	[llvm-symbolizer] Add support for --basenames/-s This fixes https://bugs.llvm.org/show_bug.cgi?id=40068. --basenames is a GNU addr2line switch which strips the directory names from the file path in the output. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D56919 llvm-svn: 351795	2019-01-22 10:24:32 +00:00
Max Kazantsev	88e4dee44f	[NFC] Factor out some reusable logic llvm-svn: 351794	2019-01-22 10:13:36 +00:00
Max Kazantsev	75db383edd	[NFC] Add detector for guards expressed as branch by widenable conditions This patch adds a function to detect guards expressed in explicit control flow form as branch by `and` with widenable condition intrinsic call: %wc = call i1 @llvm.experimental.widenable.condition() %guard_cond = and i1, %some_cond, %wc br i1 %guard_cond, label %guarded, label %deopt deopt: <maybe some non-side-effecting instructions> deoptimize() This form can be used as alternative to implicit control flow guard representation expressed by `experimental_guard` intrinsic. Differential Revision: https://reviews.llvm.org/D56074 Reviewed By: reames llvm-svn: 351791	2019-01-22 09:36:22 +00:00
James Henderson	ac2839a57d	[llvm-readelf]Revert --dyn-symbols behaviour to make it GNU compatible, and add new --hash-symbols switch for old behaviour In r287786, the behaviour of --dyn-symbols in llvm-readelf (but not llvm-readobj) was changed to print the dynamic symbols as derived from the hash table, rather than to print the dynamic symbol table contents directly. The original change was initially submitted without review, and some comments were made on the commit mailing list implying that the new behavious is GNU compatible. I argue that it is not: 1) It does not include a null symbol. 2) It prints the symbols based on an order derived from the hash table. 3) It prints an extra column indicating which bucket it came from. This could break parsers that expect a fixed number of columns, with the first column being the symbol index. 4) If the input happens to have both .hash and .gnu.hash section, it prints interpretations of them both, resulting in most symbols being printed twice. 5) There is no way of just printing the raw dynamic symbol table, because --symbols also prints the static symbol table. This patch reverts the --dyn-symbols behaviour back to its old behaviour of just printing the contents of the dynamic symbol table, similar to what is printed by --symbols. As the hashed interpretation is still desirable to validate the hash table, it puts it under a new switch "--hash-symbols". This is a no-op on all output forms except for GNU output style for ELF. If there is no hash table, it does nothing, unlike the previous behaviour which printed the raw dynamic symbol table, since the raw dynsym is available under --dyn-symbols. The yaml input for the test is based on that in test/tools/llvm-readobj/demangle.test, but stripped down to the bare minimum to provide a valid dynamic symbol. Note: some LLD tests needed updating. I will commit a separate patch for those. Reviewed by: grimar, rupprecht Differential Revision: https://reviews.llvm.org/D56910 llvm-svn: 351789	2019-01-22 09:35:35 +00:00
Vitaly Buka	316c1b9dc6	Revert "Remove static_assert(value == std::is_trivially_copyable<T>::value)" Upgraded the bot as workaround. This reverts commit r351784. llvm-svn: 351786	2019-01-22 07:22:45 +00:00
Alex Bradbury	b6f0a606cb	[RISCV][NFC] Add break to case statement in RISCVDAGToDAGISel::Select The break isn't strictly needed yet as there is no subsequent entry in the case. But adding to prevent mistakes further down the road. llvm-svn: 351785	2019-01-22 07:22:00 +00:00
Vitaly Buka	c9bbd091e7	Remove static_assert(value == std::is_trivially_copyable<T>::value) This fails to compile with clang ang libstdc++ 4.6 llvm-svn: 351784	2019-01-22 06:26:50 +00:00
Alex Bradbury	757193855d	[RISCV] Fix build after r351778 Also add a comment to explain the expansion strategy for atomicrmw {fadd,fsub}. llvm-svn: 351782	2019-01-22 05:06:57 +00:00
Matt Arsenault	44582e29c8	IR: Add fp operations to atomicrmw Add just fadd/fsub for now. llvm-svn: 351778	2019-01-22 03:32:36 +00:00
Eli Friedman	d2ca493c0b	[ARM] Combine ands+lsls to lsls+lsrs for Thumb1. This patch may seem familiar... but my previous patch handled the equivalent lsls+and, not this case. Usually instcombine puts the "and" after the shift, so this case doesn't come up. However, if the shift comes out of a GEP, it won't get canonicalized by instcombine, and DAGCombine doesn't have an equivalent transform. This also modifies isDesirableToCommuteWithShift to suppress DAGCombine transforms which would make the overall code worse. I'm not really happy adding a bunch of code to handle this, but it would probably be tricky to substantially improve the behavior of DAGCombine here. Differential Revision: https://reviews.llvm.org/D56032 llvm-svn: 351776	2019-01-22 01:51:37 +00:00
Philip Reames	22e59f8759	[CVP] Use LVI to constant fold deopt operands Deopt operands are generally intended to record information about a site in code with minimal perturbation of the surrounding code. Idiomatically, they also tend to appear down rare paths. Putting these together, we have an obvious case for extending CVP w/deopt operand constant folding. Arguably, we should be doing this for all operands on all instructions, but that's definitely a much larger and risky change. Differential Revision: https://reviews.llvm.org/D55678 llvm-svn: 351774	2019-01-22 01:34:33 +00:00
Eli Friedman	9361d42b5e	[LangRef] Clarify semantics of volatile operations. Specifically, clarify the following: 1. Volatile load and store may access addresses that are not memory. 2. Volatile load and store do not modify arbitrary memory. 3. Volatile load and store do not trap. Prompted by recent volatile discussion on llvmdev. Currently, there's sort of a split in the source code about whether volatile operations are allowed to trap; this resolves that dispute in favor of not allowing them to trap. Differential Revision: https://reviews.llvm.org/D53184 llvm-svn: 351772	2019-01-22 00:42:20 +00:00
Matt Arsenault	57bff27c5d	GlobalISel: Fix out of bounds crashes in verifier llvm-svn: 351769	2019-01-22 00:29:37 +00:00
Eli Friedman	1d6f130191	[AArch64] Add patterns for zext/sext of shift amount. Not sure this is the best fix, but it saves an instruction for certain constructs involving variable shifts. Differential Revision: https://reviews.llvm.org/D55572 llvm-svn: 351768	2019-01-22 00:21:35 +00:00
Matt Arsenault	7bd184dc17	AMDGPU/GlobalISel: Legalize more fp<->int conversions llvm-svn: 351767	2019-01-22 00:20:17 +00:00
JF Bastien	dec3ae0936	Document toolchain update policy Summary: Capture the current agreed-upon toolchain update policy based on the following discussions: - LLVM dev meeting 2018 BoF "Migrating to C++14, and beyond!" llvm.org/devmtg/2018-10/talk-abstracts.html#bof3 - A Short Policy Proposal Regarding Host Compilers lists.llvm.org/pipermail/llvm-dev/2018-May/123238.html - Using C++14 code in LLVM (2018) lists.llvm.org/pipermail/llvm-dev/2018-May/123182.html - Using C++14 code in LLVM (2017) lists.llvm.org/pipermail/llvm-dev/2017-October/118673.html - Using C++14 code in LLVM (2016) lists.llvm.org/pipermail/llvm-dev/2016-October/105483.html - Document and Enforce new Host Compiler Policy llvm.org/D47073 - Require GCC 5.1 and LLVM 3.5 at a minimum llvm.org/D46723 Subscribers: jkorous, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D56819 llvm-svn: 351765	2019-01-21 23:53:52 +00:00
Sanjay Patel	6499823747	[x86] add another test for xor with undefs; NFC llvm-svn: 351764	2019-01-21 22:12:35 +00:00
Sanjay Patel	d0b6d8449c	[x86] add tests for vector ops with undef lanes; NFC llvm-svn: 351763	2019-01-21 21:52:27 +00:00
Craig Topper	b7edb6ed38	[X86] Use X86ISD::VFPROUND instead of ISD::FP_ROUND for 256 and 512 bit cvtpd2ps intrinsics. Summary: Use X86ISD::VFPROUND in the instruction isel patterns. Add new patterns for ISD::FP_ROUND to maintain support for fptrunc in IR. In the process I found a couple duplicate isel patterns which I also deleted in this patch. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56991 llvm-svn: 351762	2019-01-21 20:14:09 +00:00
Craig Topper	4ffb8a05bb	[X86] Change avx512 COMPRESS and EXPAND lowering to use a single masked node instead of expand/compress+select. Summary: For compress, a select node doesn't semantically reflect the behavior of the instruction. The mask would have holes in it, but the resulting write is to contiguous elements at the bottom of the vector. Furthermore, as far as the compressing and expanding is concerned the behavior is depended on the mask. You can't just have an expand/compress node that only reads the input vector. That node would have no meaning by itself. This all only works because we pattern match the compress/expand+select back to the instruction. But conceivably an optimization of the select could break the pattern and leave something meaningless. This patch modifies the expand and compress node to take the mask and passthru as additional inputs and gets rid of the select all together. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D57002 llvm-svn: 351761	2019-01-21 20:02:28 +00:00
Stanislav Mekhanoshin	8d285e9031	[AMDGPU] Fixed hazard recognizer to walk predecessors Fixes two problems with GCNHazardRecognizer: 1. It only scans up to 5 instructions emitted earlier. 2. It does not take control flow into account. An earlier instruction from the previous basic block is not necessarily a predecessor. At the same time a real predecessor block is not scanned. The patch provides a way to distinguish between scheduler and hazard recognizer mode. It is OK to work with emitted instructions in the scheduler because we do not really know what will be emitted later and its order. However, when pass works as a hazard recognizer the schedule is already finalized, and we have full access to the instructions for the whole function, so we can properly traverse predecessors and their instructions. Differential Revision: https://reviews.llvm.org/D56923 llvm-svn: 351759	2019-01-21 19:11:26 +00:00
Nico Weber	5321e96990	gn build: Stop passing -DLLVM_LIBXML2_ENABLED to some targets This is a remnant from before the gn build had a working config.h. Defining LLVM_LIBXML2_ENABLED only for targets that depend on build/libs/xml is nice in that only some of the codebase needs to be rebuilt when llvm_enable_libxml2 changes -- but config.h already defines it and defining it there and then redundantly a second time for some targets is worse than having it just in config.h. No behavior change. Differential Revision: https://reviews.llvm.org/D56908 llvm-svn: 351758	2019-01-21 18:59:11 +00:00
Nico Weber	6295f7103b	gn build: Merge r351627, r351548, r351701 llvm-svn: 351757	2019-01-21 18:56:39 +00:00
Pavel Labath	38ec165096	Fix compilation error with gcc 4.8 This version of gcc seems to be having issues with raw literals inside macro arguments. I change the string to use regular string literals instead. llvm-svn: 351756	2019-01-21 18:21:03 +00:00
Simon Pilgrim	b99a324a28	[X86][BtVer2] Update latency of mmx horizontal operations D56777 added +1cy local forwarding penalty for horizontal operations, but this penalty only affects sse2/xmm variants, the mmx variants don't suffer the penalty. Confirmed with @andreadb llvm-svn: 351755	2019-01-21 18:04:25 +00:00
Sanjay Patel	965b5f410f	[AArch64] add more tests for buildvec to shuffle transform; NFC These are copied from the sibling x86 file. I'm not sure which of the current outputs (if any) is considered optimal, but someone more familiar with AArch may want to take a look. llvm-svn: 351754	2019-01-21 17:46:35 +00:00
Sanjay Patel	0962f23352	[DAGCombiner] fix crash when converting build vector to shuffle The regression test is reduced from the example shown in D56281. This does raise a question as noted in the test file: do we want to handle this pattern? I don't have a motivating example for that on x86 yet, but it seems like we could have that pattern there too, so we could avoid the back-and-forth using a shuffle. llvm-svn: 351753	2019-01-21 17:30:14 +00:00
Andrea Di Biagio	3e3ec46699	[X86][BtVer2] Update the WriteLoad latency. r327630 introduced new write definitions for float/vector loads. Before that revision, WriteLoad was used by both integer/float (scalar/vector) load. So, WriteLoad had to conservatively declare a latency to 5cy. That is because the load-to-use latency for float/vector load is 5cy. Now that we have dedicated writes for float/vector loads, there is no reason why we should keep the latency of WriteLoad to 5cy. At the moment, WriteLoad is only used by scalar integer loads only; we can assume an optimstic 3cy latency for them. This patch changes that latency from 5cy to 3cy, and regenerates the affected scheduling/mca tests. Differential Revision: https://reviews.llvm.org/D56922 llvm-svn: 351742	2019-01-21 12:04:10 +00:00
Simon Pilgrim	4aacb0da3a	[CostModel][X86] Add XOP icmp cost tests (PR40376) llvm-svn: 351741	2019-01-21 11:33:52 +00:00
Dmitry Venikov	df9b821340	[llvm-symbolizer] Add -no-demangle as alias for -demangle=false Summary: Provides -no-demangle as alias for -demangle=false. Motivation: https://bugs.llvm.org/show_bug.cgi?id=40075 Reviewers: jhenderson, ruiu Reviewed By: jhenderson Subscribers: erik.pilkington, rupprecht, llvm-commits Differential Revision: https://reviews.llvm.org/D56773 llvm-svn: 351735	2019-01-21 10:00:57 +00:00
Chandler Carruth	d4f3796eeb	Fix typos throughout the license files that somehow I and my reviewers all missed! Thanks to Alex Bradbury for pointing this out, and the fact that I never added the intended `legacy` anchor to the developer policy. Add that anchor too. With hope, this will cause the links to all resolve successfully. llvm-svn: 351731	2019-01-21 09:52:34 +00:00
Craig Topper	d3ab842eb8	[X86] Remove and autoupgrade vpmovqd/vpmovwb intrinsics using trunc+select. llvm-svn: 351729	2019-01-21 08:16:59 +00:00
Max Kazantsev	589ead7620	[NFC] Make getExpressionSize unsigned short llvm-svn: 351727	2019-01-21 07:36:55 +00:00
Max Kazantsev	8a48aae360	[NFC] Fix warnings in unit test of r351725 llvm-svn: 351726	2019-01-21 07:27:47 +00:00
Max Kazantsev	f0c38d90c7	[SCEV][NFC] Introduces expression sizes estimation This patch introduces the field `ExpressionSize` in SCEV. This field is calculated only once on SCEV creation, and it represents the complexity of this SCEV from arithmetical point of view (not from the point of the number of actual different SCEV nodes that are used in the expression). Roughly saying, it is the number of operands and operations symbols when we print this SCEV. A formal definition is following: if SCEV `X` has operands `Op1`, `Op2`, ..., `OpN`, then Size(X) = 1 + Size(Op1) + Size(Op2) + ... + Size(OpN). Size of SCEVConstant and SCEVUnknown is one. Expression size may be used as a universal way to limit SCEV transformations for huge SCEVs. Currently, we have a bunch of options that represents various limits (such as recursion depth limit) that may not make any sense from the point of view of a LLVM users who is not familiar with SCEV internals, and all these different options pursue one goal. A more general rule that may potentially allow us to get rid of this redundancy in options is "do not make transformations with SCEVs of huge size". It can apply to all SCEV traversals and transformations that may need to visit a SCEV node more than once, hence they are prone to combinatorial explosions. This patch only introduces SCEV sizes calculation as NFC, its utilization will be introduced in follow-up patches. Differential Revision: https://reviews.llvm.org/D35989 Reviewed By: reames llvm-svn: 351725	2019-01-21 06:19:50 +00:00
Kito Cheng	28868a45d5	[RISCV] Add R_RISCV_RELAX relocation to all possible relax candidates. Summary: Add R_RISCV_RELAX relocation to all possible relax candidates and update corresponding testcase. Reviewers: asb, apazos Differential Revision: https://reviews.llvm.org/D46677 llvm-svn: 351723	2019-01-21 05:27:09 +00:00
Dylan McKay	e3d6fdcccf	[AVR] Insert unconditional branch when inserting MBBs between blocks with fallthrough This updates the AVR Select8/Select16 expansion code so that, when inserting the two basic blocks for true and false conditions, any existing fallthrough on the previous block is preserved. Prior to this patch, if the block before the Select pseudo fell through to the subsequent block, two new basic blocks would be inserted at the prior fallthrough point, changing the fallthrough destination. The predecessor or successor lists were not updated, causing the BranchFolding pass at -O1 and above the rearrange basic blocks, causing an infinite loop. Not to mention the unconditional fallthrough to the true block is incorrect in of itself. This patch modifies the Select8/16 expansion so that, if inserting true and false basic blocks at a fallthrough point, the implicit branch is preserved by means of an explicit, unconditional branch to the previous fallthrough destination. Thanks to Carl Peto for reporting this bug. This fixes avr-rust bug https://github.com/avr-rust/rust/issues/123. llvm-svn: 351721	2019-01-21 04:32:02 +00:00
Dylan McKay	b12b974df1	[AVR] Enable emission of debug information Prior to this, the code was missing AVR-specific relocation logic in RelocVisitor.h. This patch teaches RelocVisitor about R_AVR_16 and R_AVR_32. Debug information is emitted in the final object file, and understood by 'avr-readelf --debug-dump' from AVR-GCC. llvm-dwarfdump is yet to understand how to dump AVR DWARF symbols. llvm-svn: 351720	2019-01-21 04:27:08 +00:00
Dylan McKay	762baddef7	Revert "[AVR] Insert unconditional branch when inserting MBBs between blocks with fallthrough" This reverts commit r351718. Carl pointed out that the unit test could be improved. This patch will be recommitted once the test is made more resilient. llvm-svn: 351719	2019-01-21 02:46:13 +00:00
Dylan McKay	3d00c7399b	[AVR] Insert unconditional branch when inserting MBBs between blocks with fallthrough This updates the AVR Select8/Select16 expansion code so that, when inserting the two basic blocks for true and false conditions, any existing fallthrough on the previous block is preserved. Prior to this patch, if the block before the Select pseudo fell through to the subsequent block, two new basic blocks would be inserted at the prior fallthrough point, changing the fallthrough destination. The predecessor or successor lists were not updated, causing the BranchFolding pass at -O1 and above the rearrange basic blocks, causing an infinite loop. Not to mention the unconditional fallthrough to the true block is incorrect in of itself. This patch modifies the Select8/16 expansion so that, if inserting true and false basic blocks at a fallthrough point, the implicit branch is preserved by means of an explicit, unconditional branch to the previous fallthrough destination. Thanks to Carl Peto for reporting this bug. This fixes avr-rust bug https://github.com/avr-rust/rust/issues/123. llvm-svn: 351718	2019-01-21 02:44:09 +00:00

... 3 4 5 6 7 ...

174214 Commits