llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Matt Arsenault	bc91f61db9	AMDGPU: Add combine for short vector extract_vector_elts Try to access pieces 4 bytes at a time. This helps various hasOneUse extract_vector_elt combines, such as load width reductions. Avoids test regressions in a future commit. llvm-svn: 334836	2018-06-15 15:31:36 +00:00
Matt Arsenault	30c299fdc4	AMDGPU: Make v4i16/v4f16 legal Some image loads return these, and it's awkward working around them not being legal. llvm-svn: 334835	2018-06-15 15:15:46 +00:00
Paul Semel	bc4f5a08fb	[llvm-readobj] Add -string-dump (-p) option This option prints the section content as a string. Differential Revision: https://reviews.llvm.org/D47989 llvm-svn: 334834	2018-06-15 14:15:02 +00:00
Roman Lebedev	4ec9cfed2e	[MCA] Add -summary-view option Summary: While that is indeed a quite interesting summary stat, there are cases where it does not really add anything other than consuming extra lines. Declutters the output of D48190. Reviewers: RKSimon, andreadb, courbet, craig.topper Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48209 llvm-svn: 334833	2018-06-15 14:01:43 +00:00
Roman Lebedev	6b29fcb05c	[MCA][x86][NFC] Add tests for -register-file-stats, -scheduler-stats Summary: There does not seem to be any other tests for this. Split off from D47676. Reviewers: RKSimon, craig.topper, courbet, andreadb Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48190 llvm-svn: 334832	2018-06-15 14:01:35 +00:00
Sander de Smalen	dda2100a0f	[AArch64][SVE] Asm: Support for FADD, FMUL and FMAX immediate instructions. Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: javed.absar Differential Revision: https://reviews.llvm.org/D47712 llvm-svn: 334831	2018-06-15 13:57:51 +00:00
Bjorn Pettersson	c3a419fcf3	Re-apply "[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue" This is r334704 (which was reverted in r334732) with a fix for types like x86_fp80. We need to use getTypeAllocSizeInBits and not getTypeStoreSizeInBits to avoid dropping debug info for such types. Original commit msg: > Summary: > Do not convert a DbgDeclare to DbgValue if the store > instruction only refer to a fragment of the variable > described by the DbgDeclare. > > Problem was seen when for example having an alloca for an > array or struct, and there were stores to individual elements. > In the past we inserted a DbgValue intrinsics for each store, > just as if the store wrote the whole variable. > > When handling store instructions we insert a DbgValue that > indicates that the variable is "undefined", as we do not know > which part of the variable that is updated by the store. > > When ConvertDebugDeclareToDebugValue is used with a load/phi > instruction we assert that the referenced value is large enough > to cover the whole variable. Afaict this should be true for all > scenarios where those methods are used on trunk. If the assert > blows in the future I guess we could simply skip to insert a > dbg.value instruction. > > In the future I think we should examine which part of the variable > that is accessed, and add a DbgValue instrinsic with an appropriate > DW_OP_LLVM_fragment expression. > > Reviewers: dblaikie, aprantl, rnk > > Reviewed By: aprantl > > Subscribers: JDevlieghere, llvm-commits > > Tags: #debug-info > > Differential Revision: https://reviews.llvm.org/D48024 llvm-svn: 334830	2018-06-15 13:48:55 +00:00
Simon Dardis	1331e402c6	[mips] Add licensing information of the microMIPS tablegen files. (NFC) llvm-svn: 334827	2018-06-15 13:29:35 +00:00
Sander de Smalen	00d2fd13c5	[AArch64][SVE] Asm: Add parsing/printing support for exact FP immediates. Some instructions require of a limited set of FP immediates as operands, for example '#0.5 or #1.0' for SVE's FADD instruction. This patch adds support for parsing and printing such FP immediates as exact values (e.g. #0.499999 is not accepted for #0.5). Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47711 llvm-svn: 334826	2018-06-15 13:11:49 +00:00
Roman Lebedev	f74c82875a	[NFC] chmod +x utils/update_analyze_test_checks.py Looks like a simple oversight. llvm-svn: 334825	2018-06-15 12:41:50 +00:00
Matt Arsenault	a61bbd4443	DAG: Fix creating concat_vectors with illegal type Test passes as is, but fails with future patch to make v4i16/v4f16 legal. llvm-svn: 334823	2018-06-15 12:09:15 +00:00
Simon Pilgrim	1b9a8986fb	[SLP][X86] Add AVX2 run to POW2 SDIV Tests Non-uniform pow2 tests are only make sense on targets with fast (low cost) non-uniform shifts llvm-svn: 334821	2018-06-15 10:29:37 +00:00
Simon Pilgrim	292418fe45	[SLP][X86] Regenerate POW2 SDIV Tests Added non-uniform pow2 test as well llvm-svn: 334819	2018-06-15 10:07:03 +00:00
Roman Lebedev	58147abfa7	[InstCombine] Recommit: Fold (x << y) >> y -> x & (-1 >> y) Summary: We already do it for splat constants, but not just values. Also, undef cases are mostly non-functional. The original commit was reverted because it broke tests for amdgpu backend, which i didn't check. Now, the backed was updated to recognize these new patterns, so we are good. https://bugs.llvm.org/show_bug.cgi?id=37603 https://rise4fun.com/Alive/cplX Reviewers: spatel, craig.topper, mareko, bogner, rampitec, nhaehnle, arsenm Reviewed By: spatel, rampitec, nhaehnle Subscribers: wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D47980 llvm-svn: 334818	2018-06-15 09:56:52 +00:00
Roman Lebedev	a0dac64487	[AMDGPU] Recognize x & ~(-1 << y) pattern. Summary: The same pattern as D48010, but this one is IR-canonical as of D47428. Reviewers: nhaehnle, bogner, tstellar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #amdgpu Differential Revision: https://reviews.llvm.org/D48012 llvm-svn: 334817	2018-06-15 09:56:45 +00:00
Roman Lebedev	733f1f7fbd	[AMDGPU] Recognize x & ((1 << y) - 1) pattern. Summary: As a followup for D48007. Since we already handle `x << (bitwidth - y) >> (bitwidth - y)` pattern, which does not have ub for both the edge cases (`y == 0`, `y == bitwidth`), i think also handling a pattern that is ub for `y == bitwidth` should be fine. Reviewers: nhaehnle, bogner, tstellar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #amdgpu Differential Revision: https://reviews.llvm.org/D48010 llvm-svn: 334816	2018-06-15 09:56:39 +00:00
Roman Lebedev	ab8ea027b7	[AMDGPU] Recognize x & (-1 >> (32 - y)) pattern. Summary: D47980 will canonicalize the `x << (32 - y) >> (32 - y)`, which is the pattern the AMDGPU expects to `x & (-1 >> (32 - y))`, which is not recognized by AMDGPU. Thus, it needs to be recognized, too. Reviewers: nhaehnle, bogner, tstellar, arsenm Reviewed By: arsenm Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #amdgpu Differential Revision: https://reviews.llvm.org/D48007 llvm-svn: 334815	2018-06-15 09:56:31 +00:00
Peter Smith	d93c0f957b	[MC] Move bundling and MCSubtargetInfo to MCEncodedFragment [NFC] Instruction bundling is only supported on descendants of the MCEncodedFragment type. By moving the bundling functionality and MCSubtargetInfo to this class it makes it easier to set and extract the MCSubtargetInfo when it is necessary. This is a refactoring change that will make it easier to pass the MCSubtargetInfo through to writeNops when nop padding is required. Differential Revision: https://reviews.llvm.org/D45959 llvm-svn: 334814	2018-06-15 09:48:18 +00:00
Clement Courbet	d3ce5d4b09	[llvm-exegesis][NFC] Remove dead variable. llvm-svn: 334813	2018-06-15 09:46:57 +00:00
Clement Courbet	aefb860998	[llvm-exegesis][NFC] Add more comments. llvm-svn: 334811	2018-06-15 09:27:12 +00:00
QingShan Zhang	aec50ce443	add myself to the CREDITS.TXT llvm-svn: 334808	2018-06-15 08:34:41 +00:00
Mikhail Dvoretckii	9ec0967837	NFC: Regenerating x86-sse41.ll test for InstCombine Test regenerated to reduce noise in further patches. llvm-svn: 334806	2018-06-15 07:59:29 +00:00
Clement Courbet	356357e6bf	[llvm-exegesis] Print the whole snippet in analysis. Summary: On hover, the whole asm snippet is displayed, including operands. This requires the actual assembly output instead of just the MCInsts: This is because some pseudo-instructions get lowered to actual target instructions during codegen (e.g. ABS_Fp32 -> SSE or X87). Reviewers: gchatelet Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D48164 llvm-svn: 334805	2018-06-15 07:30:45 +00:00
Craig Topper	ddac8b162b	Revert r334802 "[X86] Prevent folding stack reloads with instructions that have an undefined register update." There's a typo causing the build to fail. llvm-svn: 334803	2018-06-15 06:15:26 +00:00
Craig Topper	21c8de8539	[X86] Prevent folding stack reloads with instructions that have an undefined register update. We want to keep the load unfolded so we can use the same register for both sources to avoid a false dependency. llvm-svn: 334802	2018-06-15 06:11:36 +00:00
Craig Topper	5526c28acc	[X86] Add more instructions to the memory folding tables using the autogenerated table as a guide. I think this covers most of the unmasked vector instructions. We're still missing a lot of the masked instructions. There are some test changes here because of the new folding support. I don't think these particular cases should be folded because it creates an undef register dependency. I think the changes introduced in r334175 are not handling stack folding. They're only blocking the peephole pass. llvm-svn: 334800	2018-06-15 05:49:19 +00:00
Hiroshi Inoue	9e40efa572	[NFC] fix trivial typos in documents llvm-svn: 334799	2018-06-15 05:10:09 +00:00
Craig Topper	74e3a64c30	[X86] Fix some checks to use X86 instead of X32. These tests were recently updated so it looks like gone wrong. llvm-svn: 334786	2018-06-15 04:42:55 +00:00
Craig Topper	5584c8b9f5	[X86] Add 'Z' to the internal names of various EVEX instructions for overall consistency. llvm-svn: 334785	2018-06-15 04:42:54 +00:00
Andrew Kaylor	2c05bb7844	Add debug info for OProfile profiling support Patch by Gaetano Priori Differential Revision: https://reviews.llvm.org/D47925 llvm-svn: 334782	2018-06-15 00:07:28 +00:00
Shoaib Meenai	00e5a5e3bd	[cmake] Change ON/OFF to YES/NO. NFC compnerd pointed out that the latter reads better over here. llvm-svn: 334781	2018-06-14 23:40:04 +00:00
Shoaib Meenai	4ae4788876	[cmake] Add linker detection for Apple platforms LLVM currently assumes that Apple platforms will always use ld64. In the future, LLD Mach-O might also be supported, so add the beginnings of linker detection support. ld64 is currently the only detected linker, since `ld64.lld -v` doesn't yield any useful version output, but we can add that detection later, and in the meantime it's still useful to have the ld64 identification. Switch clang's order file check to use this new detection rather than just checking for the presence of an ld64 executable. Differential Revision: https://reviews.llvm.org/D48201 llvm-svn: 334780	2018-06-14 23:26:33 +00:00
Eli Friedman	7ed3f1b26c	Make uitofp and sitofp defined on overflow. IEEE 754 defines the expected result on overflow. As far as I know, hardware implementations (of f16), and compiler-rt (__floatuntisf) correctly return +-Inf on overflow. And I can't think of any useful transform that would take advantage of overflow being undefined here. Differential Revision: https://reviews.llvm.org/D47807 llvm-svn: 334777	2018-06-14 22:58:48 +00:00
Lang Hames	a86b02ba7f	[ORC] Strip weak flags from a symbol once it is selected for materialization. Once a symbol has been selected for materialization it can no longer be overridden. Stripping the weak flag guarantees this (override attempts will then be treated as duplicate definitions and result in a DuplicateDefinition error). llvm-svn: 334771	2018-06-14 21:16:29 +00:00
Matt Davis	e567892d10	[llvm-mca] Clean up the header comment. NFC. This change removes a few dashes to make room for the header syntax string. llvm-svn: 334770	2018-06-14 20:58:54 +00:00
Michael Berg	10e6427e7a	easing the constraint for isNegatibleForFree and GetNegatedExpression Summary: Here we relax the old constraint which utilized unsafe with the TargetOption flag HonorSignDependentRoundingFPMathOption, with the assertion that unsafe is no longer needed or never was required for correctness on FDIV/FMUL. Reviewers: spatel, hfinkel, wristow, arsenm, javed.absar Reviewed By: spatel Subscribers: efriedma, wdng, tpr Differential Revision: https://reviews.llvm.org/D48057 llvm-svn: 334769	2018-06-14 20:54:13 +00:00
Florian Hahn	88222e23c3	Revert r334764, as it breaks some bots llvm-svn: 334767	2018-06-14 20:32:58 +00:00
Florian Hahn	e1f114367e	[TableGen] Make TreePatternNode::getChild return a reference (NFC) The return value of TreePatternNode::getChild is never null. This patch also updates various places that use return values of getChild to also use references. Those changes were suggested post-commit for D47463. llvm-svn: 334764	2018-06-14 20:23:48 +00:00
George Burgess IV	6d5d477e04	[MSSA] Print more optimization information In particular, when asked to print a MemoryAccess, we'll now print where defs are optimized to, and we'll print optimized access types. This patch also introduces an operator<< to make printing AliasResults easier. Patch by Juneyoung Lee! Differential Revision: https://reviews.llvm.org/D47860 llvm-svn: 334760	2018-06-14 19:55:53 +00:00
Sanjay Patel	41b45b2b8c	[x86] be more selective about converting 'and' to shuffle (PR37749) isVectorClearMaskLegal() is the TLI hook used by the generic DAGCombiner::XformToShuffleWithZero(). We've grown to accomodate/expect this transform to shuffle (disabling it more generally results in many regressions). So I'm narrowly excluding the 256-bit types that clearly are not worthwhile for AVX1. I think in most cases we are able to recover by converting the shuffle back into 'and' ops, but the cases in: https://bugs.llvm.org/show_bug.cgi?id=37749 ...show that there are cracks. llvm-svn: 334759	2018-06-14 19:55:02 +00:00
Craig Topper	bb91ae466c	[X86] Fix stale comment in folding tables. llvm-svn: 334758	2018-06-14 19:28:31 +00:00
Tom Stellard	5608dfed6a	AMDGPU/GlobalISel: Implement select() for @llvm.amdgcn.cvt.pkrtz Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45907 llvm-svn: 334757	2018-06-14 19:26:37 +00:00
Justin Bogner	b10d38fe26	Re-apply "[VirtRegRewriter] Avoid clobbering registers when expanding copy bundles" This is r334750 (which was reverted in r334754) with a fix for an uninitialized variable that was caught by msan. Original commit message: > If a copy bundle happens to involve overlapping registers, we can end > up with emitting the copies in an order that ends up clobbering some > of the subregisters. Since instructions in the copy bundle > semantically happen at the same time, this is incorrect and we need to > make sure we order the copies such that this doesn't happen. llvm-svn: 334756	2018-06-14 19:24:03 +00:00
Justin Bogner	0705216f21	Revert "[VirtRegRewriter] Avoid clobbering registers when expanding copy bundles" There's an msan failure: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/19549 This reverts r334750. llvm-svn: 334754	2018-06-14 19:10:57 +00:00
Michael Berg	c2f9b2d826	updating isNegatibleForFree and GetNegatedExpression with fmf for fadd Summary: A FMF constraint is added to FADD with unsafe still available as the fallback Reviewers: spatel, wristow, arsenm, hfinkel Reviewed By: spatel Subscribers: wdng Differential Revision: https://reviews.llvm.org/D48180 llvm-svn: 334753	2018-06-14 18:48:31 +00:00
Sam Clegg	219c563769	[WebAssembly] Ignore explicit section names for functions WebAssembly doesn't support more than one function per section and we rely on function sections being unique. This change ignores the section provided by the function to avoid two functions being in the same section. Without this change the object writer produces the following error for this test: LLVM ERROR: section already has a defining function: baz Differential Revision: https://reviews.llvm.org/D48178 llvm-svn: 334752	2018-06-14 18:48:19 +00:00
Justin Bogner	e2267458e1	[VirtRegRewriter] Avoid clobbering registers when expanding copy bundles If a copy bundle happens to involve overlapping registers, we can end up with emitting the copies in an order that ends up clobbering some of the subregisters. Since instructions in the copy bundle semantically happen at the same time, this is incorrect and we need to make sure we order the copies such that this doesn't happen. Differential Revision: https://reviews.llvm.org/D48154 llvm-svn: 334750	2018-06-14 18:32:55 +00:00
Bruno Cardoso Lopes	11b4351a5f	[CMAKE] Honor CMAKE_OSX_SYSROOT to compute include dir for libxml2 On MacOS, if CMAKE_OSX_SYSROOT is used and the user has command line tools installed, we currently get the include path for libxml2 as /usr/include/libxml2, instead of ${CMAKE_OSX_SYSROOT}/usr/include/libxml2. Make it consistent on MacOS by prefixing ${CMAKE_OSX_SYSROOT} when possible. rdar://problem/41103601 llvm-svn: 334746	2018-06-14 18:19:54 +00:00
Sanjay Patel	8641980e81	[x86] add tests for AVX1 FP logic op abuse (PR37749); NFC Also, add a RUN for AVX2 to make sure that's good. llvm-svn: 334744	2018-06-14 18:08:06 +00:00
Andrea Di Biagio	4f0c283682	[llvm-mca] Add tests for instructions that implicitly clear the upper portion of a super-register. On x86-64, a write to register EAX implicitly clears the upper half or RAX. 128-bit AVX instructions clear the upper 128-bit of the YMM register that aliases the XMM definition register. llvm-mca doesn't know about register writes that implicitly clear the upper portion of an aliasing super-register. This issue will be fixed in a future patch. llvm-svn: 334742	2018-06-14 17:48:42 +00:00

... 3 4 5 6 7 ...

165575 Commits