llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Tim Northover	c802973c73	AArch64: support TLS on Darwin platforms in GlobalISel. All TLS access on Darwin is in the "general dynamic" form where we call a function to resolve the address, so implementation is pretty simple. llvm-svn: 368418	2019-08-09 09:32:38 +00:00
Tim Northover	ce0a779ded	GlobalISel: pack various parameters for lowerCall into a struct. I've now needed to add an extra parameter to this call twice recently. Not only is the signature getting extremely unwieldy, but just updating all of the callsites and implementations is a pain. Putting the parameters in a struct sidesteps both issues. llvm-svn: 368408	2019-08-09 08:26:38 +00:00
Sam Parker	5f411782e9	[ARM][ParallelDSP] Replace SExt uses As loads are combined and widened, we replaced their sext users operands whereas we should have been replacing the uses of the sext. I've added a load of tests, with only a few of them originally causing assertion failures, the rest improve pattern coverage. Differential Revision: https://reviews.llvm.org/D65740 llvm-svn: 368404	2019-08-09 07:48:50 +00:00
Craig Topper	d45a7192a6	[X86] Remove code that expands truncating stores from combineStore. We shouldn't form trunc stores that need to be expanded now that we are using widening legalization. llvm-svn: 368400	2019-08-09 06:59:53 +00:00
Craig Topper	f8e4f25a48	[X86] Remove stale FIXME from combineMaskedStore. NFC I believe PR34584 was tracking that FIXME, but its since been closed and a test case was added. llvm-svn: 368397	2019-08-09 05:55:41 +00:00
Craig Topper	b66418c34c	[X86] Remove DAG combine expansion of extending masked load and truncating masked store. The only way to generate these was through promoting legalization of narrow vectors, but we widen those types now. So we shouldn't produce these nodes. llvm-svn: 368396	2019-08-09 05:53:37 +00:00
Craig Topper	781172a09d	[X86] Remove handler for (U/S)(ADD/SUB)SAT from ReplaceNodeResults. Remove TypeWidenVector check from code that handles X86ISD::VPMADDWD and X86ISD::AVG. More unneeded code since we now legalize narrow vectors by widening. llvm-svn: 368395	2019-08-09 05:17:52 +00:00
Craig Topper	9d60b39a40	[X86] Remove ISD::SETCC handling from ReplaceNodeResults. This is no longer needed since we widen v2i32 instead of promoting. llvm-svn: 368394	2019-08-09 05:17:48 +00:00
Craig Topper	2247ce854f	[X86] Simplify ISD::LOAD handling in ReplaceNodeResults and ISD::STORE handling in LowerStore now that v2i32 is widened to v4i32. llvm-svn: 368390	2019-08-09 03:09:43 +00:00
Craig Topper	4dfff08aa4	[X86] Merge v2f32 and v2i32 gather/scatter handling in ReplaceNodeResults/LowerMSCATTER now that v2i32 is also widened like v2f32. llvm-svn: 368389	2019-08-09 03:09:28 +00:00
Craig Topper	92b2dadd78	[X86] Now unreachable handling for f64->v2i32/v4i16/v8i8 bitcasts from ReplaceNodeResults. We rely on the generic type legalizer for this now. llvm-svn: 368388	2019-08-09 03:09:19 +00:00
Craig Topper	91478a9914	[X86] Simplify ReplaceNodeResults handling for FP_TO_SINT/UINT for vectors to only handle widening. llvm-svn: 368387	2019-08-09 03:09:10 +00:00
Craig Topper	ec46fe9e92	[X86] Simplify ReplaceNodeResults handling for SIGN_EXTEND/ZERO_EXTEND/TRUNCATE for vectors to only handle widening. llvm-svn: 368386	2019-08-09 03:08:54 +00:00
Craig Topper	01bccc836d	[X86] Simplify ReplaceNodeResults handling for UDIV/UREM/SDIV/SREM for vectors to only handle widening. llvm-svn: 368385	2019-08-09 03:08:45 +00:00
Craig Topper	03c1f63231	[X86] Remove vector promotion handling from the ReplaceNodeResults ISD::MUL handling code. We now widen illegal vector types so we don't need this anymore. llvm-svn: 368384	2019-08-09 03:08:28 +00:00
Craig Topper	90f7ce4fe5	[X86] Improve codegen of v8i64->v8i16 and v16i32->v16i8 truncate with avx512vl, avx512bw, min-legal-vector-width<=256 and prefer-vector-width=256 Under this configuration we'll want to split the v8i64 or v16i32 into two vectors. The default legalization will try to truncate each of those 256-bit pieces one step to 128-bit, concatenate those, then truncate one more time from the new 256 to 128 bits. With this patch we now truncate the two splits to 64-bits then concatenate those. We have to do this two different ways depending on whether have widening legalization enabled. Without widening legalization we have to manually construct X86ISD::VTRUNC to prevent the ISD::TRUNCATE with a narrow result being promoted to 128 bits with a larger element type than what we want followed by something like a pshufb to grab the lower half of each element to finish the job. With widening legalization we just get the right thing. When we switch to widening by default we can just delete the other code path. Differential Revision: https://reviews.llvm.org/D65626 llvm-svn: 368349	2019-08-08 21:36:47 +00:00
Brian Cain	243d6b5ca3	[llvm-mc] Add reportWarning() to MCContext Adding reportWarning() to MCContext, so that it can be used from the Hexagon assembler backend. llvm-svn: 368327	2019-08-08 19:13:23 +00:00
Craig Topper	647e2e70ab	[X86] Make CMPXCHG16B feature imply CMPXCHG8B feature. This fixes znver1 so that it properly enables CMPXHG8B. We can probably remove explicit CMPXCHG8B from CPUs that also have CMPXCHG16B, but keeping this simple to allow cherry pick to 9.0. Fixes PR42935. llvm-svn: 368324	2019-08-08 18:11:17 +00:00
Pirama Arumuga Nainar	e1c7e683da	[AArch64] Do not emit '#' before immediates in inline asm Summary: The A64 assembly language does not require the '#' character to introduce constant immediate operands. Avoid the '#' since the AArch64 asm parser does not accept '#' before the lane specifier and rejects the following: __asm__ ("fmla v2.4s, v0.4s, v1.s[%0]" :: "I"(0x1)) Fix a test to not expect the '#' and add a new test case with the above asm. Fixes: https://github.com/android-ndk/ndk/issues/1036 Reviewers: peter.smith, kristof.beyls Subscribers: javed.absar, hiraditya, llvm-commits, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D65550 llvm-svn: 368320	2019-08-08 17:50:39 +00:00
Simon Pilgrim	eb76021ab0	[X86] XFormVExtractWithShuffleIntoLoad - handle shuffle mask scaling If the target shuffle mask is from a wider type, attempt to scale the mask so that the extraction can attempt to peek through. Fixes the regression mentioned in rL368307 llvm-svn: 368308	2019-08-08 16:05:23 +00:00
Simon Pilgrim	d43cdb1e90	[X86] SimplifyDemandedVectorElts - attempt to recombine target shuffle using DemandedElts mask If we don't demand all elements, then attempt to combine to a simpler shuffle. At the moment we can only do this if Depth == 0 as combineX86ShufflesRecursively uses Depth to track whether the shuffle has really changed or not - we'll need to change this before we can properly start merging combineX86ShufflesRecursively into SimplifyDemandedVectorElts. The insertps-combine.ll regression is because XFormVExtractWithShuffleIntoLoad can't see through shuffles of different widths - this will be fixed in a follow-up commit. llvm-svn: 368307	2019-08-08 15:54:20 +00:00
David Tenty	fb66d80ccb	Enable assembly output of local commons for AIX Summary: This patch enable assembly output of local commons for AIX using .lcomm directives. Adds a EmitXCOFFLocalCommonSymbol to MCStreamer so we can emit the AIX version of .lcomm assembly directives which include a csect name. Handle the case of BSS locals in PPCAIXAsmPrinter by using EmitXCOFFLocalCommonSymbol. Adds a test for generating .lcomm on AIX Targets. Reviewers: cebowleratibm, hubert.reinterpretcast, Xiangling_L, jasonliu, sfertile Reviewed By: sfertile Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64825 llvm-svn: 368306	2019-08-08 15:40:35 +00:00
David Green	b937fefeae	[ARM] Add support for MVE pre and post inc loads and stores This adds pre- and post- increment and decrements for MVE loads and stores. It uses the builtin pre and post load/store detection, unlike Neon. Loads are selected with the code in tryT2IndexedLoad, stores are selected with tablegen patterns. The immediates have a +/-7bit range, multiplied by the size of the element. Differential Revision: https://reviews.llvm.org/D63840 llvm-svn: 368305	2019-08-08 15:27:58 +00:00
David Green	3265a2671b	[ARM] MVE big endian loads/stores This adds some missing patterns for big endian loads/stores, allowing unaligned loads/stores to also be selected with an extra VREV, which produces better code than aligning through a stack. Also moves VLDR_P0 to not be LE only, and adjusts some of the tests to show all that working. Differential Revision: https://reviews.llvm.org/D65583 llvm-svn: 368304	2019-08-08 15:15:19 +00:00
Sam Elliott	db4202796c	[RISCV] Allow ABI Names in Inline Assembly Constraints Summary: Clang will replace references to registers using ABI names in inline assembly constraints with references to architecture names, but other frontends do not. LLVM uses the regular assembly parser to parse inline asm, so inline assembly strings can contain references to registers using their ABI names. This patch adds support for parsing constraints using either the ABI name or the architectural register name. This means we do not need to implement the ABI name replacement code in every single frontend, especially those like Rust which are a very thin shim on top of LLVM IR's inline asm, and that constraints can more closely match the assembly strings they refer to. Reviewers: asb, simoncook Reviewed By: simoncook Subscribers: hiraditya, rbar, johnrusso, JDevlieghere, apazos, sabuasal, niosHD, kito-cheng, shiva0217, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, Jim, s.egerton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65947 llvm-svn: 368303	2019-08-08 14:59:16 +00:00
Sam Elliott	3d793f3d94	[RISCV] Minimal stack realignment support Summary: Currently the RISC-V backend does not realign the stack. This can be an issue even for the RV32I/RV64I ABIs (where the stack is 16-byte aligned), though is rare. It will be much more comment with RV32E (though the alignment requirements for common data types remain under-documented...). This patch adds minimal support for stack realignment. It should cope with large realignments. It will error out if the stack needs realignment and variable sized objects are present. It feels like a lot of the code like getFrameIndexReference and determineFrameLayout could be refactored somehow, as right now it feels fiddly and brittle. We also seem to allocate a lot more memory than GCC does for equivalent C code. Reviewers: asb Reviewed By: asb Subscribers: wwei, jrtc27, s.egerton, MaskRay, Jim, lenary, hiraditya, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, kito-cheng, shiva0217, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62007 llvm-svn: 368300	2019-08-08 14:40:54 +00:00
Simon Pilgrim	aafea54fa1	[X86][SSE] matchBinaryPermuteShuffle - split INSERTPS combines We need to prefer INSERTPS with zeros over SHUFPS, but fallback to INSERTPS if that fails. llvm-svn: 368292	2019-08-08 13:23:53 +00:00
Petar Avramovic	d8b60f7f6e	[MIPS GlobalISel] Select jump_table and brjt G_JUMP_TABLE and G_BRJT appear from translation of switch statement. Select these two instructions for MIPS32, both pic and non-pic. Differential Revision: https://reviews.llvm.org/D65861 llvm-svn: 368274	2019-08-08 10:21:12 +00:00
Sam Tebbs	63fc526cc7	[ARM] Select VFMA llvm-svn: 368264	2019-08-08 08:21:01 +00:00
Craig Topper	47054375f9	[X86] Remove -x86-experimental-vector-widening-legalization command line option and all its uses. This option is now defaulted to true and we don't want to support turning it off so remove the option. llvm-svn: 368258	2019-08-08 06:48:22 +00:00
David Green	58df371db5	[ARM] Tighten up VLDRH.32 with low alignments VLDRH needs to have an alignment of at least 2, including the widening/narrowing versions. This tightens up the ISel patterns for it and alters allowsMisalignedMemoryAccesses so that unaligned accesses are expanded through the stack. It also fixed some incorrect shift amounts, which seemed to be passing a multiple not a shift. Differential Revision: https://reviews.llvm.org/D65580 llvm-svn: 368256	2019-08-08 06:22:03 +00:00
Craig Topper	0358274bb2	[X86] Add CMOV_FR32X and CMOV_FR64X to the isCMOVPseudo function. llvm-svn: 368250	2019-08-08 04:40:59 +00:00
Amy Huang	87f32fc007	Recommit "[MS] Emit S_HEAPALLOCSITE debug info in Selection DAG" with a fix to clear the SDNode map when SelectionDAG is cleared. llvm-svn: 368230	2019-08-07 22:49:40 +00:00
Craig Topper	46df59487a	[X86] Allow pack instructions to be used for 512->256 truncates when -mprefer-vector-width=256 is causing 512-bit vectors to be split If we're splitting the 512-bit vector anyway and we have zero/sign bits, then we might as well use pack instructions to concat and truncate at once. Differential Revision: https://reviews.llvm.org/D65904 llvm-svn: 368210	2019-08-07 21:16:10 +00:00
Craig Topper	0663a19f9d	Recommit r367901 "[X86] Enable -x86-experimental-vector-widening-legalization by default." The assert that caused this to be reverted should be fixed now. Original commit message: This patch changes our defualt legalization behavior for 16, 32, and 64 bit vectors with i8/i16/i32/i64 scalar types from promotion to widening. For example, v8i8 will now be widened to v16i8 instead of promoted to v8i16. This keeps the elements widths the same and pads with undef elements. We believe this is a better legalization strategy. But it carries some issues due to the fragmented vector ISA. For example, i8 shifts and multiplies get widened and then later have to be promoted/split into vXi16 vectors. This has the potential to cause regressions so we wanted to get it in early in the 10.0 cycle so we have plenty of time to address them. Next steps will be to merge tests that explicitly test the command line option. And then we can remove the option and its associated code. llvm-svn: 368183	2019-08-07 16:24:26 +00:00
Oliver Cruickshank	2c592de5df	[ARM] Expand CTPOP intrinsic for MVE llvm-svn: 368180	2019-08-07 15:47:45 +00:00
Simon Pilgrim	d39c9b85f3	[X86] EltsFromConsecutiveLoads - early out for non-byte sized memory (PR42909) Don't attempt to merge loads for types that aren't modulo 8-bits. llvm-svn: 368165	2019-08-07 12:41:59 +00:00
Sander de Smalen	315ff30388	[AArch64][WinCFI] Do not pair callee-save instructions in LoadStoreOptimizer Prevent the LoadStoreOptimizer from pairing any load/store instructions with instructions from the prologue/epilogue if the CFI information has encoded the operations as separate instructions. This would otherwise lead to a mismatch of the actual prologue size from the size as recorded in the Windows CFI. Reviewers: efriedma, mstorsjo, ssijaric Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D65817 llvm-svn: 368164	2019-08-07 12:41:38 +00:00
Simon Atanasyan	8011fc63d9	[mips] Make a couple of class methods plain static functions. NFC llvm-svn: 368162	2019-08-07 12:21:41 +00:00
Simon Atanasyan	8225186151	[mips] Use isMicroMips() function to check enabled feature flag. NFC llvm-svn: 368161	2019-08-07 12:21:32 +00:00
Simon Atanasyan	9a3474505e	[Mips] Instruction `sc` now accepts symbol as an argument Function MipsAsmParser::expandMemInst() did not properly handle instruction `sc` with a symbol as an argument because first argument would be counted twice. We add additional checks and handle this case separately. Patch by Mirko Brkusanin. Differential Revision: https://reviews.llvm.org/D64252 llvm-svn: 368160	2019-08-07 12:21:26 +00:00
Benjamin Kramer	b3f29d6f74	Replace llvm::MutexGuard/UniqueLock with their standard equivalents All supported platforms have <mutex> now, so we don't need our own copies any longer. No functionality change intended. llvm-svn: 368149	2019-08-07 10:57:25 +00:00
Oliver Cruickshank	3db897a47d	[ARM] Generate MVE VHADDs/VHSUBs llvm-svn: 368146	2019-08-07 10:26:57 +00:00
Sam Parker	c289d4459d	[ARM][LowOverheadLoops] Revert after read/write Currently we check whether LR is stored/loaded to/from inbetween the loop decrement and loop end pseudo instructions. There's two problems here: - It relies on all load/store instructions being labelled as such in tablegen. - Actually any use of loop decrement is troublesome because the value doesn't exist! So we need to check for any read/write of LR that occurs between the two instructions and revert if we find anything. Differential Revision: https://reviews.llvm.org/D65792 llvm-svn: 368130	2019-08-07 07:39:19 +00:00
Craig Topper	bc8edb7abd	[X86] Allow any 8-bit immediate to be used with bt/btc/btr/bts memory aliases. We have aliases that disambiguate memory forms of bt/btc/btr/bts without suffixes to the 32-bit form. These aliases should have been updated when the instructions were updated in r356413. llvm-svn: 368127	2019-08-07 06:17:58 +00:00
Craig Topper	a38d4eed10	[X86] Use isInt<8> to simplify some code. NFC llvm-svn: 368126	2019-08-07 06:17:55 +00:00
Craig Topper	7b7ab0396a	[X86] Limit vpermil2pd/vpermil2ps immediates to 4 bits in the assembly parser. The upper 4 bits of the immediate byte are used to encode a register. We need to limit the explicit immediate to fit in the remaining 4 bits. Fixes PR42899. llvm-svn: 368123	2019-08-07 05:34:27 +00:00
Mitch Phillips	a7e8bd24bd	Revert "[X86] Enable -x86-experimental-vector-widening-legalization by default." This reverts commit 3de33245d2c992c9e0af60372043540b60f3a810. This commit broke the MSan buildbots. See https://reviews.llvm.org/rL367901 for more information. llvm-svn: 368107	2019-08-06 23:00:43 +00:00
Craig Topper	bcf0742f7f	[X86] Don't allow combineSIntToFP to create v2i32 vectors after type legalization. If we're after type legalization we should only be trying to turn v2i64 into v2i32. So bitcast to v4i32, shuffle the even elements together. Then use X86ISD::CVTSI2P. The alternative is to leave the v2i64 type alone and let it scalarized. Hopefully keeping it packed is better. Fixes PR42905. llvm-svn: 368091	2019-08-06 21:43:15 +00:00
Aditya Nandakumar	c0251c505e	[GISel]: Add GISelKnownBits analysis https://reviews.llvm.org/D65698 This adds a KnownBits analysis pass for GISel. This was done as a pass (compared to static functions) so that we can add other features such as caching queries(within a pass and across passes) in the future. This patch only adds the basic pass boiler plate, and implements a lazy non caching knownbits implementation (ported from SelectionDAG). I've also hooked up the AArch64PreLegalizerCombiner pass to use this - there should be no compile time regression as the analysis is lazy. llvm-svn: 368065	2019-08-06 17:18:29 +00:00

1 2 3 4 5 ...

53346 Commits