llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Diana Picus	83ac5c0a7d	[BPF] Remove exit-on-error flag in test (PR27767) The exit-on-error flag is needed to avoid an assert where llvm::SelectionDAGISel::LowerArguments doesn't create enough arguments. Fill up with zeroes to reach the right number of args. Fixes PR27767. Differential Revision: http://reviews.llvm.org/D20571 llvm-svn: 270855	2016-05-26 15:23:50 +00:00
Chad Rosier	be99101f8a	[AArch64] Generate a BFI/BFXIL from 'or (and X, MaskImm), OrImm'. If and only if the value being inserted sets only known zero bits. This combine transforms things like and w8, w0, #0xfffffff0 movz w9, #5 orr w0, w8, w9 into movz w8, #5 bfxil w0, w8, #0, #4 The combine is tuned to make sure we always reduce the number of instructions. We avoid churning code for what is expected to be performance neutral changes (e.g., converted AND+OR to OR+BFI). Differential Revision: http://reviews.llvm.org/D20387 llvm-svn: 270846	2016-05-26 13:27:56 +00:00
Rafael Espindola	e388c6f6a2	Use shouldAssumeDSOLocal on AArch64. This reduces code duplication and now AArch64 also handles PIE. llvm-svn: 270844	2016-05-26 12:42:55 +00:00
Igor Breger	d6da40dfa4	[AVX512] Fix intrinsic cmp{sd\|ss} lowering. Differential Revision: http://reviews.llvm.org/D20615 llvm-svn: 270843	2016-05-26 12:42:25 +00:00
Chris Dewhurst	716ef61879	[Sparc] Extend the assembler printing support for Sparc back-end. Allows display of floating-point registers and display of assembler meta-data output. llvm-svn: 270829	2016-05-26 07:28:31 +00:00
Justin Lebar	db58249ac7	[NVPTX] Don't (incorrectly) say that the NVVMReflect pass preserves all analyses. Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D20585 llvm-svn: 270790	2016-05-25 23:12:38 +00:00
Rafael Espindola	c606f7449a	Don't repeat name in comment and git-clang-format. llvm-svn: 270785	2016-05-25 22:44:06 +00:00
Rafael Espindola	c2bb75e7bf	Sort includes. llvm-svn: 270769	2016-05-25 21:37:29 +00:00
Simon Pilgrim	2acf33db0b	Simplify std::all_of/any_of predicates by using llvm::all_of/any_of. NFCI. llvm-svn: 270753	2016-05-25 20:41:11 +00:00
Rafael Espindola	2224908028	Fix shouldAssumeDSOLocal for private linkage. llvm-svn: 270746	2016-05-25 19:55:16 +00:00
Matt Arsenault	1f6cee6a4f	AMDGPU: Fix v2i64/v2f64 bitcasts These operations tend to get promoted away to v4i32 so this doesn't happen often. llvm-svn: 270740	2016-05-25 18:07:36 +00:00
Matt Arsenault	e21e61958d	AMDGPU: Fix inconsistent lowering of select of vectors f32 vectors would use a sequence of BFI instructions instead of unrolled cmp + select. This was better in the case of a VALU select with SGPR inputs, but we don't have a way of dealing with that in the DAG. llvm-svn: 270731	2016-05-25 17:34:58 +00:00
Sanjay Patel	e582594538	[x86] avoid code explosion from LoopVectorizer for gather loop (PR27826) By making pointer extraction from a vector more expensive in the cost model, we avoid the vectorization of a loop that is very likely to be memory-bound: https://llvm.org/bugs/show_bug.cgi?id=27826 There are still bugs related to this, so we may need a more general solution to avoid vectorizing obviously memory-bound loops when we don't have HW gather support. Differential Revision: http://reviews.llvm.org/D20601 llvm-svn: 270729	2016-05-25 17:27:54 +00:00
Sanjay Patel	289425eb9f	[x86, AVX] allow explicit calls to VZERO* to modify state in VZeroUpperInserter pass (PR27823) As noted in the review, there are still problems, so this doesn't the bug completely. Differential Revision: http://reviews.llvm.org/D20529 llvm-svn: 270718	2016-05-25 16:39:47 +00:00
Simon Pilgrim	1a1ddc32da	[X86][SSE] Replace (V)CVTDQ2PD(Y) and (V)CVTPS2PD(Y) lossless conversion intrinsics with generic IR Followup to D20528 clang patch, this removes the (V)CVTDQ2PD(Y) and (V)CVTPS2PD(Y) llvm intrinsics and auto-upgrades to sitofp/fpext instead. Differential Revision: http://reviews.llvm.org/D20568 llvm-svn: 270678	2016-05-25 08:59:18 +00:00
Craig Topper	4710ab1424	[X86] Remove the llvm.x86.sse2.storel.dq intrinsic. It hasn't been used in a long time. llvm-svn: 270677	2016-05-25 06:56:32 +00:00
Nirav Dave	9f0b74dd18	Soften assertion in AMDGPU emitPrologue. [AMDGPU] emitPrologue looks for an unused unallocated SGPR that is not the scratch descriptor. Continue search if unused register found fails other requirements. Reviewers: arsenm, tstellarAMD, nhaehnle Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D20526 llvm-svn: 270646	2016-05-25 01:45:42 +00:00
Dan Gohman	e46ddfaa34	[WebAssembly] Put __stack_pointer in the offset field of loads and stores. Instead of this: i32.const $push10=, __stack_pointer i32.load $push11=, 0($pop10) Emit this: i32.const $push10=, 0 i32.load $push11=, __stack_pointer($pop10) It's not currently clear which is better, though there's a chance the second form may be better at overall compression. We can revisit this when we have more data; for now it makes sense to make PEI consistent with isel. Differential Revision: http://reviews.llvm.org/D20411 llvm-svn: 270635	2016-05-24 23:47:41 +00:00
Konstantin Zhuravlyov	480521dd43	[AMDGPU][NFC] Rename ReserveTrapVGPRs -> ReserveRegs Differential Revision: http://reviews.llvm.org/D20081 llvm-svn: 270594	2016-05-24 18:37:18 +00:00
Sam Kolton	5c1a0d6afe	[AMDGPU] Assembler: rework parsing of optional operands. Summary: Change process of parsing of optional operands. All optional operands use same parsing method - parseOptionalOperand(). No default values are added to OperandsVector. Get rid of WORKAROUND_USE_DUMMY_OPERANDS_INSTEAD_MUTIPLE_DEFAULT_OPERANDS. Reviewers: tstellarAMD, vpykhtin, artem.tamazov, nhaustov Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D20527 llvm-svn: 270556	2016-05-24 12:38:33 +00:00
Artem Tamazov	068739d10c	[AMDGPU][llvm-mc] Disassembler: support for TTMP/TBA/TMA registers. Differential Revision: http://reviews.llvm.org/D20476 llvm-svn: 270552	2016-05-24 12:05:16 +00:00
Igor Breger	29f643cd8e	[llvm][AVX512][intrinsics] Fix vperm{b\|w\|d\|q\|ps\|pd} intrinsics. Index is second argument to buildin function but it is first instruction operand. Differential Revision: http://reviews.llvm.org/D20515 llvm-svn: 270548	2016-05-24 11:06:22 +00:00
Sagar Thakur	5eb36c5e30	[MIPS][LLVM-MC] Fix Disassemble of Negative Offset Patch by Nitesh Jain. Summary: The type of Imm in MipsDisassembler.cpp was incorrect since SignExtend64 return int64_t type.As per the MIPSr6 doc ,the offset is added to the address of the instruction following the branch (not the branch itself), to form a PC-relative effective target address hence “4” is added to the offset. The offset of some test case are update to reflect the changes due to “ + 4 ” offset and new test case for negative offset are added. Reviewers: dsanders, vkalintiris Differential Revision: http://reviews.llvm.org/D17540 llvm-svn: 270542	2016-05-24 09:57:10 +00:00
Simon Pilgrim	de7240c8f6	[CostModel][X86][XOP] Added XOP costmodel for BITREVERSE Now that we have a nice fast VPPERM solution. Added framework for future intrinsic costs as well. llvm-svn: 270537	2016-05-24 08:17:50 +00:00
Dan Gohman	c3ac030898	[WebAssembly] Basic TargetTransformInfo support for SIMD128. llvm-svn: 270508	2016-05-23 22:47:07 +00:00
James Y Knight	f767eef5cb	[SPARC] Fix 8 and 16-bit atomic load and store. They were accidentally using the 32-bit load/store instruction for 8/16-bit operations, due to incorrect patterns (8/16-bit cmpxchg and atomicrmw will be fixed in subsequent changes) llvm-svn: 270486	2016-05-23 20:33:00 +00:00
Sanjay Patel	62f9635295	fix typo; NFC llvm-svn: 270469	2016-05-23 18:01:20 +00:00
Sanjay Patel	7d5178a58b	use range-loop; NFCI llvm-svn: 270467	2016-05-23 18:00:50 +00:00
Dan Gohman	b74c5addf0	[WebAssembly] Speed up LiveIntervals updating. Use the more specific LiveInterval::removeSegment instead of LiveInterval::shrinkToUses when we know the specific range that's being removed. llvm-svn: 270463	2016-05-23 17:42:57 +00:00
Krzysztof Parzyszek	3964c84399	[Hexagon] Move some debug-only variable declarations into DEBUG llvm-svn: 270459	2016-05-23 17:31:30 +00:00
Aaron Ballman	aea6907e5f	Removing a switch statement that contains only a default label; NFC. llvm-svn: 270444	2016-05-23 15:52:59 +00:00
Diana Picus	f989b6c3b6	[BPF] Remove exit-on-error flag in test (PR27766) The exit-on-error flag on the many_args1.ll test is needed to avoid an unreachable in BPFTargetLowering::LowerCall. We can also avoid it by ignoring any superfluous arguments to the call (i.e. any arguments after the first 5). Fixes PR27766. Differential Revision: http://reviews.llvm.org/D20471 v2 of r270419 llvm-svn: 270440	2016-05-23 14:57:19 +00:00
Renato Golin	317bd564ab	Reverts "[BPF] Remove exit-on-error flag in test (PR27766)" This patch reverts r270419 because it broke a lot of buildbots, mostly Windows. We'd like help in investigating the issues, but for now, it should stay out. llvm-svn: 270433	2016-05-23 13:02:11 +00:00
Diana Picus	0a143f0f02	[BPF] Remove exit-on-error flag in test (PR27766) The exit-on-error flag on the many_args1.ll test is needed to avoid an unreachable in BPFTargetLowering::LowerCall. We can also avoid it by ignoring any superfluous arguments to the call (i.e. any arguments after the first 5). Fixes PR27766 llvm-svn: 270419	2016-05-23 12:33:34 +00:00
Chris Dewhurst	14488dfcf0	[Sparc] LEON erratum fix - Delay Slot Filler modification. This code should have been with the previous check-in (r270417) and prevents the DelaySlotFiller pass being utilized in functions where the erratum fix has been applied as this will break the run-time code. llvm-svn: 270418	2016-05-23 11:52:28 +00:00
Chris Dewhurst	11fab31200	[Sparc][LEON] LEON Erratum fix. Insert NOP after LD or LDF instruction. Due to an erratum in some versions of LEON, we must insert a NOP after any LD or LDF instruction to ensure the processor has time to load the value correctly before using it. This pass will implement that erratum fix. The code will have no effect for other Sparc, but non-LEON processors. Differential Review: http://reviews.llvm.org/D20353 llvm-svn: 270417	2016-05-23 10:56:36 +00:00
Sam Kolton	59aa17c27c	[AMDGPU] Assembler: refactor parsing of modifiers and immediates. Allow modifiers for imms. Reviewers: nhaustov, tstellarAMD Subscribers: kzhuravl, arsenm Differential Revision: http://reviews.llvm.org/D20166 llvm-svn: 270415	2016-05-23 09:59:02 +00:00
Jacob Baungard Hansen	1036b51c05	Test commit llvm-svn: 270414	2016-05-23 09:41:44 +00:00
Craig Topper	62ac946928	[X86] Use instruction aliases to replace custom asm parser code for optimizing moves to use 2 byte VEX prefix. llvm-svn: 270394	2016-05-23 04:02:27 +00:00
Craig Topper	ae0615175a	[AVX512] Add patterns to implement stores of extracts of least signficant subvectors using XMM or YMM stores instead of the vector extract instructions. Similar is already done for AVX and we had lost it going to AVX512VL. llvm-svn: 270383	2016-05-22 23:44:33 +00:00
Sanjay Patel	b2763427c2	[x86, AVX] don't add a vzeroupper if that's what the code is already doing (PR27823) This isn't the complete fix, but it handles the trivial examples of duplicate vzero* ops in PR27823: https://llvm.org/bugs/show_bug.cgi?id=27823 ...and amusingly, the bogus cases already exist as regression tests, so let's take this baby step. We'll need to do more in the general case where there's legitimate AVX usage in the function + there's already a vzero in the code. Differential Revision: http://reviews.llvm.org/D20477 llvm-svn: 270378	2016-05-22 20:22:47 +00:00
Igor Breger	7450d33b3c	[AVX512] Implement missing patterns for any_extend load lowering. Differential Revision: http://reviews.llvm.org/D20513 llvm-svn: 270357	2016-05-22 10:21:04 +00:00
Craig Topper	40ab0cb8bb	[AVX512] The AVX512 file only need subtract_subvector index 0 patterns where the source is 512-bits. The 256-bit source patterns were redundant with AVX. llvm-svn: 270356	2016-05-22 07:40:58 +00:00
Craig Topper	ec1c660b77	[AVX512] Add an AddedComplexity line to the 512-bit insert_subvector undef index 0 patterns. This gives them higher priority than the memory patterns. This matches AVX1/2. llvm-svn: 270355	2016-05-22 07:40:40 +00:00
Craig Topper	b6b424b279	[AVX512] Change the AddedComplexity on some patterns to match their AVX/SSE equivalents. This helps group them close together in the isel tables and enable table compression. llvm-svn: 270354	2016-05-22 06:09:34 +00:00
Craig Topper	a6980162a3	[AVX512] Add a couple patterns to fix some cases where two vector mask inversions could appear in a row. llvm-svn: 270344	2016-05-22 00:39:30 +00:00
Craig Topper	0754f22fa5	[AVX512] Remove seemingly unnecessary AddedComplexity adjustment. llvm-svn: 270343	2016-05-22 00:39:27 +00:00
Craig Topper	c1c9bee262	[X86] Remove unnecessary alignment check on patterns that use VEXTRACTF128 for integer types when only AVX1 is supported. llvm-svn: 270335	2016-05-21 22:50:18 +00:00
Craig Topper	ff8ed4829f	[AVX512] Add patterns for extracting subvectors and storing to memory. llvm-svn: 270334	2016-05-21 22:50:14 +00:00
Craig Topper	e6ae74c913	[AVX512] Capitalize the Z in VEXTRACTPSzmr. Lowercase z has been primarily used to indicating the zero masking behavior which is not the case here. NFC llvm-svn: 270333	2016-05-21 22:50:11 +00:00
Craig Topper	29ae9d7e02	[AVX512] Rename vector extract instructions so 'mr' intead of 'rm' to reflect the fact that memory is the destination. llvm-svn: 270332	2016-05-21 22:50:09 +00:00
Craig Topper	6bf6b5c2f4	[AVX512] Fix copy/paste mistake a I made in a comment. llvm-svn: 270331	2016-05-21 22:50:04 +00:00
Michael Zuckerman	306643b672	[Clang][AVX512][intrinsics] Fix rcp and sqrt intrinsics. Differential Revision: http://reviews.llvm.org/D20438 llvm-svn: 270322	2016-05-21 14:44:18 +00:00
Michael Zuckerman	3d906b3ef7	[Clang][AVX512][intrinsics] Fix vscalef intrinsics. Differential Revision: http://reviews.llvm.org/D20324 llvm-svn: 270321	2016-05-21 11:09:53 +00:00
Craig Topper	79cd5a6b41	[AVX512] Add patterns for VEXTRACT v16i16->v8i16 and v32i8->v16i8. Disable AVX2 versions of vector extract when AVX512VL is enabled. llvm-svn: 270318	2016-05-21 07:08:56 +00:00
Craig Topper	f3e023e70e	[AVX512] Disable AVX2 VPERMD, VPERMQ, VPERMPS, and VPERMPD patterns when AVX512VL is enabled. Also add shuffle comment printing for AVX512VL VPERMPD/VPERMQ to keep some tests that now use these instructions instead of the AVX2 ones. llvm-svn: 270317	2016-05-21 06:07:18 +00:00
Craig Topper	30a8fe51db	[AVX512] Disable AVX/AVX2 VBROADCASTSS/VBROADCASTSD patterns when AVX512VL is enabled. llvm-svn: 270316	2016-05-21 05:47:25 +00:00
Matt Arsenault	c4ee204f5c	AMDGPU: Define priorities for register classes Allocating larger register classes first should give better allocation results (and more importantly for myself, make the lit tests more stable with respect to scheduler changes). Patch by Matthias Braun llvm-svn: 270312	2016-05-21 03:55:07 +00:00
Craig Topper	2b54c30436	[AVX512] Disable AVX/AVX2 patterns for VPSADBW and VPMULUDQ when the AVX512VL/AVX512BWI equivalents are available. llvm-svn: 270311	2016-05-21 03:52:32 +00:00
Craig Topper	8ca6c23ba5	[X86] Convert some SSE2/AVX2 intrinsics to ISD opcodes during lowering instead of pattern matching the intrinsics. This unifies handling with AVX512 and allows these intrinsics to select EVEX encoded instructions to increase available registers. llvm-svn: 270310	2016-05-21 03:52:28 +00:00
Matt Arsenault	1eaf7c8b10	AMDGPU: Cleanup lowering actions These are kind of a mess and hard to follow, particularly for loads and stores. Fix various redundant, unnecessary and dead settings. llvm-svn: 270307	2016-05-21 02:27:49 +00:00
Matt Arsenault	44230570f6	AMDGPU: Fix high bits after division optimization This is essentially doing a 24-bit signed division with FP. We need to truncate to the N bit result. llvm-svn: 270305	2016-05-21 01:53:33 +00:00
Dylan McKay	d7d0f71629	[AVR] Add AVRMCAsmInfo llvm-svn: 270302	2016-05-21 01:06:37 +00:00
Matt Arsenault	06895e862f	AMDGPU: Fix verifier error when spilling SGPRs The current SGPR spilling test does not stress this because it is using s_buffer_load instructions to increase SGPR pressure and spill, but their output operands have the same SReg_32_XM0 constraint. This fixes an error when the SReg_32 output from most instructions is spilled. llvm-svn: 270301	2016-05-21 00:53:42 +00:00
Matt Arsenault	4805dff2a5	AMDGPU: Fix relationship between SReg_32 and SReg_32_XM0 llvm-svn: 270300	2016-05-21 00:53:28 +00:00
Dylan McKay	17a442f81e	[AVR] Fix header files in MCTargetDesc Everything now compiles successfully, but there are still undefined references. llvm-svn: 270298	2016-05-21 00:35:07 +00:00
Matt Arsenault	c34a7d2258	AMDGPU: Handle cbranch vccz/vccnz llvm-svn: 270297	2016-05-21 00:29:40 +00:00
Matt Arsenault	5438a4669d	AMDGPU: Implement ReverseBranchCondition llvm-svn: 270296	2016-05-21 00:29:34 +00:00
Matt Arsenault	a197a65904	AMDGPU: Implement AnalyzeBranch Original patch by Tom Stellard llvm-svn: 270295	2016-05-21 00:29:27 +00:00
Dan Gohman	920c7d7490	[WebAssembly] Optimize away return instructions using fallthroughs. This saves a small amount of code size, and is a first small step toward passing values on the stack across block boundaries. Differential Review: http://reviews.llvm.org/D20450 llvm-svn: 270294	2016-05-21 00:21:56 +00:00
Dylan McKay	1126d23533	[AVR] Fix signuature of AVRTargetMachine constructor llvm-svn: 270292	2016-05-20 23:39:04 +00:00
Justin Bogner	20645f2085	SDAG: Implement Select instead of SelectImpl in PPCDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 270283	2016-05-20 21:43:23 +00:00
Jacques Pienaar	4813cd5255	[lanai] Change reloc to use PIC_ by default and cleanup. * Change reloc to PIC_; * Cleanup (clang-format & modify test); llvm-svn: 270282	2016-05-20 21:41:53 +00:00
David Majnemer	08b442df36	Address post-review for r270246 This gets rid of some unnecessary SmallStrings in X86TargetMachine::getSubtargetImpl. No functionality change is intended. llvm-svn: 270270	2016-05-20 20:41:24 +00:00
Jun Bum Lim	3a259859b7	[AArch64] Disable narrow load merge by default Summary: As this optimization converts two loads into one load with two shift instructions, it could potentially hurt performance if a loop is arithmetic operation intensive. Reviewers: t.p.northover, mcrosier, jmolloy Subscribers: evandro, jmolloy, aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D20172 llvm-svn: 270251	2016-05-20 18:45:49 +00:00
David Majnemer	7c0e6f025f	[X86] Reduce memory allocations in X86TargetMachine::getSubtargetImpl We performed a number of memory allocations each time getTTI was called, remove them by using SmallString. No functionality change intended. llvm-svn: 270246	2016-05-20 18:16:06 +00:00
Sanjay Patel	d0547f1204	fix comments; NFC llvm-svn: 270237	2016-05-20 17:07:19 +00:00
Sanjay Patel	2fd53c6d6b	use range-loops; NFCI llvm-svn: 270236	2016-05-20 17:00:10 +00:00
Sanjay Patel	56256e385a	fix documentation comments; NFC llvm-svn: 270234	2016-05-20 16:46:01 +00:00
Simon Pilgrim	95ba516d50	[X86][AVX] Generalized matching for target shuffle combines This patch is a first step towards a more extendible method of matching combined target shuffle masks. Initially this just pulls out the existing basic mask matches and adds support for some 256/512 bit equivalents. Future patterns will require a number of features to be added but I wanted to keep this patch simple. I hope we can avoid duplication between shuffle lowering and combining and share more complex pattern match functions in future commits. Differential Revision: http://reviews.llvm.org/D19198 llvm-svn: 270230	2016-05-20 16:19:30 +00:00
Rafael Espindola	3acc1df4cd	Refactor X86 symbol access classification. This refactors the logic in X86 to avoid code duplication. It also splits it in two steps: it first decides if a symbol is local to the DSO and then uses that information to decide how to access it. The first part is implemented by shouldAssumeDSOLocal. It is not in any way specific to X86. In a followup patch I intend to move it to somewhere common and reused it in other backends. llvm-svn: 270209	2016-05-20 12:20:10 +00:00
Rafael Espindola	8b4b8109e9	Simplify handling of hidden stubs on PowerPC. We now handle them just like non hidden ones. This was already the case on x86 (r207518) and arm (r207517). llvm-svn: 270205	2016-05-20 12:00:52 +00:00
NAKAMURA Takumi	60e2685884	SparcISelLowering.cpp: Add missing StringSwitch.h llvm-svn: 270200	2016-05-20 10:53:56 +00:00
Chris Dewhurst	1afaa64b44	[Sparc] Implement getRegisterByName. Allows Sparc registers to be specifically referred to in inline assembly. llvm-svn: 270198	2016-05-20 10:21:01 +00:00
Chris Dewhurst	08a87b67ef	[Sparc] Enable more inline assembly constraints. Note: This is specifically to allow GCC's test pr44707 to pass. Trivial change, not put for differential revision. Test included. llvm-svn: 270192	2016-05-20 09:03:01 +00:00
Craig Topper	1780b40874	[X86] Fix another AVX pattern to only be disable if VLX and BWI are supported. llvm-svn: 270182	2016-05-20 05:10:27 +00:00
Jacques Pienaar	b966c4f1a6	[lanai] Use Optional<Reloc> in LanaiTargetMachine. Follow r269988 and use Optional<Reloc>. llvm-svn: 270176	2016-05-20 03:21:37 +00:00
Craig Topper	195c9b10ae	[X86] Fix some AVX patterns to only be disabled if VLX and BWI are supported. Without this we get isel failures on the avx-intrinsics-x86.ll test in AVX512VL. llvm-svn: 270174	2016-05-20 02:00:08 +00:00
Dylan McKay	e6c6905f3e	Add AVRTargetStreamers Reviewed by Matt Arsenault in http://reviews.llvm.org/D16311 llvm-svn: 270171	2016-05-20 01:17:38 +00:00
Rafael Espindola	62b7ba5ca2	Record a TargetMachine instead of a Reloc::Model. Addresses r270095's code review. llvm-svn: 270147	2016-05-19 22:07:57 +00:00
Matt Arsenault	e86632bcd4	AMDGPU: Remove pointless conversions llvm-svn: 270139	2016-05-19 21:09:58 +00:00
Dan Gohman	0ad1ee502e	[WebAssembly] Simplify code that never has to handle physical registers. NFC. llvm-svn: 270137	2016-05-19 21:07:20 +00:00
David Blaikie	16fe2be1c3	Fix -Wunused-variable in non-Asserts build llvm-svn: 270118	2016-05-19 20:44:22 +00:00
David Blaikie	5ded374dc4	Simplify conditional unreachable into an assertion llvm-svn: 270111	2016-05-19 20:28:40 +00:00
Hans Wennborg	ff73dabfca	X86: Don't reset the stack after calls that don't return (PR27117) Since the calls don't return, the instruction afterwards will never run, and is just taking up unnecessary space in the binary. Differential Revision: http://reviews.llvm.org/D20406 llvm-svn: 270109	2016-05-19 20:15:33 +00:00
Rafael Espindola	4ed3f6e3d4	Remember the relocation model. NFC. This avoids passing a TargetMachine in a few places. llvm-svn: 270095	2016-05-19 18:49:29 +00:00
Rafael Espindola	084f91d955	Style fixes. NFC. llvm-svn: 270093	2016-05-19 18:34:20 +00:00
Zhan Jun Liau	f045dd3779	[SystemZ] Test commit - remove idea from README Remove a comment about not supporting LRVH/STRVH from the README LRVH/STRVH are being generated as of r269688 llvm-svn: 270092	2016-05-19 18:30:17 +00:00
Matt Arsenault	31cc93c0d6	AMDGPU: Also look for s_cbranch_vccz llvm-svn: 270091	2016-05-19 18:20:25 +00:00
Ron Lieberman	79c4da8069	Fix a covnersion from string to bool issue used in an assert Problem Was exposed by -Wstring-conversion llvm-svn: 270087	2016-05-19 18:05:56 +00:00
Chad Rosier	d705b5562f	[AArch64 ] Generate a BFXIL from 'or (and X, Mask0Imm),(and Y, Mask1Imm)'. Mask0Imm and ~Mask1Imm must be equivalent and one of the MaskImms is a shifted mask (e.g., 0x000ffff0). Both 'and's must have a single use. This changes code like: and w8, w0, #0xffff000f and w9, w1, #0x0000fff0 orr w0, w9, w8 into lsr w8, w1, #4 bfi w0, w8, #4, #12 llvm-svn: 270063	2016-05-19 14:19:47 +00:00
Ranjeet Singh	7f495daec3	Test commit. llvm-svn: 270056	2016-05-19 12:44:39 +00:00
Artem Tamazov	eea90f5cc7	[AMDGPU][llvm-mc] Fixes to support buffer atomics. Fixes for MUBUF_Atomic instructions to make operand list valid: - For RTN insns, make a copy of $vdata_in operand as $vdata. - Do not add operand for GLC, it is hardcoded and comes as a token. Workaround to avoid adding multiple default optional operands. Tests added. Differential Revision: http://reviews.llvm.org/D20257 llvm-svn: 270049	2016-05-19 12:22:39 +00:00
Zoran Jovanovic	07314a2bff	ps][microMIPS] Add R_MICROMIPS_PC21_S1 relocation Differential Revision: http://reviews.llvm.org/D15526 llvm-svn: 270048	2016-05-19 12:20:40 +00:00
Daniel Sanders	7b472cd465	[mips][mips16] Fix ZERO is not a CPU16Regs register error from the machine verifier. Summary: Partially fixes PR27458 Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D20330 llvm-svn: 270037	2016-05-19 10:42:14 +00:00
Andrey Turetskiy	ccc62bdbb9	[X86] Enable RRL part of the LEA optimization pass for -O2. Enable "Remove Redundant LEAs" part of the LEA optimization pass for -O2. This gives 6.4% performance improve on Broadwell on nnet benchmark from Coremark-pro. There is no significant effect on other benchmarks (Geekbench, Spec2000, Spec2006). Differential Revision: http://reviews.llvm.org/D19659 llvm-svn: 270036	2016-05-19 10:18:29 +00:00
Zlatko Buljan	3d28e4b8d5	[mips][microMIPS] Implement BC1EQZC, BC1NEZC, BC2EQZC and BC2NEZC instructions Differential Revision: http://reviews.llvm.org/D18352 llvm-svn: 270030	2016-05-19 07:31:28 +00:00
Craig Topper	1aff453749	[X86] Generalize and combine some similar type constraints and node types. No changes to the isel table size so the separation wasn't buying us anything. llvm-svn: 270026	2016-05-19 06:13:58 +00:00
Craig Topper	c298b9dab5	[X86] Simplify some type constraints by removing parts that were already implied. llvm-svn: 270025	2016-05-19 06:13:48 +00:00
Dan Gohman	bb2c8e5fad	[WebAssembly] Update WebAssembly target for r269988. llvm-svn: 270017	2016-05-19 03:00:05 +00:00
Craig Topper	4a761c4c76	[X86] Remove some type constraint classes and use already existing stricter classes. llvm-svn: 270013	2016-05-19 02:05:58 +00:00
Craig Topper	f553944514	[AVX512] Strengthen type constraints for VFIXUPIMM patterns and combine the type constraints for vector and scalar. llvm-svn: 270012	2016-05-19 02:05:55 +00:00
Chad Rosier	889772f9e3	[AArch64] Push comment into function. NFC. llvm-svn: 270003	2016-05-18 23:51:17 +00:00
Matt Arsenault	6728e29c35	AMDGPU: Fix verifier error when spilling undef subreg llvm-svn: 270002	2016-05-18 23:35:53 +00:00
Matt Arsenault	efc0ca4c19	AMDGPU: Fix promote alloca for pointer loads If the load has a pointer type, we don't want to change its type. llvm-svn: 270000	2016-05-18 23:20:24 +00:00
Rafael Espindola	22e87bbb08	Delete Reloc::Default. Having an enum member named Default is quite confusing: Is it distinct from the others? This patch removes that member and instead uses Optional<Reloc> in places where we have a user input that still hasn't been maped to the default value, which is now clear has no be one of the remaining 3 options. llvm-svn: 269988	2016-05-18 22:04:49 +00:00
Jacques Pienaar	30194757f7	[lanai] Change the way flag setting instructions are checked. isReturn() was returning different values with and without -g which led to different code being generated. Change isFlagSettingInstruction to query an instruction's effect on SR instead. llvm-svn: 269986	2016-05-18 21:31:37 +00:00
Dan Gohman	2ff0f2e766	[WebAssembly] Disable the MachineScheduler. llvm-svn: 269976	2016-05-18 20:19:02 +00:00
Jan Vesely	0f6b39e33f	AMDGPU: Fix incorrect simm check Use signed division otherwise all back jumps fail the check Fixes regression introduced in r269951 Differential Revision: http://reviews.llvm.org/D20380 llvm-svn: 269972	2016-05-18 19:07:58 +00:00
Chad Rosier	4ef3f73429	[AArch64] Minor refactoring. NFC. llvm-svn: 269963	2016-05-18 17:43:11 +00:00
Sanjay Patel	f4b59acf0d	clean up; NFCI llvm-svn: 269962	2016-05-18 17:23:38 +00:00
Matt Arsenault	326d5c727c	AMDGPU: Error if branch distance exceeds limit llvm-svn: 269951	2016-05-18 16:10:24 +00:00
Matt Arsenault	41311e20a0	AMDGPU: Other sizes of popcnt are fast We can chain bcnt instructions together, so any width popcnt is pretty fast. llvm-svn: 269950	2016-05-18 16:10:19 +00:00
Hans Wennborg	5b89989aa5	Re-commit r269828 "X86: Avoid using _chkstk when lowering WIN_ALLOCA instructions" with an additional fix to make RegAllocFast ignore undef physreg uses. It would previously get confused about the "push %eax" instruction's use of eax. That method for adjusting the stack pointer is used in X86FrameLowering::emitSPUpdate as well, but since that runs after register-allocation, we didn't run into the RegAllocFast issue before. llvm-svn: 269949	2016-05-18 16:10:17 +00:00
Matt Arsenault	174af82fd1	AMDGPU: Fix assert when erroring on a call For some reason an assert is now hit when a valid chain is not returned, so return the entry chain. llvm-svn: 269948	2016-05-18 16:10:11 +00:00
Rafael Espindola	6a904043b3	Trivial cleanups. This just clang formats and cleans comments in an area I am about to post a patch for review. llvm-svn: 269946	2016-05-18 16:00:24 +00:00
Matt Arsenault	c1825f766d	AMDGPU: Handle alloca promoting with null operands If the second pointer in a multi-pointer instruction is a constant, we can replace the type. llvm-svn: 269945	2016-05-18 15:57:21 +00:00
Matt Arsenault	c3d4584fcb	AMDGPU: Don't run passes that aren't useful llvm-svn: 269943	2016-05-18 15:41:07 +00:00
Matt Arsenault	eb22b9d92c	AMDGPU: Fix assert on ttmp registers Use register class that does not include them when looking for unallocated registers. This is hit by the udiv v8i64 test in the opencl integer conformance test, and takes a few seconds to compile in a debug build so no test included. llvm-svn: 269938	2016-05-18 15:19:50 +00:00
Krzysztof Parzyszek	0275bfc11c	[Hexagon] Recognize "q" and "v" in inline-asm as register constraints llvm-svn: 269933	2016-05-18 14:34:51 +00:00
Dan Gohman	40e6be6120	[WebAssembly] Don't expand divisions by constants. Don't expand divisions by constants if it would require multiple instructions. The current assumption is that engines will perform the desired optimizations. llvm-svn: 269930	2016-05-18 14:29:42 +00:00
Bryan Chan	930062186e	[SystemZ] Fix register ordering for BinaryRRF instructions Summary: The ordering of registers in BinaryRRF instructions are wrong, and affects the copysign instruction (CPSDR). This results in the wrong magnitude and sign being set. Author: zhanjunl Reviewers: kbarton, uweigand Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20308 llvm-svn: 269922	2016-05-18 13:24:57 +00:00
Ashutosh Nema	0cfbe42fbc	Add new flag and intrinsic support for MWAITX and MONITORX instructions Summary: MONITORX/MWAITX instructions provide similar capability to the MONITOR/MWAIT pair while adding a timer function, such that another termination of the MWAITX instruction occurs when the timer expires. The presence of the MONITORX and MWAITX instructions is indicated by CPUID 8000_0001, ECX, bit 29. The MONITORX and MWAITX instructions are intercepted by the same bits that intercept MONITOR and MWAIT. MONITORX instruction establishes a range to be monitored. MWAITX instruction causes the processor to stop instruction execution and enter an implementation-dependent optimized state until occurrence of a class of events. Opcode of MONITORX instruction is "0F 01 FA". Opcode of MWAITX instruction is "0F 01 FB". These opcode information is used in adding tests for the disassembler. These instructions are enabled for AMD's bdver4 architecture. Patch by Ganesh Gopalasubramanian! Reviewers: echristo, craig.topper, RKSimon Subscribers: RKSimon, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19795 llvm-svn: 269911	2016-05-18 11:59:12 +00:00
Rafael Espindola	6da3617e7f	Don't pass a Reloc::Model to MC. MC only needs to know if the output is PIC or not. It never has to decide about creating GOTs and PLTs for example. The only thing that MC itself uses this information for is expanding "macros" in sparc and mips. The rest I am pretty sure could be moved to CodeGen. This is a cleanup and isolates the code from future changes to Reloc::Model. llvm-svn: 269909	2016-05-18 11:58:50 +00:00
Dylan McKay	a60d032b35	[AVR] Remove the 'AVRConfig.h' header It defined the LLVM_AVR_GCC_COMPAT constant, which would enable/disable certain GCC-specific behaviours. There is no point conditionally turning it on/off, as it will always be turned on, and we have to maintain both code paths anyway. llvm-svn: 269904	2016-05-18 11:20:48 +00:00
Dylan McKay	ec3d224f78	[AVR] Add missing CMake dependencies llvm-svn: 269901	2016-05-18 11:11:51 +00:00
Dylan McKay	df26c79f7c	[AVR] Fix a few compile errors llvm-svn: 269900	2016-05-18 11:11:38 +00:00
Simon Dardis	3d2697b0c7	[PATCH] [mips] Restrict the creation of compact branches Restrict the creation of compact branches so that they do meet the ISA requirements. Notably do not permit $zero to be used as a operand for compact branches and ensure that some other branches fulfil the requirement that rs != rt. Fixup cases where $rs > $rt for bnec and beqc. Recommit of rL269893 with reviewers comments. Reviewers: dsanders, vkalintiris Differential Review: http://reviews.llvm.org/D20284 llvm-svn: 269899	2016-05-18 10:38:01 +00:00
Simon Dardis	b7ca91e9a3	Revert "[mips] Restrict the creation of compact branches" This reverts commit rL269893. Incorrect patch applied. llvm-svn: 269897	2016-05-18 09:51:37 +00:00
Dylan McKay	5376067456	[AVR] Convert C style comments to C++ llvm-svn: 269895	2016-05-18 09:43:01 +00:00
Simon Dardis	db0ade7404	[mips] Restrict the creation of compact branches Restrict the creation of compact branches so that they meet the ISA encoding requirements. Notably do not permit $zero to be used as a operand for compact branches and ensure that some other branches fulfil the requirement that rs != rt. Fixup cases where $rs > $rt for bnec and beqc. Reviewers: dsanders, vkalintiris Differential Review: http://reviews.llvm.org/D20284 llvm-svn: 269893	2016-05-18 09:21:44 +00:00
Chris Dewhurst	345da957e2	[Sparc] Add Soft Float support This change adds support for software floating point operations for Sparc targets. This is the first in a set of patches to enable software floating point on Sparc. The next patch will enable the option to be used with Clang. Differential Revision: http://reviews.llvm.org/D19265 llvm-svn: 269892	2016-05-18 09:14:13 +00:00
Craig Topper	a0fa384ff9	[AVX512] Strengthen type constraints on my rounding mode inputs and some immediate inputs. llvm-svn: 269886	2016-05-18 06:56:01 +00:00
Craig Topper	e9d3c962f1	[AVX512] Strengthen type checks on the X86ISD::SELECT node. Saves over 800 bytes in the DAG isel table by removing type checks for the condition operand which is always a vector or scalar of i1 matching the the number of elements in the other operands. llvm-svn: 269885	2016-05-18 06:55:59 +00:00
Zlatko Buljan	af377b7a57	[mips][microMIPS] Implement LH, LHE, LHU and LHUE instructions and add CodeGen support Differential Revision: http://reviews.llvm.org/D15418 llvm-svn: 269883	2016-05-18 06:54:59 +00:00
Dan Gohman	889b2555f5	[WebAssembly] Rename $discard to $drop in the assembly output. llvm-svn: 269862	2016-05-17 23:19:03 +00:00
Dan Gohman	df1ae0caf4	[WebAssembly] Model the stack evaluation order more precisely. We currently don't represent get_local and set_local explicitly; they are just implied by virtual register use and def. This avoids a lot of clutter, but it does complicate stackifying: get_locals read their operands at their position in the stack evaluation order, rather than at their parent instruction. This patch adds code to walk the stack to determine the precise ordering, when needed. llvm-svn: 269854	2016-05-17 22:24:18 +00:00
Dan Gohman	3b351cbbb5	[WebAssembly] Don't stackify calls past stack pointer modifications. llvm-svn: 269843	2016-05-17 21:14:26 +00:00
Hans Wennborg	90018c04c2	Revert r269828 "X86: Avoid using _chkstk when lowering WIN_ALLOCA instructions" Seems to have broken the Windows ASan bot. Reverting while investigating. llvm-svn: 269833	2016-05-17 20:38:56 +00:00
Dan Gohman	3ea810b777	[WebAssembly] Stackify induction variable increment instructions. This handles instructions where the defined register is also used, as in "x = x + 1". llvm-svn: 269830	2016-05-17 20:19:47 +00:00
Hans Wennborg	09bf3bedad	X86: Avoid using _chkstk when lowering WIN_ALLOCA instructions This patch moves the expansion of WIN_ALLOCA pseudo-instructions into a separate pass that walks the CFG and lowers the instructions based on a conservative estimate of the offset between the stack pointer and the lowest accessed stack address. The goal is to reduce binary size and run-time costs by removing calls to _chkstk. While it doesn't fix all the code quality problems with inalloca calls, it's an incremental improvement for PR27076. Differential Revision: http://reviews.llvm.org/D20263 llvm-svn: 269828	2016-05-17 20:13:29 +00:00
Rafael Espindola	3a373d5446	Simplify handling of hidden stub. Since r207518 they are printed exactly like non-hidden stubs on x86 and since r207517 on ARM. This means we can use a single set for all stubs in those platforms. llvm-svn: 269776	2016-05-17 16:01:32 +00:00
Renato Golin	5e4f70ea56	[ARM] ARM mov InstAlias for MOVW lacks HasV6T2 The movw instruction is only available in ARM state for V6T2 and above. The MOVi16 instruction has requirement HasV6T2 but the InstAlias for mov rd, imm where the operand is imm0_65535_expr:$imm does not. This means that movw can incorrectly be used in ARMv4 and ARMv5 by writing mov rd, 0x1234. The simple fix is to the requirement HasV6T2 to the InstAlias. Tests added to not-armv4.s. Patch by Peter Smith. llvm-svn: 269761	2016-05-17 13:05:28 +00:00
David L Kreitzer	874333eb48	Fix for PR27750. Correctly handle the case where the fallthrough block and target block are the same in getFallThroughMBB. Differential Revision: http://reviews.llvm.org/D20288 llvm-svn: 269760	2016-05-17 12:47:46 +00:00
Derek Schuff	a9b7d0355e	[WebAssembly] Remove our copy of PrologEpilogInserter It's no longer needed after r269750 llvm-svn: 269756	2016-05-17 11:18:35 +00:00
Zoran Jovanovic	c3850b81b8	[mips][microMIPS] Implement BEQZC and BNEZC instructions Differential Revision: http://reviews.llvm.org/D15417 llvm-svn: 269755	2016-05-17 11:10:15 +00:00
Simon Dardis	a99b8023bb	[mips] Compact branch policy control for MIPSR6 This patch adds the commandline option -mips-compact-branches={never,optimal,always), which controls how LLVM generates compact branches for MIPS targets. By default, the compact branch policy is 'optimal' where LLVM will (hopefully) pick the optimal branch for any situation. The 'never' policy will disable the generation of compact branches and 'always' will generate compact branches wherever possible. Reviewers: dsanders Differential Review: http://reviews.llvm.org/D20167 llvm-svn: 269753	2016-05-17 10:21:43 +00:00
Zlatko Buljan	0fef23e430	[mips][microMIPS][DSP] Implement BALIGN, BITREV, BPOSGE32, CMP, CMPGDU, CMPGU* and CMPU* instructions Differential Revision: http://reviews.llvm.org/D16182 llvm-svn: 269752	2016-05-17 09:32:58 +00:00
Derek Schuff	6435e10dbd	Factor PrologEpilogInserter around spilling, frame finalization, and scavenging PrologEpilogInserter has these 3 phases, which are related, but not all of them are needed by all targets. This patch reorganizes PEI's varous functions around those phases for more clear separation. It also introduces a new TargetMachine hook, usesPhysRegsForPEI, which is true for non-virtual targets. When it is true, all the phases operate as before, and PEI requires the AllVRegsAllocated property on MachineFunctions. Otherwise, CSR spilling and scavenging are skipped and only prolog/epilog insertion/frame finalization is done. Differential Revision: http://reviews.llvm.org/D18366 llvm-svn: 269750	2016-05-17 08:49:59 +00:00
Dan Gohman	78374076c0	[WebAssembly] Improve the precision of memory and side effect dependence tracking. MachineInstr::isSafeToMove is more conservative than is needed here; use a more explicit check, and incorporate knowledge of some WebAssembly-specific opcodes. llvm-svn: 269736	2016-05-17 04:05:31 +00:00
Jan Vesely	628b422d21	AMDGPU/R600: Use correct number of vector elements when lowering private loads Reviewer: tstellardAMD, arsenm Subscribers: arsenm, kzhuravl, llvm-commits Differential Revision: http://reviews.llvm.org/D20032 llvm-svn: 269725	2016-05-16 23:56:32 +00:00
Matt Arsenault	774adca4ab	AMDGPU: Fix promote alloca pass creating huge arrays This was assuming it could use all memory before, which is a bad decision because it restricts occupancy. By default, only try to use enough space that could reduce occupancy to 7, an arbitrarily chosen limit. Based on the exist LDS usage, try to round up to the limit in the current tier instead of further hurting occupancy. This isn't ideal, because it doesn't accurately know how much space is going to be used for alignment padding. llvm-svn: 269708	2016-05-16 21:19:59 +00:00
Geoff Berry	7dd4166698	[AArch64] Fix bug in large stack spill slot handling (PR27717) Summary: Fix bug in MachO path where a frame index offset would not be reserved for handling large frames when an extra non-used callee-save register was saved. In the case where the extra register is reserved or not a GPR (e.g. %FP in the MachO case), this would lead to the register scavenger later failing when called from PrologEpilogInserter. Reviewers: t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D20185 llvm-svn: 269697	2016-05-16 20:52:28 +00:00
Bryan Chan	49b7f76310	[SystemZ] Support LRVH and STRVH opcodes Summary: On Linux, /usr/include/bits/byteswap-16.h defines __byteswap_16(x) as an inlined LRVH (Load Reversed Half-word) instruction. The SystemZ back-end did not support this opcode and the inlined assembly would cause a fatal error. Reviewers: bryanpkc, uweigand Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18732 llvm-svn: 269688	2016-05-16 20:32:22 +00:00
Dan Gohman	f9dd86b5c9	[WebAssembly] Mark COPY_LOCAL and TEE_LOCAL instructions has having no side effects. llvm-svn: 269683	2016-05-16 19:16:32 +00:00
Dan Gohman	f4275cef7d	[WebAssembly] Use eqz to negate a branch conditions. llvm-svn: 269681	2016-05-16 18:59:34 +00:00
Dan Gohman	b5c920f753	[WebAssembly] Add a few optimization ideas to README.txt. llvm-svn: 269677	2016-05-16 18:51:03 +00:00
Michael Kuperstein	77252d7f51	[X86] Remove transformVSELECTtoBlendVECTOR_SHUFFLE The new X86 shuffle lowering can do just fine without transforming vselects into vector_shuffles. It looks like the only thing this code does right now is cause trouble - in particular, it can lead to combine/legalization infinite loops. Note that it's not completely NFC, since some of the shuffle masks get inverted, which may cause slight differences further down the line. We may want to find a way to invert those masks, but that's orthogonal to this commit. This fixes the hang in PR27689. llvm-svn: 269676	2016-05-16 18:27:00 +00:00
Krzysztof Parzyszek	0efdf6d032	[Hexagon] Make getCallerSavedRegs specific to a register class llvm-svn: 269674	2016-05-16 18:02:28 +00:00
Krzysztof Parzyszek	6566530702	[Hexagon] Simplify HexagonInstrInfo::isPredicable Remove all the checks for constant extenders from isPredicable. The users of it should be the ones checking cost/profitability. llvm-svn: 269664	2016-05-16 16:56:10 +00:00
Chad Rosier	5730f6ff68	Use proper capitalization and punctuation per coding standards. NFC. llvm-svn: 269652	2016-05-16 12:55:01 +00:00
Simon Pilgrim	0ed64737ef	Fixed unused variable warning llvm-svn: 269650	2016-05-16 11:48:54 +00:00
Simon Pilgrim	30771e251f	[X86][SSSE3] Lower vector CTLZ with PSHUFB lookups This patch uses PSHUFB to lower vector CTLZ and avoid (slower) scalarizations. The leading zero count of each 4-bit nibble of the vector is determined by using a PSHUFB lookup. Pairs of results are then repeatedly combined up to the original element width. Differential Revision: http://reviews.llvm.org/D20016 llvm-svn: 269646	2016-05-16 11:19:11 +00:00
Chris Dewhurst	6ea8ac82b1	[Sparc][LEON] Add LEON-specific CASA instruction. Differental Revision: http://reviews.llvm.org/D20098 llvm-svn: 269644	2016-05-16 11:02:00 +00:00
Daniel Sanders	7ac931ce16	[mips][ias] Fix R_MICROMIPS_GOT16 evaluation and eliminate symbol for R_MICROMIPS_(GOT\|HI\|LO)16 Summary: The failure r269410 worked around turned out to be caused by an incorrect evaluation of R_MICROMIPS_GOT16 which then caused the GOT entries to be incorrect. This patch fixes the evaluation and reverts r269410. Reviewers: sdardis, vkalintiris, rafael Subscribers: rafael, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D20242 llvm-svn: 269641	2016-05-16 09:33:59 +00:00
Daniel Sanders	0ed85c1ebe	[mips][ias] EF_MIPS_MICROMIPS should iff microMIPS code was emitted. Summary: This fixes PR27682. Additionally, '.set micromips' by itself is not sufficient to raise the EF_MIPS_MICROMIPS flag. It is also necessary to emit a microMIPS instruction. This has also been fixed. Reviewers: sdardis, vkalintiris, rafael Subscribers: rafael, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D20214 llvm-svn: 269639	2016-05-16 09:10:13 +00:00
Zoran Jovanovic	e9d6f29fb1	[mips] Addition of a third operand to the instructions [d]div, [d]divu Author: obucina Reviewers: dsanders Adds support for third operand for [D]DIV[U] instructions. Additional test for case when destination reg is zero register Differential Revision: http://reviews.llvm.org/D16888 llvm-svn: 269636	2016-05-16 08:57:59 +00:00
Simon Pilgrim	10b744e393	[X86][SSE] Simplify zero'th index extract element matching llvm-svn: 269615	2016-05-15 20:22:50 +00:00
Simon Pilgrim	a4565275be	[X86][SSE] Removed duplicate variables. NFCI. Removed duplicate getOperand / getSimpleValueType calls. llvm-svn: 269614	2016-05-15 20:11:10 +00:00
Benjamin Kramer	309b60e723	Move helper classes into anonymous namespaces. NFC. llvm-svn: 269591	2016-05-15 15:18:11 +00:00
Craig Topper	d86227613c	[AVX512] Make the permd intrinsics take a 32-bit immediate to match the software spec. llvm-svn: 269579	2016-05-14 21:13:20 +00:00
Saleem Abdulrasool	8090c443ba	ARM: support export directives for Windows It seems that cl will emit the export directives for Windows ARM targets. The fact that it did this had originally been missed and this functionality was never implemented. This makes it possible to rely solely on the source code for indicating what the exported interfaces are and brings us more compatibility with cl. llvm-svn: 269574	2016-05-14 18:58:34 +00:00
Chad Rosier	13fe560e86	[AArch64] Update local variable names to conform to coding standard. NFC. llvm-svn: 269573	2016-05-14 18:56:28 +00:00
Elena Demikhovsky	9eb843ca76	Fixed lowering of _comi_ intrinsics from all sets - SSE/SSE2/AVX/AVX-512 Differential revision http://reviews.llvm.org/D19261 llvm-svn: 269569	2016-05-14 15:06:09 +00:00
Daniel Sanders	04fad6dc3c	[mips] Enable IAS by default for 32-bit MIPS targets (O32). Summary: The MIPS IAS can now pass 'ninja check-all', recurse, build a bootable linux kernel, and pass a variety of LNT testing. Unfortunately we can't enable it by default for 64-bit targets yet since the N32 ABI is still very buggy and this also means we can't enable it for N64 either because we can't distinguish between N32 and N64 in the relevant code. Reviewers: vkalintiris Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18759 Differential Revision: http://reviews.llvm.org/D18761 llvm-svn: 269560	2016-05-14 12:43:08 +00:00
Dan Gohman	295b5775ee	[WebAssembly] Fix legalization of i128 shifts. compiler-rt/libgcc shift routines expect the shift count to be an i32, so use i32 as the shift count for shifts that are legalized to libcalls. This also reverts r268991, now that the signatures are correct. llvm-svn: 269531	2016-05-14 02:15:47 +00:00
Craig Topper	fe0638b35b	[AVX512] Fix types for pshufd intrinsics. The immediate is the second argument and the mask is the 4th argument. Also move the 128/256 tests to the right test file. Prior to this the immediate was a strange 16-bits and the 512-bit intrinsic couldn't receive the full 16 mask bits it needs. llvm-svn: 269526	2016-05-14 00:47:18 +00:00
Derek Schuff	014b2d8b98	[WebAssembly] Update expected torture test failures NFC; the waterfall just changed the way they are built. llvm-svn: 269523	2016-05-14 00:22:17 +00:00
Justin Bogner	47974e81c0	SDAG: Implement Select instead of SelectImpl in MipsDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269519	2016-05-13 23:55:59 +00:00
Justin Bogner	763c27607a	SDAG: Clean up a dead node I missed earlier in X86 H.J. Lu pointed out that I missed this in r269236. Thanks! llvm-svn: 269516	2016-05-13 23:26:28 +00:00
Chad Rosier	586f3e3b74	[AArch64] Simplify logic to reduce vertical space. NFC. llvm-svn: 269512	2016-05-13 22:53:13 +00:00
Justin Bogner	8710635f7d	SDAG: Implement Select instead of SelectImpl in XCoreDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269509	2016-05-13 22:49:18 +00:00
Justin Bogner	ac92bf33a9	SDAG: Implement Select instead of SelectImpl in WebAssemblyDAGToDAGISel This backend doesn't do anything custom here yet, so we just modernize the boilerplate. Part of llvm.org/pr26808. llvm-svn: 269506	2016-05-13 22:44:57 +00:00
Justin Bogner	5108441a75	SDAG: Implement Select instead of SelectImpl in SystemZDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. Part of llvm.org/pr26808. llvm-svn: 269505	2016-05-13 22:42:08 +00:00
Justin Bogner	f84347da06	SDAG: Implement Select instead of SelectImpl in SparcDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269490	2016-05-13 21:46:22 +00:00
Justin Bogner	fe65f9093f	SDAG: Implement Select instead of SelectImpl in NVPTXDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. Part of llvm.org/pr26808. llvm-svn: 269483	2016-05-13 21:12:53 +00:00
Jan Vesely	380b97a542	AMDGPU: Unify LowerGlobalAddress Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19794 llvm-svn: 269481	2016-05-13 20:39:34 +00:00
Jan Vesely	2161d1ce12	AMDGPU/R600: Fold global address operand Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19793 llvm-svn: 269480	2016-05-13 20:39:31 +00:00
Jan Vesely	8960f3da2c	AMDGPU/R600: Implement memory loads from constant AS Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19792 llvm-svn: 269479	2016-05-13 20:39:29 +00:00
Jan Vesely	430253f70f	AMDGPU/R600: Add support for emitting MCExpr Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19791 llvm-svn: 269478	2016-05-13 20:39:26 +00:00
Jan Vesely	c66ba87211	AMDGPU: Add support for MCExpr to instruction printer Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19790 llvm-svn: 269477	2016-05-13 20:39:24 +00:00
Jan Vesely	df38884a55	AMDGPU/R600: Use machine operands instead of ints to track literals This will be used for global addresses Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19789 llvm-svn: 269476	2016-05-13 20:39:22 +00:00
Jan Vesely	525e03cb9e	AMDGPU/R600: There are other uses for ALU_LITERAL besides Imm This will be used for GV Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19788 llvm-svn: 269475	2016-05-13 20:39:20 +00:00
Jan Vesely	76784f3903	AMDGPU: Make CONST_DATA_PTR available to R600 Rename to AMDGPUconstdata_ptr Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19786 llvm-svn: 269474	2016-05-13 20:39:18 +00:00
Jan Vesely	e05114ade0	AMDGPU/EG,CM: Add instruction to read from constant AS (VTX2) Reviewers: tstellard Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D19785 llvm-svn: 269473	2016-05-13 20:39:16 +00:00
Tim Northover	72e0542f2c	ARM: use callee-saved list in the order they're actually saved. When setting the frame pointer, the offset from SP is calculated based on the stack slot it gets allocated, but this slot is in turn based on the order of the CSR list so that list should match the order we actually save the registers in. Mostly it did, but in the edge-case of MachO AAPCS targets it was wrong. llvm-svn: 269459	2016-05-13 19:16:14 +00:00
Krzysztof Parzyszek	bcb9eb3047	[Hexagon] Remove dead nodes from SelectionDAG to avoid cycles Recent changes to the instruction selection code exposed a problem where a dead node was not removed on time. This node had both input and output chains, which lead to an apparent cycle. llvm-svn: 269458	2016-05-13 18:48:15 +00:00
Konstantin Zhuravlyov	25e9604ac4	[AMDGPU] Update nop insertion for debugger usage - Insert one nop for each high level statement instead of two - Do not insert nop before prologue Differential Revision: http://reviews.llvm.org/D20215 llvm-svn: 269452	2016-05-13 18:21:28 +00:00
Paul Osmialowski	0fa09433f0	add support for -print-imm-hex for AArch64 Most immediates are printed in Aarch64InstPrinter using 'formatImm' macro, but not all of them. Implementation contains following rules: - floating point immediates are always printed as decimal - signed integer immediates are printed depends on flag settings (for negative values 'formatImm' macro prints the value as i.e -0x01 which may be convenient when imm is an address or offset) - logical immediates are always printed as hex - the 64-bit immediate for advSIMD, encoded in "a🅱️c:d:e:f:g:h" is always printed as hex - the 64-bit immedaite in exception generation instructions like: brk, dcps1, dcps2, dcps3, hlt, hvc, smc, svc is always printed as hex - the rest of immediates is printed depends on availability of -print-imm-hex Signed-off-by: Maciej Gabka <maciej.gabka@arm.com> Signed-off-by: Paul Osmialowski <pawel.osmialowski@arm.com> Differential Revision: http://reviews.llvm.org/D16929 llvm-svn: 269446	2016-05-13 18:00:09 +00:00
Krzysztof Parzyszek	bbc158d4f4	[scan-build] fix dead store warnings emitted on LLVM Hexagon code base Patch by Apelete Seketeli. Differential Revision: http://reviews.llvm.org/D19900 llvm-svn: 269415	2016-05-13 13:13:59 +00:00
Krzysztof Parzyszek	16219f4217	[MIB] Create a helper function getRegState to extract all register flags llvm-svn: 269414	2016-05-13 13:01:19 +00:00
Amjad Aboud	6ff87595f1	Assure calling "cld" instruction in prologue of X86 interrupt handler function. Differential Revision: http://reviews.llvm.org/D18725 llvm-svn: 269413	2016-05-13 12:46:57 +00:00
Daniel Sanders	80eaa377a6	[mips][ias] Work around yet another incorrect microMIPS relocation evaluation exposed by r268900. It's not entirely clear why R_MICROMIPS_(GOT\|HI16\|LO16) are evaluated incorrectly in a small number of the LNT tests at this point. However, it's not related to the STO_MIPS_MICROMIPS issue. At this point all the microMIPS-related changes of r268900 have been reverted. llvm-svn: 269410	2016-05-13 12:07:14 +00:00
Hrvoje Varga	9dc958973e	[mips][microMIPS] Implement APPEND, BPOSGE32C, MODSUB, MULSA.W.PH and MULSAQ_S.W.PH instructions Differential Revision: http://reviews.llvm.org/D14117 llvm-svn: 269408	2016-05-13 11:32:53 +00:00
Justin Bogner	7e8c12d210	SDAG: Clean up a dangling node in SparcISelDAGToDAG::SelectImpl When we convert to the void Select interface, leaving unreferenced nodes around won't be allowed anymore. Part of llvm.org/pr26808. llvm-svn: 269396	2016-05-13 06:37:53 +00:00
Justin Bogner	7e9112c0f0	SDAG: Clean up a dangling node in MipsISelDAGToDAG::SelectImpl When we convert to the void Select interface, leaving unreferenced nodes around won't be allowed anymore. Part of llvm.org/pr26808. llvm-svn: 269394	2016-05-13 06:30:15 +00:00
Justin Bogner	7247ef1510	SDAG: Implement Select instead of SelectImpl in MSP430DAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269393	2016-05-13 06:10:50 +00:00
Matt Arsenault	4449ad7408	AMDGPU: Remove verifier check for scc live ins We only really need this to be true for SIFixSGPRCopies. I'm not sure there's any way this could happen before that point. Fixes a case where MachineCSE could introduce a cross block scc use. llvm-svn: 269391	2016-05-13 04:15:48 +00:00
Justin Bogner	f4e44712e3	SDAG: Implement Select instead of SelectImpl in AArch64DAGToDAGISel This one has a lot of code churn, but it's all mechanical and straightforward. - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269379	2016-05-12 23:10:30 +00:00
Justin Bogner	80bd946ad9	SDAG: Implement Select instead of SelectImpl in LanaiDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269364	2016-05-12 21:56:18 +00:00
Justin Bogner	9eb8000baa	SDAG: Implement Select instead of SelectImpl in HexagonDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we had already replaced all uses and we returned a node, just remove the dead node instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. Part of llvm.org/pr26808. llvm-svn: 269358	2016-05-12 21:46:18 +00:00
Justin Bogner	44f3c13019	SDAG: Clean up a dangling node in HexagonISelDAGToDAG::SelectImpl When we convert to the void Select interface, leaving unreferenced nodes around won't be allowed anymore. Part of llvm.org/pr26808. llvm-svn: 269355	2016-05-12 21:24:23 +00:00
Renato Golin	fa6e1c461b	[ARM] Support and tests for transform of LDR rt, = to MOV This change implements the transformation in processInstruction() for the LDR rt, =expression to MOV rt, expression when the expression can be evaluated and can fit into the immediate field of the MOV or a MVN. Across the ARM and Thumb instruction sets there are several cases to consider, each with a different range of representatble constants. In ARM we have: * Modified immediate (All ARM architectures) * MOVW (v6t2 and above) In Thumb we have: * Modified immediate (v6t2, v7m and v8m.mainline) * MOVW (v6t2, v7m, v8.mainline and v8m.baseline) * Narrow Thumb MOV that can be used in an IT block (non flag-setting) If the immediate fits any of the available alternatives then we make the transformation. Fixes 25722. Patch by Peter Smith. llvm-svn: 269354	2016-05-12 21:22:42 +00:00
Renato Golin	c3136a73d4	[ARM] Delay ARM constant pool creation. NFC. This change adds a new constant pool kind to ARMOperand. When parsing the operand for =immediate we create an instance of this operand rather than creating a constant pool entry and rewriting the operand. As the new operand kind is only created for ldr rt,= we can make ldr rt,= an explicit pseudo instruction in ARM, Thumb and Thumb2 The pseudo instruction is expanded in processInstruction(). This creates the constant pool and transforms the pseudo instruction into a pc-relative ldr to the constant pool. There are no functional changes and no modifications needed to existing tests. Required by the patch that fixes PR25722. Patch by Peter Smith. llvm-svn: 269352	2016-05-12 21:22:31 +00:00
Justin Bogner	65f29a04e5	SDAG: Implement Select instead of SelectImpl in BPFDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269350	2016-05-12 21:14:47 +00:00
Justin Bogner	e0b750ea0f	SDAG: Implement Select instead of SelectImpl in AMDGPUDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269349	2016-05-12 21:03:32 +00:00
Justin Bogner	73741e2745	SDAG: Clean up dangling nodes in AArch64ISelDAGToDAG::SelectImpl When we convert to the void Select interface, leaving unreferenced nodes around won't be allowed anymore. Part of llvm.org/pr26808. llvm-svn: 269345	2016-05-12 20:54:27 +00:00
Amjad Aboud	b8f1084253	Fixed the callee saved registers list for X86 AllRegs calling convention. 32-bit AllRegs: SSE: xmm0-xmm7 AVX: ymm0-ymm7 AVX512: zmm0-zmm7 + k0-k7 64-bit AllRegs: SSE: xmm0-xmm15 AVX: ymm0-ymm15 AVX512: zmm0-zmm31 + k0-k7 Differential Revision: http://reviews.llvm.org/D20142 llvm-svn: 269337	2016-05-12 19:58:32 +00:00
Chad Rosier	3f19f1ad66	[AArch64] Give function a more appropriate name. llvm-svn: 269335	2016-05-12 19:51:58 +00:00
Amjad Aboud	8cfe3168db	Fixed dwarf X86-32 register mapping for k0-k7 registers. llvm-svn: 269333	2016-05-12 19:49:24 +00:00
Chad Rosier	6705e7f48f	[AArch64] Minor refactoring to simplify future patch. NFC. llvm-svn: 269329	2016-05-12 19:38:18 +00:00
Krzysztof Parzyszek	f3aefcd439	[Hexagon] Expand VSelect pseudo instructions llvm-svn: 269328	2016-05-12 19:16:02 +00:00
Krzysztof Parzyszek	f351ddf6e0	[Hexagon] Properly handle instruction selection of vsplat intrinsics llvm-svn: 269312	2016-05-12 17:21:40 +00:00
Daniel Sanders	723ca9cb5e	[mips][ias] Fix O32 .cprestore directive when inside .set noat region and offset is in range. Summary: This expands on r269179 to fix an additional case that was not covered by our tests. The assembler temporary is not needed when the .cprestore offset fits inside a simm16 and it is not an error to use it inside a '.set noat' in this case. Reviewers: emaste, seanbruno, sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D20199 llvm-svn: 269295	2016-05-12 14:01:50 +00:00
Daniel Sanders	ace879545c	[mips][ias] Work around incorrect another microMIPS relocation evaluation exposed by r268900 As explained in r269196, microMIPS has a special case that is not correctly implemented in LLVM. If we have a symbol 'foo' which is equivalent to '.text+0x10'. The value of an R_MICROMIPS_LO16 relocation using 'foo' is 'foo+0x11' and not 'foo+0x10'. The in-place addend should therefore be 0x11. This commit reverts a little more of the effect of r268900 by keeping the symbol when the STO_MIPS_MICROMIPS flag is set for R_MIPS_GPREL32 relocations. This fixes SingleSource/UnitTests/2003-08-11-VaListArg, and SingleSource/UnitTests/2003-05-07-VarArgs for microMIPS. I believe there are additional relocations that have the same issue (e.g. R_MIPS_64, and R_MIPS_GPREL16) but for now I'm focusing on restoring our internal buildbots back to the green state we had in r268899. llvm-svn: 269294	2016-05-12 13:39:13 +00:00
Chad Rosier	c526d67f62	[AArch64] Remove command-line option use for testing. The EXTR combine has been in tree for over 2 years without complain, so go ahead and remove the option. llvm-svn: 269292	2016-05-12 13:27:24 +00:00
Hrvoje Varga	c4cdcea6eb	Revert "[mips][microMIPS] Implement CFC, CTC and LDC* instructions" This reverts commit r269176 as it caused test-suite failure. llvm-svn: 269287	2016-05-12 12:46:06 +00:00
Renato Golin	135b316516	[scan-build] fix warnings emitted on LLVM ARM code base Fix "Logic error" warnings of the type "Called C++ object pointer is null" reported by Clang Static Analyzer. Patch by Apelete Seketeli. llvm-svn: 269285	2016-05-12 12:33:33 +00:00
Daniel Sanders	202cd56665	[mips][ias] Correct ELF eflags when Octeon is the target. Reviewers: sdardis Subscribers: petarj, mpf, dsanders, spetrovic, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D18899 llvm-svn: 269283	2016-05-12 11:31:19 +00:00
Daniel Sanders	f64c0bb52e	[mips][ias] Handle N64 compound relocations and R_MIPS_SUB in needsRelocateWithSymbol() Summary: This eliminates the default case for N64 that was left out of r269047. The change to R_MIPS_SUB is needed in this patch to make this testable since %lo(%neg(%gp_rel(foo))) and %hi(%neg(%gp_rel(foo))) remain the only ways to get a compound relocation from the assembler. Reviewers: sdardis, rafael Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D20097 llvm-svn: 269280	2016-05-12 10:55:00 +00:00
Dan Gohman	fffd2940a1	[WebAssembly] Fast-isel support for calls, arguments, and selects. llvm-svn: 269273	2016-05-12 04:19:09 +00:00
Hal Finkel	ff8397dabb	[PowerPC] Fix a DAG replacement bug in PPCTargetLowering::DAGCombineExtBoolTrunc While promoting nodes in PPCTargetLowering::DAGCombineExtBoolTrunc, it is possible for one of the nodes to be replaced by another. To make sure we do not visit the deleted nodes, and to make sure we visit the replacement nodes, use a list of HandleSDNodes to track the to-be-promoted nodes during the promotion process. The same fix has been applied to the analogous code in PPCTargetLowering::DAGCombineTruncBoolExt. Fixes PR26985. llvm-svn: 269272	2016-05-12 04:00:56 +00:00
Matt Arsenault	13446c4387	AMDGPU: Fix getIntegerAttribute type and error message llvm-svn: 269268	2016-05-12 02:45:18 +00:00
Matt Arsenault	ac3313688f	AMDGPU: Fix breaking IR on instructions with multiple pointer operands The promote alloca pass would attempt to promote an alloca with a select, icmp, or phi user, even though the other operand was from a non-promotable source, producing a select on two different pointer types. Only do this if we know that both operands derive from the same alloca. In the future we should be able to relax this to an alloca which will also be promoted. llvm-svn: 269265	2016-05-12 01:58:58 +00:00
Chad Rosier	95d924439b	[AArch64] Add support for unscaled narrow stores in getUsefulBitsForUse. llvm-svn: 269263	2016-05-12 01:42:01 +00:00
Chad Rosier	6c16f1042e	[AArch64] Remove floating-point narrow stores from getUsefulBitsForUse. While not impossible, it's unlikely we'd be performing bitwise operations on FP values. llvm-svn: 269260	2016-05-12 01:04:15 +00:00
Justin Bogner	8b8b978841	SDAG: Implement Select instead of SelectImpl in ARMDAGToDAGISel This is a large change, but it's pretty mechanical: - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269258	2016-05-12 00:31:09 +00:00
Justin Bogner	7757871659	SDAG: Clean up dangling nodes in ARMISelDAGToDAG::SelectImpl When we convert to the void Select interface, leaving unreferenced nodes around won't be allowed anymore. Part of llvm.org/pr26808. llvm-svn: 269256	2016-05-12 00:20:19 +00:00
Justin Bogner	b8461809ec	SDAG: Use ReplaceNode here, not ReplaceUses This was a typo in an earlier commit - there's no point in keeping the old node around here. Noticed by Meador Inge. Thanks! llvm-svn: 269245	2016-05-11 22:21:50 +00:00
Justin Bogner	68015d402f	SDAG: Add a helper to replace and remove a node during ISel It's very common to want to replace a node and then remove it since it's dead, especially as we port backends from the SDNode *Select API to the void Select one. This helper makes this sequence a bit less verbose. llvm-svn: 269236	2016-05-11 21:13:17 +00:00

... 3 4 5 6 7 ...

37826 Commits