llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-02 00:42:52 +01:00

Author	SHA1	Message	Date
Benjamin Kramer	b101f55fa4	Silence false positive uninitialized variable warnings from GCC. llvm-svn: 139573	2011-09-13 01:59:24 +00:00
Jakob Stoklund Olesen	198a5a56f2	Extract live range calculations from SplitKit. SplitKit will soon need two copies of these data structures, and the algorithms will also be useful when LiveIntervalAnalysis becomes independent of LiveVariables. llvm-svn: 139572	2011-09-13 01:34:21 +00:00
Eli Friedman	3f92f87989	Add comment to clarify the behavior of a helper in DSE. llvm-svn: 139571	2011-09-13 01:28:59 +00:00
Eli Friedman	ce1cbc7db5	Correct grammar. llvm-svn: 139565	2011-09-13 00:44:16 +00:00
Eli Friedman	34ffc961d7	Fix the assembler strings for a couple of atomic instructions. Doesn't really matter much in practice, but it's a bit cleaner. llvm-svn: 139563	2011-09-13 00:27:04 +00:00
Jim Grosbach	0b277b24ea	Tidy up a bit. llvm-svn: 139559	2011-09-12 23:36:42 +00:00
Bruno Cardoso Lopes	a4d2bdfa40	Fix PR10845. SUBREG_TO_REG shouldn't be used when the input and destination types are equal! llvm-svn: 139553	2011-09-12 22:59:23 +00:00
Bill Wendling	836c413ca3	Introduce a bit of a hack. Splitting a landing pad takes considerable care because of PHIs and other nasties. The problem is that the jump table needs to jump to the landing pad block. However, the landing pad block can be jumped to only by an invoke instruction. So we clone the landingpad instruction into its own basic block, have the invoke jump to there. The landingpad instruction's basic block's successor is now the target for the jump table. But because of PHI nodes, we need to create another basic block for the jump table to jump to. This is definitely a hack, because the values for the PHI nodes may not be defined on the edge from the jump table. But that's okay, because the jump table is simply a construct to mimic what is happening in the CFG. So the values are mysteriously there, even though there is no value for the PHI from the jump table's edge (hence calling this a hack). llvm-svn: 139545	2011-09-12 21:56:59 +00:00
Owen Anderson	a1a10ed5c6	Thumb2 POP's don't allow the PC as an operand, and PUSH's don't allow the SP either. llvm-svn: 139542	2011-09-12 21:28:46 +00:00
Bruno Cardoso Lopes	64e2e852f9	Revert the wrong part of r139528, and fix testcases. llvm-svn: 139541	2011-09-12 21:24:07 +00:00
Owen Anderson	0081444d87	Fix encoding of PC-relative LDRSHW with an immediate offset. llvm-svn: 139537	2011-09-12 20:36:51 +00:00
Eli Friedman	047d3de417	Change a bunch of isVolatile() checks to check for atomic load/store as well. No tests; these changes aren't really interesting in the sense that the logic is the same for volatile and atomic. I believe this completes all of the changes necessary for the optimizer to handle loads and stores correctly. I'm going to try and come up with some additional testing, though. llvm-svn: 139533	2011-09-12 20:23:13 +00:00
Owen Anderson	b1d401e514	There's no need to add additional predicate operands when converting a tB to a tBfar now. Fixes nightly test failures on armv6 Thumb. <rdar://problem/10110404> llvm-svn: 139531	2011-09-12 20:07:22 +00:00
Eric Christopher	665ace6bce	Fix typo. llvm-svn: 139530	2011-09-12 19:58:22 +00:00
Bruno Cardoso Lopes	c67e996fc3	Not sure how CMPPS and CMPPD had already ever worked, I guess it didn't. However with this fix it does now. Basically the operand order for the x86 target specific node is not the same as the instruction, but since the intrinsic need that specific order at the instruction definition, just change the order during legalization. Also, there were some wrong invertions of condition codes, such as GE => LE, GT => LT, fix that too. Fix PR10907. llvm-svn: 139528	2011-09-12 19:30:40 +00:00
Bruno Cardoso Lopes	e2fc394ed2	Organize a bit the operand names for CMPPS and CMPPD llvm-svn: 139527	2011-09-12 19:30:36 +00:00
Bruno Cardoso Lopes	fc1c90ac48	Realign BLEND patterns to match the general style for patterns in .td file. llvm-svn: 139526	2011-09-12 19:30:33 +00:00
Bruno Cardoso Lopes	f0e65e0f13	Fix 80-columns llvm-svn: 139525	2011-09-12 19:30:29 +00:00
Owen Anderson	05ef2c122d	Port more encoding tests to decoding tests, and correct an improper Thumb2 pre-indexed load decoding this uncovered. llvm-svn: 139522	2011-09-12 18:56:30 +00:00
Andrew Trick	09cf4287c2	Rename -disable-iv-rewrite to -enable-iv-rewrite=false in preparation for default change. llvm-svn: 139517	2011-09-12 18:28:44 +00:00
Devang Patel	c7a1210467	Add asserts to keep front-ends honest while encoding debug info into LLVM IR using DIBuilder. llvm-svn: 139515	2011-09-12 18:26:08 +00:00
Devang Patel	484cb2a602	Add DW_ATE_UTF, which clang started using in my previous commit! llvm-svn: 139503	2011-09-12 17:18:20 +00:00
Jakob Stoklund Olesen	a147642106	Remove the -compact-regions flag. It has been enabled by default for a while, it was only there to allow performance comparisons. llvm-svn: 139501	2011-09-12 16:54:42 +00:00
Jakob Stoklund Olesen	e43edf1b4a	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Jakob Stoklund Olesen	fed2345086	Update comments to reflect some (not so) recent changes. llvm-svn: 139498	2011-09-12 16:03:26 +00:00
Richard Osborne	962b1ca071	Associate a MemOperand with LDWCP nodes introduced during ISel. This information is required if we want LDWCP to be hoisted out of loops. llvm-svn: 139495	2011-09-12 14:43:23 +00:00
Richard Osborne	05cda7958d	Mark LDWCP as having no side effects. llvm-svn: 139494	2011-09-12 14:41:31 +00:00
Nadav Rotem	06ce2ac074	Format patterns, remove unused X86blend patterns llvm-svn: 139491	2011-09-12 08:41:50 +00:00
Craig Topper	5ffd0cb080	Fix disassembling of one of the register/register forms of MOVUPS/MOVUPD/MOVAPS/MOVAPD/MOVSS/MOVSD and their VEX equivalents. Fixes PR10877. llvm-svn: 139486	2011-09-11 23:19:54 +00:00
Craig Topper	a9b27eecc9	Fix disassembling of reverse register/register forms of ADD/SUB/XOR/OR/AND/SBB/ADC/CMP/MOV. llvm-svn: 139485	2011-09-11 21:41:45 +00:00
Nadav Rotem	abb5bb41d4	CR fixes per Bruno's request. Undo the changes from r139285 which added custom lowering to vselect. Add tablegen lowering for vselect. llvm-svn: 139479	2011-09-11 15:02:23 +00:00
Eli Friedman	c79e318f02	r139454 activates an assert in a case where we were doing the right thing anyway. Make that explicit, and un-XFAIL the testcase. llvm-svn: 139458	2011-09-10 02:01:42 +00:00
Richard Trieu	8b6890f67e	Fix the asserts in lib/Target/X86/X86ELFWriterInfo.cpp and lib/ExecutionEngine/MCJIT/MCJIT.cpp from: assert("error"); to: assert(0 && "error"); llvm-svn: 139456	2011-09-10 01:42:07 +00:00
Richard Trieu	0485e133f2	Fixed an assert from: assert("not implemented for target shuffle node"); to: assert(0 && "not implemented for target shuffle node"); This causes a test failure in CodeGen/X86/palignr.ll which has been marked as XFAIL for the time being. Test failure filed at PR10901. llvm-svn: 139454	2011-09-10 01:26:21 +00:00
Andrew Trick	cf4ef9bded	[disable-iv-rewrite] Allow WidenIV to handle NSW/NUW operations better. Don't immediately give up when an add operation can't be trivially sign/zero-extended within a loop. If it has NSW/NUW flags, generate a new expression with sign extended (non-recurrent) operand. As before, if SCEV says that all sign extends are loop invariant, then we can widen the operation. llvm-svn: 139453	2011-09-10 01:24:17 +00:00
Andrew Trick	8af62b87e4	Set NSW/NUW flags on SCEVAddExpr when the operation is flagged as such. I'm doing this now for completeness because I can't think of/remember any reason that it was left out. I'm not sure it will help anything, but if we don't do it we need to explain why in comments. llvm-svn: 139450	2011-09-10 01:09:50 +00:00
Richard Trieu	43ea533a5e	Fix asserts in CodeGen from: assert("error"); to: assert(0 && "error"); llvm-svn: 139449	2011-09-10 01:07:54 +00:00
Jim Grosbach	52492b1cf3	Thumb2 parsing and encoding for MOV(immediate). Some aliases for MOV(register) also to keep existing T1 tests happy when run in thumbv7 mode. llvm-svn: 139440	2011-09-10 00:15:36 +00:00
Owen Anderson	9cd21ce8c9	LDM writeback is not allowed if Rn is in the target register list. llvm-svn: 139432	2011-09-09 23:13:33 +00:00
Owen Anderson	ca4447e808	Fix an ambiguously nested if. llvm-svn: 139431	2011-09-09 23:13:02 +00:00
Owen Anderson	58bb862098	Fix buildbot breakage caused by r139415. I missed one instance of a manually create ARM::tB. llvm-svn: 139429	2011-09-09 23:05:14 +00:00
Owen Anderson	dbe77fc5a1	Fix assembly/disassembly of Thumb2 ADR instructions with immediate operands. llvm-svn: 139422	2011-09-09 22:24:36 +00:00
Akira Hatanaka	45bb471537	O64 will not be supported. llvm-svn: 139421	2011-09-09 22:22:48 +00:00
Akira Hatanaka	da477aa5eb	Make F31 and D15 non-reserved registers. llvm-svn: 139420	2011-09-09 22:11:26 +00:00
Chris Lattner	a1676de9bd	tidy up a bit llvm-svn: 139419	2011-09-09 22:06:59 +00:00
Owen Anderson	a7838cb723	Thumb unconditional branches are allowed in IT blocks, and therefore should have a predicate operand, unlike conditional branches. llvm-svn: 139415	2011-09-09 21:48:23 +00:00
Akira Hatanaka	be07ce941b	Mips32 does not reserve even-numbered floating point registers. llvm-svn: 139412	2011-09-09 21:31:46 +00:00
Eli Friedman	4bae1c4f70	Make the SelectionDAG verify that all the operands of BUILD_VECTOR have the same type. Teach DAGCombiner::visitINSERT_VECTOR_ELT not to make invalid BUILD_VECTORs. Fixes PR10897. llvm-svn: 139407	2011-09-09 21:04:06 +00:00
Akira Hatanaka	f65d050693	Drop support for Mips1 and Mips2. llvm-svn: 139405	2011-09-09 20:45:50 +00:00
Nadav Rotem	ccb46031e6	Implement vector-select support for avx256. Refactor the vblend implementation to have tablegen match the instruction by the node type llvm-svn: 139400	2011-09-09 20:29:17 +00:00
Jim Grosbach	6225a96bf5	Thumb2 assembly parsing and encoding for MLA and MLS. llvm-svn: 139399	2011-09-09 20:24:45 +00:00
Duncan Sands	3311da4d79	Don't tack "Instruction not interpretable yet!" onto the end of the instruction. llvm-svn: 139398	2011-09-09 20:22:48 +00:00
Jim Grosbach	915ba5189e	Thumb2 assembly parsing and encoding for LDRSB. llvm-svn: 139389	2011-09-09 19:42:40 +00:00
Akira Hatanaka	17df2dfe8c	Drop support for Allegrex. Allegrex implements a variant of Mips2. llvm-svn: 139383	2011-09-09 19:00:51 +00:00
Jim Grosbach	eb2d668899	Thumb2 assembly parsing and encoding for LDREX/LDREXB/LDREXD/LDREXH. llvm-svn: 139381	2011-09-09 18:37:27 +00:00
Jakob Stoklund Olesen	659d713274	Reapply r139247: Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. The previous version had bugs that caused miscompilations. They have been fixed. llvm-svn: 139378	2011-09-09 18:11:41 +00:00
Andrew Trick	77fa88a786	Comment formatting. llvm-svn: 139375	2011-09-09 17:35:10 +00:00
Craig Topper	18cbd5db26	Fix handling of Intel syntax disassembling of movs and stos to stop being blank. Also fixed scas, and cmps to always print size suffix in Intel syntax since its abiguous without arguments. Fixes PR10875. llvm-svn: 139353	2011-09-09 05:40:53 +00:00
Akira Hatanaka	e1eb015eb9	Change default target architecture from Mips1 to Mips32r1 in preparation for removing support for Mips1 and Mips2. This change and the ones that follow have been discussed with and approved by Bruno. llvm-svn: 139344	2011-09-09 01:13:27 +00:00
Benjamin Kramer	da0ca686c9	Remove dead code. llvm-svn: 139343	2011-09-09 00:22:05 +00:00
Nick Lewycky	ec5437bfc4	Fix release build: MachOObjectFile.cpp:524: error: unused variable 'NumLoadCommands' [-Wunused-variable] llvm-svn: 139341	2011-09-09 00:16:50 +00:00
Akira Hatanaka	94deb5f3f9	80 columns. llvm-svn: 139339	2011-09-09 00:13:35 +00:00
Devang Patel	ba2d56b1ef	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00
Owen Anderson	99ad1a853e	All conditional branches are disallowed in IT blocks, not just CBZ/CBNZ. llvm-svn: 139329	2011-09-08 22:48:37 +00:00
Owen Anderson	d7127e0c27	Soft fail CBZ/CBNZ in the disassembler if they appear inside an IT block. llvm-svn: 139328	2011-09-08 22:42:49 +00:00
Eric Christopher	fc8e09962f	Formatting and typo. llvm-svn: 139325	2011-09-08 22:17:40 +00:00
Nadav Rotem	2f256b7f9f	Dix the 80-columns and remove unsupported v8i16 type from the list of legal vselect types. llvm-svn: 139324	2011-09-08 22:17:35 +00:00
Jim Grosbach	9f150bfedf	Thumb2 assembly parsing and encoding for LDRD(immediate). Refactor operand handling for STRD as well. Tests for that forthcoming. llvm-svn: 139322	2011-09-08 22:07:06 +00:00
Bruno Cardoso Lopes	54962ac233	Add a AVX version of a simple i64 -> f64 bitcast. This could be triggered using llc with -O0, which wouldn't let it be folded and expose the lack of this pattern. llvm-svn: 139320	2011-09-08 21:52:33 +00:00
Kevin Enderby	16f9df1f05	Fix a Darwin x86_64 special case of a jmp to a temporary symbol from an atom without a base symbol that must not have a relocation entry. llvm-svn: 139316	2011-09-08 20:53:44 +00:00
Benjamin Kramer	3c40c2100b	Add support for relocations to ObjectFile. Patch by Danil Malyshev! llvm-svn: 139314	2011-09-08 20:52:17 +00:00
Bruno Cardoso Lopes	2f07ca9728	* Combines Alignment, AuxInfo, and TB_NOT_REVERSABLE flag into a single field (Flags), which is a bitwise OR of items from the TB_* enum. This makes it easier to add new information in the future. * Gives every static array an equivalent layout: { RegOp, MemOp, Flags } * Adds a helper function, AddTableEntry, to avoid duplication of the insertion code. * Renames TB_NOT_REVERSABLE to TB_NO_REVERSE. * Adds TB_NO_FORWARD, which is analogous to TB_NO_REVERSE, except that it prevents addition of the Reg->Mem entry. (This is going to be used by Native Client, in the next CL). Patch by David Meyer llvm-svn: 139311	2011-09-08 18:35:57 +00:00
Bruno Cardoso Lopes	74a67e22b0	Add AVX versions of blend vector operations and fix some issues noticed in Nadav's r139285 and r139287 commits. 1) Rename vsel.ll to a more descriptive name 2) Change the order of BLEND operands to "Op1, Op2, Cond", this is necessary because PBLENDVB is already used in different places with this order, and it was being emitted in the wrong way for vselect 3) Add AVX patterns and tests for the same SSE41 instructions llvm-svn: 139305	2011-09-08 18:05:08 +00:00
Bruno Cardoso Lopes	84c53e3965	Fix PR10844: Add patterns to cover non foldable versions of X86vzmovl. Triggered using llc -O0. Also fix some SET0PS patterns to their AVX forms and test it on the testcase. llvm-svn: 139304	2011-09-08 18:05:02 +00:00
Nadav Rotem	b461f2190e	Add X86-SSE4 codegen support for vector-select. llvm-svn: 139285	2011-09-08 08:11:19 +00:00
Eli Friedman	c933295353	A couple minor corrections to r139276. llvm-svn: 139277	2011-09-08 02:37:07 +00:00
Eli Friedman	6e9cab83b0	Fix the logic in BasicAliasAnalysis::aliasGEP for comparing GEP's with variable differences so that it actually does something sane. Fixes PR10881. llvm-svn: 139276	2011-09-08 02:23:31 +00:00
Jim Grosbach	5ac3aa158b	Thumb2 assembly parsing and encoding for LDR post-indexed. More cleanup of the general indexed addressing T2 instructions. Still more to do, especially for stores. llvm-svn: 139272	2011-09-08 01:01:32 +00:00
Jim Grosbach	1aa191032a	Thumb2 assembly parsing and encoding for LDR pre-indexed w/ writeback. Adjust encoding of writeback load/store instructions to better reflect the way the operand types are represented. llvm-svn: 139270	2011-09-08 00:39:19 +00:00
Owen Anderson	4a5ec6836f	Remove the "common" set of instructions shared between ARM and Thumb2 modes. This is no longer needed now that Thumb2 has its own copy of the STC/LDC instructions. llvm-svn: 139268	2011-09-08 00:11:18 +00:00
Jim Grosbach	8b54d19514	Thumb2 assembly parsing and encoding for LDRBT. llvm-svn: 139267	2011-09-07 23:39:14 +00:00
Jim Grosbach	a3ff9eeb85	Thumb2 assembly parsing and encoding for LDR(register). llvm-svn: 139264	2011-09-07 23:10:15 +00:00
Benjamin Kramer	f4e9cbfc05	Add two notes for correlated-expression optimizations. llvm-svn: 139263	2011-09-07 22:49:26 +00:00
Jakob Stoklund Olesen	6853bdde5b	Revert r139247 "Cache intermediate results during traceSiblingValue." It broke the self host and clang-x86_64-darwin10-RA. llvm-svn: 139259	2011-09-07 21:43:52 +00:00
Jim Grosbach	d640c62856	Thumb2 assembly parsing and encoding for LDRB(immediate). llvm-svn: 139258	2011-09-07 21:41:25 +00:00
Owen Anderson	26467730c1	Create Thumb2 versions of STC/LDC, and reenable the relevant tests. llvm-svn: 139256	2011-09-07 21:10:42 +00:00
Jim Grosbach	20642fb479	Thumb2 parsing and encoding for LDR(immediate). The immediate offset of the non-writeback i8 form (encoding T4) allows negative offsets only. The positive offset form of the encoding is the LDRT instruction. Immediate offsets in the range [0,255] use encoding T3 instead. llvm-svn: 139254	2011-09-07 20:58:57 +00:00
Jim Grosbach	054b346e46	Thumb2 parsing and encoding for LDMDB. llvm-svn: 139251	2011-09-07 19:57:53 +00:00
James Molloy	ac057f13a5	Second of a three-patch series aiming to fix MSR/MRS on Cortex-M. This adds predicate checking to the Disassembler. llvm-svn: 139250	2011-09-07 19:42:28 +00:00
Jakob Stoklund Olesen	46444bd655	Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. llvm-svn: 139247	2011-09-07 19:07:31 +00:00
Eli Friedman	9ea5599729	Fix atomic load and store on x86 to pass -verify-machineinstrs (and possibly fix some subtle bugs involving passes which check mayStore()). This isn't exactly ideal, but it is good enough for the moment. llvm-svn: 139245	2011-09-07 18:48:32 +00:00
Jim Grosbach	bd018fd94f	Thumb2 ldm/stm 'db' mnemonics don't have a '.w' suffix. There is no 16-bit wide encoding, so the .w suffix isn't needed (indeed, isn't documented as allowed). Also add the missing '!' token on the _UPD variant. llvm-svn: 139243	2011-09-07 18:39:47 +00:00
Jim Grosbach	20689d28e7	Thumb2 parsing and encoding for LDMIA. Choose 32-bit vs. 16-bit encoding when there's no .w suffix in post-processing as match classes are insufficient to handle the context-sensitiveness of the writeback operand's legality for the 16-bit encodings. llvm-svn: 139242	2011-09-07 18:05:34 +00:00
Owen Anderson	4106b9fb31	Port more assembler tests over to disassembler tests, and fix a minor logic error that exposed. llvm-svn: 139240	2011-09-07 17:55:19 +00:00
James Molloy	f781d3d8e9	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. llvm-svn: 139237	2011-09-07 17:24:38 +00:00
Jim Grosbach	c0ebdea61f	Thumb2 use 'ldm' as default mnemonic. Handle explicit 'ia' suffix via a MnemonicAlias (pre-existing). llvm-svn: 139234	2011-09-07 16:22:42 +00:00
Rafael Espindola	1cca4f99bd	Detect attempt to use segmented stacks on non ELF systems and error (not assert) early. llvm-svn: 139233	2011-09-07 16:10:57 +00:00
Jim Grosbach	7969f880c0	Better diagnostic location information for mnemonic suffices. llvm-svn: 139232	2011-09-07 16:06:04 +00:00
Eli Friedman	6a45370c0f	Relax the MemOperands on atomics a bit. Fixes -verify-machineinstrs failures for atomic laod/store on ARM. (The fix for the related failures on x86 is going to be nastier because we actually need Acquire memoperands attached to the atomic load instrs, etc.) llvm-svn: 139221	2011-09-07 02:23:42 +00:00
Devang Patel	f4483238b6	While sinking machine instructions, sink matching DBG_VALUEs also otherwise live debug variable pass will drop DBG_VALUEs on the floor. llvm-svn: 139208	2011-09-07 00:07:58 +00:00
Bill Wendling	763ed58408	Reenable compact unwind by default. However, also emit the old version of unwind information for older linkers. llvm-svn: 139206	2011-09-06 23:47:14 +00:00
Owen Anderson	9ae90800a2	memset_pattern16 uses a 16 BYTE pattern, not a 16 BIT pattern. Add comments to that effect. llvm-svn: 139205	2011-09-06 23:43:26 +00:00
Owen Anderson	483f94e8d1	Teach BasicAA about the aliasing properties of memset_pattern16. Fixes PR10872 and <rdar://problem/10065079>. llvm-svn: 139204	2011-09-06 23:33:25 +00:00
Jim Grosbach	2b87e14298	ISB is HasDB, not just HasV7. llvm-svn: 139202	2011-09-06 23:09:19 +00:00
Jim Grosbach	14720bed32	Thumb2 parsing and encoding for ISB. llvm-svn: 139200	2011-09-06 22:53:27 +00:00
Jim Grosbach	276e51888c	Thumb2 parsing and encoding for DMB. llvm-svn: 139193	2011-09-06 22:14:58 +00:00
Jim Grosbach	c0aaa747a1	Thumb2 parsing and encoding for DBG. llvm-svn: 139191	2011-09-06 22:06:40 +00:00
Jim Grosbach	4258d5ffba	Thumb2 parsing and encoding for CMN and CMP. llvm-svn: 139188	2011-09-06 21:44:58 +00:00
Nick Lewycky	8203bcfd03	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Eli Friedman	33a078523a	Add mayLoad/mayStore markings to ARM 64-bit atomic pseudo-instructions. llvm-svn: 139179	2011-09-06 20:53:37 +00:00
Jim Grosbach	b5dcc965a7	Thumb2 parsing and encoding for CLREX. llvm-svn: 139172	2011-09-06 20:27:04 +00:00
Andrew Trick	8145f71bab	Add -verify-indvars for imperfect SCEV trip count verification after indvars. llvm-svn: 139169	2011-09-06 20:20:38 +00:00
Rafael Espindola	9182560b8f	Fix comment. Noticed by Duncan. llvm-svn: 139161	2011-09-06 19:29:31 +00:00
Duncan Sands	d1311488fe	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Evan Cheng	891e9696ea	Fix fall outs from my recent change on how carry bit is modeled during isel. Now the 'S' instructions, e.g. ADDS, treat S bit as optional operand as well. Also fix isel hook to correctly set the optional operand. rdar://10073745 llvm-svn: 139157	2011-09-06 18:52:20 +00:00
Devang Patel	1366637777	Use IRBuilder. llvm-svn: 139156	2011-09-06 18:49:53 +00:00
Jim Grosbach	86c318e475	ARM .code directive should always go to the streamer. Even if there's no mode switch performed, the .code directive should still be sent to the output streamer. Otherwise, for example, an output asm stream is not equivalent to the input stream which generated it (a dependency on the input target triple arm vs. thumb is introduced which was not originally there). llvm-svn: 139155	2011-09-06 18:46:23 +00:00
Rafael Espindola	9d9df4bc1a	Fix style issues and typos found by Duncan. llvm-svn: 139154	2011-09-06 18:43:08 +00:00
Bill Wendling	32608b1900	As a first step, emit both the compact unwind and CIE/FDEs for a function. llvm-svn: 139152	2011-09-06 18:37:11 +00:00
Owen Anderson	ca0326a423	Try again at r138809 (make DSE more aggressive in removing dead stores at the end of a function), now with less deleting stores before memcpy's. llvm-svn: 139150	2011-09-06 18:14:09 +00:00
Jakob Stoklund Olesen	7994269719	Atomic pseudos don't use (as in read) CPSR. They clobber it. llvm-svn: 139148	2011-09-06 17:40:35 +00:00
Devang Patel	2c2dd9114e	Now, named mdnode llvm.dbg.cu keeps track of all compile units in a module. Update DebugInfoFinder to collect compile units from llvm.dbg.cu. llvm-svn: 139147	2011-09-06 17:40:08 +00:00
Duncan Sands	6939ae53ac	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Nick Lewycky	e1c0b41d41	Fix typo in comment again. llvm-svn: 139139	2011-09-06 07:02:40 +00:00
Nick Lewycky	700f71a0ac	Apparently we compile the code, not the comments. Thanks Eli! llvm-svn: 139138	2011-09-06 06:56:00 +00:00
Nick Lewycky	4add6eec38	Fix typo in comment. llvm-svn: 139137	2011-09-06 06:46:01 +00:00
Nick Lewycky	0c9df5d6c2	Nope! I had it right the first time. Revert the operative part of r139135 and add more showing of my work. llvm-svn: 139136	2011-09-06 06:39:54 +00:00
Nick Lewycky	39b165bb7d	Fix flipped sign. While there, show my math. llvm-svn: 139135	2011-09-06 05:33:18 +00:00
Nick Lewycky	fdc650ea7a	No no no, fix typo properly! llvm-svn: 139134	2011-09-06 05:08:09 +00:00
Nick Lewycky	3823432a57	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	18c0b01a56	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	30dcc754df	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Nick Lewycky	9b5a242546	Add a new MC bit for NaCl (Native Client) mode. NaCl requires that certain instructions are more aligned than the CPU requires, and adds some additional directives, to follow in future patches. Patch by David Meyer! llvm-svn: 139125	2011-09-05 21:51:43 +00:00
Nick Lewycky	cf82a5a673	Update the C++ backend to use the new ArrayRef'ified APIs. Patch by arrowdodger! llvm-svn: 139124	2011-09-05 18:50:59 +00:00
Nick Lewycky	c10a9bb850	Fix typo in comment. llvm-svn: 139122	2011-09-05 18:35:03 +00:00
Benjamin Kramer	ec933b857e	InstSimplify: Don't try to replace an extractvalue/insertvalue pair with the original value if types don't match. Fixes clang selfhost. llvm-svn: 139120	2011-09-05 18:16:19 +00:00
Duncan Sands	d883f9f371	Delete trivial landing pads that just continue unwinding the caught exception. llvm-svn: 139117	2011-09-05 12:57:57 +00:00
Duncan Sands	a74d10bb60	Add some simple insertvalue simplifications, for the purpose of cleaning up do-nothing exception handling code produced by dragonegg. llvm-svn: 139113	2011-09-05 06:52:48 +00:00
Benjamin Kramer	0c1a5d2067	Use canonical forms for the branch probability zero heutistic. - Drop support for X >u 0, it's equivalent to X != 0 and should be canonicalized into the latter. - Add X < 1 -> unlikely, which is what instcombine canonicalizes X <= 0 into. - Add X > -1 -> likely, which is what instcombine canonicalizes X >= 0 into. llvm-svn: 139110	2011-09-04 23:53:04 +00:00
Bill Wendling	0506959970	Use Duncan's patch to delete the instructions in reverse order (minus the landingpad and terminator). llvm-svn: 139090	2011-09-04 09:43:36 +00:00
Bill Wendling	dbea8de893	The insertion point for the loads is right before the llvm.eh.exception call. The call may be in the same BB as the landingpad instruction. If that's the case, then inserting the loads after the landingpad inst, but before the extractvalues, causes undefined behavior. llvm-svn: 139088	2011-09-04 09:02:18 +00:00
Benjamin Kramer	902004dcd8	Use internal storage for command line option. llvm-svn: 139079	2011-09-03 03:45:06 +00:00
Bill Wendling	c273c3f456	Don't reload the values that are already there. The llvm.eh.resume uses the same values that the resume instruction uses. PR10850 llvm-svn: 139076	2011-09-03 01:38:17 +00:00
Bruno Cardoso Lopes	02157d584a	Add AVX versions to match AESENC/AESDEC intrinsics. This hopefully ends the cycle of missing AVX counterparts of already present SSE* patterns llvm-svn: 139073	2011-09-03 00:47:08 +00:00
Bruno Cardoso Lopes	c72ce24240	Add AVX version of a SSE4.1 VPBLENDVB pattern llvm-svn: 139072	2011-09-03 00:47:05 +00:00
Bruno Cardoso Lopes	a25fc6f941	Add AVX versions of SSE4.1 EXTRACTPS patterns llvm-svn: 139071	2011-09-03 00:47:03 +00:00
Bruno Cardoso Lopes	45d02d5eca	Add AVX versions for SSE4.1 MOVZX* patterns llvm-svn: 139070	2011-09-03 00:47:01 +00:00
Bruno Cardoso Lopes	cadec3711c	Add one more AVX pattern for MOVZPQILo2PQI llvm-svn: 139069	2011-09-03 00:46:58 +00:00
Bruno Cardoso Lopes	48eeb79003	Move PUNPCKLQDQ splat pattern close to the instruction definition and duplicate it for AVX mode. llvm-svn: 139068	2011-09-03 00:46:56 +00:00
Bruno Cardoso Lopes	ca90af60bd	Add AVX pattern versions for PSHUFB,PSIGN{B,W,D} llvm-svn: 139067	2011-09-03 00:46:54 +00:00
Bruno Cardoso Lopes	7fae5ca308	Add AVX versions of MOVZDI2PDI patterns. Use SUBREG_TO_REG to indicate that the AVX versions (even the 128-bit ones) all clear the upper part of the destination register. llvm-svn: 139066	2011-09-03 00:46:51 +00:00
Bruno Cardoso Lopes	e749426ece	Enforce subtarget checks in a few places to be explicit when the pattern should be matched llvm-svn: 139065	2011-09-03 00:46:49 +00:00
Bruno Cardoso Lopes	323a5b334e	Tidy up code moving patterns to their appropriate place! llvm-svn: 139064	2011-09-03 00:46:47 +00:00
Bruno Cardoso Lopes	ea1931b9d0	Add AVX versions of FsMOVAPS and FsMOVAPS. Teach X86InstrInfo how to use it! llvm-svn: 139063	2011-09-03 00:46:45 +00:00
Bruno Cardoso Lopes	eb041875c1	Teach X86FastISel to use AVX versions of instructions when possible llvm-svn: 139062	2011-09-03 00:46:42 +00:00
Bruno Cardoso Lopes	86c67e11c9	Fix 80-column and style llvm-svn: 139061	2011-09-03 00:46:40 +00:00
Bruno Cardoso Lopes	beb7a448e7	Tidy up some SSE/AVX convert intrinsics. Also add an AVX version of OptForSize pattern llvm-svn: 139060	2011-09-03 00:46:38 +00:00
Owen Anderson	05f809efff	Fix a truly heinous bug in DAGCombine related to AssertZext. If we have a chain of zext -> assert_zext -> zext -> use, the first zext would get simplified away because of the later zext, and then the later zext would get simplified away because of the assert. The solution is to teach SimplifyDemandedBits that assert_zext demands all of the high bits of its input, rather than only those demanded by its users. No testcase because the only example I have manifests as llvm-gcc miscompiling LLVM, and I haven't found a smaller case that reproduces this problem. Fixes <rdar://problem/10063365>. llvm-svn: 139059	2011-09-03 00:26:49 +00:00
Jakob Stoklund Olesen	ef8527b836	Pseudo CMOV instructions don't clobber EFLAGS. The explanation about a 0 argument being materialized as xor is no longer valid. Rematerialization will check if EFLAGS is live before clobbering it. The code produced by X86TargetLowering::EmitLoweredSelect does not clobber EFLAGS. This causes one less testb instruction to be generated in the cmov.ll test case. llvm-svn: 139057	2011-09-02 23:52:55 +00:00
Jakob Stoklund Olesen	29145a3de1	Check for EFLAGS live-out before clobbering it. It is only allowed to clobber EFLAGS at the end of a block if it isn't live-in to any successor. llvm-svn: 139056	2011-09-02 23:52:52 +00:00
Jakob Stoklund Olesen	6d5d51f687	Use existing function. llvm-svn: 139055	2011-09-02 23:52:49 +00:00
Jim Grosbach	fb5e64e731	Thumb2 parsing and encoding for BXJ. llvm-svn: 139053	2011-09-02 23:43:09 +00:00
Jim Grosbach	44483a9ba5	Thumb2 parsing and encoding of B instruction. Tweak handling of IT blocks a bit to enable this. The differentiation between B and Bcc needs special sauce. llvm-svn: 139049	2011-09-02 23:22:08 +00:00
Jakob Stoklund Olesen	c710d8fdc7	Remove unused variables. llvm-svn: 139047	2011-09-02 22:41:25 +00:00
Eli Friedman	383a3c76b2	Don't fast-isel for atomic load/store; some cases require extra handling missing from fast-isel. llvm-svn: 139044	2011-09-02 22:33:24 +00:00
Jim Grosbach	ba4ceeaae6	Thumb2 parsing and encoding for ASR. For other shift and rotate instructions, too. Tests for those forthcoming as I work my way through the ISA. llvm-svn: 139040	2011-09-02 21:28:54 +00:00
Andrew Trick	43d88c3879	Comment and clarifying assert. llvm-svn: 139036	2011-09-02 21:20:46 +00:00
Bill Wendling	e35fdee39e	No need to get fancy inserting a PHI node when the values are stored in stack slots. This fixes a bug where the number of nodes coming into the PHI node may not equal the number of predecessors. E.g., two or more landingpad instructions may require a PHI before reaching the eh.exception and eh.selector instructions. llvm-svn: 139035	2011-09-02 21:17:08 +00:00
Kevin Enderby	90a1526592	Change X86 disassembly to print immediates values as signed by default. Special case those instructions that the immediate is not sign-extend. radr://8795217 llvm-svn: 139028	2011-09-02 20:01:23 +00:00
Jim Grosbach	20ed697ea7	Tidy up. Formatting. llvm-svn: 139024	2011-09-02 18:46:15 +00:00
Bill Wendling	3033d7846d	Update comments to reflect reality. llvm-svn: 139023	2011-09-02 18:43:33 +00:00
Jim Grosbach	a93f292add	Tidy up. 80 columns. llvm-svn: 139022	2011-09-02 18:43:25 +00:00
Jim Grosbach	cbf37eebff	Thumb2 parsing and encoding for AND (register). llvm-svn: 139021	2011-09-02 18:41:35 +00:00
Jakob Stoklund Olesen	d10a0768cb	Simplify by using isFullCopy(). llvm-svn: 139019	2011-09-02 18:18:29 +00:00
Bill Wendling	991a1dab16	Revert r138826 until PR10834 can be fixed. llvm-svn: 139018	2011-09-02 18:15:04 +00:00
Jim Grosbach	dd0421034a	Thumb2 parsing and encoding for ADD (register). llvm-svn: 139017	2011-09-02 18:14:46 +00:00
Duncan Sands	33f33411e8	Darwin wants ctors/dtors to be ordered the other way round to linux. llvm-svn: 139015	2011-09-02 18:07:19 +00:00
Andrew Trick	36b96e4619	Enable SCEV-based unrolling by default. This changes loop unrolling to use the same mechanism for trip count computation as indvars. This is a stronger check that tends to unroll more loops. A very common side-effect is that many single iteration loops will be removed sooner. The real goal was simply to remove dependence on canonical IVs. x86 is break even. ARM performance changes to expect (+ is good): External/SPEC/CFP2000/183.equake/183.equake +13% SingleSource/Benchmarks/Dhrystone/fldry +21% MultiSource/Applications/spiff/spiff +3% SingleSource/Benchmarks/Stanford/Puzzle -14% The Puzzle regression is actually an improvement in loop optimization that defeats GVN: rdar://problem/10065079. llvm-svn: 139009	2011-09-02 17:26:28 +00:00
Jakub Staszak	4df162e09b	Return undef value (instead of arbitrary) for wrong or undef index in ConstantVector. llvm-svn: 139007	2011-09-02 17:01:40 +00:00
Jakub Staszak	132e24bf28	ConstantVector returns arbitrary value for the wrong index. This fixes PR10813. llvm-svn: 139006	2011-09-02 15:43:43 +00:00
Jakub Staszak	b82758ae9b	Compare type size instead of type _store_ size to make sure that BitCastInst will be valid. This fixes PR10820. llvm-svn: 139005	2011-09-02 14:57:37 +00:00
Kalle Raiskila	7c154fe467	Pass signed (not unsigned) 10 bit field to SPU 'ori' instruction. llvm-svn: 139004	2011-09-02 10:05:01 +00:00
Bill Wendling	66d5793dcf	Perform the upgrading of the old EH to the new EH in a more sane manner. Perform the upgrading in steps. * First, create a map of the invokes to the EH intrinsics. * Next, take that mapping and determine if the invoke's unwind destination has a single predecessor. If not, then create a new empty block to hold the new landingpad instruction. * Create a landingpad instruction into the uwnind destination. Fill it with the values from the old selector. Map the old intrinsic calls to the new landingpad values (there may be multiple landingpad instructions per instrinic call pairs). * Go through the old intrinsic calls, create a PHI node when necessary, and then replace their values with the new values from the landingpad instructions. * Delete all dead instructions. * ??? * Profit! llvm-svn: 138990	2011-09-02 01:30:08 +00:00
Owen Anderson	a319b9901d	Merge the ARM disassembler header into the implementation file, since it is not externally exposed. llvm-svn: 138982	2011-09-01 23:35:51 +00:00
Owen Anderson	c4ec9cc45f	Fix 80 columns violations. llvm-svn: 138980	2011-09-01 23:23:50 +00:00
Dan Gohman	6d0230847c	Revert r131152, r129796, r129761. This code is currently considered to be unreliable on platforms which require memcpy calls, and it is complicating broader legalize cleanups. It is hoped that these cleanups will make memcpy byval easier to implement in the future. llvm-svn: 138977	2011-09-01 23:07:08 +00:00
Benjamin Kramer	bd939ad83e	Don't drop alignment info on local common symbols. - On COFF the .lcomm directive has an alignment argument. - On ELF we fall back to .local + .comm Based on a patch by NAKAMURA Takumi. Fixes PR9337, PR9483 and PR10128. llvm-svn: 138976	2011-09-01 23:04:27 +00:00
Eli Friedman	1400b24e06	Null-initialize to shut up -Wuninitialized warnings. llvm-svn: 138974	2011-09-01 22:27:41 +00:00
James Molloy	5f19051fbd	Fix apparent build error caused by r138948 on certain versions of GCC with -Werror. Sorry for the inconvenience. llvm-svn: 138973	2011-09-01 22:01:14 +00:00
Bill Wendling	b6a419d0f0	Reduce indentation. No functionality change. llvm-svn: 138968	2011-09-01 21:29:49 +00:00
Bill Wendling	759eb19f0b	Change worklist driven deletion to be an iterative process. Duncan noticed this! llvm-svn: 138967	2011-09-01 21:28:33 +00:00
Eli Friedman	00a62b2122	Fix an issue with the IR sink pass found by inspection. (I'm not sure anyone is actually using this, but might as well fix it since I found the issue.) llvm-svn: 138965	2011-09-01 21:21:24 +00:00
Nick Lewycky	99efd4b3ac	Fix the build for us -Werror users. Remove broken emacs mode major notation marking a C++ file as C. No functionality change. llvm-svn: 138963	2011-09-01 21:09:04 +00:00
Eli Friedman	3e1fc84a39	Make isSafeToSpeculativelyExecute() return the right answer for some new instructions. Found by inspection; not sure what practical impact, if any, this has. llvm-svn: 138962	2011-09-01 21:03:03 +00:00
Jakob Stoklund Olesen	c26e2e6221	Permit remat of partial register defs when it is safe. An instruction may define part of a register where the other bits are undefined. In that case, it is safe to rematerialize the instruction. For example: %vreg2:ssub_0<def> = VLDRS <cp#0>, 0, pred:14, pred:%noreg, %vreg2<imp-def> The extra <imp-def> operand indicates that the instruction does not read the other parts of the virtual register, so a remat is safe. This patch simply allows multiple def operands for the virtual register. It is MI->readsVirtualRegister() that determines if we depend on a previous value so remat is impossible. llvm-svn: 138953	2011-09-01 18:27:51 +00:00
Jim Grosbach	36ea6726dd	ARM 'rscs' mnemonic is carry-setting 'rsc', not 'rs' with a 'cs' condition code. llvm-svn: 138952	2011-09-01 18:22:13 +00:00
Bruno Cardoso Lopes	10f234f1a7	Fix vbroadcast matching logic to early unmatch if the node doesn't have only one use. Fix PR10825. llvm-svn: 138951	2011-09-01 18:15:06 +00:00
James Molloy	4a63186421	Fix up r137380 based on post-commit review by Jim Grosbach. llvm-svn: 138948	2011-09-01 18:02:14 +00:00
Owen Anderson	d8157fabfb	t2Bcc is allowed to have a predicate without a preceding IT instruction. llvm-svn: 138946	2011-09-01 17:47:45 +00:00
Jakob Stoklund Olesen	ff7cf9e336	Revert r138794, "Do not try to rematerialize a value from a partial definition." The problem is fixed for all register allocators by r138944, so this patch is no longer necessary. <rdar://problem/10032939> llvm-svn: 138945	2011-09-01 17:25:18 +00:00

... 2 3 4 5 6 ...

49748 Commits