llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Akira Hatanaka	51dccb32d0	Make function loadImmediate a member of MipsSEInstrInfo and change it to return the temporary register that was used to load the immediate. Currently, it always returns register $at, but this will change if, in the future, we decide to use another register. No changes in functionality. llvm-svn: 162417	2012-08-23 00:21:05 +00:00
Akira Hatanaka	679d5c8fd7	Add a member of type Mips16InstrInfo/MipsSEInstrInfo to class Mips16RegisterInfo/MipsSERegisterInfo. No changes in functionality. llvm-svn: 162413	2012-08-22 23:58:53 +00:00
Chad Rosier	437076336a	[ms-inline asm] Avoid a false positive assertion Assertion failed: (Start.isValid() == End.isValid() && "Start and end should either both be valid or both be invalid!") when parsing inline asm. SMLoc assumes that the first char * in the source is invalid. However, when parsing an inline asm the mnemonic is at this location. I don't want to change SMLoc, so use a trivial workaround. llvm-svn: 162381	2012-08-22 19:14:29 +00:00
Benjamin Kramer	e09e72a083	Reduce duplicated hash map lookups. llvm-svn: 162362	2012-08-22 15:37:57 +00:00
Craig Topper	d66ff79b2c	Add a getName function to MachineFunction. Use it in places that previously did getFunction()->getName(). Remove includes of Function.h that are no longer needed. llvm-svn: 162347	2012-08-22 06:07:19 +00:00
Craig Topper	ba3d5bef9f	Don't cache the MBB in the class. Its only used by one function. Change a for loop over operands to use unsigned instead of int. llvm-svn: 162344	2012-08-22 05:59:59 +00:00
Craig Topper	37bdfa3177	Mark a function as static since it doesn't use anything in the class. llvm-svn: 162342	2012-08-22 05:36:44 +00:00
Akira Hatanaka	24b722f476	Add register Mips::GP to the list of reserved registers if target is bare-metal to prevent it from being clobbered. mips uses $gp to access small data section. This bug was originally reported by Carl Norum. llvm-svn: 162340	2012-08-22 03:18:13 +00:00
Akira Hatanaka	0602c4e928	Add option disable-mips-delay-filler. Turn on mips' delay slot filler by default. Patch by Carl Norum. llvm-svn: 162339	2012-08-22 02:51:28 +00:00
Jack Carter	1b099ac7c7	For mips64 switch statements in subroutines could generate within the codegen EK_GPRel64BlockAddress. This was not supported for direct object output and resulted in an assertion. This change adds support for EK_GPRel64BlockAddress for direct object. One fallout from this is to turn on rela relocations for mips64 to match gas. llvm-svn: 162334	2012-08-22 00:49:30 +00:00
Chad Rosier	3f65a99bf7	Add a few functions to TargetLibraryInfo as part of PR13574. Patch by Weiming Zhao <weimingz@codeaurora.org>. llvm-svn: 162329	2012-08-21 23:28:56 +00:00
Richard Smith	d1addbb679	Fix unaligned memory accesses when performing relocations in X86 JIT. There's no cost to using memcpy here: the fixed code is optimized by LLVM to perfect machine code. llvm-svn: 162311	2012-08-21 20:48:36 +00:00
Chad Rosier	92debd58d9	[ms-inline asm] Do not report a Parser error when matching inline assembly. llvm-svn: 162306	2012-08-21 19:36:59 +00:00
Chad Rosier	72a2747c53	[ms-inline asm] Expose the ErrorInfo from the MatchInstructionImpl. In general, this is the index of the operand that failed to match. Note: This may cause a buildbot failure due to an API mismatch in clang. Should recover with my next commit to clang. llvm-svn: 162295	2012-08-21 18:14:59 +00:00
Craig Topper	45eeb13dea	Fix up indentation and remove a couple else's after returns. llvm-svn: 162270	2012-08-21 08:29:51 +00:00
Craig Topper	aba5024223	Use uint16_t for tables of opcodes. llvm-svn: 162267	2012-08-21 08:23:21 +00:00
Craig Topper	9831045ed8	Fix up indentation. No functional change. llvm-svn: 162264	2012-08-21 08:17:07 +00:00
Craig Topper	63ef1d8341	Add a couple llvm_unreachables. Add a message to several others. llvm-svn: 162263	2012-08-21 08:16:16 +00:00
Craig Topper	3ba0ae7ec3	Replace a break with llvm_unreachable in the default case of a nested switch. Condense code a bit. No functional change. llvm-svn: 162261	2012-08-21 07:32:16 +00:00
Craig Topper	e432edabf1	Cleanup the scalar FMA3 definitions. Add patterns to fold loads with scalar forms. llvm-svn: 162260	2012-08-21 07:11:11 +00:00
Craig Topper	2e63b3ea18	Merge FMA3 instructions with and without patterns into single classes using null_frag. llvm-svn: 162257	2012-08-21 05:56:45 +00:00
Jakob Stoklund Olesen	4403f82dbf	Add a missing def flag. * Bad machine code: Explicit definition marked as use * - function: test_cos - basic block: BB#0 L.entry (0x7ff2a2024fd0) - instruction: VSETLNi32 %D11, %D11<undef>, %R0, 0, pred:14, pred:%noreg, %Q5<imp-use,kill>, %Q5<imp-def> - operand 0: %D11 llvm-svn: 162247	2012-08-21 00:34:53 +00:00
Jakob Stoklund Olesen	4d875f1e57	Use a SmallPtrSet to dedup successors in EmitSjLjDispatchBlock. The test case ARM/2011-05-04-MultipleLandingPadSuccs.ll was creating duplicate successor list entries. llvm-svn: 162222	2012-08-20 20:52:03 +00:00
Sebastian Pop	2f1237d5f4	fix HexagonSubtarget parsing of -mv flag llvm-svn: 162217	2012-08-20 19:56:47 +00:00
Michael Liao	3d421a0c4d	fix a case where all operands of BUILD_VECTOR are undefined llvm-svn: 162214	2012-08-20 17:59:18 +00:00
Akira Hatanaka	b64681df6d	Fix coding style violations in 162135 and 162136. Patch by Petar Jovanovic. llvm-svn: 162213	2012-08-20 17:53:24 +00:00
Craig Topper	77406bef3b	Remove FMA3 intrinsic instructions in favor of patterns. llvm-svn: 162194	2012-08-20 06:21:25 +00:00
Craig Topper	64c93f9d07	Use correct intrinsic for 256-bit VFMSUBADDPS. llvm-svn: 162193	2012-08-20 06:03:04 +00:00
Craig Topper	832951e7da	Remove trailing white space and tab characters. No functional change. llvm-svn: 162192	2012-08-19 23:37:46 +00:00
Nadav Rotem	589dc766e0	When unsafe math is used, we can use commutative FMAX and FMIN. In some cases this allows for better code generation. Added a new DAGCombine transformation to convert FMAX and FMIN to FMANC and FMINC, which are commutative. For example: movaps %xmm0, %xmm1 movsd LC(%rip), %xmm0 minsd %xmm1, %xmm0 becomes: minsd LC(%rip), %xmm0 llvm-svn: 162187	2012-08-19 13:06:16 +00:00
Benjamin Kramer	dca12ad159	Fabs folding is implemented. llvm-svn: 162186	2012-08-19 09:51:44 +00:00
Jakob Stoklund Olesen	abf0a9ec82	Remove the CAND/COR/CXOR custom ISD nodes and their select code. These nodes are no longer needed because the peephole pass can fold CMOV+AND into ANDCC etc. llvm-svn: 162179	2012-08-18 21:49:50 +00:00
Craig Topper	4362ba5082	Remove virtual from many methods. These methods replace methods in the base class, but the base class methods aren't virtual so it just increased call overhead. llvm-svn: 162178	2012-08-18 21:38:45 +00:00
Jakob Stoklund Olesen	e78d4a5b08	Also combine zext/sext into selects for ARM. This turns common i1 patterns into predicated instructions: (add (zext cc), x) -> (select cc (add x, 1), x) (add (sext cc), x) -> (select cc (add x, -1), x) For a function like: unsigned f(unsigned s, int x) { return s + (x>0); } We now produce: cmp r1, #0 it gt addgt.w r0, r0, #1 Instead of: movs r2, #0 cmp r1, #0 it gt movgt r2, #1 add r0, r2 llvm-svn: 162177	2012-08-18 21:25:22 +00:00
Jakob Stoklund Olesen	ece4a53017	Also pass logical ops to combineSelectAndUse. Add these transformations to the existing add/sub ones: (and (select cc, -1, c), x) -> (select cc, x, (and, x, c)) (or (select cc, 0, c), x) -> (select cc, x, (or, x, c)) (xor (select cc, 0, c), x) -> (select cc, x, (xor, x, c)) The selects can then be transformed to a single predicated instruction by peephole. This transformation will make it possible to eliminate the ISD::CAND, COR, and CXOR custom DAG nodes. llvm-svn: 162176	2012-08-18 21:25:16 +00:00
Nadav Rotem	d01a7b5942	Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow better compare/branch code. llvm-svn: 162172	2012-08-18 17:53:03 +00:00
Anton Korobeynikov	c0e610e681	fp16-to-fp32 conversion instructions are available in Thumb mode as well. Make sure the generic pattern is used. llvm-svn: 162170	2012-08-18 13:08:43 +00:00
Craig Topper	e341db552a	Refactor code a bit to reduce number of calls in the final compiled code. No functional change intended. llvm-svn: 162166	2012-08-18 06:39:34 +00:00
Craig Topper	d35582ae96	Reorder initialization list to silence -Wreorder llvm-svn: 162165	2012-08-18 06:20:54 +00:00
Nadav Rotem	e9cdefa762	Revert r162160 because it made a few buildbots fail. llvm-svn: 162164	2012-08-18 05:02:36 +00:00
Nadav Rotem	76f1b84f58	The X86 backend has a number of optimizations for SETCC nodes which use arithmetic instructions. However, when small data types are used, a truncate node appears between the SETCC node and the arithmetic operation. This patch adds support for this pattern. Before: xorl %esi, %edi testb %dil, %dil setne %al ret After: xorb %dil, %sil setne %al ret rdar://12081007 llvm-svn: 162160	2012-08-18 02:43:28 +00:00
Akira Hatanaka	ab6dca06f4	Add MipsELFWriterInfo.{h,cpp}. llvm-svn: 162136	2012-08-17 21:38:47 +00:00
Akira Hatanaka	a50e7bd0a6	Correct MCJIT functionality for MIPS32 architecture. No new tests are added. All tests in ExecutionEngine/MCJIT that have been failing pass after this patch is applied (when "make check" is done on a mips board). Patch by Petar Jovanovic. llvm-svn: 162135	2012-08-17 21:28:04 +00:00
Jakob Stoklund Olesen	40eb30013e	Avoid folding ADD instructions with FI operands. PEI can't handle the pseudo-instructions. This can be removed when the pseudo-instructions are replaced by normal predicated instructions. Fixes PR13628. llvm-svn: 162130	2012-08-17 20:55:34 +00:00
Akira Hatanaka	4e1b032521	Add stub methods for mips assembly matcher. Patch by Vladimir Medic. llvm-svn: 162124	2012-08-17 20:16:42 +00:00
Bill Wendling	0569e9a6f3	Change the `linker_private_weak_def_auto' linkage to` linkonce_odr_auto_hide' to make it more consistent with its intended semantics. The `linker_private_weak_def_auto' linkage type was meant to automatically hide globals which never had their addresses taken. It has nothing to do with the `linker_private' linkage type, which outputs the symbols with a `l' (ell) prefix among other things. The intended semantic is more like the `linkonce_odr' linkage type. Change the name of the linkage type to `linkonce_odr_auto_hide'. And therefore changing the semantics so that it produces the correct output for the linker. Note: The old linkage name `linker_private_weak_def_auto' will still parse but is not a synonym for `linkonce_odr_auto_hide'. This should be removed in 4.0. <rdar://problem/11754934> llvm-svn: 162114	2012-08-17 18:33:14 +00:00
Jakob Stoklund Olesen	36d81e300e	Add comment, clean up code. No functional change. llvm-svn: 162107	2012-08-17 16:59:09 +00:00
Tim Northover	1de091468c	Implement NEON domain switching for scalar <-> S-register vmovs on ARM llvm-svn: 162094	2012-08-17 11:32:52 +00:00
Craig Topper	efc1bf9ee1	Use nested switch to select arguments to reduce calls to EmitPCMP. llvm-svn: 162089	2012-08-17 07:15:56 +00:00
Craig Topper	8fa010b216	Make ReplaceATOMIC_BINARY_64 a static function. Use a nested switch to reduce to only a single call to it thus allowing it to be inlined by the compiler. llvm-svn: 162088	2012-08-17 06:55:11 +00:00
Craig Topper	117916e06d	Remove unnecessary include of ARMGenInstrInfo.inc. llvm-svn: 162086	2012-08-17 06:21:09 +00:00
Jakob Stoklund Olesen	88217b055d	Add ADD and SUB to the predicable ARM instructions. It is not my plan to duplicate the entire ARM instruction set with predicated versions. We need a way of representing predicated instructions in SSA form without requiring a separate opcode. Then the pseudo-instructions can go away. llvm-svn: 162061	2012-08-16 23:21:55 +00:00
Jakob Stoklund Olesen	aca66722c2	Handle ARM MOVCC optimization in PeepholeOptimizer. Use the target independent select analysis hooks. llvm-svn: 162060	2012-08-16 23:14:20 +00:00
Roman Divacky	b95259c849	Revert r162034, r162035 and r162037. llvm-svn: 162039	2012-08-16 19:07:59 +00:00
Roman Divacky	831ddb548a	Define and handle additional fixup kinds. By Adhemerval Zanella. llvm-svn: 162037	2012-08-16 18:37:52 +00:00
Roman Divacky	3a41549e6a	Fix typo and grammar. By Adhemerval Zanella. llvm-svn: 162032	2012-08-16 18:19:29 +00:00
Jush Lu	767c82d4e0	[arm-fast-isel] Add support for fastcc. Without fastcc support, the caller just falls through to CallingConv::C for fastcc, but callee still uses fastcc, this inconsistency of calling convention is a problem, and fastcc support can fix it. llvm-svn: 162013	2012-08-16 05:15:53 +00:00
Anitha Boyapati	161fc750a1	Patch to enable FMA on bdver2 target. Make XOP feature enable FMA4 as well. llvm-svn: 162012	2012-08-16 04:04:02 +00:00
Anitha Boyapati	5443ee0d76	(no commit message) llvm-svn: 162010	2012-08-16 03:50:04 +00:00
Akira Hatanaka	623a561154	Add Android ABI to Mips backend to handle functions returning vectors of four floats. llvm-svn: 162008	2012-08-16 03:48:05 +00:00
Jakob Stoklund Olesen	55aee8b58a	Fold predicable instructions into MOVCC / t2MOVCC. The ARM select instructions are just predicated moves. If the select is the only use of an operand, the instruction defining the operand can be predicated instead, saving one instruction and decreasing register pressure. This implementation can turn AND/ORR/EOR instructions into their corresponding ANDCC/ORRCC/EORCC variants. Ideally, we should be able to predicate any instruction, but we don't yet support predicated instructions in SSA form. llvm-svn: 161994	2012-08-15 22:16:39 +00:00
Evan Cheng	625c0ca5ee	Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows unaligned access. rdar://12091029 llvm-svn: 161962	2012-08-15 17:44:53 +00:00
Jakob Stoklund Olesen	6639cea68f	Add missing Rfalse operand to the predicated pseudo-instructions. When predicating this instruction: Rd = ADD Rn, Rm We need an extra operand to represent the value given to Rd when the predicate is false: Rd = ADDCC Rfalse, Rn, Rm, pred The Rd and Rfalse operands are different registers while in SSA form. Rfalse is tied to Rd to make sure they get the same register during register allocation. Previously, Rd and Rn were tied, but that is not required. Compare to MOVCC: Rd = MOVCC Rfalse, Rtrue, pred llvm-svn: 161955	2012-08-15 16:17:24 +00:00
Anton Korobeynikov	d13403fbd1	The names of VFP variants of half-to-float conversion instructions were reversed. This leads to wrong codegen for float-to-half conversion intrinsics which are used to support storage-only fp16 type. NEON variants of same instructions are fine. llvm-svn: 161907	2012-08-14 23:36:01 +00:00
Eric Christopher	47fee59c73	This needs braces. Spotted by Bill. llvm-svn: 161906	2012-08-14 23:32:15 +00:00
Michael Liao	f763f96863	minor fix of X86ISD::VSEXT_MOVL dump llvm-svn: 161902	2012-08-14 22:53:17 +00:00
Michael Liao	daebe04c2f	fix PR11334 - FP_EXTEND only support extending from vectors with matching elements. This results in the scalarization of extending to v2f64 from v2f32, which will be legalized to v4f32 not matching with v2f64. - add X86-specific VFPEXT supproting extending from v4f32 to v2f64. - add BUILD_VECTOR lowering helper to recover back the original extending from v4f32 to v2f64. - test case is enhanced to include different vector width. llvm-svn: 161894	2012-08-14 21:24:47 +00:00
Jim Grosbach	53796945f5	Switch the fixed-length disassembler to be table-driven. Refactor the TableGen'erated fixed length disassemblmer to use a table-driven state machine rather than a massive set of nested switch() statements. As a result, the ARM Disassembler (ARMDisassembler.cpp) builds much more quickly and generates a smaller end result. For a Release+Asserts build on a 16GB 3.4GHz i7 iMac w/ SSD: Time to compile at -O2 (averaged w/ hot caches): Previous: 35.5s New: 8.9s TEXT size: Previous: 447,251 New: 297,661 Builds in 25% of the time previously required and generates code 66% of the size. Execution time of the disassembler is only slightly slower (7% disassembling 10 million ARM instructions, 19.6s vs 21.0s). The new implementation has not yet been tuned, however, so the performance should almost certainly be recoverable should it become a concern. llvm-svn: 161888	2012-08-14 19:06:05 +00:00
Craig Topper	e7ac4d1df1	Factor duplicate calls to getUNDEF in several functions. llvm-svn: 161860	2012-08-14 08:18:43 +00:00
Craig Topper	a3795f6791	Re-factor intrinsic lowering to combine common parts of similar intrinsics. Reduces compiled code size a little bit. llvm-svn: 161859	2012-08-14 07:43:25 +00:00
Jakob Stoklund Olesen	33e364a3df	Remove the TII::scheduleTwoAddrSource() hook. It never does anything when running 'make check', and it get's in the way of updating live intervals in 2-addr. The hook was originally added to help form IT blocks in Thumb2 code before register allocation, but the pass ordering has changed since then, and we run if-conversion after register allocation now. When the MI scheduler is enabled, there will be no less than two schedulers between 2-addr and Thumb2ITBlockPass, so this hook is unlikely to help anything. llvm-svn: 161794	2012-08-13 21:52:57 +00:00
Manman Ren	159ae3b3bc	ARM: enable struct byval for AAPCS-VFP. This change is to be enabled in clang. rdar://9877866 llvm-svn: 161789	2012-08-13 21:22:50 +00:00
Arnold Schwaighofer	dbdb2581b8	[Hexagon] Don't mark callee saved registers as clobbered by a tail call This was causing unnecessary spills/restores of callee saved registers. Fixes PR13572. Patch by Pranav Bhandarkar! llvm-svn: 161778	2012-08-13 19:54:01 +00:00
Nadav Rotem	03c4d5f036	Do not optimize (or (and X,Y), Z) into BFI and other sequences if the AND ISDNode has more than one user. rdar://11876519 llvm-svn: 161775	2012-08-13 18:52:44 +00:00
Manman Ren	cb05c49c64	X86: move Int_CVTSD2SSrr, Int_CVTSI2SSrr, Int_CVTSI2SDrr, Int_CVTSS2SDrr from OpTbl1 to OpTbl2 since they have 3 operands and the last operand can be changed to a memory operand. PR13576 llvm-svn: 161769	2012-08-13 18:29:41 +00:00
Eric Christopher	3aea549423	Add support for the %H output modifier. Patch by Weiming Zhao. llvm-svn: 161768	2012-08-13 18:18:52 +00:00
Manman Ren	c9f5387a5c	X86: when auto-detecting the subtarget features, make sure use IsIntel to detect Nehalem, Westmere and Sandy Bridge. AMD also has processor family 6. llvm-svn: 161763	2012-08-13 17:26:46 +00:00
Tim Northover	b1f8be6cbe	Use correct loads for vector types during extending-load operations. Previously, we used VLD1.32 in all cases, however there are both 16 and 64-bit accesses being selected, so we need to use an appropriate width load in those cases. llvm-svn: 161748	2012-08-13 09:06:31 +00:00
Craig Topper	4fc08044be	Tidy up VSETCC lowering code a bit more by adding an llvm_unreachable and putting an a couple if conditions in a better order. llvm-svn: 161746	2012-08-13 03:42:38 +00:00
Craig Topper	a438ea46bf	Refactor code a bit to share commonalities. No functional change intended. llvm-svn: 161745	2012-08-13 02:34:03 +00:00
Craig Topper	bb92d94049	Fix an unused variable warning from r161742. llvm-svn: 161743	2012-08-13 01:26:45 +00:00
Craig Topper	1032fcf6da	Remove the LowerMMXCONCAT_VECTORS function. It could never execute because there are no legal 64-bit vector types that could be used as inputs to a 128-bit concat_vectors. Remove a target specific SDNode and its patterns that become unused as a result. llvm-svn: 161742	2012-08-13 01:23:55 +00:00
Craig Topper	5a5ed2d691	Remove call to setOperationAction for SETCC of v4f32. SETCC returns an integer type not an FP type. llvm-svn: 161738	2012-08-12 05:31:32 +00:00
Craig Topper	1292e1f43c	Remove unnecessary call to setOperationAction for SETCC of v2i64 under SSE42. It was already called for the same under SSE2. llvm-svn: 161737	2012-08-12 05:15:16 +00:00
Arnold Schwaighofer	c751a25aed	Revert 161581: Patch to implement UMLAL/SMLAL instructions for the ARM architecture It broke MultiSource/Applications/JM/ldecod/ldecod on armv7 thumb O0 g and armv7 thumb O3. llvm-svn: 161736	2012-08-12 05:11:56 +00:00
Craig Topper	4d9cbceefd	Change addTypeForNeon to use MVT instead of EVT so all the calls to getSimpleVT can be removed. llvm-svn: 161735	2012-08-12 03:16:37 +00:00
Craig Topper	709114d67f	Make replace many calls to getSizeInBits() with is128BitVector/is256BitVector llvm-svn: 161734	2012-08-12 02:23:29 +00:00
Craig Topper	a52fcd0a14	Use MVT.isXBitVector instead of EVT.isXBitVector when setting up operation actions. Compiles to smaller code. llvm-svn: 161733	2012-08-12 00:34:56 +00:00
Michael Liao	4b95cb463a	fix PR13577, an issue introduced by r161687 - FCMOV only supports a subset of X86 conditions. Skip boolean simplification if X86 condition is not valid for FCMOV. - add a minimal test case for PR13577. llvm-svn: 161732	2012-08-11 23:47:06 +00:00
Craig Topper	93e2521659	Move setOperationAction for CONCAT_VECTORS for 256-bit vectors into loop since all 256-bit types are supported. llvm-svn: 161730	2012-08-11 22:34:26 +00:00
Craig Topper	b7f7fa86ec	Tidy up indentation. No functional change. llvm-svn: 161727	2012-08-11 17:53:00 +00:00
Craig Topper	ba0c3ebe9e	Fix a cast that was casting away 'const' unnecessarily llvm-svn: 161726	2012-08-11 17:46:16 +00:00
Craig Topper	3929432178	Add a couple default: llvm_unreachable() to some switch statements. Fix a bad message in an existing llvm_unreachable. llvm-svn: 161725	2012-08-11 17:44:14 +00:00
Manman Ren	9bd686f936	X86: when we are auto-detecting the subtarget features, make sure we turn on FeatureFastUAMem for Nehalem, Westmere and Sandy Bridge. FeatureFastUAMem is already on if we pass in nehalem or westmere as a command argument. rdar: 7252306 llvm-svn: 161717	2012-08-10 23:43:32 +00:00
Manman Ren	500d45c3d9	ARM: enable struct byval for AAPCS. This change is to be enabled in clang. rdar://9877866 PR://13350 llvm-svn: 161693	2012-08-10 20:39:38 +00:00
Michael Liao	97334a5c5f	add X86-specific DAG optimization to simplify boolean test - if a boolean test (X86ISD::CMP or X86ISD:SUB) checks a boolean value generated from X86ISD::SETCC, try to simplify the boolean value generation and checking by reusing the original EFLAGS with proper condition code - add hooks to X86 specific SETCC/BRCOND/CMOV, the major 3 places consuming EFLAGS part of patches fixing PR12312 llvm-svn: 161687	2012-08-10 19:58:13 +00:00
Michael Liao	81be965deb	remove tailing whitespaces and test commit llvm-svn: 161664	2012-08-10 14:39:24 +00:00
Joerg Sonnenberger	f07e1e10a6	Add some missing includes for the build against stdcxx. llvm-svn: 161657	2012-08-10 10:53:56 +00:00
Eric Christopher	77ae8ee419	Remove getARMRegisterNumbering and replace with calls into the register info for getEncodingValue. This builds on the small patch of yesterday to set HWEncoding in the register file. One (deprecated) use was turned into a hard number to avoid needing register info in the old JIT. llvm-svn: 161628	2012-08-09 22:10:21 +00:00
Jakob Stoklund Olesen	c8bcc2518d	Don't modify MO while use_iterator is still pointing to it. llvm-svn: 161626	2012-08-09 22:08:24 +00:00
Chad Rosier	5efd936f43	[ms-inline asm] Extend the MC AsmParser API to match MCInsts (but not emit). This new API will be used by clang to parse ms-style inline asms. One goal of this project is to use this style of inline asm for targets other then x86. Therefore, this API needs to be implemented for non-x86 targets at some point in the future. llvm-svn: 161624	2012-08-09 22:04:55 +00:00
Jack Carter	5ce6f4b4e5	Another 32 to 64 bit sign extension bug. The fields in the td definition were switched. llvm-svn: 161607	2012-08-09 19:43:18 +00:00
Arnold Schwaighofer	f3d4d73157	Patch to implement UMLAL/SMLAL instructions for the ARM architecture This patch corrects the definition of umlal/smlal instructions and adds support for matching them to the ARM dag combiner. Bug 12213 Patch by Yin Ma! llvm-svn: 161581	2012-08-09 15:25:52 +00:00
Eric Christopher	286507043a	This field isn't used anymore, use it with HWEncoding instead. llvm-svn: 161564	2012-08-09 01:39:32 +00:00
Jakob Stoklund Olesen	74359693c4	Don't use getNextOperandForReg(). This way of using getNextOperandForReg() was unlikely to work as intended. We don't give any guarantees about the order of operands in the use-def chains, so looking only at operands following a given operand in the chain doesn't make sense. llvm-svn: 161542	2012-08-08 23:44:04 +00:00
Andrew Trick	75af469e99	Added MispredictPenalty to SchedMachineModel. This replaces an existing subtarget hook on ARM and allows standard CodeGen passes to potentially use the property. llvm-svn: 161471	2012-08-08 02:44:16 +00:00
Andrew Trick	749aa4269e	whitespace llvm-svn: 161469	2012-08-08 02:44:08 +00:00
Manman Ren	967804ad0a	X86: enable CSE between CMP and SUB We perform the following: 1> Use SUB instead of CMP for i8,i16,i32 and i64 in ISel lowering. 2> Modify MachineCSE to correctly handle implicit defs. 3> Convert SUB back to CMP if possible at peephole. Removed pattern matching of (a>b) ? (a-b):0 and like, since they are handled by peephole now. rdar://11873276 llvm-svn: 161462	2012-08-08 00:51:41 +00:00
Jakob Stoklund Olesen	924ff06fdf	Don't scan physreg use-def chains looking for a PIC base. We can't rematerialize a PIC base after register allocation anyway, and scanning physreg use-def chains is very expensive in a function with many calls. <rdar://problem/12047515> llvm-svn: 161461	2012-08-08 00:40:47 +00:00
Evan Cheng	96c6741fad	X86 cmp lowering is looking past truncate on the condition node. It should only do so when the high bits are known zero. This caused a subtle miscompilation. rdar://12027825 llvm-svn: 161451	2012-08-07 22:21:00 +00:00
Hal Finkel	aa174abb14	Add a comment about mftb vs. mfspr on PPC. Thanks to Alex Rosenberg for the suggestion. llvm-svn: 161428	2012-08-07 17:04:20 +00:00
Bill Wendling	69f9777937	Revert r161371. Removing the 'const' before Type is a "good thing". --- Reverse-merging r161371 into '.': U include/llvm/Target/TargetData.h U lib/Target/TargetData.cpp llvm-svn: 161394	2012-08-07 05:51:59 +00:00
Jack Carter	32420dd092	The define for 64 bit sign extension neglected to initialize fields of the class that it used. The result was nonsense code. Before: 0000000000000000 <foo>: 0: 00441100 0x441100 4: 03e00008 jr ra 8: 00000000 nop After: 0000000000000000 <foo>: 0: 00041000 sll v0,a0,0x0 4: 03e00008 jr ra 8: 00000000 nop llvm-svn: 161377	2012-08-07 00:35:22 +00:00
Bill Wendling	dc532577fd	Constify the Type parameter to some methods (which are const anyway). llvm-svn: 161371	2012-08-07 00:26:35 +00:00
Andrew Trick	35b938c991	Allow x86 subtargets to use the GenericModel defined in X86Schedule.td. This allows codegen passes to query properties like InstrItins->SchedModel->IssueWidth. It also ensure's that computeOperandLatency returns the X86 defaults for loads and "high latency ops". This should have no significant impact on existing schedulers because X86 defaults happen to be the same as global defaults. llvm-svn: 161370	2012-08-07 00:25:30 +00:00
Jack Carter	d768975885	Mips relocation R_MIPS_64 relocates a 64 bit double word. I hit this in a very large program (spirit.cpp), but have not figured out how to make a small make check test for it. llvm-svn: 161366	2012-08-07 00:01:14 +00:00
Jack Carter	3f30c3effe	The Mips64InstrInfo.td definitions DynAlloc64 LEA_ADDiu64 were using a class defined for 32 bit instructions and thus the instruction was for addiu instead of daddiu. This was corrected by adding the instruction opcode as a field in the base class to be filled in by the defs. llvm-svn: 161359	2012-08-06 23:29:06 +00:00
Jack Carter	fdb00bef02	Mips relocations R_MIPS_HIGHER and R_MIPS_HIGHEST. These 2 relocations gain access to the highest and the second highest 16 bits of a 64 bit object. R_MIPS_HIGHER %higher(A+S) The %higher(x) function is [ (((long long) x + 0x80008000LL) >> 32) & 0xffff ]. R_MIPS_HIGHEST %highest(A+S) The %highest(x) function is [ (((long long) x + 0x800080008000LL) >> 48) & 0xffff ]. llvm-svn: 161348	2012-08-06 21:26:03 +00:00
Hal Finkel	15265edebe	MFTB on PPC64 should really be encoded using MFSPR. The MFTB instruction itself is being phased out, and its functionality is provided by MFSPR. According to the ISA docs, using MFSPR works on all known chips except for the 601 (which did not have a timebase register anyway) and the POWER3. Thanks to Adhemerval Zanella for pointing this out! llvm-svn: 161346	2012-08-06 21:21:44 +00:00
Eric Christopher	f5132794cd	Add support for the OpenBSD for Bitrig. Patch by David Hill. llvm-svn: 161344	2012-08-06 20:52:18 +00:00
Roman Divacky	59eec94f55	Remove empty overrides of processFunctionBeforeFrameFinalized(). llvm-svn: 161328	2012-08-06 18:14:18 +00:00
Craig Topper	dc1b95e7de	Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom handling in DAGISelToDAG due to limitations in TableGen's implicit def handling. Fixes PR11305. llvm-svn: 161318	2012-08-06 06:22:36 +00:00
Craig Topper	e26b30c830	Remove custom inserter for MWAIT. It doesn't do anything that couldn't be represented in a pattern. llvm-svn: 161306	2012-08-05 00:36:57 +00:00
Craig Topper	c716a3f554	Use a COPY node instead of an explicit MOVA opcode in the custom insterter for pcmpestrm/pcmpistrm. Allows the register allocator to handle it better and prevent wasted identity moves. llvm-svn: 161305	2012-08-05 00:17:48 +00:00
Hal Finkel	aadd19de06	Add readcyclecounter lowering on PPC64. On PPC64, this can be done with a simple TableGen pattern. To enable this, I've added the (otherwise missing) readcyclecounter SDNode definition to TargetSelectionDAG.td. llvm-svn: 161302	2012-08-04 14:10:46 +00:00
Anton Korobeynikov	b0d4fe0a5e	Skip impdef regs during eabi save/restore list emission to workaround PR11902 llvm-svn: 161301	2012-08-04 13:25:58 +00:00
Anton Korobeynikov	dca34647bc	Recognize vst1.64 / vld1.64 with 3 and 4 regs as load from / store to stack stuff (this corresponds by spilling/reloading regs in DTriple / DQuad reg classes). No testcase, found by inspection. llvm-svn: 161300	2012-08-04 13:22:14 +00:00
Anton Korobeynikov	6dd5c91aae	Add stack spill / reload instructions for DTriple and DQuad register classes, which were missed for no reason. This fixes PR13377 llvm-svn: 161299	2012-08-04 13:16:12 +00:00
Akira Hatanaka	ebbe0eff91	1. Redo mips16 instructions to avoid multiple opcodes for same instruction. Change these to patterns. 2. Add another 16 instructions. Patch by Reed Kotler. llvm-svn: 161272	2012-08-03 22:57:02 +00:00
Gabor Greif	0c8708dd91	allow 'make CPPFLAGS=<something>' work again this makes this hack a bit more bearable for poor souls who need to pass custom preprocessor flags to the build process llvm-svn: 161240	2012-08-03 13:31:24 +00:00
Bob Wilson	d1eefbeac2	Fall back to selection DAG isel for calls to builtin functions. Fast isel doesn't currently have support for translating builtin function calls to target instructions. For embedded environments where the library functions are not available, this is a matter of correctness and not just optimization. Most of this patch is just arranging to make the TargetLibraryInfo available in fast isel. <rdar://problem/12008746> llvm-svn: 161232	2012-08-03 04:06:28 +00:00
Bob Wilson	627fc538a7	Add new getLibFunc method to TargetLibraryInfo. This just provides a way to look up a LibFunc::Func enum value for a function name. Alphabetize the enums and function names so we can use a binary search. llvm-svn: 161231	2012-08-03 04:06:22 +00:00
Jush Lu	ca5d760bf2	[arm-fast-isel] Add support for shl, lshr, and ashr. llvm-svn: 161230	2012-08-03 02:37:48 +00:00
Eric Christopher	714e032a36	Add support for the ARM GHC calling convention, this patch was in 3.0, but somehow managed to be dropped later. Patch by Karel Gardas. llvm-svn: 161226	2012-08-03 00:05:53 +00:00
Jim Grosbach	1663895970	ARM: Tidy up. Remove unused template parameters. llvm-svn: 161222	2012-08-02 22:08:27 +00:00
Jim Grosbach	42ef3d66ab	ARM: More InstAlias refactors to use #NAME#. llvm-svn: 161220	2012-08-02 21:59:52 +00:00
Jim Grosbach	b3f7932a3d	ARM: Refactor instaliases using TableGen support for #NAME#. Now that TableGen supports references to NAME w/o it being explicitly referenced in the definition's own name, use that to simplify assembly InstAlias definitions in multiclasses. llvm-svn: 161218	2012-08-02 21:50:41 +00:00
Manman Ren	2c236a30f6	X86 Peephole: fold loads to the source register operand if possible. Add more comments and use early returns to reduce nesting in isLoadFoldable. Also disable folding for V_SET0 to avoid introducing a const pool entry and a const pool load. rdar://10554090 and rdar://11873276 llvm-svn: 161207	2012-08-02 19:37:32 +00:00
Akira Hatanaka	6b71c77901	Move the code that creates instances of MipsInstrInfo and MipsFrameLowering out of MipsTargetMachine.cpp. llvm-svn: 161191	2012-08-02 18:21:47 +00:00
Akira Hatanaka	a1ef6b9f53	Set transient stack alignment in constructor of MipsFrameLowering and re-enable test o32_cc_vararg.ll. llvm-svn: 161189	2012-08-02 18:15:13 +00:00
Jiangning Liu	b12a230a17	Support fpv4 for ARM Cortex-M4. llvm-svn: 161163	2012-08-02 08:35:55 +00:00
Jiangning Liu	a806f72610	Fix #13035 , a bug around Thumb instruction LDRD/STRD with negative #0 offset index issue. llvm-svn: 161162	2012-08-02 08:29:50 +00:00
Jiangning Liu	69f0770c21	Fix #13138 , a bug around ARM instruction DSB encoding and decoding issue. llvm-svn: 161161	2012-08-02 08:21:27 +00:00
Jiangning Liu	e4c91e239f	Fix #13241 , a bug around shift immediate operand for ARM instruction ADR. llvm-svn: 161159	2012-08-02 08:13:13 +00:00
Manman Ren	78b8d454cc	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. This patch is a rework of r160919 and was tested on clang self-host on my local machine. rdar://10554090 and rdar://11873276 llvm-svn: 161152	2012-08-02 00:56:42 +00:00
Manman Ren	1c73b43c93	X86: mark GATHER instructios as mayLoad llvm-svn: 161143	2012-08-01 23:28:59 +00:00
Jim Grosbach	b60d4c1110	ARM: Remove redundant instalias. llvm-svn: 161134	2012-08-01 20:33:05 +00:00
Jim Grosbach	32cbdce015	Clean up formatting. llvm-svn: 161133	2012-08-01 20:33:02 +00:00
Jim Grosbach	c4b3246b76	Tidy up. llvm-svn: 161132	2012-08-01 20:33:00 +00:00
Chad Rosier	078db59861	Whitespace. llvm-svn: 161122	2012-08-01 18:39:17 +00:00
Elena Demikhovsky	0fec7026d9	Added FMA functionality to X86 target. llvm-svn: 161110	2012-08-01 12:06:00 +00:00
Craig Topper	3f200a7638	Add more indirection to the disassembler tables to reduce amount of space used to store the operand types and encodings. Store only the unique combinations in a separate table and store indices in the instruction table. Saves about 32K of static data. llvm-svn: 161101	2012-08-01 07:39:18 +00:00
Akira Hatanaka	731add334a	Implement MipsJITInfo::replaceMachineCodeForFunction. No new test case is added. This patch makes test JITTest.FunctionIsRecompiledAndRelinked pass on mips platform. Patch by Petar Jovanovic. llvm-svn: 161098	2012-08-01 02:29:24 +00:00
Akira Hatanaka	7c93354aff	Remove unused variable. llvm-svn: 161095	2012-08-01 00:37:53 +00:00
Akira Hatanaka	c43e6b2166	Implement MipsSERegisterInfo::eliminateCallFramePseudoInstr. The function emits instructions that decrement and increment the stack pointer before and after a call when the function does not have a reserved call frame. llvm-svn: 161093	2012-07-31 23:52:55 +00:00
Akira Hatanaka	24dddbed36	Add definitions of two subclasses of MipsRegisterInfo, Mips16RegisterInfo and MipsSERegisterInfo. llvm-svn: 161092	2012-07-31 23:41:32 +00:00
Akira Hatanaka	87fd9a992a	Add definitions of two subclasses of MipsFrameLowering, Mips16FrameLowering and MipsSEFrameLowering. Implement MipsSEFrameLowering::hasReservedCallFrame. Call frames will not be reserved if there is a call with a large call frame or there are variable sized objects on the stack. llvm-svn: 161090	2012-07-31 22:50:19 +00:00
Akira Hatanaka	45e66997d4	Add Mips16InstrInfo.cpp and MipsSEInstrInfo.cpp to CMakeLists.txt. llvm-svn: 161083	2012-07-31 22:11:05 +00:00
Akira Hatanaka	46388c74c0	Add definitions of two subclasses of MipsInstrInfo, MipsInstrInfo (for mips16), and MipsSEInstrInfo (for mips32/64). llvm-svn: 161081	2012-07-31 21:49:49 +00:00
Akira Hatanaka	44f5fec97d	Delete mips64 target machine classes. mips target machines can be used in place of them. llvm-svn: 161080	2012-07-31 21:39:17 +00:00
Akira Hatanaka	ad80f510bc	Let PEI::calculateFrameObjectOffsets compute the final stack size rather than computing it in MipsFrameLowering::emitPrologue. llvm-svn: 161078	2012-07-31 21:28:49 +00:00
Akira Hatanaka	d43e99897c	Expand DYNAMIC_STACKALLOC nodes rather than doing custom-lowering. The frame object which points to the dynamically allocated area will not be needed after changes are made to cease reserving call frames. llvm-svn: 161076	2012-07-31 20:54:48 +00:00
Akira Hatanaka	e1beddb7e8	Define ADJCALLSTACKDOWN/UP nodes. These nodes are emitted regardless of whether or not it is in mips16 mode. Define MipsPseudo (mode-independant pseudo) and PseudoSE (mips32/64 pseudo) classes. llvm-svn: 161071	2012-07-31 19:13:07 +00:00
Akira Hatanaka	4a17cb84f3	Change name of class MipsInst to InstSE to distinguish it from mips16's instruction class. SE stands for standard encoding. llvm-svn: 161069	2012-07-31 18:55:01 +00:00
Akira Hatanaka	85ddbf2e38	When store nodes or memcpy nodes are created to copy the function call arguments to the stack in MipsISelLowering::LowerCall, use stack pointer and integer offset operands rather than frame object operands. llvm-svn: 161068	2012-07-31 18:46:41 +00:00
Chad Rosier	9a4ff99710	[x86 frame lowering] In 32-bit mode, use ESI as the base pointer. Previously, we were using EBX, but PIC requires the GOT to be in EBX before function calls via PLT GOT pointer. llvm-svn: 161066	2012-07-31 18:29:21 +00:00
Akira Hatanaka	aab47c049b	Fix type of LUXC1 and SUXC1. These instructions were incorrectly defined as single-precision load and store. Also avoid selecting LUXC1 and SUXC1 instructions during isel. It is incorrect to map unaligned floating point load/store nodes to these instructions. llvm-svn: 161063	2012-07-31 18:16:49 +00:00
Craig Topper	4f038a7a70	Make INSTRUCTION_SPECIFIER_FIELDS match X86DisassemblerCommon.h. Also remove trailing whitespace. llvm-svn: 161029	2012-07-31 05:18:26 +00:00
Craig Topper	62ece8c8d5	Tidy up trailing whitespace llvm-svn: 161027	2012-07-31 04:58:05 +00:00
Craig Topper	676ae779ef	Tidy up trailing whitespace llvm-svn: 161026	2012-07-31 04:38:27 +00:00
Kevin Enderby	cde92a2741	Fix a bug in ARMMachObjectWriter::RecordRelocation() in ARMMachObjectWriter.cpp where the other_half of the movt and movw relocation entries needs to get set and only with the 16 bits of the other half. rdar://10038370 llvm-svn: 160978	2012-07-30 18:46:15 +00:00
Craig Topper	f374fd0e17	Mark MOVZX16/MOVSX16 as neverHasSideEffects/mayLoad llvm-svn: 160953	2012-07-30 07:14:07 +00:00
Craig Topper	19fc5055ea	Mark MOVZX32_NOREX as isCodeGenOnly and neverHasSideEffects. The isCodeGenOnly change allows special detection of _NOREX instructions to be removed from tablegen disassembler code. llvm-svn: 160951	2012-07-30 06:48:11 +00:00
Craig Topper	80fdfb7f56	Give VCVTTPD2DQ priority over CVTTPD2DQ. llvm-svn: 160942	2012-07-30 02:20:32 +00:00
Craig Topper	492a7af190	Fix patterns for CVTTPS2DQ to specify SSE2 instead of SSE1. llvm-svn: 160941	2012-07-30 02:14:02 +00:00
Craig Topper	9050c15c71	Fix up patterns for VCVTSS2SD. Specifically give it priority over SSE form. Add an OptForSpeed to explicitly pair up with an OptForSize that was already on another pattern. llvm-svn: 160939	2012-07-30 01:38:57 +00:00
Craig Topper	147248a6a0	Fix load types on intrinsic forms of SS2SD and SD2SS AVX/SSE convert instruction patterns. llvm-svn: 160938	2012-07-29 23:26:34 +00:00
Craig Topper	293e781ba6	Move more SSE/AVX convert instruction patterns into their definitions. llvm-svn: 160937	2012-07-29 22:30:06 +00:00
Manman Ren	ceef7c4d9b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00
Craig Topper	e75418242a	Fold patterns for some of the SSE/AVX convert instructions into their instruction definitions. llvm-svn: 160922	2012-07-28 18:59:19 +00:00
Craig Topper	189349dab2	Mark some of the SSE/AVX convert instructions as mayLoad/neverHasSideEffects. llvm-svn: 160921	2012-07-28 18:36:39 +00:00
Manman Ren	ea77f9076b	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. rdar://10554090 and rdar://11873276 llvm-svn: 160919	2012-07-28 16:48:01 +00:00
Craig Topper	3c15b4afd4	Make CVTSS2SI instruction definition consistent with CVTSD2SI. llvm-svn: 160914	2012-07-28 08:28:23 +00:00
Craig Topper	8121932592	Fix up memory load types for SSE scalar convert intrinsic patterns. llvm-svn: 160913	2012-07-28 07:59:59 +00:00
Manman Ren	fbc9fcdbf2	X86 Peephole: fix PR13475 in optimizeCompare. It is possible that an instruction can use and update EFLAGS. When checking the safety, we should check the usage of EFLAGS first before declaring it is safe to optimize due to the update. llvm-svn: 160912	2012-07-28 03:15:46 +00:00
Akira Hatanaka	9a1f4df56a	Pass the correct call frame size to callseq_start node. This is needed to replace uses of function getMaxCallFrameSize defined in MipsFunctionInfo with the one MachineFrameInfo has. llvm-svn: 160841	2012-07-26 23:27:01 +00:00
Jakob Stoklund Olesen	4b1105c720	Remove the X86 sub_ss and sub_sd sub-register indexes completely. llvm-svn: 160833	2012-07-26 23:07:20 +00:00
Jakob Stoklund Olesen	008b36037d	Remove the last mentions of sub_ss and sub_sd from patterns. I'll remove these two sub-register indexes shortly. llvm-svn: 160831	2012-07-26 23:03:08 +00:00
Jakob Stoklund Olesen	c1dd7a213d	Eliminate sub_ss, sub_sd from broadcast patterns. The (COPY_TO_REGCLASS GR32:$src, VR128) pattern looks odd, but copyPhysReg does the right thing with it. (The old pattern would eventually produce the same cross-class copy). llvm-svn: 160830	2012-07-26 22:59:06 +00:00
Jakob Stoklund Olesen	419ccae442	Eliminate more sub_ss / sub_sd patterns. This gets rid of some more INSERT_SUBREG - IMPLICIT_DEF patterns, simplifying the emitted code a bit. llvm-svn: 160820	2012-07-26 22:30:18 +00:00
Jakob Stoklund Olesen	0a41a45c36	Eliminate some SUBREG_TO_REG patterns with sub_ss and sub_sd. The SUBREG_TO_REG instruction has magic semantics asserting that the source value was defined by an instruction that cleared the high half of the register. Those semantics are never actually exploited for xmm registers. llvm-svn: 160818	2012-07-26 22:03:21 +00:00
Jakob Stoklund Olesen	44868927b9	Eliminate a batch of uses of sub_ss and sub_sd in the X86 target. These idempotent sub-register indices don't do anything --- They simply map XMM registers to themselves. They no longer affect register classes either since the SubRegClasses field has been removed from Target.td. This patch replaces XMM->XMM EXTRACT_SUBREG and INSERT_SUBREG patterns with COPY_TO_REGCLASS patterns which simply become COPY instructions. The number of IMPLICIT_DEF instructions before register allocation is reduced, and that is the cause of the test case changes. llvm-svn: 160816	2012-07-26 21:40:42 +00:00
Craig Topper	9667d599eb	Make l/q suffixes on AVX forms of scalar convert instructions consistent with their non-AVX forms. llvm-svn: 160775	2012-07-26 07:48:28 +00:00
Akira Hatanaka	92df819965	Fix call setup for PIC. Patch by Reed Kotler. llvm-svn: 160774	2012-07-26 02:24:43 +00:00
Jim Grosbach	a7de0b586b	ARM: Don't assume an SDNode is a constant. Before accessing a node as a ConstandSDNode, make sure it actually is one. No testcase of non-trivial size. rdar://11948669 llvm-svn: 160735	2012-07-25 17:02:47 +00:00
Nuno Lopes	537a3395e5	make all Emit*() functions consult the TargetLibraryInfo information before creating a call to a library function. Update all clients to pass the TLI information around. Previous draft reviewed by Eli. llvm-svn: 160733	2012-07-25 16:46:31 +00:00
Rafael Espindola	062cf9ccf9	Fix typos. Thanks to Matt Beaumont-Gay for noticing it. llvm-svn: 160731	2012-07-25 15:42:45 +00:00
Rafael Espindola	8dfc815c3f	When a return struct pointer is passed in registers, the called has nothing to pop. llvm-svn: 160725	2012-07-25 13:41:10 +00:00
Rafael Espindola	e58779ca1b	Factor a long list of conditions into a predicate function. No functionality change. llvm-svn: 160724	2012-07-25 13:35:45 +00:00
Akira Hatanaka	f52e519898	Eliminate the stack slot used to save the global base register. The long branch pass (fixed in r160601) no longer uses the global base register to compute addresses of branch destinations, so it is not necessary to reserve a slot on the stack. llvm-svn: 160703	2012-07-25 03:16:47 +00:00
Kevin Enderby	9a7bc24c01	Fix a bug in the x86 disassembler's symbolic disassembly support for Jcc-Jump if Condition Is Met instuctions that was not correctly determining the target instruction. So for a jne rel32 instruction: % cat x.s .byte 0x0f, 0x85, 0x09, 0x00, 0x00, 0x00 % as x.s it was incorrectly deterining the target: % otool -q -tv a.out a.out: (__TEXT,__text) section 0000000000000000 jne 0xd and with the fix it gets this correct as: % otool -q -tv a.out a.out: (__TEXT,__text) section 0000000000000000 jne 0xf rdar://11505997 llvm-svn: 160694	2012-07-24 21:40:01 +00:00
Nuno Lopes	6319e86c31	add a few more functions to TargetLibraryInfo: fputc, memchr, memcmp, putchar, puts, strchr, strncmp llvm-svn: 160690	2012-07-24 21:00:36 +00:00
David Chisnall	c21e3b3ce1	ELF does not imply GNU/Linux. Do not assume GNU conventions just because we are targeting an ELF platform. Only fold gs-relative (and fs-relative) loads if it is actually sensible to do so for the target platform. This fixes PR13438. llvm-svn: 160687	2012-07-24 20:04:16 +00:00
Nuno Lopes	aa177bbdbc	TargetLibraryInfo: add strn?cat, strn?cpy, and strn?len llvm-svn: 160678	2012-07-24 17:25:06 +00:00
Akira Hatanaka	dba3c8d511	Fix function MipsCodeEmitter::emitExternalSymbolAddress to pass test ExecutionEngine/test-fp.ll. Patch by Petar Jovanovic. llvm-svn: 160653	2012-07-24 00:08:26 +00:00
Akira Hatanaka	b1ba835c42	Add basic ability to setup call frame, and make procedure calls. Hello world will compile and execute with this patch. Patch by Reed Kotler. llvm-svn: 160651	2012-07-23 23:45:54 +00:00
Akira Hatanaka	50170a31a8	Add comment for relocations MO_HIGHER and HIGHEST in MipsBaseInfo.h. llvm-svn: 160636	2012-07-23 19:19:20 +00:00
Micah Villmow	759c8c46ac	Test revert of test changes. llvm-svn: 160632	2012-07-23 16:42:45 +00:00
Micah Villmow	0482f60db4	Test commit. llvm-svn: 160631	2012-07-23 16:37:24 +00:00
Sylvestre Ledru	bf8acb65ac	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Akira Hatanaka	68623fbfd7	Fix Mips long branch pass. This pass no longer requires that the global pointer value be saved to the stack or register since it uses bal instruction to compute branch distance. llvm-svn: 160601	2012-07-21 03:30:44 +00:00
Akira Hatanaka	508804a633	Add HIGHER and HIGHEST relocations to Mips backend. llvm-svn: 160599	2012-07-21 03:09:04 +00:00
Akira Hatanaka	734a4a8569	Revert accidental commit. llvm-svn: 160598	2012-07-21 02:20:33 +00:00
Akira Hatanaka	e87a027c19	Add VK_Mips_HIGHER and VK_Mips_HIGHEST to MCSymbolRefExpr::VariantKind. Test case will be added later when long branch patch is checked in. llvm-svn: 160597	2012-07-21 02:15:19 +00:00
Craig Topper	015d52b2dc	Don't use implicit register operands to calculate L-bit for AVX instructions. Needed because super reg defs and kills are added as implicit operands on 128-bit instructions. Fixes PR13349. Patch by Jose Fonseca. llvm-svn: 160543	2012-07-20 07:03:46 +00:00
Preston Gurd	6d82adeada	Adds the family codes for the Midview Atom processors so that the Atom buildbot will auto-detect Atom. llvm-svn: 160521	2012-07-19 19:05:37 +00:00
Sebastian Pop	07b782daeb	default to use -mv4 when no version of Hexagon has been specified This fixes a bunch of make check failures of the form: Unknown Architecture Version. UNREACHABLE executed at ../lib/Target/Hexagon/HexagonSubtarget.cpp:60! llvm-svn: 160518	2012-07-19 18:24:50 +00:00
Jush Lu	54c7329b88	[arm-fast-isel] Add support for vararg function calls. llvm-svn: 160500	2012-07-19 09:49:00 +00:00
Bill Wendling	d091d1863b	Remove tabs. llvm-svn: 160483	2012-07-19 00:25:04 +00:00
Bill Wendling	0b007e009e	Remove tabs. llvm-svn: 160479	2012-07-19 00:15:11 +00:00
Bill Wendling	17b12b72bc	Remove tabs. llvm-svn: 160477	2012-07-19 00:11:40 +00:00
Bill Wendling	b1bd365dfc	Remove tabs. llvm-svn: 160476	2012-07-19 00:06:06 +00:00
Manman Ren	dd8d9c10a3	X86: remove redundant cmp against zero. Updated OptimizeCompare in peephole to remove redundant cmp against zero. We only remove Compare if CF and OF are not used. rdar://11855129 llvm-svn: 160454	2012-07-18 21:40:01 +00:00
Preston Gurd	d2b344c685	This patch fixes 8 out of 20 unexpected failures in "make check" when run on an Intel Atom processor. The failures have arisen due to changes elsewhere in the trunk over the past 8 weeks or so. These failures were not detected by the Atom buildbot because the CPU on the Atom buildbot was not being detected as an Atom CPU. The fix for this problem is in Host.cpp and X86Subtarget.cpp, but shall remain commented out until the current set of Atom test failures are fixed. Patch by Andy Zhang and Tyler Nowicki! llvm-svn: 160451	2012-07-18 20:49:17 +00:00
Andrew Trick	f21192c005	Fix ARMTargetLowering::isLegalAddImmediate to consider thumb encodings. Based on Evan's suggestion without a commitable test. llvm-svn: 160441	2012-07-18 18:34:27 +00:00
Andrew Trick	b611feef0c	whitespace llvm-svn: 160440	2012-07-18 18:34:24 +00:00
Nadav Rotem	03d2729392	The vbroadcast family of instructions has 'fallback patterns' in case where the load source operand is used by multiple nodes. The v2i64 broadcast was emulated by shuffling the two lower i32 elements to the upper two. We had a bug in the immediate used for the broadcast. Replacing 0 to 0x44. 0x44 means [01\|00\|01\|00] which corresponds to the correct lane. Patch by Michael Kuperstein. llvm-svn: 160430	2012-07-18 08:14:48 +00:00
Jack Carter	7f725ae6fe	Mips specific inline asm operand modifier 'M': Print the high order register of a double word register operand. In 32 bit mode, a 64 bit double word integer will be represented by 2 32 bit registers. This modifier causes the high order register to be used in the asm expression. It is useful if you are using doubles in assembler and continue to control register to variable relationships. This patch also fixes a related bug in a previous patch: case 'D': // Second part of a double word register operand case 'L': // Low order register of a double word register operand case 'M': // High order register of a double word register operand I got 'D' and 'M' confused. The second part of a double word operand will only match 'M' for one of the endianesses. I had 'L' and 'D' be the opposite twins when 'L' and 'M' are. llvm-svn: 160429	2012-07-18 06:41:36 +00:00
Craig Topper	6150f43b28	Remove tab characters. llvm-svn: 160425	2012-07-18 04:59:16 +00:00
Craig Topper	b086c8faf2	Fix typo in error message and remove some tab characters. llvm-svn: 160423	2012-07-18 04:36:35 +00:00
Craig Topper	b144f3b6db	Make x86 asm parser to check for xmm vs ymm for index register in gather instructions. Also fix Intel syntax for gather instructions to use 'DWORD PTR' or 'QWORD PTR' to match gas. llvm-svn: 160420	2012-07-18 04:11:12 +00:00
Joel Jones	4ce75efda5	More replacing of target-dependent intrinsics with target-indepdent intrinsics. The second instruction(s) to be handled are the vector versions of count set bits (ctpop). The changes here are to clang so that it generates a target independent vector ctpop when it sees an ARM dependent vector bits set count. The changes in llvm are to match the target independent vector ctpop and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector pop counts with target-independent ctpops. There are also changes to an existing test case in llvm for ARM vector count instructions and to a test for the bitcode upgrade. <rdar://problem/11892519> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160410	2012-07-18 00:02:16 +00:00
Akira Hatanaka	25d4c684e9	Clean up Mips16InstrFormats.td and Mips16InstrInfo.td. Patch by Reed Kotler. llvm-svn: 160403	2012-07-17 22:55:34 +00:00
Evan Cheng	5e82ad04d5	Back out r160101 and instead implement a dag combine to recover from instcombine transformation. llvm-svn: 160387	2012-07-17 18:54:11 +00:00
Evan Cheng	f84dd0cf40	Implement r160312 as target indepedenet dag combine. llvm-svn: 160354	2012-07-17 08:31:11 +00:00
Evan Cheng	0b6bcb6e06	This is another case where instcombine demanded bits optimization created large immediates. Add dag combine logic to recover in case the large immediates doesn't fit in cmp immediate operand field. int foo(unsigned long l) { return (l>> 47) == 1; } we produce %shr.mask = and i64 %l, -140737488355328 %cmp = icmp eq i64 %shr.mask, 140737488355328 %conv = zext i1 %cmp to i32 ret i32 %conv which codegens to movq $0xffff800000000000,%rax andq %rdi,%rax movq $0x0000800000000000,%rcx cmpq %rcx,%rax sete %al movzbl %al,%eax ret TargetLowering::SimplifySetCC would transform (X & -256) == 256 -> (X >> 8) == 1 if the immediate fails the isLegalICmpImmediate() test. For x86, that's immediates which are not a signed 32-bit immediate. Based on a patch by Eli Friedman. PR10328 rdar://9758774 llvm-svn: 160346	2012-07-17 06:53:39 +00:00
Evan Cheng	b409a61574	For something like uint32_t hi(uint64_t res) { uint_32t hi = res >> 32; return !hi; } llvm IR looks like this: define i32 @hi(i64 %res) nounwind uwtable ssp { entry: %lnot = icmp ult i64 %res, 4294967296 %lnot.ext = zext i1 %lnot to i32 ret i32 %lnot.ext } The optimizer has optimize away the right shift and truncate but the resulting constant is too large to fit in the 32-bit immediate field. The resulting x86 code is worse as a result: movabsq $4294967296, %rax ## imm = 0x100000000 cmpq %rax, %rdi sbbl %eax, %eax andl $1, %eax This patch teaches the x86 lowering code to handle ult against a large immediate with trailing zeros. It will issue a right shift and a truncate followed by a comparison against a shifted immediate. shrq $32, %rdi testl %edi, %edi sete %al movzbl %al, %eax It also handles a ugt comparison against a large immediate with trailing bits set. i.e. X > 0x0ffffffff -> (X >> 32) >= 1 rdar://11866926 llvm-svn: 160312	2012-07-16 19:35:43 +00:00
Tom Stellard	39f7e52397	Revert "AMDGPU: Add core backend files for R600/SI codegen v6" This reverts commit 4ea70107c5e51230e9e60f0bf58a0f74aa4885ea. llvm-svn: 160303	2012-07-16 18:19:53 +00:00
Tom Stellard	715b7811c2	Revert "Build script changes for R600/SI Codegen v6" This reverts commit e3013202259ed1e006c21817c63cf25d75982721. llvm-svn: 160301	2012-07-16 18:19:46 +00:00
Tom Stellard	9dc4728c5c	Revert "Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h>" This reverts commit 0258a6bdd30802f5cc0e8e57c8e768fde2aef590. llvm-svn: 160299	2012-07-16 18:19:41 +00:00
Tom Stellard	5013977c33	Revert "Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen." This reverts commit ebc934ba32ee71abbb8f0f2eb6a0fbaa613ba0d2. llvm-svn: 160298	2012-07-16 18:19:40 +00:00
Tom Stellard	9c4f5d8855	Revert "Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator..." This reverts commit 29f28bc14ad5a907f5dc849f004fafeec0aab33a. llvm-svn: 160297	2012-07-16 18:19:38 +00:00
Tom Stellard	428cc1034f	Revert "Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0)." This reverts commit 4ba4acc1bc2561b944a571edbb6a2dc78e357dfe. llvm-svn: 160296	2012-07-16 18:19:37 +00:00
Tom Stellard	5637c04c6b	Revert "Target/AMDGPU: Fix includes, or msvc build failed." This reverts commit fef4aa1b16fcf7a472559abbbcf4c1adc9eb5ca6. llvm-svn: 160295	2012-07-16 18:19:32 +00:00
Chad Rosier	16c9db9ad6	With r160248 in place this code is no longer needed. llvm-svn: 160293	2012-07-16 17:42:13 +00:00
NAKAMURA Takumi	cd72e724ac	Target/AMDGPU: Fix includes, or msvc build failed. llvm-svn: 160280	2012-07-16 15:43:50 +00:00
NAKAMURA Takumi	48743bc036	Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0). llvm-svn: 160279	2012-07-16 15:43:09 +00:00
NAKAMURA Takumi	877e9fac64	Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator... llvm-svn: 160278	2012-07-16 15:42:35 +00:00
Jack Carter	f2bf098c4f	Doubleword Shift Left Logical Plus 32 Mips shift instructions DSLL, DSRL and DSRA are transformed into DSLL32, DSRL32 and DSRA32 respectively if the shift amount is between 32 and 63 Here is a description of DSLL: Purpose: Doubleword Shift Left Logical Plus 32 To execute a left-shift of a doubleword by a fixed amount--32 to 63 bits Description: GPR[rd] <- GPR[rt] << (sa+32) The 64-bit doubleword contents of GPR rt are shifted left, inserting zeros into the emptied bits; the result is placed in GPR rd. The bit-shift amount in the range 0 to 31 is specified by sa. This patch implements the direct object output of these instructions. llvm-svn: 160277	2012-07-16 15:14:51 +00:00
NAKAMURA Takumi	2d04e559df	Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen. llvm-svn: 160276	2012-07-16 15:09:11 +00:00
NAKAMURA Takumi	4fd62f7458	Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h> llvm-svn: 160275	2012-07-16 15:08:47 +00:00
Tom Stellard	c75d49d526	Build script changes for R600/SI Codegen v6 llvm-svn: 160272	2012-07-16 14:17:16 +00:00
Tom Stellard	9f326179fc	AMDGPU: Add core backend files for R600/SI codegen v6 llvm-svn: 160270	2012-07-16 14:17:08 +00:00
Nadav Rotem	67ff66bd0c	Fix a bug in the 3-address conversion of LEA when one of the operands is an undef virtual register. The problem is that ProcessImplicitDefs removes the definition of the register and marks all uses as undef. If we lose the undef marker then we get a register which has no def, is not marked as undef. The live interval analysis does not collect information for these virtual registers and we crash in later passes. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160260	2012-07-16 10:52:25 +00:00
Alexey Samsonov	c68bb48704	This CL changes the function prologue and epilogue emitted on X86 when stack needs realignment. It is intended to fix PR11468. Old prologue and epilogue looked like this: push %rbp mov %rsp, %rbp and $alignment, %rsp push %r14 push %r15 ... pop %r15 pop %r14 mov %rbp, %rsp pop %rbp The problem was to reference the locations of callee-saved registers in exception handling: locations of callee-saved had to be re-calculated regarding the stack alignment operation. It would take some effort to implement this in LLVM, as currently MachineLocation can only have the form "Register + Offset". Funciton prologue and epilogue are now changed to: push %rbp mov %rsp, %rbp push %14 push %15 and $alignment, %rsp ... lea -$size_of_saved_registers(%rbp), %rsp pop %r15 pop %r14 pop %rbp Reviewed by Chad Rosier. llvm-svn: 160248	2012-07-16 06:54:09 +00:00
Nadav Rotem	a09775b875	Teach getTargetVShiftNode about TargetConstant nodes. llvm-svn: 160234	2012-07-15 20:27:43 +00:00
Nadav Rotem	0377e0d234	Rename VBROADCASTSDrm into VBROADCASTSDYrm to match the naming convention. Allow the folding of vbroadcastRR to vbroadcastRM, where the memory operand is a spill slot. PR12782. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160230	2012-07-15 12:26:30 +00:00
Nadav Rotem	c40d85dda5	AVX: Fix a bug in getTargetVShiftNode. The shift amount has to be a 128bit vector with the same element type as the input vector. This is needed because of the patterns we have for the VP[SLL/SRA/SRL][W/D/Q] instructions. llvm-svn: 160222	2012-07-14 22:26:05 +00:00
Joel Jones	12ea066486	This is one of the first steps at moving to replace target-dependent intrinsics with target-indepdent intrinsics. The first instruction(s) to be handled are the vector versions of count leading zeros (ctlz). The changes here are to clang so that it generates a target independent vector ctlz when it sees an ARM dependent vector ctlz. The changes in llvm are to match the target independent vector ctlz and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector ctlzs with target-independent ctlzs. There are also changes to an existing test case in llvm for ARM vector count instructions and a new test for the bitcode upgrade. <rdar://problem/11831778> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160200	2012-07-13 23:25:25 +00:00
Jakob Stoklund Olesen	b8af245a15	Remove variable_ops from call instructions in most targets. Call instructions are no longer required to be variadic, and variable_ops should only be used for instructions that encode a variable number of arguments, like the ARM stm/ldm instructions. llvm-svn: 160189	2012-07-13 20:44:29 +00:00
Jakob Stoklund Olesen	6944c56847	Remove variable_ops from ARM call instructions. Function argument registers are added to the call SDNode, but InstrEmitter now knows how to make those operands implicit, and the call instruction doesn't have to be variadic. Explicit register operands should only be those that are encoded in the instruction, implicit register operands are for extra dependencies like call argument and return values. llvm-svn: 160188	2012-07-13 20:27:00 +00:00
Jack Carter	b8e3cf5fbc	The Mips specific relocation R_MIPS_GOT_DISP is used in cases where global symbols are directly represented in the GOT and we use an offset into the global offset table. This patch adds direct object support for R_MIPS_GOT_DISP. llvm-svn: 160183	2012-07-13 19:15:47 +00:00
Benjamin Kramer	308eb1b4c0	Make helper functions static. llvm-svn: 160173	2012-07-13 13:25:15 +00:00
Craig Topper	a75da664ff	Mark VINSERTI128rm as MayLoad=1. Fixes PR13348. llvm-svn: 160162	2012-07-13 05:46:28 +00:00
Benjamin Kramer	558c56f216	Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it. I already had the necessary things in place for IR-level passes but missed the machine passes. llvm-svn: 160137	2012-07-12 18:14:57 +00:00
Benjamin Kramer	f8e67a04f4	Add intrinsics for Ivy Bridge's rdrand instruction. The rdrand/cmov sequence is the same that is emitted by both GCC and ICC. Fixes PR13284. llvm-svn: 160117	2012-07-12 09:31:43 +00:00
Craig Topper	6eef81b65b	Update GATHER instructions to support 2 read-write operands. Patch from myself and Manman Ren. llvm-svn: 160110	2012-07-12 06:52:41 +00:00
Manman Ren	6b6d3e2854	ARM: fix typo in comments llvm-svn: 160093	2012-07-11 23:47:00 +00:00
Manman Ren	0479bff92d	ARM: Fix optimizeCompare to correctly check safe condition. It is safe if CPSR is killed or re-defined. When we are done with the basic block, check whether CPSR is live-out. Do not optimize away cmp if CPSR is live-out. llvm-svn: 160090	2012-07-11 22:51:44 +00:00
Jack Carter	7eebd50da0	Patch for Mips direct object generation. When WriteFragmentData() case FT_align called Asm.getBackend().writeNopData() is called, nothing is done since Mips implementation of writeNopData just returned "true". For some reason this has not caused problems in 32 bit mode, but in 64 bit mode it caused an assert when processing multiple function units. The test case included will assert without this patch. It runs twice with different flags to prevent false positives due to changes in code generation over time. llvm-svn: 160084	2012-07-11 22:17:39 +00:00
Jack Carter	d3a7595f31	This change removes an "initialization" warning. Even though variable in question could not be initialized before use, the code was such that the compiler had no way of knowing that. llvm-svn: 160081	2012-07-11 21:41:49 +00:00
Akira Hatanaka	e0fbd238e2	In register classes in MipsRegisterInfo.td, list the registers in ascending order of binary encoding. Patch by Vladimir Medic. llvm-svn: 160073	2012-07-11 20:51:50 +00:00
Chad Rosier	75817295b6	[x86 fast-isel] Per discussion with Eric, add all cases to switch with verbose comments. llvm-svn: 160069	2012-07-11 19:58:38 +00:00
Manman Ren	93ef864f3c	X86: Update to peephole optimization to move Movr0 before (Sub, Cmp) pair. When Movr0 is between sub and cmp, we move Movr0 before sub if it enables removal of Cmp. llvm-svn: 160066	2012-07-11 19:35:12 +00:00
Akira Hatanaka	2e26e543b9	Implement MipsTargetLowering::LowerSELECT_CC to custom lower SELECT_CC. llvm-svn: 160064	2012-07-11 19:32:27 +00:00
Chad Rosier	aaccad80b4	[x86 fast-isel] Rather then call llvm_unreachable() have fast-isel fall back to Selection DAG isel. Patch by Andrew Kaylor <andrew.kaylor@intel.com>. llvm-svn: 160055	2012-07-11 17:23:17 +00:00
Nadav Rotem	22652c85bc	When ext-loading and trunc-storing vectors to memory, on x86 32bit systems, allow loads/stores of 64bit values from xmm registers. llvm-svn: 160044	2012-07-11 13:27:05 +00:00
Akira Hatanaka	aad21ac7f2	Lower RETURNADDR node in Mips backend. Patch by Sasa Stankovic. llvm-svn: 160031	2012-07-11 00:53:32 +00:00
Jack Carter	639a740a15	Mips specific inline asm operand modifier 'L'. Low order register of a double word register operand. Operands are defined by the name of the variable they are marked with in the inline assembler code. This is a way to specify that the operand just refers to the low order register for that variable. It is the opposite of modifier 'D' which specifies the high order register. Example: main() { long long ll_input = 0x1111222233334444LL; long long ll_val = 3; int i_result = 0; __asm__ __volatile__( "or %0, %L1, %2" : "=r" (i_result) : "r" (ll_input), "r" (ll_val)); } Which results in: lui $2, %hi(_gp_disp) addiu $2, $2, %lo(_gp_disp) addiu $sp, $sp, -8 addu $2, $2, $25 sw $2, 0($sp) lui $2, 13107 ori $3, $2, 17476 <-- Low 32 bits of ll_input lui $2, 4369 ori $4, $2, 8738 <-- High 32 bits of ll_input addiu $5, $zero, 3 <-- Low 32 bits of ll_val addiu $2, $zero, 0 <-- High 32 bits of ll_val #APP or $3, $4, $5 <-- or i_result, high 32 ll_input, low 32 of ll_val #NO_APP addiu $sp, $sp, 8 jr $ra If not direction is done for the long long for 32 bit variables results in using the low 32 bits as ll_val shows. There is an existing bug if 'L' or 'D' is used for the destination register for 32 bit long longs in that the target value will be updated incorrectly for the non-specified part unless explicitly set within the inline asm code. llvm-svn: 160028	2012-07-10 22:41:20 +00:00
Chad Rosier	3273667edf	Move [get\|set]BasePtrStackAdjustment() from MachineFrameInfo to X86MachineFunctionInfo as this is currently only used by X86. If this ever becomes an issue on another arch (e.g., ARM) then we can hoist it back out. llvm-svn: 160009	2012-07-10 18:27:15 +00:00
Chad Rosier	5395ec6ee4	Add support for dynamic stack realignment in the presence of dynamic allocas on X86. Basically, this is a reapplication of r158087 with a few fixes. Specifically, (1) the stack pointer is restored from the base pointer before popping callee-saved registers and (2) in obscure cases (see comments in patch) we must cache the value of the original stack adjustment in the prologue and apply it in the epilogue. rdar://11496434 llvm-svn: 160002	2012-07-10 17:45:53 +00:00
Nadav Rotem	5f6e9d5ffe	Improve the loading of load-anyext vectors by allowing the codegen to load multiple scalars and insert them into a vector. Next, we shuffle the elements into the correct places, as before. Also fix a small dagcombine bug in SimplifyBinOpWithSameOpcodeHands, when the migration of bitcasts happened too late in the SelectionDAG process. llvm-svn: 159991	2012-07-10 13:25:08 +00:00
Richard Barton	2bacde8589	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) (and do it properly this time! llvm-svn: 159989	2012-07-10 12:51:09 +00:00
Craig Topper	b346ce8240	Reverse assembler/disassembler operand order for gather instructions. llvm-svn: 159983	2012-07-10 06:38:33 +00:00
Jim Grosbach	83589b60dc	ARM: Allow more flexible patterns in NEON formats. Some NEON instructions want to match against normal SDNodes for some operand types and Intrinsics for others. For example, CTLZ. To enable this, switch from explicitly requiring Intrinsic on the class templates to using SDPatternOperator instead. llvm-svn: 159974	2012-07-10 00:51:13 +00:00
Akira Hatanaka	96b3eb563a	Make register Mips::RA allocatable if not in mips16 mode. llvm-svn: 159971	2012-07-10 00:19:06 +00:00
Chad Rosier	b986265e3b	Revert r159938 (and r159945) to appease the buildbots. llvm-svn: 159960	2012-07-09 20:43:34 +00:00
Manman Ren	dc41586be4	X86: implement functions to analyze & synthesize CMOV\|SET\|Jcc getCondFromSETOpc, getCondFromCMovOpc, getSETFromCond, getCMovFromCond No functional change intended. If we want to update the condition code of CMOV\|SET\|Jcc, we first analyze the opcode to get the condition code, then update the condition code, finally synthesize the new opcode form the new condition code. llvm-svn: 159955	2012-07-09 18:57:12 +00:00
Akira Hatanaka	3d2bcefaf1	Reapply r158846. Access mips register classes via MCRegisterInfo's functions instead of via the TargetRegisterClasses defined in MipsGenRegisterInfo.inc. llvm-svn: 159953	2012-07-09 18:46:47 +00:00
Richard Barton	de6e2755f9	Some formatting to keep Clang happy llvm-svn: 159948	2012-07-09 18:30:56 +00:00
Richard Barton	1f07c6525e	Oops - correct broken disassembly for VMOV llvm-svn: 159945	2012-07-09 18:20:02 +00:00
Richard Barton	cb28956a79	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) llvm-svn: 159938	2012-07-09 16:41:33 +00:00
Richard Barton	58c6ccbb1c	Prevent ARM assembler from losing a right shift by #32 applied to a register llvm-svn: 159937	2012-07-09 16:31:14 +00:00
Richard Barton	957a588c71	Spelling! llvm-svn: 159936	2012-07-09 16:14:28 +00:00
Richard Barton	2ca50f6513	Teach the assembler to use the narrow thumb encodings of various three-register dp instructions where permissable. llvm-svn: 159935	2012-07-09 16:12:24 +00:00
Andrew Trick	b9c8074dcd	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Manman Ren	eca5886e50	X86: Fix optimizeCompare to correctly check safe condition. It is safe if EFLAGS is killed or re-defined. When we are done with the basic block, check whether EFLAGS is live-out. Do not optimize away cmp if EFLAGS is live-out. llvm-svn: 159888	2012-07-07 03:34:46 +00:00
Chad Rosier	a9d216beac	Fix the naming of ensureAlignment. Per the coding standard function names should be camel case, and start with a lower case letter. llvm-svn: 159877	2012-07-06 23:13:38 +00:00
Jim Grosbach	b9fd88619e	ARM: Add test cleanup entry to the README. llvm-svn: 159864	2012-07-06 21:52:04 +00:00
Akira Hatanaka	37565e70b6	revert r159851. llvm-svn: 159854	2012-07-06 20:16:48 +00:00

... 4 5 6 7 8 ...

22190 Commits