llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Logan Chien	a15abb3d65	Fix UseInitArray option for MIPS target. llvm-svn: 163193	2012-09-05 06:17:17 +00:00
Craig Topper	6274d26545	Convert vextracti128/vextractf128 intrinsics to extract_subvector at DAG build time. Similar was previously done for vinserti128/vinsertf128. Add patterns for folding these extract_subvectors with stores. llvm-svn: 163192	2012-09-05 05:48:09 +00:00
Richard Smith	8213d2a51b	Remove redundant semicolons to fix -pedantic-errors build. llvm-svn: 163190	2012-09-05 01:41:37 +00:00
Chad Rosier	b75afa43e4	Fix function name per coding standard. llvm-svn: 163187	2012-09-05 01:15:43 +00:00
Preston Gurd	c80dc7d214	Generic Bypass Slow Div - CodeGenPrepare pass for identifying div/rem ops - Backend specifies the type mapping using addBypassSlowDivType - Enabled only for Intel Atom with O2 32-bit -> 8-bit - Replace IDIV with instructions which test its value and use DIVB if the value is positive and less than 256. - In the case when the quotient and remainder of a divide are used a DIV and a REM instruction will be present in the IR. In the non-Atom case they are both lowered to IDIVs and CSE removes the redundant IDIV instruction, using the quotient and remainder from the first IDIV. However, due to this optimization CSE is not able to eliminate redundant IDIV instructions because they are located in different basic blocks. This is overcome by calculating both the quotient (DIV) and remainder (REM) in each basic block that is inserted by the optimization and reusing the result values when a subsequent DIV or REM instruction uses the same operands. - Test cases check for the presents of the optimization when calculating either the quotient, remainder, or both. Patch by Tyler Nowicki! llvm-svn: 163150	2012-09-04 18:22:17 +00:00
Sergei Larin	905bc1964f	Porting Hexagon MI Scheduler to the new API. Change current Hexagon MI scheduler to use new converging scheduler. Integrates DFA resource model into it. llvm-svn: 163137	2012-09-04 14:49:56 +00:00
Arnold Schwaighofer	d606c6fcdf	Patch to implement UMLAL/SMLAL instructions for the ARM architecture This patch corrects the definition of umlal/smlal instructions and adds support for matching them to the ARM dag combiner. Bug 12213 Patch by Yin Ma! llvm-svn: 163136	2012-09-04 14:37:49 +00:00
Elena Demikhovsky	61924c155d	This patch optimizes shuffle instruction - generates 2 instructions instead of 4. Since this specific shuffle is widely used in many workloads we have ~10% performance on them. shufflevector <8 x float> %A, <8 x float> %B, <8 x i32> <i32 0, i32 8, i32 2, i32 10, i32 4, i32 12, i32 6, i32 14> vmovaps (%rdx), %ymm0 vshufps $8, %ymm0, %ymm0, %ymm0 vmovaps (%rcx), %ymm1 vshufps $8, %ymm0, %ymm1, %ymm1 vunpcklps %ymm0, %ymm1, %ymm0 vmovaps (%rcx), %ymm0 vmovsldup (%rdx), %ymm1 vblendps $85, %ymm0, %ymm1, %ymm0 llvm-svn: 163134	2012-09-04 12:49:02 +00:00
Chad Rosier	294688cf56	[ms-inline asm] Asm operands can map to one or more MCOperands. Therefore, add the NumMCOperands argument to the GetMCInstOperandNum() function that is set to the number of MCOperands this asm operand mapped to. llvm-svn: 163124	2012-09-03 20:31:23 +00:00
Chad Rosier	6d692c7883	[ms-inline asm] Add a comment. llvm-svn: 163123	2012-09-03 19:04:35 +00:00
Chad Rosier	bd31fcd8a9	[ms-inline asm] Add an interface to the GetMCInstOperandNum() function in the MCTargetAsmParser class. llvm-svn: 163122	2012-09-03 18:47:45 +00:00
Roman Divacky	1a4b67cd3a	Remove always true checks. Noticed by Adhemerval Zanella. llvm-svn: 163117	2012-09-03 16:55:42 +00:00
Chad Rosier	bb0dcf509a	Add braces to the case statement. llvm-svn: 163116	2012-09-03 16:21:15 +00:00
Chad Rosier	fac2e7b419	Removed unused argument. llvm-svn: 163104	2012-09-03 03:16:09 +00:00
Chris Lattner	4a8f2bcb32	some peepholes that should match horizontal add/sub operations. llvm-svn: 163103	2012-09-03 02:58:21 +00:00
Chad Rosier	6fbf85d859	[ms-inline asm] Expose the Kind and Opcode variables from the MatchInstructionImpl() function. These values are used by the ConvertToMCInst() function to index into the ConversionTable. The values are also needed to call the GetMCInstOperandNum() function. llvm-svn: 163101	2012-09-03 02:06:46 +00:00
Chad Rosier	ee2993d684	Move ErrorLoc decl into the scope where it's actually used. llvm-svn: 163100	2012-09-03 01:55:11 +00:00
Nadav Rotem	d1815a0763	Not all targets have efficient ISel code generation for select instructions. For example, the ARM target does not have efficient ISel handling for vector selects with scalar conditions. This patch adds a TLI hook which allows the different targets to report which selects are supported well and which selects should be converted to CF duting codegen prepare. llvm-svn: 163093	2012-09-02 12:10:19 +00:00
Tim Northover	316bfd78cd	Limit domain conversion to cases where it won't break dep chains. NEON domain conversion was too heavy-handed with its widened registers, which could have stripped existing instructions of their dependency, leaving them vulnerable to scheduling errors. llvm-svn: 163070	2012-09-01 18:07:29 +00:00
Logan Chien	b022dbf7dc	Fix Thumb2 fixup kind in the integrated-as. llvm-svn: 163063	2012-09-01 15:06:36 +00:00
Craig Topper	0791e3f380	Typos llvm-svn: 163053	2012-09-01 06:33:50 +00:00
Manman Ren	9afdad8207	SelectionDAG: when constructing VZEXT_LOAD from other loads, make sure its output chain is correctly setup. As an example, if the original load must happen before later stores, we need to make sure the constructed VZEXT_LOAD is constrained to be before the stores. rdar://11457792 llvm-svn: 163036	2012-08-31 23:16:57 +00:00
Craig Topper	2e53378ff6	Mark FMA4 instructions as commutable and add them to the folding tables. llvm-svn: 163035	2012-08-31 23:10:34 +00:00
Chad Rosier	1335fb4cf0	Remove an unused argument. The MCInst opcode is set in the ConvertToMCInst() function nowadays. llvm-svn: 163030	2012-08-31 22:12:31 +00:00
Craig Topper	4a81c1cbe0	Add selection of RegOp2MemOpTable3 to canFoldMemoryOperand llvm-svn: 163029	2012-08-31 22:12:16 +00:00
Michael Liao	6f4b3f358d	Fix PR12359 - In addition to undefined, if V2 is zero vector, skip 2nd PSHUFB and POR as well as PSHUFB will zero elements with negative indices. Patch by Sriram Murali <sriram.murali@intel.com> llvm-svn: 163018	2012-08-31 20:12:31 +00:00
Jack Carter	a986033975	The instruction DINS may be transformed into DINSU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 163010	2012-08-31 18:06:48 +00:00
Chad Rosier	9367dbd900	Add a comment to explain what's really going on. llvm-svn: 163005	2012-08-31 17:24:10 +00:00
Chad Rosier	5e5a7c4932	The ConvertToMCInst() function can't fail, so remove the now dead Match_ConversionFail enum. llvm-svn: 163002	2012-08-31 16:41:07 +00:00
Craig Topper	917333c8c7	Mark FMA3 instructions as commutable so that the operands to the multiply part can be commuted. llvm-svn: 163001	2012-08-31 16:31:13 +00:00
Craig Topper	6bb3145d0d	Add support for converting llvm.fma to fma4 instructions. llvm-svn: 162999	2012-08-31 15:40:30 +00:00
Michael Liao	43c7369b24	Clean up AddedComplexity further after adding UseSSEx llvm-svn: 162973	2012-08-31 03:01:35 +00:00
Jakob Stoklund Olesen	eb687a399c	Fix a couple of typos in EmitAtomic. Thumb2 instructions are mostly constrained to rGPR, not tGPR which is for Thumb1. rdar://problem/12203728 llvm-svn: 162968	2012-08-31 02:08:34 +00:00
Jim Grosbach	6d3cb70105	X86: Fix encoding of 'movd %xmm0, %rax' The assembly string for the VMOVPQIto64rr instruction incorrectly lacked the 'v' prefix, resulting in mis-assembly of the vanilla movd instruction. llvm-svn: 162963	2012-08-31 00:30:30 +00:00
Chad Rosier	802539bb46	With the fix in r162954/162955 every cvt function returns true. Thus, have the ConvertToMCInst() return void, rather then a bool. Update all the cvt functions as well. llvm-svn: 162961	2012-08-31 00:03:31 +00:00
Chad Rosier	495e9f8b7b	Fix for r162954. Return the Error. llvm-svn: 162955	2012-08-30 23:22:05 +00:00
Chad Rosier	1421e2d649	Move a check to the validateInstruction() function where it more properly belongs. llvm-svn: 162954	2012-08-30 23:20:38 +00:00
Chad Rosier	54ce68581e	Typo. llvm-svn: 162952	2012-08-30 23:00:00 +00:00
Michael Liao	b6735b87b0	Introduce 'UseSSEx' to force SSE legacy encoding - Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is enabled. As the penalty of inter-mixing SSE and AVX instructions, we need prevent SSE legacy insn from being generated except explicitly specified through some intrinsics. For patterns supported by both SSE and AVX, so far, we force AVX insn will be tried first relying on AddedComplexity or position in td file. It's error-prone and introduces bugs accidentally. 'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited by AVX, we need this predicate to force VEX encoding or SSE legacy encoding only. For insns not inherited by AVX, we still use the previous predicates, i.e. 'HasSSEx'. So far, these insns fall into the following categories: * SSE insns with MMX operands * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH, CRC, and etc.) * SSE4A insns. * MMX insns. * x87 insns added by SSE. 2 test cases are modified: - test/CodeGen/X86/fast-isel-x86-64.ll AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be selected by fast-isel due to complicated pattern and fast-isel fallback to materialize it from constant pool. - test/CodeGen/X86/widen_load-1.ll AVX code generation is different from SSE one after fixing SSE/AVX inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of 'vmovaps'. llvm-svn: 162919	2012-08-30 16:54:46 +00:00
NAKAMURA Takumi	80e2544fa6	PPCISelLowering.cpp: Fix r162725. [Tobias von Koch] What's happening here is that the CR6SET/CR6UNSET is breaking the chain of register copies glued to the function call (BL_SVR4 node). The scheduler then moves other instructions in between those and the function call, which isn't good! Right. That's the case where there is no chain of register copies before the call, so InFlag == 0... Attached is a new revision of the patch which should fix this for good. llvm-svn: 162916	2012-08-30 15:52:29 +00:00
NAKAMURA Takumi	df4cfcd69b	PPCISelLowering.cpp: Whitespace. llvm-svn: 162915	2012-08-30 15:52:23 +00:00
Tim Northover	627f946e05	Add support for moving pure S-register to NEON pipeline if desired llvm-svn: 162898	2012-08-30 10:17:45 +00:00
Craig Topper	3bc01e8fa4	Only perform DAG combine on FMAs of legal types. llvm-svn: 162892	2012-08-30 06:56:15 +00:00
Michael Liao	0e40defe86	Fix PR13727 - The root cause is that target constant materialization in X86 fast-isel creates a PC-rel addressing which may overflow 32-bit range in non-Small code model if .rodata section is allocated too far away from code segment in MCJIT, which uses Large code model so far. - Follow the similar logic to fix non-Small code model in fast-isel by skipping non-Small code model. llvm-svn: 162881	2012-08-30 00:30:16 +00:00
Jakob Stoklund Olesen	50309198d1	Rename hasVolatileMemoryRef() to hasOrderedMemoryRef(). Ordered memory operations are more constrained than volatile loads and stores because they must be ordered with respect to all other memory operations. llvm-svn: 162861	2012-08-29 21:19:21 +00:00
Hal Finkel	b356af14b1	Reserve space for the mandatory traceback fields on PPC64. We need to reserve space for the mandatory traceback fields, though leaving them as zero is appropriate for now. Although the ABI calls for these fields to be filled in fully, no compiler on Linux currently does this, and GDB does not read these fields. GDB uses the first word of zeroes during exception handling to find the end of the function and the size field, allowing it to compute the beginning of the function. DWARF information is used for everything else. We need the extra 8 bytes of pad so the size field is found in the right place. As a comparison, GCC fills in a few of the fields -- language, number of saved registers -- but ignores the rest. IBM's proprietary OSes do make use of the full traceback table facility. Patch by Bill Schmidt. llvm-svn: 162854	2012-08-29 20:22:24 +00:00
Tim Northover	692b4c6860	Refactor setExecutionDomain to be clearer about what it's doing and more robust. llvm-svn: 162844	2012-08-29 16:36:07 +00:00
Benjamin Kramer	49d736fb29	Make helper function static. llvm-svn: 162843	2012-08-29 16:17:01 +00:00
Benjamin Kramer	b92d13cc42	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Craig Topper	aa2444a397	Convert FMA4 patterns to use target specific nodes instead of intrinsics to align with FMA3. llvm-svn: 162829	2012-08-29 07:18:25 +00:00
Andrew Trick	66d93eaf98	Cleanup sloppy code. Jakob's review. llvm-svn: 162825	2012-08-29 04:41:37 +00:00
Jush Lu	5a78c68e1d	[arm-fast-isel] Add support for ARM PIC. llvm-svn: 162823	2012-08-29 02:41:21 +00:00
Andrew Trick	48b2b90d4d	Fix ARM vector copies of overlapping register tuples. I have tested the fix, but have not been successfull in generating a robust unit test. This can only be exposed through particular register assignments. llvm-svn: 162821	2012-08-29 01:58:55 +00:00
Andrew Trick	e8b0d4d64e	cleanup llvm-svn: 162820	2012-08-29 01:58:52 +00:00
Chad Rosier	eed9ef7a03	Typo. llvm-svn: 162807	2012-08-28 23:57:47 +00:00
Michael Liao	2136b1b1ed	Add comments on the literal value used. llvm-svn: 162805	2012-08-28 23:42:17 +00:00
Jack Carter	c918c7a81f	The instruction DEXT may be transformed into DEXTU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 162782	2012-08-28 20:07:41 +00:00
Michael Liao	32ad80c81f	Explicitly update the number of nodes to be traversed llvm-svn: 162780	2012-08-28 19:20:29 +00:00
Jack Carter	a525a54e64	Some instructions are passed to the assembler to be transformed to the final instruction variant. An example would be dsrll which is transformed into dsll32 if the shift value is greater than 32. For direct object output we need to do this transformation in the codegen. If the instruction was inside branch delay slot, it was being missed. This patch corrects this oversight. llvm-svn: 162779	2012-08-28 19:07:39 +00:00
Roman Divacky	7c3f29735a	Emit word of zeroes after the last instruction as a start of the mandatory traceback table on PowerPC64. This helps gdb handle exceptions. The other mandatory fields are ignored by gdb and harder to implement so just add there a FIXME. Patch by Bill Schmidt. PR13641. llvm-svn: 162778	2012-08-28 19:06:55 +00:00
Akira Hatanaka	d8b83a17c8	Follow-up patch to r162731. Fix a couple of bugs in mips' long branch pass. This patch was supposed to be committed along with r162731, so I don't have a new test case. llvm-svn: 162777	2012-08-28 18:58:57 +00:00
Hal Finkel	0673920af6	Add PPC Freescale e500mc and e5500 subtargets. Add subtargets for Freescale e500mc (32-bit) and e5500 (64-bit) to the PowerPC backend. Patch by Tobias von Koch. llvm-svn: 162764	2012-08-28 16:12:39 +00:00
Bill Wendling	6488dc22bb	The commutative flag is already correctly set within the multiclass. If we set it here, then a 'register-memory' version would wrongly get the commutative flag. <rdar://problem/12180135> llvm-svn: 162741	2012-08-28 07:36:46 +00:00
Craig Topper	803047a9bb	Convert V_SETALLONES/AVX_SETALLONES/AVX2_SETALLONES to Post-RA pseudos. llvm-svn: 162740	2012-08-28 07:30:47 +00:00
Craig Topper	02bb8ce5e0	Merge AVX_SET0PSY/AVX_SET0PDY/AVX2_SET0 into a single post-RA pseudo. llvm-svn: 162738	2012-08-28 07:05:28 +00:00
Michael Liao	1f793b9c47	Fix PR12312 - Add a target-specific DAG optimization to recognize a pattern PTEST-able. Such a pattern is a OR'd tree with X86ISD::OR as the root node. When X86ISD::OR node has only its flag result being used as a boolean value and all its leaves are extracted from the same vector, it could be folded into an X86ISD::PTEST node. llvm-svn: 162735	2012-08-28 03:34:40 +00:00
Jakob Stoklund Olesen	eefb981463	Revert r162713: "Add ATOMIC_LDR* pseudo-instructions to model atomic_load on ARM." This wasn't the right way to enforce ordering of atomics. We are already setting the isVolatile bit on memory operands of atomic operations which is good enough to enforce the correct ordering. llvm-svn: 162732	2012-08-28 03:11:27 +00:00
Akira Hatanaka	ab45f57419	Fix mips' long branch pass. Instructions emitted to compute branch offsets now use immediate operands instead of symbolic labels. This change was needed because there were problems when R_MIPS_HI16/LO16 relocations were used to make shared objects. llvm-svn: 162731	2012-08-28 03:03:05 +00:00
Hal Finkel	a65f8ac557	Split several PPC instruction classes. Slight reorganisation of PPC instruction classes for scheduling. No functionality change for existing subtargets. - Clearly separate load/store-with-update instructions from regular loads and stores. - Split IntRotateD -> IntRotateD and IntRotateDI - Split out fsub and fadd from FPGeneral -> FPAddSub - Update existing itineraries Patch by Tobias von Koch. llvm-svn: 162729	2012-08-28 02:49:14 +00:00
Hal Finkel	367c494415	Allow remat of LI on PPC. Allow load-immediates to be rematerialised in the register coalescer for PPC. This makes test/CodeGen/PowerPC/big-endian-formal-args.ll fail, because it relies on a register move getting emitted. The immediate load is equivalent, so change this test case. Patch by Tobias von Koch. llvm-svn: 162727	2012-08-28 02:10:33 +00:00
Hal Finkel	d28587407f	Eliminate redundant CR moves on PPC32. The 32-bit ABI requires CR bit 6 to be set if the call has fp arguments and unset if it doesn't. The solution up to now was to insert a MachineNode to set/unset the CR bit, which produces a CR vreg. This vreg was then copied into CR bit 6. When the register allocator saw a bunch of these in the same function, it allocated the set/unset CR bit in some random CR register (1 extra instruction) and then emitted CR moves before every vararg function call, rather than just setting and unsetting CR bit 6 directly before every vararg function call. This patch instead inserts a PPCcrset/PPCcrunset instruction which are then matched by a dedicated instruction pattern. Patch by Tobias von Koch. llvm-svn: 162725	2012-08-28 02:10:27 +00:00
Hal Finkel	caa4701e37	Optimize zext on PPC64. The zeroextend IR instruction is lowered to an 'and' node with an immediate mask operand, which in turn gets legalised to a sequence of ori's & ands. This can be done more efficiently using the rldicl instruction. Patch by Tobias von Koch. llvm-svn: 162724	2012-08-28 02:10:15 +00:00
Jakob Stoklund Olesen	882cb360be	More missing mayLoad flags on AVX multiclasses. llvm-svn: 162714	2012-08-28 00:02:01 +00:00
Jakob Stoklund Olesen	b91754771a	Add ATOMIC_LDR* pseudo-instructions to model atomic_load on ARM. It is not safe to use normal LDR instructions because they may be reordered by the scheduler. The ATOMIC_LDR pseudos have a mayStore flag that prevents reordering. Atomic loads are also prevented from participating in rematerialization and load folding. llvm-svn: 162713	2012-08-27 23:58:52 +00:00
Bill Wendling	d49e183a6f	Make sure we add the predicate after all of the registers are added. <rdar://problem/12183003> llvm-svn: 162703	2012-08-27 22:12:44 +00:00
Craig Topper	3e5376d85a	Remove MMX shift intrinsic handling code that also exists in SelectionDAGBuilder. llvm-svn: 162661	2012-08-27 08:08:30 +00:00
Craig Topper	bbee14ad9d	Don't allow vextractf128 to be folded with unaligned stores. We don't fold unaligned loads so shouldn't fold unaligned stores as it can cause an alignment fault to occur. llvm-svn: 162658	2012-08-27 07:19:59 +00:00
Craig Topper	57dd6db42e	Fold some patterns into instruction definitons so tablegen can infer flags removing the need for an explicit 'neverHasSideEffects = 1' llvm-svn: 162656	2012-08-27 07:04:50 +00:00
Craig Topper	b524d2e36d	Add HasAVX1Only predicate and use it for patterns that have an AVX1 instruction and an AVX2 instruction rather than relying on AddedComplexity. llvm-svn: 162654	2012-08-27 06:08:57 +00:00
Richard Smith	865f47cbb6	Fix integer undefined behavior due to signed left shift overflow in LLVM. Reviewed offline by chandlerc. llvm-svn: 162623	2012-08-24 23:29:28 +00:00
Jakob Stoklund Olesen	d1820cea0b	Add missing mayLoad flags to a large class of AVX *_Int instructions. llvm-svn: 162622	2012-08-24 23:29:07 +00:00
Jakob Stoklund Olesen	5eccfd2aed	Missed tLEApcrelJT. ARMConstantIslandPass expects this instruction to stay in the same basic block as the jump table branch. llvm-svn: 162615	2012-08-24 22:46:55 +00:00
Jakob Stoklund Olesen	38fa28fb10	Explicitly mark LEApcrel pseudos with hasSideEffects. It's not clear that they should be marked as such, but tbb formation fails if t2LEApcrelJT is hoisted of of a loop. This doesn't change the flags on these instructions, UnmodeledSideEffects was already inferred from the missing pattern. llvm-svn: 162603	2012-08-24 21:44:11 +00:00
Jakob Stoklund Olesen	708279db06	Fix call instruction operands in ARMFastISel. The ARM BL and BLX instructions don't have predicate operands, but the thumb variants tBL and tBLX do. The argument registers should be added as implicit uses. llvm-svn: 162593	2012-08-24 20:52:46 +00:00
Jakob Stoklund Olesen	9ebe947bb0	Mark X86::RET and RETI instructions as variadic. There is special magic happening when returning floating point values on the x87 stack. The RET instructions get extra f80 operands. llvm-svn: 162592	2012-08-24 20:52:44 +00:00
Akira Hatanaka	8411cfdb72	Disable Mips' delay slot filler when optimization level is O0. llvm-svn: 162589	2012-08-24 20:40:15 +00:00
Akira Hatanaka	8e8bb580a8	In MipsDAGToDAGISel::SelectAddr, fold add node into address operand, if its second operand is MipsISD::GPRel. llvm-svn: 162584	2012-08-24 20:21:49 +00:00
Roman Divacky	eab620e38c	Lower constant pools and jump tables via TOC on PPC64/SVR4. In collaboration with Adhemerval Zanella. llvm-svn: 162562	2012-08-24 16:26:02 +00:00
Jakob Stoklund Olesen	02cb24658a	Fix load/store SDNode flags. llvm-svn: 162558	2012-08-24 14:43:30 +00:00
Jakob Stoklund Olesen	4da790818a	Add missing SDNPSideEffect flags. llvm-svn: 162557	2012-08-24 14:43:27 +00:00
Jakob Stoklund Olesen	48bb81b28a	Remove more mayLoad workarounds. llvm-svn: 162556	2012-08-24 14:43:22 +00:00
Craig Topper	aa57ba3944	Custom lower FMA intrinsics to target specific nodes and remove the patterns. llvm-svn: 162534	2012-08-24 04:03:22 +00:00
Richard Smith	188ddbae92	Fix undefined behavior (negation of INT_MIN) in ARM backend. llvm-svn: 162520	2012-08-24 00:35:46 +00:00
Jakob Stoklund Olesen	3739d6ca99	Remove some spurious mayLoad = 0 flags. They were inserted to silence TableGen's warning about redundant properties. That warning is now gone. llvm-svn: 162517	2012-08-24 00:31:20 +00:00
Jakob Stoklund Olesen	e9fa31838d	Add missing SDNP properties on the flushw node. llvm-svn: 162515	2012-08-24 00:31:13 +00:00
Jakob Stoklund Olesen	2f512d8eba	X86MemBarrier has unmodeled side effects. llvm-svn: 162514	2012-08-24 00:31:10 +00:00
Jakob Stoklund Olesen	16126ffe0d	Preserve operand flags in convertToThreeAddress() by copying operands. No test case, this is a generalization of r160260. llvm-svn: 162485	2012-08-23 22:36:31 +00:00
Craig Topper	3d4254e5b4	Favor FMA3 over FMA4 if both are enabled. llvm-svn: 162454	2012-08-23 18:14:30 +00:00
Craig Topper	528004fc78	Use a switch statement instead of a bunch of if-else checks and pull out the common function call. llvm-svn: 162428	2012-08-23 04:57:36 +00:00
Craig Topper	68f6b47a37	Remove unused private field to silence build warning. llvm-svn: 162426	2012-08-23 04:45:31 +00:00
Akira Hatanaka	51dccb32d0	Make function loadImmediate a member of MipsSEInstrInfo and change it to return the temporary register that was used to load the immediate. Currently, it always returns register $at, but this will change if, in the future, we decide to use another register. No changes in functionality. llvm-svn: 162417	2012-08-23 00:21:05 +00:00
Akira Hatanaka	679d5c8fd7	Add a member of type Mips16InstrInfo/MipsSEInstrInfo to class Mips16RegisterInfo/MipsSERegisterInfo. No changes in functionality. llvm-svn: 162413	2012-08-22 23:58:53 +00:00
Chad Rosier	437076336a	[ms-inline asm] Avoid a false positive assertion Assertion failed: (Start.isValid() == End.isValid() && "Start and end should either both be valid or both be invalid!") when parsing inline asm. SMLoc assumes that the first char * in the source is invalid. However, when parsing an inline asm the mnemonic is at this location. I don't want to change SMLoc, so use a trivial workaround. llvm-svn: 162381	2012-08-22 19:14:29 +00:00
Benjamin Kramer	e09e72a083	Reduce duplicated hash map lookups. llvm-svn: 162362	2012-08-22 15:37:57 +00:00
Craig Topper	d66ff79b2c	Add a getName function to MachineFunction. Use it in places that previously did getFunction()->getName(). Remove includes of Function.h that are no longer needed. llvm-svn: 162347	2012-08-22 06:07:19 +00:00
Craig Topper	ba3d5bef9f	Don't cache the MBB in the class. Its only used by one function. Change a for loop over operands to use unsigned instead of int. llvm-svn: 162344	2012-08-22 05:59:59 +00:00
Craig Topper	37bdfa3177	Mark a function as static since it doesn't use anything in the class. llvm-svn: 162342	2012-08-22 05:36:44 +00:00
Akira Hatanaka	24b722f476	Add register Mips::GP to the list of reserved registers if target is bare-metal to prevent it from being clobbered. mips uses $gp to access small data section. This bug was originally reported by Carl Norum. llvm-svn: 162340	2012-08-22 03:18:13 +00:00
Akira Hatanaka	0602c4e928	Add option disable-mips-delay-filler. Turn on mips' delay slot filler by default. Patch by Carl Norum. llvm-svn: 162339	2012-08-22 02:51:28 +00:00
Jack Carter	1b099ac7c7	For mips64 switch statements in subroutines could generate within the codegen EK_GPRel64BlockAddress. This was not supported for direct object output and resulted in an assertion. This change adds support for EK_GPRel64BlockAddress for direct object. One fallout from this is to turn on rela relocations for mips64 to match gas. llvm-svn: 162334	2012-08-22 00:49:30 +00:00
Chad Rosier	3f65a99bf7	Add a few functions to TargetLibraryInfo as part of PR13574. Patch by Weiming Zhao <weimingz@codeaurora.org>. llvm-svn: 162329	2012-08-21 23:28:56 +00:00
Richard Smith	d1addbb679	Fix unaligned memory accesses when performing relocations in X86 JIT. There's no cost to using memcpy here: the fixed code is optimized by LLVM to perfect machine code. llvm-svn: 162311	2012-08-21 20:48:36 +00:00
Chad Rosier	92debd58d9	[ms-inline asm] Do not report a Parser error when matching inline assembly. llvm-svn: 162306	2012-08-21 19:36:59 +00:00
Chad Rosier	72a2747c53	[ms-inline asm] Expose the ErrorInfo from the MatchInstructionImpl. In general, this is the index of the operand that failed to match. Note: This may cause a buildbot failure due to an API mismatch in clang. Should recover with my next commit to clang. llvm-svn: 162295	2012-08-21 18:14:59 +00:00
Craig Topper	45eeb13dea	Fix up indentation and remove a couple else's after returns. llvm-svn: 162270	2012-08-21 08:29:51 +00:00
Craig Topper	aba5024223	Use uint16_t for tables of opcodes. llvm-svn: 162267	2012-08-21 08:23:21 +00:00
Craig Topper	9831045ed8	Fix up indentation. No functional change. llvm-svn: 162264	2012-08-21 08:17:07 +00:00
Craig Topper	63ef1d8341	Add a couple llvm_unreachables. Add a message to several others. llvm-svn: 162263	2012-08-21 08:16:16 +00:00
Craig Topper	3ba0ae7ec3	Replace a break with llvm_unreachable in the default case of a nested switch. Condense code a bit. No functional change. llvm-svn: 162261	2012-08-21 07:32:16 +00:00
Craig Topper	e432edabf1	Cleanup the scalar FMA3 definitions. Add patterns to fold loads with scalar forms. llvm-svn: 162260	2012-08-21 07:11:11 +00:00
Craig Topper	2e63b3ea18	Merge FMA3 instructions with and without patterns into single classes using null_frag. llvm-svn: 162257	2012-08-21 05:56:45 +00:00
Jakob Stoklund Olesen	4403f82dbf	Add a missing def flag. * Bad machine code: Explicit definition marked as use * - function: test_cos - basic block: BB#0 L.entry (0x7ff2a2024fd0) - instruction: VSETLNi32 %D11, %D11<undef>, %R0, 0, pred:14, pred:%noreg, %Q5<imp-use,kill>, %Q5<imp-def> - operand 0: %D11 llvm-svn: 162247	2012-08-21 00:34:53 +00:00
Jakob Stoklund Olesen	4d875f1e57	Use a SmallPtrSet to dedup successors in EmitSjLjDispatchBlock. The test case ARM/2011-05-04-MultipleLandingPadSuccs.ll was creating duplicate successor list entries. llvm-svn: 162222	2012-08-20 20:52:03 +00:00
Sebastian Pop	2f1237d5f4	fix HexagonSubtarget parsing of -mv flag llvm-svn: 162217	2012-08-20 19:56:47 +00:00
Michael Liao	3d421a0c4d	fix a case where all operands of BUILD_VECTOR are undefined llvm-svn: 162214	2012-08-20 17:59:18 +00:00
Akira Hatanaka	b64681df6d	Fix coding style violations in 162135 and 162136. Patch by Petar Jovanovic. llvm-svn: 162213	2012-08-20 17:53:24 +00:00
Craig Topper	77406bef3b	Remove FMA3 intrinsic instructions in favor of patterns. llvm-svn: 162194	2012-08-20 06:21:25 +00:00
Craig Topper	64c93f9d07	Use correct intrinsic for 256-bit VFMSUBADDPS. llvm-svn: 162193	2012-08-20 06:03:04 +00:00
Craig Topper	832951e7da	Remove trailing white space and tab characters. No functional change. llvm-svn: 162192	2012-08-19 23:37:46 +00:00
Nadav Rotem	589dc766e0	When unsafe math is used, we can use commutative FMAX and FMIN. In some cases this allows for better code generation. Added a new DAGCombine transformation to convert FMAX and FMIN to FMANC and FMINC, which are commutative. For example: movaps %xmm0, %xmm1 movsd LC(%rip), %xmm0 minsd %xmm1, %xmm0 becomes: minsd LC(%rip), %xmm0 llvm-svn: 162187	2012-08-19 13:06:16 +00:00
Benjamin Kramer	dca12ad159	Fabs folding is implemented. llvm-svn: 162186	2012-08-19 09:51:44 +00:00
Jakob Stoklund Olesen	abf0a9ec82	Remove the CAND/COR/CXOR custom ISD nodes and their select code. These nodes are no longer needed because the peephole pass can fold CMOV+AND into ANDCC etc. llvm-svn: 162179	2012-08-18 21:49:50 +00:00
Craig Topper	4362ba5082	Remove virtual from many methods. These methods replace methods in the base class, but the base class methods aren't virtual so it just increased call overhead. llvm-svn: 162178	2012-08-18 21:38:45 +00:00
Jakob Stoklund Olesen	e78d4a5b08	Also combine zext/sext into selects for ARM. This turns common i1 patterns into predicated instructions: (add (zext cc), x) -> (select cc (add x, 1), x) (add (sext cc), x) -> (select cc (add x, -1), x) For a function like: unsigned f(unsigned s, int x) { return s + (x>0); } We now produce: cmp r1, #0 it gt addgt.w r0, r0, #1 Instead of: movs r2, #0 cmp r1, #0 it gt movgt r2, #1 add r0, r2 llvm-svn: 162177	2012-08-18 21:25:22 +00:00
Jakob Stoklund Olesen	ece4a53017	Also pass logical ops to combineSelectAndUse. Add these transformations to the existing add/sub ones: (and (select cc, -1, c), x) -> (select cc, x, (and, x, c)) (or (select cc, 0, c), x) -> (select cc, x, (or, x, c)) (xor (select cc, 0, c), x) -> (select cc, x, (xor, x, c)) The selects can then be transformed to a single predicated instruction by peephole. This transformation will make it possible to eliminate the ISD::CAND, COR, and CXOR custom DAG nodes. llvm-svn: 162176	2012-08-18 21:25:16 +00:00
Nadav Rotem	d01a7b5942	Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow better compare/branch code. llvm-svn: 162172	2012-08-18 17:53:03 +00:00
Anton Korobeynikov	c0e610e681	fp16-to-fp32 conversion instructions are available in Thumb mode as well. Make sure the generic pattern is used. llvm-svn: 162170	2012-08-18 13:08:43 +00:00
Craig Topper	e341db552a	Refactor code a bit to reduce number of calls in the final compiled code. No functional change intended. llvm-svn: 162166	2012-08-18 06:39:34 +00:00
Craig Topper	d35582ae96	Reorder initialization list to silence -Wreorder llvm-svn: 162165	2012-08-18 06:20:54 +00:00
Nadav Rotem	e9cdefa762	Revert r162160 because it made a few buildbots fail. llvm-svn: 162164	2012-08-18 05:02:36 +00:00
Nadav Rotem	76f1b84f58	The X86 backend has a number of optimizations for SETCC nodes which use arithmetic instructions. However, when small data types are used, a truncate node appears between the SETCC node and the arithmetic operation. This patch adds support for this pattern. Before: xorl %esi, %edi testb %dil, %dil setne %al ret After: xorb %dil, %sil setne %al ret rdar://12081007 llvm-svn: 162160	2012-08-18 02:43:28 +00:00
Akira Hatanaka	ab6dca06f4	Add MipsELFWriterInfo.{h,cpp}. llvm-svn: 162136	2012-08-17 21:38:47 +00:00
Akira Hatanaka	a50e7bd0a6	Correct MCJIT functionality for MIPS32 architecture. No new tests are added. All tests in ExecutionEngine/MCJIT that have been failing pass after this patch is applied (when "make check" is done on a mips board). Patch by Petar Jovanovic. llvm-svn: 162135	2012-08-17 21:28:04 +00:00
Jakob Stoklund Olesen	40eb30013e	Avoid folding ADD instructions with FI operands. PEI can't handle the pseudo-instructions. This can be removed when the pseudo-instructions are replaced by normal predicated instructions. Fixes PR13628. llvm-svn: 162130	2012-08-17 20:55:34 +00:00
Akira Hatanaka	4e1b032521	Add stub methods for mips assembly matcher. Patch by Vladimir Medic. llvm-svn: 162124	2012-08-17 20:16:42 +00:00
Bill Wendling	0569e9a6f3	Change the `linker_private_weak_def_auto' linkage to` linkonce_odr_auto_hide' to make it more consistent with its intended semantics. The `linker_private_weak_def_auto' linkage type was meant to automatically hide globals which never had their addresses taken. It has nothing to do with the `linker_private' linkage type, which outputs the symbols with a `l' (ell) prefix among other things. The intended semantic is more like the `linkonce_odr' linkage type. Change the name of the linkage type to `linkonce_odr_auto_hide'. And therefore changing the semantics so that it produces the correct output for the linker. Note: The old linkage name `linker_private_weak_def_auto' will still parse but is not a synonym for `linkonce_odr_auto_hide'. This should be removed in 4.0. <rdar://problem/11754934> llvm-svn: 162114	2012-08-17 18:33:14 +00:00
Jakob Stoklund Olesen	36d81e300e	Add comment, clean up code. No functional change. llvm-svn: 162107	2012-08-17 16:59:09 +00:00
Tim Northover	1de091468c	Implement NEON domain switching for scalar <-> S-register vmovs on ARM llvm-svn: 162094	2012-08-17 11:32:52 +00:00
Craig Topper	efc1bf9ee1	Use nested switch to select arguments to reduce calls to EmitPCMP. llvm-svn: 162089	2012-08-17 07:15:56 +00:00
Craig Topper	8fa010b216	Make ReplaceATOMIC_BINARY_64 a static function. Use a nested switch to reduce to only a single call to it thus allowing it to be inlined by the compiler. llvm-svn: 162088	2012-08-17 06:55:11 +00:00
Craig Topper	117916e06d	Remove unnecessary include of ARMGenInstrInfo.inc. llvm-svn: 162086	2012-08-17 06:21:09 +00:00
Jakob Stoklund Olesen	88217b055d	Add ADD and SUB to the predicable ARM instructions. It is not my plan to duplicate the entire ARM instruction set with predicated versions. We need a way of representing predicated instructions in SSA form without requiring a separate opcode. Then the pseudo-instructions can go away. llvm-svn: 162061	2012-08-16 23:21:55 +00:00
Jakob Stoklund Olesen	aca66722c2	Handle ARM MOVCC optimization in PeepholeOptimizer. Use the target independent select analysis hooks. llvm-svn: 162060	2012-08-16 23:14:20 +00:00
Roman Divacky	b95259c849	Revert r162034, r162035 and r162037. llvm-svn: 162039	2012-08-16 19:07:59 +00:00
Roman Divacky	831ddb548a	Define and handle additional fixup kinds. By Adhemerval Zanella. llvm-svn: 162037	2012-08-16 18:37:52 +00:00
Roman Divacky	3a41549e6a	Fix typo and grammar. By Adhemerval Zanella. llvm-svn: 162032	2012-08-16 18:19:29 +00:00
Jush Lu	767c82d4e0	[arm-fast-isel] Add support for fastcc. Without fastcc support, the caller just falls through to CallingConv::C for fastcc, but callee still uses fastcc, this inconsistency of calling convention is a problem, and fastcc support can fix it. llvm-svn: 162013	2012-08-16 05:15:53 +00:00
Anitha Boyapati	161fc750a1	Patch to enable FMA on bdver2 target. Make XOP feature enable FMA4 as well. llvm-svn: 162012	2012-08-16 04:04:02 +00:00
Anitha Boyapati	5443ee0d76	(no commit message) llvm-svn: 162010	2012-08-16 03:50:04 +00:00
Akira Hatanaka	623a561154	Add Android ABI to Mips backend to handle functions returning vectors of four floats. llvm-svn: 162008	2012-08-16 03:48:05 +00:00
Jakob Stoklund Olesen	55aee8b58a	Fold predicable instructions into MOVCC / t2MOVCC. The ARM select instructions are just predicated moves. If the select is the only use of an operand, the instruction defining the operand can be predicated instead, saving one instruction and decreasing register pressure. This implementation can turn AND/ORR/EOR instructions into their corresponding ANDCC/ORRCC/EORCC variants. Ideally, we should be able to predicate any instruction, but we don't yet support predicated instructions in SSA form. llvm-svn: 161994	2012-08-15 22:16:39 +00:00
Evan Cheng	625c0ca5ee	Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows unaligned access. rdar://12091029 llvm-svn: 161962	2012-08-15 17:44:53 +00:00
Jakob Stoklund Olesen	6639cea68f	Add missing Rfalse operand to the predicated pseudo-instructions. When predicating this instruction: Rd = ADD Rn, Rm We need an extra operand to represent the value given to Rd when the predicate is false: Rd = ADDCC Rfalse, Rn, Rm, pred The Rd and Rfalse operands are different registers while in SSA form. Rfalse is tied to Rd to make sure they get the same register during register allocation. Previously, Rd and Rn were tied, but that is not required. Compare to MOVCC: Rd = MOVCC Rfalse, Rtrue, pred llvm-svn: 161955	2012-08-15 16:17:24 +00:00
Anton Korobeynikov	d13403fbd1	The names of VFP variants of half-to-float conversion instructions were reversed. This leads to wrong codegen for float-to-half conversion intrinsics which are used to support storage-only fp16 type. NEON variants of same instructions are fine. llvm-svn: 161907	2012-08-14 23:36:01 +00:00
Eric Christopher	47fee59c73	This needs braces. Spotted by Bill. llvm-svn: 161906	2012-08-14 23:32:15 +00:00
Michael Liao	f763f96863	minor fix of X86ISD::VSEXT_MOVL dump llvm-svn: 161902	2012-08-14 22:53:17 +00:00
Michael Liao	daebe04c2f	fix PR11334 - FP_EXTEND only support extending from vectors with matching elements. This results in the scalarization of extending to v2f64 from v2f32, which will be legalized to v4f32 not matching with v2f64. - add X86-specific VFPEXT supproting extending from v4f32 to v2f64. - add BUILD_VECTOR lowering helper to recover back the original extending from v4f32 to v2f64. - test case is enhanced to include different vector width. llvm-svn: 161894	2012-08-14 21:24:47 +00:00
Jim Grosbach	53796945f5	Switch the fixed-length disassembler to be table-driven. Refactor the TableGen'erated fixed length disassemblmer to use a table-driven state machine rather than a massive set of nested switch() statements. As a result, the ARM Disassembler (ARMDisassembler.cpp) builds much more quickly and generates a smaller end result. For a Release+Asserts build on a 16GB 3.4GHz i7 iMac w/ SSD: Time to compile at -O2 (averaged w/ hot caches): Previous: 35.5s New: 8.9s TEXT size: Previous: 447,251 New: 297,661 Builds in 25% of the time previously required and generates code 66% of the size. Execution time of the disassembler is only slightly slower (7% disassembling 10 million ARM instructions, 19.6s vs 21.0s). The new implementation has not yet been tuned, however, so the performance should almost certainly be recoverable should it become a concern. llvm-svn: 161888	2012-08-14 19:06:05 +00:00
Craig Topper	e7ac4d1df1	Factor duplicate calls to getUNDEF in several functions. llvm-svn: 161860	2012-08-14 08:18:43 +00:00
Craig Topper	a3795f6791	Re-factor intrinsic lowering to combine common parts of similar intrinsics. Reduces compiled code size a little bit. llvm-svn: 161859	2012-08-14 07:43:25 +00:00
Jakob Stoklund Olesen	33e364a3df	Remove the TII::scheduleTwoAddrSource() hook. It never does anything when running 'make check', and it get's in the way of updating live intervals in 2-addr. The hook was originally added to help form IT blocks in Thumb2 code before register allocation, but the pass ordering has changed since then, and we run if-conversion after register allocation now. When the MI scheduler is enabled, there will be no less than two schedulers between 2-addr and Thumb2ITBlockPass, so this hook is unlikely to help anything. llvm-svn: 161794	2012-08-13 21:52:57 +00:00
Manman Ren	159ae3b3bc	ARM: enable struct byval for AAPCS-VFP. This change is to be enabled in clang. rdar://9877866 llvm-svn: 161789	2012-08-13 21:22:50 +00:00
Arnold Schwaighofer	dbdb2581b8	[Hexagon] Don't mark callee saved registers as clobbered by a tail call This was causing unnecessary spills/restores of callee saved registers. Fixes PR13572. Patch by Pranav Bhandarkar! llvm-svn: 161778	2012-08-13 19:54:01 +00:00
Nadav Rotem	03c4d5f036	Do not optimize (or (and X,Y), Z) into BFI and other sequences if the AND ISDNode has more than one user. rdar://11876519 llvm-svn: 161775	2012-08-13 18:52:44 +00:00
Manman Ren	cb05c49c64	X86: move Int_CVTSD2SSrr, Int_CVTSI2SSrr, Int_CVTSI2SDrr, Int_CVTSS2SDrr from OpTbl1 to OpTbl2 since they have 3 operands and the last operand can be changed to a memory operand. PR13576 llvm-svn: 161769	2012-08-13 18:29:41 +00:00
Eric Christopher	3aea549423	Add support for the %H output modifier. Patch by Weiming Zhao. llvm-svn: 161768	2012-08-13 18:18:52 +00:00
Manman Ren	c9f5387a5c	X86: when auto-detecting the subtarget features, make sure use IsIntel to detect Nehalem, Westmere and Sandy Bridge. AMD also has processor family 6. llvm-svn: 161763	2012-08-13 17:26:46 +00:00
Tim Northover	b1f8be6cbe	Use correct loads for vector types during extending-load operations. Previously, we used VLD1.32 in all cases, however there are both 16 and 64-bit accesses being selected, so we need to use an appropriate width load in those cases. llvm-svn: 161748	2012-08-13 09:06:31 +00:00
Craig Topper	4fc08044be	Tidy up VSETCC lowering code a bit more by adding an llvm_unreachable and putting an a couple if conditions in a better order. llvm-svn: 161746	2012-08-13 03:42:38 +00:00
Craig Topper	a438ea46bf	Refactor code a bit to share commonalities. No functional change intended. llvm-svn: 161745	2012-08-13 02:34:03 +00:00
Craig Topper	bb92d94049	Fix an unused variable warning from r161742. llvm-svn: 161743	2012-08-13 01:26:45 +00:00
Craig Topper	1032fcf6da	Remove the LowerMMXCONCAT_VECTORS function. It could never execute because there are no legal 64-bit vector types that could be used as inputs to a 128-bit concat_vectors. Remove a target specific SDNode and its patterns that become unused as a result. llvm-svn: 161742	2012-08-13 01:23:55 +00:00
Craig Topper	5a5ed2d691	Remove call to setOperationAction for SETCC of v4f32. SETCC returns an integer type not an FP type. llvm-svn: 161738	2012-08-12 05:31:32 +00:00
Craig Topper	1292e1f43c	Remove unnecessary call to setOperationAction for SETCC of v2i64 under SSE42. It was already called for the same under SSE2. llvm-svn: 161737	2012-08-12 05:15:16 +00:00
Arnold Schwaighofer	c751a25aed	Revert 161581: Patch to implement UMLAL/SMLAL instructions for the ARM architecture It broke MultiSource/Applications/JM/ldecod/ldecod on armv7 thumb O0 g and armv7 thumb O3. llvm-svn: 161736	2012-08-12 05:11:56 +00:00
Craig Topper	4d9cbceefd	Change addTypeForNeon to use MVT instead of EVT so all the calls to getSimpleVT can be removed. llvm-svn: 161735	2012-08-12 03:16:37 +00:00
Craig Topper	709114d67f	Make replace many calls to getSizeInBits() with is128BitVector/is256BitVector llvm-svn: 161734	2012-08-12 02:23:29 +00:00
Craig Topper	a52fcd0a14	Use MVT.isXBitVector instead of EVT.isXBitVector when setting up operation actions. Compiles to smaller code. llvm-svn: 161733	2012-08-12 00:34:56 +00:00
Michael Liao	4b95cb463a	fix PR13577, an issue introduced by r161687 - FCMOV only supports a subset of X86 conditions. Skip boolean simplification if X86 condition is not valid for FCMOV. - add a minimal test case for PR13577. llvm-svn: 161732	2012-08-11 23:47:06 +00:00
Craig Topper	93e2521659	Move setOperationAction for CONCAT_VECTORS for 256-bit vectors into loop since all 256-bit types are supported. llvm-svn: 161730	2012-08-11 22:34:26 +00:00
Craig Topper	b7f7fa86ec	Tidy up indentation. No functional change. llvm-svn: 161727	2012-08-11 17:53:00 +00:00
Craig Topper	ba0c3ebe9e	Fix a cast that was casting away 'const' unnecessarily llvm-svn: 161726	2012-08-11 17:46:16 +00:00
Craig Topper	3929432178	Add a couple default: llvm_unreachable() to some switch statements. Fix a bad message in an existing llvm_unreachable. llvm-svn: 161725	2012-08-11 17:44:14 +00:00
Manman Ren	9bd686f936	X86: when we are auto-detecting the subtarget features, make sure we turn on FeatureFastUAMem for Nehalem, Westmere and Sandy Bridge. FeatureFastUAMem is already on if we pass in nehalem or westmere as a command argument. rdar: 7252306 llvm-svn: 161717	2012-08-10 23:43:32 +00:00
Manman Ren	500d45c3d9	ARM: enable struct byval for AAPCS. This change is to be enabled in clang. rdar://9877866 PR://13350 llvm-svn: 161693	2012-08-10 20:39:38 +00:00
Michael Liao	97334a5c5f	add X86-specific DAG optimization to simplify boolean test - if a boolean test (X86ISD::CMP or X86ISD:SUB) checks a boolean value generated from X86ISD::SETCC, try to simplify the boolean value generation and checking by reusing the original EFLAGS with proper condition code - add hooks to X86 specific SETCC/BRCOND/CMOV, the major 3 places consuming EFLAGS part of patches fixing PR12312 llvm-svn: 161687	2012-08-10 19:58:13 +00:00
Michael Liao	81be965deb	remove tailing whitespaces and test commit llvm-svn: 161664	2012-08-10 14:39:24 +00:00
Joerg Sonnenberger	f07e1e10a6	Add some missing includes for the build against stdcxx. llvm-svn: 161657	2012-08-10 10:53:56 +00:00
Eric Christopher	77ae8ee419	Remove getARMRegisterNumbering and replace with calls into the register info for getEncodingValue. This builds on the small patch of yesterday to set HWEncoding in the register file. One (deprecated) use was turned into a hard number to avoid needing register info in the old JIT. llvm-svn: 161628	2012-08-09 22:10:21 +00:00
Jakob Stoklund Olesen	c8bcc2518d	Don't modify MO while use_iterator is still pointing to it. llvm-svn: 161626	2012-08-09 22:08:24 +00:00

... 2 3 4 5 6 ...

22190 Commits