llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Lang Hames	7d83af4ed0	Fix the order of the operands in the llvm.fma intrinsic patterns for ARM, <rdar://problem/11325085>. llvm-svn: 155724	2012-04-27 18:51:24 +00:00
Richard Barton	f9237b25e6	Fix ARM assembly parsing for upper case condition codes on IT instructions. llvm-svn: 155720	2012-04-27 17:34:01 +00:00
Benjamin Kramer	1380494168	X86: Don't emit conditional floating point moves on when targeting pre-pentiumpro architectures. * Model FPSW (the FPU status word) as a register. * Add ISel patterns for the FUCOM, FNSTSW and SAHF instructions. During Legalize/Lowering, build a node sequence to transfer the comparison result from FPSW into EFLAGS. If you're wondering about the right-shift: That's an implicit sub-register extraction (%ax -> %ah) which is handled later on by the instruction selector. Fixes PR6679. Patch by Christoph Erhardt! llvm-svn: 155704	2012-04-27 12:07:43 +00:00
Richard Barton	ca70156ab3	Refactor IT handling not to store the bottom bit of the condition code in the mask operand in the MCInst. llvm-svn: 155700	2012-04-27 08:42:59 +00:00
Evan Cheng	f35523d08a	Implement a bastardized ABI. llvm-svn: 155686	2012-04-27 02:11:10 +00:00
Evan Cheng	594fb11f12	- thumbv6 shouldn't imply +thumb2. Cortex-M0 doesn't suppport 32-bit Thumb2 instructions. - However, it does support dmb, dsb, isb, mrs, and msr. rdar://11331541 llvm-svn: 155685	2012-04-27 01:27:19 +00:00
Jim Grosbach	bf9adf2ab5	ARM: Thumb ldr(literal) base address alignment is 32-bits. The base address for the PC-relative load is Align(PC,4), so it's the address of the word containing the 16-bit instruction, not the address of the instruction itself. Ugh. rdar://11314619 llvm-svn: 155659	2012-04-26 20:48:12 +00:00
Preston Gurd	fb1760744d	Trivial change to set UseLeaForSP flag in addition to toggling the FeatureLeaForSP feature bit when llvm auto detects Intel Atom. Patch by Andy Zhang llvm-svn: 155655	2012-04-26 19:52:27 +00:00
Tim Northover	876c151146	Use VLD1 in NEON extenting-load patterns instead of VLDR. On some cores it's a bad idea for performance to mix VFP and NEON instructions and since these patterns are NEON anyway, the NEON load should be used. llvm-svn: 155630	2012-04-26 08:46:29 +00:00
Tim Northover	b83dc53c3a	Test commit. llvm-svn: 155626	2012-04-26 08:24:07 +00:00
Craig Topper	f883096ff7	Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to corei7-avx, core-avx-i, and core-avx2 cpu names. llvm-svn: 155618	2012-04-26 06:40:15 +00:00
Evan Cheng	4d570a3f0e	If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume the feature set of v7a. This comes about if the user specifies something like -arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as uxtab in this case. rdar://11318438 llvm-svn: 155601	2012-04-26 01:13:36 +00:00
Richard Barton	e9a972bbe3	Unify internal representation of ARM instructions with a register right-shifted by #32 . These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation. llvm-svn: 155565	2012-04-25 18:00:18 +00:00
Craig Topper	5828c654b9	Add ifdef around getSubtargetFeatureName in tablegen output file so that only targets that want the function get it. This prevents other targets from getting an unused function warning. llvm-svn: 155538	2012-04-25 06:56:34 +00:00
Craig Topper	1a016fd95d	Use vector_shuffles instead of target specific unpack nodes for AVX ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code. llvm-svn: 155537	2012-04-25 06:39:39 +00:00
Akira Hatanaka	b3ecf903f1	Do not use $gp as a dedicated global register if the target ABI is not O32. llvm-svn: 155522	2012-04-25 01:24:52 +00:00
Jim Grosbach	7ac2ac85a8	ARM: improved assembler diagnostics for missing CPU features. When an instruction match is found, but the subtarget features it requires are not available (missing floating point unit, or thumb vs arm mode, for example), issue a diagnostic that identifies what the feature mismatch is. rdar://11257547 llvm-svn: 155499	2012-04-24 22:40:08 +00:00
Jim Grosbach	69d9654f7c	ARM: Nuke remnant bogus code. r154362 was supposed to delete this bit, but obviously didn't. rdar://11305594 llvm-svn: 155465	2012-04-24 18:39:47 +00:00
Nadav Rotem	021d75713c	AVX: Add additional vbroadcast replacement sequences for integers. Remove the v2f64 patterns because it does not match any vbroadcast instruction. llvm-svn: 155461	2012-04-24 18:09:59 +00:00
Nadav Rotem	f2756d7e7f	AVX2: The BLENDPW instruction selects between vectors of v16i16 using an i8 immediate. We can't use it here because the shuffle code does not check that the lower part of the word is identical to the upper part. llvm-svn: 155440	2012-04-24 11:27:53 +00:00
Richard Barton	543457a8c8	Refactor Thumb ITState handling in ARM Disassembler to more efficiently use its vector llvm-svn: 155439	2012-04-24 11:13:20 +00:00
Nadav Rotem	d060c25823	AVX: We lower VECTOR_SHUFFLE and BUILD_VECTOR nodes into vbroadcast instructions using the pattern (vbroadcast (i32load src)). In some cases, after we generate this pattern new users are added to the load node, which prevent the selection of the blend pattern. This commit provides fallback patterns which perform in-vector broadcast (using in-vector vbroadcast in AVX2 and pshufd on AVX1). llvm-svn: 155437	2012-04-24 11:07:03 +00:00
Craig Topper	dae7196823	Remove dangling spaces. Fix some other formatting. llvm-svn: 155429	2012-04-24 06:36:35 +00:00
Craig Topper	61065e271e	Simplify code a bit and make it compile better. Remove unused parameters. llvm-svn: 155428	2012-04-24 06:02:29 +00:00
Jim Grosbach	66edf44403	Tidy up. 80 columns, whitespace, et. al. llvm-svn: 155399	2012-04-23 22:04:10 +00:00
Nadav Rotem	c60ef21760	Optimize the vector UINT_TO_FP, SINT_TO_FP and FP_TO_SINT operations where the integer type is i8 (commonly used in graphics). llvm-svn: 155397	2012-04-23 21:53:37 +00:00
Preston Gurd	0a730de3c3	This patch fixes a problem which arose when using the Post-RA scheduler on X86 Atom. Some of our tests failed because the tail merging part of the BranchFolding pass was creating new basic blocks which did not contain live-in information. When the anti-dependency code in the Post-RA scheduler ran, it would sometimes rename the register containing the function return value because the fact that the return value was live-in to the subsequent block had been lost. To fix this, it is necessary to run the RegisterScavenging code in the BranchFolding pass. This patch makes sure that the register scavenging code is invoked in the X86 subtarget only when post-RA scheduling is being done. Post RA scheduling in the X86 subtarget is only done for Atom. This patch adds a new function to the TargetRegisterClass to control whether or not live-ins should be preserved during branch folding. This is necessary in order for the anti-dependency optimizations done during the PostRASchedulerList pass to work properly when doing Post-RA scheduling for the X86 in general and for the Intel Atom in particular. The patch adds and invokes the new function trackLivenessAfterRegAlloc() instead of using the existing requiresRegisterScavenging(). It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of requiresRegisterScavenging(). It changes the all the targets that implemented requiresRegisterScavenging() to also implement trackLivenessAfterRegAlloc(). It adds an assertion in the Post RA scheduler to make sure that post RA liveness information is available when it is needed. It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order to avoid running into the added assertion. Finally, this patch restores the use of anti-dependency checking (which was turned off temporarily for the 3.1 release) for Intel Atom in the Post RA scheduler. Patch by Andy Zhang! Thanks to Jakob and Anton for their reviews. llvm-svn: 155395	2012-04-23 21:39:35 +00:00
Jim Grosbach	4221412829	ARM: VSLI two-operand assmebly aliases are tblgen'erated. llvm-svn: 155393	2012-04-23 21:22:04 +00:00
Jim Grosbach	8aac7f6a7c	ARM: tblgen'erate VSRA/VRSRA/VSRI assembly two-operand aliases. llvm-svn: 155392	2012-04-23 21:00:49 +00:00
Jim Grosbach	d377bc4e77	ARM: vqdmulh two-operand aliases are tblgen'erated now. llvm-svn: 155387	2012-04-23 20:37:20 +00:00
Chandler Carruth	9460759e4f	Revert r155365, r155366, and r155367. All three of these have regression test suite failures. The failures occur at each stage, and only get worse, so I'm reverting all of them. Please resubmit these patches, one at a time, after verifying that the regression test suite passes. Never submit a patch without running the regression test suite. llvm-svn: 155372	2012-04-23 18:25:57 +00:00
Sirish Pande	9f4844f7da	Hexagon V5 (floating point) support. llvm-svn: 155367	2012-04-23 17:49:40 +00:00
Sirish Pande	4bcbe40295	Support for Hexagon architectural feature, new value jump. llvm-svn: 155366	2012-04-23 17:49:28 +00:00
Sirish Pande	2230f1957e	Support for Hexagon VLIW Packetizer. llvm-svn: 155365	2012-04-23 17:49:20 +00:00
Craig Topper	95fa5a8765	Use MVT instead of EVT through all of LowerVECTOR_SHUFFLEtoBlend and not just the switch. Saves a little bit of binary size. llvm-svn: 155339	2012-04-23 07:36:33 +00:00
Craig Topper	4e6deec5d8	Make getZeroVector and getOnesVector more alike as far as how they detect 128-bit versus 256-bit vectors. Be explicit about both sizes and use llvm_unreachable. Similar changes to getLegalSplat. llvm-svn: 155337	2012-04-23 07:24:41 +00:00
Craig Topper	f9811e8f28	Tidy up by removing some 'else' after 'return' llvm-svn: 155336	2012-04-23 06:57:04 +00:00
Craig Topper	c315e7b6db	Tidy up spacing in LowerVECTOR_SHUFFLEtoBlend. Remove code that checks if shuffle operand has a different type than the the shuffle result since it can never happen. llvm-svn: 155333	2012-04-23 06:38:28 +00:00
Craig Topper	f27c3223f7	Add a couple llvm_unreachables. llvm-svn: 155332	2012-04-23 03:42:40 +00:00
Craig Topper	6c6ee67efe	Remove some tab characers. llvm-svn: 155331	2012-04-23 03:28:34 +00:00
Craig Topper	16829bb004	Remove some 'else' after 'return'. No functional change. llvm-svn: 155330	2012-04-23 03:26:18 +00:00
Craig Topper	2dedfa7805	Make Extract128BitVector and Insert128BitVector take an unsigned instead of an ConstantNode SDValue. getConstant was almost always called just before only to have the functions take it apart and build a new ConstantSDNode. llvm-svn: 155325	2012-04-22 20:55:18 +00:00
Craig Topper	5669044c57	Convert getNode(UNDEF) to getUNDEF. llvm-svn: 155321	2012-04-22 19:29:34 +00:00
Craig Topper	a9994377f2	Make calls to getVectorShuffle more consistent. Use shuffle VT for calls to getUNDEF instead of requerying. Use &Mask[0] instead of Mask.data(). llvm-svn: 155320	2012-04-22 19:17:57 +00:00
Craig Topper	5c4c8b1f81	Tidy up. 80 columns and argument alignment. llvm-svn: 155319	2012-04-22 18:51:37 +00:00
Craig Topper	58aeb7b7c3	Simplify code by converting multiple places that were manually concatenating 128-bit vectors to use either CONCAT_VECTORS or a helper function. CONCAT_VECTORS will itself be lowered to the same pattern as before. The helper function is needed for concats of BUILD_VECTORs since getNode(CONCAT_VECTORS) will just return a large BUILD_VECTOR and we may be trying to lower large BUILD_VECTORS when this occurs. llvm-svn: 155318	2012-04-22 18:15:59 +00:00
Benjamin Kramer	76a9040c03	ARM: Initialize the HasRAS bit. Found by valgrind. llvm-svn: 155313	2012-04-22 11:52:41 +00:00
Elena Demikhovsky	35721fc4f8	ZERO_EXTEND/SIGN_EXTEND/TRUNCATE optimization for AVX2 llvm-svn: 155309	2012-04-22 09:39:03 +00:00
Bill Wendling	86e03eac0d	Remove some potential warnings about variables used uninitialized. llvm-svn: 155307	2012-04-22 07:23:04 +00:00
Craig Topper	96407e19f5	Make some fixed arrays const. Use array_lengthof in a couple places instead of a hardcoded number. llvm-svn: 155294	2012-04-21 18:58:38 +00:00

1 2 3 4 5 ...

21179 Commits