llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Michael Ilseman	5db40ba98e	Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included. fsub X, +0 ==> X fsub X, -0 ==> X, when we know X is not -0 fsub +/-0.0, (fsub -0.0, X) ==> X fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X fsub nnan ninf X, X ==> 0.0 fadd nsz X, 0 ==> X fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0 where nnan and ninf have to occur at least once somewhere in this expression fmul X, 1.0 ==> X llvm-svn: 169940	2012-12-12 00:27:46 +00:00
Michael Ilseman	515ea526f3	Pattern matchers for floating point values m_ConstantFP - match and bind a float constant m_SpecificConstantFP - match a specific floating point value or vector of floats of that value m_FPOne - match a floating point 1.0 or vector of 1.0s m_NegZero - match -0.0 m_AnyZero - match 0 or -0.0 llvm-svn: 169939	2012-12-12 00:23:43 +00:00
Michael Ilseman	42042d4bd8	Remove FIXMEs surrounding Constant[Data]Vectors, instead llvm-svn: 169938	2012-12-12 00:21:43 +00:00
Jim Grosbach	529ac76fa2	Trim unneeded header #include. llvm-svn: 169933	2012-12-11 23:39:51 +00:00
Dmitri Gribenko	d5a3fbf03a	Documentation: cleanup: remove useless anchors and write :ref:s explicitly. llvm-svn: 169932	2012-12-11 23:35:23 +00:00
Jim Grosbach	5c24898d2a	ARM: Remove old testing option. Pre-regalloc frame allocation and referencing has been on by default for ages. No need for the testing option that disables it. llvm-svn: 169931	2012-12-11 23:31:12 +00:00
Jim Grosbach	969f11d9cd	ARM: Remove old testing options. Base pointer referencing has been enabled for ages. llvm-svn: 169930	2012-12-11 23:31:10 +00:00
Evan Cheng	b9b90d7aed	Replace TargetLowering::isIntImmLegal() with ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined term for something like integer immediate materialization. It is always possible to materialize an integer immediate. Whether to use it for memcpy expansion is more a "cost" conceern. llvm-svn: 169929	2012-12-11 23:26:14 +00:00
Dmitri Gribenko	8321a7c2f4	Documentation: Lexicon.rst: add 'SLP' acronym llvm-svn: 169928	2012-12-11 23:13:23 +00:00
Nadav Rotem	054379720d	PR14574. Fix a bug in the code that calculates the mask the converted PHIs in if-conversion. llvm-svn: 169916	2012-12-11 21:30:14 +00:00
Tom Stellard	6f17e7033b	Add R600 backend A new backend supporting AMD GPUs: Radeon HD2XXX - HD7XXX llvm-svn: 169915	2012-12-11 21:25:42 +00:00
Bill Schmidt	45b56f7632	This patch implements the general dynamic TLS model for 64-bit PowerPC. Given a thread-local symbol x with global-dynamic access, the generated code to obtain x's address is: Instruction Relocation Symbol addis ra,r2,x@got@tlsgd@ha R_PPC64_GOT_TLSGD16_HA x addi r3,ra,x@got@tlsgd@l R_PPC64_GOT_TLSGD16_L x bl __tls_get_addr(x@tlsgd) R_PPC64_TLSGD x R_PPC64_REL24 __tls_get_addr nop <use address in r3> The implementation borrows from the medium code model work for introducing special forms of ADDIS and ADDI into the DAG representation. This is made slightly more complicated by having to introduce a call to the external function __tls_get_addr. Using the full call machinery is overkill and, more importantly, makes it difficult to add a special relocation. So I've introduced another opcode GET_TLS_ADDR to represent the function call, and surrounded it with register copies to set up the parameter and return value. Most of the code is pretty straightforward. I ran into one peculiarity when I introduced a new PPC opcode BL8_NOP_ELF_TLSGD, which is just like BL8_NOP_ELF except that it takes another parameter to represent the symbol ("x" above) that requires a relocation on the call. Something in the TblGen machinery causes BL8_NOP_ELF and BL8_NOP_ELF_TLSGD to be treated identically during the emit phase, so this second operand was never visited to generate relocations. This is the reason for the slightly messy workaround in PPCMCCodeEmitter.cpp:getDirectBrEncoding(). Two new tests are included to demonstrate correct external assembly and correct generation of relocations using the integrated assembler. Comments welcome! Thanks, Bill llvm-svn: 169910	2012-12-11 20:30:11 +00:00
Eric Christopher	1a93fc9d67	Update some comments. llvm-svn: 169907	2012-12-11 19:42:09 +00:00
Nadav Rotem	fb45c4d6b4	Loop Vectorize: optimize the vectorization of trunc(induction_var). The truncation is now done on scalars. llvm-svn: 169904	2012-12-11 18:58:10 +00:00
Eli Bendersky	97c09cbfb6	Remove the RelaxAll overrule in MCAssembler::fixupNeedsRelaxation, because that method is only getting called for MCInstFragment. These fragments aren't even generated when RelaxAll is set, which is why the flag reference here is superfluous. Removing it simplifies the code with no harmful effects. An assertion is added higher up to make sure this path is never reached. llvm-svn: 169886	2012-12-11 17:16:00 +00:00
Rafael Espindola	03394e4dc6	Use an ArrayRef instead of a std::vector&. llvm-svn: 169881	2012-12-11 16:36:02 +00:00
Joel Jones	0038ea3653	Add comment for load folding llvm-svn: 169880	2012-12-11 16:10:25 +00:00
Dmitri Gribenko	b9ee3b0c67	Documentation: convert Passes.html to reST. Since now we have an autogenerated TOC, a manually written table of all passes was removed. Patch by Anthony Mykhailenko with small fixes by me. llvm-svn: 169867	2012-12-11 15:29:37 +00:00
NAKAMURA Takumi	231a6d09fc	llvm/test/TableGen: Remove XFAIL:vg_leak in dozen of tests, according to llvm-x86_64-linux-vg_leak. llvm-svn: 169862	2012-12-11 13:14:16 +00:00
Evgeniy Stepanov	fee15f88f9	[msan] Use explicitely aligned stores and loads with function argument shadow. Use explicitely aligned store and load instructions to deal with argument and retval shadow. This matters when an argument's alignment is higher than __msan_param_tls alignment (which is the case with __m128i). llvm-svn: 169859	2012-12-11 12:34:09 +00:00
Patrik Hagglund	caaedc6ade	Revert EVT->MVT changes, r169836-169851, due to buildbot failures. llvm-svn: 169854	2012-12-11 11:14:33 +00:00
Chandler Carruth	a503c881e5	Holding my nose and moving the accumulation routine to GEPOperator instead of the instruction. I've left a forwarding wrapper for the instruction so users with the instruction don't need to create a GEPOperator themselves. This lets us remove the copy of this code in instsimplify. I've looked at most of the other copies of similar code, and this is the only one I've found that is actually exactly the same. The one in InlineCost is very close, but it requires re-mapping non-constant indices through the cost analysis value simplification map. I could add direct support for this to the generic routine, but it seems overly specific. llvm-svn: 169853	2012-12-11 11:05:15 +00:00
Chandler Carruth	ffab924447	Hoist the GEP constant address offset computation to a common home on the GEP instruction class. This is part of the continued refactoring and cleaning of the infrastructure used by SROA. This particular operation is also done in a few other places which I'll try to refactor to share this implementation. llvm-svn: 169852	2012-12-11 10:29:10 +00:00
Patrik Hagglund	d09c604a20	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 169851	2012-12-11 10:24:48 +00:00
Patrik Hagglund	f45125a118	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. Accordingly, add bitsLT (and similar) to MVT. llvm-svn: 169850	2012-12-11 10:20:51 +00:00
Patrik Hagglund	4dc66d3907	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 169849	2012-12-11 10:16:19 +00:00
Patrik Hagglund	19b28301f3	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 169848	2012-12-11 10:09:23 +00:00
Patrik Hagglund	48f063d9a8	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 169847	2012-12-11 10:05:04 +00:00
Patrik Hagglund	39281bd5fb	Change TargetLowering::getRepRegClassCostFor, getIndexedLoadAction, getIndexedStoreAction, and addRegisterClass to take an MVT, instead of EVT. llvm-svn: 169846	2012-12-11 10:00:35 +00:00
Patrik Hagglund	9597517d65	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 169845	2012-12-11 09:57:18 +00:00
Patrik Hagglund	4ab6c88920	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 169844	2012-12-11 09:54:23 +00:00
Patrik Hagglund	7692ba3a13	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 169843	2012-12-11 09:51:27 +00:00
Patrik Hagglund	cfd4d97792	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 169842	2012-12-11 09:48:14 +00:00
Patrik Hagglund	dec1aa5bc5	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 169841	2012-12-11 09:42:24 +00:00
Patrik Hagglund	6c9d0f4058	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 169840	2012-12-11 09:39:09 +00:00
Patrik Hagglund	8fcc9acaaa	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 169839	2012-12-11 09:32:56 +00:00
Patrik Hagglund	0b24527a59	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 169838	2012-12-11 09:31:43 +00:00
Patrik Hagglund	758f9c5011	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. llvm-svn: 169837	2012-12-11 09:10:33 +00:00
Hao Liu	e24e94d1cb	revert the test change llvm-svn: 169823	2012-12-11 06:25:18 +00:00
Hao Liu	a4b44869a6	A newbie try a test commit llvm-svn: 169821	2012-12-11 06:22:54 +00:00
NAKAMURA Takumi	8f3b9c1a45	[CMake] Remove dependencies to intrinsics_gen I introduced in r169724. llvm-svn: 169819	2012-12-11 05:53:54 +00:00
NAKAMURA Takumi	c27d555bb0	llvm/Target/TargetMachine.h: Remove two dependent headers. -#include "llvm/Target/TargetTransformImpl.h" -#include "llvm/TargetTransformInfo.h" llvm-svn: 169818	2012-12-11 05:53:43 +00:00
NAKAMURA Takumi	3e21082d83	llvm/tools: Add #include "llvm/TargetTransformInfo.h" llvm-svn: 169817	2012-12-11 05:53:37 +00:00
Jyotsna Verma	7fb1e065a0	Use multiclass for new-value store instructions with MEMri operand. llvm-svn: 169814	2012-12-11 05:12:25 +00:00
Nadav Rotem	0715a221d8	Fix PR14565. Don't if-convert loops that have switch statements in them. llvm-svn: 169813	2012-12-11 04:55:10 +00:00
Rafael Espindola	caab8c1242	Change some functions to take const pointers. llvm-svn: 169812	2012-12-11 03:10:43 +00:00
Evan Cheng	a13f64ff50	Stylistic tweak. llvm-svn: 169811	2012-12-11 02:31:57 +00:00
Chad Rosier	175af3914a	Add a triple to this test. llvm-svn: 169803	2012-12-11 00:51:36 +00:00
Chandler Carruth	ac8f03ddc1	Fix a miscompile in the DAG combiner. Previously, we would incorrectly try to reduce the width of this load, and would end up transforming: (truncate (lshr (sextload i48 <ptr> as i64), 32) to i32) to (truncate (zextload i32 <ptr+4> as i64) to i32) We lost the sext attached to the load while building the narrower i32 load, and replaced it with a zext because lshr always zext's the results. Instead, bail out of this combine when there is a conflict between a sextload and a zext narrowing. The rest of the DAG combiner still optimize the code down to the proper single instruction: movswl 6(...),%eax Which is exactly what we wanted. Previously we read past the end and missed the sign extension: movl 6(...), %eax llvm-svn: 169802	2012-12-11 00:36:57 +00:00
Paul Redmond	fde20fa567	move X86-specific test This test case uses -mcpu=corei7 so it belongs in CodeGen/X86 Reviewed by: Nadav llvm-svn: 169801	2012-12-11 00:36:43 +00:00

1 2 3 4 5 ...

87301 Commits