llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Akira Hatanaka	8d412f5a8a	[mips] Print move instructions. "move $4, $5" is printed instead of "or $4, $5, $zero". llvm-svn: 176455	2013-03-04 22:25:01 +00:00
Jack Carter	44abaa390d	Mips specific inline assembler constraint 'R' 'R' An address that can be sued in a non-macro load or store. This patch includes a positive test case. llvm-svn: 176452	2013-03-04 21:33:15 +00:00
Preston Gurd	66b9c4fcf9	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Tom Stellard	6cdfe5698a	R600: Clean up datalayout strings so they better match hardware capabilities llvm-svn: 176439	2013-03-04 17:40:28 +00:00
Jia Liu	d8829e76b3	Mips ISD typo llvm-svn: 176426	2013-03-04 01:06:54 +00:00
Jim Grosbach	2b831fb8d3	ARM: Creating a vector from a lane of another. The VDUP instruction source register doesn't allow a non-constant lane index, so make sure we don't construct a ARM::VDUPLANE node asking it to do so. rdar://13328063 http://llvm.org/bugs/show_bug.cgi?id=13963 llvm-svn: 176413	2013-03-02 20:16:24 +00:00
Jim Grosbach	c4e1223217	Clean up code format a bit. llvm-svn: 176412	2013-03-02 20:16:19 +00:00
Jim Grosbach	a2c026c2f1	Tidy up. Trailing whitespace. llvm-svn: 176411	2013-03-02 20:16:15 +00:00
Arnold Schwaighofer	c633bf302e	ARM NEON: Fix v2f32 float intrinsics Mark them as expand, they are not legal as our backend does not match them. llvm-svn: 176410	2013-03-02 19:38:33 +00:00
Arnold Schwaighofer	e60e6fc70f	X86 cost model: Adjust cost for custom lowered vector multiplies This matters for example in following matrix multiply: int mmult(int rows, int cols, int m1, int m2, int m3) { int i, j, k, val; for (i=0; i<rows; i++) { for (j=0; j<cols; j++) { val = 0; for (k=0; k<cols; k++) { val += m1[i][k] * m2[k][j]; } m3[i][j] = val; } } return(m3); } Taken from the test-suite benchmark Shootout. We estimate the cost of the multiply to be 2 while we generate 9 instructions for it and end up being quite a bit slower than the scalar version (48% on my machine). Also, properly differentiate between avx1 and avx2. On avx-1 we still split the vector into 2 128bits and handle the subvector muls like above with 9 instructions. Only on avx-2 will we have a cost of 9 for v4i64. I changed the test case in test/Transforms/LoopVectorize/X86/avx1.ll to use an add instead of a mul because with a mul we now no longer vectorize. I did verify that the mul would be indeed more expensive when vectorized with 3 kernels: for (i ...) r += a[i] * 3; for (i ...) m1[i] = m1[i] * 3; // This matches the test case in avx1.ll and a matrix multiply. In each case the vectorized version was considerably slower. radar://13304919 llvm-svn: 176403	2013-03-02 04:02:52 +00:00
Andrew Trick	bc662b6282	Added FIXME for future Hexagon cleanup. llvm-svn: 176400	2013-03-02 01:43:08 +00:00
Akira Hatanaka	d2f7ed089c	[mips] Fix inefficient code generation. This patch eliminates the need to emit a constant move instruction when this pattern is matched: (select (setgt a, Constant), T, F) The pattern above effectively turns into this: (conditional-move (setlt a, Constant + 1), F, T) llvm-svn: 176384	2013-03-01 21:52:08 +00:00
Akira Hatanaka	a064b57260	Fix indentation. llvm-svn: 176380	2013-03-01 21:22:21 +00:00
Michael Liao	1e621fbd2f	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Chad Rosier	313ffa4bc0	Add support for using non-pic code for arm and thumb1 when emitting the sjlj dispatch code. As far as I can tell the thumb2 code is behaving as expected. I was able to compile and run the associated test case for both arm and thumb1. rdar://13066352 llvm-svn: 176363	2013-03-01 18:30:38 +00:00
Jyotsna Verma	359daa8ad9	Hexagon: Add constant extender support framework. llvm-svn: 176358	2013-03-01 17:37:13 +00:00
Christian Konig	bb4b8f7965	R600/SI: handle all registers in copyPhysReg v2 v2: based on Michels patch, but now allows copying of all registers sizes. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176346	2013-03-01 09:46:27 +00:00
Christian Konig	ce629dcbb4	R600/SI: remove S_MOV immediate patterns They won't match anyway. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176345	2013-03-01 09:46:22 +00:00
Christian Konig	d6999b3354	R600/SI: remove GPR*AlignEncode It's much easier to specify the encoding with tablegen directly. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176344	2013-03-01 09:46:17 +00:00
Christian Konig	b528622393	R600/SI: fix warning about overloaded virtual Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176343	2013-03-01 09:46:11 +00:00
Christian Konig	6ddcef3c30	R600/SI: fix inserting waits for unordered defines Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176342	2013-03-01 09:46:04 +00:00
Duncan Sands	268f52a52f	GCC thinks that this variable might be used uninitialized (it isn't). llvm-svn: 176341	2013-03-01 09:46:03 +00:00
Akira Hatanaka	04b651d332	[mips] Remove unused option. Fix 80-column violations. llvm-svn: 176330	2013-03-01 02:17:02 +00:00
Akira Hatanaka	9449ff1c6e	[mips] Add the capability to search delay slot filling instructions in successor basic blocks. Currently this is off by default. llvm-svn: 176329	2013-03-01 02:03:51 +00:00
Akira Hatanaka	b55d677ce5	[mips] Do not add SecondLastInst to list BranchInstrs if there is only one terminator. No functionality change. llvm-svn: 176326	2013-03-01 01:22:26 +00:00
Akira Hatanaka	ec7322cdd6	[mips] Define an overloaded version of function MipsInstrInfo::AnalyzeBranchAdd. This function will be used later when the capability to search delay slot filling instructions in successor blocks is added. No intended functionality changes. llvm-svn: 176325	2013-03-01 01:10:17 +00:00
Akira Hatanaka	411670a07b	[mips] Add options to disable searching backward and in successor blocks. llvm-svn: 176321	2013-03-01 01:02:36 +00:00
Akira Hatanaka	01c253f718	[mips] Add capability to search in the forward direction for instructions that can fill the delay slot. Currently, this is off by default. llvm-svn: 176320	2013-03-01 00:50:52 +00:00
Akira Hatanaka	a433f695dd	[mips] Define helper function searchRange No functionality change. llvm-svn: 176318	2013-03-01 00:26:14 +00:00
Akira Hatanaka	99e76c0e98	[mips] Rename function findDelayInstr to searchBackward. llvm-svn: 176317	2013-03-01 00:20:16 +00:00
Akira Hatanaka	581402232c	[mips] Define class MemDefsUses. This class tracks dependence between memory instructions using underlying objects of memory operands. llvm-svn: 176313	2013-03-01 00:16:31 +00:00
Chad Rosier	ab7a6a51de	Tidy up; no functional change. llvm-svn: 176288	2013-02-28 19:16:42 +00:00
Chad Rosier	c33dea7e4d	Style; no functional change. llvm-svn: 176285	2013-02-28 18:54:27 +00:00
Yiannis Tsiouris	b2a123a008	Re-format comments (and check commit access) llvm-svn: 176270	2013-02-28 16:59:10 +00:00
Tim Northover	e7cedcf871	AArch64: remove post-encoder method from FCMP (immediate) instructions. The work done by the post-encoder (setting architecturally unused bits to 0 as required) can be done by the existing operand that covers the "#0.0". This removes at least one use of the discouraged PostEncoderMethod uses. llvm-svn: 176261	2013-02-28 14:46:14 +00:00
Tim Northover	779708f861	AArch64: be more careful resorting to inefficient addressing for weak vars. If an otherwise weak var is actually defined in this unit, it can't be undefined at runtime so we can use normal global variable sequences (ADRP/ADD) to access it. llvm-svn: 176259	2013-02-28 14:36:31 +00:00
Tim Northover	b24657b0c5	AArch64: don't drop GlobalAddress offset when handling extern_weak decls. llvm-svn: 176258	2013-02-28 14:36:24 +00:00
Tim Northover	e2cf283c3e	AArch64: Use cbnz instead of cmp/b.ne pair for atomic operations. llvm-svn: 176253	2013-02-28 13:52:07 +00:00
Jim Grosbach	4d945565f7	ARM: FMA is legal only if VFP4 is available. rdar://13306723 llvm-svn: 176212	2013-02-27 21:31:12 +00:00
Chad Rosier	fb2e02e725	Remove this instance of dl as it's defined in a previous scope. llvm-svn: 176208	2013-02-27 20:34:14 +00:00
Tim Northover	281992500a	ARM: permit full range of valid ADR immediates. This fixes an issue where trying to assemlbe valid ADR instructions would cause LLVM to hit a failed assertion. Patch by Keith Walker. llvm-svn: 176189	2013-02-27 16:43:09 +00:00
Nadav Rotem	6489b3c2b7	Revert r176166 because it broke one of the lit tests. llvm-svn: 176171	2013-02-27 05:56:20 +00:00
Nadav Rotem	c64a5a0435	std::string to StringRef. llvm-svn: 176166	2013-02-27 05:23:56 +00:00
Reed Kotler	fc840cf992	Fix cut/paste error in a comment. llvm-svn: 176165	2013-02-27 04:20:14 +00:00
Reed Kotler	ffc8d6b3af	Add the skeleton for the Mips constant island pass. It will only be used for Mips 16 at this time. llvm-svn: 176161	2013-02-27 03:33:58 +00:00
Bill Schmidt	5440b8eaca	Fix PR15332 (patch by Florian Zeitz). There's no need to generate a stack frame for PPC32 SVR4 when there are no local variables assigned to the stack, i.e., when no red zone is needed. (PPC64 supports a red zone, but PPC32 does not.) llvm-svn: 176124	2013-02-26 21:28:57 +00:00
Christian Konig	1d0c9b5395	R600/SI: Add promotion of e32 to e64 in operand folding Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176105	2013-02-26 17:52:47 +00:00
Christian Konig	e5dbe6105f	R600/SI: add VOP mapping functions Make it possible to map between e32 and e64 encoding opcodes. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176104	2013-02-26 17:52:42 +00:00
Christian Konig	32f29baf4e	R600/SI: swap operands if it helps folding Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176103	2013-02-26 17:52:36 +00:00
Christian Konig	2c9a510876	R600/SI: add some more instruction flags Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176102	2013-02-26 17:52:29 +00:00

1 2 3 4 5 ...

23482 Commits