llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Vincent Lejeune	0db35fba3a	R600: initial scheduler code This is a skeleton for a pre-RA MachineInstr scheduler strategy. Currently it only tries to expose more parallelism for ALU instructions (this also makes the distribution of GPR channels more uniform and increases the chances of ALU instructions to be packed together in a single VLIW group). Also it tries to reduce clause switching by grouping instruction of the same kind (ALU/FETCH/CF) together. Vincent Lejeune: - Support for VLIW4 Slot assignement - Recomputation of ScheduleDAG to get more parallelism opportunities Tom Stellard: - Fix assertion failure when trying to determine an instruction's slot based on its destination register's class - Fix some compiler warnings Vincent Lejeune: [v2] - Remove recomputation of ScheduleDAG (will be provided in a later patch) - Improve estimation of an ALU clause size so that heuristic does not emit cf instructions at the wrong position. - Make schedule heuristic smarter using SUnit Depth - Take constant read limitations into account Vincent Lejeune: [v3] - Fix some uninitialized values in ConstPair - Add asserts to ensure an ALU slot is always populated llvm-svn: 176498	2013-03-05 18:41:32 +00:00
Arnold Schwaighofer	7475aaf23e	Clarify comment for function getObjectSize Clarify that we mean the object starting at the pointer to the end of the underlying object and not the size of the whole allocated object. llvm-svn: 176491	2013-03-05 16:53:24 +00:00
David Sehr	5f7e2bf434	Add a test that .align directives on capable processors use long NOPs. llvm-svn: 176490	2013-03-05 16:46:54 +00:00
Vincent Lejeune	a008982a6c	R600: Remove LowerConstCopyPass and lower CONST_COPY right after ISel. Maintaining CONST_COPY Instructions until Pre Emit may prevent some ifcvt case and taking them in account for scheduling is difficult for no real benefit. llvm-svn: 176488	2013-03-05 15:04:55 +00:00
Vincent Lejeune	9ca3635aac	R600: Turn BUILD_VECTOR into Reg_Sequence Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176487	2013-03-05 15:04:49 +00:00
Vincent Lejeune	03f33e1dfc	R600: CONST_ADDRESS node is not marked as mayLoad anymore Reviewed-by: Tom Stellard <thomas.stellard at amd.com> mayLoad complexify scheduling and does not bring any usefull info as the location is not writeable at all. llvm-svn: 176486	2013-03-05 15:04:42 +00:00
Vincent Lejeune	d0d37f790e	R600: Use MUL_IEEE for trig/fdiv intrinsic Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176485	2013-03-05 15:04:37 +00:00
Vincent Lejeune	484e21aa38	R600: Add support for indirect addressing of non default const buffer NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 176484	2013-03-05 15:04:29 +00:00
Alexey Samsonov	a707dc5c34	Print a warning message if compiler-rt can't be built because of old CMake version to make this requirement more visible to users llvm-svn: 176481	2013-03-05 14:43:07 +00:00
NAKAMURA Takumi	7ab1b62c2b	llvm/test/CodeGen/Mips/mips64-f128.ll: Add explicit -mtriple=mips64el-unknown-unknown to appease win32. FIXME: Is it expected for win32 to affect mips targets? llvm-svn: 176471	2013-03-05 02:18:59 +00:00
NAKAMURA Takumi	7c27fc3cf0	llvm/test/CodeGen/Thumb/iabs.ll: Add explicit -mtriple=thumb-unknown-unknown to appease win32 hosts. llvm-svn: 176470	2013-03-05 02:18:52 +00:00
Bill Wendling	58256ba413	Remove unused #includes. llvm-svn: 176467	2013-03-05 01:00:45 +00:00
David Sehr	e37f7ab590	The current X86 NOP padding uses one long NOP followed by the remainder in one-byte NOPs. If the processor actually executes those NOPs, as it sometimes does with aligned bundling, this can have a performance impact. From my micro-benchmarks run on my one machine, a 15-byte NOP followed by twelve one-byte NOPs is about 20% worse than a 15 followed by a 12. This patch changes NOP emission to emit as many 15-byte (the maximum) as possible followed by at most one shorter NOP. llvm-svn: 176464	2013-03-05 00:02:23 +00:00
Lang Hames	ba8dee024f	Check isDiscardableIfUnused, rather than hasLocalLinkage, when bumping GlobalValue linkage up to ExternalLinkage in the ExtractGV pass. This prevents linkonce and linkonce_odr symbols from being DCE'd. llvm-svn: 176459	2013-03-04 22:40:44 +00:00
Akira Hatanaka	8d412f5a8a	[mips] Print move instructions. "move $4, $5" is printed instead of "or $4, $5, $zero". llvm-svn: 176455	2013-03-04 22:25:01 +00:00
Jack Carter	44abaa390d	Mips specific inline assembler constraint 'R' 'R' An address that can be sued in a non-macro load or store. This patch includes a positive test case. llvm-svn: 176452	2013-03-04 21:33:15 +00:00
Eli Bendersky	f241522533	Reapply r176381, writing the CHECKs in a more forgiving manner to account for running llvm-objdump on Darwin. llvm-svn: 176443	2013-03-04 18:20:31 +00:00
Preston Gurd	66b9c4fcf9	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Tom Stellard	6cdfe5698a	R600: Clean up datalayout strings so they better match hardware capabilities llvm-svn: 176439	2013-03-04 17:40:28 +00:00
Jia Liu	d8829e76b3	Mips ISD typo llvm-svn: 176426	2013-03-04 01:06:54 +00:00
Jim Grosbach	2b831fb8d3	ARM: Creating a vector from a lane of another. The VDUP instruction source register doesn't allow a non-constant lane index, so make sure we don't construct a ARM::VDUPLANE node asking it to do so. rdar://13328063 http://llvm.org/bugs/show_bug.cgi?id=13963 llvm-svn: 176413	2013-03-02 20:16:24 +00:00
Jim Grosbach	c4e1223217	Clean up code format a bit. llvm-svn: 176412	2013-03-02 20:16:19 +00:00
Jim Grosbach	a2c026c2f1	Tidy up. Trailing whitespace. llvm-svn: 176411	2013-03-02 20:16:15 +00:00
Arnold Schwaighofer	c633bf302e	ARM NEON: Fix v2f32 float intrinsics Mark them as expand, they are not legal as our backend does not match them. llvm-svn: 176410	2013-03-02 19:38:33 +00:00
Nuno Lopes	fc752c7658	recommit r172363 & r171325 (reverted in r172756) This adds minimalistic support for PHI nodes to llvm.objectsize() evaluation fingers crossed so that it does break clang boostrap again.. llvm-svn: 176408	2013-03-02 11:36:24 +00:00
Nuno Lopes	a2fd2b65d3	add getUnderlyingObjectSize() this is similar to getObjectSize(), but doesnt subtract the offset tweak the BasicAA code accordingly (per PR14988) llvm-svn: 176407	2013-03-02 11:23:34 +00:00
Arnold Schwaighofer	e60e6fc70f	X86 cost model: Adjust cost for custom lowered vector multiplies This matters for example in following matrix multiply: int mmult(int rows, int cols, int m1, int m2, int m3) { int i, j, k, val; for (i=0; i<rows; i++) { for (j=0; j<cols; j++) { val = 0; for (k=0; k<cols; k++) { val += m1[i][k] * m2[k][j]; } m3[i][j] = val; } } return(m3); } Taken from the test-suite benchmark Shootout. We estimate the cost of the multiply to be 2 while we generate 9 instructions for it and end up being quite a bit slower than the scalar version (48% on my machine). Also, properly differentiate between avx1 and avx2. On avx-1 we still split the vector into 2 128bits and handle the subvector muls like above with 9 instructions. Only on avx-2 will we have a cost of 9 for v4i64. I changed the test case in test/Transforms/LoopVectorize/X86/avx1.ll to use an add instead of a mul because with a mul we now no longer vectorize. I did verify that the mul would be indeed more expensive when vectorized with 3 kernels: for (i ...) r += a[i] * 3; for (i ...) m1[i] = m1[i] * 3; // This matches the test case in avx1.ll and a matrix multiply. In each case the vectorized version was considerably slower. radar://13304919 llvm-svn: 176403	2013-03-02 04:02:52 +00:00
Andrew Trick	bc662b6282	Added FIXME for future Hexagon cleanup. llvm-svn: 176400	2013-03-02 01:43:08 +00:00
Nadav Rotem	6d803820f8	PR14448 - prevent the loop vectorizer from vectorizing the same loop twice. The LoopVectorizer often runs multiple times on the same function due to inlining. When this happens the loop vectorizer often vectorizes the same loops multiple times, increasing code size and adding unneeded branches. With this patch, the vectorizer during vectorization puts metadata on scalar loops and marks them as 'already vectorized' so that it knows to ignore them when it sees them a second time. PR14448. llvm-svn: 176399	2013-03-02 01:33:49 +00:00
Peter Collingbourne	8b72c382d6	Modify {Call,Invoke}Inst::addAttribute to take an AttrKind. llvm-svn: 176397	2013-03-02 01:20:18 +00:00
Jordan Rose	43df9a115c	CMake: Always include the CheckCXXCompilerFlag in HandleLLVMOptions.cmake. Previously we relied on it being included by config-ix.cmake. llvm-svn: 176396	2013-03-02 01:00:40 +00:00
Michael Gottesman	a4c89f27cc	Revert "Rewrite a test to count emitted instructions without using -stats" This reverts commit aac7922b8fe7ae733d3fe6697e6789fd730315dc. I am reverting the commit since it broke the phase 1 public buildbot for a few hours. http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RA/builds/2137 llvm-svn: 176394	2013-03-02 00:53:20 +00:00
Eli Bendersky	29ed4d2427	Remove duplicate line and move another closer to its actual use llvm-svn: 176391	2013-03-01 23:32:40 +00:00
Andrew Trick	ab2046b460	MIsched machine model: tablegen subtarget emitter improvement. Fix the way resources are counted. I'm taking some time to cleanup the way MachineScheduler handles in-order machine resources. Eventually we'll need more PPC/Atom test cases in tree. llvm-svn: 176390	2013-03-01 23:31:26 +00:00
Argyrios Kyrtzidis	697931fd3d	In llvm::MemoryBuffer::getFile() remove an unnecessary stat call check. The sys::fs::is_directory() check is unnecessary because, if the filename is a directory, the function will fail anyway with the same error code returned. Remove the check to avoid an unnecessary stat call. Someone needs to review on windows and see if the check is necessary there or not. llvm-svn: 176386	2013-03-01 22:48:51 +00:00
Stefanus Du Toit	e13d557d54	Fix my email address in CREDITS.TXT. Checking to see if svn notifications also use correct address now. llvm-svn: 176385	2013-03-01 22:20:03 +00:00
Akira Hatanaka	d2f7ed089c	[mips] Fix inefficient code generation. This patch eliminates the need to emit a constant move instruction when this pattern is matched: (select (setgt a, Constant), T, F) The pattern above effectively turns into this: (conditional-move (setlt a, Constant + 1), F, T) llvm-svn: 176384	2013-03-01 21:52:08 +00:00
Jean-Luc Duprat	29a7c88237	Removed extraneous #include "LLVMContextImpl.h" from lib/IR/Module.cpp llvm-svn: 176382	2013-03-01 21:37:24 +00:00
Eli Bendersky	f364fecbb8	Rewrite a test to count emitted instructions without using -stats Also removed the comments of "should produce..." because they completely don't match the actually produced output. llvm-svn: 176381	2013-03-01 21:34:37 +00:00
Akira Hatanaka	a064b57260	Fix indentation. llvm-svn: 176380	2013-03-01 21:22:21 +00:00
Akira Hatanaka	e2ccd1b4d6	Set properties for f128 type. llvm-svn: 176378	2013-03-01 21:11:44 +00:00
Eli Bendersky	b43cf3bd32	Rewrite a test to check actual output rather than intermediate implementation detail. The was this test was written, it was relying on an implementation detail (fixups) and hence was very brittle (relying, among other things, on the exact ordering of statistics printed by MC). The test was rewritten to check a more observable output difference. While it doesn't cover 100% of the things the original test covered, it's a good practice to write regression tests this way. If we want to check that internal details and invariants hold, such tests should be expressed as unit tests. llvm-svn: 176377	2013-03-01 20:54:00 +00:00
Edwin Vane	d84583a773	No need to force-create clang-tools-extra lit.site.cfg The make (all) target takes care of creating lit configs and auto-generating tests. The problem with the original 'lit.site.cfg' target is it's not recursive and doesn't fully create everything necessary for testing clang-tools-extra. llvm-svn: 176374	2013-03-01 19:58:58 +00:00
Michael Liao	fde72e5106	Add regression tests (WORKSFORME) - These tests wont't crash on trunk but would be better to add them so that they don't break again in the future. llvm-svn: 176369	2013-03-01 19:23:37 +00:00
Chad Rosier	25ffc43c38	Generate an error message instead of asserting or segfaulting when we can't handle indirect register inputs. rdar://13322011 llvm-svn: 176367	2013-03-01 19:12:05 +00:00
Benjamin Kramer	a462070739	LoopVectorize: Don't hang forever if a PHI only has skipped PHI uses. Fixes PR15384. llvm-svn: 176366	2013-03-01 19:07:31 +00:00
Michael Ilseman	6bd55f4125	Cache the result of Function::getIntrinsicID() in a DenseMap attached to the LLVMContext. This reduces the time actually spent doing string to ID conversion and shows a 10% improvement in compile time for a particularly bad case that involves ARM Neon intrinsics (these have many overloads). Patch by Jean-Luc Duprat! llvm-svn: 176365	2013-03-01 18:48:54 +00:00
Michael Liao	1e621fbd2f	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Chad Rosier	313ffa4bc0	Add support for using non-pic code for arm and thumb1 when emitting the sjlj dispatch code. As far as I can tell the thumb2 code is behaving as expected. I was able to compile and run the associated test case for both arm and thumb1. rdar://13066352 llvm-svn: 176363	2013-03-01 18:30:38 +00:00
Christian Konig	1a86119413	R600/SI: fix sampler tests after fixing wait insertions Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176359	2013-03-01 17:39:05 +00:00

1 2 3 4 5 ...

89957 Commits