llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Jakub Staszak	e47df808ad	unHECKify test. It was fixed by Chris in 2009. llvm-svn: 170017	2012-12-12 20:43:00 +00:00
Jakub Staszak	60715f510b	Fix typo in test-case. llvm-svn: 170015	2012-12-12 20:29:06 +00:00
Jakub Staszak	980c2687c8	Fix typo. llvm-svn: 170006	2012-12-12 19:47:04 +00:00
Nadav Rotem	2c25a05088	LoopVectorizer: Use the "optsize" attribute to decide if we are allowed to increase the function size. llvm-svn: 170004	2012-12-12 19:29:45 +00:00
Shuxin Yang	0b24765e3f	- Fix a problematic way in creating all-the-1 APInt. - Propagate "exact" bit of [l\|a]shr instruction. llvm-svn: 169942	2012-12-12 00:29:03 +00:00
Michael Ilseman	5db40ba98e	Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included. fsub X, +0 ==> X fsub X, -0 ==> X, when we know X is not -0 fsub +/-0.0, (fsub -0.0, X) ==> X fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X fsub nnan ninf X, X ==> 0.0 fadd nsz X, 0 ==> X fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0 where nnan and ninf have to occur at least once somewhere in this expression fmul X, 1.0 ==> X llvm-svn: 169940	2012-12-12 00:27:46 +00:00
Nadav Rotem	054379720d	PR14574. Fix a bug in the code that calculates the mask the converted PHIs in if-conversion. llvm-svn: 169916	2012-12-11 21:30:14 +00:00
Nadav Rotem	fb45c4d6b4	Loop Vectorize: optimize the vectorization of trunc(induction_var). The truncation is now done on scalars. llvm-svn: 169904	2012-12-11 18:58:10 +00:00
Nadav Rotem	0715a221d8	Fix PR14565. Don't if-convert loops that have switch statements in them. llvm-svn: 169813	2012-12-11 04:55:10 +00:00
Nadav Rotem	196fc7cc8c	Add support for reverse induction variables. For example: while (i--) sum+=A[i]; llvm-svn: 169752	2012-12-10 19:25:06 +00:00
Chandler Carruth	c9b6bd9712	Fix PR14548: SROA was crashing on a mixture of i1 and i8 loads and stores. When SROA was evaluating a mixture of i1 and i8 loads and stores, in just a particular case, it would tickle a latent bug where we compared bits to bytes rather than bits to bits. As a consequence of the latent bug, we would allow integers through which were not byte-size multiples, a situation the later rewriting code was never intended to handle. In release builds this could trigger all manner of oddities, but the reported issue in PR14548 was forming invalid bitcast instructions. The only downside of this fix is that it makes it more clear that SROA in its current form is not capable of handling mixed i1 and i8 loads and stores. Sometimes with the previous code this would work by luck, but usually it would crash, so I'm not terribly worried. I'll watch the LNT numbers just to be sure. llvm-svn: 169719	2012-12-10 00:54:45 +00:00
Paul Redmond	e43761293d	LoopVectorize: support vectorizing intrinsic calls - added function to VectorTargetTransformInfo to query cost of intrinsics - vectorize trivially vectorizable intrinsic calls such as sin, cos, log, etc. Reviewed by: Nadav llvm-svn: 169711	2012-12-09 20:42:17 +00:00
Shuxin Yang	7221b14d96	- Re-enable population count loop idiom recognization - fix a bug which cause sigfault. - add two testing cases which was causing crash llvm-svn: 169687	2012-12-09 03:12:46 +00:00
Chandler Carruth	329a5c1e03	Revert the patches adding a popcount loop idiom recognition pass. There are still bugs in this pass, as well as other issues that are being worked on, but the bugs are crashers that occur pretty easily in the wild. Test cases have been sent to the original commit's review thread. This reverts the commits: r169671: Fix a logic error. r169604: Move the popcnt tests to an X86 subdirectory. r168931: Initial commit adding the pass. llvm-svn: 169683	2012-12-08 22:18:29 +00:00
David Tweed	74d6b6b73b	The test unconditionally assumes a particular cpu has a backend build in the target. Buildbots for some hosts may choose to build only their own backend in order to maximise testing-turnaround time. Move the test into a prefixed directory so lit's standard "backend specific" suppression can be done. llvm-svn: 169604	2012-12-07 15:57:45 +00:00
Chandler Carruth	9290708acb	Add support to ValueTracking for determining that a pointer is non-null by virtue of inbounds GEPs that preclude a null pointer. This is a very common pattern in the code generated by std::vector and other standard library routines which use allocators that test for null pervasively. This is one step closer to teaching Clang+LLVM to be able to produce an empty function for: void f() { std::vector<int> v; v.push_back(1); v.push_back(2); v.push_back(3); v.push_back(4); } Which is related to getting them to completely fold SmallVector push_back sequences into constants when inlining and other optimizations make that a possibility. llvm-svn: 169573	2012-12-07 02:08:58 +00:00
Dmitri Gribenko	d81733f67d	Fix typos in CHECK lines. Patch by Alexander Zinenko. llvm-svn: 169547	2012-12-06 21:24:47 +00:00
Shuxin Yang	b226a9f2b3	fix a typo llvm-svn: 169345	2012-12-05 00:33:16 +00:00
Nadav Rotem	452993ad1a	Fix a bug in vectorization of if-converted reduction variables. If the reduction variable is not used outside the loop then we ran into an endless loop. This change checks if we found the original PHI. llvm-svn: 169324	2012-12-04 22:40:22 +00:00
Shuxin Yang	c390be6a5d	For rdar://12329730, last piece. This change attempts to simplify (X^Y) -> X or Y in the user's context if we know that only bits from X or Y are demanded. A minimized case is provided bellow. This change will simplify "t>>16" into "var1 >>16". ============================================================= unsigned foo (unsigned val1, unsigned val2) { unsigned t = val1 ^ 1234; return (t >> 16) \| t; // NOTE: t is used more than once. } ============================================================= Note that if the "t" were used only once, the expression would be finally optimized as well. However, with with this change, the optimization will take place earlier. Reviewed by Nadav, Thanks a lot! llvm-svn: 169317	2012-12-04 22:15:32 +00:00
Nadav Rotem	4f22c83996	Add support for reduction variables when IF-conversion is enabled. llvm-svn: 169288	2012-12-04 18:17:33 +00:00
Nadav Rotem	43d200ded1	Add the last part that is needed for vectorization of if-converted code. Added the code that actually performs the if-conversion during vectorization. We can now vectorize this code: for (int i=0; i<n; ++i) { unsigned k = 0; if (a[i] > b[i]) <------ IF inside the loop. k = k * 5 + 3; a[i] = k; <---- K is a phi node that becomes vector-select. } llvm-svn: 169217	2012-12-04 06:15:11 +00:00
Shuxin Yang	ac685f44b0	rdar://12329730 (2nd part, revised) The type of shirt-right (logical or arithemetic) should remain unchanged when transforming "X << C1 >> C2" into "X << (C1-C2)" llvm-svn: 169209	2012-12-04 03:28:32 +00:00
Shuxin Yang	f6948fd368	rdar://12329730 (2nd part) This change tries to simmplify E1 = " X >> C1 << C2" into : - E2 = "X << (C2 - C1)" if C2 > C1, or - E2 = "X >> (C1 - C2)" if C1 > C2, or - E2 = X if C1 == C2. Reviewed by Nadav. Thanks! llvm-svn: 169182	2012-12-04 00:04:54 +00:00
Benjamin Kramer	b7100504d2	SROA: Avoid struct and array types early to avoid creating an overly large integer type. Fixes PR14465. Differential Revision: http://llvm-reviews.chandlerc.com/D148 llvm-svn: 169084	2012-12-01 11:53:32 +00:00
Zhou Sheng	b1eda65988	Revert previous check in r168581, r169079 as they are still in code review status. llvm-svn: 169083	2012-12-01 10:54:28 +00:00
Zhou Sheng	f4e0514807	The patch is to improve the memory footprint of pass GlobalOpt. Also check in a case to repeat the issue, on which 'opt -globalopt' consumes 1.6GB memory. The big memory footprint cause is that current GlobalOpt one by one hoists and stores the leaf element constant into the global array, in each iteration, it recreates the global array initializer constant and leave the old initializer alone. This may result in many obsolete constants left. For example: we have global array @rom = global [16 x i32] zeroinitializer After the first element value is hoisted and installed: @rom = global [16 x i32] [ 1, 0, 0, ... ] After the second element value is installed: @rom = global [16 x 32] [ 1, 2, 0, 0, ... ] // here the previous initializer is obsolete ... When the transform is done, we have 15 obsolete initializers left useless. llvm-svn: 169079	2012-12-01 04:38:53 +00:00
Evan Cheng	af9b73ef6f	Fix logic to determine whether to turn a switch into a lookup table. When the tables cannot fit in registers (i.e. bitmap), do not emit the table if it's using an illegal type. rdar://12779436 llvm-svn: 168970	2012-11-30 02:02:42 +00:00
Shuxin Yang	a7c032d8b5	rdar://12100355 (part 1) This revision attempts to recognize following population-count pattern: while(a) { c++; ... ; a &= a - 1; ... }, where <c> and <a>could be used multiple times in the loop body. TODO: On X8664 and ARM, __buildin_ctpop() are not expanded to a efficent instruction sequence, which need to be improved in the following commits. Reviewed by Nadav, really appreciate! llvm-svn: 168931	2012-11-29 19:38:54 +00:00
Meador Inge	3524aece42	instcombine: Migrate puts optimizations This patch migrates the puts optimizations from the simplify-libcalls pass into the instcombine library call simplifier. All the simplifiers from simplify-libcalls have now been migrated to instcombine. Yay! Just a few other bits to migrate (prototype attribute inference and a few statistics) and simplify-libcalls can finally be put to rest. llvm-svn: 168925	2012-11-29 19:15:17 +00:00
Benjamin Kramer	0bcd999459	Follow up to 168711: It's safe to base this analysis on the found compare, just return the value for the right predicate. Thanks to Andy for catching this. llvm-svn: 168921	2012-11-29 19:07:57 +00:00
Shuxin Yang	fd7c5c30c7	fix a typo llvm-svn: 168909	2012-11-29 18:09:37 +00:00
Meador Inge	95a0f6df53	instcombine: Migrate fputs optimizations This patch migrates the fputs optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168893	2012-11-29 15:45:43 +00:00
Meador Inge	787f51971a	instcombine: Migrate fwrite optimizations This patch migrates the fwrite optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168892	2012-11-29 15:45:39 +00:00
Meador Inge	5553b265a0	instcombine: Migrate fprintf optimizations This patch migrates the fprintf optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168891	2012-11-29 15:45:33 +00:00
Shuxin Yang	106133b571	Instruction::isAssociative() returns true for fmul/fadd if they are tagged "unsafe" mode. Approved by: Eli and Michael. llvm-svn: 168848	2012-11-29 01:47:31 +00:00
Patrik Hägglund	9c1279a58f	Add error handling in getInt. Accordingly, update a testcase with a broken datalayout string. Also, we never parse negative numbers, because '-' is used as a separator. Therefore, use unsigned as result type. llvm-svn: 168785	2012-11-28 12:13:12 +00:00
Hal Finkel	e25b9ebee4	BBVectorize: Correctly merge SubclassOptionalData When two instructions are combined into a vector instruction, the resulting instruction must have the most-conservative flags. llvm-svn: 168765	2012-11-28 03:04:10 +00:00
Meador Inge	4275530cf4	instcombine: Don't replace all uses for instructions with no uses My commit to migrate the printf simplifiers from the simplify-libcalls in r168604 introduced a regression reported by Duncan [1]. The problem is that in some cases the library call simplifier can return a new value that has no uses and the new value's type is different than the old value's type (which is fine because there are no uses). The specific case that triggered the bug looked something like: declare void @printf(i8, ...) ... call void (i8, ...)* @printf(i8* %fmt) Which we want to optimized into: call i32 @putchar(i32 104) However, the code was attempting to replace all uses of the printf with the putchar and the types differ, hence a crash. This is fixed by just deleting the original instruction when there are no uses. The old simplify-libcalls pass is already doing something similar. [1] http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-November/056338.html llvm-svn: 168716	2012-11-27 18:52:49 +00:00
Benjamin Kramer	dd7fb68c76	SCEV: Even if the latch terminator is foldable we can't deduce the result of an unrelated condition with it. Fixes PR14432. llvm-svn: 168711	2012-11-27 18:16:32 +00:00
Meador Inge	9e1b661bc5	Move sprintf simplifier tests to test/Transforms/InstCombine The tests from SPrintF.ll should have been migrated to sprintf-1.ll in r168677, but I forgot to do it. llvm-svn: 168702	2012-11-27 15:35:58 +00:00
Bill Wendling	bdeb3167f1	Remove the dependent libraries feature. The dependent libraries feature was never used and has bit-rotted. Remove it. llvm-svn: 168694	2012-11-27 09:55:56 +00:00
NAKAMURA Takumi	efe26468d2	llvm/test/Transforms/SimplifyLibCalls: FileCheck-ize 3 tests. llvm-svn: 168691	2012-11-27 08:18:23 +00:00
NAKAMURA Takumi	fa7eb216cb	llvm/test/Transforms/SimplifyLibCalls/SPrintF.ll: Handle @sprintf() with -instcombine, not -simplify-libcalls. llvm-svn: 168690	2012-11-27 08:18:15 +00:00
NAKAMURA Takumi	fb7695e133	llvm/test/Transforms/SimplifyLibCalls/SPrintF.ll: Fix datalayout since r168516. llvm-svn: 168689	2012-11-27 08:18:08 +00:00
NAKAMURA Takumi	72c1315003	Trailing linefeeds. llvm-svn: 168688	2012-11-27 08:17:58 +00:00
NAKAMURA Takumi	b929c6efc4	test/Transforms/SimplifyLibCalls/SPrintF.ll: Suppress this for now. r168677 unveiled another failure. FYI, this test makes no sense with "not grep"... I saw "assertion failure" in stderr. llvm-svn: 168679	2012-11-27 06:42:48 +00:00
Meador Inge	7ed4062656	instcombine: Migrate sprintf optimizations This patch migrates the sprintf optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 168677	2012-11-27 05:57:54 +00:00
Michael Ilseman	6f28be8b63	Fast-math test for SimplifyInstruction: fold multiply by 0 Applied the patch, rather than committing it. llvm-svn: 168656	2012-11-27 01:00:22 +00:00
Eli Friedman	dd1df015f4	Get rid of the getPointeeAlignment helper function from InstCombineLoadStoreAlloca.cpp, which had many issues. (At least two bugs were noted on llvm-commits, and it was overly conservative.) Instead, use getOrEnforceKnownAlignment. llvm-svn: 168629	2012-11-26 23:04:53 +00:00

1 2 3 4 5 ...

3327 Commits