llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Duncan Sands	a7198342e7	If a function does a volatile load from a global constant, do not consider it to be readonly. In fact, don't even consider it to be readonly if it does a volatile load from an AllocaInst either (it is debatable as to whether readonly would be correct or not in this case; play safe for the moment). This fixes PR8279. llvm-svn: 117783	2010-10-30 12:59:44 +00:00
Bob Wilson	d7f24e831f	Change instcombine's getShuffleMask to represent undef with negative values. This code had previously used 2*N, where N is the mask length, to represent undef. That is not safe because the shufflevector operands may have more than N elements -- they don't have to match the result type. llvm-svn: 117721	2010-10-29 22:03:05 +00:00
Bob Wilson	996353fb5d	Make instcombine a little more aggressive in combining vector shuffles. Allow splats even if they don't match either of the original shuffles, possibly due to undef entries in the shuffles masks. Radar 8597790. Also fix some 80-column violations. llvm-svn: 117719	2010-10-29 22:02:50 +00:00
Owen Anderson	14cf6bfa0f	Update testcase since we're no longer doing the constant forwarding inline with correlated value propagation. llvm-svn: 117712	2010-10-29 21:18:23 +00:00
NAKAMURA Takumi	b89aaebdde	test/Transforms/SimplifyLibCalls/floor.ll: Mark as XFAIL:win32 due to lack of nearbyintf on MSVC. [PR8466] llvm-svn: 117529	2010-10-28 06:46:04 +00:00
Dale Johannesen	454b9243bd	Teach InstCombine not to use Add and Neg on FP. PR 8490. llvm-svn: 117510	2010-10-27 23:45:18 +00:00
Dan Gohman	96e34e87ca	Fix a case where instcombine was stripping metadata (and alignment) from stores when folding in bitcasts. llvm-svn: 117265	2010-10-25 16:16:27 +00:00
Duncan Sands	5b25503aab	Fix PR8445: a block with no predecessors may be the entry block, in which case it isn't unreachable and should not be zapped. The check for the entry block was missing in one case: a block containing a unwind instruction. While there, do some small cleanups: "M" is not a great name for a Function* (it would be more appropriate for a Module*), change it to "Fn"; use Fn in more places. llvm-svn: 117224	2010-10-24 12:23:30 +00:00
Bob Wilson	0290dbe7d4	Teach instcombine to set the alignment arguments for NEON load/store intrinsics. llvm-svn: 117154	2010-10-22 21:41:48 +00:00
Mikhail Glushenkov	0c09a4b97f	GlobalOpt: EvaluateFunction() must not evaluate stores to weak_odr globals. Fixes PR8389. llvm-svn: 116812	2010-10-19 16:47:23 +00:00
Dan Gohman	6aff5b94ff	Make BasicAliasAnalysis a normal AliasAnalysis implementation which does normal initialization and normal chaining. Change the default AliasAnalysis implementation to NoAlias. Update StandardCompileOpts.h and friends to explicitly request BasicAliasAnalysis. Update tests to explicitly request -basicaa. llvm-svn: 116720	2010-10-18 18:04:47 +00:00
Owen Anderson	4373b4516b	Generalize MemCpyOpt's handling of call slot forwarding to function properly when the call slot forwarding is implemented with a load/store pair rather than a memcpy. llvm-svn: 116637	2010-10-15 22:52:12 +00:00
Chris Lattner	27d8b68afa	fix a bug I introduced, no idea how this didn't repro right. llvm-svn: 116462	2010-10-14 00:30:00 +00:00
Chris Lattner	7c5912d186	hack to unbreak buildbots llvm-svn: 116461	2010-10-14 00:26:10 +00:00
Chris Lattner	451a0accb5	add uadd_ov/usub_ov to apint, consolidate constant folding logic to use the new APInt methods. Among other things this implements rdar://8501501 - llvm.smul.with.overflow.i32 should constant fold which comes from "clang -ftrapv", originally brought to my attention from PR8221. llvm-svn: 116457	2010-10-14 00:05:07 +00:00
Kenneth Uildriks	e9771f15f7	Now using a variant of the existing inlining heuristics to decide whether to create a given specialization of a function in PartialSpecialization. If the total performance bonus across all callsites passing the same constant exceeds the specialization cost, we create the specialization. llvm-svn: 116158	2010-10-09 22:06:36 +00:00
Devang Patel	35201e0fd6	Remove LoopIndexSplit pass. It is neither maintained nor used by anyone. llvm-svn: 116004	2010-10-07 23:29:37 +00:00
Owen Anderson	a88628cd72	Now that the profitable bits of EnableFullLoadPRE have been enabled by default, rip out the remainder. Anyone interested in more general PRE would be better served by implementing it separately, to get real anticipation calculation, etc. llvm-svn: 115337	2010-10-01 20:02:55 +00:00
Chris Lattner	bf0f375aba	fix PR8267 - Instcombine shouldn't optimizer away volatile memcpy's. llvm-svn: 115296	2010-10-01 05:51:02 +00:00
Chris Lattner	c131f7d23b	upgrade this test. llvm-svn: 115295	2010-10-01 05:47:16 +00:00
Owen Anderson	5adba2c2ff	We do want to allow LoadPRE to perform LICM-like transformations: we already consider PHI nodes to be negligible for code size (making this transform code size neutral), and it allows us to hoist values out of loops, which is always a good thing. llvm-svn: 115205	2010-09-30 20:53:04 +00:00
Benjamin Kramer	2a44a539e2	Add constant folding for strspn and strcspn to SimplifyLibCalls. llvm-svn: 115116	2010-09-30 00:58:35 +00:00
Benjamin Kramer	476bfb7a10	Add strpbrk folding to SimplifyLibCalls. llvm-svn: 115111	2010-09-29 23:52:12 +00:00
Benjamin Kramer	cec2603ec2	Simplify the loop in StrChrOptimizer. FileCheckize test. llvm-svn: 115095	2010-09-29 22:29:12 +00:00
Benjamin Kramer	75a825ff6b	Teach SimplifyLibCalls how to optimize strrchr. llvm-svn: 115091	2010-09-29 21:50:51 +00:00
Owen Anderson	8e70968a13	Fix PR8247: JumpThreading can cause a block to become unreachable while still having predecessor, if it is part of a self-loop. Because of this, we cannot use the Simplify* APIs, as they can assert-fail on unreachable code. Since it's not easy to determine if a given threading will cause a block to become unreachable, simply defer simplifying simplification to later InstCombine and/or DCE passes. llvm-svn: 115082	2010-09-29 20:34:41 +00:00
Jakob Stoklund Olesen	c9755c5213	Don't try to constant fold libm functions with non-finite arguments. Usually we wouldn't do this anyway because llvm_fenv_testexcept would return an exception, but we have seen some cases where neither errno nor fenv detect an exception on arm-linux. llvm-svn: 114893	2010-09-27 21:29:20 +00:00
Owen Anderson	856fcd57d1	LoadPRE was not properly checking that the load it was PRE'ing post-dominated the block it was being hoisted to. Splitting critical edges at the merge point only addressed part of the issue; it is also possible for non-post-domination to occur when the path from the load to the merge has branches in it. Unfortunately, full anticipation analysis is time-consuming, so for now approximate it. This is strictly more conservative than real anticipation, so we will miss some cases that real PRE would allow, but we also no longer insert loads into paths where they didn't exist before. :-) This is a very slight net positive on SPEC for me (0.5% on average). Most of the benchmarks are largely unaffected, but when it pays off it pays off decently: 181.mcf improves by 4.5% on my machine. llvm-svn: 114785	2010-09-25 05:26:18 +00:00
Jakob Stoklund Olesen	eb9c5129d1	Be more precise when trying to XFAIL this tester: http://google1.osuosl.org:8011/builders/llvm-arm-linux llvm-svn: 114755	2010-09-24 20:34:49 +00:00
Dan Gohman	e96841854e	Attempt to XFAIL this test on arm-linux, which is inexplicably failing. llvm-svn: 114241	2010-09-18 00:04:37 +00:00
Dan Gohman	4487f42592	Fix this test to avoid an "inexact" fold. llvm-svn: 114202	2010-09-17 20:25:43 +00:00
Dan Gohman	0e6744d219	Fix this test so that folding doesn't depend on a potentially "inexact" result. llvm-svn: 114198	2010-09-17 20:15:53 +00:00
Dan Gohman	9dc559bdef	Fix the folding of floating-point math library calls, like sin(infinity), so that it detects errors on platforms where libm doesn't set errno. It's still subject to host libm details though. llvm-svn: 114148	2010-09-17 01:38:06 +00:00
Owen Anderson	37a6d67bd6	Add missing RUN line to this test. llvm-svn: 114106	2010-09-16 18:46:23 +00:00
Owen Anderson	6f3516065f	It is possible, under specific circumstances involving ptrtoint ConstantExpr's, for LVI to end up trying to merge a Constant into a ConstantRange. Handle this conservatively for now, rather than asserting. The testcase is more complex that I would like, but the manifestation of the problem is sensitive to iteration orders and the state of the LVI cache, and I have not been able to reproduce it with manually constructed or simplified cases. Fixes PR8162. llvm-svn: 114103	2010-09-16 18:28:33 +00:00
Owen Anderson	521e8dfef8	Fix PR8161, in which an unreachable loop causes recursive instruction simplification to try to replace an instruction with itself. Add a predicate to the simplifier to prevent this case. llvm-svn: 114097	2010-09-16 17:42:36 +00:00
Chris Lattner	8729e47b8f	fix PR8144, a bug where constant merge would merge globals marked attribute(used). llvm-svn: 113911	2010-09-15 00:30:11 +00:00
Owen Anderson	788b93febd	Remove dead option from tests. llvm-svn: 113855	2010-09-14 21:03:40 +00:00
Chris Lattner	0718ff9be2	fix PR8102, a case where we'd copyValue from a value that we already deleted. Fix this by doing the copyValue's before we delete stuff! The testcase only repros the problem on my system with valgrind. llvm-svn: 113820	2010-09-14 00:19:00 +00:00
Owen Anderson	e305c119b9	Add a reduced testcase for the infinite loop fixed in r113763. llvm-svn: 113770	2010-09-13 18:28:40 +00:00
Owen Anderson	9c34a7831d	Re-apply r113679, which was reverted in r113720, which added a paid of new instcombine transforms to expose greater opportunities for store narrowing in codegen. This patch fixes a potential infinite loop in instcombine caused by one of the introduced transforms being overly aggressive. llvm-svn: 113763	2010-09-13 17:59:27 +00:00
Eric Christopher	d4aaabfa74	Revert 113679, it was causing an infinite loop in a testcase that I've sent on to Owen. llvm-svn: 113720	2010-09-12 06:09:23 +00:00
Owen Anderson	d4ebde12ce	Invert and-of-or into or-of-and when doing so would allow us to clear bits of the and's mask. This can result in increased opportunities for store narrowing in code generation. Update a number of tests for this change. This fixes <rdar://problem/8285027>. Additionally, because this inverts the order of ors and ands, some patterns for optimizing or-of-and-of-or no longer fire in instances where they did originally. Add a simple transform which recaptures most of these opportunities: if we have an or-of-constant-or and have failed to fold away the inner or, commute the order of the two ors, to give the non-constant or a chance for simplification instead. llvm-svn: 113679	2010-09-11 05:48:06 +00:00
Benjamin Kramer	6110efbf8c	Teach InstructionSimplify to fold (A & B) & A -> A & B and (A \| B) \| A -> A \| B. Reassociate does this but it doesn't catch all cases (e.g. if the operands are i1). llvm-svn: 113651	2010-09-10 22:39:55 +00:00
Owen Anderson	db6a08beef	Revert r113439, which relaxed the requirement that loops containing calls cannot be unrolled. After some discussion, there seems to be a better way to achieve the same effect. llvm-svn: 113528	2010-09-09 20:02:23 +00:00
Owen Anderson	956afdd1f2	Relax the "don't unroll loops containing calls" rule. Instead, when a loop contains a call, lower the unrolling threshold to the optimize-for-size threshold. Basically, for loops containing calls, unrolling can still be profitable as long as the loop is REALLY small. llvm-svn: 113439	2010-09-08 23:10:07 +00:00
Owen Anderson	c51d7d1a8d	Generalize instcombine's support for combining multiple bit checks into a single test. Patch by Dirk Steinke! llvm-svn: 113423	2010-09-08 22:16:17 +00:00
Chris Lattner	a58a97dafc	Fix a serious performance regression introduced by r108687 on linux: turning (fptrunc (sqrt (fpext x))) -> (sqrtf x) is great, but we have to delete the original sqrt as well. Not doing so causes us to do two sqrt's when building with -fmath-errno (the default on linux). llvm-svn: 113260	2010-09-07 20:01:38 +00:00
Chris Lattner	f2534b401f	rename test. llvm-svn: 113257	2010-09-07 19:57:06 +00:00
Chris Lattner	6e6a535055	fix PR8067, an over-aggressive assertion in LICM. llvm-svn: 113146	2010-09-06 05:11:24 +00:00

1 2 3 4 5 ...

1906 Commits