llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Daniel Dunbar	097da598fb	Add InlineCost class for represent the estimated cost of inlining a function. - This explicitly models the costs for functions which should "always" or "never" be inlined. This fixes bugs where such costs were not previously respected. llvm-svn: 58450	2008-10-30 19:26:59 +00:00
Chris Lattner	4af91a146a	Fix PR2967 by not deleting volatile load/stores that occur before unreachable. I don't really see this as being needed, but there is little harm from doing it. llvm-svn: 58385	2008-10-29 17:46:26 +00:00
Dan Gohman	3ceee36545	(A & sext(C)) \| (B & ~sext(C) -> C ? A : B llvm-svn: 58351	2008-10-28 22:38:57 +00:00
Chris Lattner	a57ac16183	no need to print output llvm-svn: 58228	2008-10-27 06:56:35 +00:00
Nick Lewycky	44356e13da	Don't try to create a mask when we don't need one. Fixes a crash. llvm-svn: 58075	2008-10-24 06:14:27 +00:00
Chris Lattner	3048ccc1f9	apply Eli's patch for PR2165 and provide a testcase. llvm-svn: 57625	2008-10-16 05:26:51 +00:00
Dan Gohman	6f40163d83	Teach instcombine's visitLoad to scan back several instructions to find opportunities for store-to-load forwarding or load CSE, in the same way that visitStore scans back to do DSE. Also, define a new helper function for testing whether the addresses of two memory accesses are known to have the same value, and use it in both visitStore and visitLoad. These two changes allow instcombine to eliminate loads in code produced by front-ends that frequently emit obviously redundant addressing for memory references. llvm-svn: 57608	2008-10-15 23:19:35 +00:00
Evan Cheng	591baeed7c	Combine (fcmp cc0 x, y) \| (fcmp cc1 x, y) into a single fcmp when possible. llvm-svn: 57515	2008-10-14 18:44:08 +00:00
Evan Cheng	778b47e6c0	- Somehow I forgot about one / une. - Renumber fcmp predicates to match their icmp counterparts. - Try swapping operands to expose more optimization opportunities. llvm-svn: 57513	2008-10-14 18:13:38 +00:00
Evan Cheng	91528965e7	Optimize anding of two fcmp into a single fcmp if the operands are the same. e.g. uno && ueq -> ueq ord && olt -> olt ord && ueq -> oeq llvm-svn: 57507	2008-10-14 17:15:11 +00:00
Chris Lattner	7a61ef92f5	Fix PR2697 by rewriting the '(X / pos) op neg' logic. This also changes a couple other cases for clarity, but shouldn't affect correctness. Patch by Eli Friedman! llvm-svn: 57387	2008-10-11 22:55:00 +00:00
Devang Patel	255fee3bce	Check loop exit predicate properly while eliminating one iteration loop. This patch fixes PR 2869 llvm-svn: 57369	2008-10-10 22:02:57 +00:00
Devang Patel	378c8633e5	Fix typo, fix PR 2865. llvm-svn: 57221	2008-10-06 23:22:54 +00:00
Matthijs Kooijman	12cd5d041d	Allow scalarrepl to treat an all-zero GEP just as bitcast. This includes not marking a GEP involving a vector as unsafe, but only when it has all zero indices. This allows scalarrepl to work in a few more cases. llvm-svn: 57177	2008-10-06 16:23:31 +00:00
Chris Lattner	107e8f8b60	rewrite bswap matching to be more general, allowing arbitrary shifting and masking inside a bswap expr. This allows it to handle the cases from PR2842, which involve the intermediate 'or' expressions being shifted, not just the input value. llvm-svn: 57095	2008-10-05 02:13:19 +00:00
Duncan Sands	e22385630e	Ignore loads from and stores to local memory (i.e. allocas) when deciding whether to mark a function readnone/readonly. Since the pass is currently run before SROA, this may be quite helpful. Requested by Chris on IRC. llvm-svn: 57050	2008-10-04 13:24:24 +00:00
Nick Lewycky	53e751fcba	Allow the construction of SCEVs with SCEVCouldNotCompute operands, by implementing folding. Fixes PR2857. llvm-svn: 57049	2008-10-04 11:19:07 +00:00
Devang Patel	55401473f9	Nick Lewycky's patch. While hosting instruction check PHI node. llvm-svn: 57025	2008-10-03 18:57:37 +00:00
Nick Lewycky	9e918179c8	Fix misoptimization of: xor i1 (icmp eq (X, C1), icmp s[lg]t (X, C2)) llvm-svn: 56834	2008-09-30 06:08:34 +00:00
Devang Patel	7536edca09	Support inreg, zext and sext as return value attributes. llvm-svn: 56801	2008-09-29 20:49:50 +00:00
Matthijs Kooijman	9fb8d5f5b1	Add a testcase showing that scalarrepl supports first class structs. I originally made this script to show that scalarrepl didn't support them, but it turned out it does. Better to still add the testcase then. llvm-svn: 56781	2008-09-29 10:42:13 +00:00
Devang Patel	47a504c87c	Implement function notes as function attributes. llvm-svn: 56716	2008-09-26 23:51:19 +00:00
Duncan Sands	2c9f6ab06d	Rationalize the names of passes that print information: -callgraph => print-callgraph -callscc => print-callgraph-sccs -cfgscc => print-cfg-sccs -externalfnconstants => print-externalfnconstants -print => print-function -print-alias-sets (no change) -print-callgraph => dot-callgraph -print-cfg => dot-cfg -print-cfg-only => dot-cfg-only -print-dom-info (no change) -printm => print-module -printusedtypes => print-used-types llvm-svn: 56487	2008-09-23 12:47:39 +00:00
Duncan Sands	df9829409c	Add test for improvement of readonly to readnone, and non-demotion of readnone to readonly. llvm-svn: 56344	2008-09-19 09:20:05 +00:00
Duncan Sands	6c31b40cef	Turn on these tests! llvm-svn: 56343	2008-09-19 09:16:32 +00:00
Duncan Sands	c2ff9ca568	Add a new pass AddReadAttrs which works out which functions can get the readnone/readonly attributes, and gives them it. The plan is to remove markmodref (which did the same thing by querying GlobalsModRef) and delete the analogous functionality from GlobalsModRef. llvm-svn: 56341	2008-09-19 08:17:05 +00:00
Duncan Sands	bb48df5839	Test the callgraph directly for the missing edge. llvm-svn: 56338	2008-09-19 08:01:57 +00:00
Devang Patel	1e7ddf5d31	splitLoop does not handle split condition EQ. Fixes PR 2805 llvm-svn: 56321	2008-09-18 23:45:14 +00:00
Devang Patel	88efee2dcc	Do not hoist instruction above branch condition. The instruction may use branch condition. llvm-svn: 56286	2008-09-17 18:21:49 +00:00
Devang Patel	50c22bf69b	Do not ignore iv uses outside the loop. This one slipped through cracks very well. llvm-svn: 56284	2008-09-17 17:53:47 +00:00
Dan Gohman	c598e29a1c	Improve instcombine's handling of integer min and max in two ways: - Recognize expressions like "x > -1 ? x : 0" as min/max and turn them into expressions like "x < 0 ? 0 : x", which is easily recognizable as a min/max operation. - Refrain from folding expression like "y/2 < 1" to "y < 2" when the comparison is being used as part of a min or max idiom, like "y/2 < 1 ? 1 : y/2". In that case, the division has another use, so folding doesn't eliminate it, and obfuscates the min/max, making it harder to recognize as a min/max operation. These benefit ScalarEvolution, CodeGen, and anything else that wants to recognize integer min and max. llvm-svn: 56246	2008-09-16 18:46:06 +00:00
Dan Gohman	0b6d3a9a9b	On 64-bit targets, change 32-bit getelementptr indices to be 64-bit getelementptr indices, inserting an explicit cast if necessary. This helps expose the sign-extension operation to other optimizations. llvm-svn: 56133	2008-09-11 23:06:38 +00:00
Dan Gohman	5e154a591d	Fix a vectorshuffle instcombine bug introduced by r55995. Patch by Nicolas Capens! llvm-svn: 56129	2008-09-11 22:47:57 +00:00
Dan Gohman	ebfb483309	Fix an icmp+sdiv optimization to check for and handle an overflow condition. This fixes PR2740. llvm-svn: 56076	2008-09-10 23:30:57 +00:00
Devang Patel	ed0ea8049f	Remove. llvm-svn: 56018	2008-09-09 21:41:34 +00:00
Devang Patel	7365754925	if loop induction variable is always sign or zero extended then extend the type of induction variable. llvm-svn: 56017	2008-09-09 21:41:07 +00:00
Devang Patel	36ad68421e	fix overflow check. llvm-svn: 56011	2008-09-09 20:54:34 +00:00
Anton Korobeynikov	4676c09dc8	Resolve aliases, when possible llvm-svn: 56001	2008-09-09 19:04:59 +00:00
Dan Gohman	28c911b79b	Make SimplifyDemandedVectorElts simplify vectors with multiple users, and teach it about shufflevector instructions. Also, fix a subtle bug in SimplifyDemandedVectorElts' insertelement code. This is a patch that was originally written by Eli Friedman, with some fixes and cleanup by me. llvm-svn: 55995	2008-09-09 18:11:14 +00:00
Devang Patel	b6380f607d	Fix simplifycfg crash in handing block merge. llvm-svn: 55971	2008-09-09 01:06:56 +00:00
Devang Patel	c60c8e35eb	xfail llvm-svn: 55914	2008-09-08 16:24:30 +00:00
Duncan Sands	62431a29a5	Update the callgraph correctly in ArgumentPromotion. llvm-svn: 55895	2008-09-08 11:07:35 +00:00
Duncan Sands	c4ec7871bf	When PruneEH turned an invoke into an ordinary call (thus changing the call site) it didn't inform the callgraph about this. But the call site does matter - as shown by the testcase, the callgraph become invalid after the inliner ran (with an edge between two functions simply missing), resulting in wrong deductions by GlobalsModRef. llvm-svn: 55872	2008-09-06 17:19:29 +00:00
Nick Lewycky	57cebeaeba	Don't crash when trying to constant fold a vector with some elements that can't be folded. Instead, fail to fold the entire vector. We could also return a vector with some elements folded and some not. If anyone thinks that's a better approach, please speak up! llvm-svn: 55689	2008-09-03 05:54:33 +00:00
Devang Patel	4dfecae8fe	Check iteration count. llvm-svn: 55680	2008-09-03 00:10:56 +00:00
Devang Patel	a1e2066b1d	If all IV uses are extending integer IV then change the type of IV itself, if possible. llvm-svn: 55674	2008-09-02 22:18:08 +00:00
Devang Patel	cda9086d29	respect inline=never and inline=always notes. llvm-svn: 55673	2008-09-02 22:16:13 +00:00
Devang Patel	211c05f7a3	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast operation. llvm-svn: 55374	2008-08-26 17:57:54 +00:00
Chris Lattner	72ee6ebb0a	Fix PR2423 by checking all indices for out of range access, not only indices that start with an array subscript. x->field[10000] is just as bad as (*X)[14][10000]. llvm-svn: 55226	2008-08-23 05:21:06 +00:00
Nick Lewycky	7b87c4d8a4	Revert r54876 r54877 r54906 and r54907. Evan found that these caused a 20% slowdown in bzip2. llvm-svn: 55113	2008-08-21 05:56:10 +00:00

1 2 3 4 5 ...

692 Commits