llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 06:22:51 +01:00

Author	SHA1	Message	Date
Duncan Sands	c2b128ad7d	Add a rather pointless InstructionSimplify transform, inspired by recent constant folding improvements: if P points to a type of size zero, turn "gep P, N" into "P". More generally, if a gep index type has size zero, instcombine could replace the index with zero, but that is not done here. llvm-svn: 119942	2010-11-21 13:53:09 +00:00
Bill Wendling	a472e7be70	Add encoding for ARM "trap" instruction. llvm-svn: 119938	2010-11-21 11:05:29 +00:00
Chris Lattner	4aaa7fbb98	implement PR8524, apparently mainline gas accepts movq as an alias for movd when transfering between i64 gprs and mmx regs. llvm-svn: 119931	2010-11-21 08:18:57 +00:00
Chris Lattner	3a0edfb37c	implement PR8576, deleting dead stores with intervening may-alias stores. llvm-svn: 119927	2010-11-21 07:34:32 +00:00
Chris Lattner	32a16bce7a	file checkize llvm-svn: 119926	2010-11-21 07:32:40 +00:00
Chris Lattner	908a01328c	optimize: void a(int x) { if (((1<<x)&8)==0) b(); } into "x != 3", which occurs over 100 times in 403.gcc but in no other program in llvm-test. llvm-svn: 119922	2010-11-21 06:44:42 +00:00
Rafael Espindola	ee6aea622f	Handle PCRel relocations with absolute values. Fixes PR8656. llvm-svn: 119917	2010-11-21 00:48:25 +00:00
Chris Lattner	ba1cc33676	Implement PR8644: forwarding a memcpy value to a byval, allowing the memcpy to be eliminated. Unfortunately, the requirements on byval's without explicit alignment are really weak and impossible to predict in the mid-level optimizer, so this doesn't kick in much with current frontends. The fix is to change clang to set alignment on all byval arguments. llvm-svn: 119916	2010-11-21 00:28:59 +00:00
Andrew Trick	3166f72d7a	Removing the useless test that I added recently. It was meant as an example, but not complicated enough to merit another test. llvm-svn: 119898	2010-11-20 07:26:51 +00:00
Owen Anderson	5ee547b9d5	Add a test for CodeGenPrepare's ability to look through PHI nodes when performing addressing mode folding, introduced in r119853. llvm-svn: 119857	2010-11-19 22:34:53 +00:00
Dale Johannesen	6399550f2f	Prefetch has a MemOperand now. FileCheckize a test. This finishes up 8460971. llvm-svn: 119848	2010-11-19 21:49:38 +00:00
Mon P Wang	4965983b22	Make isScalarToVector to return false if the node is a scalar. This will prevent DAGCombine from making an illegal transformation of bitcast of a scalar to a vector into a scalar_to_vector. llvm-svn: 119819	2010-11-19 19:08:12 +00:00
Kevin Enderby	214e641d8d	Added support for the Mach-O .symbol_resolver directive. rdar://8673046 llvm-svn: 119816	2010-11-19 18:39:33 +00:00
Bill Wendling	50a1812023	Add MC encodings for some Thumb instructions. Test for a few of them. The "bx lr" instruction cannot be tested just yet. It requires matching a "condition code", but adding one of those makes things go south quickly... llvm-svn: 119774	2010-11-19 01:33:10 +00:00
Bill Wendling	ef43273c51	Add support for parsing the writeback ("!") token. llvm-svn: 119761	2010-11-18 23:43:05 +00:00
Owen Anderson	c0d5d13769	More tests. llvm-svn: 119756	2010-11-18 23:30:10 +00:00
Owen Anderson	ca14474db4	Fix encodings for pkhbt, and fix some tests where I accidentally tested ARM mode instead of Thumb2. llvm-svn: 119755	2010-11-18 23:29:56 +00:00
Tanya Lattner	9cac0edef3	Fix bug in DAGCombiner for ARM that was trying to do a ShiftCombine on illegal types (vector should be split first). Added test case. llvm-svn: 119749	2010-11-18 22:06:46 +00:00
Owen Anderson	25dc3a4fe6	More Thumb2 encodings. llvm-svn: 119737	2010-11-18 21:15:19 +00:00
Owen Anderson	eec8c82d32	Fill out the set of Thumb2 multiplication operator encodings. llvm-svn: 119733	2010-11-18 20:32:18 +00:00
Duncan Sands	a61bc1a41a	The DAGCombiner was threading select over pairs of extending loads even if the extension types were not the same. The result was that if you fed a select with sext and zext loads, as in the testcase, then it would get turned into a zext (or sext) of the select, which is wrong in the cases when it should have been an sext (resp. zext). Reported and diagnosed by Sebastien Deldon. llvm-svn: 119728	2010-11-18 20:05:18 +00:00
Duncan Sands	4562d3b919	Factor code for testing whether replacing one value with another preserves LCSSA form out of ScalarEvolution and into the LoopInfo class. Use it to check that SimplifyInstruction simplifications are not breaking LCSSA form. Fixes PR8622. llvm-svn: 119727	2010-11-18 19:59:41 +00:00
Eric Christopher	bc6a51d63f	Rewrite stack callee saved spills and restores to use push/pop instructions. Remove movePastCSLoadStoreOps and associated code for simple pointer increments. Update routines that depended upon other opcodes for save/restore. Adjust all testcases accordingly. llvm-svn: 119725	2010-11-18 19:40:05 +00:00
Owen Anderson	c2db966e5e	Completely rework the datastructure GVN uses to represent the value number to leader mapping. Previously, this was a tree of hashtables, and a query recursed into the table for the immediate dominator ad infinitum if the initial lookup failed. This led to really bad performance on tall, narrow CFGs. We can instead replace it with what is conceptually a multimap of value numbers to leaders (actually represented by a hashtable with a list of Value*'s as the value type), and then determine which leader from that set to use very cheaply thanks to the DFS numberings maintained by DominatorTree. Because there are typically few duplicates of a given value, this scan tends to be quite fast. Additionally, we use a custom linked list and BumpPtr allocation to avoid any unnecessary allocation in representing the value-side of the multimap. This change brings with it a 15% (!) improvement in the total running time of GVN on 403.gcc, which I think is pretty good considering that includes all the "real work" being done by MemDep as well. The one downside to this approach is that we can no longer use GVN to perform simple conditional progation, but that seems like an acceptable loss since we now have LVI and CorrelatedValuePropagation to pick up the slack. If you see conditional propagation that's not happening, please file bugs against LVI or CVP. llvm-svn: 119714	2010-11-18 18:32:40 +00:00
Dan Gohman	ec75e876ab	Add support for PHI-translating sext, zext, and trunc instructions, enabling more PRE. PR8586. llvm-svn: 119704	2010-11-18 17:05:13 +00:00
Chris Lattner	c752718881	remove a pointless restriction from memcpyopt. It was refusing to optimize two memcpy's like this: copy A <- B copy C <- A if it couldn't prove that noalias(B,C). We can eliminate the copy by producing a memmove instead of memcpy. llvm-svn: 119694	2010-11-18 08:00:57 +00:00
Chris Lattner	1000d06bee	filecheckize, this is still not optimal, see PR8643 llvm-svn: 119693	2010-11-18 07:49:32 +00:00
Chris Lattner	6048697a30	allow eliminating an alloca that is just copied from an constant global if it is passed as a byval argument. The byval argument will just be a read, so it is safe to read from the original global instead. This allows us to promote away the %agg.tmp alloca in PR8582 llvm-svn: 119686	2010-11-18 06:41:51 +00:00
Chris Lattner	791e914b1b	enhance the "alloca is just a memcpy from constant global" to ignore calls that obviously can't modify the alloca because they are readonly/readnone. llvm-svn: 119683	2010-11-18 06:26:49 +00:00
Chris Lattner	44ccd4643d	fix a small oversight in the "eliminate memcpy from constant global" optimization. If the alloca that is "memcpy'd from constant" also has a memcpy from it, ignore it: it is a load. We now optimize the testcase to: define void @test2() { %B = alloca %T %a = bitcast %T* @G to i8* %b = bitcast %T* %B to i8* call void @llvm.memcpy.p0i8.p0i8.i64(i8* %b, i8* %a, i64 124, i32 4, i1 false) call void @bar(i8* %b) ret void } previously we would generate: define void @test() { %B = alloca %T %b = bitcast %T* %B to i8* %G.0 = getelementptr inbounds %T* @G, i32 0, i32 0 %tmp3 = load i8* %G.0, align 4 %G.1 = getelementptr inbounds %T* @G, i32 0, i32 1 %G.15 = bitcast [123 x i8]* %G.1 to i8* %1 = bitcast [123 x i8]* %G.1 to i984* %srcval = load i984* %1, align 1 %B.0 = getelementptr inbounds %T* %B, i32 0, i32 0 store i8 %tmp3, i8* %B.0, align 4 %B.1 = getelementptr inbounds %T* %B, i32 0, i32 1 %B.12 = bitcast [123 x i8]* %B.1 to i8* %2 = bitcast [123 x i8]* %B.1 to i984* store i984 %srcval, i984* %2, align 1 call void @bar(i8* %b) ret void } llvm-svn: 119682	2010-11-18 06:20:47 +00:00
Chris Lattner	c1e63bb987	filecheckize llvm-svn: 119681	2010-11-18 06:16:43 +00:00
Rafael Espindola	93a07b464e	Change CodeGen to use .loc directives. This produces a lot more readable output and testing is easier. A good example is the unknown-location.ll test that now can just look for ".loc 1 0 0". We also don't use a DW_LNE_set_address for every address change anymore. llvm-svn: 119613	2010-11-18 02:04:25 +00:00
Dale Johannesen	06f479d543	Do not throw away alignment when generating the DAG for memset; we may need it to decide between MOVAPS and MOVUPS later. Adjust a test that was looking for wrong code. PR 3866 / 8675131. llvm-svn: 119605	2010-11-18 01:35:23 +00:00
Owen Anderson	b7970b2c6a	Try again at providing Thumb2 encodings for basic multiplication operators. llvm-svn: 119601	2010-11-18 01:08:42 +00:00
John Thompson	8c52bd2004	Fixed to use input redirection for source - to eliminate .s output. llvm-svn: 119599	2010-11-18 00:50:20 +00:00
Owen Anderson	e8906ba112	Revert r119593 while I figure out my testing disagrees with the buildbot. llvm-svn: 119597	2010-11-18 00:42:51 +00:00
Owen Anderson	47a64ab90c	Provide correct Thumb2 encodings for basic multiplication operators. llvm-svn: 119593	2010-11-18 00:19:10 +00:00
John Thompson	b33f935bc3	Bug 8621 fix - pointer cast stripped from inline asm constraint argument. llvm-svn: 119590	2010-11-17 23:58:47 +00:00
Wesley Peck	890a8357af	Now that the MBlaze backend is in its own directory, split the test cases into multiple files for different types of instructions. llvm-svn: 119580	2010-11-17 22:54:43 +00:00
Owen Anderson	ea6ac4cdff	Second attempt at correct encodings for Thumb2 bitfield instructions. llvm-svn: 119575	2010-11-17 22:16:31 +00:00
Dale Johannesen	a87210e350	These tests are looking for library function names that appear to differ on Linux. Try to make them pass on Linux. Would be good for a Linux person to review this. llvm-svn: 119572	2010-11-17 21:57:32 +00:00
Bob Wilson	a217a6e40b	Change ARMGlobalMerge to keep BSS globals in separate pools. This completes the fixes for Radar 8673120. llvm-svn: 119566	2010-11-17 21:25:39 +00:00
Bob Wilson	819660b716	Fix ARMGlobalMerge pass to check if globals are entirely within range. It is generally not sufficient to check if the starting offset is in range of the maximum offset that can be efficiently used for the target. llvm-svn: 119565	2010-11-17 21:25:36 +00:00
Bob Wilson	49bf8702f0	Change the symbol for merged globals from "merged" to "_MergedGlobals". This makes it more clear that the symbol is an internal, compiler-generated name and gives a little more description about its contents. llvm-svn: 119564	2010-11-17 21:25:33 +00:00
Bob Wilson	fd7399b6a6	Fix the ARMGlobalMerge pass to look at variable sizes instead of pointer sizes. It was mistakenly looking at the pointer type when checking for the size of global variables. This is a partial fix for Radar 8673120. llvm-svn: 119563	2010-11-17 21:25:27 +00:00
Owen Anderson	2adebbb603	Revert r119551, which broke buildbots. llvm-svn: 119555	2010-11-17 20:48:51 +00:00
Owen Anderson	c7750780fc	Provide Thumb2 encodings for bitfield instructions. llvm-svn: 119551	2010-11-17 20:35:29 +00:00
Evan Cheng	ce610bd6b3	Remove ARM isel hacks that fold large immediates into a pair of add, sub, and, and xor. The 32-bit move immediates can be hoisted out of loops by machine LICM but the isel hacks were preventing them. Instead, let peephole optimization pass recognize registers that are defined by immediates and the ARM target hook will fold the immediates in. Other changes include 1) do not fold and / xor into cmp to isel TST / TEQ instructions if there are multiple uses. This happens when the 'and' is live out, machine sink would have sinked the computation and that ends up pessimizing code. The peephole pass would recognize situations where the 'and' can be toggled to define CPSR and eliminate the comparison anyway. 2) Move peephole pass to after machine LICM, sink, and CSE to avoid blocking important optimizations. rdar://8663787, rdar://8241368 llvm-svn: 119548	2010-11-17 20:13:28 +00:00
Owen Anderson	d88cfe5453	More miscellaneous Thumb2 encodings. llvm-svn: 119546	2010-11-17 19:57:38 +00:00
Benjamin Kramer	1b330efb46	InstCombine: Add a missing irem identity (X % X -> 0). llvm-svn: 119538	2010-11-17 19:11:46 +00:00

1 2 3 4 5 ...

11543 Commits