llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	5b9d14b55e	Always normalize spill weights, also for intervals created by spilling. Moderate the weight given to very small intervals. The spill weight given to new intervals created when spilling was not normalized in the same way as the original spill weights calculated by CalcSpillWeights. That meant that restored registers would tend to hang around because they had a much higher spill weight that unspilled registers. This improves the runtime of a few tests by up to 10%, and there are no significant regressions. llvm-svn: 96613	2010-02-18 21:33:05 +00:00
Dan Gohman	34b5cb7deb	Make CodePlacementOpt detect special EH control flow by checking whether AnalyzeBranch disagrees with the CFG directly, rather than looking for EH_LABEL instructions. EH_LABEL instructions aren't always at the end of the block, due to FP_REG_KILL and other things. This fixes an infinite loop compiling MultiSource/Benchmarks/Bullet. llvm-svn: 96611	2010-02-18 21:25:53 +00:00
Chris Lattner	a2e094064f	remove empty file llvm-svn: 96573	2010-02-18 06:29:06 +00:00
Bob Wilson	84fc0200bd	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
Evan Cheng	9af06dfc83	Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" llvm-svn: 96556	2010-02-18 02:13:50 +00:00
Dan Gohman	3cb7dc5912	Don't check for comments, which vary between subtargets. llvm-svn: 96434	2010-02-17 01:08:57 +00:00
Dan Gohman	493a1fcbe0	Don't attempt to divide INT_MIN by -1; consider such cases to have overflowed. llvm-svn: 96428	2010-02-17 00:41:53 +00:00
Chris Lattner	c87f9d6d1a	roundss is an sse 4 thing, fix the test on non-sse41 builders like llvm-gcc-x86_64-darwin10-selfhost llvm-svn: 96417	2010-02-17 00:29:06 +00:00
Dale Johannesen	d147b9a4d4	Make g5 target explicit; scheduling affects register choice. llvm-svn: 96413	2010-02-16 23:25:23 +00:00
Chris Lattner	0d35c68d5c	fix rdar://7653908, a crash on a case where we would fold a load into a roundss intrinsic, producing a cyclic dag. The root cause of this is badness handling ComplexPattern nodes in the old dagisel that I noticed through inspection. Eliminate a copy of the of the code that handled ComplexPatterns by making EmitChildMatchCode call into EmitMatchCode. llvm-svn: 96408	2010-02-16 22:35:06 +00:00
Dale Johannesen	60d48aef7b	Adjust register numbers in tests to compensate for the new lack of R2. llvm-svn: 96407	2010-02-16 22:31:31 +00:00
Chris Lattner	008f62bfa2	filecheckize llvm-svn: 96404	2010-02-16 22:13:43 +00:00
Evan Cheng	ee44d6a752	Look for SSE and instructions of this form: (and x, (build_vector c1,c2,c3,c4)). If there exists a use of a build_vector that's the bitwise complement of the mask, then transform the node to (and (xor x, (build_vector -1,-1,-1,-1)), (build_vector ~c1,~c2,~c3,~c4)). Since this transformation is only useful when 1) the given build_vector will become a load from constpool, and 2) (and (xor x -1), y) matches to a single instruction, I decided this is appropriate as a x86 specific transformation. rdar://7323335 llvm-svn: 96389	2010-02-16 21:09:44 +00:00
David Greene	c10133139e	Add support for emitting non-temporal stores for DAGs marked non-temporal. Fix from r96241 for botched encoding of MOVNTDQ. Add documentation for !nontemporal metadata. Add a simpler movnt testcase. llvm-svn: 96386	2010-02-16 20:50:18 +00:00
Bob Wilson	94eef3fc13	Fix pr6111: Avoid using the LR register for the target address of an indirect branch in ARM v4 code, since it gets clobbered by the return address before it is used. Instead of adding a new register class containing all the GPRs except LR, just use the existing tGPR class. llvm-svn: 96360	2010-02-16 17:24:15 +00:00
Dan Gohman	d19ecedc40	Split the main for-each-use loop again, this time for GenerateTruncates, as it also peeks at which registers are being used by other uses. This makes LSR less sensitive to use-list order. llvm-svn: 96308	2010-02-16 01:42:53 +00:00
Anton Korobeynikov	dccd240998	Preliminary patch to improve dwarf EH generation - Hooks to return Personality / FDE / LSDA / TType encoding depending on target / options (e.g. code model / relocation model) - MCIzation of Dwarf EH printer to use encoding information - Stub generation for ELF target (needed for indirect references) - Some other small changes here and there llvm-svn: 96285	2010-02-15 22:35:59 +00:00
Jakob Stoklund Olesen	143339a43a	Fix PR6300. A virtual register can be used before it is defined in the same MBB if the MBB is part of a loop. Teach the implicit-def pass about this case. llvm-svn: 96279	2010-02-15 22:03:29 +00:00
Bob Wilson	01e8d35855	Last week we were generating code with duplicate induction variables in this test, but the problem seems to have gone away today. Add a check to make sure it doesn't come back. llvm-svn: 96277	2010-02-15 21:56:40 +00:00
Chris Lattner	2ce5f89c01	remove empty file. llvm-svn: 96271	2010-02-15 21:14:50 +00:00
Chris Lattner	d7470aa340	revert r96241. It breaks two regression tests, isn't documented, and the testcase needs improvement. llvm-svn: 96265	2010-02-15 20:53:01 +00:00
Chris Lattner	a8505609fe	fix PR6305 by handling BlockAddress in a helper function called by jump threading. llvm-svn: 96263	2010-02-15 20:47:49 +00:00
David Greene	ba8bac644b	Add support for emitting non-temporal stores for DAGs marked non-temporal. llvm-svn: 96241	2010-02-15 17:02:56 +00:00
Jakob Stoklund Olesen	0a65533a38	Fix PR6283. When coalescing with a physreg, remember to add imp-def and imp-kill when dealing with sub-registers. Also fix a related bug in VirtRegRewriter where substitutePhysReg may reallocate the operand list on an instruction and invalidate the reg_iterator. This can happen when a register is mentioned twice on the same instruction. llvm-svn: 96072	2010-02-13 02:06:10 +00:00
Bob Wilson	5d66f81412	Besides removing phi cycles that reduce to a single value, also remove dead phi cycles. Adjust a few tests to keep dead instructions from being optimized away. This (together with my previous change for phi cycles) fixes Apple radar 7627077. llvm-svn: 96057	2010-02-13 00:31:44 +00:00
Dale Johannesen	ea96b2974f	When save/restoring CR at prolog/epilog, in a large stack frame, the prolog/epilog code was using the same register for the copy of CR and the address of the save slot. Oops. This is fixed here for Darwin, sort of, by reserving R2 for this case. A better way would be to do the store before the decrement of SP, which is safe on Darwin due to the red zone. SVR4 probably has the same problem, but I don't know how to fix it; there is no red zone and R2 is already used for something else. I'm going to leave it to someone interested in that target. Better still would be to rewrite the CR-saving code completely; spilling each CR subregister individually is horrible code. llvm-svn: 96015	2010-02-12 21:35:34 +00:00
Anton Korobeynikov	c66de6687b	Testcases for recent stdcall / fastcall mangling improvements llvm-svn: 95982	2010-02-12 15:29:13 +00:00
Anton Korobeynikov	7073515c86	Cleanup stdcall / fastcall name mangling. This should fix alot of problems we saw so far, e.g. PRs 5851 & 2936 llvm-svn: 95980	2010-02-12 15:28:40 +00:00
Dan Gohman	c40eb525ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Bob Wilson	2fd80c3d94	Add a new pass on machine instructions to optimize away PHI cycles that reduce down to a single value. InstCombine already does this transformation but DAG legalization may introduce new opportunities. This has turned out to be important for ARM where 64-bit values are split up during type legalization: InstCombine is not able to remove the PHI cycles on the 64-bit values but the separate 32-bit values can be optimized. I measured the compile time impact of this (running llc on 176.gcc) and it was not significant. llvm-svn: 95951	2010-02-12 01:30:21 +00:00
Jakob Stoklund Olesen	b800ff8ca9	Reapply coalescer fix for better cross-class coalescing. This time with fixed test cases. llvm-svn: 95938	2010-02-11 23:55:29 +00:00
Mon P Wang	c17e781f35	The previous fix of widening divides that trap was too fragile as it depends on custom lowering and requires that certain types exist in ValueTypes.h. Modified widening to check if an op can trap and if so, the widening algorithm will apply only the op on the defined elements. It is safer to do this in widening because the optimizer can't guarantee removing unused ops in some cases. llvm-svn: 95823	2010-02-10 23:37:45 +00:00
Bob Wilson	82d5534acc	Delete dead PHI machine instructions. These can be created due to type legalization even when the IR-level optimizer has removed dead phis, such as when the high half of an i64 value is unused on a 32-bit target. I had to adjust a few test cases that had dead phis. This is a partial fix for Radar 7627077. llvm-svn: 95816	2010-02-10 22:58:57 +00:00
Evan Cheng	8bee7fb61d	Now that ShrinkDemandedOps() is separated out from DAG combine. It sometimes leave some obvious nops which dag combine used to clean up afterwards e.g. (trunk (ext n)) -> n. Look for them and squash them. llvm-svn: 95757	2010-02-10 02:17:34 +00:00
Chris Lattner	340fe1f187	move tests that depend on the x86 backend out of codegen/generic, and remove a few old and unreduced ones. Fixes PR5624. llvm-svn: 95656	2010-02-09 06:41:03 +00:00
Chris Lattner	fdb9fda4af	make target independent. llvm-svn: 95655	2010-02-09 06:36:30 +00:00
Chris Lattner	28b79c686d	merge a target-specific add test into x86 directory. llvm-svn: 95654	2010-02-09 06:35:50 +00:00
Chris Lattner	34c420d11b	merge another test in, drop the trivially constant folded cases. llvm-svn: 95653	2010-02-09 06:33:27 +00:00
Chris Lattner	ddb2e5a05c	consolidate and filecheckize two tests. llvm-svn: 95652	2010-02-09 06:24:00 +00:00
Chris Lattner	e669789912	merge two tests, make target independent. llvm-svn: 95651	2010-02-09 06:19:20 +00:00
Chris Lattner	20be5fb012	convert to filecheck. llvm-svn: 95608	2010-02-08 23:47:34 +00:00
Chris Lattner	6162c89bfe	add an x86 implementation of MCTargetExpr for representing @GOT and friends. Use it for personality references as a first use. llvm-svn: 95588	2010-02-08 22:09:08 +00:00
Dan Gohman	56b9ea088b	When CodeGen'ing unoptimized code, there may be unfolded constant expressions in global initializers. Instead of aborting, attempt to fold them on the spot. If folding succeeds, emit the folded expression instead. This fixes PR6255. llvm-svn: 95583	2010-02-08 22:02:38 +00:00
Dan Gohman	f113e5466c	In guaranteed tailcall mode, don't decline the tailcall optimization for blocks ending in "unreachable". llvm-svn: 95565	2010-02-08 20:34:14 +00:00
Evan Cheng	5541068ad3	Run codegen dce pass for all targets at all optimization levels. Previously it's only run for x86 with fastisel. I've found it being very effective in eliminating some obvious dead code as result of formal parameter lowering especially when tail call optimization eliminated the need for some of the loads from fixed frame objects. It also shrinks a number of the tests. A couple of tests no longer make sense and are now eliminated. llvm-svn: 95493	2010-02-06 09:07:11 +00:00
Evan Cheng	c3cfda4e7e	Remove a large test case that (soon will) no longer make sense. llvm-svn: 95492	2010-02-06 09:00:30 +00:00
Rafael Espindola	b0bb1ddfe3	Fix alignment on ppc linux. This fixes the build of crtend.o llvm-svn: 95477	2010-02-06 03:32:21 +00:00
Evan Cheng	de1a4726e6	Do not emit callseq instructions around sibcalls. This eliminated some unnecessary stack adjustments. llvm-svn: 95475	2010-02-06 03:28:46 +00:00
Bob Wilson	1a324958d6	Handle AddrMode6 (for NEON load/stores) in Thumb2's rewriteT2FrameIndex. Radar 7614112. llvm-svn: 95456	2010-02-06 00:24:38 +00:00
Jakob Stoklund Olesen	7b4c60adae	Don't unroll loops containing function calls. llvm-svn: 95454	2010-02-05 23:21:31 +00:00

1 2 3 4 5 ...

2903 Commits