llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Evan Cheng	073659986f	Be more careful with insert_subreg and extract_subreg where either source or destination operand has already been coalesced with another register that's defined by a insert_subreg or extract_subreg. llvm-svn: 49843	2008-04-17 07:58:04 +00:00
Evan Cheng	7c2c3333ca	Fix a sub-register indice propagation bug. llvm-svn: 49832	2008-04-17 00:06:42 +00:00
Evan Cheng	e2e899b5c2	Don't forget about sub-register indices when rematting instructions. llvm-svn: 49830	2008-04-16 23:44:44 +00:00
Evan Cheng	44a0a0c8ee	After reading memory that's already freed. llvm-svn: 49810	2008-04-16 20:24:25 +00:00
Evan Cheng	4b16ea6247	Really test what's intended. llvm-svn: 49802	2008-04-16 18:21:55 +00:00
Evan Cheng	6d05ce493b	Rewrite LiveVariable liveness computation. The new implementation is much simplified. It eliminated the nasty recursive routines and removed the partial def / use bookkeeping. There is also potential for performance improvement by replacing the conservative handling of partial physical register definitions. The code is currently disabled until live interval analysis is taught of the name scheme. This patch also fixed a couple of nasty corner cases. llvm-svn: 49784	2008-04-16 09:46:40 +00:00
Dan Gohman	be8f2b452b	Add support for the form of the SSE41 extractps instruction that puts its result in a 32-bit GPR. llvm-svn: 49762	2008-04-16 02:32:24 +00:00
Dan Gohman	cf79877623	Recreate the size SDNode instead of reusing the old one in the x86 memcpy lowering code; this ensures that the size node has the desired result type. This fixes a regression from r49572 with @llvm.memcpy.i64 on x86-32. llvm-svn: 49761	2008-04-16 01:32:32 +00:00
Dan Gohman	7d27552962	Add movd instructions to move from MMX registers to 64-bit GPR registers on x86-64. llvm-svn: 49757	2008-04-15 23:55:07 +00:00
Dan Gohman	3b99b3c807	Treat EntryToken nodes as "passive" so that they aren't added to the ScheduleDAG; they don't correspond to any actual instructions so they don't need to be scheduled. This fixes a bug where the EntryToken was being scheduled multiple times in some cases, though it ended up not causing any trouble because EntryToken doesn't expand into anything. With this fixed the schedulers reliably schedule the expected number of units, so we can check this with an assertion. This requires a tweak to test/CodeGen/X86/loop-hoist.ll because it ends up getting scheduled differently in a trivial way, though it was enough to fool the prcontext+grep that the test does. llvm-svn: 49701	2008-04-15 01:22:18 +00:00
Dan Gohman	cce2b42edc	Upgrade these tests for the current intrinsic prototypes. llvm-svn: 49669	2008-04-14 18:19:18 +00:00
Dale Johannesen	d9a9c746d8	Remove -unwind-tables-optional everywhere, since this is now the default. llvm-svn: 49667	2008-04-14 17:56:54 +00:00
Arnold Schwaighofer	82af0e6a43	This patch corrects the handling of byval arguments for tailcall optimized x86-64 (and x86) calls so that they work (... at least for my test cases). Should fix the following problems: Problem 1: When i introduced the optimized handling of arguments for tail called functions (using a sequence of copyto/copyfrom virtual registers instead of always lowering to top of the stack) i did not handle byval arguments correctly e.g they did not work at all :). Problem 2: On x86-64 after the arguments of the tail called function are moved to their registers (which include ESI/RSI etc), tail call optimization performs byval lowering which causes xSI,xDI, xCX registers to be overwritten. This is handled in this patch by moving the arguments to virtual registers first and after the byval lowering the arguments are moved from those virtual registers back to RSI/RDI/RCX. llvm-svn: 49584	2008-04-12 18:11:06 +00:00
Dan Gohman	15edbf989f	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dan Gohman	41f9d24d52	Fix a bug that prevented x86-64 from using rep.movsq for 8-byte-aligned data. llvm-svn: 49571	2008-04-12 02:35:39 +00:00
Evan Cheng	6e52146f16	If a PHI node has a single implicit_def source, replace it with an implicit_def instead of a copy. llvm-svn: 49543	2008-04-11 17:54:45 +00:00
Evan Cheng	56ca7e285a	New test. llvm-svn: 49514	2008-04-10 23:49:09 +00:00
Evan Cheng	6f164e3814	A copy instruction may use a register multiple times on some targets. Change them all. llvm-svn: 49491	2008-04-10 18:38:47 +00:00
Chris Lattner	3b289289a7	Fix the x86-64 side of PR2108 by adding a v2f64 version of MOVZQI2PQIrr. This would be better handled as a dag combine (with the goal of eliminating the bitconvert) but I don't know how to do that safely. Thoughts welcome. llvm-svn: 49463	2008-04-10 05:13:43 +00:00
Evan Cheng	1803e20a62	Teach branch folding pass about implicit_def instructions. Unfortunately we can't just eliminate them since register scavenger expects every register use to be defined. However, we can delete them when there are no intra-block uses. Carefully removing some implicit def's which enable more blocks to be optimized away. llvm-svn: 49461	2008-04-10 02:32:10 +00:00
Evan Cheng	def576f9e6	- More aggressively coalescing away copies whose source is defined by an implicit_def. - Added insert_subreg coalescing support. llvm-svn: 49448	2008-04-09 20:57:25 +00:00
Evan Cheng	f35cc57821	Missed a hasInterval check. llvm-svn: 49415	2008-04-09 01:30:15 +00:00
Dale Johannesen	5ac0a0ed21	Rename -disable-required-unwind-tables to -unwind-tables-optional. llvm-svn: 49391	2008-04-08 18:10:08 +00:00
Dale Johannesen	576a7685f2	Missed one. llvm-svn: 49365	2008-04-08 00:14:59 +00:00
Dale Johannesen	3f992b224e	Add -disable-required-unwind-tables to tests that need it (usually, grepping for some string found in unwind info) llvm-svn: 49364	2008-04-08 00:14:17 +00:00
Evan Cheng	6c58f2397d	Fix test. llvm-svn: 49343	2008-04-07 17:02:18 +00:00
Chris Lattner	f88214caca	fix this testcase to pass and remove a duplicate instance of itself. llvm-svn: 49281	2008-04-06 21:39:17 +00:00
Torok Edwin	34e6889671	Prefer to expand mask for xor to -1, so we have a chance to turn it into a not. If it cannot be expanded, it will keep the old behaviour and try to shrink the constant. Part of enhancement for PR2191. llvm-svn: 49280	2008-04-06 21:23:02 +00:00
Evan Cheng	d7d1c94e67	1. IMPLICIT_DEF can re-define any register. 2. Coalescer can now create an interesting situation where a register def can reaches itself without being killed. llvm-svn: 49246	2008-04-05 01:27:09 +00:00
Evan Cheng	4d7b2ab16f	Favors pshufd over shufps when shuffling elements from one vector. pshufd is faster than shufps. llvm-svn: 49244	2008-04-05 00:30:36 +00:00
Evan Cheng	f045d86660	New test case. llvm-svn: 49190	2008-04-03 21:25:03 +00:00
Dale Johannesen	ebfa6edc65	Testcase for EH with functions whose names are stripped. llvm-svn: 49111	2008-04-02 20:16:41 +00:00
Dan Gohman	168b2b1300	Speculatively micro-optimize memory-zeroing calls on Darwin 10. llvm-svn: 49048	2008-04-01 20:38:36 +00:00
Evan Cheng	c2f298f318	More soft fp fixes. llvm-svn: 49016	2008-04-01 02:18:22 +00:00
Evan Cheng	a38ae9c502	Unbreak ARM / Thumb soft FP support. llvm-svn: 49012	2008-04-01 01:50:16 +00:00
Dale Johannesen	d9a5b77269	Mark functions in some tests as 'nounwind'. Generating EH info for these functions causes the tests to fail for random reasons (e.g. looking for 'or' or counting lines with asm-printer; labels count as lines.) llvm-svn: 49003	2008-03-31 23:20:09 +00:00
Evan Cheng	a3ce7b4c76	It's not safe to fold a load from GV stub or constantpool into a two-address use. llvm-svn: 49002	2008-03-31 23:19:51 +00:00
Dan Gohman	f223eaafcd	Fix a DAGCombiner optimization to respect volatile qualification. llvm-svn: 48994	2008-03-31 20:32:52 +00:00
Dan Gohman	227e702cae	Fix a tokenfactor node to use the load chain rather than the load value. This fixes PR2177. llvm-svn: 48932	2008-03-28 23:45:16 +00:00
Evan Cheng	6cbce6b602	Fix a memory bug: increment an iterator of a deleted machine instr. llvm-svn: 48853	2008-03-27 01:27:25 +00:00
Evan Cheng	6fc37c8f25	One more coalescer fix wrt deadness propagation. llvm-svn: 48837	2008-03-26 20:15:49 +00:00
Evan Cheng	8d222d6221	Avoid commuting a def MI in order to coalesce a copy instruction away if any use of the same val# is a copy instruction that has already been coalesced. llvm-svn: 48833	2008-03-26 19:03:01 +00:00
Dale Johannesen	8c1e95810f	Use ## for comment delimiter on darwin x86-32, so llvm's output .s files will go through gcc -std=c99 without triggering preprocesser errors. Approach suggested by Daveed Vandevoorde. llvm-svn: 48808	2008-03-25 23:29:30 +00:00
Evan Cheng	8cb64d8e8b	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48792	2008-03-25 20:08:07 +00:00
Dan Gohman	58ad056286	Add CMP32mr and friends to the load-unfolding table. Among other things, this allows the scheduler to unfold a load operand in the 2008-01-08-SchedulerCrash.ll testcase, so it now successfully clones the comparison to avoid a pushf+popf. llvm-svn: 48777	2008-03-25 16:53:19 +00:00
Tanya Lattner	b6a27ed83f	Byebye llvm-upgrade! llvm-svn: 48762	2008-03-25 04:26:08 +00:00
Evan Cheng	7c1dcd8371	lastRegisterUse() should ignore identity copies. Those will be erased. llvm-svn: 48759	2008-03-25 02:02:19 +00:00
Bill Wendling	2097b72649	Use the bit size of the operand instead of the hard-coded 32 to generate the mask. llvm-svn: 48750	2008-03-24 23:16:37 +00:00
Evan Cheng	dbdf48276a	- SSE4.1 extractfps extracts a f32 into a gr32 register. Very useful! Not. Fix the instruction specification and teaches lowering code to use it only when the only use is a store instruction. llvm-svn: 48746	2008-03-24 21:52:23 +00:00
Dan Gohman	b9c5e6258f	APIntify SelectionDAG's EXTRACT_ELEMENT code. llvm-svn: 48726	2008-03-24 16:38:05 +00:00

1 2 3 4 5 ...

842 Commits