llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Nadav Rotem	7a79f94aad	Fix pr11193. SHL inserts zeros from the right, thus even when the original sign_extend_inreg value was of 1-bit, we need to sra. llvm-svn: 142724	2011-10-22 12:39:25 +00:00
Nadav Rotem	57f652cfe4	Fix pr11194. When promoting and splitting integers we need to use ZExtPromotedInteger and SExtPromotedInteger based on the operation we legalize. SetCC return type needs to be legalized via PromoteTargetBoolean. llvm-svn: 142660	2011-10-21 17:35:19 +00:00
Chandler Carruth	2f20f63a01	Don't hard code the desired alignment for loops -- it isn't 16-bytes on all x86 systems. Sorry for the breakage. llvm-svn: 142656	2011-10-21 16:41:39 +00:00
Nadav Rotem	52d820c0dd	1. Fix the widening of SETCC in WidenVecOp_SETCC. Use the correct return CC type. 2. Fix a typo in CONCAT_VECTORS which exposed the bug in #1. llvm-svn: 142648	2011-10-21 11:42:07 +00:00
Chandler Carruth	21c689d1ac	Add loop aligning to MachineBlockPlacement based on review discussion so it's a bit more plausible to use this instead of CodePlacementOpt. The code for this was shamelessly stolen from CodePlacementOpt, and then trimmed down a bit. There doesn't seem to be much utility in returning true/false from this pass as we may or may not have rewritten all of the blocks. Also, the statistic of counting how many loops were aligned doesn't seem terribly important so I removed it. If folks would like it to be included, I'm happy to add it back. This was probably the most egregious of the missing features, and now I'm going to start gathering some performance numbers and looking at specific loop structures that have different layout between the two. Test is updated to include both basic loop alignment and nested loop alignment. llvm-svn: 142645	2011-10-21 08:57:37 +00:00
Chandler Carruth	f352d2d7e3	Add a very basic test for MachineBlockPlacement. This is essentially the canonical example I used when developing it, and is one of the primary motivating real-world use cases for __builtin_expect (when burried under a macro). I'm working on more test cases here, but I'm trying to make sure both that the pass is doing the right thing with the test cases and that they aren't too brittle to changes elsewhere in the code generation pipeline. Feedback and/or suggestions on how to test this are very welcome. Especially feedback on whether testing the block comments is a good strategy; I couldn't find any good examples to steal from but all the other ideas I had were a lot uglier or more fragile. llvm-svn: 142644	2011-10-21 08:01:56 +00:00
Craig Topper	fd96157f13	Remove intrinsics for X86 BLSI, BLSMSK, and BLSR intrinsics and replace with custom isel lowering code. llvm-svn: 142642	2011-10-21 06:55:01 +00:00
Chad Rosier	38661ab3ce	Revert 142337. Thumb1 still doesn't support dynamic stack realignment. :( llvm-svn: 142557	2011-10-20 00:07:12 +00:00
Evan Cheng	057c12c2a0	Fix TLS lowering bug. The CopyFromReg must be glued to the TLSCALL. rdar://10291355 llvm-svn: 142550	2011-10-19 22:22:54 +00:00
Nadav Rotem	df65a641dd	Improve code generation for vselect on SSE2: When checking the availability of instructions using the TLI, a 'promoted' instruction IS available. It means that the value is bitcasted to another type for which there is an operation. The correct check for the availablity of an instruction is to check if it should be expanded. llvm-svn: 142542	2011-10-19 20:43:16 +00:00
James Molloy	73a2a8a45e	Use literal pool loads instead of MOVW/MOVT for materializing global addresses when optimizing for size. On spec/gcc, this caused a codesize improvement of ~1.9% for ARM mode and ~4.9% for Thumb(2) mode. This is codesize including literal pools. The pools themselves doubled in size for ARM mode and quintupled for Thumb mode, leaving suggestion that there is still perhaps redundancy in LLVM's use of constant pools that could be decreased by sharing entries. Fixes PR11087. llvm-svn: 142530	2011-10-19 14:11:07 +00:00
Nadav Rotem	05587f317b	Add support for the vector-widening of vselect and vector-setcc llvm-svn: 142488	2011-10-19 09:45:11 +00:00
Craig Topper	b1fa647871	Rename PEXTR to PEXT. Add intrinsics for BMI instructions. llvm-svn: 142480	2011-10-19 07:48:35 +00:00
Lang Hames	03f36ab3f6	Added testcase for <rdar://problem/10215997> llvm-svn: 142462	2011-10-18 23:50:52 +00:00
Nadav Rotem	f9d8f801d9	Add additional element-promotion tests. llvm-svn: 142442	2011-10-18 23:05:33 +00:00
Nadav Rotem	e435b9e2fd	Fix a bug in the legalization of vector anyext-load and trunc-store. Mem Index starts with zero. llvm-svn: 142434	2011-10-18 22:32:43 +00:00
Bob Wilson	f78f688c02	Fix incorrect check for sign-extended constant BUILD_VECTOR. <rdar://problem/10298332> llvm-svn: 142371	2011-10-18 17:34:51 +00:00
Bob Wilson	0273c767c8	Fix a DAG combiner assertion failure when constant folding BUILD_VECTORS. svn r139159 caused SelectionDAG::getConstant() to promote BUILD_VECTOR operands with illegal types, even before type legalization. For this testcase, that led to one BUILD_VECTOR with i16 operands and another with promoted i32 operands, which triggered the assertion. llvm-svn: 142370	2011-10-18 17:34:47 +00:00
Justin Holewinski	361b3c9ff2	PTX: Fix disabling of MAD instruction selection llvm-svn: 142352	2011-10-18 13:39:20 +00:00
Chad Rosier	eb469f466b	Add support for dynamic stack realignment when in thumb1 mode. rdar://10288916 llvm-svn: 142337	2011-10-18 05:28:00 +00:00
Nick Lewycky	c551c1c5f9	Add support for a new extension to the .file directive: .file filenumber "directory" "filename" This removes one join+split of the directory+filename in MC internals. Because bitcode files have independent fields for directory and filenames in debug info, this patch may change the .o files written by existing .bc files. llvm-svn: 142300	2011-10-17 23:05:28 +00:00
Nadav Rotem	b03ef2cd0b	Add CHECKs and document PR11158. llvm-svn: 142240	2011-10-17 20:23:23 +00:00
Nadav Rotem	2d3d696b51	stabalize tests by specifying the exact sse level llvm-svn: 142229	2011-10-17 19:45:38 +00:00
Hal Finkel	d65adcde2d	use FileCheck and not grep in new tests llvm-svn: 142189	2011-10-17 16:01:41 +00:00
Nadav Rotem	b2979db708	Clean the triple, add check lines. llvm-svn: 142183	2011-10-17 07:07:51 +00:00
Nadav Rotem	77004e845a	Previously v2i32 vectors were legalized to v4i32. Now, they are legalized to v2i64. These tests do not check MMX nor zmoving into them. llvm-svn: 142182	2011-10-17 06:59:01 +00:00
Hal Finkel	8be5b30fa8	Test case for CanLowerReturn fix (r141981) llvm-svn: 142172	2011-10-17 04:03:59 +00:00
Hal Finkel	b128cda81b	Add PPC 440 scheduler and some associated tests (new files) llvm-svn: 142171	2011-10-17 04:03:55 +00:00
Nadav Rotem	eeb400322e	Add tripple and stabalize a few more tests. llvm-svn: 142158	2011-10-16 21:20:54 +00:00
Nadav Rotem	d3f24f0fe5	Add triple to tests. llvm-svn: 142154	2011-10-16 20:53:20 +00:00
Nadav Rotem	60c9b81158	fix a typo in the test llvm-svn: 142153	2011-10-16 20:43:41 +00:00
Nadav Rotem	8ed6f090ef	Enable element promotion type legalization by deafault. Changed tests which assumed that vectors are legalized by widening them. llvm-svn: 142152	2011-10-16 20:31:33 +00:00
Nadav Rotem	22398874f8	Remove the the test which checks the saving of a vector of booleans into memory. The decision was to pack the bits. Currently no codegen supports this. Currently, all of the bits in the vector are saved into the same address in memory. llvm-svn: 142149	2011-10-16 19:06:06 +00:00
Nadav Rotem	a733f43137	Fix a bug in LowerV2I64Splat, which generated a BUILD_VECTOR for which there was no pattern. llvm-svn: 142130	2011-10-16 10:02:06 +00:00
Eli Friedman	c8d798d91e	Add missing correctness check to ARMTargetLowering::ReconstructShuffle. Fixes PR11129. llvm-svn: 142022	2011-10-14 23:58:49 +00:00
Jakob Stoklund Olesen	6f0c76740b	Update live-in lists when splitting critical edges. Fixes PR10814. Patch by Jan Sjödin! llvm-svn: 141960	2011-10-14 17:25:46 +00:00
Craig Topper	0a11eb1b21	Add X86 ANDN instruction. Including instruction selection. llvm-svn: 141947	2011-10-14 07:06:56 +00:00
Craig Topper	6b2120a8e1	Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell. llvm-svn: 141939	2011-10-14 03:21:46 +00:00
Jakob Stoklund Olesen	98e48aef6f	Add value numbers when spilling dead defs. When spilling around an instruction with a dead def, remember to add a value number for the def. The missing value number wouldn't normally create problems since there would be an incoming live range as well. However, due to another bug we could spill a dead V_SET0 instruction which doesn't read any values. The missing value number caused an empty live range to be created which is dangerous since it doesn't interfere with anything. This fixes part of PR11125. llvm-svn: 141923	2011-10-14 00:34:31 +00:00
Benjamin Kramer	a705595096	Force CPU type on test so it doesn't accidentally emit movbe instead of bswap on Intel Atom CPUs. llvm-svn: 141863	2011-10-13 14:27:54 +00:00
Kalle Raiskila	15993a5d28	Mark 'branch indirect' instruction as an indirect branch. Not having it confused assembly printing of jumptables. llvm-svn: 141862	2011-10-13 11:40:03 +00:00
Bill Wendling	dcd9c25744	More closely follow libgcc, which has code after the `ret' instruction to release the stack segment and reset the stack pointer. Place the code in its own MBB to make the verifier happy. llvm-svn: 141859	2011-10-13 08:24:19 +00:00
Bill Wendling	2a571af745	Revert r141854 because it was causing failures: http://lab.llvm.org:8011/builders/llvm-x86_64-linux/builds/101 --- Reverse-merging r141854 into '.': U test/MC/Disassembler/X86/x86-32.txt U test/MC/Disassembler/X86/simple-tests.txt D test/CodeGen/X86/bmi.ll U lib/Target/X86/X86InstrInfo.td U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86.td U lib/Target/X86/X86Subtarget.h llvm-svn: 141857	2011-10-13 07:48:07 +00:00
Bill Wendling	a6bf25f30a	Should not add instructions to a BB after a return instruction. The machine instruction verifier doesn't like this, nor do I. llvm-svn: 141856	2011-10-13 07:42:32 +00:00
Craig Topper	eb29e18c9b	Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell. llvm-svn: 141854	2011-10-13 07:09:14 +00:00
Jakob Stoklund Olesen	f38db50fed	Also inflate register classes around inline asm. Now that MI->getRegClassConstraint() can also handle inline assembly, don't bail when recomputing the register class of a virtual register used by inline asm. This fixes PR11078. llvm-svn: 141836	2011-10-12 23:37:40 +00:00
Bill Wendling	60bfb5d707	We need to verify that the machine instruction we're using as a replacement for our current machine instruction defines a register with the same register class as what's being replaced. This showed up in the SPEC 403.gcc benchmark, where it would ICE because a tail call was expecting one register class but was given another. (The machine instruction verifier catches this situation.) <rdar://problem/10270968> llvm-svn: 141830	2011-10-12 23:03:40 +00:00
Jakob Stoklund Olesen	2611cc2520	Fix -widen-vmovs liveness issues. When widening a copy, we are reading a larger register that may not be live. Use an <undef> flag to tell the register scavenger and machine code verifier that we know the value isn't defined. We now widen: %S6<def> = COPY %S4<kill>, %D3<imp-def> into: %D3<def> = VMOVD %D2<undef>, pred:14, pred:%noreg, %S4<imp-use,kill> This also keeps the <kill> flag on %S4 so we don't inadvertently kill a live value in %S5. Finally, ensure that ARMBaseInstrInfo::setExecutionDomain() preserves the <undef> flag when converting VMOVD to VORR. llvm-svn: 141746	2011-10-12 00:06:23 +00:00
Bob Wilson	20cc1d9f51	Make this test more specific. There are 3 stats that matched "machine-licm". llvm-svn: 141741	2011-10-11 23:34:31 +00:00
Eric Christopher	57c57a3260	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Bill Wendling	a1c238d8bf	Revert r141529. This is causing failures in the test-suite, like bigstack and ReedSolomon. Boo... llvm-svn: 141716	2011-10-11 21:40:47 +00:00
Devang Patel	3811b42be0	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141689	2011-10-11 18:09:58 +00:00
Nadav Rotem	6ddbd1308b	Add support for legalization of vector SHL/SRA/SRL instructions llvm-svn: 141667	2011-10-11 14:36:35 +00:00
Craig Topper	f95d9bd513	Test case for X86 LZCNT instruction selection. llvm-svn: 141652	2011-10-11 06:47:01 +00:00
Bill Wendling	2606813218	Test simplification that Ana Pazos noticed. llvm-svn: 141644	2011-10-11 04:43:15 +00:00
NAKAMURA Takumi	00636555f2	test/CodeGen/X86/movbe.ll: Give explicit -mtriple=x86_64-linux, to unbreak win32 hosts. llvm-svn: 141640	2011-10-11 03:41:03 +00:00
Akira Hatanaka	2da85501f4	Test cases for 64-bit load and store instructions. llvm-svn: 141631	2011-10-11 01:52:31 +00:00
Bill Wendling	cb617ae8e0	Add testcase for PR11107. llvm-svn: 141607	2011-10-11 00:26:57 +00:00
Devang Patel	e6091c61c7	Revert r141569 and r141576. llvm-svn: 141594	2011-10-10 23:18:02 +00:00
Bill Wendling	7121342ad5	Reapply r141365 now that PR11107 is fixed. llvm-svn: 141591	2011-10-10 22:59:55 +00:00
Eli Friedman	7188ba35cb	Make sure the X86 backend doesn't explode on 128-bit shuffles in AVX mode. Fixes PR11102. llvm-svn: 141585	2011-10-10 22:28:47 +00:00
Devang Patel	d6f05a47c1	If loop header is also loop exiting block then it may not be safe to hoist instructions. llvm-svn: 141576	2011-10-10 20:32:03 +00:00
Nadav Rotem	38187aec17	Fix 10892 - When lowering SIGN_EXTEND_INREG do not lower v2i64 because the instruction set has no 64-bit SRA support. llvm-svn: 141570	2011-10-10 19:31:45 +00:00
Devang Patel	c149f390c6	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141569	2011-10-10 19:09:20 +00:00
Benjamin Kramer	c498279bec	X86: Add patterns for the movbe instruction (mov + bswap, only available on atom) llvm-svn: 141563	2011-10-10 18:34:56 +00:00
Bill Wendling	7cba44defc	Revert r141365. It was causing MultiSource/Benchmarks/MiBench/consumer-lame to hang, and possibly SPEC/CINT2006/464_h264ref. llvm-svn: 141560	2011-10-10 18:27:30 +00:00
Bill Wendling	94258753c7	When getting the number of bits necessary for addressing mode ARMII::AddrModeT1_s, we need to take into account that if the frame register is ARM::SP, then the number of bits is 8. If it's not ARM::SP, then the number of bits is 5. llvm-svn: 141529	2011-10-10 07:24:23 +00:00
Jakob Stoklund Olesen	b49557d06d	Add TEST8ri_NOREX pseudo to constrain sub_8bit_hi copies. In 64-bit mode, sub_8bit_hi sub-registers can only be used by NOREX instructions. The COPY created from the EXTRACT_SUBREG DAG node cannot target all GR8 registers, only those in GR8_NOREX. TO enforce this, we ensure that all instructions using the EXTRACT_SUBREG are GR8_NOREX constrained. This fixes PR11088. llvm-svn: 141499	2011-10-08 18:28:28 +00:00
Jakob Stoklund Olesen	2817bbdeaf	Add missing test case for r141410. llvm-svn: 141498	2011-10-08 18:06:54 +00:00
Evan Cheng	99b25c827c	High bits of movmskp{s\|d} and pmovmskb are known zero. rdar://10247336 llvm-svn: 141371	2011-10-07 17:21:44 +00:00
Bob Wilson	b55a64ae72	Reenable tail calls for iOS 5.0 and later. llvm-svn: 141370	2011-10-07 17:17:49 +00:00
Bob Wilson	d8856e17fe	Reenable use of divmod compiler_rt functions for iOS 5.0 and later. llvm-svn: 141368	2011-10-07 16:59:21 +00:00
Anton Korobeynikov	0944a4c5cc	Peephole optimization for ABS on ARM. Patch by Ana Pazos! llvm-svn: 141365	2011-10-07 16:15:08 +00:00
Cameron Zwarich	cc5f846d58	Remove a check from ARM shifted operand isel helper methods, which were blocking merging an lsl #2 that has multiple uses on A9. This shift is free, so there is no problem merging it in multiple places. Other unprofitable shifts will not be merged. llvm-svn: 141247	2011-10-05 23:38:50 +00:00
Dan Gohman	779ae47721	Make this test less sensitive to codegen optimizations. llvm-svn: 141195	2011-10-05 18:13:08 +00:00
Akira Hatanaka	0a697c10ae	Move CHECK after entry label. llvm-svn: 141030	2011-10-03 21:24:30 +00:00
Akira Hatanaka	76c268271d	Add support for 64-bit logical NOR. llvm-svn: 141029	2011-10-03 21:23:18 +00:00
Akira Hatanaka	ae50204423	Add support for 64-bit count leading ones and zeros instructions. llvm-svn: 141028	2011-10-03 21:16:50 +00:00
Akira Hatanaka	6c6ff6fde7	Add support for 64-bit divide instructions. llvm-svn: 141024	2011-10-03 21:06:13 +00:00
Akira Hatanaka	e3c1d7eac0	Add support for 64-bit integer multiply instructions. llvm-svn: 141017	2011-10-03 20:01:11 +00:00
Bill Wendling	98db234c02	Filecheck-ize. llvm-svn: 140904	2011-09-30 23:40:29 +00:00
Bill Wendling	252b649025	Add new line at end of file. llvm-svn: 140903	2011-09-30 23:21:11 +00:00
Bill Wendling	fcf3096d9b	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Akira Hatanaka	5479850400	Remove unnecessary checking of register operands. llvm-svn: 140872	2011-09-30 19:18:24 +00:00
Akira Hatanaka	c9268767d6	Add definitions of Mips64 rotate instructions. llvm-svn: 140870	2011-09-30 18:51:46 +00:00
Jim Grosbach	44047da675	ARM Darwin default relocation model is PIC. This matches clang, so default options in llc and friends are now closer to clang's defaults. llvm-svn: 140863	2011-09-30 17:41:35 +00:00
Akira Hatanaka	ffa28d49f1	Check values of immediate operands. llvm-svn: 140860	2011-09-30 17:19:21 +00:00
Akira Hatanaka	dcd69c2e07	Mips64 shift instructions. llvm-svn: 140841	2011-09-30 03:18:46 +00:00
Akira Hatanaka	4bdb4b1958	Mips64 arithmetic and logical instructions with one source register and immediate. llvm-svn: 140839	2011-09-30 02:08:54 +00:00
Akira Hatanaka	50a4f6d570	Fill delay slot with useful instructions. Modified from Sparc's version of delay slot filler. Patch by Reed Kotler at Mips Technologies. llvm-svn: 140825	2011-09-29 23:52:13 +00:00
Akira Hatanaka	70be05d5d0	Mips64 arithmetic and logical instructions with two source registers. llvm-svn: 140806	2011-09-29 20:37:56 +00:00
Andrew Trick	e1d5ae73ac	LSR: rewrite inner loops only. Rewriting the entire loop nest now requires -enable-lsr-nested. See PR11035 for some performance data. A few unit tests specifically test nested LSR, and are now under a flag. llvm-svn: 140762	2011-09-29 01:33:38 +00:00
Andrew Trick	75c6a45d08	whitespace llvm-svn: 140761	2011-09-29 01:31:48 +00:00
Justin Holewinski	4966d44b44	PTX: Add new patterns for bitconvert and any_extend llvm-svn: 140753	2011-09-29 01:13:12 +00:00
Evan Cheng	1e53900b70	Tighten a ARM dag combine condition to avoid an identity transformation, which ends up introducing a cycle in the DAG. rdar://10196296 llvm-svn: 140733	2011-09-28 23:16:31 +00:00
Eli Friedman	81fc13efd2	PR11033: Make sure we don't generate PCMPGTQ and PCMPEQQ if the target CPU does not support them. llvm-svn: 140723	2011-09-28 21:00:25 +00:00
Justin Holewinski	2f96de340f	PTX: MC-ize the PTX back-end (patch 1 of N) Lay some groundwork for converting to MC-based asm printer. This is the first of probably many patches to bring the back-end back up-to-date with all of the recent MC changes. llvm-svn: 140697	2011-09-28 14:32:04 +00:00
Jakob Stoklund Olesen	2bf243f464	Remove X86-dependent stuff from SSEDomainFix. This also enables domain swizzling for AVX code which required a few trivial test changes. The pass will be moved to lib/CodeGen shortly. llvm-svn: 140659	2011-09-27 23:50:46 +00:00
Justin Holewinski	a50e29abd6	PTX: Add support for sitofp in backend llvm-svn: 140593	2011-09-27 01:04:47 +00:00
Eli Friedman	db69f84663	Last batch of test conversions to new atomic instructions. llvm-svn: 140585	2011-09-27 00:17:29 +00:00
Eli Friedman	3db429c878	Convert a bunch more tests over to the new atomic instructions. llvm-svn: 140582	2011-09-26 23:15:09 +00:00
Eli Friedman	d01fc33809	Convert more tests to new atomic instructions. llvm-svn: 140567	2011-09-26 21:36:10 +00:00
Eli Friedman	6aaaadc188	Convert more tests over to the new atomic instructions. I did not convert Atomics-32.ll and Atomics-64.ll by hand; the diff is autoupgrade output. The wmb test is gone because there isn't any way to express wmb with the new atomic instructions; if someone really needs a non-asm way to write a wmb on Alpha, a platform-specific intrisic could be added. llvm-svn: 140566	2011-09-26 21:30:17 +00:00
Eli Friedman	56e68f7271	Convert more tests over to the new atomic instructions. llvm-svn: 140559	2011-09-26 20:27:49 +00:00
Justin Holewinski	52c50104d7	PTX: Fix detection of stack load/store vs. global load/store, as well as fix the printing of local offsets llvm-svn: 140547	2011-09-26 18:57:22 +00:00
Justin Holewinski	443a122ac3	PTX: Add .align tests to stack object test file llvm-svn: 140537	2011-09-26 16:20:38 +00:00
Justin Holewinski	859dd9fa59	PTX: Fix some lingering issues with stack allocation llvm-svn: 140535	2011-09-26 16:20:34 +00:00
Justin Holewinski	83ae9143fd	PTX: Unify handling of loads/stores llvm-svn: 140533	2011-09-26 16:20:28 +00:00
David Meyer	90ed5fdd4f	Only run tests in test/CodeGen/CBackend/X86 when both X86 and CBackend are supported llvm-svn: 140517	2011-09-26 06:44:27 +00:00
David Meyer	a6e588d80c	PR11004: Inline memcpy to avoid generating nested call sequence. Un-XFAIL 2011-06-09-TailCallByVal and 2010-11-04-BigByval llvm-svn: 140516	2011-09-26 06:13:20 +00:00
Jakob Stoklund Olesen	59b2982dcf	Only run MF.verify() with EXPENSIVE_CHECKS=1. llvm-svn: 140441	2011-09-24 01:11:19 +00:00
Jakob Stoklund Olesen	bc6ae70907	Verify that terminators follow non-terminators. This exposes a -segmented-stacks bug. llvm-svn: 140429	2011-09-23 22:45:39 +00:00
Eli Friedman	a66a438876	PR10998: It is not legal to sink an instruction past the terminator of a block; make sure we don't do that. llvm-svn: 140428	2011-09-23 22:41:57 +00:00
Jakob Stoklund Olesen	ca6877343b	Also match negative offsets for addrmode3 and addrmode5. Math is hard, and isScaledConstantInRange() always returned false for negative constants. It was doing unsigned division of negative numbers before casting back to signed. llvm-svn: 140425	2011-09-23 22:10:33 +00:00
Justin Holewinski	1c0e0dcfbe	PTX: Handle function call return values llvm-svn: 140386	2011-09-23 16:48:41 +00:00
Justin Holewinski	0231798704	PTX: Start fixing function calls llvm-svn: 140378	2011-09-23 14:31:12 +00:00
Eli Friedman	6f0131b3a7	PR10989: Don't print .hidden on Windows. llvm-svn: 140356	2011-09-23 00:13:02 +00:00
Eli Friedman	31c7bde95a	PR10991: make fast-isel correctly check whether accessing a global through an alias involves thread-local storage. (I'm not entirely sure how this is supposed to work, but this patch makes fast-isel consistent with the normal isel path.) llvm-svn: 140355	2011-09-22 23:41:28 +00:00
Dan Gohman	d63418e497	Fix SimplifySelectCC to add newly created nodes to the DAGCombiner worklist, as it may be possible to perform further optimization on them. llvm-svn: 140349	2011-09-22 23:01:29 +00:00
Duncan Sands	1da590b589	Synthesize SSE3/AVX 128 bit horizontal add/sub instructions from floating point add/sub of appropriate shuffle vectors. Does not synthesize the 256 bit AVX versions because they work differently. llvm-svn: 140332	2011-09-22 20:15:48 +00:00
Justin Holewinski	9acce6aa64	PTX: fixup test cases for register changes llvm-svn: 140311	2011-09-22 16:45:51 +00:00
Devang Patel	5d43ab8434	Do not unnecessarily use AT_specification DIE because it does not add any value. Few weeks ago, llvm completely inverted the debug info graph. Earlier each debug info node used to keep track of its compile unit, now compile unit keeps track of important nodes. One impact of this change is that the global variable's do not have any context, which should be checked before deciding to use AT_specification DIE. llvm-svn: 140282	2011-09-21 23:41:11 +00:00
Akira Hatanaka	0c87291a10	Remove +. llvm-svn: 140266	2011-09-21 17:43:48 +00:00
Akira Hatanaka	d987b12b57	Re-enable some of the disabled tests. Use FileCheck instead of grep to check output. llvm-svn: 140263	2011-09-21 17:36:30 +00:00
Nadav Rotem	50430e8160	add another testcase for pr10902 llvm-svn: 140257	2011-09-21 17:13:40 +00:00
Nadav Rotem	af5643de3c	[VECTOR-SELECT] Address one of the bugs in pr10902. Vector SetCC result types need to be type-legalized. This code worked before because scalar result types are known to be legal. llvm-svn: 140249	2011-09-21 14:34:38 +00:00
Eric Christopher	9b721ff19e	Remove llvm-gcc and various compiler handling from llvm. It's not needed here anymore and has been migrated to the test-suite project. llvm-svn: 140216	2011-09-20 23:58:15 +00:00
Bill Wendling	67cf034fe3	This test is completely invalid with the modern EH model. Delete. llvm-svn: 140213	2011-09-20 23:52:09 +00:00
Bruno Cardoso Lopes	1ffbef8ad1	Add a DAGCombine for subvector extracts to remove useless chains of subvector inserts and extracts. Initial patch by Rackover, Zvi with some tweak done by me. llvm-svn: 140204	2011-09-20 23:19:33 +00:00
Bruno Cardoso Lopes	629e7c2410	Revert r140097, working on a better approach llvm-svn: 140203	2011-09-20 23:19:29 +00:00
Evan Cheng	ead45e2ba6	Fix a bug introduced during refactoring a couple of months ago. Cortex-M3 does not support Thumb2 dsp instructions. rdar://10152911. llvm-svn: 140181	2011-09-20 21:38:18 +00:00
NAKAMURA Takumi	595c0c8e15	test/CodeGen/X86/avx-minmax.ll: Unbreak Win32. On Windows x64, 128-bit arguments are not passed by reg but by indirect. eg. maxpd: vmovapd (%rcx), %xmm0 vmaxpd (%rdx), %xmm0, %xmm0 FIXME: I don't care YMM on x64 for now. llvm-svn: 140143	2011-09-20 14:11:35 +00:00
Craig Topper	df17f1cc99	Extend changes from r139986 to produce 256-bit AVX minps/minpd/maxps/maxpd. llvm-svn: 140140	2011-09-20 07:38:59 +00:00
Andrew Trick	53aeb9f663	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Bruno Cardoso Lopes	bed7ef51b6	Attempt to fix -mtriple=i686-{cygwin\|mingw\|win32} regressions. Nakamura, if this doesn't work, please provide more details. llvm-svn: 140107	2011-09-20 00:08:12 +00:00
Bruno Cardoso Lopes	7cf7f02c3d	Based on the small opt Zvi's patch was trying to achieve, eliminate 128-bit undef subvector insertion into a 256-bit vector llvm-svn: 140097	2011-09-19 23:36:50 +00:00
Eli Friedman	b11676fb4b	Some additional tests for Thumb atomic load and store (which I somehow forgot to commit earlier). llvm-svn: 140074	2011-09-19 22:02:33 +00:00
Bruno Cardoso Lopes	9e5ef44daf	Match X86ISD::FSETCCsd and X86ISD::FSETCCss while in AVX mode. This fix PR10955 and PR10948. llvm-svn: 140069	2011-09-19 21:29:24 +00:00
Nadav Rotem	1cfdc59e94	setOperationAction should be done on the return value of the type, not the operands. llvm-svn: 140001	2011-09-18 14:57:03 +00:00
Nadav Rotem	cfc77bc719	When promoting integer vectors we often create ext-loads. This patch adds a dag-combine optimization to implement the ext-load efficiently (using shuffles). For example the type <4 x i8> is stored in memory as i32, but it needs to find its way into a <4 x i32> register. Previously we scalarized the memory access, now we use shuffles. llvm-svn: 139995	2011-09-18 10:39:32 +00:00
Benjamin Kramer	547157073b	Apply Duncan's test fix from r139986 to the avx version of that test too. llvm-svn: 139992	2011-09-18 00:41:38 +00:00
Duncan Sands	4149334f09	Synthesize x86 max/min instructions also for vectors (i.e. produce maxps and maxpd). This broke the sse41-blend.ll testcase by causing maxpd to be produced rather than a cmp+blend pair, which is the reason I tweaked it. Gives a small speedup on doduc with dragonegg when the GCC vectorizer is used. llvm-svn: 139986	2011-09-17 16:49:39 +00:00
Andrew Trick	10ea51b841	Test case trial and error. Not sure the proper way to check MBB names. llvm-svn: 139900	2011-09-16 03:57:19 +00:00
Andrew Trick	5be06c8057	Reduced a stronger test case for coalescer bug PR10920. llvm-svn: 139898	2011-09-16 03:46:49 +00:00
Eli Friedman	f7bb39b592	Some legalization fixes for atomic load and store. llvm-svn: 139851	2011-09-15 21:20:49 +00:00
Jakob Stoklund Olesen	b36a98d18f	VirtRegMap is counting spill slots, not register spills. Fix the stats counters to reflect that. llvm-svn: 139819	2011-09-15 18:31:13 +00:00
Bruno Cardoso Lopes	8e702bba63	Change all checks regarding the presence of any SSE level to always take into consideration the presence of AVX. This change, together with the SSEDomainFix enabled for AVX, makes AVX codegen to always (hopefully) emit the same code as SSE for 128-bit vector ops. I don't have a testcase for this, but AVX now beats SSE in performance for 128-bit ops in the majority of programas in the llvm testsuite llvm-svn: 139817	2011-09-15 18:27:36 +00:00
Andrew Trick	e5bb7267ff	[regcoalescing] bug fix for RegistersDefinedFromSameValue. An improper SlotIndex->VNInfo lookup was leading to unsafe copy removal. Fixes PR10920 401.bzip2 miscompile with no IV rewrite. llvm-svn: 139765	2011-09-15 01:09:33 +00:00
Nadav Rotem	8e3edccebe	Add integer promotion support for vselect llvm-svn: 139692	2011-09-14 14:42:15 +00:00
Bruno Cardoso Lopes	3e6b9661d1	Vector shuffle mask <i32 4, i32 5, i32 2, i32 3> should yield "movsd", not "movss". llvm-svn: 139686	2011-09-14 02:36:14 +00:00

1 2 3 4 5 ...

5243 Commits