llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Bruno Cardoso Lopes	9e5ef44daf	Match X86ISD::FSETCCsd and X86ISD::FSETCCss while in AVX mode. This fix PR10955 and PR10948. llvm-svn: 140069	2011-09-19 21:29:24 +00:00
Nadav Rotem	a6af03c6fb	Fix typos in my prev commit, found by Tobi. llvm-svn: 140003	2011-09-18 19:00:23 +00:00
Nadav Rotem	1cfdc59e94	setOperationAction should be done on the return value of the type, not the operands. llvm-svn: 140001	2011-09-18 14:57:03 +00:00
Nadav Rotem	cfc77bc719	When promoting integer vectors we often create ext-loads. This patch adds a dag-combine optimization to implement the ext-load efficiently (using shuffles). For example the type <4 x i8> is stored in memory as i32, but it needs to find its way into a <4 x i32> register. Previously we scalarized the memory access, now we use shuffles. llvm-svn: 139995	2011-09-18 10:39:32 +00:00
Craig Topper	c5a97d12bb	Fix typo by changing Lower256IntVETCC to Lower256IntVSETCC. llvm-svn: 139993	2011-09-18 08:03:58 +00:00
Duncan Sands	4149334f09	Synthesize x86 max/min instructions also for vectors (i.e. produce maxps and maxpd). This broke the sse41-blend.ll testcase by causing maxpd to be produced rather than a cmp+blend pair, which is the reason I tweaked it. Gives a small speedup on doduc with dragonegg when the GCC vectorizer is used. llvm-svn: 139986	2011-09-17 16:49:39 +00:00
Bruno Cardoso Lopes	f611f6c371	Describe more AVX 128-bit convert instructions without patterns to have mayLoad = 1 llvm-svn: 139973	2011-09-16 23:41:29 +00:00
Bruno Cardoso Lopes	396b8136bf	Add mayLoad attribute to AVX convert instructions, since non of them are declared with load patterns. This fix the crash in PR10941. No testcases, since a fold is triggered and then converted back to the register form afterwards. llvm-svn: 139953	2011-09-16 22:02:14 +00:00
Bruno Cardoso Lopes	a60e62ad02	Fix PR10884. This PR basically reports a problem where a crash in generated code happened due to %rbp being clobbered: pushq %rbp movq %rsp, %rbp .... vmovmskps %ymm12, %ebp .... movq %rbp, %rsp popq %rbp ret Since Eric's r123367 commit, the default stack alignment for x86 32-bit has changed to be 16-bytes. Since then, the MaxStackAlignmentHeuristicPass hasn't been really used, but with AVX it becomes useful again, since per ABI compliance we don't always align the stack to 256-bit, but only when there are 256-bit incoming arguments. ReserveFP was only used by this pass, but there's no RA target hook that uses getReserveFP() to check for the presence of FP (since nothing was triggering the pass to run, the uses of getReserveFP() were removed through time without being noticed). Change this pass to use setForceFramePointer, which is properly called by MachineFunction hasFP method. The testcase is very big and dependent on RA, not sure if it's worth adding to test/CodeGen/X86. llvm-svn: 139939	2011-09-16 20:58:28 +00:00
Owen Anderson	e54c4beb5a	Don't attach annotations to MCInst's. Instead, have the disassembler return, and the printer accept, an annotation string which can be passed through if the client cares about annotations. llvm-svn: 139876	2011-09-15 23:38:46 +00:00
Bruno Cardoso Lopes	1465f4d334	Add a fixme note! llvm-svn: 139872	2011-09-15 23:04:24 +00:00
Bruno Cardoso Lopes	7ad9ea026a	Add the remaining AVX versions of instructions to X86InstrInfo, this time for describing high latency ones and for recognizting loads from the same base pointer llvm-svn: 139864	2011-09-15 22:15:52 +00:00
Bruno Cardoso Lopes	901f6ff218	Factor out partial register update checks for some SSE instructions. Also add the AVX versions and add comments! llvm-svn: 139854	2011-09-15 21:42:23 +00:00
Owen Anderson	84d4e5d0e2	Add support for stored annotations to MCInst, and provide facilities for MC-based InstPrinters to print them out. Enhance the ARM and X86 InstPrinter's to do so in verbose mode. llvm-svn: 139820	2011-09-15 18:36:29 +00:00
Bruno Cardoso Lopes	8e702bba63	Change all checks regarding the presence of any SSE level to always take into consideration the presence of AVX. This change, together with the SSEDomainFix enabled for AVX, makes AVX codegen to always (hopefully) emit the same code as SSE for 128-bit vector ops. I don't have a testcase for this, but AVX now beats SSE in performance for 128-bit ops in the majority of programas in the llvm testsuite llvm-svn: 139817	2011-09-15 18:27:36 +00:00
Bruno Cardoso Lopes	0fa8b71a55	Enable SSEDomainFix pass for AVX mode. llvm-svn: 139816	2011-09-15 18:27:32 +00:00
Eli Friedman	7cb90dcbce	Fix the code creating VZEXT_LOAD so that it creates the right memoperand. Issue spotted in -debug output. I can't think of any practical effects at the moment, but it might matter if we start doing more aggressive alias analysis in CodeGen. llvm-svn: 139758	2011-09-14 23:42:45 +00:00
Craig Topper	60719c7bfb	Fix mem type for VEX.128 form of VROUNDP*. Remove filter preventing VROUND from being recognized by disassembler. llvm-svn: 139691	2011-09-14 06:41:26 +00:00
Craig Topper	25e81ae604	Make disassembling of VBLEND* print immediate as a XMM/YMM register name. Fixes PR10917. llvm-svn: 139690	2011-09-14 05:55:28 +00:00
Bruno Cardoso Lopes	27a7ace4b4	Teach the foldable tables about 128-bit AVX instructions and make the alignment check for 256-bit classes more strict. There're no testcases but we catch more folding cases for AVX while running single and multi sources in the llvm testsuite. Since some 128-bit AVX instructions have different number of operands than their SSE counterparts, they are placed in different tables. 256-bit AVX instructions should also be added in the table soon. And there a few more 128-bit versions to handled, which should come in the following commits. llvm-svn: 139687	2011-09-14 02:36:58 +00:00
Bruno Cardoso Lopes	3e6b9661d1	Vector shuffle mask <i32 4, i32 5, i32 2, i32 3> should yield "movsd", not "movss". llvm-svn: 139686	2011-09-14 02:36:14 +00:00
Nadav Rotem	f1730712f7	swap vselect operand order - pr10907 llvm-svn: 139630	2011-09-13 19:56:38 +00:00
Bruno Cardoso Lopes	f02589db47	Add versions 256-bit versions of alignedstore and alignedload, to be more strict about the alignment checking. This was found by inspection and I don't have any testcases so far, although the llvm testsuite runs without any problem. llvm-svn: 139625	2011-09-13 19:33:03 +00:00
Bruno Cardoso Lopes	6f299a4937	Revert the remaining part of r139528. According to PR10907 the bug seems to be in the VSELECT operands order, so I'll leave the fix for Nadav. llvm-svn: 139624	2011-09-13 19:33:00 +00:00
Nadav Rotem	60df99b809	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Craig Topper	0f36afb30c	Only disassembler instructions with vvvv != 1111 if the instruction actually uses the vvvv field to encode an operand. Fixes PR10851. llvm-svn: 139591	2011-09-13 07:37:44 +00:00
Craig Topper	03c833ff84	Remove filter that was preventing MOVDQU/MOVDQA and their VEX forms from being disassembled. Also added encodings for the other register/register form of these instructions. Fixes PR10848. llvm-svn: 139588	2011-09-13 06:54:58 +00:00
Craig Topper	6eeb5396f8	Fix encoding of VMOVDQU to not simultaneously be 'TB OpSize' and 'XS'. 'XS' is correct and seems to have been taking priority. llvm-svn: 139587	2011-09-13 06:39:34 +00:00
Eli Friedman	34ffc961d7	Fix the assembler strings for a couple of atomic instructions. Doesn't really matter much in practice, but it's a bit cleaner. llvm-svn: 139563	2011-09-13 00:27:04 +00:00
Bruno Cardoso Lopes	a4d2bdfa40	Fix PR10845. SUBREG_TO_REG shouldn't be used when the input and destination types are equal! llvm-svn: 139553	2011-09-12 22:59:23 +00:00
Bruno Cardoso Lopes	64e2e852f9	Revert the wrong part of r139528, and fix testcases. llvm-svn: 139541	2011-09-12 21:24:07 +00:00
Bruno Cardoso Lopes	c67e996fc3	Not sure how CMPPS and CMPPD had already ever worked, I guess it didn't. However with this fix it does now. Basically the operand order for the x86 target specific node is not the same as the instruction, but since the intrinsic need that specific order at the instruction definition, just change the order during legalization. Also, there were some wrong invertions of condition codes, such as GE => LE, GT => LT, fix that too. Fix PR10907. llvm-svn: 139528	2011-09-12 19:30:40 +00:00
Bruno Cardoso Lopes	e2fc394ed2	Organize a bit the operand names for CMPPS and CMPPD llvm-svn: 139527	2011-09-12 19:30:36 +00:00
Bruno Cardoso Lopes	fc1c90ac48	Realign BLEND patterns to match the general style for patterns in .td file. llvm-svn: 139526	2011-09-12 19:30:33 +00:00
Bruno Cardoso Lopes	f0e65e0f13	Fix 80-columns llvm-svn: 139525	2011-09-12 19:30:29 +00:00
Nadav Rotem	06ce2ac074	Format patterns, remove unused X86blend patterns llvm-svn: 139491	2011-09-12 08:41:50 +00:00
Craig Topper	5ffd0cb080	Fix disassembling of one of the register/register forms of MOVUPS/MOVUPD/MOVAPS/MOVAPD/MOVSS/MOVSD and their VEX equivalents. Fixes PR10877. llvm-svn: 139486	2011-09-11 23:19:54 +00:00
Craig Topper	a9b27eecc9	Fix disassembling of reverse register/register forms of ADD/SUB/XOR/OR/AND/SBB/ADC/CMP/MOV. llvm-svn: 139485	2011-09-11 21:41:45 +00:00
Nadav Rotem	abb5bb41d4	CR fixes per Bruno's request. Undo the changes from r139285 which added custom lowering to vselect. Add tablegen lowering for vselect. llvm-svn: 139479	2011-09-11 15:02:23 +00:00
Eli Friedman	c79e318f02	r139454 activates an assert in a case where we were doing the right thing anyway. Make that explicit, and un-XFAIL the testcase. llvm-svn: 139458	2011-09-10 02:01:42 +00:00
Richard Trieu	8b6890f67e	Fix the asserts in lib/Target/X86/X86ELFWriterInfo.cpp and lib/ExecutionEngine/MCJIT/MCJIT.cpp from: assert("error"); to: assert(0 && "error"); llvm-svn: 139456	2011-09-10 01:42:07 +00:00
Richard Trieu	0485e133f2	Fixed an assert from: assert("not implemented for target shuffle node"); to: assert(0 && "not implemented for target shuffle node"); This causes a test failure in CodeGen/X86/palignr.ll which has been marked as XFAIL for the time being. Test failure filed at PR10901. llvm-svn: 139454	2011-09-10 01:26:21 +00:00
Nadav Rotem	ccb46031e6	Implement vector-select support for avx256. Refactor the vblend implementation to have tablegen match the instruction by the node type llvm-svn: 139400	2011-09-09 20:29:17 +00:00
Craig Topper	18cbd5db26	Fix handling of Intel syntax disassembling of movs and stos to stop being blank. Also fixed scas, and cmps to always print size suffix in Intel syntax since its abiguous without arguments. Fixes PR10875. llvm-svn: 139353	2011-09-09 05:40:53 +00:00
Nadav Rotem	2f256b7f9f	Dix the 80-columns and remove unsupported v8i16 type from the list of legal vselect types. llvm-svn: 139324	2011-09-08 22:17:35 +00:00
Bruno Cardoso Lopes	54962ac233	Add a AVX version of a simple i64 -> f64 bitcast. This could be triggered using llc with -O0, which wouldn't let it be folded and expose the lack of this pattern. llvm-svn: 139320	2011-09-08 21:52:33 +00:00
Bruno Cardoso Lopes	2f07ca9728	* Combines Alignment, AuxInfo, and TB_NOT_REVERSABLE flag into a single field (Flags), which is a bitwise OR of items from the TB_* enum. This makes it easier to add new information in the future. * Gives every static array an equivalent layout: { RegOp, MemOp, Flags } * Adds a helper function, AddTableEntry, to avoid duplication of the insertion code. * Renames TB_NOT_REVERSABLE to TB_NO_REVERSE. * Adds TB_NO_FORWARD, which is analogous to TB_NO_REVERSE, except that it prevents addition of the Reg->Mem entry. (This is going to be used by Native Client, in the next CL). Patch by David Meyer llvm-svn: 139311	2011-09-08 18:35:57 +00:00
Bruno Cardoso Lopes	74a67e22b0	Add AVX versions of blend vector operations and fix some issues noticed in Nadav's r139285 and r139287 commits. 1) Rename vsel.ll to a more descriptive name 2) Change the order of BLEND operands to "Op1, Op2, Cond", this is necessary because PBLENDVB is already used in different places with this order, and it was being emitted in the wrong way for vselect 3) Add AVX patterns and tests for the same SSE41 instructions llvm-svn: 139305	2011-09-08 18:05:08 +00:00
Bruno Cardoso Lopes	84c53e3965	Fix PR10844: Add patterns to cover non foldable versions of X86vzmovl. Triggered using llc -O0. Also fix some SET0PS patterns to their AVX forms and test it on the testcase. llvm-svn: 139304	2011-09-08 18:05:02 +00:00
Nadav Rotem	b461f2190e	Add X86-SSE4 codegen support for vector-select. llvm-svn: 139285	2011-09-08 08:11:19 +00:00
Eli Friedman	9ea5599729	Fix atomic load and store on x86 to pass -verify-machineinstrs (and possibly fix some subtle bugs involving passes which check mayStore()). This isn't exactly ideal, but it is good enough for the moment. llvm-svn: 139245	2011-09-07 18:48:32 +00:00
James Molloy	f781d3d8e9	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. llvm-svn: 139237	2011-09-07 17:24:38 +00:00
Rafael Espindola	1cca4f99bd	Detect attempt to use segmented stacks on non ELF systems and error (not assert) early. llvm-svn: 139233	2011-09-07 16:10:57 +00:00
Bill Wendling	763ed58408	Reenable compact unwind by default. However, also emit the old version of unwind information for older linkers. llvm-svn: 139206	2011-09-06 23:47:14 +00:00
Rafael Espindola	9182560b8f	Fix comment. Noticed by Duncan. llvm-svn: 139161	2011-09-06 19:29:31 +00:00
Duncan Sands	d1311488fe	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Rafael Espindola	9d9df4bc1a	Fix style issues and typos found by Duncan. llvm-svn: 139154	2011-09-06 18:43:08 +00:00
Duncan Sands	6939ae53ac	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Nick Lewycky	9b5a242546	Add a new MC bit for NaCl (Native Client) mode. NaCl requires that certain instructions are more aligned than the CPU requires, and adds some additional directives, to follow in future patches. Patch by David Meyer! llvm-svn: 139125	2011-09-05 21:51:43 +00:00
Benjamin Kramer	902004dcd8	Use internal storage for command line option. llvm-svn: 139079	2011-09-03 03:45:06 +00:00
Bruno Cardoso Lopes	02157d584a	Add AVX versions to match AESENC/AESDEC intrinsics. This hopefully ends the cycle of missing AVX counterparts of already present SSE* patterns llvm-svn: 139073	2011-09-03 00:47:08 +00:00
Bruno Cardoso Lopes	c72ce24240	Add AVX version of a SSE4.1 VPBLENDVB pattern llvm-svn: 139072	2011-09-03 00:47:05 +00:00
Bruno Cardoso Lopes	a25fc6f941	Add AVX versions of SSE4.1 EXTRACTPS patterns llvm-svn: 139071	2011-09-03 00:47:03 +00:00
Bruno Cardoso Lopes	45d02d5eca	Add AVX versions for SSE4.1 MOVZX* patterns llvm-svn: 139070	2011-09-03 00:47:01 +00:00
Bruno Cardoso Lopes	cadec3711c	Add one more AVX pattern for MOVZPQILo2PQI llvm-svn: 139069	2011-09-03 00:46:58 +00:00
Bruno Cardoso Lopes	48eeb79003	Move PUNPCKLQDQ splat pattern close to the instruction definition and duplicate it for AVX mode. llvm-svn: 139068	2011-09-03 00:46:56 +00:00
Bruno Cardoso Lopes	ca90af60bd	Add AVX pattern versions for PSHUFB,PSIGN{B,W,D} llvm-svn: 139067	2011-09-03 00:46:54 +00:00
Bruno Cardoso Lopes	7fae5ca308	Add AVX versions of MOVZDI2PDI patterns. Use SUBREG_TO_REG to indicate that the AVX versions (even the 128-bit ones) all clear the upper part of the destination register. llvm-svn: 139066	2011-09-03 00:46:51 +00:00
Bruno Cardoso Lopes	e749426ece	Enforce subtarget checks in a few places to be explicit when the pattern should be matched llvm-svn: 139065	2011-09-03 00:46:49 +00:00
Bruno Cardoso Lopes	323a5b334e	Tidy up code moving patterns to their appropriate place! llvm-svn: 139064	2011-09-03 00:46:47 +00:00
Bruno Cardoso Lopes	ea1931b9d0	Add AVX versions of FsMOVAPS and FsMOVAPS. Teach X86InstrInfo how to use it! llvm-svn: 139063	2011-09-03 00:46:45 +00:00
Bruno Cardoso Lopes	eb041875c1	Teach X86FastISel to use AVX versions of instructions when possible llvm-svn: 139062	2011-09-03 00:46:42 +00:00
Bruno Cardoso Lopes	86c67e11c9	Fix 80-column and style llvm-svn: 139061	2011-09-03 00:46:40 +00:00
Bruno Cardoso Lopes	beb7a448e7	Tidy up some SSE/AVX convert intrinsics. Also add an AVX version of OptForSize pattern llvm-svn: 139060	2011-09-03 00:46:38 +00:00
Jakob Stoklund Olesen	ef8527b836	Pseudo CMOV instructions don't clobber EFLAGS. The explanation about a 0 argument being materialized as xor is no longer valid. Rematerialization will check if EFLAGS is live before clobbering it. The code produced by X86TargetLowering::EmitLoweredSelect does not clobber EFLAGS. This causes one less testb instruction to be generated in the cmov.ll test case. llvm-svn: 139057	2011-09-02 23:52:55 +00:00
Jakob Stoklund Olesen	29145a3de1	Check for EFLAGS live-out before clobbering it. It is only allowed to clobber EFLAGS at the end of a block if it isn't live-in to any successor. llvm-svn: 139056	2011-09-02 23:52:52 +00:00
Jakob Stoklund Olesen	6d5d51f687	Use existing function. llvm-svn: 139055	2011-09-02 23:52:49 +00:00
Jakob Stoklund Olesen	c710d8fdc7	Remove unused variables. llvm-svn: 139047	2011-09-02 22:41:25 +00:00
Eli Friedman	383a3c76b2	Don't fast-isel for atomic load/store; some cases require extra handling missing from fast-isel. llvm-svn: 139044	2011-09-02 22:33:24 +00:00
Kevin Enderby	90a1526592	Change X86 disassembly to print immediates values as signed by default. Special case those instructions that the immediate is not sign-extend. radr://8795217 llvm-svn: 139028	2011-09-02 20:01:23 +00:00
Bill Wendling	991a1dab16	Revert r138826 until PR10834 can be fixed. llvm-svn: 139018	2011-09-02 18:15:04 +00:00
Bruno Cardoso Lopes	10f234f1a7	Fix vbroadcast matching logic to early unmatch if the node doesn't have only one use. Fix PR10825. llvm-svn: 138951	2011-09-01 18:15:06 +00:00
Bruno Cardoso Lopes	8771512b75	Move more code around and duplicate AVX patterns: MOVHPS and MOVLPS llvm-svn: 138897	2011-08-31 21:15:32 +00:00
Bruno Cardoso Lopes	22aceefbf7	Move MOVAPS,MOVUPS patterns close to the instructions definition llvm-svn: 138896	2011-08-31 21:15:29 +00:00
Bruno Cardoso Lopes	4823fe07e6	Remove "_Int" forms of MOVUPSmr and MOVAPSmr llvm-svn: 138895	2011-08-31 21:15:22 +00:00
Rafael Espindola	295e404961	Spelling and grammar fixes to problems found by Duncan. llvm-svn: 138858	2011-08-31 16:43:33 +00:00
Eli Friedman	4fefb0a561	Make sure we don't crash when -miphoneos-version-min is specified on x86. Hopefully this will fix gcc testsuite failures. llvm-svn: 138856	2011-08-31 16:19:51 +00:00
Eric Christopher	157bf8b08d	Rework this conditional a bit. Patch by Sanjoy Das llvm-svn: 138853	2011-08-31 04:17:21 +00:00
Bruno Cardoso Lopes	5bd6e92f99	- Move all MOVSS and MOVSD patterns close to their definitions - Duplicate some store patterns to their AVX forms! - Catched a bug while restricting the patterns subtarget, fix it and update a testcase to check it properly llvm-svn: 138851	2011-08-31 03:04:20 +00:00
Bruno Cardoso Lopes	a9c2c56e13	Remove unnecessary AVX checks llvm-svn: 138850	2011-08-31 03:04:14 +00:00
Bruno Cardoso Lopes	fe3f3344a6	Teach more places to use VMOVAPS,VMOVUPS instead of MOVAPS,MOVUPS, whenever AVX is enabled. llvm-svn: 138849	2011-08-31 03:04:09 +00:00
Evan Cheng	bbabe9ff60	Fix (movhps load) lowering / pattern to match more cases. rdar://10050549 llvm-svn: 138848	2011-08-31 02:05:24 +00:00
Bill Wendling	1e8c335302	Fix off-by-one error Benjamin noticed. llvm-svn: 138832	2011-08-30 21:23:24 +00:00
Bill Wendling	569b9fee87	Enable compact unwind info by default. This only applies to Darwin when CFI is disabled. llvm-svn: 138826	2011-08-30 20:54:11 +00:00
Jeffrey Yasskin	8f36e758c2	Fix C++0x narrowing errors when char is unsigned. In the case of EDInstInfo, this would actually cause a bug when -1 became 255 and was then compared >=0 in llvm-mc/Disassembler.cpp. llvm-svn: 138825	2011-08-30 20:53:29 +00:00
Rafael Espindola	9db302e741	Adds support for variable sized allocas. For a variable sized alloca, code is inserted to first check if the current stacklet has enough space. If so, space is allocated by simply decrementing the stack pointer. Otherwise a runtime routine (__morestack_allocate_stack_space in libgcc) is called which allocates the required memory from the heap. Patch by Sanjoy Das. llvm-svn: 138818	2011-08-30 19:47:04 +00:00
Rafael Espindola	7721c15106	Adds a SelectionDAG node X86SegAlloca which will be custom lowered from DYNAMIC_STACKALLOC. Two new pseudo instructions (SEG_ALLOCA_32 and SEG_ALLOCA_64) which will match X86SegAlloca (based on word size) are also added. They will be custom emitted to inject the actual stack handling code. Patch by Sanjoy Das. llvm-svn: 138814	2011-08-30 19:43:21 +00:00
Rafael Espindola	321e47cd0b	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. llvm-svn: 138812	2011-08-30 19:39:58 +00:00
Eli Friedman	4d90e53381	Explicitly zero out parts of a vector which are required to be zero by the algorithm in LowerUINT_TO_FP_i32. This only has a substantial effect on the generated code when the input is extracted from a vector register; other ways of loading an i32 do the appropriate zeroing implicitly. Fixes PR10802. llvm-svn: 138768	2011-08-29 21:15:46 +00:00
Bruno Cardoso Lopes	3a09888a72	Move non-intruction patterns to a more appropriate place! llvm-svn: 138744	2011-08-29 17:51:24 +00:00
Nicolas Geoffray	74b006fe71	Remove premature previous commit. llvm-svn: 138725	2011-08-28 14:52:51 +00:00
Nicolas Geoffray	d30e51ca07	Encoding of instructions referencing segments has changed. Do what X86MCCodeEmitter does. llvm-svn: 138723	2011-08-28 13:07:57 +00:00
Benjamin Kramer	6411b8f81a	Silence GCC warnings and make an array const. llvm-svn: 138706	2011-08-27 17:36:14 +00:00
Eli Friedman	9f95c7d381	Add support for generating CMPXCHG16B on x86-64 for the cmpxchg IR instruction. llvm-svn: 138660	2011-08-26 21:21:21 +00:00
Craig Topper	b20cee1e19	Fix disassembling of VCVTSD2SI llvm-svn: 138623	2011-08-26 04:49:29 +00:00
Bruno Cardoso Lopes	e6119d18de	Do the same as r138461. Mark VZEROALL as clobbering all YMM registers llvm-svn: 138592	2011-08-25 22:23:58 +00:00
Bruno Cardoso Lopes	5b3d2c9e17	Add support for AVX 256-bit version of MOVDDUP! llvm-svn: 138588	2011-08-25 21:40:37 +00:00
Bruno Cardoso Lopes	dedd2ffa0b	Make isMOVDDUP mask check more strict and update comments! llvm-svn: 138587	2011-08-25 21:40:34 +00:00
Craig Topper	a6085b9757	Add more missing TB encodings to VEX instructions to allow them to be disassembled. Fixes remainder of PR10678. llvm-svn: 138553	2011-08-25 08:11:01 +00:00
Craig Topper	06ed6cb856	Add TB encoding to VEROALL, VZEROUPPER, and VCVTPS2PD to allow them to be disassembled. Fixes PR10723. llvm-svn: 138551	2011-08-25 06:57:46 +00:00
Bruno Cardoso Lopes	5d34219953	Add support for 256-bit versions of VSHUFPD and VSHUFPS. llvm-svn: 138546	2011-08-25 02:58:26 +00:00
Bruno Cardoso Lopes	ccffe56b5d	Add memory version of SHUFPD to mask decoding! llvm-svn: 138545	2011-08-25 02:58:21 +00:00
Bruno Cardoso Lopes	dfa5cf4620	Create a section for non-instructions patterns in the beginning of the file, and move more code around! llvm-svn: 138521	2011-08-24 23:18:11 +00:00
Bruno Cardoso Lopes	719b357628	Move code around! llvm-svn: 138520	2011-08-24 23:18:09 +00:00
Bruno Cardoso Lopes	3824b766ac	Organize UNPCK* patterns, also add remaining for AVX. llvm-svn: 138519	2011-08-24 23:18:06 +00:00
Bruno Cardoso Lopes	82c8bc7efd	Move remaining MOVDDUP patterns close to MOVDDUP defintion and duplicate the missing ones for AVX. llvm-svn: 138518	2011-08-24 23:18:04 +00:00
Bruno Cardoso Lopes	d315a6b6e6	Organize and tidy up MOVDDUP section. Also update comments! llvm-svn: 138517	2011-08-24 23:18:02 +00:00
Bruno Cardoso Lopes	762fb13cc9	Move MOVHLPS patterns close to MOVHLPS definition, and duplicate the pattern for 128-bit AVX mode. llvm-svn: 138516	2011-08-24 23:17:59 +00:00
Bruno Cardoso Lopes	d62766849f	Move all PSHUF* patterns close to the PSHUF* definitions. Also be explicit about which subtarget they refer to, and add AVX versions of the ones we currently don't. Remove old and now wrong comments! llvm-svn: 138515	2011-08-24 23:17:57 +00:00
Bruno Cardoso Lopes	122f7cfc92	Move all SHUFP* patterns close to the SHUFP* definitions. Also be explicit about which subtarget they refer to, and add AVX versions of the ones we currently don't. Make the mask check more strict, to be clear it won't be used to match to 256-bit versions! llvm-svn: 138514	2011-08-24 23:17:55 +00:00
Eli Friedman	b6597a2e70	Hook up 64-bit atomic load/store on x86-32. I plan to write more efficient implementations eventually. llvm-svn: 138505	2011-08-24 22:33:28 +00:00
Eli Friedman	e4cd816e7b	Fix whitespace. llvm-svn: 138487	2011-08-24 21:17:30 +00:00
Eli Friedman	6f95a6ae1b	Basic x86 code generation for atomic load and store instructions. llvm-svn: 138478	2011-08-24 20:50:09 +00:00
Bruno Cardoso Lopes	734febce18	Mark VZEROALL as clobbering all YMM registers llvm-svn: 138461	2011-08-24 18:48:33 +00:00
Evan Cheng	420bf5446c	Move TargetRegistry and TargetSelect from Target to Support where they belong. These are strictly utilities for registering targets and components. llvm-svn: 138450	2011-08-24 18:08:43 +00:00
Craig Topper	1da38a34a6	Break 256-bit vector int add/sub/mul into two 128-bit operations to avoid costly scalarization. Fixes PR10711. llvm-svn: 138427	2011-08-24 06:14:18 +00:00
Bruno Cardoso Lopes	8959b54713	Fix a nasty bug where a v4i64 was being wrong emitted with 32-bit permutations. Also tidy up some patterns and make them close to their instruction definition! llvm-svn: 138392	2011-08-23 22:06:37 +00:00
Evan Cheng	ed13551c1d	Some refactoring so TargetRegistry.h no longer has to include any files from MC. llvm-svn: 138367	2011-08-23 20:15:21 +00:00
Nick Lewycky	11874a4e0a	PerformSubCombine to work on integers larger than i128. Fixes a crasher. llvm-svn: 138354	2011-08-23 19:01:24 +00:00
Craig Topper	67b22aedb4	Add support for breaking 256-bit v16i16 and v32i8 VSETCC into two 128-bit ones, avoiding sclarization. Add vex form of pcmpeqq and pcmpgtq. Fixes more cases for PR10712. llvm-svn: 138321	2011-08-23 04:36:33 +00:00
Bruno Cardoso Lopes	8024703a16	Introduce a pass to insert vzeroupper instructions to avoid AVX to SSE transition penalty. The pass is enabled through the "x86-use-vzeroupper" llc command line option. This is only the first step (very naive and conservative one) to sketch out the idea, but proper DFA is coming next to allow smarter decisions. Comments and ideas now and in further commits will be very appreciated. llvm-svn: 138317	2011-08-23 01:14:17 +00:00
Benjamin Kramer	bd13a6a319	X86: Add some operand types required to identify calls. llvm-svn: 138285	2011-08-22 22:55:32 +00:00
Bruno Cardoso Lopes	8007165688	Add support for breaking 256-bit int VETCC into two 128-bit ones, avoding scalarization of the compare. Reduces code from 59 to 6 instructions. Fix PR10712. llvm-svn: 138271	2011-08-22 20:31:04 +00:00
Bruno Cardoso Lopes	23ff325f5b	Add 128-bit AVX codegen for PCMP* family of integer instructions llvm-svn: 138270	2011-08-22 20:31:00 +00:00
Bruno Cardoso Lopes	9979e44f1b	Re-write part of VEX encoding logic, to be more easy to read! Also fix a bug and add a testcase! llvm-svn: 138123	2011-08-19 22:27:29 +00:00
Craig Topper	f68d77215d	Add TB encoding to VEX versions of SSE fp logical operations to fix disassembler llvm-svn: 138034	2011-08-19 05:28:50 +00:00
Bruno Cardoso Lopes	306110c29a	Fix PR10677. Initial patch and idea by Peter Cooper but I've changed the implementation! llvm-svn: 138029	2011-08-19 02:23:56 +00:00
Bruno Cardoso Lopes	0d458d4bb3	Re-encoded 128-bit AVX versions of SQRT, RSQRT, RCP have 3 operands instead of 2. They were already defined this way in their regular version, but not for the intrinsics versions (_Int), and that would work for assembly emission but not for object code, since a MachineOperand would be missing. This commit fix PR10697. Also removed the {VSQRT,VRSQRT,VRCP}r_Int forms and match the intrinsic via INSERT_SUBREG+EXTRACT_SUBREG patterns. The same couldn't be done for memory versions because sse_load_f32/sse_load_f64 operand need special handling and don't work like regular "addr" operands. There are right now 114 "_Int" and 98 "Int_*" forms! I'm slowly removing them as I step through, but hope we can get rid of these someday, they are really annoying :) llvm-svn: 138012	2011-08-18 23:59:21 +00:00
Bruno Cardoso Lopes	c174d8ac48	Cleanup vector logical ops in AVX and add use int versions for simple v2i64 llvm-svn: 137919	2011-08-18 02:11:34 +00:00
Bruno Cardoso Lopes	82795e6b41	Fix PR10688. Add support for spliting 256-bit vector shifts when the shift amount is variable llvm-svn: 137885	2011-08-17 22:12:20 +00:00
Owen Anderson	3146968039	Allow the MCDisassembler to return a "soft fail" status code, indicating an instruction that is disassemblable, but invalid. Only used for ARM UNPREDICTABLE instructions at the moment. Patch by James Molloy. llvm-svn: 137830	2011-08-17 17:44:15 +00:00
Bruno Cardoso Lopes	98531dfd08	Introduce matching patterns for vbroadcast AVX instruction. The idea is to match splats in the form (splat (scalar_to_vector (load ...))) whenever the load can be folded. All the logic and instruction emission is working but because of PR8156, there are no ways to match loads, cause they can never be folded for splats. Thus, the tests are XFAILed, but I've tested and exercised all the logic using a relaxed version for checking the foldable loads, as if the bug was already fixed. This should work out of the box once PR8156 gets fixed since MayFoldLoad will work as expected. llvm-svn: 137810	2011-08-17 02:29:19 +00:00
Bruno Cardoso Lopes	e3bab71a4b	Update comments about vector splat handling in x86 llvm-svn: 137808	2011-08-17 02:29:13 +00:00
Bruno Cardoso Lopes	4ff4ed28af	Now that we have a canonical way to handle 256-bit splats: vinsertf128 $1 + vpermilps $0, remove the old code that used to first do the splat in a 128-bit vector and then insert it into a larger one. This is better because the handling code gets simpler and also makes a better room for the upcoming vbroadcast! llvm-svn: 137807	2011-08-17 02:29:10 +00:00
Bruno Cardoso Lopes	d64294fb0a	Instead of always leaving the work to the generic legalizer when there is no support for native 256-bit shuffles, be more smart in some cases, for example, when you can extract specific 128-bit parts and use regular 128-bit shuffles for them. Example: For this shuffle: shufflevector <4 x i64> %a, <4 x i64> %b, <4 x i32> <i32 1, i32 0, i32 7, i32 6> This was expanded to: vextractf128 $1, %ymm1, %xmm2 vpextrq $0, %xmm2, %rax vmovd %rax, %xmm1 vpextrq $1, %xmm2, %rax vmovd %rax, %xmm2 vpunpcklqdq %xmm1, %xmm2, %xmm1 vpextrq $0, %xmm0, %rax vmovd %rax, %xmm2 vpextrq $1, %xmm0, %rax vmovd %rax, %xmm0 vpunpcklqdq %xmm2, %xmm0, %xmm0 vinsertf128 $1, %xmm1, %ymm0, %ymm0 ret Now we get: vshufpd $1, %xmm0, %xmm0, %xmm0 vextractf128 $1, %ymm1, %xmm1 vshufpd $1, %xmm1, %xmm1, %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 llvm-svn: 137733	2011-08-16 18:21:54 +00:00
Bruno Cardoso Lopes	f026c60f3d	While I'm here, remove the "_alt" hacks to a series of INSERT_SUBREG and also add the AVX versions of the 128-bit patterns llvm-svn: 137685	2011-08-15 23:36:51 +00:00
Bruno Cardoso Lopes	1e817d1451	Reorder declarations of vmovmskp* and also put the necessary AVX predicate and TB encoding fields. This fix the encoding for the attached testcase. This fixes PR10625. llvm-svn: 137684	2011-08-15 23:36:45 +00:00
Jim Grosbach	31c0c9a1f6	MCTargetAsmParser target match predicate support. Allow a target assembly parser to do context sensitive constraint checking on a potential instruction match. This will be used, for example, to handle Thumb2 IT block parsing. llvm-svn: 137675	2011-08-15 23:03:29 +00:00
Bruno Cardoso Lopes	b81c3ed76d	Fix PR10656. It's only profitable to use 128-bit inserts and extracts when AVX mode is one. Otherwise is just more work for the type legalizer. llvm-svn: 137661	2011-08-15 21:45:54 +00:00
Bruno Cardoso Lopes	8bdbc680ea	Fix comment! llvm-svn: 137521	2011-08-12 21:54:42 +00:00

1 2 3 4 5 ...

7631 Commits