llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Bruno Cardoso Lopes	27a7ace4b4	Teach the foldable tables about 128-bit AVX instructions and make the alignment check for 256-bit classes more strict. There're no testcases but we catch more folding cases for AVX while running single and multi sources in the llvm testsuite. Since some 128-bit AVX instructions have different number of operands than their SSE counterparts, they are placed in different tables. 256-bit AVX instructions should also be added in the table soon. And there a few more 128-bit versions to handled, which should come in the following commits. llvm-svn: 139687	2011-09-14 02:36:58 +00:00
Bruno Cardoso Lopes	3e6b9661d1	Vector shuffle mask <i32 4, i32 5, i32 2, i32 3> should yield "movsd", not "movss". llvm-svn: 139686	2011-09-14 02:36:14 +00:00
Nadav Rotem	f1730712f7	swap vselect operand order - pr10907 llvm-svn: 139630	2011-09-13 19:56:38 +00:00
Bruno Cardoso Lopes	f02589db47	Add versions 256-bit versions of alignedstore and alignedload, to be more strict about the alignment checking. This was found by inspection and I don't have any testcases so far, although the llvm testsuite runs without any problem. llvm-svn: 139625	2011-09-13 19:33:03 +00:00
Bruno Cardoso Lopes	6f299a4937	Revert the remaining part of r139528. According to PR10907 the bug seems to be in the VSELECT operands order, so I'll leave the fix for Nadav. llvm-svn: 139624	2011-09-13 19:33:00 +00:00
Nadav Rotem	60df99b809	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Craig Topper	0f36afb30c	Only disassembler instructions with vvvv != 1111 if the instruction actually uses the vvvv field to encode an operand. Fixes PR10851. llvm-svn: 139591	2011-09-13 07:37:44 +00:00
Craig Topper	03c833ff84	Remove filter that was preventing MOVDQU/MOVDQA and their VEX forms from being disassembled. Also added encodings for the other register/register form of these instructions. Fixes PR10848. llvm-svn: 139588	2011-09-13 06:54:58 +00:00
Craig Topper	6eeb5396f8	Fix encoding of VMOVDQU to not simultaneously be 'TB OpSize' and 'XS'. 'XS' is correct and seems to have been taking priority. llvm-svn: 139587	2011-09-13 06:39:34 +00:00
Eli Friedman	34ffc961d7	Fix the assembler strings for a couple of atomic instructions. Doesn't really matter much in practice, but it's a bit cleaner. llvm-svn: 139563	2011-09-13 00:27:04 +00:00
Bruno Cardoso Lopes	a4d2bdfa40	Fix PR10845. SUBREG_TO_REG shouldn't be used when the input and destination types are equal! llvm-svn: 139553	2011-09-12 22:59:23 +00:00
Bruno Cardoso Lopes	64e2e852f9	Revert the wrong part of r139528, and fix testcases. llvm-svn: 139541	2011-09-12 21:24:07 +00:00
Bruno Cardoso Lopes	c67e996fc3	Not sure how CMPPS and CMPPD had already ever worked, I guess it didn't. However with this fix it does now. Basically the operand order for the x86 target specific node is not the same as the instruction, but since the intrinsic need that specific order at the instruction definition, just change the order during legalization. Also, there were some wrong invertions of condition codes, such as GE => LE, GT => LT, fix that too. Fix PR10907. llvm-svn: 139528	2011-09-12 19:30:40 +00:00
Bruno Cardoso Lopes	e2fc394ed2	Organize a bit the operand names for CMPPS and CMPPD llvm-svn: 139527	2011-09-12 19:30:36 +00:00
Bruno Cardoso Lopes	fc1c90ac48	Realign BLEND patterns to match the general style for patterns in .td file. llvm-svn: 139526	2011-09-12 19:30:33 +00:00
Bruno Cardoso Lopes	f0e65e0f13	Fix 80-columns llvm-svn: 139525	2011-09-12 19:30:29 +00:00
Nadav Rotem	06ce2ac074	Format patterns, remove unused X86blend patterns llvm-svn: 139491	2011-09-12 08:41:50 +00:00
Craig Topper	5ffd0cb080	Fix disassembling of one of the register/register forms of MOVUPS/MOVUPD/MOVAPS/MOVAPD/MOVSS/MOVSD and their VEX equivalents. Fixes PR10877. llvm-svn: 139486	2011-09-11 23:19:54 +00:00
Craig Topper	a9b27eecc9	Fix disassembling of reverse register/register forms of ADD/SUB/XOR/OR/AND/SBB/ADC/CMP/MOV. llvm-svn: 139485	2011-09-11 21:41:45 +00:00
Nadav Rotem	abb5bb41d4	CR fixes per Bruno's request. Undo the changes from r139285 which added custom lowering to vselect. Add tablegen lowering for vselect. llvm-svn: 139479	2011-09-11 15:02:23 +00:00
Eli Friedman	c79e318f02	r139454 activates an assert in a case where we were doing the right thing anyway. Make that explicit, and un-XFAIL the testcase. llvm-svn: 139458	2011-09-10 02:01:42 +00:00
Richard Trieu	8b6890f67e	Fix the asserts in lib/Target/X86/X86ELFWriterInfo.cpp and lib/ExecutionEngine/MCJIT/MCJIT.cpp from: assert("error"); to: assert(0 && "error"); llvm-svn: 139456	2011-09-10 01:42:07 +00:00
Richard Trieu	0485e133f2	Fixed an assert from: assert("not implemented for target shuffle node"); to: assert(0 && "not implemented for target shuffle node"); This causes a test failure in CodeGen/X86/palignr.ll which has been marked as XFAIL for the time being. Test failure filed at PR10901. llvm-svn: 139454	2011-09-10 01:26:21 +00:00
Nadav Rotem	ccb46031e6	Implement vector-select support for avx256. Refactor the vblend implementation to have tablegen match the instruction by the node type llvm-svn: 139400	2011-09-09 20:29:17 +00:00
Craig Topper	18cbd5db26	Fix handling of Intel syntax disassembling of movs and stos to stop being blank. Also fixed scas, and cmps to always print size suffix in Intel syntax since its abiguous without arguments. Fixes PR10875. llvm-svn: 139353	2011-09-09 05:40:53 +00:00
Nadav Rotem	2f256b7f9f	Dix the 80-columns and remove unsupported v8i16 type from the list of legal vselect types. llvm-svn: 139324	2011-09-08 22:17:35 +00:00
Bruno Cardoso Lopes	54962ac233	Add a AVX version of a simple i64 -> f64 bitcast. This could be triggered using llc with -O0, which wouldn't let it be folded and expose the lack of this pattern. llvm-svn: 139320	2011-09-08 21:52:33 +00:00
Bruno Cardoso Lopes	2f07ca9728	* Combines Alignment, AuxInfo, and TB_NOT_REVERSABLE flag into a single field (Flags), which is a bitwise OR of items from the TB_* enum. This makes it easier to add new information in the future. * Gives every static array an equivalent layout: { RegOp, MemOp, Flags } * Adds a helper function, AddTableEntry, to avoid duplication of the insertion code. * Renames TB_NOT_REVERSABLE to TB_NO_REVERSE. * Adds TB_NO_FORWARD, which is analogous to TB_NO_REVERSE, except that it prevents addition of the Reg->Mem entry. (This is going to be used by Native Client, in the next CL). Patch by David Meyer llvm-svn: 139311	2011-09-08 18:35:57 +00:00
Bruno Cardoso Lopes	74a67e22b0	Add AVX versions of blend vector operations and fix some issues noticed in Nadav's r139285 and r139287 commits. 1) Rename vsel.ll to a more descriptive name 2) Change the order of BLEND operands to "Op1, Op2, Cond", this is necessary because PBLENDVB is already used in different places with this order, and it was being emitted in the wrong way for vselect 3) Add AVX patterns and tests for the same SSE41 instructions llvm-svn: 139305	2011-09-08 18:05:08 +00:00
Bruno Cardoso Lopes	84c53e3965	Fix PR10844: Add patterns to cover non foldable versions of X86vzmovl. Triggered using llc -O0. Also fix some SET0PS patterns to their AVX forms and test it on the testcase. llvm-svn: 139304	2011-09-08 18:05:02 +00:00
Nadav Rotem	b461f2190e	Add X86-SSE4 codegen support for vector-select. llvm-svn: 139285	2011-09-08 08:11:19 +00:00
Eli Friedman	9ea5599729	Fix atomic load and store on x86 to pass -verify-machineinstrs (and possibly fix some subtle bugs involving passes which check mayStore()). This isn't exactly ideal, but it is good enough for the moment. llvm-svn: 139245	2011-09-07 18:48:32 +00:00
James Molloy	f781d3d8e9	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. llvm-svn: 139237	2011-09-07 17:24:38 +00:00
Rafael Espindola	1cca4f99bd	Detect attempt to use segmented stacks on non ELF systems and error (not assert) early. llvm-svn: 139233	2011-09-07 16:10:57 +00:00
Bill Wendling	763ed58408	Reenable compact unwind by default. However, also emit the old version of unwind information for older linkers. llvm-svn: 139206	2011-09-06 23:47:14 +00:00
Rafael Espindola	9182560b8f	Fix comment. Noticed by Duncan. llvm-svn: 139161	2011-09-06 19:29:31 +00:00
Duncan Sands	d1311488fe	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Rafael Espindola	9d9df4bc1a	Fix style issues and typos found by Duncan. llvm-svn: 139154	2011-09-06 18:43:08 +00:00
Duncan Sands	6939ae53ac	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Nick Lewycky	9b5a242546	Add a new MC bit for NaCl (Native Client) mode. NaCl requires that certain instructions are more aligned than the CPU requires, and adds some additional directives, to follow in future patches. Patch by David Meyer! llvm-svn: 139125	2011-09-05 21:51:43 +00:00
Benjamin Kramer	902004dcd8	Use internal storage for command line option. llvm-svn: 139079	2011-09-03 03:45:06 +00:00
Bruno Cardoso Lopes	02157d584a	Add AVX versions to match AESENC/AESDEC intrinsics. This hopefully ends the cycle of missing AVX counterparts of already present SSE* patterns llvm-svn: 139073	2011-09-03 00:47:08 +00:00
Bruno Cardoso Lopes	c72ce24240	Add AVX version of a SSE4.1 VPBLENDVB pattern llvm-svn: 139072	2011-09-03 00:47:05 +00:00
Bruno Cardoso Lopes	a25fc6f941	Add AVX versions of SSE4.1 EXTRACTPS patterns llvm-svn: 139071	2011-09-03 00:47:03 +00:00
Bruno Cardoso Lopes	45d02d5eca	Add AVX versions for SSE4.1 MOVZX* patterns llvm-svn: 139070	2011-09-03 00:47:01 +00:00
Bruno Cardoso Lopes	cadec3711c	Add one more AVX pattern for MOVZPQILo2PQI llvm-svn: 139069	2011-09-03 00:46:58 +00:00
Bruno Cardoso Lopes	48eeb79003	Move PUNPCKLQDQ splat pattern close to the instruction definition and duplicate it for AVX mode. llvm-svn: 139068	2011-09-03 00:46:56 +00:00
Bruno Cardoso Lopes	ca90af60bd	Add AVX pattern versions for PSHUFB,PSIGN{B,W,D} llvm-svn: 139067	2011-09-03 00:46:54 +00:00
Bruno Cardoso Lopes	7fae5ca308	Add AVX versions of MOVZDI2PDI patterns. Use SUBREG_TO_REG to indicate that the AVX versions (even the 128-bit ones) all clear the upper part of the destination register. llvm-svn: 139066	2011-09-03 00:46:51 +00:00
Bruno Cardoso Lopes	e749426ece	Enforce subtarget checks in a few places to be explicit when the pattern should be matched llvm-svn: 139065	2011-09-03 00:46:49 +00:00

1 2 3 4 5 ...

7512 Commits