llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 04:52:54 +02:00

Author	SHA1	Message	Date
Benjamin Kramer	6a45984683	Remove a bunch of integer width predicate functions in favor of MathExtras. Most of these were unused, some of them were wrong and unused (isS16Constant<short>, isS10Constant<short>). llvm-svn: 99827	2010-03-29 19:07:58 +00:00
Chris Lattner	c0d5bcc160	From Kalle Raiskila: "the bigstack patch for SPU, with testcase. It is essentially the patch committed as 97091, and reverted as 97099, but with the following additions: -in vararg handling, registers are marked to be live, to not confuse the register scavenger -function prologue and epilogue are not emitted, if the stack size is 16. 16 means it is empty - there is only the register scavenger emergency spill slot, which is not used as there is no stack." llvm-svn: 99819	2010-03-29 17:38:47 +00:00
Chris Lattner	6a8b488d78	add a note. llvm-svn: 99815	2010-03-29 17:02:02 +00:00
Johnny Chen	03c2e8a36c	Add NVTBLFrm to represent A8.6.406 VTBL, VTBX Vector Table Lookup Instructions. These instructions use byte index in a control vector (M:Vm) to lookup byte values in a table and generate a new vector (D:Vd). The table is specified via a list of vectors, which can be: {Dn} {Dn D<n+1>} {Dn D<n+1> D<n+2>} {Dn D<n+1> D<n+2> D<n+3>} llvm-svn: 99789	2010-03-29 01:14:22 +00:00
Chris Lattner	87f1ea5d27	zap an extra line that Eli noticed! llvm-svn: 99770	2010-03-28 18:52:28 +00:00
Chris Lattner	4de7f7e862	fix a type contradition: XCoreISD::RETSP has one argument, not zero. llvm-svn: 99760	2010-03-28 08:47:39 +00:00
Chris Lattner	e522e8f94e	remove a pattern with no testcase that doesn't appear to be matchable: it seems like it would always constant fold. llvm-svn: 99758	2010-03-28 08:40:48 +00:00
Chris Lattner	7f2800ea6c	fix integer negates to use the proper type for the zero vectors, this also depends on the new "bitconvert dropping" behavior just added to tblgen. llvm-svn: 99757	2010-03-28 08:39:10 +00:00
Chris Lattner	0530bbf7ea	fix a typo, bitconvert from node to itself isn't valid. llvm-svn: 99755	2010-03-28 08:36:45 +00:00
Chris Lattner	36a0df5c96	fix vnot matching to explicitly specify the type of the input to be v8i8 or v16i8, which buildvectors get canonicalized to. This allows the patterns that were previously using a bare 'vnot' to match, before they couldn't. llvm-svn: 99754	2010-03-28 08:08:07 +00:00
Chris Lattner	167d09c397	fix up vnot matching, eliminating a dead pattern, correcting a couple of patterns that would never match because of bitcast, and eliminating use of vnot_conv. llvm-svn: 99753	2010-03-28 08:00:23 +00:00
Chris Lattner	ac16bb9827	stop using vnot_conv llvm-svn: 99750	2010-03-28 07:48:17 +00:00
Chris Lattner	22c84d79fa	revert r99743, this is saying that the repmovs instructinos have an input of other type, which is the VT. llvm-svn: 99749	2010-03-28 07:38:39 +00:00
Chris Lattner	59eebcbbde	remove a bunch of dead patterns. llvm-svn: 99748	2010-03-28 07:38:00 +00:00
Chris Lattner	941ab0b2d5	claiming to return other is pointless. llvm-svn: 99743	2010-03-28 05:57:36 +00:00
Chris Lattner	4a35b78926	Improve systemz to model cmp and ucmp nodes as returning their flags correctly. llvm-svn: 99738	2010-03-28 05:21:52 +00:00
Chris Lattner	99fafcf87e	the FPCmp node returns an i32. llvm-svn: 99737	2010-03-28 05:12:57 +00:00
Chris Lattner	c5499723d5	fix some modelling problems exposed by a patch I'm working on. bsr/bsf/ptest nodes all have an EFLAGS result when made by isel lowering. llvm-svn: 99736	2010-03-28 05:07:17 +00:00
Bob Wilson	6d9296abc6	Fix indentation. llvm-svn: 99705	2010-03-27 04:01:23 +00:00
Bob Wilson	be89c51aa3	Add a format argument to the N3V and N3VX classes, removing the N3Vf class. llvm-svn: 99704	2010-03-27 03:56:52 +00:00
Chris Lattner	154641e2ff	eliminate the last of the parallel's! llvm-svn: 99700	2010-03-27 02:47:14 +00:00
Johnny Chen	794a9a3941	Add NVMulSLFrm to represent "3-register multiply with scalar" operations and set it as the format for the appropriate N3VSL<> classes. These instructions require special handling of the M:Vm field which encodes the restricted Dm and the lane index within Dm. Examples are A8.6.325 VMLA, VMLAL, VMLS, VMLSL (by scalar): vmlal.s32 q3, d2, d10[0] llvm-svn: 99690	2010-03-27 01:03:13 +00:00
Chris Lattner	22dceb8eb0	eliminate almost all the rest of the x86-32 parallels. llvm-svn: 99686	2010-03-27 00:45:04 +00:00
Jim Grosbach	f884dba43d	Thumb2 storeFrom/LoadToStackSlot() need to handle tGPR regs directly, not pass through to the generic version. The generic functions use STR/LDR, but T2 needs the t2STR/t2LDR instead so we get the addressing mode correct. llvm-svn: 99678	2010-03-27 00:09:12 +00:00
Johnny Chen	0763231e02	Remove the duplicate multiclass N3VSh_QHSD and use N3VInt_QHSD which is modified to now take a format argument. N3VDInt<> and N3VQInt<> are modified to take a format argument as well. llvm-svn: 99676	2010-03-26 23:49:07 +00:00
Johnny Chen	ca1d7dcfe4	Add NVExtFrm to represent NEON Vector Extract Instructions, that uses Inst{11-8} to encode the byte location of the extracted result in the concatenation of the operands, from the least significant end. Modify VEXTd and VEXTq classes to use the format. llvm-svn: 99659	2010-03-26 22:28:56 +00:00
Johnny Chen	4a97a176e7	Add N3RegVShFrm to represent 3-Register Vector Shift Instructions, which do not follow the N3RegFrm's operand order of D:Vd N:Vn M:Vm. The operand order of N3RegVShFrm is D:Vd M:Vm N:Vn (notice that M:Vm is the first src operand). Add a parent class N3Vf which requires passing a Format argument and which the N3V class is modified to inherit from. N3V class represents the "normal" 3-Register NEON Instructions with N3RegFrm. Also add a multiclass N3VSh_QHSD to represent clusters of NEON 3-Register Shift Instructions and replace 8 invocations with it. llvm-svn: 99655	2010-03-26 21:26:28 +00:00
Jim Grosbach	97d626c850	vldm/vstm can only do up to 16 double-word registers at a time. Radar 7797856 llvm-svn: 99630	2010-03-26 18:41:09 +00:00
Johnny Chen	c986f10733	Add N3RegFrm to represent "NEON 3 vector register format" instructions. Examples are VABA (Vector Absolute Difference and Accumulate), VABAL (Vector Absolute Difference and Accumulate Long), and VABD (Vector Absolute Difference). llvm-svn: 99628	2010-03-26 18:32:20 +00:00
Evan Cheng	d1ee7e0ba3	Do not sibcall if stack needs to be dynamically aligned. llvm-svn: 99620	2010-03-26 16:26:03 +00:00
Evan Cheng	377bb993d8	Allow trivial sibcall of vararg callee when no arguments are being passed. llvm-svn: 99598	2010-03-26 02:13:13 +00:00
Johnny Chen	a8b02d6451	Add N2RegVShLFrm and N2RegVShRFrm formats so that the disassembler can easily dispatch to the appropriate routines to handle the different interpretations of the shift amount encoded in the imm6 field. The Vd, Vm fields are interpreted the same between the two, though. See, for example, A8.6.367 VQSHL, VQSHLU (immediate) for N2RegVShLFrm format and A8.6.368 VQSHRN, VQSHRUN for N2RegVShRFrm format. llvm-svn: 99590	2010-03-26 01:07:59 +00:00
Jim Grosbach	2a0b14a387	switch the flag for using NEON for SP floating point to a subtarget 'feature'. Re-commit. This time complete with testsuite updates. llvm-svn: 99570	2010-03-25 23:47:34 +00:00
Jim Grosbach	97d5bc2b86	need to fix 'make check' tests first. revert for a moment. llvm-svn: 99569	2010-03-25 23:34:05 +00:00
Jim Grosbach	7e87ba79e6	switch the flag for using NEON for SP floating point to a subtarget 'feature' llvm-svn: 99568	2010-03-25 23:32:19 +00:00
Johnny Chen	d56897bddc	Removed instruction class NI from ARMInstrFormats.td. It doesn't seem to be used anywhere. llvm-svn: 99566	2010-03-25 23:11:56 +00:00
Jim Grosbach	b97ff2a4c1	switch the use-vml[as] instructions flag to a subtarget 'feature' llvm-svn: 99565	2010-03-25 23:11:16 +00:00
Johnny Chen	38c9f64289	Add NVDupLnFrm and change NVDupLane class to use that format. llvm-svn: 99557	2010-03-25 21:49:12 +00:00
Jim Grosbach	0975d55c8e	ARM cortex-a8 doesn't do vmla/vmls well. disable them by default for that cpu llvm-svn: 99549	2010-03-25 20:48:50 +00:00
Johnny Chen	58278a364d	Add NVCVTFrm (NEON Convert with fractional bits immediate) and modify N2VImm to expect a Format arg. N2VCvtD/N2VCvtQ are modified to use the NVCVTFrm format. llvm-svn: 99548	2010-03-25 20:39:04 +00:00
Daniel Dunbar	aeb4d40a70	Fix -Asserts warning, again. llvm-svn: 99542	2010-03-25 19:35:53 +00:00
Jakob Stoklund Olesen	17f506ccdd	Tag SSE2 integer instructions as SSEPackedInt. llvm-svn: 99540	2010-03-25 18:52:04 +00:00
Jakob Stoklund Olesen	5a6e614de9	Teach TableGen to understand X.Y notation in the TSFlagsFields strings. Remove much horribleness from X86InstrFormats as a result. Similar simplifications are probably possible for other targets. llvm-svn: 99539	2010-03-25 18:52:01 +00:00
Jakob Stoklund Olesen	5ca19faccc	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register in a different domain than where it was defined. Some instructions have equvivalents for different domains, like por/orps/orpd. The SSEDomainFix pass tries to minimize the number of domain crossings by changing between equvivalent opcodes where possible. This is a work in progress, in particular the pass doesn't do anything yet. SSE instructions are tagged with their execution domain in TableGen using the last two bits of TSFlags. Note that not all instructions are tagged correctly. Life just isn't that simple. The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline issue handled by NEONMoveFixPass. This pass may become target independent to handle both. llvm-svn: 99524	2010-03-25 17:25:00 +00:00
Johnny Chen	cc491eff10	Added a new instruction class NVDupLane to be inherited by VDUPLND and VDUPLNQ, instead of the current N2V. Format of NVDupLane instances are set to NEONFrm currently. llvm-svn: 99518	2010-03-25 17:01:27 +00:00
Bob Wilson	04e9ff15cb	Reapply Kevin's change 94440, now that Chris has fixed the limitation on opcode values fitting in one byte (svn r99494). llvm-svn: 99514	2010-03-25 16:36:14 +00:00
Chris Lattner	cda90fafdd	eliminate a bunch more parallels now that scheduling handles dead implicit results more aggressively. More to come, I think this is now just a data entry problem. llvm-svn: 99486	2010-03-25 05:44:01 +00:00
Evan Cheng	d663ac8306	Disable folding loads into tail call in 32-bit PIC mode. It can introduce illegal code like this: addl $12, %esp popl %esi popl %edi popl %ebx popl %ebp jmpl __Block_deallocator-L1$pb(%esi) # TAILCALL The problem is the global base register is assigned GR32 register class. TCRETURNmi needs the registers making up the address mode to have the GR32_TC register class. The proper* fix is for X86DAGToDAGISel::getGlobalBaseReg() to return a copy from the global base register of the machine function rather than returning the register itself. But that has the potential of causing it to be coalesced to a more restrictive register class: GR32_TC. It can introduce additional copies and spills. For something as important the PIC base, it's not worth it especially since this is not an issue on 64-bit. llvm-svn: 99455	2010-03-25 00:10:31 +00:00
Bob Wilson	d5673d9f1f	Speculatively revert this to see if it fixes buildbot failures. --- Reverse-merging r99440 into '.': U test/MC/AsmParser/X86/x86_32-bit_cat.s U test/MC/AsmParser/X86/x86_32-encoding.s U include/llvm/IntrinsicsX86.td U include/llvm/CodeGen/SelectionDAGNodes.h U lib/Target/X86/X86InstrSSE.td U lib/Target/X86/X86ISelLowering.h llvm-svn: 99450	2010-03-24 23:26:29 +00:00
Kevin Enderby	9cab7fdb12	Added the Advanced Encryption Standard (AES) Instructions. llvm-svn: 99440	2010-03-24 22:33:33 +00:00

1 2 3 4 5 ...

13651 Commits