llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Bruno Cardoso Lopes	b7b9688aa5	-Inspected a AVX code block added by someone in early Feb. This was never used and was actually very wrong, fix it and make it simpler. Also remove the ConcatVectors function, which is unused now. - Fix a introduction of useless nodes in r126664 and r126264. The VUNPCKL* should never be introduced cause we don't want duplicate nodes for 128 AVX and non-AVX modes, the actual instruction difference only exists during isel, but not for target specific DAG nodes. We only introduce V* target nodes when there is no 128-bit version already there. - Fix a fragile test and make it more useful. llvm-svn: 135729	2011-07-22 00:15:07 +00:00
Bruno Cardoso Lopes	1ee6122518	Add a DAGCombine for transforming 128->256 casts into a simple vxorps + vinsertf128 pair of instructions llvm-svn: 135727	2011-07-22 00:15:00 +00:00
Bruno Cardoso Lopes	7e5f2950cc	Introduce a new function to lower 256-bit vectors which are not direclty supported and should be promoted and handled by smaller shuffles llvm-svn: 135726	2011-07-22 00:14:56 +00:00
Bruno Cardoso Lopes	71a17c4789	Rename function to be more specific and be more strict about its usage llvm-svn: 135725	2011-07-22 00:14:53 +00:00
Bruno Cardoso Lopes	3691063149	- Register v16i16 as valid VR256 register class - Add more bitcasts for v16i16 - Since 135661 and 135662 already added the splat logic, just add one more splat test for v16i16 llvm-svn: 135663	2011-07-21 02:24:08 +00:00
Bruno Cardoso Lopes	ba1a2a9135	Add support for 256-bit versions of VPERMIL instruction. This is a new instruction introduced in AVX, which can operate on 128 and 256-bit vectors. It considers a 256-bit vector as two independent 128-bit lanes. It can permute any 32 or 64 elements inside a lane, and restricts the second lane to have the same permutation of the first one. With the improved splat support introduced early today, adding codegen for this instruction enable more efficient 256-bit code: Instead of: vextractf128 $0, %ymm0, %xmm0 punpcklbw %xmm0, %xmm0 punpckhbw %xmm0, %xmm0 vinsertf128 $0, %xmm0, %ymm0, %ymm1 vinsertf128 $1, %xmm0, %ymm1, %ymm0 vextractf128 $1, %ymm0, %xmm1 shufps $1, %xmm1, %xmm1 movss %xmm1, 28(%rsp) movss %xmm1, 24(%rsp) movss %xmm1, 20(%rsp) movss %xmm1, 16(%rsp) vextractf128 $0, %ymm0, %xmm0 shufps $1, %xmm0, %xmm0 movss %xmm0, 12(%rsp) movss %xmm0, 8(%rsp) movss %xmm0, 4(%rsp) movss %xmm0, (%rsp) vmovaps (%rsp), %ymm0 We get: vextractf128 $0, %ymm0, %xmm0 punpcklbw %xmm0, %xmm0 punpckhbw %xmm0, %xmm0 vinsertf128 $0, %xmm0, %ymm0, %ymm1 vinsertf128 $1, %xmm0, %ymm1, %ymm0 vpermilps $85, %ymm0, %ymm0 llvm-svn: 135662	2011-07-21 01:55:47 +00:00
Bruno Cardoso Lopes	b16371a45e	Improve splat promotion to handle AVX types: v32i8 and v16i16. Also refactor the code and add a bunch of comments. The final shuffle emitted by handling 256-bit types is suitable for the VPERM shuffle instruction which is going to be introduced in a next commit (with a testcase which cover this commit) llvm-svn: 135661	2011-07-21 01:55:42 +00:00
Bruno Cardoso Lopes	194507cc77	Add aditional patterns for vextractf128 instruction llvm-svn: 135660	2011-07-21 01:55:39 +00:00
Bruno Cardoso Lopes	14c800c1e3	Add aditional patterns for vinsertf128 instruction llvm-svn: 135659	2011-07-21 01:55:36 +00:00
Bruno Cardoso Lopes	a8244d4444	Add v16i16 type to VR256 class llvm-svn: 135658	2011-07-21 01:55:33 +00:00
Bruno Cardoso Lopes	e0d5bd467f	Move code around. No functionality changes llvm-svn: 135657	2011-07-21 01:55:30 +00:00
Bruno Cardoso Lopes	60093b6104	Tidy up code llvm-svn: 135656	2011-07-21 01:55:27 +00:00
Bill Wendling	1f46862df8	Mark instructions which are part of the frame setup with the MachineInstr::FrameSetup flag. llvm-svn: 135645	2011-07-21 00:44:56 +00:00
Bill Wendling	4958d250c9	Remove unused function. llvm-svn: 135635	2011-07-20 23:07:42 +00:00
Bill Wendling	55eb4a26d9	Remove the now defunct getCompactUnwindEncoding method from the frame lowering code. llvm-svn: 135634	2011-07-20 23:04:09 +00:00
Evan Cheng	c9bc5a9011	Goodbye TargetAsmInfo. This eliminate last bit of CodeGen and Target in llvm-mc. There is still a bit more refactoring left to do in Targets. But we are now very close to fixing all the layering issues in MC. llvm-svn: 135611	2011-07-20 19:50:42 +00:00
Eli Friedman	7776a468cf	Extend the hack for _GLOBAL_OFFSET_TABLE_ slightly; PR10389. llvm-svn: 135607	2011-07-20 19:36:11 +00:00
Evan Cheng	55d7fcc5f7	- Move CodeModel from a TargetMachine global option to MCCodeGenInfo. - Introduce JITDefault code model. This tells targets to set different default code model for JIT. This eliminates the ugly hack in TargetMachine where code model is changed after construction. llvm-svn: 135580	2011-07-20 07:51:56 +00:00
NAKAMURA Takumi	19d48c106d	X86Subtarget.h: Assume "x86_64-cygwin", though it has not been released yet, to appease test/CodeGen/X86 on cygwin. llvm-svn: 135564	2011-07-20 04:02:20 +00:00
Evan Cheng	bfc0cac54d	Introduce MCCodeGenInfo, which keeps information that can affect codegen (including compilation, assembly). Move relocation model Reloc::Model from TargetMachine to MCCodeGenInfo so it's accessible even without TargetMachine. llvm-svn: 135468	2011-07-19 06:37:02 +00:00
Evan Cheng	10c6820ff4	Move getInitialFrameState from TargetFrameInfo to MCAsmInfo (suggestions for better location welcome). llvm-svn: 135438	2011-07-18 22:29:13 +00:00
Evan Cheng	561d71ce7b	Sink getDwarfRegNum, getLLVMRegNum, getSEHRegNum from TargetRegisterInfo down to MCRegisterInfo. Also initialize the mapping at construction time. This patch eliminate TargetRegisterInfo from TargetAsmInfo. It's another step towards fixing the layering violation. llvm-svn: 135424	2011-07-18 20:57:22 +00:00
Bruno Cardoso Lopes	bdf75dfa28	Be more smart with VCVTSS2SD. Also place the patterns close to the definitions. llvm-svn: 135407	2011-07-18 18:11:25 +00:00
Bruno Cardoso Lopes	da90f383ab	Add AVX 128-bit sqrt versions llvm-svn: 135404	2011-07-18 17:51:40 +00:00
Chris Lattner	e1fe7061ce	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Bruno Cardoso Lopes	d258749f73	Add AVX 128-bit patterns for sint_to_fp llvm-svn: 135332	2011-07-16 00:50:20 +00:00
Bruno Cardoso Lopes	d5b62f3403	Fix a couple of things: 1) Make non-legal 256-bit loads to be promoted to v4i64. This lets us canonize the loads and handle things the same way we use to handle for 128-bit registers. Despite of what one of the removed comments explained, the load promotion would not mess with VPERM, it's only a matter of doing the appropriate bitcasts when this instructions comes to be introduced. Also make LOAD v8i32 legal. 2) Doing 1) exposed two bugs: - v4i64 was being promoted to itself for several opcodes (introduced in r124447 by David Greene) causing endless recursion and the stack to explode. - there was no support for allOnes BUILD_VECTORs and ANDNP would fail to match because it was generating early target constant pools during lowering. 3) The testcases are already checked-in, doing 1) exposed the bugs in the current testcases. 4) Tidy up code to be more clear and explicit about AVX. llvm-svn: 135313	2011-07-15 22:24:33 +00:00
Bruno Cardoso Lopes	2a23e486ad	Add a few patterns for 256-bit bitcasts. No testcases now, they are comming together with other tests. llvm-svn: 135312	2011-07-15 22:24:17 +00:00
Eli Friedman	6bd9cfed88	PR10370: Make sure we know how to relax push correctly on x86-64. llvm-svn: 135303	2011-07-15 21:28:39 +00:00
Chandler Carruth	89cfaf4305	Remove an unnecessary header from this file. I don't think this header was really intended, and it may have been required prior to some of the recent refactors. Including it however causes LLVMX86Desc to need symbols from LLVMX86CodeGen, forming a dependency cycle. This was masked in almost all builds: Clang, and GCC w/ optimizations didn't actually emit the symbols! llvm-svn: 135242	2011-07-15 04:16:38 +00:00
Evan Cheng	1ae06d95e0	Move some parts of TargetAsmInfo down to MCAsmInfo. This is not the greatest solution but it is a small step towards removing the horror that is TargetAsmInfo. llvm-svn: 135237	2011-07-15 02:09:41 +00:00
Chandler Carruth	ce38403d2d	Major update to CMake build to reflect changes in r135219 in the backend. Moved some MCAsmInfo files down into the MCTargetDesc sublibraries, removed some (i suspect long) dead files from other parts of the CMake build, etc. Also copied the include directory hack from the Makefile. Finally, updated the lib deps. I spot checked this, and think its correct, but review appreciated there. llvm-svn: 135234	2011-07-15 00:40:52 +00:00
Evan Cheng	9e8f90a020	Rename createAsmInfo to createMCAsmInfo and move registration code to MCTargetDesc to prepare for next round of changes. llvm-svn: 135219	2011-07-14 23:50:31 +00:00
Bill Wendling	ed868b039b	* Redo the permutation encoding for frameless stacks to be more like what the unwind library expects. * Comment the permutation encoding for frameless stacks. llvm-svn: 135202	2011-07-14 22:01:34 +00:00
Benjamin Kramer	d88f66e018	Port operand types for ARM and X86 over from EDIS to the .td files. llvm-svn: 135198	2011-07-14 21:47:22 +00:00
Evan Cheng	24257cb9ea	Next round of MC refactoring. This patch factor MC table instantiations, MC registeration and creation code into XXXMCDesc libraries. llvm-svn: 135184	2011-07-14 20:59:42 +00:00
Eric Christopher	ca7ae418a5	Check register class matching instead of width of type matching when determining validity of matching constraint. Allow i1 types access to the GR8 reg class for x86. Fixes PR10352 and rdar://9777108 llvm-svn: 135180	2011-07-14 20:13:52 +00:00
Bruno Cardoso Lopes	d24f039847	Add 256-bit load/store recognition and matching in several places. llvm-svn: 135171	2011-07-14 18:50:58 +00:00
Nadav Rotem	b93249b1e7	[VECTOR-SELECT] During type legalization we often use the SIGN_EXTEND_INREG SDNode. When this SDNode is legalized during the LegalizeVector phase, it is scalarized because non-simple types are automatically marked to be expanded. In this patch we add support for lowering SIGN_EXTEND_INREG manually. This fixes CodeGen/X86/vec_sext.ll when running with the '-promote-elements' flag. llvm-svn: 135144	2011-07-14 11:11:14 +00:00
Eli Friedman	a1db9f2fd5	Fix up assertion in r135018 so it doesn't trigger on 32-bit; when we're in 32-bit, it doesn't matter whether the operation overflows because the computed address is not wider than the immediate. llvm-svn: 135120	2011-07-14 00:22:31 +00:00
Bill Wendling	e896e46a8c	Add code to handle a "frameless" unwind stack. The frameless unwind stack has a special encoding, the algorithm for which is in "permuteEncode". llvm-svn: 135103	2011-07-13 23:03:31 +00:00
Bruno Cardoso Lopes	c0401dddf7	Make X86ISD::ANDNP more general and Codegen 256-bit VANDNP. A more general version of X86ISD::ANDNP also opened the room for a little bit of refactoring. llvm-svn: 135088	2011-07-13 21:36:51 +00:00
Bruno Cardoso Lopes	b98f50da03	The target specific node PANDN name is misleading. That happens because it's later selected to a ANDNPD/ANDNPS instruction instead of the PANDN instruction. Rename it. llvm-svn: 135087	2011-07-13 21:36:47 +00:00
Eli Friedman	30d557cc28	Make sure we don't combine a large displacement and a frame index in the same addressing mode on x86-64. It can overflow, leading to a crash/miscompile. <rdar://problem/9763308> llvm-svn: 135084	2011-07-13 21:29:53 +00:00
Eli Friedman	e0a117fbdf	Refactor out checking for displacements on x86-64 addressing modes. No functionality change. Refactoring in preparation for an additional safety check in FoldOffsetIntoAddress. Part of <rdar://problem/9763308>. llvm-svn: 135079	2011-07-13 20:44:23 +00:00
Jim Grosbach	e0fc4019f9	Update MCParsedAsmOperand debug methods. Update the debug output interface for MCParsedAsmOperand to have a print() method which takes an output stream argument, an << operator which invokes the print method using the given stream, and a dump() method which prints the operand to the dbgs() stream. This makes the interface more consistent with the rest of LLVM, and more convenient to use at the debugger command line. llvm-svn: 135043	2011-07-13 15:34:57 +00:00
Bruno Cardoso Lopes	cb49278ad6	AVX Codegen support for 256-bit versions of vandps, vandpd, vorps, vorpd, vxorps, vxorpd llvm-svn: 135023	2011-07-13 01:15:33 +00:00
Bill Wendling	e6de1eeb86	Don't emit the FDE end label if the last thing emitted was a compact unwind and not the FDE llvm-svn: 135020	2011-07-13 00:49:09 +00:00
Eli Friedman	8c4106f2a5	Add an assert (which should never trigger) that triggers on a testcase I'm looking at. llvm-svn: 135018	2011-07-13 00:44:29 +00:00
Bill Wendling	78fce7597f	Assign variable before we test it. llvm-svn: 135015	2011-07-13 00:23:39 +00:00
Bill Wendling	d61dd044e1	Fix obvious think-o. llvm-svn: 135014	2011-07-13 00:20:09 +00:00
Bill Wendling	9e1528dea4	Clean up the handling of an EBP/RBP unwind frame pointer. In particular, don't assert when the frame pointer is -1 (i.e., the function is "frameless"). Still to do: "frameless" unwind information. llvm-svn: 135013	2011-07-13 00:16:14 +00:00
Evan Cheng	1346a63a0f	- Eliminate MCCodeEmitter's dependency on TargetMachine. It now uses MCInstrInfo and MCSubtargetInfo. - Added methods to update subtarget features (used when targets automatically detect subtarget features or switch modes). - Teach X86Subtarget to update MCSubtargetInfo features bits since the MCSubtargetInfo layer can be shared with other modules. - These fixes .code 16 / .code 32 support since mode switch is updated in MCSubtargetInfo so MC code emitter can do the right thing. llvm-svn: 134884	2011-07-11 03:57:24 +00:00
Evan Cheng	c9e252df68	Change createAsmParser to take a MCSubtargetInfo instead of triple, CPU, and feature string. Parsing some asm directives can change subtarget state (e.g. .code 16) and it must be reflected in other modules (e.g. MCCodeEmitter). That is, the MCSubtargetInfo instance must be shared. llvm-svn: 134795	2011-07-09 05:47:46 +00:00
Eli Friedman	1f8926e94d	Really force on 64bit for 64-bit targets. Should fix remaining failures on unknown x86/non-x86 targets. llvm-svn: 134773	2011-07-08 23:43:01 +00:00
Eli Friedman	6de12d7388	Revert earlier unnecessary hack. Make sure we correctly force on 64bit and cmov for 64-bit targets. llvm-svn: 134768	2011-07-08 23:07:42 +00:00
Evan Cheng	03af99dd82	Restore old behavior. Always auto-detect features unless cpu or features are specified. llvm-svn: 134757	2011-07-08 22:30:25 +00:00
Eli Friedman	0ea2c325a9	Default 64-bit target features and SSE2 on when a triple specifies x86-64. Clean up all the other hacks which are now unnecessary. llvm-svn: 134753	2011-07-08 22:16:47 +00:00
Julien Lerouge	75e462e164	Add _allrem, _aullrem and _allmul to the runtime for MSVC. http://llvm.org/bugs/show_bug.cgi?id=10305 llvm-svn: 134744	2011-07-08 21:40:25 +00:00
Cameron Zwarich	c23366d357	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Evan Cheng	69f14d6012	For non-x86 host, used generic as CPU name. llvm-svn: 134741	2011-07-08 21:14:14 +00:00
Benjamin Kramer	85b2770a1c	Plug a leak by giving the AsmParser ownership of the MCSubtargetInfo. Found by valgrind. llvm-svn: 134738	2011-07-08 21:06:23 +00:00
Evan Cheng	34f67f2dda	TargetAsmParser doesn't need reference to Target. llvm-svn: 134721	2011-07-08 19:33:14 +00:00
Evan Cheng	50f2d8d304	Eliminate asm parser's dependency on TargetMachine: - Each target asm parser now creates its own MCSubtatgetInfo (if needed). - Changed AssemblerPredicate to take subtarget features which tablegen uses to generate asm matcher subtarget feature queries. e.g. "ModeThumb,FeatureThumb2" is translated to "(Bits & ModeThumb) != 0 && (Bits & FeatureThumb2) != 0". llvm-svn: 134678	2011-07-08 01:53:10 +00:00
Nick Lewycky	a82f7a687e	Let the inline asm 'q' constraint match float, and on 64-bit double too. Fixes PR9602! llvm-svn: 134665	2011-07-08 00:19:27 +00:00
Eric Christopher	5fb023bb10	Go ahead and emit the barrier on x86-64 even without sse2. The processor supports it just fine. Fixes PR9675 and rdar://9740801 llvm-svn: 134664	2011-07-08 00:04:56 +00:00
Eric Christopher	96527f39fd	Handle fpcr register. Part of PR10299 and rdar://9740322 llvm-svn: 134653	2011-07-07 22:54:12 +00:00
Eric Christopher	b7597bc669	Add support for the X86 'l' constraint. Fixes PR10149 and rdar://9738585 llvm-svn: 134648	2011-07-07 22:29:07 +00:00
Evan Cheng	bbed81df25	Add Mode64Bit feature and sink it down to MC layer. llvm-svn: 134641	2011-07-07 21:06:52 +00:00
Evan Cheng	18acf2200c	Compute feature bits at time of MCSubtargetInfo initialization. llvm-svn: 134606	2011-07-07 07:07:08 +00:00
Bill Wendling	2b47bfeaa9	Use ArrayRef instead of a std::vector&. llvm-svn: 134595	2011-07-07 04:42:01 +00:00
Bill Wendling	ba39846c2b	Add a target hook to encode the compact unwind information. llvm-svn: 134577	2011-07-07 00:54:13 +00:00
Evan Cheng	b0e0a318b7	Rename files for consistency. llvm-svn: 134546	2011-07-06 22:01:53 +00:00
Bill Wendling	479007f9af	Constify getCompactUnwindRegNum. llvm-svn: 134527	2011-07-06 20:33:48 +00:00
Evan Cheng	dcd3ea7062	createMCInstPrinter doesn't need TargetMachine anymore. llvm-svn: 134525	2011-07-06 19:45:42 +00:00
Kevin Enderby	59ba10f2ac	Changed the X86 PUSH64i8 record to use the i64i8imm ParserMatchClass so that a push with a small constant produces a 2-byte push. llvm-svn: 134501	2011-07-06 17:23:46 +00:00
Evan Cheng	1112260be0	Remove the AsmWriterEmitter (unused) feature that rely on TargetSubtargetInfo. llvm-svn: 134457	2011-07-06 02:02:33 +00:00
Eli Friedman	9765ae0015	Add assembler/disassembler support for non-AVX pclmulqdq. While I'm here, use proper aliases for the pclmullqlqdq and friends. PR10269. llvm-svn: 134424	2011-07-05 18:21:20 +00:00
Jakob Stoklund Olesen	4d72701c7e	Consistent diagnostic capitalization and redundant context elimination. llvm-svn: 134311	2011-07-02 07:23:40 +00:00
Jakob Stoklund Olesen	c19c47697f	Include a source location when complaining about bad inline assembly. Add a MI->emitError() method that the backend can use to report errors related to inline assembly. Call it from X86FloatingPoint.cpp when the constraints are wrong. This enables proper clang diagnostics from the backend: $ clang -c pr30848.c pr30848.c:5:12: error: Inline asm output regs must be last on the x87 stack __asm__ ("" : "=u" (d)); /* { dg-error "output regs" } */ ^ 1 error generated. llvm-svn: 134307	2011-07-02 03:53:34 +00:00
Eric Christopher	7260817287	TargetConstant immediates won't be placed into registers so tighten up the valid constant check earlier. rdar://9692967 llvm-svn: 134286	2011-07-01 23:04:38 +00:00
Evan Cheng	018b2055fc	Rename XXXGenSubtarget.inc to XXXGenSubtargetInfo.inc for consistency. llvm-svn: 134281	2011-07-01 22:36:09 +00:00
Evan Cheng	a230202d5e	Add MCSubtargetInfo target registry stuff. llvm-svn: 134279	2011-07-01 22:25:04 +00:00
Eli Friedman	c3fee5e2c7	Calling-convention specifications for illegal types are no-ops. Simplify based on this. llvm-svn: 134264	2011-07-01 21:33:28 +00:00
Evan Cheng	e7e74a3250	Rename TargetSubtarget to TargetSubtargetInfo for consistency. llvm-svn: 134259	2011-07-01 21:01:15 +00:00
Evan Cheng	771cdf9b5d	- Added MCSubtargetInfo to capture subtarget features and scheduling itineraries. - Refactor TargetSubtarget to be based on MCSubtargetInfo. - Change tablegen generated subtarget info to initialize MCSubtargetInfo and hide more details from targets. llvm-svn: 134257	2011-07-01 20:45:01 +00:00
Evan Cheng	157d40fba1	Hide the call to InitMCInstrInfo into tblgen generated ctor. llvm-svn: 134244	2011-07-01 17:57:27 +00:00
Bill Wendling	6aa9fb80dc	Use the correct registers on X86_64. llvm-svn: 134208	2011-06-30 23:47:14 +00:00
Jakob Stoklund Olesen	8b22811785	Fix a problem with fast-isel return values introduced in r134018. We would put the return value from long double functions in the wrong register. This fixes gcc.c-torture/execute/conversion.c llvm-svn: 134205	2011-06-30 23:42:18 +00:00
Bill Wendling	28c3cfe015	Add target a target hook to get the register number used by the compact unwind encoding for the registers it knows about. Return -1 if it can't handle that register. llvm-svn: 134202	2011-06-30 23:20:32 +00:00
Jakob Stoklund Olesen	074d0abb1a	Tweak error messages to match GCC. Should fix gcc.target/i386/pr30848.c llvm-svn: 134193	2011-06-30 21:30:30 +00:00
Evan Cheng	034261674b	Fix the ridiculous SubtargetFeatures API where it implicitly expects CPU name to be the first encoded as the first feature. It then uses the CPU name to look up features / scheduling itineray even though clients know full well the CPU name being used to query these properties. The fix is to just have the clients explictly pass the CPU name! llvm-svn: 134127	2011-06-30 01:53:36 +00:00
Joerg Sonnenberger	708b6e085d	Recognize the xstorerng alias for VIA PadLock's xstore instruction. llvm-svn: 134126	2011-06-30 01:38:03 +00:00
Eric Christopher	7ce905754f	Fix a small thinko for constant i64 lock/orq optimization where we we didn't have an opcode for 64-bit constant or expressions. Fixes rdar://9692967 llvm-svn: 134121	2011-06-30 00:48:30 +00:00
Jakob Stoklund Olesen	6856a8e3c8	Always adjust the stack pointer immediately after the call. Some x86-32 calls pop values off the stack, and we need to readjust the stack pointer after the call. This happens when ADJCALLSTACKUP is eliminated. It could happen that spill code was inserted between the CALL and ADJCALLSTACKUP instructions, and we would compute wrong stack pointer offsets for those frame index references. Fix this by inserting the stack pointer adjustment immediately after the call instead of where the ADJCALLSTACKUP instruction was erased. I don't have a test case since we don't currently insert code in that position. We will soon, though. I am testing a regalloc patch that didn't work on Linux because of this. llvm-svn: 134113	2011-06-29 23:11:39 +00:00
Eric Christopher	3cd31a95dd	Use getRegForInlineAsmConstraint instead of custom defining regclasses via vectors. Part of rdar://9643582 llvm-svn: 134079	2011-06-29 17:23:50 +00:00
Evan Cheng	65e7766262	Move CallFrameSetupOpcode and CallFrameDestroyOpcode to TargetInstrInfo. llvm-svn: 134030	2011-06-28 21:14:33 +00:00
Evan Cheng	b83b307ae8	Hide more details in tablegen generated MCRegisterInfo ctor function. llvm-svn: 134027	2011-06-28 20:44:22 +00:00
Evan Cheng	61530114d5	Add MCInstrInfo registeration machinery. llvm-svn: 134026	2011-06-28 20:29:03 +00:00
Evan Cheng	a115f77785	Merge XXXGenRegisterNames.inc into XXXGenRegisterInfo.inc llvm-svn: 134024	2011-06-28 20:07:07 +00:00
Evan Cheng	4a169be530	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Jakob Stoklund Olesen	7d3e1553d2	Clean up the handling of the x87 fp stack to make it more robust. Drop the FpMov instructions, use plain COPY instead. Drop the FpSET/GET instruction for accessing fixed stack positions. Instead use normal COPY to/from ST registers around inline assembly, and provide a single new FpPOP_RETVAL instruction that can access the return value(s) from a call. This is still necessary since you cannot tell from the CALL instruction alone if it returns anything on the FP stack. Teach fast isel to use this. This provides a much more robust way of handling fixed stack registers - we can tolerate arbitrary FP stack instructions inserted around calls and inline assembly. Live range splitting could sometimes break x87 code by inserting spill code in unfortunate places. As a bonus we handle floating point inline assembly correctly now. llvm-svn: 134018	2011-06-28 18:32:28 +00:00
Evan Cheng	2c06c8b3c2	More refactoring. Move getRegClass from TargetOperandInfo to TargetInstrInfo. llvm-svn: 133944	2011-06-27 21:26:13 +00:00
Evan Cheng	6fea701360	Merge XXXGenRegisterDesc.inc XXXGenRegisterNames.inc XXXGenRegisterInfo.h.inc into XXXGenRegisterInfo.inc. llvm-svn: 133922	2011-06-27 18:32:37 +00:00
Jakob Stoklund Olesen	80238b93fb	Grow the X86FloatingPoint register map to hold 16 registers. This allows for more live scratch registers which is needed to handle live ST registers before return and inline asm instructions. llvm-svn: 133903	2011-06-27 04:08:36 +00:00
Chad Rosier	9a11e3e082	Replace dyn_cast<> with cast<> since the cast is already guarded by the necessary check. llvm-svn: 133874	2011-06-25 18:51:28 +00:00
Chad Rosier	7c292f757f	Enable tail call optimization in the presence of a byval (x86-32 and x86-64). <rdar://problem/9483883> llvm-svn: 133858	2011-06-25 02:04:56 +00:00
Douglas Gregor	a1ab267c45	Unbreak CMake build llvm-svn: 133853	2011-06-25 00:51:50 +00:00
Evan Cheng	7b857b24bb	Add include guard. llvm-svn: 133847	2011-06-24 23:59:54 +00:00
Evan Cheng	2fa8b44985	Rename TargetDesc to MCTargetDesc llvm-svn: 133846	2011-06-24 23:53:19 +00:00
Jim Grosbach	440526a1e8	Refactor MachO relocation generaration into the Target directories. Move the target-specific RecordRelocation logic out of the generic MC MachObjectWriter and into the target-specific object writers. This allows nuking quite a bit of target knowledge from the supposedly target-independent bits in lib/MC. llvm-svn: 133844	2011-06-24 23:44:37 +00:00
Chad Rosier	70f20abc37	Hoist simple check above more complex checking to avoid unnecessary overheads. No functional change intended. llvm-svn: 133824	2011-06-24 21:15:36 +00:00
Evan Cheng	391461842d	- Add MCRegisterInfo registration machinery. Also added x86 registration routines. - Rename TargetRegisterDesc to MCRegisterDesc. llvm-svn: 133820	2011-06-24 20:42:09 +00:00
Evan Cheng	e0801b07e0	Starting to refactor Target to separate out code that's needed to fully describe target machine from those that are only needed by codegen. The goal is to sink the essential target description into MC layer so we can start building MC based tools without needing to link in the entire codegen. First step is to refactor TargetRegisterInfo. This patch added a base class MCRegisterInfo which TargetRegisterInfo is derived from. Changed TableGen to separate register description from the rest of the stuff. llvm-svn: 133782	2011-06-24 01:44:41 +00:00
Eli Friedman	802029c494	Add support for movntil/movntiq mnemonics. Reported on llvmdev. llvm-svn: 133759	2011-06-23 21:07:47 +00:00
Evan Cheng	ed34559fcd	Rename TargetOptions::StackAlignment to StackAlignmentOverride. llvm-svn: 133739	2011-06-23 18:15:47 +00:00
Evan Cheng	f86a6485e7	Remove TargetOptions.h dependency from X86Subtarget. llvm-svn: 133726	2011-06-23 17:54:54 +00:00
Evan Cheng	71256b6030	Get rid of one getStackAlignment(). RegisterInfo shouldn't need to know about stack alignment. llvm-svn: 133679	2011-06-23 01:53:43 +00:00
Nick Lewycky	8e5c09b7dc	Add support for assembling "movq" when it's correct to do so, while continuing to emit "movd" across the board to continue supporting a Darwin assembler bug. This is the reincarnation of r133452. llvm-svn: 133565	2011-06-21 22:45:41 +00:00
Bob Wilson	5b04895bb8	Revert r133452: "Emit movq for 64-bit register to XMM register moves..." This is breaking compiler-rt and llvm-gcc builds on MacOSX when not using the integrated assembler. llvm-svn: 133524	2011-06-21 17:35:13 +00:00
Nick Lewycky	831fb8200d	Emit movq for 64-bit register to XMM register moves, but continue to accept movd when assembling. llvm-svn: 133452	2011-06-20 18:33:26 +00:00
Benjamin Kramer	8fa1866146	Remove unused but set variables. llvm-svn: 133347	2011-06-18 11:09:41 +00:00
Jakob Stoklund Olesen	434b0e8aef	Switch x86 to using AltOrders instead of MethodBodies. llvm-svn: 133325	2011-06-18 01:14:43 +00:00
Jakob Stoklund Olesen	e01d928b0f	SI, DI, BP, and SP don't have 8-bit sub-registers in x86 mode. llvm-svn: 133308	2011-06-17 23:15:00 +00:00
Dan Gohman	4762d28ff9	Add a comment describing why transforming (shl x, 1) to (add x, x) is to be considered safe enough in this context. llvm-svn: 133159	2011-06-16 15:55:48 +00:00
Bruno Cardoso Lopes	f52f4dd0b8	Add AVX suport for fpextend. Original patch by Syoyo Fujita with more comments by me. llvm-svn: 133153	2011-06-16 07:03:21 +00:00
Jakob Stoklund Olesen	d89900e14c	Use set operations instead of plain lists to enumerate register classes. This simplifies many of the target description files since it is common for register classes to be related or contain sequences of numbered registers. I have verified that this doesn't change the files generated by TableGen for ARM and X86. It alters the allocation order of MBlaze GPR and Mips FGR32 registers, but I believe the change is benign. llvm-svn: 133105	2011-06-15 23:28:14 +00:00
John McCall	e6835ee44e	Add a new function attribute, nonlazybind, which inhibits lazy-loading optimizations when emitting calls to the function; instead those calls may use faster relocations which require the function to be immediately resolved upon loading the dynamic object featuring the call. This is useful when it is known that the function will be called frequently and pervasively and therefore there is no merit in delaying binding of the function. Currently only implemented for x86-64, where it turns into a call through the global offset table. Patch by Dan Gohman, who assures me that he's going to add LangRef documentation for this once it's committed. llvm-svn: 133080	2011-06-15 20:36:13 +00:00
Bruno Cardoso Lopes	b6afc5168f	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Nick Lewycky	6a95970b19	Fit banner in 80-col and adjust whitespace. No functionality changes. llvm-svn: 132964	2011-06-14 03:23:52 +00:00
Rafael Espindola	db58547906	AnalyzeBranch doesn't change which successors a bb has, just the order we try to branch to them. Before we were creating successor lists with duplicated entries. Fixing that found a bug in isBlockOnlyReachableByFallthrough that would causes it to return the wrong answer for ----------- ... jne foo jmp bar foo: ---------- llvm-svn: 132882	2011-06-12 03:20:32 +00:00
Charles Davis	27dba856ab	Put FrameSetup flag on x86 instructions that set up the call frame. No functionality change. Later on, we'll use the flag to emit SEH pseudo-ops that describe how the call frame was built. llvm-svn: 132880	2011-06-12 01:45:54 +00:00
Eli Friedman	cbadeac131	Make sure to pass OpFlags into MachineInstrBuilder::addExternalSymbol; the memcpy/memset symbol doesn't get marked up correctly in PIC modes otherwise. Should fix llvm-x86_64-linux-checks buildbot. Followup to r132864. llvm-svn: 132869	2011-06-11 01:55:07 +00:00
Eli Friedman	0bb1c525fd	Add full x86 fast-isel support for memcpy and memset. rdar://9431466 llvm-svn: 132864	2011-06-10 23:39:36 +00:00
Eli Friedman	950df94d25	PR10092 (second try): Don't crash on a load without a momoperand; fast-isel creates loads like this. llvm-svn: 132826	2011-06-10 01:13:01 +00:00
Eli Friedman	66d3e9e11f	Chris fixed this README a while back by changing how clang generates code for structs like the given struct. llvm-svn: 132815	2011-06-09 23:02:19 +00:00
Eli Friedman	f2dbd3e767	Revert 132789; it breaks tests. My mistake. llvm-svn: 132795	2011-06-09 19:33:30 +00:00
Eli Friedman	d04e75fca2	Add a check to make sure we don't crash with strange configurations where we do fast-isel, then try to fold instructions. PR10092. llvm-svn: 132789	2011-06-09 18:55:00 +00:00
Jakob Stoklund Olesen	164dc685e5	Remove custom allocation order boilerplate that is no longer needed. The register allocators automatically filter out reserved registers and place the callee saved registers last in the allocation order, so custom methods are no longer necessary just for that. Some targets still use custom allocation orders: ARM/Thumb: The high registers are removed from GPR in thumb mode. The NEON allocation orders prefer to use non-VFP2 registers first. X86: The GR8 classes omit AH-DH in x86-64 mode to avoid REX trouble. SystemZ: Some of the allocation orders are omitting R12 aliases without explanation. I don't understand this target well enough to fix that. It looks like all the boilerplate could be removed by reserving the right registers. llvm-svn: 132781	2011-06-09 16:56:59 +00:00
Eric Christopher	1ae9ec6124	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Stuart Hastings	d044ba7a9f	Followup to 132458, omit unnecessary stack copy when x87 input is a load. rdar://problem/6373334 llvm-svn: 132696	2011-06-06 23:15:58 +00:00
Stuart Hastings	ea8b49dff3	Reapply 132424 with fixes. This fixes PR10068. rdar://problem/5993888 llvm-svn: 132606	2011-06-03 23:53:54 +00:00
Eric Christopher	d68494ffdd	Have LowerOperandForConstraint handle multiple character constraints. Part of rdar://9119939 llvm-svn: 132510	2011-06-02 23:16:42 +00:00
Jakob Stoklund Olesen	409986a648	Flag unallocatable register classes instead of giving them empty allocation orders. llvm-svn: 132509	2011-06-02 23:07:24 +00:00
Rafael Espindola	1299f014d4	Revert 132424 to fix PR10068. llvm-svn: 132479	2011-06-02 19:57:47 +00:00
Stuart Hastings	8447f18f85	Omit unnecessary stack copy when x87 input is a load. rdar://problem/6373334 llvm-svn: 132458	2011-06-02 15:57:11 +00:00
Jakob Stoklund Olesen	25716baae0	Use TRI::has{Sub,Super}ClassEq() where possible. No functional change. llvm-svn: 132455	2011-06-02 05:43:46 +00:00
Rafael Espindola	ee123951a2	Don't hardcode the %reg format in the streamer. llvm-svn: 132451	2011-06-02 02:34:55 +00:00
Stuart Hastings	9a085fb9d8	Recommit 132404 with fixes. rdar://problem/5993888 llvm-svn: 132424	2011-06-01 21:33:14 +00:00
Stuart Hastings	4b33767382	Revert 132404 to appease a buildbot. rdar://problem/5993888 llvm-svn: 132419	2011-06-01 19:52:20 +00:00
Stuart Hastings	23f5ceda96	Add support for x86 CMPEQSS and friends. These instructions do a floating-point comparison, generate a mask of 0s or 1s, and generally DTRT with NaNs. Only profitable when the user wants a materialized 0 or 1 at runtime. rdar://problem/5993888 llvm-svn: 132404	2011-06-01 17:17:45 +00:00
Jakob Stoklund Olesen	283a7e46b5	Fix PR10059 and future variations by handling all register subclasses. Add TargetRegisterInfo::hasSubClassEq and use it to check for compatible register classes instead of trying to list all register classes in X86's getLoadStoreRegOpcode. llvm-svn: 132398	2011-06-01 15:32:10 +00:00
Stuart Hastings	fdc9e4af68	FGETSIGN support for x86, using movmskps/pd. Will be enabled with a patch to TargetLowering.cpp. rdar://problem/5660695 llvm-svn: 132388	2011-06-01 04:39:42 +00:00
Rafael Espindola	33f7d7f9fa	Use the dwarf->llvm mapping to print register names in the cfi directives. Fixes PR9826. llvm-svn: 132317	2011-05-30 20:20:15 +00:00
Rafael Espindola	5917c1f6ec	Introduce the DwarfRegAlias class for declaring that two registers have the same dwarf number. This will be used for creating a dwarf number to register mapping. The only case that needs this so far is the XMM/YMM registers that unfortunately do have the same numbers. llvm-svn: 132314	2011-05-30 17:49:59 +00:00
Rafael Espindola	00ba4a56e0	Mark the 32 bit registers as invalid in 64 bit mode. In 64 bit mode they are subregisters of the 64 bit ones. llvm-svn: 132313	2011-05-30 16:04:54 +00:00
Rafael Espindola	707fa44bc0	Add 132187 back now that the real problem is fixed. llvm-svn: 132238	2011-05-28 00:24:37 +00:00
Rafael Espindola	8ed6285c8d	It looks like 132187 might have broken the llvm-gcc bootstrap. Revert while I check. llvm-svn: 132230	2011-05-27 23:36:02 +00:00
Cameron Zwarich	ded03d4e24	Add a GR32_NOREX_NOSP register class and fix a bug where getMatchingSuperRegClass() was saying that the matching superregister class of GR32_NOREX in GR64_NOREX_NOSP is GR64_NOREX, which drops the NOSP constraint. This fixes PR10032. llvm-svn: 132225	2011-05-27 22:26:04 +00:00
Jakob Stoklund Olesen	021b1ff0c7	Delete MethodBodies that only filtered reserved registers. The register allocators know to filter reserved registers from the allocation orders, so we don't need all of this boilerplate. llvm-svn: 132199	2011-05-27 18:27:13 +00:00
Rafael Espindola	7e68d3bf57	Remove dwarf numbers from subregs. We should use DW_OP_bit_piece to refer to them. I tested this with both check-all and the gdb testsuite. llvm-svn: 132187	2011-05-27 15:08:24 +00:00
Chad Rosier	b87c4a6945	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Stuart Hastings	837a958ff6	Reverting 132105: it broke some LLVM-GCC DejaGNU tests. llvm-svn: 132108	2011-05-26 04:09:49 +00:00
Stuart Hastings	e704bfb21e	Correctly handle a one-word struct passed byval on x86_64. rdar://problem/6920088 llvm-svn: 132105	2011-05-26 02:44:56 +00:00
Eli Friedman	93ffb875ad	Rewrite fast-isel integer cast handling to handle more cases, and to be simpler and more consistent. The practical effects here are that x86-64 fast-isel can now handle trunc from i8 to i1, and ARM fast-isel can handle many more constructs involving integers narrower than 32 bits (including loads, stores, and many integer casts). rdar://9437928 . llvm-svn: 132099	2011-05-25 23:49:02 +00:00
Francois Pichet	b2042fbfe2	Remove unused OpcodeMask enumerator. llvm-svn: 132062	2011-05-25 17:02:53 +00:00
Francois Pichet	fab3c58733	Fix MSVC warning: "is out of range for enum constant" MSVC doesn't support 64 bit enum. OpcodeMask is not used anywhere in the code base. llvm-svn: 132057	2011-05-25 15:58:10 +00:00
Rafael Espindola	70213c7c5f	Replace the -unwind-tables option with a per function flag. This is more LTO friendly as we can now correctly merge files compiled with or without -fasynchronous-unwind-tables. llvm-svn: 132033	2011-05-25 03:44:17 +00:00
Charles Davis	3ac82d9bb2	Add a method to TargetRegisterInfo to get the register number that the Win64 EH scheme uses internally. Implement it for x86 (the only architecture that LLVM supports for which this matters right now). llvm-svn: 131969	2011-05-24 16:57:53 +00:00
Evan Cheng	b5950697e8	- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is non-zero. - Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero. rdar://9490949 llvm-svn: 131948	2011-05-24 01:48:22 +00:00
Chris Lattner	5442c034a8	add a missing alias to make us more bug compatible with gcc, PR9378 llvm-svn: 131874	2011-05-22 22:31:57 +00:00
Benjamin Kramer	85e86083d5	X86: smulo -> add is now done target-independently in DAGCombiner, remove the patterns. llvm-svn: 131801	2011-05-21 18:32:01 +00:00
Cameron Zwarich	28ea8de263	Fix PR9978 by adding RIP to GR64_TC so it can be used as an address in PIC code. It is already in GR64 for the same reasons. Since it isn't allocatable it can't cause any problems. llvm-svn: 131787	2011-05-21 04:13:49 +00:00
Eli Friedman	dfd96ebe52	Add fast-isel support for byval calls on x86. llvm-svn: 131764	2011-05-20 22:21:04 +00:00
Stuart Hastings	e3158f93ec	Re-commit 131641 with fixes; de-pseudoize MOVSX16rr8 and friends. rdar://problem/8614450 llvm-svn: 131746	2011-05-20 19:04:40 +00:00
Benjamin Kramer	83096d1db1	Rename the "sandybridge" subtarget to "corei7-avx", for GCC compatibility. llvm-svn: 131730	2011-05-20 15:11:26 +00:00
Chad Rosier	a5f0bb3719	Don't attempt to tail call optimize for Win64. llvm-svn: 131709	2011-05-20 00:59:28 +00:00
Evan Cheng	a3f5204c82	Revert r131664 and fix it in instcombine instead. rdar://9467055 llvm-svn: 131708	2011-05-20 00:54:37 +00:00
Eli Friedman	ecdbb58b95	Add fast-isel support for zeroext and signext ret instructions on x86. llvm-svn: 131689	2011-05-19 22:16:13 +00:00
Eric Christopher	74a9e350d2	Oddly people want to use the 'r' constraint for fp constants on x86. Fixes rdar://9218925 Fixes PR9601 llvm-svn: 131682	2011-05-19 21:33:47 +00:00
Rafael Espindola	826d41a144	ADD64ri32 sign extends its argument, so we need to use a R_X86_64_32S. Fixes PR9934. We really need to start tblgening the relocation info :-( llvm-svn: 131669	2011-05-19 20:32:34 +00:00
Evan Cheng	efcc06b08f	crc32 with 64-bit output zeros upper 32-bits. rdar://9467055 llvm-svn: 131664	2011-05-19 18:57:12 +00:00
Stuart Hastings	ff15dfa12e	Reverting 131641 to investigate 'bot complaint. llvm-svn: 131654	2011-05-19 17:54:42 +00:00
Stuart Hastings	7baa1babdb	Revise MOVSX16rr8/MOVZX16rr8 (and rm variants) to no longer be pseudos. rdar://problem/8614450 llvm-svn: 131641	2011-05-19 16:59:50 +00:00
Eli Friedman	2bfd6b0b85	Revert unintentional commit. llvm-svn: 131597	2011-05-18 23:13:10 +00:00
Eli Friedman	2fa7bea638	More instcombine simplifications towards better debug locations. llvm-svn: 131596	2011-05-18 23:11:30 +00:00
Cameron Zwarich	8164175e57	Reserve the segment registers on x86 to fix verifier failures in any code that uses them. llvm-svn: 131591	2011-05-18 22:24:48 +00:00
Chad Rosier	be943c5d9a	Enables vararg functions that pass all arguments via registers to be optimized into tail-calls when possible. llvm-svn: 131560	2011-05-18 19:59:50 +00:00
Mon P Wang	602defb22e	Enable autodetect of popcnt llvm-svn: 131476	2011-05-17 18:33:37 +00:00
Eli Friedman	ba315a4fcc	Add x86 fast-isel for calls returning first-class aggregates. rdar://9435872. This is r131438 with a couple small fixes. llvm-svn: 131474	2011-05-17 18:29:03 +00:00
Eli Friedman	42d94ce561	Clean up the mess created by r131467+r131469. llvm-svn: 131471	2011-05-17 18:02:22 +00:00
Stuart Hastings	581113d8a0	Revert 131467 due to buildbot complaint. llvm-svn: 131469	2011-05-17 16:59:46 +00:00
Stuart Hastings	a2509a7ec3	Fix an obscure issue in X86_64 parameter passing: if a tiny byval is passed as the fifth parameter, insure it's passed correctly (in R9). rdar://problem/6920088 llvm-svn: 131467	2011-05-17 16:45:55 +00:00
Nadav Rotem	1b263b575b	Fix a bug in PerformEXTRACT_VECTOR_ELTCombine. The code created an ADD SDNode with two different types, in cases where the index and the ptr had different types. llvm-svn: 131461	2011-05-17 08:31:57 +00:00
Eric Christopher	d613c05f26	Update comment. llvm-svn: 131459	2011-05-17 08:16:14 +00:00
Eric Christopher	c03ef7ebb3	Support XOR and AND optimization with no return value. Finishes off rdar://8470697 llvm-svn: 131458	2011-05-17 08:10:18 +00:00
Eric Christopher	f81a665961	Couple less magic numbers. llvm-svn: 131457	2011-05-17 07:50:41 +00:00
Eric Christopher	dc12267689	Make this code a little less magic number laden. llvm-svn: 131456	2011-05-17 07:47:55 +00:00
Chris Lattner	294ec479fb	add a note llvm-svn: 131455	2011-05-17 07:22:33 +00:00
Eli Friedman	3aa2fe389f	Back out r131444 and r131438; they're breaking nightly tests. I'll look into it more tomorrow. llvm-svn: 131451	2011-05-17 02:36:59 +00:00

... 2 3 4 5 6 ...

7422 Commits