llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 05:52:53 +02:00

Author	SHA1	Message	Date
Chris Lattner	0304b82f80	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Bill Wendling	0984f4927e	Reapply r129401 with patch for clang. llvm-svn: 129419	2011-04-13 00:36:11 +00:00
Bill Wendling	f6446a0961	Revert r129401 for now. Clang is using the old way of doing things. llvm-svn: 129403	2011-04-12 22:59:27 +00:00
Bill Wendling	f9c9d3e05b	Remove the unaligned load intrinsics in favor of using native unaligned loads. Now that we have a first-class way to represent unaligned loads, the unaligned load intrinsics are superfluous. First part of <rdar://problem/8460511>. llvm-svn: 129401	2011-04-12 22:46:31 +00:00
Andrew Trick	7db197d209	Increased the register pressure limit on x86_64 from 8 to 12 regs. This is the only change in this checkin that may affects the default scheduler. With better register tracking and heuristics, it doesn't make sense to artificially lower the register limit so much. Added -sched-high-latency-cycles and X86InstrInfo::isHighLatencyDef to give the scheduler a way to account for div and sqrt on targets that don't have an itinerary. It is currently defaults to 10 (the actual number doesn't matter much), but only takes effect on non-default schedulers: list-hybrid and list-ilp. Added several heuristics that can be individually disabled for the non-default sched=list-ilp mode. This helps us determine how much better we can do on a given benchmark than the default scheduler. Certain compute intensive loops run much faster in this mode with the right set of heuristics, and it doesn't seem to have much negative impact elsewhere. Not all of the heuristics are needed, but we still need to experiment to decide which should be disabled by default for sched=list-ilp. llvm-svn: 127067	2011-03-05 08:00:22 +00:00
Evan Cheng	9db7b1367d	Fix bug in X86 folding / unfolding table. Int_CMPSDrm and Int_CMPSSrm memory operands starts at index 2, not 1. rdar://9045024 PR9305 llvm-svn: 126359	2011-02-24 02:36:52 +00:00
NAKAMURA Takumi	8ace7260cc	Target/X86: Tweak win64's tailcall. llvm-svn: 124272	2011-01-26 02:04:09 +00:00
NAKAMURA Takumi	066378440a	Fix whitespace. llvm-svn: 124270	2011-01-26 02:03:37 +00:00
Nate Begeman	4a62a3e229	Add support for AVX to materialize +0.0 when doing scalar FP. llvm-svn: 121415	2010-12-09 21:43:51 +00:00
Anton Korobeynikov	c87f68e32e	Move callee-saved regs spills / reloads to TFI llvm-svn: 120228	2010-11-27 23:05:03 +00:00
Evan Cheng	1c8dafd12a	Re-enable register pressure aware machine licm with fixes. Hoist() may have erased the instruction during LICM so UpdateRegPressureAfter() should not reference it afterwards. llvm-svn: 116845	2010-10-19 18:58:51 +00:00
Daniel Dunbar	6ff550c84d	Revert r116781 "- Add a hook for target to determine whether an instruction def is", which breaks some nightly tests. llvm-svn: 116816	2010-10-19 17:14:24 +00:00
Evan Cheng	9c3f6f486e	- Add a hook for target to determine whether an instruction def is "long latency" enough to hoist even if it may increase spilling. Reloading a value from spill slot is often cheaper than performing an expensive computation in the loop. For X86, that means machine LICM will hoist SQRT, DIV, etc. ARM will be somewhat aggressive with VFP and NEON instructions. - Enable register pressure aware machine LICM by default. llvm-svn: 116781	2010-10-19 00:55:07 +00:00
Jakob Stoklund Olesen	499fe39d23	Remove the x86 MOV{32,64}{rr,rm,mr}_TC instructions. The reg-reg copies were no longer being generated since copyPhysReg copies physical registers only. The loads and stores are not necessary - The TC constraint is imposed by the TAILJMP and TCRETURN instructions, there should be no need for constrained loads and stores. llvm-svn: 116314	2010-10-12 17:15:00 +00:00
Chris Lattner	82ce325f16	reapply: Use the new TB_NOT_REVERSABLE flag instead of special reapply: reimplement the second half of the or/add optimization. We should now with no changes. Turns out that one missing "Defs = [EFLAGS]" can upset things a bit. llvm-svn: 116040	2010-10-08 03:57:25 +00:00
Chris Lattner	fbdd285dd6	reapply the patch reverted in r116033: "Reimplement (part of) the or -> add optimization. Matching 'or' into 'add'" With a critical fix: the add pseudos clobber EFLAGS. llvm-svn: 116039	2010-10-08 03:54:52 +00:00
Daniel Dunbar	d3b6b8bf2b	Revert "Reimplement (part of) the or -> add optimization. Matching 'or' into 'add'", which seems to have broken just about everything. llvm-svn: 116033	2010-10-08 02:07:32 +00:00
Daniel Dunbar	59848f6703	Revert "Use the new TB_NOT_REVERSABLE flag instead of special ", which depends on r116007, which I am about to revert. llvm-svn: 116032	2010-10-08 02:07:29 +00:00
Daniel Dunbar	983fae5a86	Revert "reimplement the second half of the or/add optimization. We should now", which depends on r116007, which I am about to revert. llvm-svn: 116031	2010-10-08 02:07:26 +00:00
Chris Lattner	7577cb7b49	reimplement the second half of the or/add optimization. We should now only end up emitting LEA instead of OR. If we aren't able to promote something into an LEA, we should never be emitting it as an ADD. Add some testcases that we emit "or" in cases where we used to produce an "add". llvm-svn: 116026	2010-10-08 01:05:10 +00:00
Chris Lattner	d62e94b465	Use the new TB_NOT_REVERSABLE flag instead of special casing FsMOVAPDrr/FsMOVAPSrr. llvm-svn: 116016	2010-10-08 00:03:02 +00:00
Chris Lattner	72e7e84c3f	simplify some map operations. llvm-svn: 116014	2010-10-07 23:57:02 +00:00
Chris Lattner	d8f05bf65e	Reimplement (part of) the or -> add optimization. Matching 'or' into 'add' is general goodness because it allows ORs to be converted to LEA to avoid inserting copies. However, this is bad because it makes the generated .s file less obvious and gives valgrind heartburn (tons of false positives in bitfield code). While the general fix should be in valgrind, we can at least try to avoid emitting ADD instructions that don't get promoted to LEA. This is more work because it requires introducing pseudo instructions to represents "add that knows the bits are disjoint", but hey, people really love valgrind. This fixes this testcase: https://bugs.kde.org/show_bug.cgi?id=242137#c20 the add r/i cases are coming next. llvm-svn: 116007	2010-10-07 23:36:18 +00:00
Chris Lattner	2212441ff7	Reduce casting in various tables by defining the table with the right types. llvm-svn: 116001	2010-10-07 23:08:41 +00:00
Chris Lattner	17850b2677	simplify code: don't build up vector only to assert it is empty. llvm-svn: 115997	2010-10-07 22:26:19 +00:00
Jakob Stoklund Olesen	5e329859cd	Constrain the offset register to a *_NOSP register class when inserting LEA instructions. This unbreaks the machine code verifier and fixes PR8317. llvm-svn: 115879	2010-10-07 00:07:26 +00:00
Chris Lattner	195a9c3877	Use #NAME# to have the CMOV multiclass define things with the same names as before (e.g. CMOVBE16rr instead of CMOVBErr16). llvm-svn: 115705	2010-10-05 23:00:14 +00:00
Chris Lattner	c3c03dfeff	switch CMOVBE to the multipattern: 21 insertions(+), 53 deletions(-) Moar change coming before I switch the rest. llvm-svn: 115697	2010-10-05 22:23:58 +00:00
Chris Lattner	c14d59589c	add basic avx support to the disassembler, also teach it about ssmem/sdmem operands. With this done, we can remove the _Int suffixes from the round instructions without the disassembler blowing up. This allows the assembler to support them, implementing rdar://8456376 - llvm-mc rejects 'roundss' llvm-svn: 115019	2010-09-29 02:57:56 +00:00
Chris Lattner	f90296b045	add asmparser support for cvttpd2dq by removing some Int_ prefixes. Clean up cvttps2dq by removing some redundant implementations of the same instruction. rdar://8456382 llvm-svn: 115018	2010-09-29 02:36:32 +00:00
Chris Lattner	e5c5c8dc1f	implement rdar://8456382 - cvtsd2si support, by removing some Int_ prefixes. llvm-svn: 115017	2010-09-29 02:24:57 +00:00
Chris Lattner	1ff3935290	fix rdar://8456412 - llvm-mc crash in encoder on "mov %rdx, %cr8" Teaching the code generator about CR8-15, how to rex them up, etc. llvm-svn: 114533	2010-09-22 05:29:50 +00:00
Dan Gohman	aaed2c137f	Avoid emitting a PIC base register if no PIC addresses are needed. This fixes rdar://8396318. llvm-svn: 114201	2010-09-17 20:24:24 +00:00
Anton Korobeynikov	62a9879ef4	Properly handle passing of FP stuff to varargs function on Win64: value should be copied to the corresponding shadow reg as well. Patch by Cameron Esfahani! llvm-svn: 112262	2010-08-27 14:43:06 +00:00
Anton Korobeynikov	52c8ecf231	Revert part of one of the prev. patches - tailjmp will follow later. llvm-svn: 111291	2010-08-17 21:08:28 +00:00
Anton Korobeynikov	f1f88db4fd	Enable more win64 calls folding opportunities. Patch by Cameron Esfahani! llvm-svn: 111288	2010-08-17 21:06:01 +00:00
Bruno Cardoso Lopes	7cb26cb8be	- Teach SSEDomainFix to switch between different levels of AVX instructions. Here we guess that AVX will have domain issues, so just implement them for consistency and in the future we remove if it's unnecessary. - Make foldMemoryOperandImpl aware of 256-bit zero vectors folding and support the 128-bit counterparts of AVX too. - Make sure MOV[AU]PS instructions are only selected when SSE1 is enabled, and duplicate the patterns to match AVX. - Add a testcase for a simple 128-bit zero vector creation. llvm-svn: 110946	2010-08-12 20:20:53 +00:00
Bruno Cardoso Lopes	43a7ba2bbc	Fix comment order llvm-svn: 110898	2010-08-12 02:08:52 +00:00
Jakob Stoklund Olesen	5a62f10abc	Fix <rdar://problem/8282498> even if it doesn't reproduce on trunk. When a register is defined by a partial load: %reg1234:sub_32 = MOV32mr <fi#-1>; GR64:%reg1234 That load cannot be folded into an instruction using the full 64-bit register. It would become a 64-bit load. This is related to the recent change to have isLoadFromStackSlot return false on a sub-register load. llvm-svn: 110874	2010-08-11 23:08:22 +00:00
Owen Anderson	f2fea95f2f	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Owen Anderson	aadd8a89ca	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	b9762c07cb	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Jakob Stoklund Olesen	1dee3913d6	Revert r109652, and remove the offending assert in loadRegFromStackSlot instead. We do sometimes load from a too small stack slot when dealing with x86 arguments (varargs and smaller-than-32-bit args). It looks like we know what we are doing in those cases, so I am going to remove the assert instead of artifically enlarging stack slot sizes. The assert in storeRegToStackSlot stays in. We don't want to write beyond the bounds of a stack slot. llvm-svn: 109764	2010-07-29 17:42:27 +00:00
Jakob Stoklund Olesen	b056152ccf	The isLoadFromStackSlot and isStoreToStackSlot have no way of reporting subregister operands like this: %reg1040:sub_32bit<def> = MOV32rm <fi#-2>, 1, %reg0, 0, %reg0, %reg1040<imp-def>; mem:LD4[FixedStack-2](align=8) Make them return false when subreg operands are present. VirtRegRewriter is making bad assumptions otherwise. This fixes PR7713. llvm-svn: 109489	2010-07-27 04:17:01 +00:00
Jakob Stoklund Olesen	fa4bcde9d9	Add assertions that expose the PR7713 miscompilation: Accessing a stack slot with a too-big register class. llvm-svn: 109488	2010-07-27 04:16:58 +00:00
Chris Lattner	65ad913bec	remove the JIT "NeedsExactSize" feature and supporting logic. llvm-svn: 109167	2010-07-22 21:17:55 +00:00
Chris Lattner	367aa754b1	instead of migrating it to the MC instruction encoder, just rip out the implementation of X86InstrInfo::GetInstSizeInBytes. The code being ripped out just implemented a copy and hacked up version of the (old) instruction encoder, and is buggy and terrible in other ways. Since "GetInstSizeInBytes" is really only there to support the JIT's "NeedsExactSize" hook (which noone is using), just rip out the code. I will rip out the NeedsExactSize hook next. This resolves rdar://7617809 - switch X86InstrInfo::GetInstSizeInBytes to use X86MCCodeEmitter llvm-svn: 109149	2010-07-22 21:05:13 +00:00
Rafael Espindola	c8342b43c4	Fixes win64. It was broken by a previous patch where I missed the !isWin64 and then forced every register to be a vr128 on win64. llvm-svn: 109060	2010-07-21 23:19:57 +00:00
Nate Begeman	e7ca21ab3b	Fix a couple issues with Win64 ABI 1) all registers were spilled as xmm, regardless of actual size 2) win64 abi doesn't do the varargs-size-in-%al thing Still to look into: xmm6-15 are marked as clobbered by call instructions on win64 even though they aren't. llvm-svn: 109035	2010-07-21 20:49:52 +00:00
Jakob Stoklund Olesen	44949b2e1b	Remove the isMoveInstr() hook. llvm-svn: 108567	2010-07-16 22:35:46 +00:00
Bill Wendling	e2833a21c2	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Jakob Stoklund Olesen	858d6bb512	Remove the X86::FP_REG_KILL pseudo-instruction and the X86FloatingPointRegKill pass that inserted it. It is no longer necessary to limit the live ranges of FP registers to a single basic block. llvm-svn: 108536	2010-07-16 17:41:44 +00:00
Dan Gohman	5a42173004	Check begin!=end, rather than !begin. llvm-svn: 108167	2010-07-12 18:12:35 +00:00
Rafael Espindola	16319e45c6	Convert getLoadStoreRegOpcode to use a switch. llvm-svn: 108123	2010-07-12 03:43:04 +00:00
Rafael Espindola	0c1a9aa248	Convert the last getPhysicalRegisterRegClass in VirtRegRewriter.cpp to getMinimalPhysRegClass. It was used to produce spills, and it is better to use the most specific class if possible. Update getLoadStoreRegOpcode to handle GR32_AD. llvm-svn: 108115	2010-07-12 00:52:33 +00:00
Jakob Stoklund Olesen	821d058fd2	X86InstrInfo::copyRegToReg is dead. Long live copyPhysReg! llvm-svn: 108076	2010-07-11 05:44:30 +00:00
Jakob Stoklund Olesen	b1e88a2725	Don't emit st(0)/st(1) copies as FpMOV instructions. Use FpSET_ST? instead. Based on a patch by Rafael Espíndola. Attempt to make the FpSET_ST1 hack more robust, but we are still relying on FpSET_ST0 preceeding it. This is only for supporting really weird x87 inline asm. We support: FpSET_ST0 INLINEASM FpSET_ST0 FpSET_ST1 INLINEASM with and without kills on the arguments. We don't support: FpSET_ST1 FpSET_ST0 INLINEASM nor FpSET_ST1 INLINEASM Just Don't Do It! llvm-svn: 108047	2010-07-10 17:42:34 +00:00
Dan Gohman	fef30fcd5e	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Jakob Stoklund Olesen	ef941722c5	Remember the *_TC opcodes for load/store llvm-svn: 108020	2010-07-09 21:27:55 +00:00
Jakob Stoklund Olesen	d7c882a505	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	53d777f3bd	Fix a few tests llvm-svn: 108011	2010-07-09 20:43:09 +00:00
Bruno Cardoso Lopes	3676e24b67	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Chris Lattner	49ac65543c	Change LEA to have 5 operands for its memory operand, just like all other instructions, even though a segment is not allowed. This resolves a bunch of gross hacks in the encoder and makes LEA more consistent with the rest of the instruction set. No functionality change. llvm-svn: 107934	2010-07-08 23:46:44 +00:00
Chris Lattner	18802e1a55	add some long-overdue enums to refer to the parts of the 5-operand X86 memory operand. llvm-svn: 107925	2010-07-08 22:41:28 +00:00
Jakob Stoklund Olesen	1ae7342eaf	Remember the VR64 register class llvm-svn: 107920	2010-07-08 22:30:35 +00:00
Jakob Stoklund Olesen	aed86b1af7	Implement X86InstrInfo::copyPhysReg llvm-svn: 107898	2010-07-08 19:46:25 +00:00
Jakob Stoklund Olesen	30aacf68b9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	8983ea915c	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Jakob Stoklund Olesen	6afcd69bee	fix copies to/from GR8_ABCD_H even more llvm-svn: 107832	2010-07-07 23:04:56 +00:00
Jakob Stoklund Olesen	34ec644313	Allow copies between GR8_ABCD_L and GR8_ABCD_H. This fixes PR7540. llvm-svn: 107809	2010-07-07 20:33:27 +00:00
Evan Cheng	2954bb814e	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Bill Wendling	c29fd1120a	Fix the formatting of the switch statement and add a missing break. llvm-svn: 106586	2010-06-22 22:16:17 +00:00
Rafael Espindola	5e123a1745	Fix an unintentional commit. I think I typed "git svn dcommit" in the wrong branch. I was trying to do some refactoring on the copyRegToReg, but this is realyl a work in progress and not generally useful yet. llvm-svn: 106413	2010-06-21 13:31:32 +00:00
Rafael Espindola	84260ed15a	wip llvm-svn: 106408	2010-06-21 02:17:34 +00:00
Stuart Hastings	bd7194d21c	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Rafael Espindola	1f31d12e8f	Merge getStoreRegOpcode and getLoadRegOpcode. llvm-svn: 105900	2010-06-12 20:13:29 +00:00
Jakob Stoklund Olesen	f0226fee37	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Rafael Espindola	f7170870cf	Remove the TargetRegisterClass member from CalleeSavedInfo llvm-svn: 105344	2010-06-02 20:02:30 +00:00
Jakob Stoklund Olesen	201408751f	Use enums instead of literals for X86 subregisters. The cases in getMatchingSuperRegClass cannot be broken up until the enums have unique values. llvm-svn: 104611	2010-05-25 17:04:16 +00:00
Jakob Stoklund Olesen	f40bb16b94	Rename X86 subregister indices to something shorter. Use the tablegen-produced enums. llvm-svn: 104493	2010-05-24 14:48:17 +00:00
Jakob Stoklund Olesen	9a54fec092	Add the SubRegIndex TableGen class. This is the beginning of purely symbolic subregister indices, but we need a bit of jiggling before the explicit numeric indices can be completely removed. llvm-svn: 104492	2010-05-24 14:48:12 +00:00
Evan Cheng	241d2c434e	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Dan Gohman	c8b4555a94	Fix assembly parsing and encoding of the pushf and popf family of instructions. llvm-svn: 104231	2010-05-20 16:16:00 +00:00
Dan Gohman	42a89be283	Teach mode load folding and unfolding code about CMP32ri8 and friends. llvm-svn: 104068	2010-05-18 21:54:15 +00:00
Dan Gohman	eaa3ee65bd	When converting a test to a cmp to fold a load, use the cmp that has an 8-bit immediate field rather than one with a wider immediate field. llvm-svn: 104064	2010-05-18 21:42:03 +00:00
Dan Gohman	79efb5a2ad	When rematerializing, use the debug location of the original instruction, rather than a location near where the new instruction is being inserted. llvm-svn: 103232	2010-05-07 01:28:10 +00:00
Dan Gohman	497e752655	Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it doesn't have to guess. llvm-svn: 103194	2010-05-06 20:33:48 +00:00
Evan Cheng	80f3051bb7	Add argument TargetRegisterInfo to loadRegFromStackSlot and storeRegToStackSlot. llvm-svn: 103193	2010-05-06 19:06:44 +00:00
Evan Cheng	6764226f8c	Frame index can be negative. llvm-svn: 102577	2010-04-29 01:13:30 +00:00
Chris Lattner	9292bad5f5	on darwin empty functions need to codegen into something of non-zero length, otherwise labels get incorrectly merged. We handled this by emitting a ".byte 0", but this isn't correct on thumb/arm targets where the text segment needs to be a multiple of 2/4 bytes. Handle this by emitting a noop. This is more gross than it should be because arm/ppc are not fully mc'ized yet. This fixes rdar://7908505 llvm-svn: 102400	2010-04-26 23:37:21 +00:00
Evan Cheng	cedc094434	Remove a redundant comment. llvm-svn: 102326	2010-04-26 08:16:57 +00:00
Evan Cheng	dc0ce1eae8	- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue. - Teach spiller to modify DBG_VALUE instructions to reference spill slots. llvm-svn: 102323	2010-04-26 07:38:55 +00:00
Dan Gohman	0e0b8cf9fd	Add const qualifiers to CodeGen's use of LLVM IR constructs. llvm-svn: 101334	2010-04-15 01:51:59 +00:00
Evan Cheng	b8861dcb04	Re-apply 101075 and fix it properly. Just reuse the debug info of the branch instruction being optimized. There is no need to --I which can deref off start of the BB. llvm-svn: 101162	2010-04-13 18:50:27 +00:00
Eric Christopher	330ca0c937	Temporarily revert r101075, it's causing invalid iterator assertions in a nightly tester. llvm-svn: 101158	2010-04-13 18:37:58 +00:00
Bill Wendling	1f2e71928c	Micro-optimization: If we have this situation: jCC L1 jmp L2 L1: ... L2: ... We can get a small performance boost by emitting this instead: jnCC L2 L1: ... L2: ... This testcase shows an example of this: float func(float x, float y) { double product = (double)x * y; if (product == 0.0) return product; return product - 1.0; } llvm-svn: 101075	2010-04-12 22:19:57 +00:00
Chris Lattner	80b41881bc	rename llvm::llvm_report_error -> llvm::report_fatal_error llvm-svn: 100709	2010-04-07 22:58:41 +00:00
Dale Johannesen	8691bcbc2a	Educate GetInstrSizeInBytes implementations that DBG_VALUE does not generate code. llvm-svn: 100681	2010-04-07 19:51:44 +00:00
Jakob Stoklund Olesen	fcc2dfa1b4	Properly enable load clustering. Operand 2 on a load instruction does not have to be a RegisterSDNode for this to work. llvm-svn: 100497	2010-04-05 23:48:02 +00:00
Chris Lattner	58b7cca257	use DebugLoc default ctor instead of DebugLoc::getUnknownLoc() llvm-svn: 100214	2010-04-02 20:16:16 +00:00
Dale Johannesen	5b35f2ee86	Teach AnalyzeBranch, RemoveBranch and the branch folder to be tolerant of debug info following the branch(es) at the end of a block. llvm-svn: 100168	2010-04-02 01:38:09 +00:00
Jakob Stoklund Olesen	58296f9543	Replace V_SET0 with variants for each SSE execution domain. llvm-svn: 99975	2010-03-31 00:40:13 +00:00
Jakob Stoklund Olesen	1768b3d4a6	Renumber SSE execution domains for better code size. SSEDomainFix will collapse to the domain with the lower number when it has a choice. The SSEPackedSingle domain often has smaller instructions, so prefer that. llvm-svn: 99952	2010-03-30 22:46:53 +00:00
Eric Christopher	4c3a3208e3	Remove the pmulld intrinsic and autoupdate it as a vector multiply. Rewrite the pmulld patterns, and make sure that they fold in loads of arguments into the instruction. llvm-svn: 99910	2010-03-30 18:49:01 +00:00
Jakob Stoklund Olesen	d0d432022a	Basic implementation of SSEDomainFix pass. Cross-block inference is primitive and wrong, but the pass is working otherwise. llvm-svn: 99848	2010-03-29 23:24:21 +00:00
Jakob Stoklund Olesen	5ca19faccc	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register in a different domain than where it was defined. Some instructions have equvivalents for different domains, like por/orps/orpd. The SSEDomainFix pass tries to minimize the number of domain crossings by changing between equvivalent opcodes where possible. This is a work in progress, in particular the pass doesn't do anything yet. SSE instructions are tagged with their execution domain in TableGen using the last two bits of TSFlags. Note that not all instructions are tagged correctly. Life just isn't that simple. The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline issue handled by NEONMoveFixPass. This pass may become target independent to handle both. llvm-svn: 99524	2010-03-25 17:25:00 +00:00
Jakob Stoklund Olesen	1dba4a4389	Revert "Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings." This reverts commit 99345. It was breaking buildbots. llvm-svn: 99352	2010-03-23 23:48:51 +00:00
Jakob Stoklund Olesen	9df76f18b4	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. This is work in progress. So far, SSE execution domain tables are added to X86InstrInfo, and a skeleton pass is enabled with -sse-domain-fix. llvm-svn: 99345	2010-03-23 23:14:44 +00:00
Evan Cheng	e6099e3382	Teach isSafeToClobberEFLAGS to ignore dbg_value's. We need a MachineBasicBlock::iterator that does this automatically? llvm-svn: 99320	2010-03-23 20:35:45 +00:00
Evan Cheng	7d8c39bb1c	Do not force indirect tailcall through fixed registers: eax, r11. Add support to allow loads to be folded to tail call instructions. llvm-svn: 98465	2010-03-14 03:48:46 +00:00
Dan Gohman	18aa7d328e	Don't try to fold V_SET0 and V_SETALLONES to loads in medium and large code models. llvm-svn: 98042	2010-03-09 03:01:40 +00:00
Bill Wendling	a92a5b8a6c	Revert r97766. It's deleting a tag. llvm-svn: 97768	2010-03-05 00:33:59 +00:00
Bill Wendling	55b4dfcd9d	Micro-optimization: This code: float floatingPointComparison(float x, float y) { double product = (double)x * y; if (product == 0.0) return product; return product - 1.0; } produces this: _floatingPointComparison: 0000000000000000 cvtss2sd %xmm1,%xmm1 0000000000000004 cvtss2sd %xmm0,%xmm0 0000000000000008 mulsd %xmm1,%xmm0 000000000000000c pxor %xmm1,%xmm1 0000000000000010 ucomisd %xmm1,%xmm0 0000000000000014 jne 0x00000004 0000000000000016 jp 0x00000002 0000000000000018 jmp 0x00000008 000000000000001a addsd 0x00000006(%rip),%xmm0 0000000000000022 cvtsd2ss %xmm0,%xmm0 0000000000000026 ret The "jne/jp/jmp" sequence can be reduced to this instead: _floatingPointComparison: 0000000000000000 cvtss2sd %xmm1,%xmm1 0000000000000004 cvtss2sd %xmm0,%xmm0 0000000000000008 mulsd %xmm1,%xmm0 000000000000000c pxor %xmm1,%xmm1 0000000000000010 ucomisd %xmm1,%xmm0 0000000000000014 jp 0x00000002 0000000000000016 je 0x00000008 0000000000000018 addsd 0x00000006(%rip),%xmm0 0000000000000020 cvtsd2ss %xmm0,%xmm0 0000000000000024 ret for a savings of 2 bytes. This xform can happen when we recognize that jne and jp jump to the same "true" MBB, the unconditional jump would jump to the "false" MBB, and the "true" branch is the fall-through MBB. llvm-svn: 97766	2010-03-05 00:24:26 +00:00
Dan Gohman	3177cfd002	Implement XMM subregs. Extracting the low element of a vector is now done with EXTRACT_SUBREG, and the zero-extension performed by load movss is now modeled with SUBREG_TO_REG, and so on. Register-to-register movss and movsd are no longer considered copies; they are two-address instructions which insert a scalar into a vector. llvm-svn: 97354	2010-02-28 00:17:42 +00:00
Dan Gohman	e382448355	movl is a cheaper way to materialize 0 without clobbering EFLAGS than movabsq. llvm-svn: 97227	2010-02-26 16:49:27 +00:00
Dan Gohman	733388cdcf	Fix a typo in a comment. llvm-svn: 96778	2010-02-22 04:09:26 +00:00
Chris Lattner	c3dbd2e2fc	add a bunch of mod/rm encoding types for fixed mod/rm bytes. This will work better for the disassembler for modeling things like lfence/monitor/vmcall etc. llvm-svn: 95960	2010-02-12 02:06:33 +00:00
Chris Lattner	d69e1c1bb2	refactor the conditional jump instructions in the .td file to use a multipattern that generates both the 1-byte and 4-byte versions from the same defm llvm-svn: 95901	2010-02-11 19:25:55 +00:00
Chris Lattner	7acf9be6c4	move target-independent opcodes out of TargetInstrInfo into TargetOpcodes.h. #include the new TargetOpcodes.h into MachineInstr. Add new inline accessors (like isPHI()) to MachineInstr, and start using them throughout the codebase. llvm-svn: 95687	2010-02-09 19:54:29 +00:00
Chris Lattner	8db3500688	port X86InstrInfo::determineREX over to the new encoder. llvm-svn: 95440	2010-02-05 22:10:22 +00:00
Chris Lattner	a6d9d45f42	move functions for decoding X86II values into the X86II namespace. llvm-svn: 95410	2010-02-05 19:24:13 +00:00
Chris Lattner	d43a7714c9	change getSizeOfImm and getBaseOpcodeFor to just take TSFlags directly instead of a TargetInstrDesc. llvm-svn: 95405	2010-02-05 19:16:26 +00:00
Dale Johannesen	e26033402e	use findDebugLoc in more places. llvm-svn: 94477	2010-01-26 00:03:12 +00:00
Evan Cheng	1093210d30	Be more conservative with clustering f32 / f64 loads. llvm-svn: 94254	2010-01-22 23:49:11 +00:00
Evan Cheng	8e25244a48	Add two target hooks to determine whether two loads are near and should be scheduled together. llvm-svn: 94147	2010-01-22 03:34:51 +00:00
Evan Cheng	6133019b90	Fix a minor issue in x86 load / store folding table. movups does an unaligned load so it doesn't require 16-byte alignment. llvm-svn: 94058	2010-01-21 00:55:14 +00:00
Dale Johannesen	e6672a5bda	make findDebugLoc a class method llvm-svn: 94032	2010-01-20 21:36:02 +00:00
Dale Johannesen	ac7865b75b	Move findDebugLoc somewhere more central. Fix more cases where debug declarations affect debug line info. llvm-svn: 93953	2010-01-20 00:19:24 +00:00
Jim Grosbach	034f69e0aa	For aligned load/store instructions, it's only required to know whether a function can support dynamic stack realignment. That's a much easier question to answer at instruction selection stage than whether the function actually will have dynamic alignment prologue. This allows the removal of the stack alignment heuristic pass, and improves code quality for cases where the heuristic would result in dynamic alignment code being generated when it was not strictly necessary. llvm-svn: 93885	2010-01-19 18:31:11 +00:00
Evan Cheng	98af245e5f	For now, avoid issuing extract_subreg to reuse lower 8-bit, it's not safe in 32-bit. llvm-svn: 93307	2010-01-13 08:01:32 +00:00
Evan Cheng	76db3bb18e	Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg. For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used. llvm-svn: 93278	2010-01-13 00:30:23 +00:00
Dan Gohman	51b3e804dc	Reapply the MOV64r0 patch, with a fix: MOV64r0 clobbers EFLAGS. llvm-svn: 93229	2010-01-12 04:42:54 +00:00
Evan Cheng	e5b545fd60	Add TargetInstrInfo::isCoalescableInstr. It returns true if the specified instruction is copy like where the source and destination registers can overlap. This is to be used by the coalescable to coalesce the source and destination registers of instructions like X86::MOVSX64rr32. Apparently some crazy people believe the coalescer is too simple. llvm-svn: 93210	2010-01-12 00:09:37 +00:00
Evan Cheng	bc84a42d7b	Revert 93158. It's breaking quite a few x86_64 tests. llvm-svn: 93185	2010-01-11 21:13:41 +00:00
Dan Gohman	5b79391087	Re-instate MOV64r0 and MOV16r0, with adjustments to work with the new AsmPrinter. This is perhaps less elegant than describing them in terms of MOV32r0 and subreg operations, but it allows the current register to rematerialize them. llvm-svn: 93158	2010-01-11 17:37:57 +00:00
David Greene	05203dccf1	Change errs() to dbgs(). llvm-svn: 92653	2010-01-05 01:29:29 +00:00
Bill Wendling	b96be04b8a	Remove dead variable. llvm-svn: 92184	2009-12-28 01:36:02 +00:00
Chris Lattner	d7e8bd73fe	completely eliminate the MOV16r0 'instruction'. The only interesting part of this is the divrem changes, which are already tested by CodeGen/X86/divrem.ll. llvm-svn: 91975	2009-12-23 01:45:04 +00:00
Evan Cheng	7cd6bfe549	Remove target attribute break-sse-dep. Instead, do not fold load into sse partial update instructions unless optimizing for size. llvm-svn: 91910	2009-12-22 17:47:23 +00:00
Evan Cheng	d97d025eba	On recent Intel u-arch's, folding loads into some unary SSE instructions can be non-optimal. To be precise, we should avoid folding loads if the instructions only update part of the destination register, and the non-updated part is not needed. e.g. cvtss2sd, sqrtss. Unfolding the load from these instructions breaks the partial register dependency and it can improve performance. e.g. movss (%rdi), %xmm0 cvtss2sd %xmm0, %xmm0 instead of cvtss2sd (%rdi), %xmm0 An alternative method to break dependency is to clear the register first. e.g. xorps %xmm0, %xmm0 cvtss2sd (%rdi), %xmm0 llvm-svn: 91672	2009-12-18 07:40:29 +00:00
Sean Callanan	06b6feb2e1	Instruction fixes, added instructions, and AsmString changes in the X86 instruction tables. Also (while I was at it) cleaned up the X86 tables, removing tabs and 80-line violations. This patch was reviewed by Chris Lattner, but please let me know if there are any problems. * X86.td Removed tabs and fixed 80-line violations X86Instr64bit.td (IRET, POPCNT, BT_, LSL, SWPGS, PUSH_S, POP_S, L_S, SMSW) Added (CALL, CMOV) Added qualifiers (JMP) Added PC-relative jump instruction (POPFQ/PUSHFQ) Added qualifiers; renamed PUSHFQ to indicate that it is 64-bit only (ambiguous since it has no REX prefix) (MOV) Added rr form going the other way, which is encoded differently (MOV) Changed immediates to offsets, which is more correct; also fixed MOV64o64a to have to a 64-bit offset (MOV) Fixed qualifiers (MOV) Added debug-register and condition-register moves (MOVZX) Added more forms (ADC, SUB, SBB, AND, OR, XOR) Added reverse forms, which (as with MOV) are encoded differently (ROL) Made REX.W required (BT) Uncommented mr form for disassembly only (CVT__2__) Added several missing non-intrinsic forms (LXADD, XCHG) Reordered operands to make more sense for MRMSrcMem (XCHG) Added register-to-register forms (XADD, CMPXCHG, XCHG) Added non-locked forms * X86InstrSSE.td (CVTSS2SI, COMISS, CVTTPS2DQ, CVTPS2PD, CVTPD2PS, MOVQ) Added * X86InstrFPStack.td (COM_FST0, COMP_FST0, COM_FI, COM_FIP, FFREE, FNCLEX, FNOP, FXAM, FLDL2T, FLDL2E, FLDPI, FLDLG2, FLDLN2, F2XM1, FYL2X, FPTAN, FPATAN, FXTRACT, FPREM1, FDECSTP, FINCSTP, FPREM, FYL2XP1, FSINCOS, FRNDINT, FSCALE, FCOMPP, FXSAVE, FXRSTOR) Added (FCOM, FCOMP) Added qualifiers (FSTENV, FSAVE, FSTSW) Fixed opcode names (FNSTSW) Added implicit register operand * X86InstrInfo.td (opaque512mem) Added for FXSAVE/FXRSTOR (offset8, offset16, offset32, offset64) Added for MOV (NOOPW, IRET, POPCNT, IN, BTC, BTR, BTS, LSL, INVLPG, STR, LTR, PUSHFS, PUSHGS, POPFS, POPGS, LDS, LSS, LES, LFS, LGS, VERR, VERW, SGDT, SIDT, SLDT, LGDT, LIDT, LLDT, LODSD, OUTSB, OUTSW, OUTSD, HLT, RSM, FNINIT, CLC, STC, CLI, STI, CLD, STD, CMC, CLTS, XLAT, WRMSR, RDMSR, RDPMC, SMSW, LMSW, CPUID, INVD, WBINVD, INVEPT, INVVPID, VMCALL, VMCLEAR, VMLAUNCH, VMRESUME, VMPTRLD, VMPTRST, VMREAD, VMWRITE, VMXOFF, VMXON) Added (NOOPL, POPF, POPFD, PUSHF, PUSHFD) Added qualifier (JO, JNO, JB, JAE, JE, JNE, JBE, JA, JS, JNS, JP, JNP, JL, JGE, JLE, JG, JCXZ) Added 32-bit forms (MOV) Changed some immediate forms to offset forms (MOV) Added reversed reg-reg forms, which are encoded differently (MOV) Added debug-register and condition-register moves (CMOV) Added qualifiers (AND, OR, XOR, ADC, SUB, SBB) Added reverse forms, like MOV (BT) Uncommented memory-register forms for disassembler (MOVSX, MOVZX) Added forms (XCHG, LXADD) Made operand order make sense for MRMSrcMem (XCHG) Added register-register forms (XADD, CMPXCHG) Added unlocked forms * X86InstrMMX.td (MMX_MOVD, MMV_MOVQ) Added forms * X86InstrInfo.cpp: Changed PUSHFQ to PUSHFQ64 to reflect table change * X86RegisterInfo.td: Added debug and condition register sets * x86-64-pic-3.ll: Fixed testcase to reflect call qualifier * peep-test-3.ll: Fixed testcase to reflect test qualifier * cmov.ll: Fixed testcase to reflect cmov qualifier * loop-blocks.ll: Fixed testcase to reflect call qualifier * x86-64-pic-11.ll: Fixed testcase to reflect call qualifier * 2009-11-04-SubregCoalescingBug.ll: Fixed testcase to reflect call qualifier * x86-64-pic-2.ll: Fixed testcase to reflect call qualifier * live-out-reg-info.ll: Fixed testcase to reflect test qualifier * tail-opts.ll: Fixed testcase to reflect call qualifiers * x86-64-pic-10.ll: Fixed testcase to reflect call qualifier * bss-pagealigned.ll: Fixed testcase to reflect call qualifier * x86-64-pic-1.ll: Fixed testcase to reflect call qualifier * widen_load-1.ll: Fixed testcase to reflect call qualifier llvm-svn: 91638	2009-12-18 00:01:26 +00:00
Bill Wendling	e4328758f9	Whitespace changes, comment clarification. No functional changes. llvm-svn: 91274	2009-12-14 06:51:19 +00:00
Evan Cheng	ee5b5917fd	Disable r91104 for x86. It causes partial register stall which pessimize code in 32-bit. llvm-svn: 91223	2009-12-12 20:03:14 +00:00
Evan Cheng	5cd8cd2a5c	Add comment about potential partial register stall. llvm-svn: 91220	2009-12-12 18:55:26 +00:00
Evan Cheng	01a56041a5	Add support to 3-addressify 16-bit instructions. llvm-svn: 91104	2009-12-11 06:01:48 +00:00
Dan Gohman	f9654e9258	Remove the target hook TargetInstrInfo::BlockHasNoFallThrough in favor of MachineBasicBlock::canFallThrough(), which is target-independent and more thorough. llvm-svn: 90634	2009-12-05 00:44:40 +00:00
David Greene	755852d0c3	Remove an unneeded include. llvm-svn: 90625	2009-12-04 23:55:07 +00:00
David Greene	cb0611ec3b	Have hasLoad/StoreFrom/ToStackSlot return the relevant MachineMemOperand. llvm-svn: 90608	2009-12-04 22:38:46 +00:00
Chris Lattner	9ce833945e	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Dan Gohman	b5ec39e2dc	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Evan Cheng	d7cf6167f1	Re-apply 89011. It's not to be blamed. llvm-svn: 89081	2009-11-17 09:51:18 +00:00
Evan Cheng	52159ba00a	Revert 89011. Buildbot thinks it might be breaking stuff. llvm-svn: 89076	2009-11-17 09:20:28 +00:00
Evan Cheng	382a91041b	A few more instructions that should be marked re-materializable. llvm-svn: 89011	2009-11-17 00:23:22 +00:00
Evan Cheng	78be20d62e	- Check memoperand alignment instead of checking stack alignment. Most load / store folding instructions are not referencing spill stack slots. - Mark MOVUPSrm re-materializable. llvm-svn: 88974	2009-11-16 21:56:03 +00:00
Evan Cheng	9b46e74f42	- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. - If destination is a physical register and it has a subreg index, use the sub-register instead. This fixes PR5423. llvm-svn: 88745	2009-11-14 02:55:43 +00:00
David Greene	1a5969d74c	Fix a bootstrap failure. Provide special isLoadFromStackSlotPostFE and isStoreToStackSlotPostFE interfaces to explicitly request checking for post-frame ptr elimination operands. This uses a heuristic so it isn't reliable for correctness. llvm-svn: 87047	2009-11-13 00:29:53 +00:00
David Greene	ea251ed2b9	Add hasLoadFromStackSlot and hasStoreToStackSlot to return whether a machine instruction loads or stores from/to a stack slot. Unlike isLoadFromStackSlot and isStoreFromStackSlot, the instruction may be something other than a pure load/store (e.g. it may be an arithmetic operation with a memory operand). This helps AsmPrinter determine when to print a spill/reload comment. This is only a hint since we may not be able to figure this out in all cases. As such, it should not be relied upon for correctness. Implement for X86. Return false by default for other architectures. llvm-svn: 87026	2009-11-12 20:55:29 +00:00
Jeffrey Yasskin	23ac706aab	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
Dan Gohman	ad6c6a3d33	Fix MachineLICM to use the correct virtual register class when unfolding loads for hoisting. getOpcodeAfterMemoryUnfold returns the opcode of the original operation without the load, not the load itself, MachineLICM needs to know the operand index in order to get the correct register class. Extend getOpcodeAfterMemoryUnfold to return this information. llvm-svn: 85622	2009-10-30 22:18:41 +00:00
Dan Gohman	76221cc874	Make isSafeToClobberEFLAGS more aggressive. Teach it to scan backwards (for uses marked kill and defs marked dead) a few instructions in addition to forwards. Also, increase the maximum number of instructions to scan, as it appears to help in a fair number of cases. llvm-svn: 84061	2009-10-14 00:08:59 +00:00
Dan Gohman	84a61978de	Remove a no-longer-necessary #include. llvm-svn: 83697	2009-10-10 00:36:09 +00:00
Dan Gohman	177b8de981	Replace X86's CanRematLoadWithDispOperand by calling the target-independent MachineInstr::isInvariantLoad instead, which has the benefit of being more complete. llvm-svn: 83696	2009-10-10 00:34:18 +00:00
Dan Gohman	4fe1a982ed	Add basic infrastructure and x86 support for preserving MachineMemOperand information when unfolding memory references. llvm-svn: 83656	2009-10-09 18:10:05 +00:00
Dan Gohman	b95136e649	Replace TargetInstrInfo::isInvariantLoad and its target-specific implementations with a new MachineInstr::isInvariantLoad, which uses MachineMemOperands and is target-independent. This brings MachineLICM and other functionality to targets which previously lacked an isInvariantLoad implementation. llvm-svn: 83475	2009-10-07 17:38:06 +00:00
Jakob Stoklund Olesen	31fcbdefbb	Introduce the TargetInstrInfo::KILL machine instruction and get rid of the unused DECLARE instruction. KILL is not yet used anywhere, it will replace TargetInstrInfo::IMPLICIT_DEF in the places where IMPLICIT_DEF is just used to alter liveness of physical registers. llvm-svn: 83006	2009-09-28 20:32:26 +00:00
Dan Gohman	0ac693a89e	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Dan Gohman	0c4e55a2f8	Rename getTargetNode to getMachineNode, for consistency with the naming scheme used in SelectionDAG, where there are multiple kinds of "target" nodes, but "machine" nodes are nodes which represent a MachineInstr. llvm-svn: 82790	2009-09-25 18:54:59 +00:00
Dan Gohman	e7e623cd25	Fix X86's unfoldMemoryOperand to properly handle MachineMemOperands. llvm-svn: 82597	2009-09-23 01:29:41 +00:00
Dan Gohman	7a076e642e	Add support for rematerializing FsFLD0SS and FsFLD0SD as constant-pool loads in order to reduce register pressure. llvm-svn: 82470	2009-09-21 18:30:38 +00:00
Evan Cheng	c3ef5b0802	Follow up to 81494. When the folded reload is narrowed to a 32-bit load then change the destination register to a 32-bit one or add a sub-register index. llvm-svn: 81496	2009-09-11 01:01:31 +00:00
Evan Cheng	93831c07de	It's not legal to fold a load from a narrower stack slot into a wider instruction. If done, the instruction does a 64-bit load and that's not safe. This can happen we a subreg_to_reg 0 has been coalesced. One exception is when the instruction that folds the load is a move, then we can simply turn it into a 32-bit load from the stack slot. rdar://7170444 llvm-svn: 81494	2009-09-11 00:39:26 +00:00
Daniel Dunbar	9872eb764c	Remove Offset from ExternalSybmol MachineOperands, this is unused (and at least partly unsupported, in X86 encoding at least). llvm-svn: 80726	2009-09-01 22:06:46 +00:00
Anton Korobeynikov	8595d8549f	Short-term workaround for frame-related weirdness on win64. Some other minor win64 fixes as well. Patch by Michael Beck! llvm-svn: 80370	2009-08-28 16:06:41 +00:00
Chris Lattner	db2965c71f	remove various std::ostream version of printing methods from MachineInstr and MachineOperand. This required eliminating a bunch of stuff that was using DOUT, I hope that bill doesn't mind me stealing his fun. ;-) llvm-svn: 79813	2009-08-23 03:41:05 +00:00
Chris Lattner	5d8af49626	Rename TargetAsmInfo (and its subclasses) to MCAsmInfo. llvm-svn: 79763	2009-08-22 20:48:53 +00:00
Devang Patel	c071d6c1b4	Record variable debug info at ISel time directly. llvm-svn: 79742	2009-08-22 17:12:53 +00:00
Owen Anderson	9df206d02d	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Owen Anderson	48f2f0ae72	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Dan Gohman	9fbdb3de0d	Simplify this code. The case where one class is GR64RegClass and the other is a subclass of it is effectively handled by the prior tests. llvm-svn: 78676	2009-08-11 15:59:48 +00:00
Owen Anderson	b4bce99769	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Eric Christopher	40031ed766	Add crc32 instruction and intrinsics. Add a new class of prefix bytes for F2 0F 38 and propagate. Add a FIXME for a set of possibilities which correspond to intrinsics already used. New test. llvm-svn: 78508	2009-08-08 21:55:08 +00:00
Dan Gohman	2bce18dae9	Use GR32 for copies between GR32_NOSP and GR32_NOREX, as neither is a subset of the other, but both are subsets of GR32. llvm-svn: 78250	2009-08-05 22:18:26 +00:00
Dan Gohman	892fcb1d57	hasSuperClass tests for a strict superset relation, rather than a superset relation. This code wants to test the regular superset relation. llvm-svn: 78236	2009-08-05 20:13:45 +00:00
Chris Lattner	c388490738	Move the getInlineAsmLength virtual method from TAI to TII, where the only real caller (GetFunctionSizeInBytes) uses it. The custom ARM implementation of this is basically reimplementing an assembler poorly for negligible gain. It should be removed IMNSHO, but I'll leave that to ARMish folks to decide. llvm-svn: 77877	2009-08-02 05:20:37 +00:00
Owen Anderson	1dc40e205b	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	3c7e8160f6	Add a new register class to describe operands that can't be SP, due to x86 encoding restrictions. This is currently off by default because it may cause code quality regressions. This is for PR4572. llvm-svn: 77565	2009-07-30 01:56:29 +00:00
Chris Lattner	6c284cc8cd	1. Introduce a new TargetOperandInfo::getRegClass() helper method and convert code to using it, instead of having lots of things poke the isLookupPtrRegClass() method directly. 2. Make PointerLikeRegClass contain a 'kind' int, and store it in the existing regclass field of TargetOperandInfo when the isLookupPtrRegClass() predicate is set. Make getRegClass pass this into TargetRegisterInfo::getPointerRegClass(), allowing targets to have multiple ptr_rc things. llvm-svn: 77504	2009-07-29 21:10:12 +00:00
Owen Anderson	cc287b28c9	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Jakob Stoklund Olesen	c438f0ecfa	Silence warning in Linux builds: X86InstrInfo.cpp:2272: warning: suggest explicit braces to avoid ambiguous 'else' llvm-svn: 76105	2009-07-16 21:24:13 +00:00
Evan Cheng	39e5f6205a	With recent MC changes, RIP base register is explicitly modeled. Make sure we add it when x86 V_SET0 / V_SETALLONES (by transforming it into a constpool load) into the use instruction. llvm-svn: 76094	2009-07-16 18:44:05 +00:00
Evan Cheng	7a6b20df7f	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
Evan Cheng	6408cda38d	Move load / store folding alignment require into the table(s). llvm-svn: 75749	2009-07-15 06:10:07 +00:00
Chris Lattner	290c415b94	reapply r75408, which eliminates MOV64r0 in favor of using MOV32r0 + subregs to do the same thing. This should work now that PR4544 is fixed. Thanks Evan! llvm-svn: 75671	2009-07-14 20:19:57 +00:00
Torok Edwin	f955a6ef49	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Owen Anderson	1e5155161a	Move more functionality over to LLVMContext. llvm-svn: 75497	2009-07-13 20:58:05 +00:00
Owen Anderson	393d8b0a0c	Begin the painful process of tearing apart the rat'ss nest that is Constants.cpp and ConstantFold.cpp. This involves temporarily hard wiring some parts to use the global context. This isn't ideal, but it's the only way I could figure out to make this process vaguely incremental. llvm-svn: 75445	2009-07-13 04:09:18 +00:00
Bill Wendling	dcf4c8e237	Temporarily revert r75408. It appears to break the Apple-style builds: x86_64-apple-darwin10-gcc -c -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -pedantic -Wno-long-long -Wno-variadic-macros -Wno-overlength-strings -Wold-style-definition -Wmissing-format-attribute -mdynamic-no-pic -DHAVE_CONFIG_H -I. -I. -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/. -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/../include -I./../intl -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/../libcpp/include -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmCore.roots/llvmCore~dst/Developer/usr/local/include -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmCore.roots/llvmCore~obj/src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmCore.roots/llvmCore~dst/Developer/usr/local/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -DLLVM_VERSION_INFO='"9999"' -DBUILD_LLVM_APPLE_STYLE /Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/tree-ssa-alias.c -o tree-ssa-alias.o /var/tmp//ccJQ2JBT.s:4134:Incorrect register `%rcx' used with `l' suffix make[2]: * [tree-ssa-live.o] Error 1 make[2]: * Waiting for unfinished jobs.... llvm-svn: 75412	2009-07-12 02:49:22 +00:00
Chris Lattner	83effafb6b	eliminate MOV64r0 in favor of a Pat<> pattern. This is only nontrivial because the div lowering code explicitly references it. llvm-svn: 75408	2009-07-12 00:47:55 +00:00
Torok Edwin	ae8a3ff177	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Evan Cheng	94967abfe3	Undo my brain cramp. llvm-svn: 75290	2009-07-10 21:31:42 +00:00
Chris Lattner	933108f921	some minor simplifications. llvm-svn: 75274	2009-07-10 20:53:38 +00:00
Evan Cheng	7f57128609	CMOVxx doesn't swap operands which it's commuted. llvm-svn: 75266	2009-07-10 19:26:57 +00:00
Chris Lattner	4e8de888f2	change isGlobalStubReference to take target flags instead of a MachineOperand. llvm-svn: 75236	2009-07-10 06:29:59 +00:00
Chris Lattner	c4e0e9c987	convert some late code (called by regalloc and code emission) to use isGlobalStubReference instead of GVRequiresExtraLoad (which should really be part of isel). llvm-svn: 75234	2009-07-10 06:07:08 +00:00
Chris Lattner	41fccd30b7	GVRequiresExtraLoad is now never used for calls, simplify it based on this. llvm-svn: 75232	2009-07-10 05:52:02 +00:00
Evan Cheng	4f87295872	Targets sometimes assign fixed stack object to spill certain callee-saved registers based on dynamic conditions. For example, X86 EBP/RBP, when used as frame register has to be spilled in the first fixed object. It should inform PEI this so it doesn't get allocated another stack object. Also, it should not be spilled as other callee-saved registers but rather its spilling and restoring are being handled by emitPrologue and emitEpilogue. Avoid spilling it twice. llvm-svn: 75116	2009-07-09 06:53:48 +00:00
Chris Lattner	76adfe755d	simplify some code based on the fact that picstyles != none are only valid in pic or dynamic-no-pic mode. Also, x86-64 never used picstylegot. llvm-svn: 75101	2009-07-09 04:39:06 +00:00
Torok Edwin	ad3be984b7	Start converting to new error handling API. cerr+abort -> llvm_report_error assert(0)+abort -> LLVM_UNREACHABLE (assert(0)+llvm_unreachable-> abort() included) llvm-svn: 75018	2009-07-08 18:01:40 +00:00
Evan Cheng	c6c942b70f	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Chris Lattner	e711b85035	factor some logic out into a helper function, allow remat of loads from constant globals. This implements remat-constant.ll even without aggressive-remat. llvm-svn: 74373	2009-06-27 04:38:55 +00:00
Chris Lattner	19eb0dad26	Reimplement rip-relative addressing in the X86-64 backend. The new implementation primarily differs from the former in that the asmprinter doesn't make a zillion decisions about whether or not something will be RIP relative or not. Instead, those decisions are made by isel lowering and propagated through to the asm printer. To achieve this, we: 1. Represent RIP relative addresses by setting the base of the X86 addr mode to X86::RIP. 2. When ISel Lowering decides that it is safe to use RIP, it lowers to X86ISD::WrapperRIP. When it is unsafe to use RIP, it lowers to X86ISD::Wrapper as before. 3. This removes isRIPRel from X86ISelAddressMode, representing it with a basereg of RIP instead. 4. The addressing mode matching logic in isel is greatly simplified. 5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate passed through various printoperand routines is gone now. 6. The various symbol printing routines in asmprinter now no longer infer when to emit (%rip), they just print the symbol. I think this is a big improvement over the previous situation. It does have two small caveats though: 1. I implemented a horrible "no-rip" modifier for the inline asm "P" constraint modifier. This is a short term hack, there is a much better, but more involved, solution. 2. I had to xfail an -aggressive-remat testcase because it isn't handling the use of RIP in the constant-pool reading instruction. This specific test is easy to fix without -aggressive-remat, which I intend to do next. llvm-svn: 74372	2009-06-27 04:16:01 +00:00
Chris Lattner	8c562c2f76	Use target-specific machine operand flags to eliminate a gross hack from the asmprinter. llvm-svn: 74184	2009-06-25 17:38:33 +00:00
Chris Lattner	f70f8cc399	just eliminate the code entirely! llvm-svn: 74183	2009-06-25 17:28:07 +00:00
Eli Friedman	11070e275f	PR3739, part 2: Use an explicit store to spill XMM registers. (Previously, the code tried to use "push", which doesn't exist for XMM registers.) llvm-svn: 72836	2009-06-04 02:32:04 +00:00
Bill Wendling	dd6cbdb28c	The MONITOR and MWAIT instructions have insufficient information for decoding. Essentially, they both map to the same column in the "opcode extensions for one- and two-byte opcodes" table in the x86 manual. The RawFrm complicates decoding this. Instead, use opcode 0x01, prefix 0x01, and form MRM1r. Then have the code emitter special case these, a la [SML]FENCE. llvm-svn: 72556	2009-05-28 23:40:46 +00:00
Bill Wendling	e421c8f63d	Change MachineInstrBuilder::addReg() to take a flag instead of a list of booleans. This gives a better indication of what the "addReg()" is doing. Remembering what all of those booleans mean isn't easy, especially if you aren't spending all of your time in that code. I took Jakob's suggestion and made it illegal to pass in "true" for the flag. This should hopefully prevent any unintended misuse of this (by reverting to the old way of using addReg()). llvm-svn: 71722	2009-05-13 21:33:08 +00:00
Evan Cheng	96cd1decc6	Avoid unneeded SIB byte encoding. Patch by Zoltan Varga. llvm-svn: 71520	2009-05-12 00:07:35 +00:00
Evan Cheng	2a1d20b0fb	Optimize code placement in loop to eliminate unconditional branches or move unconditional branch to the outside of the loop. e.g. /// A: /// ... /// <fallthrough to B> /// /// B: --> loop header /// ... /// jcc <cond> C, [exit] /// /// C: /// ... /// jmp B /// /// ==> /// /// A: /// ... /// jmp B /// /// C: --> new loop header /// ... /// <fallthough to B> /// /// B: /// ... /// jcc <cond> C, [exit] llvm-svn: 71209	2009-05-08 06:34:09 +00:00
Evan Cheng	138eed76c7	Revert part of 70929 that has to do with determining whether a SIB byte is needed. It causes a lot of x86_64 JIT failures. llvm-svn: 70986	2009-05-05 18:18:57 +00:00
Evan Cheng	6843d3293b	- Avoid the longer SIB encoding on x86_64 when it's not needed. - Synchronize instruction length computation code in X86InstrInfo with code in X86CodeEmitter.cpp Patch by Zoltan Varga. llvm-svn: 70929	2009-05-04 22:49:16 +00:00
Dan Gohman	a241dec2fc	Rename GR8_ABCD to GR8_ABCD_L and create GR8_ABCD_H, and use these to precisely describe the h-register subreg register classes. Thanks to Jakob Stoklund Olesen for spotting this and for the initial patch! Also, make getStoreRegOpcode and getLoadRegOpcode aware of the needs of h registers. llvm-svn: 70211	2009-04-27 16:41:36 +00:00
Dan Gohman	180fa04e35	Rename GR8_, GR16_, GR32_, and GR64_ to GR8_ABCD, GR16_ABCD, GR32_ABCD, and GR64_ABCD, respectively, to help describe them. llvm-svn: 70210	2009-04-27 16:33:14 +00:00
Dan Gohman	de72d5129b	Make X86's copyRegToReg able to handle copies to and from subclasses. This makes the extra copyRegToReg calls in ScheduleDAGSDNodesEmit.cpp unnecessary. Derived from a patch by Jakob Stoklund Olesen. llvm-svn: 69635	2009-04-20 22:54:34 +00:00
Mon P Wang	4db825e615	Fixed a few 64 bit cases in X86InstrInfo::commuteInstruction llvm-svn: 69417	2009-04-18 05:16:01 +00:00
Bill Wendling	0476a0acf3	Recommit r69335 and r69336. These were not causing problems. llvm-svn: 69394	2009-04-17 22:40:38 +00:00
Bill Wendling	073e1c91dd	Revert r69335 and r69336. They were causing build failures. llvm-svn: 69347	2009-04-17 04:19:22 +00:00
Dan Gohman	d254f36e54	MOV8rr_NOREX is a "Move" instruction. This doesn't currently matter, because this instruction isn't generated until after things that care. llvm-svn: 69336	2009-04-17 00:45:17 +00:00
Dan Gohman	2349973ff3	Don't use MOV8rr_NOREX on x86-32. It doesn't actually hurt anything at present, but it's inconsistent. llvm-svn: 69335	2009-04-17 00:43:09 +00:00
Dan Gohman	38bc0faa22	Fix 80-column violations. llvm-svn: 69204	2009-04-15 19:48:57 +00:00
Dan Gohman	a2ec3156eb	Add a folding table entry for MOV8rr_NOREX. llvm-svn: 69203	2009-04-15 19:48:28 +00:00
Dan Gohman	a1fe2a3741	Add a new MOV8rr_NOREX, and make X86's copyRegToReg use it when either the source or destination is a physical h register. This fixes sqlite3 with the post-RA scheduler enabled. llvm-svn: 69111	2009-04-15 00:04:23 +00:00
Dan Gohman	be7227005f	Implement x86 h-register extract support. - Add patterns for h-register extract, which avoids a shift and mask, and in some cases a temporary register. - Add address-mode matching for turning (X>>(8-n))&(255<<n), where n is a valid address-mode scale value, into an h-register extract and a scaled-offset address. - Replace X86's MOV32to32_ and related instructions with the new target-independent COPY_TO_SUBREG instruction. On x86-64 there are complicated constraints on h registers, and CodeGen doesn't currently provide a high-level way to express all of them, so they are handled with a bunch of special code. This code currently only supports extracts where the result is used by a zero-extend or a store, though these are fairly common. These transformations are not always beneficial; since there are only 4 h registers, they sometimes require extra move instructions, and this sometimes increases register pressure because it can force out values that would otherwise be in one of those registers. However, this appears to be relatively uncommon. llvm-svn: 68962	2009-04-13 16:09:41 +00:00
Dan Gohman	65bafadd2b	Fix another hard-coded constant to use X86AddrNumOperands. This unbreaks the JIT on x86-64. llvm-svn: 68948	2009-04-13 15:04:25 +00:00
Chris Lattner	e0d0edaf3f	Fix code size computation on x86-64, patch by Zoltan Varga! llvm-svn: 68690	2009-04-09 06:10:51 +00:00
Rafael Espindola	7eb72dc5f2	Re-apply 68552. Tested by bootstrapping llvm-gcc and using that to build llvm. llvm-svn: 68645	2009-04-08 21:14:34 +00:00
Bill Wendling	6e702cf68c	Temporarily revert r68552. This was causing a failure in the self-hosting LLVM builds. --- Reverse-merging (from foreign repository) r68552 into '.': U test/CodeGen/X86/tls8.ll U test/CodeGen/X86/tls10.ll U test/CodeGen/X86/tls2.ll U test/CodeGen/X86/tls6.ll U lib/Target/X86/X86Instr64bit.td U lib/Target/X86/X86InstrSSE.td U lib/Target/X86/X86InstrInfo.td U lib/Target/X86/X86RegisterInfo.cpp U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86CodeEmitter.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86InstrInfo.h U lib/Target/X86/X86ISelDAGToDAG.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp U lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.h U lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.h U lib/Target/X86/X86ISelLowering.h U lib/Target/X86/X86InstrInfo.cpp U lib/Target/X86/X86InstrBuilder.h U lib/Target/X86/X86RegisterInfo.td llvm-svn: 68560	2009-04-07 22:35:25 +00:00
Rafael Espindola	0324937229	Reduce code duplication on the TLS implementation. This introduces a small regression on the generated code quality in the case we are just computing addresses, not loading values. Will work on it and on X86-64 support. llvm-svn: 68552	2009-04-07 21:37:46 +00:00
Rafael Espindola	37522e768a	Have only one definition of X86AddrNumOperands. llvm-svn: 67949	2009-03-28 18:55:31 +00:00
Rafael Espindola	884992a7e9	Make code a bit less brittle by no hardcoding the number of operands in an address in so many places. llvm-svn: 67945	2009-03-28 17:03:24 +00:00
Rafael Espindola	83ee99d7b0	Avoid hardcoding that X86 addresses have 4 operands. llvm-svn: 67848	2009-03-27 15:57:50 +00:00
Evan Cheng	f9951d1557	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Dan Gohman	f41e54c5af	Correct this comment. llvm-svn: 66057	2009-03-04 19:24:25 +00:00
Dan Gohman	04453ca36c	When using MachineInstr operand indices on SDNodes, the number of MachineInstr def operands must be subtracted out. This bug was uncovered by the recent x86 EFLAGS optimization. Before that, the only instructions that ever needed unfolding were things like CMP32rm, where NumDefs is zero. llvm-svn: 66056	2009-03-04 19:23:38 +00:00
Evan Cheng	9d9688ec15	Do not consider MMX_MOVD64rr a move instructions. The source register is in GR32, the destination is VR64. They are not compatible. llvm-svn: 65273	2009-02-22 08:04:23 +00:00
Dan Gohman	9c258bd2ec	Factor out the code to add a MachineOperand to a MachineInstrBuilder. llvm-svn: 64891	2009-02-18 05:45:50 +00:00
Dale Johannesen	560b03bbcd	Remove non-DebugLoc versions of BuildMI from X86. There were some that might even matter in X86FastISel. llvm-svn: 64437	2009-02-13 02:33:27 +00:00
Dale Johannesen	5a21722625	Eliminate a couple of non-DebugLoc BuildMI variants. Modify callers. llvm-svn: 64409	2009-02-12 23:08:38 +00:00
Bill Wendling	49baa5465c	Propagate DebugLoc info for spiller call-backs. llvm-svn: 64329	2009-02-11 21:51:19 +00:00
Evan Cheng	3b84024598	Implement FpSET_ST1_*. llvm-svn: 64186	2009-02-09 23:32:07 +00:00
Evan Cheng	9dc1507838	Turns out AnalyzeBranch can modify the mbb being analyzed. This is a nasty suprise to some callers, e.g. register coalescer. For now, add an parameter that tells AnalyzeBranch whether it's safe to modify the mbb. A better solution is out there, but I don't have time to deal with it right now. llvm-svn: 64124	2009-02-09 07:14:22 +00:00

... 3 4 5 6 7 ...

698 Commits