llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 06:22:51 +01:00

Author	SHA1	Message	Date
Evan Cheng	6a824a7741	Forgot this in last commit. llvm-svn: 58527	2008-10-31 19:11:09 +00:00
Evan Cheng	afe2deb372	Encode PICADD; some code clean up. llvm-svn: 58526	2008-10-31 19:10:44 +00:00
Bill Wendling	f8f6ed82f1	Revert r58489. It isn't correct for all cases. llvm-svn: 58523	2008-10-31 18:30:19 +00:00
Evan Cheng	168bd3be1b	Change x86 register allocation ordering to match that of gcc. Otherwise some tools get confused by prologue generated by llvm. llvm-svn: 58517	2008-10-31 16:52:57 +00:00
Bill Wendling	0f1f4f8bb1	Don't skip over all "terminator" instructions when determining where to put the callee-saved restore code. It could skip over conditional jumps accidentally. Instead, just skip the "return" instructions. llvm-svn: 58489	2008-10-31 04:00:23 +00:00
Dan Gohman	481e1fd0a6	Use MOVSSmr instead of EXTRACTPSmr in the case of extracting vector element 0 for a store, as it's smaller and faster. llvm-svn: 58483	2008-10-31 00:57:24 +00:00
Evan Cheng	56f4944f9a	I think we got non-machine specific constpool entries covered. llvm-svn: 58474	2008-10-30 23:43:36 +00:00
Duncan Sands	720e5548cb	Shift amounts should have type getShiftAmountTy (i32 for PPC, not i8). Correct this, and some formatting while there. llvm-svn: 58451	2008-10-30 19:28:32 +00:00
Duncan Sands	ae31c14963	Shift amounts should have the type given by getShiftAmountTy (i32 in the case of CellSPU). llvm-svn: 58449	2008-10-30 19:24:28 +00:00
Evan Cheng	6f41528b91	ARM JIT should observe -relocation-model command line option. llvm-svn: 58433	2008-10-30 16:10:54 +00:00
Mon P Wang	d7e34cd378	Add initial support for vector widening. Logic is set to widen for X86. One will only see an effect if legalizetype is not active. Will move support to LegalizeType soon. llvm-svn: 58426	2008-10-30 08:01:45 +00:00
Scott Michel	5b588212d8	Resolve bug 2947: vararg-marked functions must spill registers R3-R79 to stack so that va_start/va_arg/et.al. will walk arguments correctly for Cell SPU. N.B.: Because neither clang nor llvm-gcc-4.2 can be built for CellSPU, this is still unexorcised code. llvm-svn: 58415	2008-10-30 01:51:48 +00:00
Evan Cheng	69c2588244	Correct way to handle CONSTPOOL_ENTRY instructions. llvm-svn: 58409	2008-10-29 23:55:43 +00:00
Evan Cheng	5e8fa6ef36	Add debugging support. llvm-svn: 58408	2008-10-29 23:55:17 +00:00
Nate Begeman	e621f0539e	Fix PEXTRQ encoding llvm-svn: 58403	2008-10-29 23:07:17 +00:00
Dale Johannesen	ff738e8897	Add a RM pseudoreg for the rounding mode, which allows ppcf128->int conversion to work with DeadInstructionElimination. This is now turned off but RM is harmless. It does not do a complete job of modeling the rounding mode. Revert marking MFCR as using all 7 CR subregisters; while correct, this caused the problem in PR 2964, plus the local RA crash noted in the comments. This was needed to make DeadInstructionElimination, but as we are not running that, it is backed out for now. Eventually it should go back in and the other problems fixed where they're broken. llvm-svn: 58391	2008-10-29 18:26:45 +00:00
Jim Grosbach	d735f403a0	Support for constant islands in the ARM JIT. Since the ARM constant pool handling supercedes the standard LLVM constant pool entirely, the JIT emitter does not allocate space for the constants, nor initialize the memory. The constant pool is considered part of the instruction stream. Likewise, when resolving relocations into the constant pool, a hook into the target back end is used to resolve from the constant ID# to the address where the constant is stored. For now, the support in the ARM emitter is limited to 32-bit integer. Future patches will expand this to the full range of constants necessary. llvm-svn: 58338	2008-10-28 18:25:49 +00:00
Duncan Sands	a64641fbd2	Fix darwin ppc llvm-gcc build breakage: intercept ppcf128 to i32 conversion and expand it into a code sequence like in LegalizeDAG. This needs custom ppc lowering of FP_ROUND_INREG, so turn that on and make it work with LegalizeTypes. Probably PPC should simply custom lower the original conversion. llvm-svn: 58329	2008-10-28 15:00:32 +00:00
Chris Lattner	63e92876e0	Fix a nasty miscompilation of 176.gcc on linux/x86 where we synthesized a memset using 16-byte XMM stores, but where the stack realignment code didn't work. Until it does (PR2962) disable use of xmm regs in memcpy and memset formation for linux and other targets with insufficiently aligned stacks. This is part of PR2888 llvm-svn: 58317	2008-10-28 05:49:35 +00:00
David Greene	93f9f0f718	Have TableGen emit setSubgraphColor calls under control of a -gen-debug flag. Then in a debugger developers can set breakpoints at these calls to see waht is about to be selected and what the resulting subgraph looks like. This really helps when debugging instruction selection. llvm-svn: 58278	2008-10-27 21:56:29 +00:00
Evan Cheng	3bcbccf563	For now, don't split live intervals around x87 stack register barriers. FpGET_ST0_80 must be right after a call instruction (and ADJCALLSTACKUP) so we need to find a way to prevent reload of x87 registers between them. llvm-svn: 58230	2008-10-27 07:14:50 +00:00
Dan Gohman	66e878f316	Move the code that adds the DeadMachineInstructionElimPass from target-independent code to target-specific code. This prevents it from running on targets that aren't using fast-isel. In addition to saving compile time, this addresses the problem that not all targets are prepared for it. In order to use this pass, all instructions must declare all their fixed uses and defs of physical registers. llvm-svn: 58144	2008-10-25 17:46:52 +00:00
Nicolas Geoffray	ce30b5caf0	Support for allocation of TLS variables in the JIT. Allocation of a global variable is moved to the execution engine. The JIT calls the TargetJITInfo to allocate thread local storage. Currently, only linux/x86 knows how to allocate thread local global variables. llvm-svn: 58142	2008-10-25 15:41:43 +00:00
Nicolas Geoffray	323dc44a69	Generate code for TLS instructions. llvm-svn: 58141	2008-10-25 15:22:06 +00:00
Oscar Fuentes	51e77b801a	CMake: lib/Target/ARM/AsmPrinter/CMakeLists.txt added. llvm-svn: 58133	2008-10-25 03:40:32 +00:00
Dale Johannesen	20f93b45e7	Mark MFCR as reading all condition code registers. Prevents some more overzealous deletions (mostly in AltiVec code). llvm-svn: 58121	2008-10-24 22:08:01 +00:00
Dale Johannesen	a3d7e7e900	Rewrite logic to figure out whether LR needs to be saved/restored in the prolog/epilog. We need to do this iff something in the function stores into it. llvm-svn: 58116	2008-10-24 21:24:23 +00:00
Torok Edwin	e0ecce06a0	move the note to the correct README llvm-svn: 58104	2008-10-24 19:23:07 +00:00
Torok Edwin	5560590122	add note about va_arg code on x86 and x86-64 llvm-svn: 58103	2008-10-24 19:20:05 +00:00
Duncan Sands	4b148a29ef	Fix translateX86CC: if SetCCOpcode is SETULE and LHS is a foldable load, then LHS and RHS are swapped and SetCCOpcode is changed to SETUGT. But the later code is expecting operands to be the wrong way round for SETUGT, but they are not in this case, resulting in an inverted compare. The solution is to move the load normalization before the correction for SETUGT. This bug was tickled by LegalizeTypes which happened to legalize the testcase slightly differently to LegalizeDAG. llvm-svn: 58092	2008-10-24 13:03:10 +00:00
Dan Gohman	ed90fd3ecf	Fix constant-offset emission for x86-64 absolute addresses. This fixes a bunch of test-suite JIT failures on x86-64 in -relocation-model=static mode. llvm-svn: 58066	2008-10-24 01:57:54 +00:00
Dale Johannesen	b79ddda5bf	Mark defs and uses of CTR and LR correctly. Prevents DeadMachineInstructionElim from thinking things like MTCTR are dead (fixes massive testsuite breakage at -O0). llvm-svn: 58043	2008-10-23 20:41:28 +00:00
Jim Grosbach	a8a40398e8	remove extraneous #ifdef's llvm-svn: 58006	2008-10-22 22:27:51 +00:00
Dale Johannesen	c146b1b281	Remove allocation of unused stack slot. llvm-svn: 57987	2008-10-22 17:26:06 +00:00
Duncan Sands	9d8f7ab614	Get this working with LegalizeTypes: (1) don't assume that i64 has been turned into a BUILD_PAIR node (when called from LegalizeTypes this hasn't happened yet) and don't use a vector shuffle mask with an illegal element type. llvm-svn: 57972	2008-10-22 11:24:12 +00:00
Chris Lattner	cf48fee0c7	Fix PR2907 by digging through constant expressions to find FP constants that are their operands. llvm-svn: 57956	2008-10-22 04:53:16 +00:00
Oscar Fuentes	a932cae97a	CMake: Turned some libraries into partially linked objects. Corrected names of LLVMCore and ARMCodeGen. llvm-svn: 57943	2008-10-22 02:51:53 +00:00
Dale Johannesen	3bd1c1e5cd	Adjust comments for pedantic satisfaction. llvm-svn: 57940	2008-10-22 00:02:32 +00:00
Dale Johannesen	9185d28b4b	Add comments to explain uint64->f64 algorithm, well, sort of. (Algorithm by Ian Ollmann.) llvm-svn: 57932	2008-10-21 23:07:49 +00:00
Dale Johannesen	eb7e2deb1d	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	34306e122d	Implement the optimized FCMP_OEQ/FCMP_UNE code for x86 fast-isel. llvm-svn: 57915	2008-10-21 18:24:51 +00:00
Jim Grosbach	24a4744d53	use pre-UAL mnemonics for push/pop for compilaton callback function llvm-svn: 57911	2008-10-21 16:54:12 +00:00
Dan Gohman	e49a93ccea	Disable constant-offset folding for PowerPC, as the PowerPC target isn't yet prepared for it. llvm-svn: 57886	2008-10-21 03:41:46 +00:00
Dan Gohman	847a83dbad	Don't create TargetGlobalAddress nodes with offsets that don't fit in the 32-bit signed offset field of addresses. Even though this may be intended, some linkers refuse to relocate code where the relocated address computation overflows. Also, fix the sign-extension of constant offsets to use the actual pointer size, rather than the size of the GlobalAddress node, which may be different, for example on x86-64 where MVT::i32 is used when the address is being fit into the 32-bit displacement field. llvm-svn: 57885	2008-10-21 03:38:42 +00:00
Dan Gohman	281881b8e2	Optimized FCMP_OEQ and FCMP_UNE for x86. Where previously LLVM might emit code like this: ucomisd %xmm1, %xmm0 setne %al setp %cl orb %al, %cl jne .LBB4_2 it now emits this: ucomisd %xmm1, %xmm0 jne .LBB4_2 jp .LBB4_2 It has fewer instructions and uses fewer registers, but it does have more branches. And in the case that this code is followed by a non-fallthrough edge, it may be followed by a jmp instruction, resulting in three branch instructions in sequence. Some effort is made to avoid this situation. To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and FCMP_UNE in lowered form, and replace them with code that emits two branches, except in the case where it would require converting a fall-through edge to an explicit branch. Also, X86InstrInfo.cpp's branch analysis and transform code now knows now to handle blocks with multiple conditional branches. It uses loops instead of having fixed checks for up to two instructions. It can now analyze and transform code generated from FCMP_OEQ and FCMP_UNE. llvm-svn: 57873	2008-10-21 03:29:32 +00:00
Dan Gohman	d692070372	When the coalescer is doing rematerializing, have it remove the copy instruction from the instruction list before asking the target to create the new instruction. This gets the old instruction out of the way so that it doesn't interfere with the target's rematerialization code. In the case of x86, this helps it find more cases where EFLAGS is not live. Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check to see if it reached the end of the block after scanning each instruction, instead of just before. This lets it notice when the end of the block is only two instructions away, without doing any additional scanning. These changes allow rematerialization to clobber EFLAGS in more cases, for example using xor instead of mov to set the return value to zero in the included testcase. llvm-svn: 57872	2008-10-21 03:24:31 +00:00
Jim Grosbach	1de8b23129	Update the stub and callback code to handle lazy compilation. The stub is re-written by the callback to branch directly to the compiled code in future invocations. Added back in range-based memory permission functions for the updating of the stub on Darwin. llvm-svn: 57846	2008-10-20 21:39:23 +00:00
Duncan Sands	98fc39f607	Have X86 custom lowering for LegalizeTypes use LowerOperation if it doesn't know what else to do. This methods should probably be factorized some, but this is good enough for the moment. Have LowerATOMIC_BINARY_64 use EXTRACT_ELEMENT rather than assuming the operand is a BUILD_PAIR (if it is then getNode will automagically simplify the EXTRACT_ELEMENT). This way LowerATOMIC_BINARY_64 usable from LegalizeTypes. llvm-svn: 57831	2008-10-20 15:56:33 +00:00
Dan Gohman	15597f07b2	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	ac8c7772ba	This is now partly done. llvm-svn: 57734	2008-10-17 21:39:27 +00:00
Dan Gohman	69ac9cc00f	This is done. llvm-svn: 57733	2008-10-17 21:38:40 +00:00
Evan Cheng	08d0796cf5	Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is not technically true, it tells tblgen that these instructions "clobber" the entire XMM register file. llvm-svn: 57723	2008-10-17 21:02:22 +00:00
Chris Lattner	d96b8d12bc	add support for 128 bit inputs on both x86-64 and x86-32. llvm-svn: 57709	2008-10-17 18:15:05 +00:00
Chris Lattner	231a9466df	Fix a bug where the x86 backend would reject 64-bit r constraints when in 32-bit mode instead of assigning a register pair. This has nothing to do with PR2356, but I happened to notice it while working on it. llvm-svn: 57704	2008-10-17 17:59:52 +00:00
Evan Cheng	fa61b6a4ba	Fix lfence and mfence encoding. These look like MRM5r and MRM6r instructions except they do not have any operands. The RegModRM byte is encoded with register number 0. llvm-svn: 57692	2008-10-17 17:14:20 +00:00
Evan Cheng	733b305f24	getX86RegNum has long been moved to X86RegisterInfo. llvm-svn: 57691	2008-10-17 17:12:18 +00:00
Chris Lattner	e087da1d39	add some simple hacky long double support for the CBE. This should work for intel long double, but ppc long double aborts in convert. llvm-svn: 57672	2008-10-17 06:11:48 +00:00
Dan Gohman	268cfea6bc	Fun x86 encoding tricks: when adding an immediate value of 128, use a SUB instruction instead of an ADD, because -128 can be encoded in an 8-bit signed immediate field, while +128 can't be. This avoids the need for a 32-bit immediate field in this case. A similar optimization applies to 64-bit adds with 0x80000000, with the 32-bit signed immediate field. To support this, teach tablegen how to handle 64-bit constants. llvm-svn: 57663	2008-10-17 01:33:43 +00:00
Dan Gohman	5d83bd89a5	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Dan Gohman	90f776986d	Trim #includes. llvm-svn: 57649	2008-10-16 20:18:31 +00:00
Chris Lattner	39f881ab69	fix typo noticed by sdt llvm-svn: 57644	2008-10-16 17:02:50 +00:00
Duncan Sands	1349af7df4	Fix warnings about mb/me being potentially used uninitialized in these functions with gcc-4.3. llvm-svn: 57635	2008-10-16 13:02:33 +00:00
Chris Lattner	9afb6adf17	add some notes llvm-svn: 57631	2008-10-16 07:04:06 +00:00
Chris Lattner	9d39b11d10	add some notes and a file to collect unimplemented features in the x86 backend. These will all be answered with "patches welcome", so a PR doesn't help drive them along. llvm-svn: 57630	2008-10-16 06:46:12 +00:00
Chris Lattner	562984c110	mark some targets as experimental. Andrew, if you think that Alpha is basically working, feel free to remove the tag. The other targets have really basic things that break them. llvm-svn: 57628	2008-10-16 06:16:50 +00:00
Dan Gohman	86527c1834	Const-ify several TargetInstrInfo methods. llvm-svn: 57622	2008-10-16 01:49:15 +00:00
Dan Gohman	991376be85	Remove an unused variable. llvm-svn: 57621	2008-10-16 01:47:47 +00:00
Dan Gohman	6dba6b2384	Fix the predicate for memop64 to be a regular load, not just an unindexed load. llvm-svn: 57612	2008-10-16 00:03:00 +00:00
Chris Lattner	2ce4f1e7ad	move PR1941 here. llvm-svn: 57586	2008-10-15 16:33:52 +00:00
Chris Lattner	866578b51b	move PR1604 here. llvm-svn: 57582	2008-10-15 16:06:03 +00:00
Chris Lattner	4ccc775d89	move PR1488 into this file. llvm-svn: 57579	2008-10-15 16:02:15 +00:00
Dan Gohman	65702b2eb8	Now that predicates can be composed, simplify several of the predicates by extending simple predicates to create more complex predicates instead of duplicating the logic for the simple predicates. This doesn't reduce much redundancy in DAGISelEmitter.cpp's generated source yet; that will require improvements to DAGISelEmitter.cpp's instruction sorting, to make it more effectively group nodes with similar predicates together. llvm-svn: 57565	2008-10-15 06:50:19 +00:00
Chris Lattner	d91c01484c	add a note llvm-svn: 57557	2008-10-15 05:53:25 +00:00
Chris Lattner	7194e8406a	add support for folding immediates into stores when they are due to argument passing in calls. This is significant because it hits all immediate arguments to calls on x86-32. llvm-svn: 57556	2008-10-15 05:38:32 +00:00
Chris Lattner	214643296b	fold immediates into stores in simple cases, this produces diffs like this: - movl $0, %eax - movl %eax, _yy_n_chars + movl $0, _yy_n_chars llvm-svn: 57555	2008-10-15 05:30:52 +00:00
Chris Lattner	5716b8daa4	fold compare of null pointer into compare with 0. llvm-svn: 57553	2008-10-15 05:18:04 +00:00
Chris Lattner	928e8e5092	Some minor cleanups: 1. Compute action in X86SelectSelect based on MVT instead of type. 2. Use TLI.getValueType(..) instead of MVT::getVT(..) because the former handles pointers and the later doesn't. 3. Don't pass TLI into isTypeLegal, since it already has access to it as an ivar. #2 gives fast isel some minor new functionality: handling load/stores of pointers. llvm-svn: 57552	2008-10-15 05:07:36 +00:00
Chris Lattner	da69b5e401	Use switch on VT instead of Type* comparisons. llvm-svn: 57551	2008-10-15 04:32:45 +00:00
Chris Lattner	052e062f08	Use X86FastEmitCompare for FCMP_OEQ and FCMP_UNE: it doesn't change the generated code, but makes the code simpler. llvm-svn: 57550	2008-10-15 04:29:23 +00:00
Chris Lattner	d7024dafbf	refactor compare emission out into a new X86FastEmitCompare method, which makes it easy to share the compare/imm folding logic with 'setcc'. This shaves a bunch of instructions off the common select case, which happens a lot in llvm-gcc. llvm-svn: 57549	2008-10-15 04:26:38 +00:00
Chris Lattner	cbea6edde5	Fold immediates into compares when possible, producing "cmp $4, %eax" instead of loading 4 into a register and then doing the compare. llvm-svn: 57548	2008-10-15 04:13:29 +00:00
Chris Lattner	905dbd0dea	more minor refactoring of X86SelectBranch, no functionality change. llvm-svn: 57547	2008-10-15 04:02:26 +00:00
Chris Lattner	99d88895b6	factor buildmi calls in X86SelectBranch llvm-svn: 57546	2008-10-15 03:58:05 +00:00
Chris Lattner	97e96bcca0	factor some more BuildMI's in X86SelectCmp llvm-svn: 57545	2008-10-15 03:52:54 +00:00
Chris Lattner	e66c12b3f5	factor some BuildMI calls, no functionality change. llvm-svn: 57544	2008-10-15 03:47:17 +00:00
Evan Cheng	cb8b4e9dd4	- Add target lowering hooks that specify which setcc conditions are illegal, i.e. conditions that cannot be checked with a single instruction. For example, SETONE and SETUEQ on x86. - Teach legalizer to implement illegal setcc as a and / or of a number of legal setcc nodes. For now, only implement FP conditions. e.g. SETONE is implemented as SETO & SETNE, SETUEQ is SETUO \| SETEQ. - Move x86 target over. llvm-svn: 57542	2008-10-15 02:05:31 +00:00
Dan Gohman	c070ffc493	FastISel support for exception-handling constructs. - Move the EH landing-pad code and adjust it so that it works with FastISel as well as with SDISel. - Add FastISel support for @llvm.eh.exception and @llvm.eh.selector. llvm-svn: 57539	2008-10-14 23:54:11 +00:00
Dale Johannesen	19407daf55	Accept -march=i586, because gcc does (a synonym for pentium). Fixes gcc.target/i386/20000720-1.c gcc.target/i386/pr26826.c llvm-svn: 57528	2008-10-14 22:06:33 +00:00
Evan Cheng	3faedff2de	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Jim Grosbach	d0ff59ec42	Update ARM Insn encoding to get endian-ness to match the documentation (31-0 left to right) llvm-svn: 57524	2008-10-14 20:36:24 +00:00
Dan Gohman	9543edc4ef	Fix command-line option printing to print two spaces where needed, instead of requiring all "short description" strings to begin with two spaces. This makes these strings less mysterious, and it fixes some cases where short description strings mistakenly did not begin with two spaces. llvm-svn: 57521	2008-10-14 20:25:08 +00:00
Evan Cheng	39a819027c	Fix indentation. llvm-svn: 57508	2008-10-14 17:15:39 +00:00
Dan Gohman	e08e0dcfcc	When doing the very-late shift-and address-mode optimization, create a new DAG node to represent the new shift to keep the DAG consistent, even though it'll almost always be folded into the address. If a user of the resulting address has multiple uses, the nodes may get revisited by a later MatchAddress call, in which case DAG inconsistencies do matter. This fixes PR2849. llvm-svn: 57465	2008-10-13 20:52:04 +00:00
Anton Korobeynikov	6204f4ac43	Update size of inst correctly with segment override. llvm-svn: 57414	2008-10-12 10:30:11 +00:00
Chris Lattner	7910d59d44	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Duncan Sands	f175d9eae0	Fix comment typo. llvm-svn: 57381	2008-10-11 19:34:24 +00:00
Anton Korobeynikov	d8b3e6f0ce	Add ability to override segment (mostly for code emitter purposes). llvm-svn: 57380	2008-10-11 19:09:15 +00:00
Dale Johannesen	03d0a7d70c	Fix SSE4.1 roundss, roundsd. While the instructions have the same pattern as roundpd/roundps, the Intel compiler builtins do not: rounds* has an extra operand. Fixes gcc.target/i386/sse4_1-rounds[sd]-[1234].c llvm-svn: 57370	2008-10-10 23:51:03 +00:00
Anton Korobeynikov	e44fccb2f0	Fix a thinko and unbreak sparc default CC llvm-svn: 57368	2008-10-10 21:47:37 +00:00
Anton Korobeynikov	f5398df73c	Extend set of return registers on sparc until someone will implement MRV support there. At least, this will allow libgcc compile, however we are not ABI-compatible with stuff compiled with native gcc. llvm-svn: 57364	2008-10-10 20:30:14 +00:00
Anton Korobeynikov	1ead869e67	Ignore extra 'r' modifier for now llvm-svn: 57363	2008-10-10 20:29:50 +00:00
Anton Korobeynikov	ffdc3498b4	Use expand for smul_lohi for now llvm-svn: 57362	2008-10-10 20:29:31 +00:00
Anton Korobeynikov	8ed548b24a	Add rudimentary support for 'r' register operand llvm-svn: 57359	2008-10-10 20:28:10 +00:00
Anton Korobeynikov	ba8bb3e12f	Cleanup llvm-svn: 57358	2008-10-10 20:27:31 +00:00
Anton Korobeynikov	6054846989	Add rudimentary asmprinter support for printing inline asm operands for sparc. llvm-svn: 57346	2008-10-10 10:15:03 +00:00
Anton Korobeynikov	a12d563611	Add dummy 'm' inline asm constraint handler for Sparc. I'm not sure, whether it is correct, however :) llvm-svn: 57345	2008-10-10 10:14:47 +00:00
Anton Korobeynikov	8cfd14b138	Cleanup llvm-svn: 57344	2008-10-10 10:14:15 +00:00
Dale Johannesen	075a62519f	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Dale Johannesen	9e57068854	Rename APFloat::convertToAPInt to bitcastToAPInt to make it clearer what the function does. No functional change. llvm-svn: 57325	2008-10-09 18:53:47 +00:00
Chris Lattner	284ae75537	get CodeGen/Alpha/mul128.ll to work. llvm-svn: 57318	2008-10-09 04:50:56 +00:00
Dale Johannesen	817eff07b1	(re)Put const weak strings in appropriate section on Darwin. g++dg/abi/key2.C llvm-svn: 57309	2008-10-08 21:49:47 +00:00
Jim Grosbach	974799922e	Comment to be explicit that the enumeration values for CondCodes matter. llvm-svn: 57295	2008-10-08 16:24:35 +00:00
Duncan Sands	bc53ce331a	Use template to distinguish between function variants. GCC 4.4.0 gives an error on the "int" declaration for example saying that it has already been declared (using the "short" one). Using templates here allow the compiler to distinguish between the function to choose. Also, "llvm/Support/DataTypes.h" was not included, leading to error messages about not knowing "uint32_t" for example. Patch by Samuel Tardieu. llvm-svn: 57292	2008-10-08 07:44:52 +00:00
Duncan Sands	8f296a3788	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Dan Gohman	c65ca09308	Add MBB successors and physreg Uses in the same order that SDISel typically adds them in. This makes it a little easier to compare FastISel output with SDISel output. llvm-svn: 57266	2008-10-07 22:10:33 +00:00
Dan Gohman	2561119124	Instead of emitting an implicit use for the super-register of X86::CL that was used, emit an EXTRACT_SUBREG from the CL super-register to CL. This more precisely describes how the CL register is being used. llvm-svn: 57264	2008-10-07 21:50:36 +00:00
Jim Grosbach	8ac554209f	Unconditional branch instruction encoding fix. Needs to use ABI, not AXI, to get the proper opcode bits. llvm-svn: 57262	2008-10-07 21:08:09 +00:00
Jim Grosbach	5fb8bb7434	need ARM.h for ARMCC definition llvm-svn: 57261	2008-10-07 21:01:51 +00:00
Owen Anderson	c9a628af26	Add an option to enable StrongPHIElimination, for ease of testing. llvm-svn: 57259	2008-10-07 20:22:28 +00:00
Jim Grosbach	d44d20be6e	Encode the conditional execution predicate when JITing. llvm-svn: 57258	2008-10-07 19:05:35 +00:00
Dale Johannesen	3d2e449078	Model hardwired inputs & outputs of x86 8-bit divides correctly. Fixes local RA miscompilation of gcc.c-torture/execute/20020904-1.c -O0. llvm-svn: 57257	2008-10-07 18:54:28 +00:00
Jim Grosbach	61f8207ac9	Clarify naming and correct conditional so that CMP and CMN instructions get the Rn operand encoded properly llvm-svn: 57252	2008-10-07 17:42:09 +00:00
Jim Grosbach	a52546fec4	Fix Opcode values of CMP and CMN llvm-svn: 57251	2008-10-07 17:40:46 +00:00
Anders Carlsson	a9c42526f8	Certain patterns involving the "movss" instruction were marked as requiring SSE2, when in reality movss is an SSE1 instruction. llvm-svn: 57246	2008-10-07 16:14:11 +00:00
Andrew Lenharth	7a997d1cfd	Note that ADDC and company don't actually expand yet (missing in legalize llvm-svn: 57226	2008-10-07 02:10:26 +00:00
Evan Cheng	88d76ffe8a	Fix PR2850 and PR2863. Only generate movddup for 128-bit SSE vector shuffles. llvm-svn: 57210	2008-10-06 21:13:08 +00:00
Devang Patel	07a9ce22e4	It is possible that all functions in one module are not being optimized for size. Set OptForSize for each function separately. llvm-svn: 57182	2008-10-06 18:03:39 +00:00
Devang Patel	4f91b766d2	Remove unncessary isDeclaration() checks. llvm-svn: 57179	2008-10-06 17:30:07 +00:00
Anton Korobeynikov	8106145480	Emit type-correct constant null. Also fix a typo. Patch by Robert G. Jakabosky! llvm-svn: 57110	2008-10-05 15:07:06 +00:00
Anton Korobeynikov	1f7e8162d9	Fix weird think-o and unbreak build on all gcc-3.4.x-based platforms (e.g. mingw) llvm-svn: 57106	2008-10-05 08:53:29 +00:00
Chris Lattner	ed0f84ee2d	this case is matched now. llvm-svn: 57096	2008-10-05 02:16:12 +00:00
Anton Korobeynikov	4cc9051fbb	Revert r56675 - it breaks unwinding runtime everywhere. llvm-svn: 57048	2008-10-04 11:09:36 +00:00
Dale Johannesen	dc83b95ba5	Make atomic Swap work, 64-bit on x86-32. Make it all work in non-pic mode. llvm-svn: 57034	2008-10-03 22:25:52 +00:00
Dale Johannesen	27d8955b8f	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dan Gohman	00034b1416	Avoid creating two TargetLowering objects for each target. Instead, just create one, and make sure everything that needs it can access it. Previously most of the SelectionDAGISel subclasses all had their own TargetLowering object, which was redundant with the TargetLowering object in the TargetMachine subclasses, except on Sparc, where SparcTargetMachine didn't have a TargetLowering object. Change Sparc to work more like the other targets here. llvm-svn: 57016	2008-10-03 16:55:19 +00:00
Dan Gohman	e41ac02dce	Remove an unused field. llvm-svn: 57014	2008-10-03 16:17:33 +00:00
Jim Grosbach	d9ff019d3e	Indexing off by one resulted in errant encoding of source register for reg->reg moves. llvm-svn: 57011	2008-10-03 15:53:56 +00:00
Jim Grosbach	e398f553b2	NeedStub/DoesntNeedStub logic was reversed, leading to not using a stub for global relocations that do need them (libc calls, for example). llvm-svn: 57010	2008-10-03 15:52:42 +00:00
Dan Gohman	30c5ce1b7d	Switch the MachineOperand accessors back to the short names like isReg, etc., from isRegister, etc. llvm-svn: 57006	2008-10-03 15:45:36 +00:00
Dan Gohman	78ef9889c9	Fix X86FastISel to handle dynamic allocas that have avoided getting inserted into the ValueMap. This avoids infinite recursion in some rare cases. llvm-svn: 56989	2008-10-03 01:27:49 +00:00
Dan Gohman	e75d14f8b0	Optimize conditional branches in X86FastISel. This replaces sequences like this: sete %al testb %al, %al jne LBB11_1 with this: je LBB11_1 llvm-svn: 56969	2008-10-02 22:15:21 +00:00
Dale Johannesen	dbd7b1bd33	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	bba3fb6d18	Work around an interaction between fast-isel and regalloc=local. The local register allocator's physreg liveness doesn't recognize subregs, so it doesn't know that defs of %ecx that are immediately followed by uses of %cl aren't dead. This comes up due to the way fast-isel emits shift instructions. This is a temporary workaround. Arguably, local regalloc should handle subreg references correctly. On the other hand, perhaps fast-isel should use INSERT_SUBREG instead of just assigning to the most convenient super-register of %cl when lowering shifts. This fixes MultiSource/Benchmarks/MallocBench/espresso, MultiSource/Applications/hexxagon, and others, under -fast. llvm-svn: 56947	2008-10-02 14:56:12 +00:00
Bill Wendling	1c97db4090	"The original bug was a complaint that _mm_srli_si128 mis-compiled when passed a constant vector ("{0x123, 0x456}" syntax). The fix is to simplify the _mm_srli_si128 macro, and move the "* 8" from the macro into the compiler back-end. I can't change the existing __builtins because so many people are using them :-(." Patch by Stuart Hastings! llvm-svn: 56944	2008-10-02 05:56:52 +00:00
Devang Patel	a5cda569d3	Remove OptimizeForSize global. Use function attribute optsize. llvm-svn: 56937	2008-10-01 23:18:38 +00:00
Dan Gohman	4aacc3ab83	Split x86's ADJCALLSTACK instructions into 32-bit and 64-bit forms. This allows the 64-bit forms to use+def RSP instead of ESP. This doesn't fix any real bugs today, but it is more precise and it makes the debug dumps on x86-64 look more consistent. Also, add some comments describing the CALL instructions' physreg operand uses and defs. llvm-svn: 56925	2008-10-01 18:28:06 +00:00
Jim Grosbach	0268ee4c25	Fix typo s/ther/there/ llvm-svn: 56924	2008-10-01 18:16:49 +00:00
Dan Gohman	d6e96e9888	Mark CALL instructions as having a Use of ESP/RSP. llvm-svn: 56911	2008-10-01 04:14:30 +00:00
Bill Wendling	d7effcf8da	Implement the -fno-builtin option in the front-end, not in the back-end. llvm-svn: 56900	2008-10-01 00:59:58 +00:00
Bill Wendling	618d422cdd	Just don't transform this memset into "bzero" if no-builtin is specified. llvm-svn: 56888	2008-09-30 22:05:33 +00:00

1 2 3 4 5 ...

9257 Commits