llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Duncan Sands	4b148a29ef	Fix translateX86CC: if SetCCOpcode is SETULE and LHS is a foldable load, then LHS and RHS are swapped and SetCCOpcode is changed to SETUGT. But the later code is expecting operands to be the wrong way round for SETUGT, but they are not in this case, resulting in an inverted compare. The solution is to move the load normalization before the correction for SETUGT. This bug was tickled by LegalizeTypes which happened to legalize the testcase slightly differently to LegalizeDAG. llvm-svn: 58092	2008-10-24 13:03:10 +00:00
Dan Gohman	ed90fd3ecf	Fix constant-offset emission for x86-64 absolute addresses. This fixes a bunch of test-suite JIT failures on x86-64 in -relocation-model=static mode. llvm-svn: 58066	2008-10-24 01:57:54 +00:00
Dale Johannesen	b79ddda5bf	Mark defs and uses of CTR and LR correctly. Prevents DeadMachineInstructionElim from thinking things like MTCTR are dead (fixes massive testsuite breakage at -O0). llvm-svn: 58043	2008-10-23 20:41:28 +00:00
Jim Grosbach	a8a40398e8	remove extraneous #ifdef's llvm-svn: 58006	2008-10-22 22:27:51 +00:00
Dale Johannesen	c146b1b281	Remove allocation of unused stack slot. llvm-svn: 57987	2008-10-22 17:26:06 +00:00
Duncan Sands	9d8f7ab614	Get this working with LegalizeTypes: (1) don't assume that i64 has been turned into a BUILD_PAIR node (when called from LegalizeTypes this hasn't happened yet) and don't use a vector shuffle mask with an illegal element type. llvm-svn: 57972	2008-10-22 11:24:12 +00:00
Chris Lattner	cf48fee0c7	Fix PR2907 by digging through constant expressions to find FP constants that are their operands. llvm-svn: 57956	2008-10-22 04:53:16 +00:00
Oscar Fuentes	a932cae97a	CMake: Turned some libraries into partially linked objects. Corrected names of LLVMCore and ARMCodeGen. llvm-svn: 57943	2008-10-22 02:51:53 +00:00
Dale Johannesen	3bd1c1e5cd	Adjust comments for pedantic satisfaction. llvm-svn: 57940	2008-10-22 00:02:32 +00:00
Dale Johannesen	9185d28b4b	Add comments to explain uint64->f64 algorithm, well, sort of. (Algorithm by Ian Ollmann.) llvm-svn: 57932	2008-10-21 23:07:49 +00:00
Dale Johannesen	eb7e2deb1d	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	34306e122d	Implement the optimized FCMP_OEQ/FCMP_UNE code for x86 fast-isel. llvm-svn: 57915	2008-10-21 18:24:51 +00:00
Jim Grosbach	24a4744d53	use pre-UAL mnemonics for push/pop for compilaton callback function llvm-svn: 57911	2008-10-21 16:54:12 +00:00
Dan Gohman	e49a93ccea	Disable constant-offset folding for PowerPC, as the PowerPC target isn't yet prepared for it. llvm-svn: 57886	2008-10-21 03:41:46 +00:00
Dan Gohman	847a83dbad	Don't create TargetGlobalAddress nodes with offsets that don't fit in the 32-bit signed offset field of addresses. Even though this may be intended, some linkers refuse to relocate code where the relocated address computation overflows. Also, fix the sign-extension of constant offsets to use the actual pointer size, rather than the size of the GlobalAddress node, which may be different, for example on x86-64 where MVT::i32 is used when the address is being fit into the 32-bit displacement field. llvm-svn: 57885	2008-10-21 03:38:42 +00:00
Dan Gohman	281881b8e2	Optimized FCMP_OEQ and FCMP_UNE for x86. Where previously LLVM might emit code like this: ucomisd %xmm1, %xmm0 setne %al setp %cl orb %al, %cl jne .LBB4_2 it now emits this: ucomisd %xmm1, %xmm0 jne .LBB4_2 jp .LBB4_2 It has fewer instructions and uses fewer registers, but it does have more branches. And in the case that this code is followed by a non-fallthrough edge, it may be followed by a jmp instruction, resulting in three branch instructions in sequence. Some effort is made to avoid this situation. To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and FCMP_UNE in lowered form, and replace them with code that emits two branches, except in the case where it would require converting a fall-through edge to an explicit branch. Also, X86InstrInfo.cpp's branch analysis and transform code now knows now to handle blocks with multiple conditional branches. It uses loops instead of having fixed checks for up to two instructions. It can now analyze and transform code generated from FCMP_OEQ and FCMP_UNE. llvm-svn: 57873	2008-10-21 03:29:32 +00:00
Dan Gohman	d692070372	When the coalescer is doing rematerializing, have it remove the copy instruction from the instruction list before asking the target to create the new instruction. This gets the old instruction out of the way so that it doesn't interfere with the target's rematerialization code. In the case of x86, this helps it find more cases where EFLAGS is not live. Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check to see if it reached the end of the block after scanning each instruction, instead of just before. This lets it notice when the end of the block is only two instructions away, without doing any additional scanning. These changes allow rematerialization to clobber EFLAGS in more cases, for example using xor instead of mov to set the return value to zero in the included testcase. llvm-svn: 57872	2008-10-21 03:24:31 +00:00
Jim Grosbach	1de8b23129	Update the stub and callback code to handle lazy compilation. The stub is re-written by the callback to branch directly to the compiled code in future invocations. Added back in range-based memory permission functions for the updating of the stub on Darwin. llvm-svn: 57846	2008-10-20 21:39:23 +00:00
Duncan Sands	98fc39f607	Have X86 custom lowering for LegalizeTypes use LowerOperation if it doesn't know what else to do. This methods should probably be factorized some, but this is good enough for the moment. Have LowerATOMIC_BINARY_64 use EXTRACT_ELEMENT rather than assuming the operand is a BUILD_PAIR (if it is then getNode will automagically simplify the EXTRACT_ELEMENT). This way LowerATOMIC_BINARY_64 usable from LegalizeTypes. llvm-svn: 57831	2008-10-20 15:56:33 +00:00
Dan Gohman	15597f07b2	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	ac8c7772ba	This is now partly done. llvm-svn: 57734	2008-10-17 21:39:27 +00:00
Dan Gohman	69ac9cc00f	This is done. llvm-svn: 57733	2008-10-17 21:38:40 +00:00
Evan Cheng	08d0796cf5	Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is not technically true, it tells tblgen that these instructions "clobber" the entire XMM register file. llvm-svn: 57723	2008-10-17 21:02:22 +00:00
Chris Lattner	d96b8d12bc	add support for 128 bit inputs on both x86-64 and x86-32. llvm-svn: 57709	2008-10-17 18:15:05 +00:00
Chris Lattner	231a9466df	Fix a bug where the x86 backend would reject 64-bit r constraints when in 32-bit mode instead of assigning a register pair. This has nothing to do with PR2356, but I happened to notice it while working on it. llvm-svn: 57704	2008-10-17 17:59:52 +00:00
Evan Cheng	fa61b6a4ba	Fix lfence and mfence encoding. These look like MRM5r and MRM6r instructions except they do not have any operands. The RegModRM byte is encoded with register number 0. llvm-svn: 57692	2008-10-17 17:14:20 +00:00
Evan Cheng	733b305f24	getX86RegNum has long been moved to X86RegisterInfo. llvm-svn: 57691	2008-10-17 17:12:18 +00:00
Chris Lattner	e087da1d39	add some simple hacky long double support for the CBE. This should work for intel long double, but ppc long double aborts in convert. llvm-svn: 57672	2008-10-17 06:11:48 +00:00
Dan Gohman	268cfea6bc	Fun x86 encoding tricks: when adding an immediate value of 128, use a SUB instruction instead of an ADD, because -128 can be encoded in an 8-bit signed immediate field, while +128 can't be. This avoids the need for a 32-bit immediate field in this case. A similar optimization applies to 64-bit adds with 0x80000000, with the 32-bit signed immediate field. To support this, teach tablegen how to handle 64-bit constants. llvm-svn: 57663	2008-10-17 01:33:43 +00:00
Dan Gohman	5d83bd89a5	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Dan Gohman	90f776986d	Trim #includes. llvm-svn: 57649	2008-10-16 20:18:31 +00:00
Chris Lattner	39f881ab69	fix typo noticed by sdt llvm-svn: 57644	2008-10-16 17:02:50 +00:00
Duncan Sands	1349af7df4	Fix warnings about mb/me being potentially used uninitialized in these functions with gcc-4.3. llvm-svn: 57635	2008-10-16 13:02:33 +00:00
Chris Lattner	9afb6adf17	add some notes llvm-svn: 57631	2008-10-16 07:04:06 +00:00
Chris Lattner	9d39b11d10	add some notes and a file to collect unimplemented features in the x86 backend. These will all be answered with "patches welcome", so a PR doesn't help drive them along. llvm-svn: 57630	2008-10-16 06:46:12 +00:00
Chris Lattner	562984c110	mark some targets as experimental. Andrew, if you think that Alpha is basically working, feel free to remove the tag. The other targets have really basic things that break them. llvm-svn: 57628	2008-10-16 06:16:50 +00:00
Dan Gohman	86527c1834	Const-ify several TargetInstrInfo methods. llvm-svn: 57622	2008-10-16 01:49:15 +00:00
Dan Gohman	991376be85	Remove an unused variable. llvm-svn: 57621	2008-10-16 01:47:47 +00:00
Dan Gohman	6dba6b2384	Fix the predicate for memop64 to be a regular load, not just an unindexed load. llvm-svn: 57612	2008-10-16 00:03:00 +00:00
Chris Lattner	2ce4f1e7ad	move PR1941 here. llvm-svn: 57586	2008-10-15 16:33:52 +00:00
Chris Lattner	866578b51b	move PR1604 here. llvm-svn: 57582	2008-10-15 16:06:03 +00:00
Chris Lattner	4ccc775d89	move PR1488 into this file. llvm-svn: 57579	2008-10-15 16:02:15 +00:00
Dan Gohman	65702b2eb8	Now that predicates can be composed, simplify several of the predicates by extending simple predicates to create more complex predicates instead of duplicating the logic for the simple predicates. This doesn't reduce much redundancy in DAGISelEmitter.cpp's generated source yet; that will require improvements to DAGISelEmitter.cpp's instruction sorting, to make it more effectively group nodes with similar predicates together. llvm-svn: 57565	2008-10-15 06:50:19 +00:00
Chris Lattner	d91c01484c	add a note llvm-svn: 57557	2008-10-15 05:53:25 +00:00
Chris Lattner	7194e8406a	add support for folding immediates into stores when they are due to argument passing in calls. This is significant because it hits all immediate arguments to calls on x86-32. llvm-svn: 57556	2008-10-15 05:38:32 +00:00
Chris Lattner	214643296b	fold immediates into stores in simple cases, this produces diffs like this: - movl $0, %eax - movl %eax, _yy_n_chars + movl $0, _yy_n_chars llvm-svn: 57555	2008-10-15 05:30:52 +00:00
Chris Lattner	5716b8daa4	fold compare of null pointer into compare with 0. llvm-svn: 57553	2008-10-15 05:18:04 +00:00
Chris Lattner	928e8e5092	Some minor cleanups: 1. Compute action in X86SelectSelect based on MVT instead of type. 2. Use TLI.getValueType(..) instead of MVT::getVT(..) because the former handles pointers and the later doesn't. 3. Don't pass TLI into isTypeLegal, since it already has access to it as an ivar. #2 gives fast isel some minor new functionality: handling load/stores of pointers. llvm-svn: 57552	2008-10-15 05:07:36 +00:00
Chris Lattner	da69b5e401	Use switch on VT instead of Type* comparisons. llvm-svn: 57551	2008-10-15 04:32:45 +00:00
Chris Lattner	052e062f08	Use X86FastEmitCompare for FCMP_OEQ and FCMP_UNE: it doesn't change the generated code, but makes the code simpler. llvm-svn: 57550	2008-10-15 04:29:23 +00:00

1 2 3 4 5 ...

9128 Commits