llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 06:22:51 +01:00

Author	SHA1	Message	Date
Dan Gohman	fd00e20872	The 32-bit displacement field in an x86 address is signed. Arrange for it to be sign-extended when it is promoted to 64 bits for intermediate offset calculations. The offset calculations are done as uint64_t so that overflow conditions are well defined. This fixes a problem which is currently hidden by the x86 AsmPrinter but which was exposed by r58917 (which is temporarily reverted). See PR3027 for details. llvm-svn: 59044	2008-11-11 15:52:29 +00:00
Dan Gohman	b5857c2859	Fix indentation. llvm-svn: 59004	2008-11-10 22:09:58 +00:00
Evan Cheng	080c1a9fac	Rename isGVNonLazyPtr to isIndirectSym to reflect how it will be used. llvm-svn: 58949	2008-11-10 01:08:07 +00:00
Anton Korobeynikov	a4933e408d	Temporary revert my last commit: it seems it's triggering some subtle bug in backend and breaks llvm-gcc llvm-svn: 58926	2008-11-08 23:05:05 +00:00
Anton Korobeynikov	e3b9284fa8	Factor out offset printing code into generic AsmPrinter. FIXME: it seems, that most of targets don't support offsets wrt CPI/GlobalAddress', was it intentional? llvm-svn: 58917	2008-11-08 17:21:38 +00:00
Evan Cheng	3d9c134136	Rename startFunctionStub to startGVStub since it's also used for GV non-lazy ptr. llvm-svn: 58897	2008-11-08 08:02:53 +00:00
Dan Gohman	1418c52e1a	Flush the raw_ostream after emitting the assembly for a function. This is a temporary fix for the -print-emitted-asm option, where errs() is used as the stream, in the case where other code is using stderr without using errs()' buffer. Hopefully soon we'll fix errs() to be non-buffered instead. Patch by Preston Gurd. llvm-svn: 58859	2008-11-07 19:49:17 +00:00
Mon P Wang	41f90a3ee5	Widening cleanup llvm-svn: 58796	2008-11-06 05:31:54 +00:00
Evan Cheng	1378d6c7a9	Add more vector move low and zero-extend patterns. llvm-svn: 58752	2008-11-05 06:04:51 +00:00
Evan Cheng	fdd6d65e39	Indentation. llvm-svn: 58750	2008-11-05 06:03:38 +00:00
Dan Gohman	cd4b68bee9	Eliminate the ISel priority queue, which used the topological order for a priority function. Instead, just iterate over the AllNodes list, which is already in topological order. This eliminates a fair amount of bookkeeping, and speeds up the isel phase by about 15% on many testcases. The impact on most targets is that AddToISelQueue calls can be simply removed. In the x86 target, there are two additional notable changes. The rule-bending AND+SHIFT optimization in MatchAddress that creates new pre-isel nodes during isel is now a little more verbose, but more robust. Instead of either creating an invalid DAG or creating an invalid topological sort, as it has historically done, it can now just insert the new nodes into the node list at a position where they will be consistent with the topological ordering. Also, the address-matching code has logic that checked to see if a node was "already selected". However, when a node is selected, it has all its uses taken away via ReplaceAllUsesWith or equivalent, so it won't recieve any further visits from MatchAddress. This code is now removed. llvm-svn: 58748	2008-11-05 04:14:16 +00:00
Evan Cheng	2702e22b83	Rename isGVLazyPtr to isGVNonLazyPtr relocation. This represents Mac OS X indirect gv reference. Please don't call it lazy. llvm-svn: 58746	2008-11-05 01:50:32 +00:00
Dan Gohman	0ba8aad1af	The ANDMask node folds to a constant, and isn't the node that needs to have its node id set. The new and and shift nodes are the nodes that need the IDs. This fixes PR2982. llvm-svn: 58655	2008-11-03 23:43:55 +00:00
Dan Gohman	155df8a79e	Refactor various TargetAsmInfo subclasses' TargetMachine members away adding a TargetMachine member to the base TargetAsmInfo class instead. llvm-svn: 58624	2008-11-03 18:22:42 +00:00
Mon P Wang	6363e90277	x86_64 rip-relative and magic mode address llvm-svn: 58528	2008-10-31 19:13:42 +00:00
Bill Wendling	f8f6ed82f1	Revert r58489. It isn't correct for all cases. llvm-svn: 58523	2008-10-31 18:30:19 +00:00
Evan Cheng	168bd3be1b	Change x86 register allocation ordering to match that of gcc. Otherwise some tools get confused by prologue generated by llvm. llvm-svn: 58517	2008-10-31 16:52:57 +00:00
Bill Wendling	0f1f4f8bb1	Don't skip over all "terminator" instructions when determining where to put the callee-saved restore code. It could skip over conditional jumps accidentally. Instead, just skip the "return" instructions. llvm-svn: 58489	2008-10-31 04:00:23 +00:00
Dan Gohman	481e1fd0a6	Use MOVSSmr instead of EXTRACTPSmr in the case of extracting vector element 0 for a store, as it's smaller and faster. llvm-svn: 58483	2008-10-31 00:57:24 +00:00
Mon P Wang	d7e34cd378	Add initial support for vector widening. Logic is set to widen for X86. One will only see an effect if legalizetype is not active. Will move support to LegalizeType soon. llvm-svn: 58426	2008-10-30 08:01:45 +00:00
Nate Begeman	e621f0539e	Fix PEXTRQ encoding llvm-svn: 58403	2008-10-29 23:07:17 +00:00
Chris Lattner	63e92876e0	Fix a nasty miscompilation of 176.gcc on linux/x86 where we synthesized a memset using 16-byte XMM stores, but where the stack realignment code didn't work. Until it does (PR2962) disable use of xmm regs in memcpy and memset formation for linux and other targets with insufficiently aligned stacks. This is part of PR2888 llvm-svn: 58317	2008-10-28 05:49:35 +00:00
David Greene	93f9f0f718	Have TableGen emit setSubgraphColor calls under control of a -gen-debug flag. Then in a debugger developers can set breakpoints at these calls to see waht is about to be selected and what the resulting subgraph looks like. This really helps when debugging instruction selection. llvm-svn: 58278	2008-10-27 21:56:29 +00:00
Evan Cheng	3bcbccf563	For now, don't split live intervals around x87 stack register barriers. FpGET_ST0_80 must be right after a call instruction (and ADJCALLSTACKUP) so we need to find a way to prevent reload of x87 registers between them. llvm-svn: 58230	2008-10-27 07:14:50 +00:00
Dan Gohman	66e878f316	Move the code that adds the DeadMachineInstructionElimPass from target-independent code to target-specific code. This prevents it from running on targets that aren't using fast-isel. In addition to saving compile time, this addresses the problem that not all targets are prepared for it. In order to use this pass, all instructions must declare all their fixed uses and defs of physical registers. llvm-svn: 58144	2008-10-25 17:46:52 +00:00
Nicolas Geoffray	ce30b5caf0	Support for allocation of TLS variables in the JIT. Allocation of a global variable is moved to the execution engine. The JIT calls the TargetJITInfo to allocate thread local storage. Currently, only linux/x86 knows how to allocate thread local global variables. llvm-svn: 58142	2008-10-25 15:41:43 +00:00
Nicolas Geoffray	323dc44a69	Generate code for TLS instructions. llvm-svn: 58141	2008-10-25 15:22:06 +00:00
Torok Edwin	e0ecce06a0	move the note to the correct README llvm-svn: 58104	2008-10-24 19:23:07 +00:00
Duncan Sands	4b148a29ef	Fix translateX86CC: if SetCCOpcode is SETULE and LHS is a foldable load, then LHS and RHS are swapped and SetCCOpcode is changed to SETUGT. But the later code is expecting operands to be the wrong way round for SETUGT, but they are not in this case, resulting in an inverted compare. The solution is to move the load normalization before the correction for SETUGT. This bug was tickled by LegalizeTypes which happened to legalize the testcase slightly differently to LegalizeDAG. llvm-svn: 58092	2008-10-24 13:03:10 +00:00
Dan Gohman	ed90fd3ecf	Fix constant-offset emission for x86-64 absolute addresses. This fixes a bunch of test-suite JIT failures on x86-64 in -relocation-model=static mode. llvm-svn: 58066	2008-10-24 01:57:54 +00:00
Dale Johannesen	c146b1b281	Remove allocation of unused stack slot. llvm-svn: 57987	2008-10-22 17:26:06 +00:00
Duncan Sands	9d8f7ab614	Get this working with LegalizeTypes: (1) don't assume that i64 has been turned into a BUILD_PAIR node (when called from LegalizeTypes this hasn't happened yet) and don't use a vector shuffle mask with an illegal element type. llvm-svn: 57972	2008-10-22 11:24:12 +00:00
Oscar Fuentes	a932cae97a	CMake: Turned some libraries into partially linked objects. Corrected names of LLVMCore and ARMCodeGen. llvm-svn: 57943	2008-10-22 02:51:53 +00:00
Dale Johannesen	3bd1c1e5cd	Adjust comments for pedantic satisfaction. llvm-svn: 57940	2008-10-22 00:02:32 +00:00
Dale Johannesen	9185d28b4b	Add comments to explain uint64->f64 algorithm, well, sort of. (Algorithm by Ian Ollmann.) llvm-svn: 57932	2008-10-21 23:07:49 +00:00
Dale Johannesen	eb7e2deb1d	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	34306e122d	Implement the optimized FCMP_OEQ/FCMP_UNE code for x86 fast-isel. llvm-svn: 57915	2008-10-21 18:24:51 +00:00
Dan Gohman	847a83dbad	Don't create TargetGlobalAddress nodes with offsets that don't fit in the 32-bit signed offset field of addresses. Even though this may be intended, some linkers refuse to relocate code where the relocated address computation overflows. Also, fix the sign-extension of constant offsets to use the actual pointer size, rather than the size of the GlobalAddress node, which may be different, for example on x86-64 where MVT::i32 is used when the address is being fit into the 32-bit displacement field. llvm-svn: 57885	2008-10-21 03:38:42 +00:00
Dan Gohman	281881b8e2	Optimized FCMP_OEQ and FCMP_UNE for x86. Where previously LLVM might emit code like this: ucomisd %xmm1, %xmm0 setne %al setp %cl orb %al, %cl jne .LBB4_2 it now emits this: ucomisd %xmm1, %xmm0 jne .LBB4_2 jp .LBB4_2 It has fewer instructions and uses fewer registers, but it does have more branches. And in the case that this code is followed by a non-fallthrough edge, it may be followed by a jmp instruction, resulting in three branch instructions in sequence. Some effort is made to avoid this situation. To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and FCMP_UNE in lowered form, and replace them with code that emits two branches, except in the case where it would require converting a fall-through edge to an explicit branch. Also, X86InstrInfo.cpp's branch analysis and transform code now knows now to handle blocks with multiple conditional branches. It uses loops instead of having fixed checks for up to two instructions. It can now analyze and transform code generated from FCMP_OEQ and FCMP_UNE. llvm-svn: 57873	2008-10-21 03:29:32 +00:00
Dan Gohman	d692070372	When the coalescer is doing rematerializing, have it remove the copy instruction from the instruction list before asking the target to create the new instruction. This gets the old instruction out of the way so that it doesn't interfere with the target's rematerialization code. In the case of x86, this helps it find more cases where EFLAGS is not live. Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check to see if it reached the end of the block after scanning each instruction, instead of just before. This lets it notice when the end of the block is only two instructions away, without doing any additional scanning. These changes allow rematerialization to clobber EFLAGS in more cases, for example using xor instead of mov to set the return value to zero in the included testcase. llvm-svn: 57872	2008-10-21 03:24:31 +00:00
Duncan Sands	98fc39f607	Have X86 custom lowering for LegalizeTypes use LowerOperation if it doesn't know what else to do. This methods should probably be factorized some, but this is good enough for the moment. Have LowerATOMIC_BINARY_64 use EXTRACT_ELEMENT rather than assuming the operand is a BUILD_PAIR (if it is then getNode will automagically simplify the EXTRACT_ELEMENT). This way LowerATOMIC_BINARY_64 usable from LegalizeTypes. llvm-svn: 57831	2008-10-20 15:56:33 +00:00
Dan Gohman	15597f07b2	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	69ac9cc00f	This is done. llvm-svn: 57733	2008-10-17 21:38:40 +00:00
Evan Cheng	08d0796cf5	Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is not technically true, it tells tblgen that these instructions "clobber" the entire XMM register file. llvm-svn: 57723	2008-10-17 21:02:22 +00:00
Chris Lattner	d96b8d12bc	add support for 128 bit inputs on both x86-64 and x86-32. llvm-svn: 57709	2008-10-17 18:15:05 +00:00
Chris Lattner	231a9466df	Fix a bug where the x86 backend would reject 64-bit r constraints when in 32-bit mode instead of assigning a register pair. This has nothing to do with PR2356, but I happened to notice it while working on it. llvm-svn: 57704	2008-10-17 17:59:52 +00:00
Evan Cheng	fa61b6a4ba	Fix lfence and mfence encoding. These look like MRM5r and MRM6r instructions except they do not have any operands. The RegModRM byte is encoded with register number 0. llvm-svn: 57692	2008-10-17 17:14:20 +00:00
Evan Cheng	733b305f24	getX86RegNum has long been moved to X86RegisterInfo. llvm-svn: 57691	2008-10-17 17:12:18 +00:00
Dan Gohman	268cfea6bc	Fun x86 encoding tricks: when adding an immediate value of 128, use a SUB instruction instead of an ADD, because -128 can be encoded in an 8-bit signed immediate field, while +128 can't be. This avoids the need for a 32-bit immediate field in this case. A similar optimization applies to 64-bit adds with 0x80000000, with the 32-bit signed immediate field. To support this, teach tablegen how to handle 64-bit constants. llvm-svn: 57663	2008-10-17 01:33:43 +00:00
Dan Gohman	5d83bd89a5	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00

1 2 3 4 5 ...

3838 Commits