llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Anton Korobeynikov	f42d75201a	Provide general hook for section name calculation llvm-svn: 53304	2008-07-09 13:22:46 +00:00
Anton Korobeynikov	c421fcddb4	Print entity size for mergeable sections llvm-svn: 53303	2008-07-09 13:22:17 +00:00
Anton Korobeynikov	849c8617be	Split PrintSectionFlags llvm-svn: 53302	2008-07-09 13:21:49 +00:00
Anton Korobeynikov	7f21791b33	Split UniqueSectionForGlobal() llvm-svn: 53301	2008-07-09 13:21:29 +00:00
Anton Korobeynikov	61aca29278	Split PreferredEHDataFormat hook llvm-svn: 53300	2008-07-09 13:21:08 +00:00
Anton Korobeynikov	32d3d15c2e	Split X86TargetAsmInfo into 4 subtarget-specific classes llvm-svn: 53299	2008-07-09 13:20:48 +00:00
Anton Korobeynikov	80f2417e3b	Whitespace cleanup llvm-svn: 53298	2008-07-09 13:20:27 +00:00
Anton Korobeynikov	059999d321	Move flag decoding stuff into special hook llvm-svn: 53297	2008-07-09 13:20:07 +00:00
Anton Korobeynikov	ca271dd426	Properly handle linkonce stuff llvm-svn: 53296	2008-07-09 13:19:38 +00:00
Anton Korobeynikov	782a69505d	Provide skeletone code for calculation of section, where global should be emitted into llvm-svn: 53295	2008-07-09 13:19:08 +00:00
Evan Cheng	f51c436a1b	Back out 53254. It broke ppc debug info codegen. llvm-svn: 53280	2008-07-09 06:36:53 +00:00
Dale Johannesen	d609d7166c	Make debug info come out in data-only files. This is a question of the debugging setup code not being called at the right time, and it's called from target-dependent code for some reason. I have only attempted to fix Darwin, but I'm pretty sure it's broken elsewhere; I'll leave that to people who can test it. llvm-svn: 53254	2008-07-08 21:56:22 +00:00
Evan Cheng	6af015292e	Unbreak C++ tests on x86 Darwin. llvm-svn: 53237	2008-07-08 16:40:43 +00:00
Evan Cheng	5be1103646	Avoid unnecessary string construction during asm printing. llvm-svn: 53215	2008-07-08 00:55:58 +00:00
Dan Gohman	cd25487258	Pool-allocation for MachineInstrs, MachineBasicBlocks, and MachineMemOperands. The pools are owned by MachineFunctions. This drastically reduces the number of calls to malloc/free made during the "Emit" phase of scheduling, as well as later phases in CodeGen. Combined with other changes, this speeds up the "instruction selection" phase of CodeGen by 10% in some cases. llvm-svn: 53212	2008-07-07 23:14:23 +00:00
Evan Cheng	688a8070f4	ATT asm printer just print register AsmName's instead of calling tolower on each charater of Name. This speeds it up by 10%. llvm-svn: 53208	2008-07-07 22:21:06 +00:00
Dan Gohman	c97817aac3	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Duncan Sands	3ea6f15708	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Duncan Sands	aac5c915ed	Linux also does not require exception handling moves in order to get correct debug info. Since I can't imagine how any target could possibly be any different, I've just stripped out the option: now all the world's like Darwin! llvm-svn: 53134	2008-07-04 09:55:48 +00:00
Evan Cheng	3e6a03a4b6	Back out 53091 for now. llvm-svn: 53109	2008-07-03 18:11:29 +00:00
Evan Cheng	1f6148a84c	- Remove calls to copyKillDeadInfo which is an N^2 function. Instead, propagate kill / dead markers as new instructions are constructed in foldMemoryOperand, convertToThressAddress, etc. - Also remove LiveVariables::instructionChanged, etc. Replace all calls with cheaper calls which update VarInfo kill list. llvm-svn: 53097	2008-07-03 09:09:37 +00:00
Anton Korobeynikov	f3fc979d9c	llvm-gcc sometimes marks external declarations hidden, because intializers are processed separately. Honour such situation and emit PIC relocations properly in such case. llvm-svn: 53091	2008-07-03 07:43:14 +00:00
Evan Cheng	6d84ad83ca	commuteInstruction should preserve dead markers. llvm-svn: 53060	2008-07-03 00:04:51 +00:00
Owen Anderson	604f9f722d	Make LiveVariables even more optional, by making it optional in the call to TargetInstrInfo::convertToThreeAddressInstruction Also, if LV isn't around, then TwoAddr doesn't need to be updating flags, since they won't have been set in the first place. llvm-svn: 53058	2008-07-02 23:41:07 +00:00
Duncan Sands	21e2a711e3	Add a new getMergeValues method that does not need to be passed the list of value types, and use this where appropriate. Inappropriate places are where the value type list is already known and may be long, in which case the existing method is more efficient. llvm-svn: 53035	2008-07-02 17:40:58 +00:00
Bill Wendling	27c38cee90	Darwin doesn't need exception handling information for the "move" info when debug information is being output, because it's leet! llvm-svn: 52994	2008-07-01 23:34:48 +00:00
Dan Gohman	83c1b4cede	Prune a few dependencies on MachineFunction.h. llvm-svn: 52976	2008-07-01 18:15:35 +00:00
Evan Cheng	67ce381ffe	Do not use computationally expensive scheduling heuristics with -fast. llvm-svn: 52971	2008-07-01 18:05:03 +00:00
Duncan Sands	d8d11501c9	Highlight that getMergeValues optimization is being suppressed here. llvm-svn: 52952	2008-07-01 08:00:49 +00:00
Dan Gohman	c8097f8c8c	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Dan Gohman	c8c04b1ff4	std::ostream and std::string microoptimizations for asm printing. llvm-svn: 52929	2008-06-30 22:03:41 +00:00
Dan Gohman	e58f07e5d6	Update comments to new-style syntax. llvm-svn: 52925	2008-06-30 21:00:56 +00:00
Dan Gohman	6cc648891b	Rename ISD::LOCATION to ISD::DBG_STOPPOINT to better reflect its purpose, and give it a custom SDNode subclass so that it doesn't need to have line number, column number, filename string, and directory string, all existing as individual SDNodes to be the operands. This was the only user of ISD::STRING, StringSDNode, etc., so remove those and some associated code. This makes stop-points considerably easier to read in -view-legalize-dags output, and reduces overhead (creating new nodes and copying std::strings into them) on code containing debugging information. llvm-svn: 52924	2008-06-30 20:59:49 +00:00
Evan Cheng	3f664b6fd3	Split scheduling from instruction selection. llvm-svn: 52923	2008-06-30 20:45:06 +00:00
Duncan Sands	c882a4eba9	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Anton Korobeynikov	0b708e559e	Unbreak llvm-svn: 52866	2008-06-28 11:10:06 +00:00
Anton Korobeynikov	8562255056	Temporary rever invalid commit llvm-svn: 52865	2008-06-28 11:09:48 +00:00
Anton Korobeynikov	ea88d91267	Move printing of module-level GVs into dedicated helper llvm-svn: 52864	2008-06-28 11:09:32 +00:00
Anton Korobeynikov	77c3528f69	Use common naming convention llvm-svn: 52863	2008-06-28 11:09:17 +00:00
Anton Korobeynikov	2331efee7e	Factor out stuff into helper function llvm-svn: 52862	2008-06-28 11:09:01 +00:00
Anton Korobeynikov	908c1fab55	Cleanup llvm-svn: 52861	2008-06-28 11:08:44 +00:00
Anton Korobeynikov	adec555f96	Remove X86SharedAsmPrinter llvm-svn: 52860	2008-06-28 11:08:27 +00:00
Anton Korobeynikov	dcc6a8314a	whitespace cleanup llvm-svn: 52859	2008-06-28 11:08:09 +00:00
Anton Korobeynikov	03a62267fe	Make intel asmprinter child of generic asmprinter, not x86 shared asm printer. This leads to some code duplication, which will be resolved later. llvm-svn: 52858	2008-06-28 11:07:54 +00:00
Anton Korobeynikov	b75aeb6b1a	Cleanup llvm-svn: 52857	2008-06-28 11:07:35 +00:00
Anton Korobeynikov	f4017f7d50	Whitespace cleanup llvm-svn: 52856	2008-06-28 11:07:18 +00:00
Anton Korobeynikov	e48fe3dde8	Use StringSet instead of std::set<std::string> llvm-svn: 52836	2008-06-27 21:22:49 +00:00
Dale Johannesen	f170e29cf5	Fixes the last x86-64 test failure in compat.exp: <16 x float> is 64-byte aligned (for some reason), which gets us into the stack realignment code. The computation changing FP-relative offsets to SP-relative was broken, assiging a spill temp to a location also used for parameter passing. This fixes it by rounding up the stack frame to a multiple of the largest alignment (I concluded it wasn't fixable without doing this, but I'm not very sure.) llvm-svn: 52750	2008-06-26 01:51:13 +00:00
Evan Cheng	71fbfe73c1	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Dan Gohman	404964dbc0	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	7d89d61387	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	bab5925a0b	Enable two-address remat by default. llvm-svn: 52701	2008-06-25 01:16:38 +00:00
Dale Johannesen	fdf8fe6c03	Add v2f32 (MMX) type to X86. Support is primitive: load,store,call,return,bitcast. This is enough to make call and return work. llvm-svn: 52691	2008-06-24 22:01:44 +00:00
Evan Cheng	a62f5f0f82	If it's determined safe, remat MOV32r0 (i.e. xor r, r) and others as it is instead of using the longer MOV32ri instruction. llvm-svn: 52670	2008-06-24 07:10:51 +00:00
Dan Gohman	9941a2dab3	Add a note about a potential PIC optimization. llvm-svn: 52663	2008-06-24 00:53:07 +00:00
Dan Gohman	ebc59c90b7	Fixes for being compiled PIC on Linux. This isn't the most general solution possible, but it's a fairly simple one. Based on a patch from the OpenGTL project! llvm-svn: 52662	2008-06-24 00:50:01 +00:00
Dan Gohman	c1aa753f00	Remove unnecessary #includes. llvm-svn: 52613	2008-06-22 19:21:26 +00:00
Eli Friedman	570aa6f801	Fix a bug with <8 x i16> shuffle lowering on X86 where parts of the shuffle could be skipped. The check is invalid because the loop index i doesn't correspond to the element actually inserted. The correct check is already done a few lines earlier, for whether the element is already in the right spot, so this shouldn't have any effect on the codegen for code that was already correct. llvm-svn: 52486	2008-06-19 06:09:51 +00:00
Evan Cheng	0570953e28	XOR32rr, etc. are not AsCheapAsMove, but MOV32ri, etc. are. llvm-svn: 52454	2008-06-18 08:13:07 +00:00
Evan Cheng	deb754898b	Unbreak DECLARE isel in pic mode. llvm-svn: 52439	2008-06-18 02:48:27 +00:00
Evan Cheng	89e2e3292d	Rather than avoiding to wrap ISD::DECLARE GV operand in X86ISD::Wrapper, simply handle it at dagisel time with x86 specific isel code. llvm-svn: 52377	2008-06-17 02:01:22 +00:00
Evan Cheng	4e7b7b21a2	Horizontal-add instructions are not commutative. llvm-svn: 52363	2008-06-16 21:16:24 +00:00
Evan Cheng	acd614c262	mpsadbw is commutable. llvm-svn: 52352	2008-06-16 20:25:59 +00:00
Evan Cheng	2dfe8c2435	Add option to commuteInstruction() which forces it to create a new (commuted) instruction. llvm-svn: 52308	2008-06-16 07:33:11 +00:00
Andrew Lenharth	327c3e7559	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Duncan Sands	40c8db881a	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Anton Korobeynikov	74422b3cd0	Properly lower DYNAMIC_STACKALLOC - bracket all black magic with CALLSEQ_BEGIN & CALLSEQ_END. llvm-svn: 52225	2008-06-11 20:16:42 +00:00
Rafael Espindola	feaadb1e05	add support for PIC on linux x86-64 llvm-svn: 52139	2008-06-09 09:52:31 +00:00
Duncan Sands	fe2a970a5c	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Duncan Sands	d634afe3aa	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Evan Cheng	badbe3e3fa	Don't break strict aliasing. llvm-svn: 52026	2008-06-05 22:59:21 +00:00
Dale Johannesen	c0cd6cd4d4	Add StringConstantPrefix to control what the assembler names of string constants look like. llvm-svn: 51909	2008-06-03 18:09:06 +00:00
Rafael Espindola	feec40a71f	Don't use the GOT for symbols that are not externally visible. llvm-svn: 51865	2008-06-02 07:52:43 +00:00
Dan Gohman	00823cb0d4	Teach the DAGISelEmitter to not compute the variable_ops operand index for the input pattern in terms of the output pattern. Instead keep track of how many fixed operands the input pattern actually has, and have the input matching code pass the output-emitting function that index value. This simplifies the code, disentangles variables_ops from the support for predication operations, and makes variable_ops more robust. llvm-svn: 51808	2008-05-31 02:11:25 +00:00
Bill Wendling	244b4db58d	Add the "AsCheapAsAMove" flag to some 64-bit xor instructions. llvm-svn: 51761	2008-05-30 06:47:04 +00:00
Dan Gohman	aa8fcd5657	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Dan Gohman	4e87d82476	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Dan Gohman	e256337a1a	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Evan Cheng	04c0915a2f	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Bill Wendling	81199f0cc8	XOR?RI instructions aren't as cheap as moves. llvm-svn: 51664	2008-05-29 03:46:36 +00:00
Bill Wendling	edb38e9410	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Dan Gohman	a5549a2f9c	Fix the encoding for two more "rm" instructions that were using MRMSrcReg. llvm-svn: 51630	2008-05-28 01:50:19 +00:00
Mon P Wang	8e37b2d13e	Fixed X86 encoding error CVTPS2PD and CVTPD2PS when the source operand is a memory location llvm-svn: 51626	2008-05-28 00:42:27 +00:00
Nate Begeman	23dd264da6	Don't attempt to create VZEXT_LOAD out of an extload. This an issue where the code generator would do something like this: f64 = load f32 <anyext>, f32mem v2f64 = insertelt undef, %0, 0 v2f64 = insertelt %1, 0.0, 1 into v2f64 = vzext_load f32mem which on x86 is movsd, when you really wanted a cvtss2sd/movsd pair. llvm-svn: 51624	2008-05-28 00:24:25 +00:00
Evan Cheng	e5e0b4660d	Eliminate x86.sse2.punpckh.qdq and x86.sse2.punpckl.qdq. llvm-svn: 51533	2008-05-24 02:56:30 +00:00
Evan Cheng	564238c841	Eliminate x86.sse2.movs.d, x86.sse2.shuf.pd, x86.sse2.unpckh.pd, and x86.sse2.unpckl.pd intrinsics. These will be lowered into shuffles. llvm-svn: 51531	2008-05-24 02:14:05 +00:00
Evan Cheng	d312ced1cf	This is done. llvm-svn: 51526	2008-05-24 00:10:13 +00:00
Evan Cheng	98a292a302	Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions. llvm-svn: 51522	2008-05-24 00:07:29 +00:00
Evan Cheng	4f660778f0	Use movlps / movhps to modify low / high half of 16-byet memory location. llvm-svn: 51501	2008-05-23 21:23:16 +00:00
Dan Gohman	e8422fc112	Elaborate on the entry on integer vector multiplication by constants. llvm-svn: 51491	2008-05-23 18:05:39 +00:00
Evan Cheng	ec8bd19399	Fix a duplicated pattern. llvm-svn: 51490	2008-05-23 18:00:18 +00:00
Dan Gohman	6cc0b4f262	Use PMULDQ for v2i64 multiplies when SSE4.1 is available. And add load-folding table entries for PMULDQ and PMULLD. llvm-svn: 51489	2008-05-23 17:49:40 +00:00
Evan Cheng	e7ec4690e1	New entry. llvm-svn: 51487	2008-05-23 17:28:11 +00:00
Chris Lattner	4c1ffef5af	we compile multiply-by-constant into horrible code. Doesn't sse4 have some instruction for doing this? llvm-svn: 51473	2008-05-23 04:29:53 +00:00
Evan Cheng	097e95b1f7	Bug: rcpps can only folds a load if the address is 16-byte aligned. Fixed many 'ps' load folding patterns in X86InstrSSE.td which are missing the proper alignment checks. Also fixed some 80 col. violations. llvm-svn: 51462	2008-05-23 00:37:07 +00:00
Dale Johannesen	7cc19db16f	Put const weak stuff in appropriate section on Darwin. g++.dg/abi/key2.C llvm-svn: 51458	2008-05-23 00:16:59 +00:00
Evan Cheng	2dc53b5d58	X86CodeEmitter should not set PIC style to None at initialization time. This will break codegen if relocation model is changed to PIC_ later. llvm-svn: 51455	2008-05-22 23:55:24 +00:00
Evan Cheng	d1373cd497	Add missing patterns. llvm-svn: 51435	2008-05-22 18:56:56 +00:00
Evan Cheng	d694e78e36	movsd and movq do not require 16-byte alignment. This fixes vec_set-5.ll on Linux. llvm-svn: 51327	2008-05-20 18:24:47 +00:00
Evan Cheng	e95fc3e83d	runOnMachineFunction should set IsPIC because relocation model may have been changed. llvm-svn: 51291	2008-05-20 01:56:59 +00:00
Dale Johannesen	e6977495aa	Handle quoted names when constructing $stub's, $non_lazy_ptr's and $lazy_ptr's. llvm-svn: 51277	2008-05-19 21:38:18 +00:00
Dale Johannesen	ebc511c6aa	Treat common as distinct from weak global on Darwin x86. llvm-svn: 51172	2008-05-16 00:52:06 +00:00
Evan Cheng	73dadf21ce	Fix typos and comments. llvm-svn: 51165	2008-05-15 22:13:02 +00:00
Evan Cheng	778a5e27b0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Dale Johannesen	768b6f281e	Add CommonLinkage; currently tentative definitions are represented as "weak", but there are subtle differences in some cases on Darwin, so we need both. The intent is that "common" will behave identically to "weak" unless somebody changes their target to do something else. No functional change as yet. llvm-svn: 51118	2008-05-14 20:12:51 +00:00
Evan Cheng	95987c2586	Doh. Alignment is in bytes, not in bits. llvm-svn: 51092	2008-05-14 02:49:43 +00:00
Dan Gohman	f9d5689496	Change target-specific classes to use more precise static types. This eliminates the need for several awkward casts, including the last dynamic_cast under lib/Target. llvm-svn: 51091	2008-05-14 01:58:56 +00:00
Chris Lattner	a11adf725d	add a note llvm-svn: 51062	2008-05-13 19:56:20 +00:00
Evan Cheng	cb56638548	- Fix the pasto in the fix for a previous pasto. - Incorporate Chris' comment suggestion. llvm-svn: 51061	2008-05-13 18:59:59 +00:00
Chris Lattner	c9eb6a7d64	add a note llvm-svn: 51060	2008-05-13 18:48:54 +00:00
Nate Begeman	c290daf581	Fix one more encoding bug. llvm-svn: 51057	2008-05-13 17:52:09 +00:00
Evan Cheng	cf6928983b	- Don't treat anyext 16-bit load as a 32-bit load if it's volatile. - Correct a pasto. llvm-svn: 51054	2008-05-13 16:45:56 +00:00
Evan Cheng	9e15622879	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Nate Begeman	b9a3d141aa	Fix and encoding error in the psrad xmm, imm8 instruction. llvm-svn: 51020	2008-05-13 01:47:52 +00:00
Evan Cheng	e4ee4c2870	On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. llvm-svn: 51019	2008-05-13 00:54:02 +00:00
Dan Gohman	bab18cae46	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	5d939498c3	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	fcbdc8bd6e	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Bill Wendling	646f3458c4	Constify the machine instruction passed into the "is{Trivially,Really}ReMaterializable" methods. llvm-svn: 51001	2008-05-12 20:54:26 +00:00
Nate Begeman	2ae55cecc6	Initial X86 codegen support for VSETCC. llvm-svn: 51000	2008-05-12 20:34:32 +00:00
Dan Gohman	efa0925915	Fix a copy+paste bug; pseudo-instructions shouldn't have encoding information. llvm-svn: 50997	2008-05-12 20:22:45 +00:00
Evan Cheng	c7e9acfed7	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Dan Gohman	8212eaa43a	Fix a compile error on compilers that still want a return value in a non-void function that calls abort. llvm-svn: 50969	2008-05-12 16:17:19 +00:00
Anton Korobeynikov	ad83aeb489	Add note llvm-svn: 50959	2008-05-11 14:33:15 +00:00
Evan Cheng	c19c639ad7	When transforming a vector_shuffle to a load, the base address must not be an undef. llvm-svn: 50940	2008-05-10 06:46:49 +00:00
Dan Gohman	4b23d9e60a	For now, abort when an ISD::VAARG is encountered on x86-64, rather than silently generate invalid code. llvm-gcc does not currently use VAArgInst; it lowers va_arg in the front-end. llvm-svn: 50930	2008-05-10 01:26:14 +00:00
Evan Cheng	6a3fa28b38	Some clean up. llvm-svn: 50929	2008-05-10 00:59:18 +00:00
Evan Cheng	79230955a8	If movl top bits are undef, let it be selected to movlps, etc. llvm-svn: 50928	2008-05-10 00:58:41 +00:00
Evan Cheng	2adea48f7e	Add a pattern to do move the low element of a v4f32 and zero extend the rest. llvm-svn: 50922	2008-05-09 23:37:55 +00:00
Evan Cheng	3493e43afd	Handle a few more cases of folding load i64 into xmm and zero top bits. Note, some of the code will be moved into target independent part of DAG combiner in a subsequent patch. llvm-svn: 50918	2008-05-09 21:53:03 +00:00
Evan Cheng	f824b47188	Use movq to move low half of XMM register and zero-extend the rest. llvm-svn: 50874	2008-05-08 22:35:02 +00:00
Evan Cheng	f97e716511	Handle vector move / load which zero the destination register top bits (i.e. movd, movq, movss (addr), movsd (addr)) with X86 specific dag combine. llvm-svn: 50838	2008-05-08 00:57:18 +00:00
Duncan Sands	6f4e916c6a	Output correct exception handling and frame info on x86-64 linux. This causes no regressions on 32 bit linux and 32 bit ppc. More tests pass on 64 bit ppc with no regressions. I didn't turn on eh on 64 bit linux because the intrinsics needed to compile the eh runtime aren't done yet. But if you turn it on and link with the mainline runtime then eh seems to work fine on x86-64 linux with this patch. Thanks to Dale for testing. The main point of the patch is that if you output that some object is encoded using 4 bytes you had better not output 8 bytes for it: the patch makes everything consistent. llvm-svn: 50825	2008-05-07 19:11:09 +00:00
Chris Lattner	9f4f2444ea	add a micro optzn. llvm-svn: 50681	2008-05-05 23:19:45 +00:00
Mon P Wang	34b3f18a70	Improved generated code for atomic operators llvm-svn: 50677	2008-05-05 22:56:23 +00:00
Evan Cheng	44d49e72a1	Code clean up. No functionality change. llvm-svn: 50675	2008-05-05 22:12:23 +00:00
Mon P Wang	84a269e023	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	4a674dc536	Fix IsLinux being uninitialized on non-Linux targets. llvm-svn: 50660	2008-05-05 18:43:07 +00:00
Anton Korobeynikov	12c48230f9	Fix 80col violation llvm-svn: 50654	2008-05-05 17:08:59 +00:00
Dan Gohman	8ee7bf053e	Use a dedicated IsLinux flag instead of an ELFLinux TargetType. llvm-svn: 50649	2008-05-05 16:11:31 +00:00
Dan Gohman	c860d9c77c	Add AsmPrinter support for emitting a directive to declare that the code being generated does not require an executable stack. Also, add target-specific code to make use of this on Linux on x86. llvm-svn: 50634	2008-05-05 00:28:39 +00:00
Anton Korobeynikov	04c974b1b2	Add General Dynamic TLS model for X86-64. Some parts looks really ugly (look for tlsaddr pattern), but should work. Work is in progress, more models will follow llvm-svn: 50630	2008-05-04 21:36:32 +00:00
Evan Cheng	a7747df955	Select vector shift with non-immediate i32 shift amount operand by first moving the operand into the right register. llvm-svn: 50619	2008-05-04 09:15:50 +00:00
Evan Cheng	c1c2adbfc6	Add separate intrinsics for MMX / SSE shifts with i32 integer operands. This allow us to simplify the horribly complicated matching code. llvm-svn: 50601	2008-05-03 00:52:09 +00:00
Evan Cheng	90b9027f68	Undo r50574. We are already ensuring the folded load address is 16-byte aligned. llvm-svn: 50578	2008-05-02 17:01:01 +00:00
Evan Cheng	583a346ec6	80 column violation. llvm-svn: 50575	2008-05-02 07:53:32 +00:00
Evan Cheng	862e3a147c	Not safe folding a load + FsXORPSrr into FsXORPSrm. It's loading a FR64 value but the load folding variant expects a 16-byte aligned address. llvm-svn: 50574	2008-05-02 07:50:58 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Dan Gohman	0285c1e9bb	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Anton Korobeynikov	54791c2a43	Fix FP return for Win64 ABI llvm-svn: 50342	2008-04-28 07:40:07 +00:00
Anton Korobeynikov	1c5d228377	Properly lower vararg's FORMAL_ARGUMENTS node on win64 llvm-svn: 50325	2008-04-27 23:15:03 +00:00
Anton Korobeynikov	0df1f3bc6c	Handle fp80 for win64 llvm-svn: 50324	2008-04-27 22:54:09 +00:00
Chris Lattner	b5bd654163	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Evan Cheng	318e7e042c	Extract the lower 64-bit if a MMX value is passed in a XMM register. llvm-svn: 50292	2008-04-25 20:13:28 +00:00
Evan Cheng	eaaec15b4f	Fix illegal MMX_MOVDQ2Qrr pattern. vector_extract result must be a scalar value. llvm-svn: 50291	2008-04-25 20:12:46 +00:00
Evan Cheng	11f101a800	Special handling for MMX values being passed in either GPR64 or lower 64-bits of XMM registers. llvm-svn: 50289	2008-04-25 19:11:04 +00:00
Evan Cheng	0fe99f024d	Fix MMX_MOVQ2DQrr pattern. It's illegal to do a bitconvert from a smaller type to a larger one. llvm-svn: 50278	2008-04-25 18:19:54 +00:00
Evan Cheng	37ca5de3b7	Not checking for intrinsics which do not have a chain operand. llvm-svn: 50260	2008-04-25 08:55:28 +00:00
Evan Cheng	e177dc6696	- Switch from std::set to SmallPtrSet. - Add comments. llvm-svn: 50259	2008-04-25 08:22:20 +00:00
Evan Cheng	39ae78cadb	MMX argument passing fixes: On Darwin / Linux x86-32, v8i8, v4i16, v2i32 values are passed in MM[0-2]. On Darwin / Linux x86-32, v1i64 values are passed in memory. On Darwin x86-64, v8i8, v4i16, v2i32 values are passed in XMM[0-7]. On Darwin x86-64, v1i64 values are passed in 64-bit GPRs. llvm-svn: 50257	2008-04-25 07:56:45 +00:00
Chris Lattner	8c9f6c929a	Loosen up an assertion to allow intrinsics. I really have no idea what this code (findNonImmUse) does, so I'm only guessing that this is the right thing. It would be really really nice if this had comments and perhaps switched to SmallPtrSet (hint hint) :) This fixes rdar://5886601, a crash on gcc.target/i386/sse4_1-pblendw.c llvm-svn: 50252	2008-04-25 05:13:01 +00:00
Evan Cheng	484060ba4a	Fix bug in x86 memcpy / memset lowering. If there are trailing bytes not handled by rep instructions, a new memcpy / memset is introduced for them. However, since source / destination addresses are already adjusted, their offsets should be zero. llvm-svn: 50239	2008-04-25 00:26:43 +00:00
Anton Korobeynikov	4b572e0f73	Fix typo llvm-svn: 50169	2008-04-23 18:24:25 +00:00
Anton Korobeynikov	372e69e652	Only allow increase of max alignment value llvm-svn: 50168	2008-04-23 18:23:50 +00:00
Anton Korobeynikov	47a8e6d7a9	Be over-conservative: scan for all used virtual registers and calculate maximal stack alignment in assumption, that there will be spill of vector register. llvm-svn: 50167	2008-04-23 18:23:30 +00:00
Anton Korobeynikov	e7754f758b	Add X86 Maximal Stack Alignment Calculator Pass before RA llvm-svn: 50166	2008-04-23 18:23:05 +00:00
Anton Korobeynikov	158f614c67	Do proper book-keeping of offsets and prologue/epilogue code for stack realignment llvm-svn: 50163	2008-04-23 18:21:27 +00:00
Anton Korobeynikov	1f07315f47	If stack realignment is used - incoming args will use EBP as base register and locals - ESP llvm-svn: 50162	2008-04-23 18:21:02 +00:00
Anton Korobeynikov	5079553b9d	Eastimate required stack alignment early, so we can decide, whether we will need frame pointer or not llvm-svn: 50161	2008-04-23 18:20:17 +00:00
Anton Korobeynikov	492641d67f	Cleanup llvm-svn: 50159	2008-04-23 18:19:23 +00:00
Anton Korobeynikov	87325bfdf5	Simplify llvm-svn: 50158	2008-04-23 18:18:36 +00:00
Anton Korobeynikov	73935826d4	Make stack alignment options global for all targets llvm-svn: 50157	2008-04-23 18:18:10 +00:00
Anton Korobeynikov	6a59c959ca	Provide option for enabling-disabling stack realignment llvm-svn: 50156	2008-04-23 18:17:11 +00:00
Anton Korobeynikov	fc59ae78e0	Disable stack realignment for functions with dynamic-sized alloca's llvm-svn: 50155	2008-04-23 18:16:43 +00:00
Anton Korobeynikov	11851230a9	Provide ABI-correct stack alignment llvm-svn: 50154	2008-04-23 18:16:16 +00:00
Anton Korobeynikov	7e6850d1a1	Provide convenient helpers for some operations llvm-svn: 50153	2008-04-23 18:15:48 +00:00
Anton Korobeynikov	71adb49389	Whitespace cleanup llvm-svn: 50152	2008-04-23 18:15:11 +00:00
Dan Gohman	93b5be1824	Implement an x86-64 ABI detail of passing structs by hidden first argument. The x86-64 ABI requires the incoming value of %rdi to be copied to %rax on exit from a function that is returning a large C struct. Also, add a README-X86-64 entry detailing the missed optimization opportunity and proposing an alternative approach. llvm-svn: 50075	2008-04-21 23:59:07 +00:00
Dan Gohman	105b523786	Fix the encoding of the MMX movd that moves from MMX to 64-bit GPR. llvm-svn: 50053	2008-04-21 19:52:29 +00:00
Chris Lattner	3117d33f74	Add an ugly note. llvm-svn: 50029	2008-04-21 04:46:30 +00:00
Nicolas Geoffray	036fb2bebf	Don't forget to update the current operand when getting the size of an instruction. llvm-svn: 50007	2008-04-20 23:36:47 +00:00
Chris Lattner	2c5b96fbee	A better fix for my previous patch, MOVZQI2PQIrr just requires SSE2. llvm-svn: 49986	2008-04-20 05:52:46 +00:00
Chris Lattner	f390d62b7f	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Evan Cheng	1c54ebbe2f	Also LXCHG64 -> XCHG64rm. llvm-svn: 49948	2008-04-19 02:05:42 +00:00
Evan Cheng	b1d240f973	xchg which references a memory operand does not need to lock prefix. Atomicity is guaranteed. llvm-svn: 49946	2008-04-19 01:20:30 +00:00
Dan Gohman	98ca33cb59	Fix the handling of va_copy on x86-64. As of llvm-gcc r49920 llvm-gcc is now lowering va_copy on x86-64, so this completes the fix for PR2230. llvm-svn: 49922	2008-04-18 20:55:41 +00:00
Evan Cheng	a626e13995	- Fix atomic operation JIT encoding. - Remove unused instructions. llvm-svn: 49921	2008-04-18 20:55:36 +00:00
Evan Cheng	2b03674feb	Also support Intel asm syntax. llvm-svn: 49878	2008-04-17 23:35:10 +00:00
Evan Cheng	0b36ca5023	Fix assembly code for atomic operations. llvm-svn: 49869	2008-04-17 21:26:35 +00:00
Evan Cheng	e2e899b5c2	Don't forget about sub-register indices when rematting instructions. llvm-svn: 49830	2008-04-16 23:44:44 +00:00
Dale Johannesen	d19ab27ee1	Unbreak build on x86-64. llvm-svn: 49822	2008-04-16 22:24:33 +00:00
Nicolas Geoffray	1f3211af01	Correlate stubs with functions in JIT: when emitting a stub, the JIT tells the memory manager which function the stub will resolve. llvm-svn: 49814	2008-04-16 20:46:05 +00:00
Nicolas Geoffray	82baa2d2c6	Infrastructure for getting the machine code size of a function and an instruction. X86, PowerPC and ARM are implemented llvm-svn: 49809	2008-04-16 20:10:13 +00:00
Evan Cheng	341bed7210	Initialize X863DNowLevel. llvm-svn: 49808	2008-04-16 19:03:02 +00:00
Roman Levenstein	728d59166f	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Dan Gohman	be8f2b452b	Add support for the form of the SSE41 extractps instruction that puts its result in a 32-bit GPR. llvm-svn: 49762	2008-04-16 02:32:24 +00:00
Dan Gohman	cf79877623	Recreate the size SDNode instead of reusing the old one in the x86 memcpy lowering code; this ensures that the size node has the desired result type. This fixes a regression from r49572 with @llvm.memcpy.i64 on x86-32. llvm-svn: 49761	2008-04-16 01:32:32 +00:00
Dan Gohman	6f9b55bc7c	Remove X86_64SRet; it isn't used anymore. llvm-svn: 49759	2008-04-16 00:24:30 +00:00
Dan Gohman	7d27552962	Add movd instructions to move from MMX registers to 64-bit GPR registers on x86-64. llvm-svn: 49757	2008-04-15 23:55:07 +00:00
Dan Gohman	8d46278998	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Dale Johannesen	edcba1161f	Reverse sense of unwind-tables option. This means stack tracebacks on Darwin x86-64 won't work by default; nevertheless, everybody but me thinks this is a good idea. llvm-svn: 49663	2008-04-14 17:54:17 +00:00
Anton Korobeynikov	ea8dbf596a	Provide option for stack alignment override llvm-svn: 49593	2008-04-12 22:12:22 +00:00
Arnold Schwaighofer	82af0e6a43	This patch corrects the handling of byval arguments for tailcall optimized x86-64 (and x86) calls so that they work (... at least for my test cases). Should fix the following problems: Problem 1: When i introduced the optimized handling of arguments for tail called functions (using a sequence of copyto/copyfrom virtual registers instead of always lowering to top of the stack) i did not handle byval arguments correctly e.g they did not work at all :). Problem 2: On x86-64 after the arguments of the tail called function are moved to their registers (which include ESI/RSI etc), tail call optimization performs byval lowering which causes xSI,xDI, xCX registers to be overwritten. This is handled in this patch by moving the arguments to virtual registers first and after the byval lowering the arguments are moved from those virtual registers back to RSI/RDI/RCX. llvm-svn: 49584	2008-04-12 18:11:06 +00:00
Dan Gohman	15edbf989f	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dan Gohman	41f9d24d52	Fix a bug that prevented x86-64 from using rep.movsq for 8-byte-aligned data. llvm-svn: 49571	2008-04-12 02:35:39 +00:00
Nate Begeman	81586b24d6	80 col fix llvm-svn: 49569	2008-04-12 00:47:57 +00:00
Chris Lattner	9f994482f5	add a note, this is actually not too bad to implement. llvm-svn: 49466	2008-04-10 05:54:50 +00:00
Chris Lattner	869325c4c4	move the x86-32 part of PR2108 here. llvm-svn: 49465	2008-04-10 05:37:47 +00:00
Chris Lattner	3b289289a7	Fix the x86-64 side of PR2108 by adding a v2f64 version of MOVZQI2PQIrr. This would be better handled as a dag combine (with the goal of eliminating the bitconvert) but I don't know how to do that safely. Thoughts welcome. llvm-svn: 49463	2008-04-10 05:13:43 +00:00
Dan Gohman	b3a511b236	Make isVectorClearMaskLegal's operand list const. llvm-svn: 49446	2008-04-09 20:09:42 +00:00
Dan Gohman	f4cd5a4801	Add XMM1 as a second return value register for f32 and f64 on x86-64. This is needed for the x86-64-ABI handling of structs that contain floating-point members that are returned by value. llvm-svn: 49441	2008-04-09 17:54:37 +00:00
Dan Gohman	9f7c4f6e16	Add DX as a second return value register for i16 on x86. llvm-svn: 49440	2008-04-09 17:53:38 +00:00
Dale Johannesen	2d29b1c5bb	Handle the situation in 2008-01-25-EmptyFunction.ll correctly when unwind info is being generated. llvm-svn: 49366	2008-04-08 00:37:56 +00:00
Dale Johannesen	ec0fe04044	Implement new llc flag -disable-required-unwind-tables. Corresponds to -fno-unwind-tables (usually default in gcc). llvm-svn: 49361	2008-04-08 00:10:24 +00:00
Dan Gohman	d7301ea935	Rename MemOperand to MachineMemOperand. This was suggested by review feedback from Chris quite a while ago. No functionality change. llvm-svn: 49348	2008-04-07 19:35:22 +00:00
Roman Levenstein	b40d332929	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Gabor Greif	6c6b8a57f3	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Evan Cheng	4d7b2ab16f	Favors pshufd over shufps when shuffling elements from one vector. pshufd is faster than shufps. llvm-svn: 49244	2008-04-05 00:30:36 +00:00
Evan Cheng	0585b4bc2a	Re-enable SSE4. llvm-svn: 49158	2008-04-03 08:53:29 +00:00
Evan Cheng	cb5a5467dc	Fix x86-64 encoding bug. REX prefix must always follow 0x0F prefix. For example, extractps in 64bit mode: 66 REX 0F 3A 17, not 66 0F 3A REX 17. llvm-svn: 49157	2008-04-03 08:53:17 +00:00
Evan Cheng	12d2bbde0d	Cosmetic llvm-svn: 49156	2008-04-03 07:45:18 +00:00
Evan Cheng	f112eb3b1c	Temporarily disabling SSE4 until we fix the encoding issues. llvm-svn: 49129	2008-04-03 04:49:54 +00:00
Evan Cheng	497c607fae	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Dale Johannesen	84a1314ea1	Cosmetic changes per EH patch review feedback. llvm-svn: 49096	2008-04-02 17:04:45 +00:00
Anton Korobeynikov	d3330dfbf6	Add new CC lowering rule: provide a list of registers, which can be 'shadowed', when some another register is used for argument passing. Currently is used on Win64. llvm-svn: 49079	2008-04-02 05:23:57 +00:00
Dale Johannesen	79633a914f	Recommitting EH patch; this should answer most of the review feedback. -enable-eh is still accepted but doesn't do anything. EH intrinsics use Dwarf EH if the target supports that, and are handled by LowerInvoke otherwise. The separation of the EH table and frame move data is, I think, logically figured out, but either one still causes full EH info to be generated (not sure how to split the metadata correctly). MachineModuleInfo::needsFrameInfo is no longer used and is removed. llvm-svn: 49064	2008-04-02 00:25:04 +00:00
Evan Cheng	e1eee9570f	ReMat of load from stub in pic mode extends the life of pic base. Currently spiller doesn't do a good job of estimating the impact. Disable for now. llvm-svn: 49059	2008-04-01 23:26:12 +00:00
Evan Cheng	5c98bdbc4f	Remove unnecessary and non-deterministic checking code. Re-enable remat of load from gv stub. llvm-svn: 49054	2008-04-01 21:38:20 +00:00
Dan Gohman	a3e01dc1ec	Don't use __bzero for memset if the second argument isn't zero. llvm-svn: 49050	2008-04-01 20:56:18 +00:00
Dan Gohman	168b2b1300	Speculatively micro-optimize memory-zeroing calls on Darwin 10. llvm-svn: 49048	2008-04-01 20:38:36 +00:00
Dale Johannesen	8813206b7f	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Evan Cheng	d7f2ac9a0a	Disabling remat of load from gv stub (temporarily) again to fix llvmgcc bootstrap miscompare. llvm-svn: 49037	2008-04-01 07:33:13 +00:00
Dale Johannesen	1336104c02	Accept 'y' constraint (MMX) in inline asm. llvm-svn: 49011	2008-04-01 00:57:48 +00:00
Dale Johannesen	fa4433be71	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Evan Cheng	a3ce7b4c76	It's not safe to fold a load from GV stub or constantpool into a two-address use. llvm-svn: 49002	2008-03-31 23:19:51 +00:00
Evan Cheng	38a755499d	Move reMaterialize() from TargetRegisterInfo to TargetInstrInfo. llvm-svn: 48995	2008-03-31 20:40:39 +00:00
Evan Cheng	38bfff8a16	Re-apply 48911. llvm-svn: 48977	2008-03-31 07:54:19 +00:00
Dan Gohman	227e702cae	Fix a tokenfactor node to use the load chain rather than the load value. This fixes PR2177. llvm-svn: 48932	2008-03-28 23:45:16 +00:00
Evan Cheng	10d0aba260	Backing out 48911 for now. It's breaking stuff. llvm-svn: 48922	2008-03-28 17:49:06 +00:00
Evan Cheng	3b54c5fa08	New entry. llvm-svn: 48912	2008-03-28 07:07:06 +00:00
Evan Cheng	d66e48366f	Load from stub is already re-materializable. llvm-svn: 48911	2008-03-28 06:49:25 +00:00
Evan Cheng	e66720fd57	Code clean up. llvm-svn: 48856	2008-03-27 01:45:11 +00:00
Evan Cheng	aca67f0b29	Allow certain lea instructions to be rematerialized. llvm-svn: 48855	2008-03-27 01:41:09 +00:00
Evan Cheng	1afaf3092f	Remove an unused command line option. llvm-svn: 48854	2008-03-27 01:30:24 +00:00
Roman Levenstein	55b8822511	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Evan Cheng	6323ea8467	Fix some SSE4.1 instruction encoding bugs. llvm-svn: 48815	2008-03-26 08:11:49 +00:00
Dale Johannesen	8c1e95810f	Use ## for comment delimiter on darwin x86-32, so llvm's output .s files will go through gcc -std=c99 without triggering preprocesser errors. Approach suggested by Daveed Vandevoorde. llvm-svn: 48808	2008-03-25 23:29:30 +00:00
Evan Cheng	6226a78cb1	Smaller function alignment when optimizing for size. llvm-svn: 48805	2008-03-25 22:29:46 +00:00
Dan Gohman	2b96ce84aa	Add explicit keywords. llvm-svn: 48801	2008-03-25 22:06:05 +00:00
Dan Gohman	22002efa15	A quick nm audit turned up several fixed tables and objects that were marked read-write. Use const so that they can be allocated in a read-only segment. llvm-svn: 48800	2008-03-25 21:45:14 +00:00

... 3 4 5 6 7 ...

3657 Commits