llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Nate Begeman	73efed7a4c	Remove dead PatLeaf; there are a number of issues around MMX movl that need to be fixed. llvm-svn: 54026	2008-07-25 17:25:04 +00:00
Evan Cheng	20c9cdbe69	Fix PR2485: do all 4-element SSE shuffles in max. of 2 shuffle instructions. Based on patch by Nicolas Capens. llvm-svn: 53939	2008-07-23 00:22:17 +00:00
Evan Cheng	ff0bd19937	Factor out SSE 4 wide shuffle lowering code into its own function. No functionality changes. llvm-svn: 53933	2008-07-22 21:13:36 +00:00
Evan Cheng	901d469e05	Fix PR2574: implement v2f32 scalar_to_vector. llvm-svn: 53927	2008-07-22 18:39:19 +00:00
Anton Korobeynikov	f13fbd6879	Fix encoding of atomic compare and swap for i64 llvm-svn: 53911	2008-07-22 16:22:48 +00:00
Evan Cheng	a2bb31372d	Eliminate a compilation warning. llvm-svn: 53873	2008-07-21 20:02:45 +00:00
Dan Gohman	b91bef08a7	Add titles to the various SelectionDAG viewGraph calls that include useful information like the name of the block being viewed and the current phase of compilation. llvm-svn: 53872	2008-07-21 20:00:07 +00:00
Duncan Sands	6e31474e71	Add VerifyNode, a place to put sanity checks on generic SDNode's (nodes with their own constructors should do sanity checking in the constructor). Add sanity checks for BUILD_VECTOR and fix all the places that were producing bogus BUILD_VECTORs, as found by "make check". My favorite is the BUILD_VECTOR with only two operands that was being used to build a vector with four elements! llvm-svn: 53850	2008-07-21 10:20:31 +00:00
Evan Cheng	ffd51ccf6b	Use movaps instead of movups to spill 16-byte vector values when default alignment is >= 16. This fixes some massive performance regressions. llvm-svn: 53844	2008-07-21 06:34:17 +00:00
Bill Wendling	98b6e63176	Fix for first part of PR2562. Generate the "pinsrw" instruction for inserts into v4i16 vectors. llvm-svn: 53807	2008-07-20 02:32:23 +00:00
Anton Korobeynikov	449fb584e4	Fix a FIXME :) llvm-svn: 53789	2008-07-19 13:15:46 +00:00
Anton Korobeynikov	5c0eb7e991	Use generic ELFTargetAsmInfo and DarwinTargetAsmInfo for X86 code llvm-svn: 53788	2008-07-19 13:15:21 +00:00
Anton Korobeynikov	6e00357dd6	Use aligned stack spills, where possible. This fixes PR2549. llvm-svn: 53784	2008-07-19 06:30:51 +00:00
Dan Gohman	8981962672	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Nate Begeman	64f8f7f6bb	Remove unnecessary readme entry llvm-svn: 53722	2008-07-17 17:21:14 +00:00
Nate Begeman	61f6c21028	Fix a typo in last commit llvm-svn: 53720	2008-07-17 17:04:58 +00:00
Nate Begeman	af01bfff99	SSE codegen for vsetcc nodes llvm-svn: 53719	2008-07-17 16:51:19 +00:00
Mon P Wang	57cd9d6e5a	When lowering certain atomics, we need to copy the memoperand from the old atomic operation to the new one. llvm-svn: 53714	2008-07-17 04:54:06 +00:00
Devang Patel	a6c5ff690a	Mark function used by asm block as used, otherwise optimizer may not see the use and may delete the function. llvm-svn: 53692	2008-07-16 17:54:34 +00:00
Dan Gohman	4c8c8e3aad	Fix the result type of X86's truncate to i8. llvm-svn: 53688	2008-07-16 16:20:48 +00:00
Evan Cheng	face16f9d8	x86-64 PIC JIT fixes: do not generate the extra load for external GV's. llvm-svn: 53661	2008-07-16 01:34:02 +00:00
Evan Cheng	cabfd3f78c	X86-64 PIC jump table values are different from x86-32 cases, they are dest - table base. llvm-svn: 53660	2008-07-16 01:33:08 +00:00
Dan Gohman	bf47a27643	Add a utility function to MachineInstr for testing whether an instruction has exactly one MachineMemOperand, and change some X86 lowering code to make use of it. llvm-svn: 53498	2008-07-12 00:10:52 +00:00
Dan Gohman	4c18394001	Include a frame index in the "fixed stack" pseudo source value instead of using the frame index for the SVOffset, which was inconsistent. llvm-svn: 53486	2008-07-11 22:44:52 +00:00
Bill Wendling	9f17caa9a9	The frame address on an x86-64 box needs to be offset by -8, not -4. llvm-svn: 53450	2008-07-11 07:18:52 +00:00
Evan Cheng	02a618dc56	Fix for PR2472. Use movss to set lower 32-bits of a zero XMM vector. llvm-svn: 53386	2008-07-10 01:08:23 +00:00
Anton Korobeynikov	9eae9520a9	Remove a FIXME: we really need to use const_data section on darwin for constant pool, if relocation model is not static. This directly maps to the way how GCC works. llvm-svn: 53370	2008-07-09 21:54:26 +00:00
Anton Korobeynikov	a5955dc461	Add FIXME for future checking. llvm-svn: 53368	2008-07-09 21:38:28 +00:00
Dale Johannesen	36a38a5ba1	Emit debug info for data-only files. This version is X86 ATT only. llvm-svn: 53355	2008-07-09 20:55:35 +00:00
Anton Korobeynikov	61f4175d64	Add missed section llvm-svn: 53354	2008-07-09 20:47:55 +00:00
Anton Korobeynikov	57e9182691	Distinguish .const and .const_data on Darwin, when needed. This is somehow crazy :) llvm-svn: 53350	2008-07-09 20:01:42 +00:00
Anton Korobeynikov	67931c35bd	Weak stuff always goes to coalesced sections on Darwin llvm-svn: 53340	2008-07-09 19:06:02 +00:00
Dan Gohman	8a421248b9	Remove #include <iostream>. llvm-svn: 53333	2008-07-09 18:08:48 +00:00
Anton Korobeynikov	2b2543166d	Add FIXME needed to be resolved later llvm-svn: 53324	2008-07-09 13:30:02 +00:00
Anton Korobeynikov	32e4256260	Typo llvm-svn: 53322	2008-07-09 13:29:27 +00:00
Anton Korobeynikov	d31e7ad0cf	Revert accidentially added stuff llvm-svn: 53321	2008-07-09 13:29:08 +00:00
Anton Korobeynikov	03614f247c	First sketch of special section objects llvm-svn: 53320	2008-07-09 13:28:49 +00:00
Anton Korobeynikov	395dac000b	Honour text sections llvm-svn: 53319	2008-07-09 13:28:19 +00:00
Anton Korobeynikov	5ad0c235f1	Use isWeakForLinker() hook llvm-svn: 53318	2008-07-09 13:27:59 +00:00
Anton Korobeynikov	f16db15839	Switch to new section name handling facility llvm-svn: 53316	2008-07-09 13:27:16 +00:00
Anton Korobeynikov	1f697cd97b	Another bunch of hacks for named sections support llvm-svn: 53315	2008-07-09 13:26:52 +00:00
Anton Korobeynikov	d0f5cb4490	Typo llvm-svn: 53314	2008-07-09 13:26:24 +00:00
Anton Korobeynikov	93fe3c3fad	Drop mergeable flag, if size is no suitable llvm-svn: 53313	2008-07-09 13:26:05 +00:00
Anton Korobeynikov	df663a8ddf	Fix several bugs in named sections handling llvm-svn: 53312	2008-07-09 13:25:46 +00:00
Anton Korobeynikov	933bf0ecc4	Add hacky way to distinguish named and named sections. This will be generalized in the future. llvm-svn: 53311	2008-07-09 13:25:26 +00:00
Anton Korobeynikov	3bde8f2e24	Fix thinko llvm-svn: 53309	2008-07-09 13:24:38 +00:00
Anton Korobeynikov	d30979695f	Drop dead member reference llvm-svn: 53308	2008-07-09 13:24:18 +00:00
Anton Korobeynikov	9f05fccb88	Add funny darwin section selection logic llvm-svn: 53307	2008-07-09 13:23:57 +00:00
Anton Korobeynikov	751cfda7dd	Handle ELF mergeable sections llvm-svn: 53306	2008-07-09 13:23:37 +00:00
Anton Korobeynikov	dd347538c8	Provide section selection for X86 ELF targets llvm-svn: 53305	2008-07-09 13:23:08 +00:00
Anton Korobeynikov	f42d75201a	Provide general hook for section name calculation llvm-svn: 53304	2008-07-09 13:22:46 +00:00
Anton Korobeynikov	c421fcddb4	Print entity size for mergeable sections llvm-svn: 53303	2008-07-09 13:22:17 +00:00
Anton Korobeynikov	849c8617be	Split PrintSectionFlags llvm-svn: 53302	2008-07-09 13:21:49 +00:00
Anton Korobeynikov	7f21791b33	Split UniqueSectionForGlobal() llvm-svn: 53301	2008-07-09 13:21:29 +00:00
Anton Korobeynikov	61aca29278	Split PreferredEHDataFormat hook llvm-svn: 53300	2008-07-09 13:21:08 +00:00
Anton Korobeynikov	32d3d15c2e	Split X86TargetAsmInfo into 4 subtarget-specific classes llvm-svn: 53299	2008-07-09 13:20:48 +00:00
Anton Korobeynikov	80f2417e3b	Whitespace cleanup llvm-svn: 53298	2008-07-09 13:20:27 +00:00
Anton Korobeynikov	059999d321	Move flag decoding stuff into special hook llvm-svn: 53297	2008-07-09 13:20:07 +00:00
Anton Korobeynikov	ca271dd426	Properly handle linkonce stuff llvm-svn: 53296	2008-07-09 13:19:38 +00:00
Anton Korobeynikov	782a69505d	Provide skeletone code for calculation of section, where global should be emitted into llvm-svn: 53295	2008-07-09 13:19:08 +00:00
Evan Cheng	f51c436a1b	Back out 53254. It broke ppc debug info codegen. llvm-svn: 53280	2008-07-09 06:36:53 +00:00
Dale Johannesen	d609d7166c	Make debug info come out in data-only files. This is a question of the debugging setup code not being called at the right time, and it's called from target-dependent code for some reason. I have only attempted to fix Darwin, but I'm pretty sure it's broken elsewhere; I'll leave that to people who can test it. llvm-svn: 53254	2008-07-08 21:56:22 +00:00
Evan Cheng	6af015292e	Unbreak C++ tests on x86 Darwin. llvm-svn: 53237	2008-07-08 16:40:43 +00:00
Evan Cheng	5be1103646	Avoid unnecessary string construction during asm printing. llvm-svn: 53215	2008-07-08 00:55:58 +00:00
Dan Gohman	cd25487258	Pool-allocation for MachineInstrs, MachineBasicBlocks, and MachineMemOperands. The pools are owned by MachineFunctions. This drastically reduces the number of calls to malloc/free made during the "Emit" phase of scheduling, as well as later phases in CodeGen. Combined with other changes, this speeds up the "instruction selection" phase of CodeGen by 10% in some cases. llvm-svn: 53212	2008-07-07 23:14:23 +00:00
Evan Cheng	688a8070f4	ATT asm printer just print register AsmName's instead of calling tolower on each charater of Name. This speeds it up by 10%. llvm-svn: 53208	2008-07-07 22:21:06 +00:00
Dan Gohman	c97817aac3	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Duncan Sands	3ea6f15708	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Duncan Sands	aac5c915ed	Linux also does not require exception handling moves in order to get correct debug info. Since I can't imagine how any target could possibly be any different, I've just stripped out the option: now all the world's like Darwin! llvm-svn: 53134	2008-07-04 09:55:48 +00:00
Evan Cheng	3e6a03a4b6	Back out 53091 for now. llvm-svn: 53109	2008-07-03 18:11:29 +00:00
Evan Cheng	1f6148a84c	- Remove calls to copyKillDeadInfo which is an N^2 function. Instead, propagate kill / dead markers as new instructions are constructed in foldMemoryOperand, convertToThressAddress, etc. - Also remove LiveVariables::instructionChanged, etc. Replace all calls with cheaper calls which update VarInfo kill list. llvm-svn: 53097	2008-07-03 09:09:37 +00:00
Anton Korobeynikov	f3fc979d9c	llvm-gcc sometimes marks external declarations hidden, because intializers are processed separately. Honour such situation and emit PIC relocations properly in such case. llvm-svn: 53091	2008-07-03 07:43:14 +00:00
Evan Cheng	6d84ad83ca	commuteInstruction should preserve dead markers. llvm-svn: 53060	2008-07-03 00:04:51 +00:00
Owen Anderson	604f9f722d	Make LiveVariables even more optional, by making it optional in the call to TargetInstrInfo::convertToThreeAddressInstruction Also, if LV isn't around, then TwoAddr doesn't need to be updating flags, since they won't have been set in the first place. llvm-svn: 53058	2008-07-02 23:41:07 +00:00
Duncan Sands	21e2a711e3	Add a new getMergeValues method that does not need to be passed the list of value types, and use this where appropriate. Inappropriate places are where the value type list is already known and may be long, in which case the existing method is more efficient. llvm-svn: 53035	2008-07-02 17:40:58 +00:00
Bill Wendling	27c38cee90	Darwin doesn't need exception handling information for the "move" info when debug information is being output, because it's leet! llvm-svn: 52994	2008-07-01 23:34:48 +00:00
Dan Gohman	83c1b4cede	Prune a few dependencies on MachineFunction.h. llvm-svn: 52976	2008-07-01 18:15:35 +00:00
Evan Cheng	67ce381ffe	Do not use computationally expensive scheduling heuristics with -fast. llvm-svn: 52971	2008-07-01 18:05:03 +00:00
Duncan Sands	d8d11501c9	Highlight that getMergeValues optimization is being suppressed here. llvm-svn: 52952	2008-07-01 08:00:49 +00:00
Dan Gohman	c8097f8c8c	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Dan Gohman	c8c04b1ff4	std::ostream and std::string microoptimizations for asm printing. llvm-svn: 52929	2008-06-30 22:03:41 +00:00
Dan Gohman	e58f07e5d6	Update comments to new-style syntax. llvm-svn: 52925	2008-06-30 21:00:56 +00:00
Dan Gohman	6cc648891b	Rename ISD::LOCATION to ISD::DBG_STOPPOINT to better reflect its purpose, and give it a custom SDNode subclass so that it doesn't need to have line number, column number, filename string, and directory string, all existing as individual SDNodes to be the operands. This was the only user of ISD::STRING, StringSDNode, etc., so remove those and some associated code. This makes stop-points considerably easier to read in -view-legalize-dags output, and reduces overhead (creating new nodes and copying std::strings into them) on code containing debugging information. llvm-svn: 52924	2008-06-30 20:59:49 +00:00
Evan Cheng	3f664b6fd3	Split scheduling from instruction selection. llvm-svn: 52923	2008-06-30 20:45:06 +00:00
Duncan Sands	c882a4eba9	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Anton Korobeynikov	0b708e559e	Unbreak llvm-svn: 52866	2008-06-28 11:10:06 +00:00
Anton Korobeynikov	8562255056	Temporary rever invalid commit llvm-svn: 52865	2008-06-28 11:09:48 +00:00
Anton Korobeynikov	ea88d91267	Move printing of module-level GVs into dedicated helper llvm-svn: 52864	2008-06-28 11:09:32 +00:00
Anton Korobeynikov	77c3528f69	Use common naming convention llvm-svn: 52863	2008-06-28 11:09:17 +00:00
Anton Korobeynikov	2331efee7e	Factor out stuff into helper function llvm-svn: 52862	2008-06-28 11:09:01 +00:00
Anton Korobeynikov	908c1fab55	Cleanup llvm-svn: 52861	2008-06-28 11:08:44 +00:00
Anton Korobeynikov	adec555f96	Remove X86SharedAsmPrinter llvm-svn: 52860	2008-06-28 11:08:27 +00:00
Anton Korobeynikov	dcc6a8314a	whitespace cleanup llvm-svn: 52859	2008-06-28 11:08:09 +00:00
Anton Korobeynikov	03a62267fe	Make intel asmprinter child of generic asmprinter, not x86 shared asm printer. This leads to some code duplication, which will be resolved later. llvm-svn: 52858	2008-06-28 11:07:54 +00:00
Anton Korobeynikov	b75aeb6b1a	Cleanup llvm-svn: 52857	2008-06-28 11:07:35 +00:00
Anton Korobeynikov	f4017f7d50	Whitespace cleanup llvm-svn: 52856	2008-06-28 11:07:18 +00:00
Anton Korobeynikov	e48fe3dde8	Use StringSet instead of std::set<std::string> llvm-svn: 52836	2008-06-27 21:22:49 +00:00
Dale Johannesen	f170e29cf5	Fixes the last x86-64 test failure in compat.exp: <16 x float> is 64-byte aligned (for some reason), which gets us into the stack realignment code. The computation changing FP-relative offsets to SP-relative was broken, assiging a spill temp to a location also used for parameter passing. This fixes it by rounding up the stack frame to a multiple of the largest alignment (I concluded it wasn't fixable without doing this, but I'm not very sure.) llvm-svn: 52750	2008-06-26 01:51:13 +00:00
Evan Cheng	71fbfe73c1	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Dan Gohman	404964dbc0	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	7d89d61387	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	bab5925a0b	Enable two-address remat by default. llvm-svn: 52701	2008-06-25 01:16:38 +00:00
Dale Johannesen	fdf8fe6c03	Add v2f32 (MMX) type to X86. Support is primitive: load,store,call,return,bitcast. This is enough to make call and return work. llvm-svn: 52691	2008-06-24 22:01:44 +00:00
Evan Cheng	a62f5f0f82	If it's determined safe, remat MOV32r0 (i.e. xor r, r) and others as it is instead of using the longer MOV32ri instruction. llvm-svn: 52670	2008-06-24 07:10:51 +00:00
Dan Gohman	9941a2dab3	Add a note about a potential PIC optimization. llvm-svn: 52663	2008-06-24 00:53:07 +00:00
Dan Gohman	ebc59c90b7	Fixes for being compiled PIC on Linux. This isn't the most general solution possible, but it's a fairly simple one. Based on a patch from the OpenGTL project! llvm-svn: 52662	2008-06-24 00:50:01 +00:00
Dan Gohman	c1aa753f00	Remove unnecessary #includes. llvm-svn: 52613	2008-06-22 19:21:26 +00:00
Eli Friedman	570aa6f801	Fix a bug with <8 x i16> shuffle lowering on X86 where parts of the shuffle could be skipped. The check is invalid because the loop index i doesn't correspond to the element actually inserted. The correct check is already done a few lines earlier, for whether the element is already in the right spot, so this shouldn't have any effect on the codegen for code that was already correct. llvm-svn: 52486	2008-06-19 06:09:51 +00:00
Evan Cheng	0570953e28	XOR32rr, etc. are not AsCheapAsMove, but MOV32ri, etc. are. llvm-svn: 52454	2008-06-18 08:13:07 +00:00
Evan Cheng	deb754898b	Unbreak DECLARE isel in pic mode. llvm-svn: 52439	2008-06-18 02:48:27 +00:00
Evan Cheng	89e2e3292d	Rather than avoiding to wrap ISD::DECLARE GV operand in X86ISD::Wrapper, simply handle it at dagisel time with x86 specific isel code. llvm-svn: 52377	2008-06-17 02:01:22 +00:00
Evan Cheng	4e7b7b21a2	Horizontal-add instructions are not commutative. llvm-svn: 52363	2008-06-16 21:16:24 +00:00
Evan Cheng	acd614c262	mpsadbw is commutable. llvm-svn: 52352	2008-06-16 20:25:59 +00:00
Evan Cheng	2dfe8c2435	Add option to commuteInstruction() which forces it to create a new (commuted) instruction. llvm-svn: 52308	2008-06-16 07:33:11 +00:00
Andrew Lenharth	327c3e7559	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Duncan Sands	40c8db881a	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Anton Korobeynikov	74422b3cd0	Properly lower DYNAMIC_STACKALLOC - bracket all black magic with CALLSEQ_BEGIN & CALLSEQ_END. llvm-svn: 52225	2008-06-11 20:16:42 +00:00
Rafael Espindola	feaadb1e05	add support for PIC on linux x86-64 llvm-svn: 52139	2008-06-09 09:52:31 +00:00
Duncan Sands	fe2a970a5c	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Duncan Sands	d634afe3aa	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Evan Cheng	badbe3e3fa	Don't break strict aliasing. llvm-svn: 52026	2008-06-05 22:59:21 +00:00
Dale Johannesen	c0cd6cd4d4	Add StringConstantPrefix to control what the assembler names of string constants look like. llvm-svn: 51909	2008-06-03 18:09:06 +00:00
Rafael Espindola	feec40a71f	Don't use the GOT for symbols that are not externally visible. llvm-svn: 51865	2008-06-02 07:52:43 +00:00
Dan Gohman	00823cb0d4	Teach the DAGISelEmitter to not compute the variable_ops operand index for the input pattern in terms of the output pattern. Instead keep track of how many fixed operands the input pattern actually has, and have the input matching code pass the output-emitting function that index value. This simplifies the code, disentangles variables_ops from the support for predication operations, and makes variable_ops more robust. llvm-svn: 51808	2008-05-31 02:11:25 +00:00
Bill Wendling	244b4db58d	Add the "AsCheapAsAMove" flag to some 64-bit xor instructions. llvm-svn: 51761	2008-05-30 06:47:04 +00:00
Dan Gohman	aa8fcd5657	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Dan Gohman	4e87d82476	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Dan Gohman	e256337a1a	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Evan Cheng	04c0915a2f	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Bill Wendling	81199f0cc8	XOR?RI instructions aren't as cheap as moves. llvm-svn: 51664	2008-05-29 03:46:36 +00:00
Bill Wendling	edb38e9410	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Dan Gohman	a5549a2f9c	Fix the encoding for two more "rm" instructions that were using MRMSrcReg. llvm-svn: 51630	2008-05-28 01:50:19 +00:00
Mon P Wang	8e37b2d13e	Fixed X86 encoding error CVTPS2PD and CVTPD2PS when the source operand is a memory location llvm-svn: 51626	2008-05-28 00:42:27 +00:00
Nate Begeman	23dd264da6	Don't attempt to create VZEXT_LOAD out of an extload. This an issue where the code generator would do something like this: f64 = load f32 <anyext>, f32mem v2f64 = insertelt undef, %0, 0 v2f64 = insertelt %1, 0.0, 1 into v2f64 = vzext_load f32mem which on x86 is movsd, when you really wanted a cvtss2sd/movsd pair. llvm-svn: 51624	2008-05-28 00:24:25 +00:00
Evan Cheng	e5e0b4660d	Eliminate x86.sse2.punpckh.qdq and x86.sse2.punpckl.qdq. llvm-svn: 51533	2008-05-24 02:56:30 +00:00
Evan Cheng	564238c841	Eliminate x86.sse2.movs.d, x86.sse2.shuf.pd, x86.sse2.unpckh.pd, and x86.sse2.unpckl.pd intrinsics. These will be lowered into shuffles. llvm-svn: 51531	2008-05-24 02:14:05 +00:00
Evan Cheng	d312ced1cf	This is done. llvm-svn: 51526	2008-05-24 00:10:13 +00:00
Evan Cheng	98a292a302	Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions. llvm-svn: 51522	2008-05-24 00:07:29 +00:00
Evan Cheng	4f660778f0	Use movlps / movhps to modify low / high half of 16-byet memory location. llvm-svn: 51501	2008-05-23 21:23:16 +00:00
Dan Gohman	e8422fc112	Elaborate on the entry on integer vector multiplication by constants. llvm-svn: 51491	2008-05-23 18:05:39 +00:00
Evan Cheng	ec8bd19399	Fix a duplicated pattern. llvm-svn: 51490	2008-05-23 18:00:18 +00:00
Dan Gohman	6cc0b4f262	Use PMULDQ for v2i64 multiplies when SSE4.1 is available. And add load-folding table entries for PMULDQ and PMULLD. llvm-svn: 51489	2008-05-23 17:49:40 +00:00
Evan Cheng	e7ec4690e1	New entry. llvm-svn: 51487	2008-05-23 17:28:11 +00:00
Chris Lattner	4c1ffef5af	we compile multiply-by-constant into horrible code. Doesn't sse4 have some instruction for doing this? llvm-svn: 51473	2008-05-23 04:29:53 +00:00
Evan Cheng	097e95b1f7	Bug: rcpps can only folds a load if the address is 16-byte aligned. Fixed many 'ps' load folding patterns in X86InstrSSE.td which are missing the proper alignment checks. Also fixed some 80 col. violations. llvm-svn: 51462	2008-05-23 00:37:07 +00:00
Dale Johannesen	7cc19db16f	Put const weak stuff in appropriate section on Darwin. g++.dg/abi/key2.C llvm-svn: 51458	2008-05-23 00:16:59 +00:00
Evan Cheng	2dc53b5d58	X86CodeEmitter should not set PIC style to None at initialization time. This will break codegen if relocation model is changed to PIC_ later. llvm-svn: 51455	2008-05-22 23:55:24 +00:00
Evan Cheng	d1373cd497	Add missing patterns. llvm-svn: 51435	2008-05-22 18:56:56 +00:00
Evan Cheng	d694e78e36	movsd and movq do not require 16-byte alignment. This fixes vec_set-5.ll on Linux. llvm-svn: 51327	2008-05-20 18:24:47 +00:00
Evan Cheng	e95fc3e83d	runOnMachineFunction should set IsPIC because relocation model may have been changed. llvm-svn: 51291	2008-05-20 01:56:59 +00:00
Dale Johannesen	e6977495aa	Handle quoted names when constructing $stub's, $non_lazy_ptr's and $lazy_ptr's. llvm-svn: 51277	2008-05-19 21:38:18 +00:00
Dale Johannesen	ebc511c6aa	Treat common as distinct from weak global on Darwin x86. llvm-svn: 51172	2008-05-16 00:52:06 +00:00
Evan Cheng	73dadf21ce	Fix typos and comments. llvm-svn: 51165	2008-05-15 22:13:02 +00:00
Evan Cheng	778a5e27b0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Dale Johannesen	768b6f281e	Add CommonLinkage; currently tentative definitions are represented as "weak", but there are subtle differences in some cases on Darwin, so we need both. The intent is that "common" will behave identically to "weak" unless somebody changes their target to do something else. No functional change as yet. llvm-svn: 51118	2008-05-14 20:12:51 +00:00
Evan Cheng	95987c2586	Doh. Alignment is in bytes, not in bits. llvm-svn: 51092	2008-05-14 02:49:43 +00:00
Dan Gohman	f9d5689496	Change target-specific classes to use more precise static types. This eliminates the need for several awkward casts, including the last dynamic_cast under lib/Target. llvm-svn: 51091	2008-05-14 01:58:56 +00:00
Chris Lattner	a11adf725d	add a note llvm-svn: 51062	2008-05-13 19:56:20 +00:00
Evan Cheng	cb56638548	- Fix the pasto in the fix for a previous pasto. - Incorporate Chris' comment suggestion. llvm-svn: 51061	2008-05-13 18:59:59 +00:00
Chris Lattner	c9eb6a7d64	add a note llvm-svn: 51060	2008-05-13 18:48:54 +00:00
Nate Begeman	c290daf581	Fix one more encoding bug. llvm-svn: 51057	2008-05-13 17:52:09 +00:00
Evan Cheng	cf6928983b	- Don't treat anyext 16-bit load as a 32-bit load if it's volatile. - Correct a pasto. llvm-svn: 51054	2008-05-13 16:45:56 +00:00
Evan Cheng	9e15622879	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Nate Begeman	b9a3d141aa	Fix and encoding error in the psrad xmm, imm8 instruction. llvm-svn: 51020	2008-05-13 01:47:52 +00:00
Evan Cheng	e4ee4c2870	On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. llvm-svn: 51019	2008-05-13 00:54:02 +00:00
Dan Gohman	bab18cae46	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	5d939498c3	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	fcbdc8bd6e	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Bill Wendling	646f3458c4	Constify the machine instruction passed into the "is{Trivially,Really}ReMaterializable" methods. llvm-svn: 51001	2008-05-12 20:54:26 +00:00
Nate Begeman	2ae55cecc6	Initial X86 codegen support for VSETCC. llvm-svn: 51000	2008-05-12 20:34:32 +00:00
Dan Gohman	efa0925915	Fix a copy+paste bug; pseudo-instructions shouldn't have encoding information. llvm-svn: 50997	2008-05-12 20:22:45 +00:00
Evan Cheng	c7e9acfed7	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Dan Gohman	8212eaa43a	Fix a compile error on compilers that still want a return value in a non-void function that calls abort. llvm-svn: 50969	2008-05-12 16:17:19 +00:00
Anton Korobeynikov	ad83aeb489	Add note llvm-svn: 50959	2008-05-11 14:33:15 +00:00
Evan Cheng	c19c639ad7	When transforming a vector_shuffle to a load, the base address must not be an undef. llvm-svn: 50940	2008-05-10 06:46:49 +00:00
Dan Gohman	4b23d9e60a	For now, abort when an ISD::VAARG is encountered on x86-64, rather than silently generate invalid code. llvm-gcc does not currently use VAArgInst; it lowers va_arg in the front-end. llvm-svn: 50930	2008-05-10 01:26:14 +00:00
Evan Cheng	6a3fa28b38	Some clean up. llvm-svn: 50929	2008-05-10 00:59:18 +00:00
Evan Cheng	79230955a8	If movl top bits are undef, let it be selected to movlps, etc. llvm-svn: 50928	2008-05-10 00:58:41 +00:00
Evan Cheng	2adea48f7e	Add a pattern to do move the low element of a v4f32 and zero extend the rest. llvm-svn: 50922	2008-05-09 23:37:55 +00:00
Evan Cheng	3493e43afd	Handle a few more cases of folding load i64 into xmm and zero top bits. Note, some of the code will be moved into target independent part of DAG combiner in a subsequent patch. llvm-svn: 50918	2008-05-09 21:53:03 +00:00
Evan Cheng	f824b47188	Use movq to move low half of XMM register and zero-extend the rest. llvm-svn: 50874	2008-05-08 22:35:02 +00:00
Evan Cheng	f97e716511	Handle vector move / load which zero the destination register top bits (i.e. movd, movq, movss (addr), movsd (addr)) with X86 specific dag combine. llvm-svn: 50838	2008-05-08 00:57:18 +00:00
Duncan Sands	6f4e916c6a	Output correct exception handling and frame info on x86-64 linux. This causes no regressions on 32 bit linux and 32 bit ppc. More tests pass on 64 bit ppc with no regressions. I didn't turn on eh on 64 bit linux because the intrinsics needed to compile the eh runtime aren't done yet. But if you turn it on and link with the mainline runtime then eh seems to work fine on x86-64 linux with this patch. Thanks to Dale for testing. The main point of the patch is that if you output that some object is encoded using 4 bytes you had better not output 8 bytes for it: the patch makes everything consistent. llvm-svn: 50825	2008-05-07 19:11:09 +00:00
Chris Lattner	9f4f2444ea	add a micro optzn. llvm-svn: 50681	2008-05-05 23:19:45 +00:00
Mon P Wang	34b3f18a70	Improved generated code for atomic operators llvm-svn: 50677	2008-05-05 22:56:23 +00:00
Evan Cheng	44d49e72a1	Code clean up. No functionality change. llvm-svn: 50675	2008-05-05 22:12:23 +00:00
Mon P Wang	84a269e023	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	4a674dc536	Fix IsLinux being uninitialized on non-Linux targets. llvm-svn: 50660	2008-05-05 18:43:07 +00:00
Anton Korobeynikov	12c48230f9	Fix 80col violation llvm-svn: 50654	2008-05-05 17:08:59 +00:00
Dan Gohman	8ee7bf053e	Use a dedicated IsLinux flag instead of an ELFLinux TargetType. llvm-svn: 50649	2008-05-05 16:11:31 +00:00
Dan Gohman	c860d9c77c	Add AsmPrinter support for emitting a directive to declare that the code being generated does not require an executable stack. Also, add target-specific code to make use of this on Linux on x86. llvm-svn: 50634	2008-05-05 00:28:39 +00:00
Anton Korobeynikov	04c974b1b2	Add General Dynamic TLS model for X86-64. Some parts looks really ugly (look for tlsaddr pattern), but should work. Work is in progress, more models will follow llvm-svn: 50630	2008-05-04 21:36:32 +00:00
Evan Cheng	a7747df955	Select vector shift with non-immediate i32 shift amount operand by first moving the operand into the right register. llvm-svn: 50619	2008-05-04 09:15:50 +00:00
Evan Cheng	c1c2adbfc6	Add separate intrinsics for MMX / SSE shifts with i32 integer operands. This allow us to simplify the horribly complicated matching code. llvm-svn: 50601	2008-05-03 00:52:09 +00:00
Evan Cheng	90b9027f68	Undo r50574. We are already ensuring the folded load address is 16-byte aligned. llvm-svn: 50578	2008-05-02 17:01:01 +00:00
Evan Cheng	583a346ec6	80 column violation. llvm-svn: 50575	2008-05-02 07:53:32 +00:00
Evan Cheng	862e3a147c	Not safe folding a load + FsXORPSrr into FsXORPSrm. It's loading a FR64 value but the load folding variant expects a 16-byte aligned address. llvm-svn: 50574	2008-05-02 07:50:58 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Dan Gohman	0285c1e9bb	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Anton Korobeynikov	54791c2a43	Fix FP return for Win64 ABI llvm-svn: 50342	2008-04-28 07:40:07 +00:00

... 2 3 4 5 6 ...

3657 Commits