llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 05:23:45 +02:00

Author	SHA1	Message	Date
Owen Anderson	35f6bae989	Allow FastISel of three-register-operand instructions. llvm-svn: 130934	2011-05-05 17:59:04 +00:00
Chandler Carruth	c83eb00361	Remove an unused variable in NDEBUG (found with -Wunused-variable). llvm-svn: 130688	2011-05-02 05:49:01 +00:00
Jakob Stoklund Olesen	e0ec7ed462	Add a SlotIndexes::insertMachineInstrInMaps to insert the instruction after any null indexes. This makes a difference if a live interval is referring to a deleted instruction. It can be important to insert an instruction before or after a deleted instruction to avoid interference. llvm-svn: 130686	2011-05-02 05:29:56 +00:00
Rafael Espindola	eb5d0cb4f4	GCC uses a different encoding of pointers in the FDE when using -fno-dwarf2-cfi-asm. Implement the same behavior. llvm-svn: 130637	2011-05-01 04:49:54 +00:00
Jakob Stoklund Olesen	4ec9e1c33a	Avoid using stale entries form the sibling value map. This could happen when trying to use a value that had been eliminated after dead code elimination and folding loads. llvm-svn: 130597	2011-04-30 06:42:21 +00:00
Rafael Espindola	16b23d9ff7	Factor some code to needsCFIMoves. Avoid printing moves when we don't have to. llvm-svn: 130501	2011-04-29 14:14:06 +00:00
Chris Lattner	52b19aa2b5	add a missing operator that caused us to have to use (*MIB).foo everywhere. llvm-svn: 130473	2011-04-29 05:24:07 +00:00
Devang Patel	900ceb725b	Teach dwarf writer to handle complex address expression for .debug_loc entries. This fixes clang generated blocks' variables' debug info. Radar 9279956. llvm-svn: 130373	2011-04-28 02:22:40 +00:00
Rafael Espindola	36e419b524	Remove unnecessary argument. llvm-svn: 130343	2011-04-27 23:17:57 +00:00
Rafael Espindola	0525497a16	Rename getPersonalityPICSymbol to getCFIPersonalitySymbol, document it, and give it a bit more responsibility. Also implement it for MachO. If hacked to use cfi, 32 bit MachO will produce .cfi_personality 155, L___gxx_personality_v0$non_lazy_ptr and 64 bit will produce .cfi_presonality ___gxx_personality_v0 The general idea is that .cfi_personality gets passed the final symbol. It is up to codegen to produce it if using indirect representation (like 32 bit MachO), but it is up to MC to decide which relocations to create. llvm-svn: 130341	2011-04-27 23:08:15 +00:00
Eli Friedman	c5406cdb50	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Eli Friedman	fc1152d772	Remove unused function. llvm-svn: 130337	2011-04-27 22:21:02 +00:00
Devang Patel	42f4a7ff92	Revert r130178. It turned out to be not the optimal path to emit complex location expressions. llvm-svn: 130326	2011-04-27 20:29:27 +00:00
Evan Cheng	dea3347167	Be careful about scheduling nodes above previous calls. It increase usages of more callee-saved registers and introduce copies. Only allows it if scheduling a node above calls would end up lessen register pressure. Call operands also has added ABI restrictions for register allocation, so be extra careful with hoisting them above calls. rdar://9329627 llvm-svn: 130245	2011-04-26 21:31:35 +00:00
Jakob Stoklund Olesen	c9cf507d93	Use the new TRI->getLargestLegalSuperClass hook to constrain register class inflation. This has two effects: 1. We never inflate to a larger register class than what the sub-target can handle. 2. Completely unconstrained virtual registers get the largest possible register class. llvm-svn: 130229	2011-04-26 18:52:36 +00:00
Devang Patel	4969322bc4	Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables. llvm-svn: 130178	2011-04-26 00:12:46 +00:00
Jay Foad	c146569beb	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Owen Anderson	e1b33b92a3	Teach FastISel to deal with instructions that have two immediate operands. llvm-svn: 130033	2011-04-22 23:38:06 +00:00
Eric Christopher	4de9ef5cf7	Fix comment. llvm-svn: 130027	2011-04-22 23:08:45 +00:00
Chris Lattner	d9c0db9bd7	Recommit the fix for rdar://9289512 with a couple tweaks to fix bugs exposed by the gcc dejagnu testsuite: 1. The load may actually be used by a dead instruction, which would cause an assert. 2. The load may not be used by the current chain of instructions, and we could move it past a side-effecting instruction. Change how we process uses to define the problem away. llvm-svn: 130018	2011-04-22 21:59:37 +00:00
Devang Patel	4f25432e4e	Refactor. llvm-svn: 129938	2011-04-21 21:07:35 +00:00
Daniel Dunbar	3a96439b36	Revert r1296656, "Fix rdar://9289512 - not folding load into compare at -O0...", which broke a couple GCC test suite tests at -O0. llvm-svn: 129914	2011-04-21 16:14:46 +00:00
Stuart Hastings	a552942e02	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Chris Lattner	5e00f501ff	Fix rdar://9289512 - not folding load into compare at -O0 The basic issue here is that bottom-up isel is matching the branch and compare, and was failing to fold the load into the branch/compare combo. Fixing this (by allowing folding into any instruction of a sequence that is selected) allows us to produce things like: cmpb $0, 52(%rax) je LBB4_2 instead of: movb 52(%rax), %cl cmpb $0, %cl je LBB4_2 This makes the generated -O0 code run a bit faster, but also speeds up compile time by putting less pressure on the register allocator and generating less code. This was one of the biggest classes of missing load folding. Implementing this shrinks 176.gcc's c-decl.s (as a random example) by about 4% in (verbose-asm) line count. llvm-svn: 129656	2011-04-17 06:35:44 +00:00
Rafael Espindola	9e5aaa3b78	Put each personality function in a section. This fixes the gnu ld warning: error in foo.o; no .eh_frame_hdr table will be created. llvm-svn: 129635	2011-04-16 03:51:21 +00:00
Rafael Espindola	694ad2f25c	Some refactoring suggested by Anton Korobeynikov. llvm-svn: 129600	2011-04-15 20:32:03 +00:00
Rafael Espindola	99831068c8	Add 129518 back with a fix for when we are producing eh just because of debug info. Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129571	2011-04-15 15:11:06 +00:00
Chris Lattner	0304b82f80	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
NAKAMURA Takumi	7aed456653	Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" It broke several builds. llvm-svn: 129557	2011-04-15 03:35:57 +00:00
Rafael Espindola	d5eed657e2	Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129518	2011-04-14 15:18:53 +00:00
Andrew Trick	e89c19ab7b	In the pre-RA scheduler, maintain cmp+br proximity. This is done by pushing physical register definitions close to their use, which happens to handle flag definitions if they're not glued to the branch. This seems to be generally a good thing though, so I didn't need to add a target hook yet. The primary motivation is to generate code closer to what people expect and rule out missed opportunity from enabling macro-op fusion. As a side benefit, we get several 2-5% gains on x86 benchmarks. There is one regression: SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is an independent scheduler bug that will be tracked separately. See rdar://problem/9283108. Incidentally, pre-RA scheduling is only half the solution. Fixing the later passes is tracked by: <rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump Fixes: <rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion llvm-svn: 129508	2011-04-14 05:15:06 +00:00
Jay Foad	b215137166	Like the coding standards say, do not use "using namespace std". llvm-svn: 129435	2011-04-13 12:46:01 +00:00
Jakob Stoklund Olesen	a1ffedb740	Skip a binary search when possible. llvm-svn: 129293	2011-04-11 20:01:44 +00:00
Jakob Stoklund Olesen	5add6d16b7	Build the Hopfield network incrementally when splitting global live ranges. It is common for large live ranges to have few basic blocks with register uses and many live-through blocks without any uses. This approach grows the Hopfield network incrementally around the use blocks, completely avoiding checking interference for some through blocks. llvm-svn: 129188	2011-04-09 02:59:09 +00:00
Andrew Trick	36a1759769	Added a check in the preRA scheduler for potential interference on a induction variable. The preRA scheduler is unaware of induction vars, so we look for potential "virtual register cycles" instead. Fixes <rdar://problem/8946719> Bad scheduling prevents coalescing llvm-svn: 129100	2011-04-07 19:54:57 +00:00
Jakob Stoklund Olesen	731b0d77a2	Use std::unique instead of a SmallPtrSet to ensure unique instructions in UseSlots. This allows us to always keep the smaller slot for an instruction which is what we want when a register has early clobber defines. Drop the UsingInstrs set and the UsingBlocks map. They are no longer needed. llvm-svn: 128886	2011-04-05 15:18:18 +00:00
Jakob Stoklund Olesen	65c8f18b8d	Cache the fairly expensive last split point computation and provide a fast inlined path for the common case. Most basic blocks don't contain a call that may throw, so the last split point os simply the first terminator. llvm-svn: 128874	2011-04-05 04:20:27 +00:00
Jakob Stoklund Olesen	78d65c6632	Stop caching basic block index ranges now that SlotIndexes can keep up. llvm-svn: 128821	2011-04-04 15:32:15 +00:00
Jakob Stoklund Olesen	024a1de4ae	Use basic block numbers as indexes when mapping slot index ranges. This is more compact and faster than using DenseMap. llvm-svn: 128763	2011-04-02 06:03:31 +00:00
Evan Cheng	39574b2766	Issue libcalls __udivmodi4 / __divmodi4 for div / rem pairs. rdar://8911343 llvm-svn: 128696	2011-04-01 00:42:02 +00:00
Jakob Stoklund Olesen	446412de55	Collect and coalesce DBG_VALUE instructions before emitting the function. Correctly terminate the range of register DBG_VALUEs when the register is clobbered or when the basic block ends. The code is now ready to deal with variables that are sometimes in a register and sometimes on the stack. We just need to teach emitDebugLoc to say 'stack slot'. llvm-svn: 128327	2011-03-26 02:19:36 +00:00
Evan Cheng	9d660cc8b7	Add comment to clarify what MachineConstantPoolEntry::isMachineConstantPoolEntry() means. llvm-svn: 128204	2011-03-24 06:28:45 +00:00
Jakob Stoklund Olesen	047a25b0b0	Dead code elimination may separate the live interval into multiple connected components. I have convinced myself that it can only happen when a phi value dies. When it happens, allocate new virtual registers for the components. llvm-svn: 127827	2011-03-17 20:37:07 +00:00
Jakob Stoklund Olesen	2786187b43	Rewrite instructions as part of ConnectedVNInfoEqClasses::Distribute. llvm-svn: 127779	2011-03-17 00:23:45 +00:00
Jakob Stoklund Olesen	29a9539e7f	Place context in member variables instead of passing around pointers. Use the opportunity to get rid of the trailing underscore variable names. llvm-svn: 127618	2011-03-14 20:57:14 +00:00
Owen Anderson	78afadfa5d	Teach FastISel to support register-immediate-immediate instructions. llvm-svn: 127496	2011-03-11 21:33:55 +00:00
Jim Grosbach	39476d9010	80 columns. llvm-svn: 127495	2011-03-11 21:02:27 +00:00
Jim Grosbach	4ed235527d	Trailing whitespace. llvm-svn: 127493	2011-03-11 20:59:19 +00:00
Jakob Stoklund Olesen	70541686bf	Make SpillIs an optional pointer. Avoid creating a bunch of temporary SmallVectors. llvm-svn: 127388	2011-03-10 01:21:58 +00:00
Jakob Stoklund Olesen	f9401745e0	Let shrinkToUses optionally return a list of now dead machine instructions. llvm-svn: 127192	2011-03-07 23:29:10 +00:00
Eric Christopher	e3080a1de4	Typos. llvm-svn: 127186	2011-03-07 22:48:16 +00:00
Jim Grosbach	cbd2f07c7e	Tidy up. llvm-svn: 127169	2011-03-07 19:28:43 +00:00
Owen Anderson	11a49e845a	Use the correct LHS type when determining the legalization of a shift's RHS type. llvm-svn: 127163	2011-03-07 18:29:47 +00:00
Anton Korobeynikov	d7910e3cb6	Provide hooks to set MI flags in MachineInstrBuilder llvm-svn: 127100	2011-03-05 18:43:20 +00:00
Anton Korobeynikov	c746be3dc4	Add FrameSetup MI flags llvm-svn: 127098	2011-03-05 18:43:04 +00:00
Anton Korobeynikov	eb2742ccf2	Shorten AsmPrinterFlags filed to accomodate for future Flags field llvm-svn: 127097	2011-03-05 18:42:54 +00:00
Jim Grosbach	372394916e	Teach the register scavenger to take subregs into account when finding a free register. llvm-svn: 127049	2011-03-05 00:20:19 +00:00
Jakob Stoklund Olesen	8b66caf12a	Renumber slot indexes locally when possible. Initially, slot indexes are quad-spaced. There is room for inserting up to 3 new instructions between the original instructions. When we run out of indexes between two instructions, renumber locally using double-spaced indexes. The original quad-spacing means that we catch up quickly, and we only have to renumber a handful of instructions to get a monotonic sequence. This is much faster than renumbering the whole function as we did before. llvm-svn: 127023	2011-03-04 19:43:38 +00:00
Jakob Stoklund Olesen	61af2752b1	Symbolize the default instruction distance. llvm-svn: 127013	2011-03-04 18:36:51 +00:00
Jakob Stoklund Olesen	728fffc159	Deferred SlotIndex renumbering was a good idea but never used. llvm-svn: 127008	2011-03-04 18:08:32 +00:00
Jakob Stoklund Olesen	5bc7a96cca	Use an IndexedMap instead of a DenseMap for the live-out cache. This speeds up updateSSA() so it only accounts for 5% of the live range splitting time. llvm-svn: 126972	2011-03-04 00:15:36 +00:00
Bill Wendling	6e6a0422eb	There are times when the landing pad won't have a call to 'eh.selector' in it. It's been assumed up til now that it would be in its immediate successor. However, this isn't necessarily the case. It could be in one of its successor's successors. Modify the code to more thoroughly check for an 'eh.selector' call in successors. It only looks at a successor if we get there as a result of an unconditional branch. Testcase ObjC/exceptions-4.m in r126968. llvm-svn: 126969	2011-03-03 23:14:05 +00:00
Jakob Stoklund Olesen	d0930e03c1	Represent sentinel slot indexes with a null pointer. This is much faster than using a pointer to a ManagedStatic object accessed with a function call. The greedy register allocator is 5% faster overall just from the SlotIndex default constructor savings. llvm-svn: 126925	2011-03-03 05:40:04 +00:00
Jakob Stoklund Olesen	2e6407e9aa	Avoid comparing invalid slot indexes, and assert that it doesn't happen. The SlotIndex created by the default construction does not represent a position in the function, and it doesn't make sense to compare it to other indexes. llvm-svn: 126924	2011-03-03 05:18:19 +00:00
Jakob Stoklund Olesen	2a12e5adba	Optimize SlotIndex equality tests. IndexListEntries have unique indexes, so it is not necessary to dereference pointers to them. llvm-svn: 126923	2011-03-03 05:18:15 +00:00
Jakob Stoklund Olesen	4d3c996555	Move LiveIntervalMap::extendTo into LiveInterval itself. This method could probably be used by LiveIntervalAnalysis::shrinkToUses, and now it can use extendIntervalEndTo() which coalesces ranges. llvm-svn: 126803	2011-03-02 00:06:15 +00:00
Jim Grosbach	fbdcd70f4b	Generalize the register matching code in DAGISel a bit. llvm-svn: 126731	2011-03-01 01:37:19 +00:00
Cameron Zwarich	764320383d	Fix PR9324 / <rdar://problem/9052489> by handling the case where a PHI has no uses. llvm-svn: 126567	2011-02-27 08:06:01 +00:00
Cameron Zwarich	724eb8706a	Merge information about the number of zero, one, and sign bits of live-out registers at phis. This enables us to eliminate a lot of pointless zexts during the DAGCombine phase. This fixes <rdar://problem/8760114>. llvm-svn: 126380	2011-02-24 10:00:25 +00:00
Cameron Zwarich	e79a75febe	Add a mechanism for invalidating the LiveOutInfo of a PHI, and use it whenever a block is visited before all of its predecessors. llvm-svn: 126378	2011-02-24 10:00:16 +00:00
Cameron Zwarich	5c9384705f	Track blocks visited in reverse postorder. llvm-svn: 126377	2011-02-24 10:00:13 +00:00
Cameron Zwarich	3d2f99227a	Refactor the LiveOutInfo interface into a few methods on FunctionLoweringInfo and make the actual map private. llvm-svn: 126376	2011-02-24 10:00:08 +00:00
Stuart Hastings	c0c38e8673	Omit private_extern declarations of extern symbols; followup to r124468. Patch by Rafael Avila de Espindola! llvm-svn: 126297	2011-02-23 02:27:05 +00:00
Cameron Zwarich	bde7e8b3e0	MachineConstantPoolValues are not uniqued, so they need to be freed if they share entries. Add a DenseSet to MachineConstantPool for the MachineCPVs that it owns. This will hopefully fix the MC/ARM/elf-reloc-01.ll failure on the leaks bots. llvm-svn: 126218	2011-02-22 08:54:30 +00:00
Cameron Zwarich	c942ffcae4	Roll out r126169 and r126170 in an attempt to fix the selfhost bot. llvm-svn: 126185	2011-02-22 03:24:52 +00:00
Cameron Zwarich	63ed1f4c67	Merge information about the number of zero, one, and sign bits of live-out registers at phis. This enables us to eliminate a lot of pointless zexts during the DAGCombine phase. This fixes <rdar://problem/8760114>. llvm-svn: 126170	2011-02-22 00:46:27 +00:00
Devang Patel	d5c4589795	Revert r124611 - "Keep track of incoming argument's location while emitting LiveIns." In other words, do not keep track of argument's location. The debugger (gdb) is not prepared to see line table entries for arguments. For the debugger, "second" line table entry marks beginning of function body. This requires some coordination with debugger to get this working. - The debugger needs to be aware of prolog_end attribute attached with line table entries. - The compiler needs to accurately mark prolog_end in line table entries (at -O0 and at -O1+) llvm-svn: 126155	2011-02-21 23:21:26 +00:00
Devang Patel	d63bce18da	Do not lose debug info of an inlined function argument even if the argument is only used through GEPs. This time with a fix that avoids using invalidated DenseMap iterator. llvm-svn: 125984	2011-02-18 22:43:42 +00:00
Cameron Zwarich	f6fa19a03f	Roll out r125794 to help diagnose the llvm-gcc-i386-linux-selfhost failure. llvm-svn: 125830	2011-02-18 04:58:10 +00:00
Devang Patel	b6f55191b3	Do not lose debug info of an inlined function argument even if the argument is only used through GEPs. llvm-svn: 125794	2011-02-17 23:33:27 +00:00
Stuart Hastings	47e45a32a8	Swap VT and DebugLoc operands of getExtLoad() for consistency with other getNode() methods. Radar 9002173. llvm-svn: 125665	2011-02-16 16:23:55 +00:00
Jakob Stoklund Olesen	70f48f08c1	Move more fragments of spill weight calculation into CalcSpillWeights.h Simplify the spill weight calculation a bit by bypassing getApproximateInstructionCount() and using LiveInterval::getSize() directly. This changes the computed spill weights, but only by a constant factor in each function. It should not affect how spill weights compare against each other, and so it shouldn't affect code generation. llvm-svn: 125530	2011-02-14 23:15:38 +00:00
Chris Lattner	2552afcae6	fix two comment thinkos llvm-svn: 125481	2011-02-14 06:14:42 +00:00
Chris Lattner	f427f729cd	missed a header llvm-svn: 125471	2011-02-13 22:30:09 +00:00
Chris Lattner	4286019ee4	fix thinko :) llvm-svn: 125466	2011-02-13 19:53:36 +00:00
Chris Lattner	c9c0de6faf	Revisit my fix for PR9028: the issue is that DAGCombine was generating i8 shift amounts for things like i1024 types. Add an assert in getNode to prevent this from occuring in the future, fix the buggy transformation, revert my previous patch, and document this gotcha in ISDOpcodes.h llvm-svn: 125465	2011-02-13 19:09:16 +00:00
Jakob Stoklund Olesen	21624a745c	Move calcLiveBlockInfo() and the BlockInfo struct into SplitAnalysis. No functional changes intended. llvm-svn: 125231	2011-02-09 22:50:26 +00:00
Jakob Stoklund Olesen	fe0a7ea3aa	Add LiveIntervals::addKillFlags() to recompute kill flags after register allocation. This is a lot easier than trying to get kill flags right during live range splitting and rematerialization. llvm-svn: 125113	2011-02-08 21:13:03 +00:00
Jakob Stoklund Olesen	a5e0ea6e4e	Add LiveIntervals::shrinkToUses(). After uses of a live range are removed, recompute the live range to only cover the remaining uses. This is necessary after rematerializing the value before some (but not all) uses. llvm-svn: 125058	2011-02-08 00:03:05 +00:00
Devang Patel	930b4b16a1	Merge .debug_loc entries whenever possible to reduce debug_loc size. llvm-svn: 124904	2011-02-04 22:57:18 +00:00
Jakob Stoklund Olesen	bf833680ec	Add LiveIntervals::getLastSplitPoint(). A live range cannot be split everywhere in a basic block. A split must go before the first terminator, and if the variable is live into a landing pad, the split must happen before the call that can throw. llvm-svn: 124894	2011-02-04 19:33:11 +00:00
Andrew Trick	09aa9fe96b	Introducing a new method of tracking register pressure. We can't precisely track pressure on a selection DAG, but we can at least keep it balanced. This design accounts for various interesting aspects of selection DAGS: register and subregister copies, glued nodes, dead nodes, unused registers, etc. Added SUnit::NumRegDefsLeft and ScheduleDAGSDNodes::RegDefIter. Note: I disabled PrescheduleNodesWithMultipleUses when register pressure is enabled, based on no evidence other than I don't think it makes sense to have both enabled. llvm-svn: 124853	2011-02-04 03:18:17 +00:00
Eric Christopher	57e4dada99	Reapply this. llvm-svn: 124779	2011-02-03 06:18:29 +00:00
Eric Christopher	8082811b65	Temporarily revert 124765 in an attempt to find the cycle breaking bootstrap. llvm-svn: 124778	2011-02-03 05:40:54 +00:00
Jakob Stoklund Olesen	880fa5b5dc	Defer SplitKit value mapping until all defs are available. The greedy register allocator revealed some problems with the value mapping in SplitKit. We would sometimes start mapping values before all defs were known, and that could change a value from a simple 1-1 mapping to a multi-def mapping that requires ssa update. The new approach collects all defs and register assignments first without filling in any live intervals. Only when finish() is called, do we compute liveness and mapped values. At this time we know with certainty which values map to multiple values in a split range. This also has the advantage that we can compute live ranges based on the remaining uses after rematerializing at split points. The current implementation has many opportunities for compile time optimization. llvm-svn: 124765	2011-02-03 00:54:23 +00:00
Devang Patel	97c467ee47	Keep track of incoming argument's location while emitting LiveIns. llvm-svn: 124611	2011-01-31 21:38:14 +00:00
David Greene	5c173a307b	[AVX] Add INSERT_SUBVECTOR and support it on x86. This provides a default implementation for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VINSERTF128 if AVX is available. llvm-svn: 124307	2011-01-26 19:13:22 +00:00
Devang Patel	134e5b7679	Provide an interface to transfer SDDbgValue from one SDNode to another. llvm-svn: 124245	2011-01-25 23:27:42 +00:00
Rafael Espindola	aefd549139	Delay the creation of eh_frame so that the user can change the defaults. Add support for SHT_X86_64_UNWIND. llvm-svn: 124059	2011-01-23 05:43:40 +00:00
Benjamin Kramer	beeff6bcf0	Remove dead ivar. llvm-svn: 124028	2011-01-22 12:13:28 +00:00
Andrew Trick	7155e98904	Convert -enable-sched-cycles and -enable-sched-hazard to -disable flags. They are still not enable in this revision. Added TargetInstrInfo::isZeroCost() to fix a fundamental problem with the scheduler's model of operand latency in the selection DAG. Generalized unit tests to work with sched-cycles. llvm-svn: 123969	2011-01-21 05:51:33 +00:00
Anton Korobeynikov	ef11a77938	Add CFI directives-based frame information emission. Not hooked yet. llvm-svn: 123474	2011-01-14 21:57:53 +00:00
Jakob Stoklund Olesen	0f2b9d9dc4	Teach frame lowering to ignore debug values after the terminators. llvm-svn: 123399	2011-01-13 21:28:52 +00:00
Jakob Stoklund Olesen	8c5c268f05	Annotate VirtRegRewriter debug output with slot indexes. llvm-svn: 123333	2011-01-12 22:28:48 +00:00
Jakob Stoklund Olesen	22bdcea2fd	Assert if anybody tries to put a slot index on a DBG_VALUE instruction. llvm-svn: 123323	2011-01-12 21:27:45 +00:00
Anton Korobeynikov	cf5967630b	Rename TargetFrameInfo into TargetFrameLowering. Also, put couple of FIXMEs and fixes here and there. llvm-svn: 123170	2011-01-10 12:39:04 +00:00
Jakob Stoklund Olesen	32f1783ca1	Simplify a bunch of isVirtualRegister() and isPhysicalRegister() logic. These functions not longer assert when passed 0, but simply return false instead. No functional change intended. llvm-svn: 123155	2011-01-10 02:58:51 +00:00
Jakob Stoklund Olesen	785d31a2d2	Remove MachineRegisterInfo::getLastVirtReg(), it was giving wrong results when no virtual registers have been allocated. It was only used to resize IndexedMaps, so provide an IndexedMap::resize() method such that Map.grow(MRI.getLastVirtReg()); can be replaced with the simpler Map.resize(MRI.getNumVirtRegs()); This works correctly when no virtuals are allocated, and it bypasses the to/from index conversions. llvm-svn: 123130	2011-01-09 21:58:20 +00:00
Jakob Stoklund Olesen	957748e7ac	Teach TargetRegisterInfo how to cram stack slot indexes in with the virtual and physical register numbers. This makes the hack used in LiveInterval official, and lets LiveInterval be oblivious of stack slots. The isPhysicalRegister() and isVirtualRegister() predicates don't know about this, so when a variable may contain a stack slot, isStackSlot() should always be tested first. llvm-svn: 123128	2011-01-09 21:17:37 +00:00
Jakob Stoklund Olesen	d4dcf22b65	Simplify LiveDebugVariables by storing MachineOperand copies locations instead of using a Location class with the same information. When making a copy of a MachineOperand that was already stored in a MachineInstr, it is necessary to clear the parent pointer on the copy. Otherwise the register use-def lists become inconsistent. Add MachineOperand::clearParent() to do that. An alternative would be a custom MachineOperand copy constructor that cleared ParentMI. I didn't want to do that because of the performance impact. llvm-svn: 123109	2011-01-09 05:33:21 +00:00
Jakob Stoklund Olesen	9a7e67d141	Use IndexedMap for MachineRegisterInfo as well. No functional change. llvm-svn: 123106	2011-01-09 03:05:46 +00:00
Jakob Stoklund Olesen	fb2b53c0de	Use an IndexedMap for LiveVariables::VirtRegInfo. Provide MRI::getNumVirtRegs() and TRI::index2VirtReg() functions to allow iteration over virtual registers without depending on the representation of virtual register numbers. llvm-svn: 123098	2011-01-08 23:10:57 +00:00
Jakob Stoklund Olesen	4bc0b62215	Do not talk about TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 123097	2011-01-08 23:10:53 +00:00
Jakob Stoklund Olesen	b3820cdc22	Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 123096	2011-01-08 23:10:50 +00:00
Evan Cheng	1afd04fc59	Recognize inline asm 'rev /bin/bash, ' as a bswap intrinsic call. llvm-svn: 123048	2011-01-08 01:24:27 +00:00
Evan Cheng	aa16fd02ad	Do not model all INLINEASM instructions as having unmodelled side effects. Instead encode llvm IR level property "HasSideEffects" in an operand (shared with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check the operand when the instruction is an INLINEASM. This allows memory instructions to be moved around INLINEASM instructions. llvm-svn: 123044	2011-01-07 23:50:32 +00:00
Evan Cheng	b960c4c2ae	Fix comment. INLINEASM node operand #3 is IsAlignStack bit. llvm-svn: 123036	2011-01-07 21:38:59 +00:00
Bob Wilson	bcbb3375dd	Change EXTRACT_SUBVECTOR to require a constant index. We were never generating any of these nodes with variable indices, and there was one legalizer function asserting on a non-constant index. If we ever have a need to support variable indices, we can add this back again. llvm-svn: 122993	2011-01-07 04:58:56 +00:00
Jakob Stoklund Olesen	7b1480ff12	Add the SpillPlacement analysis pass. This pass precomputes CFG block frequency information that can be used by the register allocator to find optimal spill code placement. Given an interference pattern, placeSpills() will compute which basic blocks should have the current variable enter or exit in a register, and which blocks prefer the stack. The algorithm is ready to consume block frequencies from profiling data, but for now it gets by with the static estimates used for spill weights. This is a work in progress and still not hooked up to RegAllocGreedy. llvm-svn: 122938	2011-01-06 01:21:53 +00:00
Wesley Peck	832c9e07a1	Fix small bug in setDebugInfoAvailability. llvm-svn: 122886	2011-01-05 17:01:57 +00:00
Jakob Stoklund Olesen	76e782c385	Use the EdgeBundles analysis in X86FloatingPoint instead of recomputing CFG bundles in the pass. llvm-svn: 122833	2011-01-04 21:10:11 +00:00
Jakob Stoklund Olesen	abf8941a60	Turn the EdgeBundles class into a stand-alone machine CFG analysis pass. The analysis will be needed by both the greedy register allocator and the X86FloatingPoint pass. It only needs to be computed once when the CFG doesn't change. This pass is very fast, usually showing up as 0.0% wall time. llvm-svn: 122832	2011-01-04 21:10:05 +00:00
Owen Anderson	8bc93c6fdf	Give MachineFunctionAnalysis a getPassName() implementation to make timing reports prettier. llvm-svn: 122816	2011-01-04 18:21:18 +00:00
Eric Christopher	61974eb1f8	Header warning patrol. llvm-svn: 122551	2010-12-25 02:38:01 +00:00
Andrew Trick	dfa31b1cf9	Minor cleanup related to my latest scheduler changes. llvm-svn: 122545	2010-12-24 07:10:19 +00:00
Andrew Trick	134b2a5907	Various bits of framework needed for precise machine-level selection DAG scheduling during isel. Most new functionality is currently guarded by -enable-sched-cycles and -enable-sched-hazard. Added InstrItineraryData::IssueWidth field, currently derived from ARM itineraries, but could be initialized differently on other targets. Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is active, and if so how many cycles of state it holds. Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry into the scheduler's available queue. ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to get information about it's SUnits, provides RecedeCycle for bottom-up scheduling, correctly computes scoreboard depth, tracks IssueCount, and considers potential stall cycles when checking for hazards. ScheduleDAGRRList now models machine cycles and hazards (under flags). It tracks MinAvailableCycle, drives the hazard recognizer and priority queue's ready filter, manages a new PendingQueue, properly accounts for stall cycles, etc. llvm-svn: 122541	2010-12-24 05:03:26 +00:00
Andrew Trick	53f4556c64	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Chris Lattner	b607e7deda	flags -> glue for selectiondag llvm-svn: 122509	2010-12-23 17:24:32 +00:00
Chris Lattner	fb9ff7a4ff	sdisel flag -> glue. llvm-svn: 122507	2010-12-23 17:13:18 +00:00
Chris Lattner	65c5243bd6	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Jakob Stoklund Olesen	86786c46c2	Use IntEqClasses to compute connected components of live intervals. llvm-svn: 122296	2010-12-21 00:48:17 +00:00
Chris Lattner	47b7bc98ae	update comment. llvm-svn: 122212	2010-12-20 00:56:59 +00:00
Jakob Stoklund Olesen	2879da5e13	Pass a Banner argument to the machine code verifier both from createMachineVerifierPass and MachineFunction::verify. The banner is printed before the machine code dump, just like the printer pass. llvm-svn: 122113	2010-12-18 00:06:56 +00:00
Jakob Stoklund Olesen	6498db2c8c	Avoid dereferencing end() in collectInterferingVRegs() when there is no interference. llvm-svn: 122108	2010-12-17 23:16:38 +00:00
Jakob Stoklund Olesen	df9e162423	Enable loop splitting in RegAllocGreedy. The heuristics split around the largest loop where the current register may be allocated without interference. llvm-svn: 122106	2010-12-17 23:16:32 +00:00
Jakob Stoklund Olesen	f4a0c81371	Add MachineLoopRange comparators for sorting loop lists by number and by area. llvm-svn: 122073	2010-12-17 18:13:52 +00:00
Jakob Stoklund Olesen	40f23cd5ca	Provide LiveIntervalUnion::Query::checkLoopInterference. This is a three-way interval list intersection between a virtual register, a live interval union, and a loop. It will be used to identify interference-free loops for live range splitting. llvm-svn: 122034	2010-12-17 04:09:47 +00:00
Jakob Stoklund Olesen	d40af5ffbd	Add MachineLoopRanges analysis. A MachineLoopRange contains the intervals of slot indexes covered by the blocks in a loop. This representation of the loop blocks is more efficient to compare against interfering registers during register coalescing. llvm-svn: 121917	2010-12-15 23:41:23 +00:00
Jakob Stoklund Olesen	1fc1f0c4a0	Add SlotIndexes::getMBBRange() to get the range of a basic block in a single lookup. llvm-svn: 121893	2010-12-15 20:40:22 +00:00
Rafael Espindola	0e665e502d	Fixed version of 121434 with no new memory leaks. llvm-svn: 121471	2010-12-10 07:39:47 +00:00
Rafael Espindola	011e168728	Revert my previous patch to make the valgrind bots happy. llvm-svn: 121461	2010-12-10 04:01:09 +00:00
Rafael Espindola	03ad1e8f1f	Initial support for the cfi directives. This is just enough to get f: .cfi_startproc nop .cfi_endproc assembled (on ELF). llvm-svn: 121434	2010-12-09 23:48:29 +00:00
Lang Hames	334ef20886	Fixed some dependencies in RegAllocPBQP.h . Thanks to Borja Ferrer for pointing out this issue. llvm-svn: 121292	2010-12-08 22:15:32 +00:00
Andrew Trick	fb72ca2129	Generalize PostRAHazardRecognizer so it can be used in any pass for both forward and backward scheduling. Rename it to ScoreboardHazardRecognizer (Scoreboard is one word). Remove integer division from the scoreboard's critical path. llvm-svn: 121274	2010-12-08 20:04:29 +00:00
Jakob Stoklund Olesen	d638b989f2	Stub out RegAllocGreedy. This new register allocator is initially identical to RegAllocBasic, but it will receive all of the tricks that RegAllocBasic won't get. RegAllocGreedy will eventually replace linear scan. llvm-svn: 121234	2010-12-08 03:26:16 +00:00
Jakob Stoklund Olesen	54b6cd6d38	Implement the first half of LiveDebugVariables. Scan the MachineFunction for DBG_VALUE instructions, and replace them with a data structure similar to LiveIntervals. The live range of a DBG_VALUE is determined by propagating it down the dominator tree until a new DBG_VALUE is found. When a DBG_VALUE lives in a register, its live range is confined to the live range of the register's value. LiveDebugVariables runs before coalescing, so DBG_VALUEs are not artificially extended when registers are joined. The missing half will recreate DBG_VALUE instructions from the intervals when register allocation is complete. The pass is disabled by default. It can be enabled with the temporary command line option -live-debug-variables. llvm-svn: 120636	2010-12-02 00:37:37 +00:00
Evan Cheng	f7e586d749	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Michael J. Spencer	d5ec932c3a	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Benjamin Kramer	8689c9176b	SDep is POD-like. Shave off a few bytes from SUnit by moving a member around. llvm-svn: 120150	2010-11-25 17:50:19 +00:00
Wesley Peck	d589353ad0	Renaming ISD::BIT_CONVERT to ISD::BITCAST to better reflect the LLVM IR concept. llvm-svn: 119990	2010-11-23 03:31:01 +00:00
Chris Lattner	1161b003a0	add some helper methods for asmprinter flags, from PR8417 llvm-svn: 119932	2010-11-21 08:30:55 +00:00
Duncan Sands	028cf0619e	On X86, MEMBARRIER, MFENCE, SFENCE, LFENCE are not target memory intrinsics, so don't claim they are. They are allocated using DAG.getNode, so attempts to access MemSDNode fields results in reading off the end of the allocated memory. This fixes crashes with "llc -debug" due to debug code trying to print MemSDNode fields for these barrier nodes (since the crashes are not deterministic, use valgrind to see this). Add some nasty checking to try to catch this kind of thing in the future. llvm-svn: 119901	2010-11-20 11:25:00 +00:00
Dan Gohman	3998a0430f	Rename ExpandPseudos to ExpandISelPseudos to help clarify its role. llvm-svn: 119716	2010-11-18 18:45:06 +00:00
Chris Lattner	003e3db609	refactor the interface to EmitInlineAsm a bit, no functionality change. llvm-svn: 119482	2010-11-17 07:53:40 +00:00
Dan Gohman	52a761760d	Split pseudo-instruction expansion into a separate pass, to make it easier to debug, and to avoid complications when the CFG changes in the middle of the instruction selection process. llvm-svn: 119382	2010-11-16 21:02:37 +00:00
Chris Lattner	51168d6510	move the pic base symbol stuff up to MachineFunction since it is trivial and will be shared between ppc and x86. This substantially simplifies the X86 backend also. llvm-svn: 119089	2010-11-14 22:48:15 +00:00
Chris Lattner	ce47bb4409	add operand iterator apis to MachineInstr, patch by ether zhhb. llvm-svn: 118862	2010-11-12 00:00:21 +00:00
Jakob Stoklund Olesen	313b78d28e	Insert two blank SlotIndexes between basic blocks instead of just one. This is the first small step towards using closed intervals for liveness instead of the half-open intervals we're using now. We want to be able to distinguish between a SlotIndex that represents a variable being live-out of a basic block, and an index representing a variable live-in to its successor. That requires two separate indexes between blocks. One for live-outs and one for live-ins. With this change, getMBBEndIdx(MBB).getPrevSlot() becomes stable so it stays greater than any instructions inserted at the end of MBB. llvm-svn: 118747	2010-11-11 00:19:20 +00:00
Jakob Stoklund Olesen	3eb4a7b12d	Delete unused function. llvm-svn: 118743	2010-11-10 23:56:02 +00:00
Andrew Trick	9d60f59b55	RABasic is nearly functionally complete. There are a few remaining benchmarks hitting an assertion. Adds LiveIntervalUnion::collectInterferingVRegs. Fixes "late spilling" by checking for any unspillable live vregs among all physReg aliases. llvm-svn: 118701	2010-11-10 19:18:47 +00:00
Benjamin Kramer	96ac873014	Prune includes. llvm-svn: 118342	2010-11-06 11:45:59 +00:00
Duncan Sands	3bf2a701a5	In the calling convention logic, ValVT is always a legal type, and as such can be represented by an MVT - the more complicated EVT is not needed. Use MVT for ValVT everywhere. llvm-svn: 118245	2010-11-04 10:49:57 +00:00
Duncan Sands	41edf30895	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Duncan Sands	4bbe978c7c	Fix a comment typo. llvm-svn: 118168	2010-11-03 11:55:03 +00:00
Duncan Sands	f6e5e02c9b	Inside the calling convention logic LocVT is always a simple value type, so there is no point in passing it around using an EVT. Use the simpler MVT everywhere. Rather than trying to propagate this information maximally in all the code that using the calling convention stuff, I chose to do a mainly low impact change instead. llvm-svn: 118167	2010-11-03 11:35:31 +00:00
Evan Cheng	67db408634	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
Duncan Sands	1651d8cdb3	Add some comments explaining what MVT and EVT are, and how they differ. llvm-svn: 118014	2010-11-02 13:57:09 +00:00
Duncan Sands	c56946f7c5	Remove trailing whitespace. llvm-svn: 118013	2010-11-02 13:43:07 +00:00
Nicolas Geoffray	6889997474	Attach a GCModuleInfo to a MachineFunction. llvm-svn: 117867	2010-10-31 20:38:38 +00:00
Duncan Sands	812f6878ea	Explain the return value of CCAssignFn. llvm-svn: 117854	2010-10-31 10:29:14 +00:00
Chris Lattner	ee8dea6453	Rename alignof -> alignOf to avoid irritating C++'0x compilers, PR8423, patch by nobled. llvm-svn: 117774	2010-10-30 05:14:01 +00:00
Jakob Stoklund Olesen	0ab92619d0	Add SkipPHIsAndLabels from PHIElimination to MachineBasicBlock. It is needed elsewhere. llvm-svn: 117763	2010-10-30 01:26:14 +00:00
John Thompson	6115a7f1d4	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Jakob Stoklund Olesen	1210a5145a	Print out the connected components in the verifier after complaining about their multiplicity. llvm-svn: 117630	2010-10-29 00:40:57 +00:00
Dale Johannesen	e7f07349e4	Use a MemIntrinsicSDNode for ISD::PREFETCH, which touches memory, so a MachineMemOperand is useful (not propagated into the MachineInstr yet). No functional change except for dump output. llvm-svn: 117413	2010-10-26 23:11:10 +00:00
Jakob Stoklund Olesen	3a4c0c13eb	Teach MachineBasicBlock::print() to annotate instructions and blocks with SlotIndexes when available. llvm-svn: 117392	2010-10-26 20:21:46 +00:00
Jakob Stoklund Olesen	3988c3fb55	Make the spiller responsible for updating the LiveStacks analysis. llvm-svn: 117337	2010-10-26 00:11:33 +00:00
Devang Patel	fa145a94d1	Simplify. Do not count use of sdisel for single call instruction. llvm-svn: 117316	2010-10-25 21:31:46 +00:00
Devang Patel	206643ef76	Update SelectBasicBlock signature. This should have been committed with r117310. llvm-svn: 117312	2010-10-25 21:04:12 +00:00
Andrew Trick	7a1dadd47d	This is a prototype of an experimental register allocation framework. It's purpose is not to improve register allocation per se, but to make it easier to develop powerful live range splitting. I call it the basic allocator because it is as simple as a global allocator can be but provides the building blocks for sophisticated register allocation with live range splitting. A minimal implementation is provided that trivially spills whenever it runs out of registers. I'm checking in now to get high-level design and style feedback. I've only done minimal testing. The next step is implementing a "greedy" allocation algorithm that does some register reassignment and makes better splitting decisions. llvm-svn: 117174	2010-10-22 23:09:15 +00:00
Evan Cheng	20b70697bb	Transfer implicit ops when forming load multiple and return instructions. llvm-svn: 117151	2010-10-22 21:29:58 +00:00
Michael J. Spencer	b9cffadc06	CodeGen-Windows: Only emit _fltused if a VarArg function is called with floating point args. This should be the minimum set of functions that could possibly need it. llvm-svn: 116978	2010-10-21 00:08:21 +00:00
Dan Gohman	c781a28a1d	Make CodeGen TBAA-aware. llvm-svn: 116890	2010-10-20 00:31:05 +00:00
Jim Grosbach	b390dd1bd5	Spelling typo fix. s/incput/input/. Thanks, Bob! llvm-svn: 116880	2010-10-19 23:39:23 +00:00
Jim Grosbach	a8c0be5343	Add a pre-dispatch SjLj EH hook on the unwind edge for targets to do any setup they require. Use this for ARM/Darwin to rematerialize the base pointer from the frame pointer when required. rdar://8564268 llvm-svn: 116879	2010-10-19 23:27:08 +00:00
Jakob Stoklund Olesen	02d7f65c49	Shrink MachineOperand from 40 to 32 bytes on 64-bit hosts. Pull an unsigned out of the Contents union such that it has the same size as two pointers and no padding. Arrange members such that the Contents union and all pointers can be 8-byte aligned without padding. This speeds up code generation by 0.8% on a 64-bit host. 32-bit hosts should be unaffected. llvm-svn: 116857	2010-10-19 20:56:32 +00:00
Owen Anderson	46990c17f7	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Michael J. Spencer	e57b670425	X86-Windows: Emit an undefined global __fltused symbol when targeting Windows if any floating point arguments are passed to an external function. llvm-svn: 116665	2010-10-16 08:25:41 +00:00
Michael J. Spencer	16ad2c129c	Whitespace! llvm-svn: 116664	2010-10-16 08:25:21 +00:00
Dan Gohman	d904add908	Initial va_arg support for x86-64. Patch by David Meyer! llvm-svn: 116319	2010-10-12 18:00:49 +00:00
Chris Lattner	f97743c49d	tweak comment. llvm-svn: 116192	2010-10-11 05:48:00 +00:00
Jakob Stoklund Olesen	1d81101e97	After splitting, the remaining LiveInterval may be fragmented into multiple connected components. These components should be allocated different virtual registers because there is no reason for them to be allocated together. Add the ConnectedVNInfoEqClasses class to calculate the connected components, and move values to new LiveIntervals. Use it from SplitKit::rewrite by creating new virtual registers for the components. llvm-svn: 116006	2010-10-07 23:34:34 +00:00
Jakob Stoklund Olesen	6b4557461f	Add MachineRegisterInfo::constrainRegClass and use it in MachineCSE. This function is intended to be used when inserting a machine instruction that trivially restricts the legal registers, like LEA requiring a GR32_NOSP argument. llvm-svn: 115875	2010-10-06 23:54:39 +00:00
Dan Gohman	57f707c6a7	ComputeLinearIndex doesn't need its TLI argument. llvm-svn: 115792	2010-10-06 16:18:29 +00:00
Jakob Stoklund Olesen	cf5ec4b4cd	When RemoveCopyByCommutingDef is creating additional identity copies, just use LiveInterval::MergeValueNumberInto instead of trying to extend LiveRanges and getting it wrong. This fixed PR8249 where a valno with a multi-segment live range was defined by an identity copy created by RemoveCopyByCommutingDef. Some of the live segments disappeared. llvm-svn: 115385	2010-10-01 23:52:25 +00:00
Jakob Stoklund Olesen	53ffe6c58b	Avoid using VNInfo::getCopy as much as possible. I want to get rid of it. llvm-svn: 114794	2010-09-25 18:10:38 +00:00
Lang Hames	fb22f00975	Removed VNInfo::isDefAccurate(). Def "accuracy" can be checked by testing whether LiveIntervals::getInstructionFromIndex(def) returns NULL. llvm-svn: 114791	2010-09-25 12:04:16 +00:00
Jakob Stoklund Olesen	af7994784c	Remove SlotIndex::PHI_BIT. It is no longer used by anything. llvm-svn: 114779	2010-09-25 00:45:18 +00:00
Jakob Stoklund Olesen	794b5e00d7	Terminator gaps were unused. Might as well delete them. llvm-svn: 114776	2010-09-24 23:58:56 +00:00
Nicolas Geoffray	3a40b52aea	Attach a DebugLoc to a GC point in order to get precise information in the JIT of a GC point. llvm-svn: 114736	2010-09-24 17:27:50 +00:00
Lang Hames	f670bff621	Moved the PBQP allocator class out of the header and back in to the cpp file to hide the gory details. Allocator instances can now be created by calling createPBQPRegisterAllocator. Tidied up use of CoalescerPair as per Jakob's suggestions. Made the new PBQPBuilder based construction process the default. The internal construction process remains in-place and available via -pbqp-builder=false for now. It will be removed shortly if the new process doesn't cause any regressions. llvm-svn: 114626	2010-09-23 04:28:54 +00:00
Chris Lattner	6543dacfac	Rework passing parent pointers into complexpatterns, I forgot that complex patterns are matched after the entire pattern has a structural match, therefore the NodeStack isn't in a useful state when the actual call to the matcher happens. llvm-svn: 114489	2010-09-21 22:00:25 +00:00
Devang Patel	904f538a7a	Add insertAfter. This should have accompanied previous check-in. llvm-svn: 114481	2010-09-21 21:10:42 +00:00
Chris Lattner	a911c9ed3a	just like they can opt into getting the root of the pattern being matched, allow ComplexPatterns to opt into getting the parent node of the operand being matched. llvm-svn: 114472	2010-09-21 20:37:12 +00:00
Chris Lattner	32ec32b690	finish pushing MachinePointerInfo through selectiondags. At this point, I think I've audited all uses, so it should be dependable for address spaces, and the pointer+offset info should also be accurate when there. llvm-svn: 114464	2010-09-21 18:58:22 +00:00
Chris Lattner	3dde58c15a	convert a couple more places to use the new getStore() llvm-svn: 114463	2010-09-21 18:51:21 +00:00
Chris Lattner	86b3f287ce	eliminate an old SelectionDAG::getTruncStore method, propagating MachinePointerInfo around more. llvm-svn: 114452	2010-09-21 17:42:31 +00:00
Chris Lattner	bf98f86fed	eliminate last SelectionDAG::getLoad old entrypoint, on to stores. llvm-svn: 114450	2010-09-21 17:28:52 +00:00
Chris Lattner	8af4fb7aed	fix the code that infers SV info to be correct when dealing with an indexed load/store that has an offset in the index. llvm-svn: 114449	2010-09-21 17:24:05 +00:00
Jakob Stoklund Olesen	03451a0e51	Add LiveInterval::find and use it for most LiveRange searching operations instead of calling lower_bound or upper_bound directly. This cleans up the search logic a bit because {lower,upper}_bound compare LR->start by default, and it is usually simpler to search LR->end. Funnelling all searches through one function also makes it possible to replace the search algorithm with something faster than binary search. llvm-svn: 114448	2010-09-21 17:12:18 +00:00
Jakob Stoklund Olesen	73d2940daa	Remove dead method. llvm-svn: 114447	2010-09-21 17:12:15 +00:00
Chris Lattner	cdfd993df0	propagate MachinePointerInfo through various uses of the old SelectionDAG::getExtLoad overload, and eliminate it. llvm-svn: 114446	2010-09-21 17:04:51 +00:00
Chris Lattner	0d430648ae	continue MachinePointerInfo'izing, eliminating use of one of the old getLoad overloads. llvm-svn: 114443	2010-09-21 16:36:31 +00:00
Lang Hames	eae68e1117	Added an additional PBQP problem builder which adds coalescing costs (both between pairs of virtuals, and between virtuals and physicals). llvm-svn: 114429	2010-09-21 13:19:36 +00:00
Chris Lattner	1cad885bf7	add some accessors llvm-svn: 114409	2010-09-21 06:43:24 +00:00
Chris Lattner	112cf9bc89	it's more elegant to put the "getConstantPool" and "getFixedStack" on the MachinePointerInfo class. While this isn't the problem I'm setting out to solve, it is the right way to eliminate PseudoSourceValue, so lets go with it. llvm-svn: 114406	2010-09-21 06:22:23 +00:00
Chris Lattner	3496d7e718	ugh, missed a file. llvm-svn: 114405	2010-09-21 06:16:40 +00:00
Chris Lattner	f94de5bf46	reimplement memcpy/memmove/memset lowering to use MachinePointerInfo instead of srcvalue/offset pairs. This corrects SV info for mem operations whose size is > 32-bits. llvm-svn: 114401	2010-09-21 05:40:29 +00:00
Chris Lattner	b6d15db75c	add some helpful accessors. llvm-svn: 114400	2010-09-21 05:39:30 +00:00
Chris Lattner	dbe51ad1b8	add overloads for SelectionDAG::getLoad, getStore, getTruncStore that take a MachinePointerInfo. Among other virtues, this doesn't silently truncate the svoffset to 32-bits. llvm-svn: 114399	2010-09-21 05:10:45 +00:00
Chris Lattner	e1fc671030	simplify interface to SelectionDAG::getMemIntrinsicNode, making it take a MachinePointerInfo llvm-svn: 114397	2010-09-21 04:57:15 +00:00
Chris Lattner	e4db4cad3b	chagne interface to SelectionDAG::getAtomic to take a MachinePointerInfo, eliminating some weird "infer a frame address" logic which was dead. llvm-svn: 114396	2010-09-21 04:53:42 +00:00
Chris Lattner	af01f8d142	force clients of MachineFunction::getMachineMemOperand to provide a MachinePointerInfo, propagating the type out a level of API. Remove the old MachineFunction::getMachineMemOperand impl. llvm-svn: 114393	2010-09-21 04:46:39 +00:00
Chris Lattner	940c35a3c3	start pushing MachinePointerInfo out through the MachineMemOperand interface to the MachineFunction construction methods. llvm-svn: 114390	2010-09-21 04:32:08 +00:00
Chris Lattner	7fdf193383	refactor the Value/offset pair from MachineMemOperand out to a new MachinePointerInfo struct, no functionality change. This also adds an assert to MachineMemOperand::MachineMemOperand that verifies that the Value is either null or is an IR pointer type. llvm-svn: 114389	2010-09-21 04:23:39 +00:00
Lang Hames	4a8c999803	Added a separate class (PBQPBuilder) for PBQP Problem construction. This class can be extended to support custom constraints. For now the allocator still uses the old (internal) construction mechanism by default. This will be phased out soon assuming no issues with the builder system come up. To invoke the new construction mechanism just pass '-regalloc=pbqp -pbqp-builder' to llc. To provide custom constraints a Target just needs to extend PBQPBuilder and pass an instance of their derived builder to the RegAllocPBQP constructor. llvm-svn: 114272	2010-09-18 09:07:10 +00:00
Gabor Greif	61838d80cd	fix comments; patch by Edmund Grimley-Evans\! llvm-svn: 114189	2010-09-17 17:52:00 +00:00
Jim Grosbach	611e7708d3	trailing whitespace llvm-svn: 113975	2010-09-15 16:08:15 +00:00
Benjamin Kramer	9859d9eee4	Fix linux/msvc build, move include. llvm-svn: 113776	2010-09-13 20:04:49 +00:00
Owen Anderson	9134dcb6a7	Attempt to fix the Linux build. llvm-svn: 113773	2010-09-13 19:47:32 +00:00
Gabor Greif	dfe6dea95f	typoes llvm-svn: 113647	2010-09-10 22:25:58 +00:00
Dale Johannesen	545bd92baf	x86mmx is 64 bits. llvm-svn: 113594	2010-09-10 17:51:47 +00:00
Evan Cheng	c9cb37516d	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Jakob Stoklund Olesen	6ddb47f4ee	Rearrange for better alignment and less padding llvm-svn: 113445	2010-09-08 23:54:00 +00:00
Jakob Stoklund Olesen	0d27bf88e2	Remove dead code and data. llvm-svn: 113411	2010-09-08 21:21:28 +00:00
Jakob Stoklund Olesen	db1636ff8c	Remove dead code. llvm-svn: 113386	2010-09-08 18:50:24 +00:00
Bill Wendling	aea333c247	Remove untrue comments. llvm-svn: 113287	2010-09-07 21:07:59 +00:00
Bill Wendling	9bb7ac566f	Add an MVT::x86mmx type. It will take the place of all current MMX vector types. llvm-svn: 113261	2010-09-07 20:03:56 +00:00
Chris Lattner	684ae57b8e	implement rdar://6653118 - fastisel should fold loads where possible. Since mem2reg isn't run at -O0, we get a ton of reloads from the stack, for example, before, this code: int foo(int x, int y, int z) { return x+y+z; } used to compile into: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx movl 4(%rsp), %esi addl %edx, %esi movl (%rsp), %edx addl %esi, %edx movl %edx, %eax addq $12, %rsp ret Now we produce: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx addl 4(%rsp), %edx ## Folded load addl (%rsp), %edx ## Folded load movl %edx, %eax addq $12, %rsp ret Fewer instructions and less register use = faster compiles. llvm-svn: 113102	2010-09-05 02:18:34 +00:00
Jakob Stoklund Olesen	14f6dc4465	Remove dead code. Clobber ranges are no longer used when joining physical registers. Instead, all aliases are checked for interference. llvm-svn: 113084	2010-09-04 21:09:33 +00:00
Jim Grosbach	1ebe0e2667	Add 'const' to getter function. llvm-svn: 112984	2010-09-03 18:17:16 +00:00
Devang Patel	bbbd35d042	Fix .debug_range for linux. Patch by Krister Wombell. llvm-svn: 112830	2010-09-02 16:43:44 +00:00
Devang Patel	46570e6783	Reapply r112623. Included additional check for unused byval argument. llvm-svn: 112659	2010-08-31 22:22:42 +00:00
Devang Patel	b94251aea0	Revert r112623. It is causing self host build failures. llvm-svn: 112631	2010-08-31 19:41:03 +00:00
Devang Patel	414cbc940a	Remember byval argument's frame index during argument lowering and use this info to emit debug info. Fixes Radar 8367011. llvm-svn: 112623	2010-08-31 18:50:09 +00:00
Duncan Sands	2a1c11e104	Stop using the dom frontier in DwarfEHPrepare by not promoting alloca's any more. I plan to reimplement alloca promotion using SSAUpdater later. It looks like Bill's URoR logic really always needs domtree, so the pass now always asks for domtree info. llvm-svn: 112597	2010-08-31 09:05:06 +00:00
Bruno Cardoso Lopes	ebe80d78ff	zap unused method. x86 is the only user and already has a more powerfull version llvm-svn: 112571	2010-08-31 02:36:20 +00:00
Chris Lattner	e667efb462	nuke dead ivar which was supposed to be committed with r112496 llvm-svn: 112497	2010-08-30 18:16:27 +00:00
Eric Christopher	67801775eb	Fix a couple of typos. Patch by Cameron Esfahani! llvm-svn: 112297	2010-08-27 21:38:11 +00:00
Bruno Cardoso Lopes	6150648a64	zap the now unused MVT::getIntVectorWithNumElements llvm-svn: 112218	2010-08-26 20:53:12 +00:00
Chris Lattner	56bc3bc1af	tidy up llvm-svn: 112099	2010-08-25 22:45:53 +00:00
Jim Grosbach	33f85d977a	Remove the MFI storage of the local allocation block size. It's not needed. llvm-svn: 111847	2010-08-23 21:29:29 +00:00
Bruno Cardoso Lopes	28d9071635	This is the first step towards refactoring the x86 vector shuffle code. The general idea here is to have a group of x86 target specific nodes which are going to be selected during lowering and then directly matched in isel. The commit includes the addition of those specific nodes and a bunch of patterns, and incrementally we're going to switch between them and what we have right now. Both the patterns and target specific nodes can change as we move forward with this work. llvm-svn: 111691	2010-08-20 22:55:05 +00:00
Jim Grosbach	079599c699	Add explicit initializer for UseLocalStackAllocationBlock in MFI constructor llvm-svn: 111655	2010-08-20 17:34:22 +00:00
Bob Wilson	0498520f7c	Update comment to remove special case for vector extending loads. An extending vector load should extend each element in the same way as the corresponding scalar extending load. llvm-svn: 111577	2010-08-19 23:39:00 +00:00
Jim Grosbach	d6e0ffd95b	Update local stack block allocation to let PEI do the allocs if no additional base registers were required. This will allow for slightly better packing of the locals when alignment padding is necessary after callee saved registers. llvm-svn: 111508	2010-08-19 02:47:08 +00:00
Jim Grosbach	ea414d3999	Better handle alignment requirements for local objects in pre-regalloc frame mapping. Have the local block track its alignment requirement, and then apply that when the block itself is allocated. Previously, offsets could get adjusted in PEI to be different, relative to one another, than the block allocation thought they would be, which defeats the point of doing the allocation this way. Continuing rdar://8277890 llvm-svn: 111197	2010-08-16 22:30:41 +00:00
Jim Grosbach	33f86ffe9f	track local frame size in MFI, not local to the pass, since PEI needs it. llvm-svn: 111164	2010-08-16 18:06:15 +00:00
Jim Grosbach	a4d3174cba	Add a local stack object block allocation pass. This is still an experimental pass that allocates locals relative to one another before register allocation and then assigns them to actual stack slots as a block later in PEI. This will eventually allow targets with limited index offset range to allocate additional base registers (not just FP and SP) to more efficiently reference locals, as well as handle situations where locals cannot be referenced via SP or FP at all (dynamic stack realignment together with variable sized objects, for example). It's currently incomplete and almost certainly buggy. Work in progress. Disabled by default and gated via the -enable-local-stack-alloc command line option. rdar://8277890 llvm-svn: 111059	2010-08-14 00:15:52 +00:00
Jim Grosbach	b1e8749e37	tidy up comments llvm-svn: 111040	2010-08-13 20:32:35 +00:00
Jim Grosbach	e76c0d6dee	tidy up 80 column and whitespace llvm-svn: 111033	2010-08-13 20:08:59 +00:00
Jakob Stoklund Olesen	1337aa8e38	Also recompute HasPHIKill flags in LiveInterval::RenumberValues. If a phi-def value were removed from the interval, the phi-kill flags are no longer valid. llvm-svn: 110949	2010-08-12 20:38:03 +00:00
Jakob Stoklund Olesen	cbb21e8c0e	Remove trailing whitespace. llvm-svn: 110944	2010-08-12 20:01:23 +00:00
Jakob Stoklund Olesen	ccf528b792	Fix a FIXME. The SlotIndex::Slot enum should be private. llvm-svn: 110826	2010-08-11 16:50:17 +00:00
Jakob Stoklund Olesen	245a1faf76	Implement register class inflation. When splitting a live range, the new registers have fewer uses and the permissible register class may be less constrained. Recompute the register class constraint from the uses of new registers created for a split. This may let them be allocated from a larger set, possibly avoiding a spill. llvm-svn: 110703	2010-08-10 18:37:40 +00:00
Jakob Stoklund Olesen	e51a747336	Recalculate the spill weight and allocation hint for virtual registers created during live range splitting. llvm-svn: 110686	2010-08-10 17:07:22 +00:00
Jakob Stoklund Olesen	1ab2fab3af	Transpose the calculation of spill weights such that we are calculating one register at a time. This turns out to be slightly faster than iterating over instructions, but more importantly, it allows us to compute spill weights for new registers created after the spill weight pass has run. Also compute the allocation hint at the same time as the spill weight. This allows us to use the spill weight as a cost metric for copies, and choose the most profitable hint if there is more than one possibility. The new hints provide a very small (< 0.1%) but universal code size improvement. llvm-svn: 110631	2010-08-10 00:02:26 +00:00
Bill Wendling	8a7a43a1cb	Merge the OptimizeExts and OptimizeCmps passes into one PeepholeOptimizer pass. This pass should expand with all of the small, fine-grained optimization passes to reduce compile time and increase happiment. llvm-svn: 110627	2010-08-09 23:59:04 +00:00
Dan Gohman	1d48a4b1d7	Tidy some #includes and forward-declarations, and move the C binding code out of PassManager.cpp and into Core.cpp with the rest of the C binding code. llvm-svn: 110494	2010-08-07 00:43:20 +00:00
Jim Grosbach	e4f646b03f	tidy up llvm-svn: 110476	2010-08-06 21:31:35 +00:00
Jakob Stoklund Olesen	a37c7509bf	Add LiveInterval::RenumberValues - Garbage collection for VNInfos. After heavy editing of a live interval, it is much easier to simply renumber the live values instead of trying to keep track of the unused ones. llvm-svn: 110463	2010-08-06 18:46:59 +00:00
Owen Anderson	f2fea95f2f	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Rafael Espindola	6d53fded19	Fix eabi calling convention when a 64 bit value shadows r3. Without this what was happening was: * R3 is not marked as "used" * ARM backend thinks it has to save it to the stack because of vaarg * Offset computation correctly ignores it * Offsets are wrong llvm-svn: 110446	2010-08-06 15:35:32 +00:00
Bill Wendling	0cd2ae5158	Add the Optimize Compares pass (disabled by default). This pass tries to remove comparison instructions when possible. For instance, if you have this code: sub r1, 1 cmp r1, 0 bz L1 and "sub" either sets the same flag as the "cmp" instruction or could be converted to set the same flag, then we can eliminate the "cmp" instruction all together. This is a important for ARM where the ALU instructions could set the CPSR flag, but need a special suffix ('s') to do so. llvm-svn: 110423	2010-08-06 01:32:48 +00:00
Owen Anderson	aadd8a89ca	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	b9762c07cb	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Jakob Stoklund Olesen	21e64c3fae	Remove double-def checking from MachineVerifier, so a register does not have to be killed before being redefined. These checks are usually disabled, and usually fail when enabled. We de facto allow live registers to be redefined without a kill, the corresponding assertions in RegScavenger were removed long ago. llvm-svn: 110362	2010-08-05 18:59:59 +00:00
Bill Wendling	bb2398331b	It's better to have the arrays, which would trigger the creation of stack protectors, to be near the stack protectors on the stack. Accomplish this by tagging the stack object with a predicate that indicates that it would trigger this. In the prolog-epilog inserter, assign these objects to the stack after the stack protector but before the other objects. llvm-svn: 109481	2010-07-27 01:55:19 +00:00
Lang Hames	998b522009	Factored out a bit of common code to mark VNInfos for deletion. llvm-svn: 109388	2010-07-26 01:49:41 +00:00
Evan Cheng	a0b74d8804	Add an ILP scheduler. This is a register pressure aware scheduler that's appropriate for targets without detailed instruction iterineries. The scheduler schedules for increased instruction level parallelism in low register pressure situation; it schedules to reduce register pressure when the register pressure becomes high. On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2 by 16%. llvm-svn: 109300	2010-07-24 00:39:05 +00:00
Lang Hames	6c2677e83c	If 'other' was empty 'overlapsFrom(other, other.begin());' will segfault. This avoids that. llvm-svn: 109075	2010-07-22 02:05:10 +00:00
Jim Grosbach	489d758ea8	For ARM/Darwin, add a dwarf entry indicating whether a function is arm or thumb rdar://8202967 llvm-svn: 109057	2010-07-21 23:03:52 +00:00
Jim Grosbach	2dc4ae051d	tidy up llvm-svn: 109042	2010-07-21 22:04:53 +00:00
Eric Christopher	3ae12eb078	Formatting. llvm-svn: 108926	2010-07-20 21:05:58 +00:00
Lang Hames	304ecc0487	Render MachineFunctions to HTML pages, with options to render register pressure estimates and liveness alongside. Still experimental. llvm-svn: 108698	2010-07-19 15:22:28 +00:00
Lang Hames	48638f63ba	LoopSplitter - intended to split live intervals over loop boundaries. Still very much under development. Comments and fixes will be forthcoming. (This commit includes some small tweaks to LiveIntervals & LoopInfo to support the splitter) llvm-svn: 108615	2010-07-17 07:34:01 +00:00
Eric Christopher	b397b001b9	Propagate alloca alignment information via variable size object frame information. No functional change yet. llvm-svn: 108583	2010-07-17 00:28:22 +00:00
Bill Wendling	e2833a21c2	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Dan Gohman	444c76a3b1	Revert r108369, sorting llvm.dbg.declare information by source position, since it doesn't work for front-ends which don't emit column information (which includes llvm-gcc in its present configuration), and doesn't work for clang for K&R style variables where the variables are declared in a different order from the parameter list. Instead, make a separate pass through the instructions to collect the llvm.dbg.declare instructions in order. This ensures that the debug information for variables is emitted in this order. llvm-svn: 108538	2010-07-16 17:54:27 +00:00
Dan Gohman	07ef07c202	Make the order in which variables are described in debug information independent of the order that isel happens to visit the dbg_declare intrinsics. This fixes a bug in which the formal arguments were being printed in reverse order, now that fast isel is going bottom up. llvm-svn: 108369	2010-07-14 23:08:16 +00:00
Dan Gohman	8e01a639c0	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Evan Cheng	72e40c4e08	Teach ProcessImplicitDefs to transform more COPY instructions into IMPLICIT_DEF (and subsequently eliminate them). This allows machine LICM to hoist IMPLICIT_DEF's. PR7620. llvm-svn: 108304	2010-07-14 01:22:19 +00:00
Dan Gohman	18711b19c9	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Jakob Stoklund Olesen	9b5d14c0f0	Remove vestigial decl. llvm-svn: 108278	2010-07-13 21:19:08 +00:00
Rafael Espindola	84716579d4	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Dan Gohman	fef30fcd5e	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Bob Wilson	9e8c9204ef	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Lang Hames	67f0ebc5c6	Added a support for inserting new MBBs into the numbering. Unlike insertMachineInstrInMaps this does not guarantee live intervals will remain correct. The caller will need to manually update intervals to account for the changes made to the CFG. llvm-svn: 107958	2010-07-09 09:19:23 +00:00
Dan Gohman	7e6e4dd058	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Jim Grosbach	8e0f47b37d	After r107880, findSurvivorReg() no longer needs to be public. llvm-svn: 107887	2010-07-08 17:27:23 +00:00
Jakob Stoklund Olesen	30aacf68b9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Dan Gohman	4dcc56a102	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jim Grosbach	b934d25d5f	When processing frame index virtual registers, consider all available registers (if there are any) and use the one which remains available for the longest rather than just using the first one. This should help enable better re-use of the loaded frame index values. rdar://7318760 llvm-svn: 107847	2010-07-08 00:38:54 +00:00
Evan Cheng	22b3e8f3b1	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Dan Gohman	d0caefa601	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	424cc6b616	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Dan Gohman	b2d5b47efb	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	b87c534168	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	1c3ce1ccd5	Move FunctionLoweringInfo.h out into include/llvm/CodeGen. This will allow target-specific fast-isel code to make use of it directly. llvm-svn: 107787	2010-07-07 16:01:37 +00:00
Dan Gohman	c768525273	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dan Gohman	f60a8be1d0	Move ArgFlagsTy, OutputArg, and InputArg out of SelectionDAGNodes.h and into a new header, TargetCallingConv.h. llvm-svn: 107782	2010-07-07 15:28:42 +00:00
Dan Gohman	28eddf12ea	Move CallingConvLower.cpp out of the SelectionDAG directory. llvm-svn: 107781	2010-07-07 15:15:27 +00:00
Dan Gohman	bbee8c93fb	Add a getFirstNonPHI utility function. llvm-svn: 107778	2010-07-07 14:33:51 +00:00
Dan Gohman	d409104054	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	7ab104353b	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Jakob Stoklund Olesen	f86de96f78	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Dan Gohman	808f334f79	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Dan Gohman	4d264f7e51	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	38f2820fc3	Add versions of OutputArgReg, AnalyzeReturn, and AnalyzeCallOperands which do not depend on SelectionDAG. llvm-svn: 107666	2010-07-06 15:39:54 +00:00
Dan Gohman	66125b8df0	Add a new CCValAssign LocInfo value, and a comment explaining what it should be used for. llvm-svn: 107661	2010-07-06 15:35:06 +00:00
Dan Gohman	6a73079aba	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Evan Cheng	47f3a2db40	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Jakob Stoklund Olesen	dba28ee3d8	Detect and handle COPY in many places. This code is transitional, it will soon be possible to eliminate isExtractSubreg, isInsertSubreg, and isMoveInstr in most places. llvm-svn: 107547	2010-07-03 00:04:37 +00:00
Jakob Stoklund Olesen	8186b4c8d1	Add a new target independent COPY instruction and code to lower it. The COPY instruction is intended to replace the target specific copy instructions for virtual registers as well as the EXTRACT_SUBREG and INSERT_SUBREG instructions in MachineFunctions. It won't we used in a selection DAG. COPY is lowered to native register copies by LowerSubregs. llvm-svn: 107529	2010-07-02 22:29:50 +00:00
Jakob Stoklund Olesen	ab1145ea8e	Handle unindexed instructions in SlotIndices. SlotIndexes::insertMachineInstrInMaps would crash when trying to insert an instruction imediately after an unmapped debug value. llvm-svn: 107504	2010-07-02 19:54:45 +00:00
Jakob Stoklund Olesen	fc2900126b	Rematerialize as much as possible before inserting spills and reloads. This allows us to recognize the common case where all uses could be rematerialized, and no stack slot allocation is necessary. If some values could be fully rematerialized, remove them from the live range before allocating a stack slot for the rest. llvm-svn: 107492	2010-07-02 17:44:57 +00:00
Dan Gohman	5c4876b691	Comment a non-obvious member variable. llvm-svn: 107458	2010-07-02 01:20:16 +00:00
Dan Gohman	8022d8e885	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Mikhail Glushenkov	0163e1e289	Trailing whitespace. llvm-svn: 107360	2010-07-01 01:00:22 +00:00
Jakob Stoklund Olesen	8918e475af	Begin implementation of an inline spiller. InlineSpiller inserts loads and spills immediately instead of deferring to VirtRegMap. This is possible now because SlotIndexes allows instructions to be inserted and renumbered. This is work in progress, and is mostly a copy of TrivialSpiller so far. It works very well for functions that don't require spilling. llvm-svn: 107227	2010-06-29 23:58:39 +00:00
Bill Wendling	59ef9bcc6d	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Bill Wendling	05a4c0b1f2	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Rafael Espindola	317a02739d	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Benjamin Kramer	b624350656	VNInfos don't need to be destructed anymore. llvm-svn: 106943	2010-06-26 11:30:59 +00:00
Jakob Stoklund Olesen	71ad56676d	Don't track kills in VNInfo. Use interval ends instead. The VNInfo.kills vector was almost unused except for all the code keeping it updated. The few places using it were easily rewritten to check for interval ends instead. The two new methods LiveInterval::killedAt and killedInRange are replacements. This brings us down to 3 independent data structures tracking kills. llvm-svn: 106905	2010-06-25 22:53:05 +00:00
Jakob Stoklund Olesen	7a5bf34236	Remove the now unused LiveIntervals::getVNInfoSourceReg(). This method was always a bit too simplistic for the real world. It didn't really deal with subregisters and such. llvm-svn: 106781	2010-06-24 20:18:15 +00:00
Jakob Stoklund Olesen	1a6a8cfc51	Remove the -fast-spill option. This code path has never really been used, and we are going to be handling spilling through the Spiller interface in the future. llvm-svn: 106777	2010-06-24 19:56:08 +00:00
Jakob Stoklund Olesen	9f8104463f	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Jakob Stoklund Olesen	f01eb3aab6	Be more strict about subreg-to-subreg copies in CoalescerPair. Also keep track of the original DstREg before subregister adjustments. llvm-svn: 106753	2010-06-24 16:19:28 +00:00
Dan Gohman	a08a9b8a0b	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Jakob Stoklund Olesen	1c9d50ed92	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Jakob Stoklund Olesen	19abbf4387	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Daniel Dunbar	be50ef88bd	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Dan Gohman	2303159bba	Move PHIElimination's SplitCriticalEdge for MachineBasicBlocks out into a utility routine, teach it how to update MachineLoopInfo, and make use of it in MachineLICM to split critical edges on demand. llvm-svn: 106555	2010-06-22 17:25:57 +00:00
Dan Gohman	823dff64cd	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Jim Grosbach	b8c94667a8	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jim Grosbach	91aae1c534	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Dan Gohman	b220af45eb	Add explicit keywords. llvm-svn: 106300	2010-06-18 19:04:37 +00:00
Dan Gohman	1ccf40774e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Jim Grosbach	60a3287950	Grammar. llvm-svn: 106292	2010-06-18 17:40:42 +00:00

... 5 6 7 8 9 ...

3786 Commits