llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 05:23:45 +02:00

Author	SHA1	Message	Date
Owen Anderson	285891eccf	Enhance both TargetLibraryInfo and SelectionDAGBuilder so that the latter can use the former to prevent the formation of libm SDNode's when -fno-builtin is passed. llvm-svn: 146193	2011-12-08 22:15:21 +00:00
Evan Cheng	320b2be38c	Make MachineInstr instruction property queries more flexible. This change all clients to decide whether to look inside bundled instructions and whether the query should return true if any / all bundled instructions have the queried property. llvm-svn: 146168	2011-12-08 19:23:10 +00:00
Evan Cheng	1acd685d87	Add bundle aware API for querying instruction properties and switch the code generator to it. For non-bundle instructions, these behave exactly the same as the MC layer API. For properties like mayLoad / mayStore, look into the bundle and if any of the bundled instructions has the property it would return true. For properties like isPredicable, only return true if all of the bundled instructions have the property. For properties like canFoldAsLoad, isCompare, conservatively return false for bundles. llvm-svn: 146026	2011-12-07 07:15:52 +00:00
Jakob Stoklund Olesen	e612fdbbab	Add MachineOperand IsInternalRead flag. This flag is used when bundling machine instructions. It indicates whether the operand reads a value defined inside or outside its bundle. llvm-svn: 145997	2011-12-07 00:22:07 +00:00
Evan Cheng	5061553f9d	First chunk of MachineInstr bundle support. 1. Added opcode BUNDLE 2. Taught MachineInstr class to deal with bundled MIs 3. Changed MachineBasicBlock iterator to skip over bundled MIs; added an iterator to walk all the MIs 4. Taught MachineBasicBlock methods about bundled MIs llvm-svn: 145975	2011-12-06 22:12:01 +00:00
Sebastian Pop	182ae6a6fa	use space star instead of star space llvm-svn: 145944	2011-12-06 17:34:16 +00:00
Sebastian Pop	cb55bb22ab	add missing point at the end of sentences llvm-svn: 145943	2011-12-06 17:34:11 +00:00
Jakob Stoklund Olesen	e53ed273d9	Use logarithmic units for basic block alignment. This was actually a bit of a mess. TLI.setPrefLoopAlignment was clearly documented as taking log2(bytes) units, but the x86 target would still set a preferred loop alignment of '16'. CodePlacementOpt passed this number on to the basic block, and AsmPrinter interpreted it as bytes. Now both MachineFunction and MachineBasicBlock use logarithmic alignments. Obviously, MachineConstantPool still measures alignments in bytes, so we can emulate the thrill of using as. llvm-svn: 145889	2011-12-06 01:26:19 +00:00
Jakob Stoklund Olesen	faa1b18b38	Fix unclear wording. llvm-svn: 145882	2011-12-06 00:51:09 +00:00
Anna Zaks	431b43fdbe	Change the Dominators recalculate() function to only rely on GraphTraits This is a patch by Guoping Long! As part of utilizing LLVM Dominator computation in Clang, made two changes to LLVM dominators tree implementation: - (1) Change the recalculate() template function to only rely on GraphTraits. - (2) Add a size() method to GraphTraits template class to query the number of nodes in the graph. llvm-svn: 145837	2011-12-05 19:17:04 +00:00
Nick Lewycky	7d0d3c2d58	Move global variables in TargetMachine into new TargetOptions class. As an API change, now you need a TargetOptions object to create a TargetMachine. Clang patch to follow. One small functionality change in PTX. PTX had commented out the machine verifier parts in their copy of printAndVerify. That now calls the version in LLVMTargetMachine. Users of PTX who need verification disabled should rely on not passing the command-line flag to enable it. llvm-svn: 145714	2011-12-02 22:16:29 +00:00
Anshuman Dasgupta	f754c8bf8e	Add a deterministic finite automaton based packetizer for VLIW architectures llvm-svn: 145629	2011-12-01 21:10:21 +00:00
Chad Rosier	0ff2f46d12	If fast-isel fails, remove dead instructions generated during the failed attempt. llvm-svn: 145425	2011-11-29 19:40:47 +00:00
Bill Wendling	8beab76b07	Remove dead llvm.eh.sjlj.dispatchsetup intrinsic. llvm-svn: 145263	2011-11-28 19:23:13 +00:00
Evan Cheng	2b239cbcf6	Sink codegen optimization level into MCCodeGenInfo along side relocation model and code model. This eliminates the need to pass OptLevel flag all over the place and makes it possible for any codegen pass to use this information. llvm-svn: 144788	2011-11-16 08:38:26 +00:00
Owen Anderson	48a129b50e	Rename MVT::untyped to MVT::Untyped to match similar nomenclature. llvm-svn: 144747	2011-11-16 01:02:57 +00:00
Benjamin Kramer	3eeef2e739	Twinify GraphWriter a little bit. llvm-svn: 144647	2011-11-15 16:26:38 +00:00
Benjamin Kramer	bff5aaee3f	Make headers standalone. llvm-svn: 144537	2011-11-14 17:45:03 +00:00
Chandler Carruth	f89087744e	Under the hood, MBPI is doing a linear scan of every successor every time it is queried to compute the probability of a single successor. This makes computing the probability of every successor of a block in sequence... really really slow. ;] This switches to a linear walk of the successors rather than a quadratic one. One of several quadratic behaviors slowing this pass down. I'm not really thrilled with moving the sum code into the public interface of MBPI, but I don't (at the moment) have ideas for a better interface. My direction I'm thinking in for a better interface is to have MBPI actually retain much more state and make all of these queries cheap. That's a lot of work, and would require invasive changes. Until then, this seems like the least bad (ie, least quadratic) solution. Suggestions welcome. llvm-svn: 144530	2011-11-14 09:12:57 +00:00
Chandler Carruth	09418993f8	Reuse the logic in getEdgeProbability within getHotSucc in order to correctly handle blocks whose successor weights sum to more than UINT32_MAX. This is slightly less efficient, but the entire thing is already linear on the number of successors. Calling it within any hot routine is a mistake, and indeed no one is calling it. It also simplifies the code. llvm-svn: 144527	2011-11-14 08:55:59 +00:00
Chandler Carruth	462bb16130	Fix an overflow bug in MachineBranchProbabilityInfo. This pass relied on the sum of the edge weights not overflowing uint32, and crashed when they did. This is generally safe as BranchProbabilityInfo tries to provide this guarantee. However, the CFG can get modified during codegen in a way that grows the sum of the edge weights. This doesn't seem unreasonable (imagine just adding more blocks all with the default weight of 16), but it is hard to come up with a case that actually triggers 32-bit overflow. Fortuately, the single-source GCC build is good at this. The solution isn't very pretty, but its no worse than the previous code. We're already summing all of the edge weights on each query, we can sum them, check for an overflow, compute a scale, and sum them again. I've included a greatly reduced test case out of the GCC source that triggers it. It's a pretty lame test, as it clearly is just barely triggering the overflow. I'd like to have something that is much more definitive, but I don't understand the fundamental pattern that triggers an explosion in the edge weight sums. The buggy code is duplicated within this file. I'll colapse them into a single implementation in a subsequent commit. llvm-svn: 144526	2011-11-14 08:50:16 +00:00
Chandler Carruth	5d03d0351f	Add a cautionary note to this API. It was not at all obvious to me how expensive the most useful interface to this analysis is. Fun story -- it's also not correct. That's getting fixed in another patch. llvm-svn: 144523	2011-11-14 06:51:49 +00:00
Jakob Stoklund Olesen	9b34607bdf	Rename SlotIndexes to match how they are used. The old naming scheme (load/use/def/store) can be traced back to an old linear scan article, but the names don't match how slots are actually used. The load and store slots are not needed after the deferred spill code insertion framework was deleted. The use and def slots don't make any sense because we are using half-open intervals as is customary in C code, but the names suggest closed intervals. In reality, these slots were used to distinguish early-clobber defs from normal defs. The new naming scheme also has 4 slots, but the names match how the slots are really used. This is a purely mechanical renaming, but some of the code makes a lot more sense now. llvm-svn: 144503	2011-11-13 20:45:27 +00:00
Jakob Stoklund Olesen	5a265aeb70	Delete the old spilling framework from LiveIntervalAnalysis. This is dead code, all register allocators use InlineSpiller. llvm-svn: 144478	2011-11-12 23:57:05 +00:00
Jakob Stoklund Olesen	78902f9088	Delete the linear scan register allocator. RegAllocGreedy has been the default for six months now. Deleting RegAllocLinearScan makes it possible to also delete VirtRegRewriter and clean up the spiller code. llvm-svn: 144475	2011-11-12 22:39:45 +00:00
Eli Friedman	8563e57e38	Don't try to form pre/post-indexed loads/stores until after LegalizeDAG runs. Fixes PR11029. llvm-svn: 144438	2011-11-12 00:35:34 +00:00
Nicolas Geoffray	98ef58406c	Add a custom safepoint method, in order for language implementers to decide which machine instruction gets to be a safepoint. llvm-svn: 144399	2011-11-11 18:32:52 +00:00
Owen Anderson	50d2e054f9	Add additional checking to ensure that MachineMemOperands are never set to null, which can happen in weird circumstances where target intrinsic hooks are implemented incorrectly. llvm-svn: 144303	2011-11-10 19:25:09 +00:00
Pete Cooper	224434deec	Added invariant field to the DAG.getLoad method and changed all calls. When this field is true it means that the load is from constant (runt-time or compile-time) and so can be hoisted from loops or moved around other memory accesses llvm-svn: 144100	2011-11-08 18:42:53 +00:00
Chandler Carruth	f95461f23b	Begin collecting some of the statistics for block placement discussed on the mailing list. Suggestions for other statistics to collect would be awesome. =] Currently these are implemented as a separate pass guarded by a separate flag. I'm not thrilled by that, but I wanted to be able to collect the statistics for the old code placement as well as the new in order to have a point of comparison. I'm planning on folding them into the single pass if / when there is only one pass of interest. llvm-svn: 143537	2011-11-02 07:17:12 +00:00
Nick Lewycky	651475977d	Teach our Dwarf emission to use the string pool. llvm-svn: 143097	2011-10-27 06:44:11 +00:00
Dan Gohman	b54d296fd4	Remove the SystemZ backend. llvm-svn: 142878	2011-10-24 23:48:32 +00:00
Dan Gohman	91adc96acd	Delete the top-down "Latency" scheduler. Top-down scheduling doesn't handle physreg dependencies, and upcoming codegen changes will require proper physreg dependence handling. llvm-svn: 142816	2011-10-24 18:01:06 +00:00
Chandler Carruth	380e0d5013	Implement a block placement pass based on the branch probability and block frequency analyses. This differs substantially from the existing block-placement pass in LLVM: 1) It operates on the Machine-IR in the CodeGen layer. This exposes much more (and more precise) information and opportunities. Also, the results are more stable due to fewer transforms ocurring after the pass runs. 2) It uses the generalized probability and frequency analyses. These can model static heuristics, code annotation derived heuristics as well as eventual profile loading. By basing the optimization on the analysis interface it can work from any (or a combination) of these inputs. 3) It uses a more aggressive algorithm, both building chains from tho bottom up to maximize benefit, and using an SCC-based walk to layout chains of blocks in a profitable ordering without O(N^2) iterations which the old pass involves. The pass is currently gated behind a flag, and not enabled by default because it still needs to grow some important features. Most notably, it needs to support loop aligning and careful layout of loop structures much as done by hand currently in CodePlacementOpt. Once it supports these, and has sufficient testing and quality tuning, it should replace both of these passes. Thanks to Nick Lewycky and Richard Smith for help authoring & debugging this, and to Jakob, Andy, Eric, Jim, and probably a few others I'm forgetting for reviewing and answering all my questions. Writing a backend pass is sooo much better now than it used to be. =D llvm-svn: 142641	2011-10-21 06:46:38 +00:00
Dan Gohman	666f749afc	Delete the list-tdrr scheduler. Top-down schedulers are going away because they don't support physical register dependencies. llvm-svn: 142620	2011-10-20 21:44:34 +00:00
Jakob Stoklund Olesen	1910496fb4	Admonish that MI is not IR and virtual registers have constraints. In machine code, you can't just replaceRegWith() the same way you can replaceAllUsesWith() in IR. Virtual registers may have different register classes that need to be merged first. llvm-svn: 142201	2011-10-17 17:33:39 +00:00
Jakob Stoklund Olesen	9349890ed8	Add MachineInstr::getRegClassConstraint(). Most instructions have some requirements for their register operands. Usually, this is expressed as register class constraints in the MCInstrDesc, but for inline assembly the constraints are encoded in the flag words. llvm-svn: 141835	2011-10-12 23:37:36 +00:00
Jakob Stoklund Olesen	d406c95461	Extract a method for finding the inline asm flag operand. llvm-svn: 141834	2011-10-12 23:37:33 +00:00
Bill Wendling	b52a154112	Add a bool value to set the IsLandingPad flag to. llvm-svn: 141435	2011-10-07 23:06:01 +00:00
Bill Wendling	0f5b533c48	Thread the chain through the eh.sjlj.setjmp intrinsic, like it's documented to do. This will be useful later on with the new SJLJ stuff. llvm-svn: 141416	2011-10-07 21:25:38 +00:00
Bill Wendling	729a01085f	Add accessor method to check if the landing pad symbol has call site information. llvm-svn: 141244	2011-10-05 23:26:10 +00:00
Bill Wendling	94f80818c1	Add an ivar that maps a landing pad's EH symbol to the call sites that may jump to the landing pad. This will be used by the back-end to generate the jump tables for dispatching the arriving longjmp in sjlj eh. llvm-svn: 141224	2011-10-05 22:20:38 +00:00
Jakob Stoklund Olesen	a6045464c8	Allow <undef> flags on def operands as well as uses. The <undef> flag says that a MachineOperand doesn't read its register, or doesn't depend on the previous value of its register. A full register def never depends on the previous register value. A partial register def may depend on the previous value if it is intended to update part of a register. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 The first copy instruction defines the full %vreg10 register with the bits not covered by dsub_0 defined as <undef>. It is not considered a read of %vreg10. The second copy modifies part of %vreg10 while preserving the rest. It has an implicit read of %vreg10. This patch adds a MachineOperand::readsReg() method to determine if an operand reads its register. Previously, this was modelled by adding a full-register <imp-def> operand to the instruction. This approach makes it possible to determine directly from a MachineOperand if it reads its register. No scanning of MI operands is required. llvm-svn: 141124	2011-10-04 21:49:33 +00:00
Bill Wendling	e8a545f260	Doxygen-ize comments. No functionality change. llvm-svn: 141122	2011-10-04 21:25:01 +00:00
Bill Wendling	83ccbf4f1c	Add method to determine if a begin label has a call site number associated with it. llvm-svn: 141107	2011-10-04 20:31:56 +00:00
Jakob Stoklund Olesen	b0b79fa82c	Move getCommonSubClass() into TRI. It will soon need the context. llvm-svn: 140896	2011-09-30 22:18:51 +00:00
Nick Lewycky	73dbedf97e	Fix typo. llvm-svn: 140807	2011-09-29 21:07:46 +00:00
Jakob Stoklund Olesen	49803374a4	Remove NumImplicitOps which is now unused. llvm-svn: 140767	2011-09-29 01:47:36 +00:00
Bill Wendling	b3656866e2	Create and use an llvm.eh.sjlj.functioncontext intrinsic. This intrinsic is used to pass the index of the function context to the back-end for further processing. The back-end is in charge of filling in the rest of the entries. llvm-svn: 140676	2011-09-28 03:36:43 +00:00
Jakob Stoklund Olesen	2bf243f464	Remove X86-dependent stuff from SSEDomainFix. This also enables domain swizzling for AVX code which required a few trivial test changes. The pass will be moved to lib/CodeGen shortly. llvm-svn: 140659	2011-09-27 23:50:46 +00:00
Jim Grosbach	e883e939a3	Rename AddSelectionDAGCSEId() to addSelectionDAGCSEId(). Naming conventions consistency. No functional change. llvm-svn: 140636	2011-09-27 20:59:33 +00:00
Nadav Rotem	5b6d21503b	Cleanup PromoteIntOp_EXTRACT_VECTOR_ELT and PromoteIntRes_SETCC. Add a new method: getAnyExtOrTrunc and use it to replace the manual check. llvm-svn: 140603	2011-09-27 11:16:47 +00:00
Jakob Stoklund Olesen	7e8448c147	Clean up code after renaming LowerSubregs -> ExpandPostRAPseudos. No functional change intended. llvm-svn: 140470	2011-09-25 16:46:08 +00:00
Jakob Stoklund Olesen	1d3105c3d3	Add a MinNumRegs argument to MRI::constrainRegClass(). The function will refuse to use a register class with fewer registers than MinNumRegs. This can be used by clients to avoid accidentally increase register pressure too much. The default value of MinNumRegs=0 doesn't affect how constrainRegClass() works. llvm-svn: 140339	2011-09-22 21:39:31 +00:00
Jakob Stoklund Olesen	93195a175c	Use getPrevSlot() instead of getPrevIndex(). The getPrevIndex() function moves to the same slot in the previous instruction. For getVNInfoBefore(), we just need the previous slot in the same instruction. llvm-svn: 139793	2011-09-15 15:31:49 +00:00
Jakob Stoklund Olesen	83f23f23cd	Stop verifying hasPHIKill() flags. There is only one legitimate use remaining, in addIntervalsForSpills(). All other calls to hasPHIKill() are only used to update PHIKill flags. The addIntervalsForSpills() function is part of the old spilling framework, only used by linearscan. llvm-svn: 139783	2011-09-15 05:16:30 +00:00
Jakob Stoklund Olesen	a7631a56b4	Leave hasPHIKill flags alone in LiveInterval::RenumberValues. It is conservatively correct to keep the hasPHIKill flags, even after deleting PHI-defs. The calculation can be very expensive after taildup has created a quadratic number of indirectbr edges in the CFG, and the hasPHIKill flag isn't used for anything after RenumberValues(). llvm-svn: 139780	2011-09-15 04:37:18 +00:00
Andrew Trick	e5bb7267ff	[regcoalescing] bug fix for RegistersDefinedFromSameValue. An improper SlotIndex->VNInfo lookup was leading to unsafe copy removal. Fixes PR10920 401.bzip2 miscompile with no IV rewrite. llvm-svn: 139765	2011-09-15 01:09:33 +00:00
Eric Christopher	25b7bedcf9	Fix indenting. llvm-svn: 139670	2011-09-13 23:45:39 +00:00
Jakob Stoklund Olesen	8e739db8a2	Switch extendInBlock() to take a kill slot instead of the last use slot. Three out of four clients prefer this interface which is consistent with extendIntervalEndTo() and LiveRangeCalc::extend(). llvm-svn: 139604	2011-09-13 16:47:56 +00:00
Devang Patel	ba2d56b1ef	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00
Eli Friedman	6a45370c0f	Relax the MemOperands on atomics a bit. Fixes -verify-machineinstrs failures for atomic laod/store on ARM. (The fix for the related failures on x86 is going to be nastier because we actually need Acquire memoperands attached to the atomic load instrs, etc.) llvm-svn: 139221	2011-09-07 02:23:42 +00:00
Duncan Sands	d1311488fe	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Duncan Sands	6939ae53ac	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Eli Friedman	6f95a6ae1b	Basic x86 code generation for atomic load and store instructions. llvm-svn: 138478	2011-08-24 20:50:09 +00:00
Ivan Krasin	338df71d60	FastISel: avoid function calls between the materialization of the constant and its use. llvm-svn: 137993	2011-08-18 22:06:10 +00:00
Bill Wendling	71ce55ffd7	Add the support in code-gen for the landingpad instruction lowering. The landingpad instruction is lowered into the EXCEPTIONADDR and EHSELECTION SDNodes. The information from the landingpad instruction is harvested by the 'AddLandingPadInfo' function. The new EH uses the current EH scheme in the back-end. This will change once we switch over to the new scheme. (Reviewed by Jakob!) llvm-svn: 137880	2011-08-17 21:56:44 +00:00
Devang Patel	b79ed42390	Constify. llvm-svn: 137489	2011-08-12 18:18:02 +00:00
Devang Patel	5901733259	Use ArrayRef. llvm-svn: 137485	2011-08-12 18:10:19 +00:00
Duncan Sands	10a9e984bc	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Nick Lewycky	a4f1ff9b56	A virtual destructor for the class with virtual methods! llvm-svn: 137400	2011-08-12 00:32:15 +00:00
Devang Patel	504551c7bf	Stay within 80 columns. llvm-svn: 137283	2011-08-10 23:58:09 +00:00
Devang Patel	4459f63998	Provide utility to extract and use lexical scoping information from machine instructions. llvm-svn: 137237	2011-08-10 19:04:06 +00:00
Jakob Stoklund Olesen	f7f4398587	Trim an unneeded header. llvm-svn: 137184	2011-08-09 23:49:21 +00:00
Jakob Stoklund Olesen	cbd8bcf3b8	Move CalculateRegClass to MRI::recomputeRegClass. This function doesn't have anything to do with spill weights, and MRI already has functions for manipulating the register class of a virtual register. llvm-svn: 137123	2011-08-09 16:46:27 +00:00
Jakob Stoklund Olesen	2f58336f2f	Refer to the RegisterCoalescer pass by ID. A public interface is no longer needed since RegisterCoalescer is not an analysis any more. llvm-svn: 137082	2011-08-09 00:29:53 +00:00
Jakub Staszak	7b30ba0db8	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Bill Wendling	57ddbb84ac	Revert r136253, r136263, r136269, r136313, r136325, r136326, r136329, r136338, r136339, r136341, r136369, r136387, r136392, r136396, r136429, r136430, r136444, r136445, r136446, r136253 pending review. llvm-svn: 136556	2011-07-30 05:42:50 +00:00
Jakob Stoklund Olesen	7b77f35a1b	Add an isSSA() flag to MachineRegisterInfo. This flag is true from isel to register allocation when the machine function is required to be in SSA form. The TwoAddressInstructionPass and PHIElimination passes clear the flag. The SSA flag wil be used by the machine code verifier to check for SSA form, and eventually an assertion can enforce it in +Asserts builds. This will catch the common target error of creating machine code with multiple defs of a virtual register. llvm-svn: 136532	2011-07-29 22:51:22 +00:00
Eli Friedman	6f2419f1a2	Misc optimizer+codegen work for 'cmpxchg' and 'atomicrmw'. They appear to be working on x86 (at least for trivial testcases); other architectures will need more work so that they actually emit the appropriate instructions for orderings stricter than 'monotonic'. (As far as I can tell, the ARM, PPC, Mips, and Alpha backends need such changes.) llvm-svn: 136457	2011-07-29 03:05:32 +00:00
Bill Wendling	e4090cc864	Add the AddLandingPadInfo function. AddLandingPadInfo takes a landingpad instruction and grabs all of the information from it that it needs for EH table generation. llvm-svn: 136429	2011-07-28 23:42:57 +00:00
Bill Wendling	dba29efd6f	Use ArrayRef instead of requiring an std::vector. llvm-svn: 136396	2011-07-28 21:25:33 +00:00
Eli Friedman	842ea169de	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Jakub Staszak	f5076015fc	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Jakub Staszak	2873980483	Fix #include guard directive. llvm-svn: 135947	2011-07-25 20:08:00 +00:00
Jakub Staszak	5c309cbead	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Bill Wendling	a289f709aa	Add a method to set the compact unwind info. llvm-svn: 135806	2011-07-22 21:17:05 +00:00
Jakub Staszak	cf5ebedf56	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Evan Cheng	c9bc5a9011	Goodbye TargetAsmInfo. This eliminate last bit of CodeGen and Target in llvm-mc. There is still a bit more refactoring left to do in Targets. But we are now very close to fixing all the layering issues in MC. llvm-svn: 135611	2011-07-20 19:50:42 +00:00
Evan Cheng	380dc98371	Add MCObjectFileInfo and sink the MCSections initialization code from TargetLoweringObjectFileImpl down to MCObjectFileInfo. TargetAsmInfo is done to one last method. It's almost gone! llvm-svn: 135569	2011-07-20 05:58:47 +00:00
Devang Patel	72886ba8d8	Revert r135423. llvm-svn: 135454	2011-07-19 00:28:24 +00:00
Bill Wendling	b1d5a0798a	Rename CompactEncoding to CompactUnwindEncoding. llvm-svn: 135448	2011-07-19 00:00:58 +00:00
Bill Wendling	203796ee7a	Move the compact encoding from the target-specific library to the code-gen library. llvm-svn: 135443	2011-07-18 23:38:40 +00:00
Evan Cheng	10c6820ff4	Move getInitialFrameState from TargetFrameInfo to MCAsmInfo (suggestions for better location welcome). llvm-svn: 135438	2011-07-18 22:29:13 +00:00
Evan Cheng	561d71ce7b	Sink getDwarfRegNum, getLLVMRegNum, getSEHRegNum from TargetRegisterInfo down to MCRegisterInfo. Also initialize the mapping at construction time. This patch eliminate TargetRegisterInfo from TargetAsmInfo. It's another step towards fixing the layering violation. llvm-svn: 135424	2011-07-18 20:57:22 +00:00
Devang Patel	389cb9d8c6	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. [take 2] llvm-svn: 135423	2011-07-18 20:55:23 +00:00
Chris Lattner	e1fe7061ce	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Jakub Staszak	a0010953f7	Add MachineBlockFrequency analysis. llvm-svn: 135352	2011-07-16 20:23:20 +00:00
Jakob Stoklund Olesen	987cd08002	Extract parts of RAGreedy::splitAroundRegion as SplitKit methods. This gets rid of some of the gory splitting details in RAGreedy and makes them available to future SplitKit clients. Slightly generalize the functionality to support multi-way splitting. Specifically, SplitEditor::splitLiveThroughBlock() supports switching between different register intervals in a block. llvm-svn: 135307	2011-07-15 21:47:57 +00:00
Evan Cheng	7bdc771798	Fix up TargetLoweringObjectFile ctors to properly initialize fields. llvm-svn: 135068	2011-07-13 19:54:59 +00:00
Jay Foad	88fb4f4597	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Jakub Staszak	a8987cb392	- Make BranchProbability constructor public. - Add getCompl() method. llvm-svn: 134857	2011-07-10 02:12:39 +00:00
Cameron Zwarich	c23366d357	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Jakob Stoklund Olesen	acaf9e9ce1	Be more aggressive about following hints. RAGreedy::tryAssign will now evict interference from the preferred register even when another register is free. To support this, add the EvictionCost struct that counts how many hints are broken by an eviction. We don't want to break one hint just to satisfy another. Rename canEvict to shouldEvict, and add the first bit of eviction policy that doesn't depend on spill weights: Always make room in the preferred register as long as the evictees can be split and aren't already assigned to their preferred register. Also make the CSR avoidance more accurate. When looking for a cheaper register it is OK to use a new volatile register. Only CSR aliases that have never been used before should be avoided. llvm-svn: 134735	2011-07-08 20:46:18 +00:00
Lang Hames	9e52663aa4	Add functions 'hasPredecessor' and 'hasPredecessorHelper' to SDNode. The hasPredecessorHelper function allows predecessors to be cached to speed up repeated invocations. This fixes PR10186. X.isPredecessorOf(Y) now just calls Y.hasPredecessor(X) Y.hasPredecessor(X) calls Y.hasPredecessorHelper(X, Visited, Worklist) with empty Visited and Worklist sets (i.e. no caching over invocations). Y.hasPredecessorHelper(X, Visited, Worklist) caches search state in Visited and Worklist to speed up repeated calls. The Visited set is searched for X before going to the worklist to further search the DAG if necessary. llvm-svn: 134592	2011-07-07 04:31:51 +00:00
Jakob Stoklund Olesen	c19c47697f	Include a source location when complaining about bad inline assembly. Add a MI->emitError() method that the backend can use to report errors related to inline assembly. Call it from X86FloatingPoint.cpp when the constraints are wrong. This enables proper clang diagnostics from the backend: $ clang -c pr30848.c pr30848.c:5:12: error: Inline asm output regs must be last on the x87 stack __asm__ ("" : "=u" (d)); /* { dg-error "output regs" } */ ^ 1 error generated. llvm-svn: 134307	2011-07-02 03:53:34 +00:00
Rafael Espindola	83789b3b8d	Create a isFullCopy predicate. llvm-svn: 134189	2011-06-30 21:15:52 +00:00
Devang Patel	66c4bc1dda	Revert r133953 for now. llvm-svn: 134116	2011-06-29 23:50:13 +00:00
Evan Cheng	4a169be530	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Evan Cheng	f79231cbd4	Remove RegClass2VRegMap from MachineRegisterInfo. llvm-svn: 133967	2011-06-27 23:54:40 +00:00
Evan Cheng	7df851a4ff	Remove the experimental (and unused) pre-ra splitting pass. Greedy regalloc can split live ranges. llvm-svn: 133962	2011-06-27 23:40:45 +00:00
Devang Patel	8fbd4b55ea	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. llvm-svn: 133953	2011-06-27 22:32:04 +00:00
Rafael Espindola	45a2fa5664	There is only one register coalescer. Merge it into the base class and remove the analysis group. llvm-svn: 133899	2011-06-26 22:34:10 +00:00
Rafael Espindola	7ad658a832	Move RegisterCoalescer.h to lib/CodeGen. llvm-svn: 133895	2011-06-26 21:41:06 +00:00
Devang Patel	91fee59b74	Handle debug info for i128 constants. llvm-svn: 133821	2011-06-24 20:46:11 +00:00
Jay Foad	9dc6571cbc	Fix a FIXME by making GlobalVariable::getInitializer() return a const Constant *. llvm-svn: 133400	2011-06-19 18:37:11 +00:00
Benjamin Kramer	0b4d4ce7c1	Don't allocate empty read-only SmallVectors during SelectionDAG deallocation. llvm-svn: 133348	2011-06-18 13:13:44 +00:00
Eric Christopher	25aa04466a	Lower multiply with overflow checking to __mulo<mode> calls if we haven't been able to lower them any other way. Fixes rdar://9090077 and rdar://9210061 llvm-svn: 133288	2011-06-17 20:41:29 +00:00
Lang Hames	20552cda1a	Add a hook for PBQP clients to run a custom pre-alloc pass to run prior to PBQP allocation. Patch by Arnaud Allard de Grandmaison. llvm-svn: 133249	2011-06-17 07:09:01 +00:00
Jakub Staszak	5c7b7d64ba	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Owen Anderson	a9bf21548f	Fix formatting. llvm-svn: 133164	2011-06-16 16:52:24 +00:00
Owen Anderson	f98c2ea49d	Add a new MVT::untyped. This will be used in future work for modelling ISA features like register pairs and lists with "interesting" constraints (such as ARM NEON contiguous register lists or even-odd paired registers). We need to be able to generate these instructions (often from intrinsics), but don't want to have to assign a legal type to them. Instead, we'll use an "untyped" edge to bypass the type-checking and simply ensure that the register classes match. llvm-svn: 133106	2011-06-15 23:35:18 +00:00
Andrew Trick	ce93f28a36	Added -stress-sched flag in the Asserts build. Added a test case for handling physreg aliases during pre-RA-sched. llvm-svn: 133063	2011-06-15 17:16:12 +00:00
Bruno Cardoso Lopes	b6afc5168f	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Bill Wendling	0f5a6fb66c	Reformatting. Moving class definitions to more natural places. No functionalogical changes. llvm-svn: 132876	2011-06-11 11:37:49 +00:00
Cameron Zwarich	a54eaeb7ae	Provide an ARMCCState subclass of CCState so that ARM clients will always set CallOrPrologue correctly and eliminate the existing setter. llvm-svn: 132856	2011-06-10 20:59:24 +00:00
Cameron Zwarich	7f353f2163	Rename the ParmContext enum values to make a bit more sense and add a small comment on their meaning. llvm-svn: 132854	2011-06-10 20:37:36 +00:00
Cameron Zwarich	cc21e3fc58	Remove tabs. llvm-svn: 132853	2011-06-10 20:31:39 +00:00
Eric Christopher	1ae9ec6124	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Lang Hames	642b95ac13	Switched to DenseMap for allowed sets in PBQP. Reduces total LLC time by 15% on CINT2006 for x86-32. llvm-svn: 132707	2011-06-07 06:05:58 +00:00
Devang Patel	7b9fc618b2	Remove dead code. llvm-svn: 132488	2011-06-02 21:31:00 +00:00
Chad Rosier	945a780b4e	Typos. llvm-svn: 132437	2011-06-01 23:32:40 +00:00
Charles Davis	6702c786ed	When generating code for Win64 EH, emit StartProc and EndProc directives. llvm-svn: 132250	2011-05-28 04:21:04 +00:00
Rafael Espindola	2230168a0f	Make size computation less brittle. llvm-svn: 132222	2011-05-27 22:05:41 +00:00
Charles Davis	cb20ea9935	Add the suffix to the Win64 EH data sections' names if given. Add a test for this. XFAIL'd, because the COFF AsmParser can't handle .section yet. llvm-svn: 132220	2011-05-27 21:38:47 +00:00
Charles Davis	f835c87c83	Add a parameter to the Win64 EH section getters to get a section with a suffix (e.g. .xdata$myfunc). The suffix part isn't implemented yet, but I'll get to it in the next patch. Fix up all callers of the affected functions. Make them pass said suffix to the function. llvm-svn: 132205	2011-05-27 19:09:24 +00:00
Eric Christopher	94fbcd8d81	Comment cleanup. llvm-svn: 132162	2011-05-26 22:54:27 +00:00
Devang Patel	0b44360610	Remove dead code. llvm-svn: 131974	2011-05-24 18:27:52 +00:00
Charles Davis	989cc73ef3	Add .pdata and .xdata sections to the COFF TLOF implementation. llvm-svn: 131763	2011-05-20 22:13:55 +00:00
Jim Grosbach	5b691ebabf	Frame indices are signed. Update MachineOperand methods accordingly. llvm-svn: 131475	2011-05-17 18:29:21 +00:00
Jakob Stoklund Olesen	16f11212fc	Teach LiveInterval::isZeroLength about null SlotIndexes. When instructions are deleted, they leave tombstone SlotIndex entries. The isZeroLength method should ignore these null indexes. This causes RABasic to sometimes spill a callee-saved register in the abi-isel.ll test, so don't run that test with -regalloc=basic. Prioritizing register allocation according to spill weight can cause more registers to be used. llvm-svn: 131436	2011-05-16 23:50:05 +00:00
Dan Gohman	9a55240376	Delete unused variables. llvm-svn: 131430	2011-05-16 22:19:54 +00:00
Eli Friedman	cb60e2293f	Make fast-isel work correctly s/uadd.with.overflow intrinsics. llvm-svn: 131420	2011-05-16 21:06:17 +00:00
Eli Friedman	5f1b7e4153	Basic fast-isel of extractvalue. Not too helpful on its own, given the IR clang generates for cases like this, but it should become more useful soon. llvm-svn: 131417	2011-05-16 20:27:46 +00:00
Evan Cheng	5ff60c7364	Re-commit 131172 with fix. MachineInstr identity checks should check dead markers. In some cases a register def is dead on one path, but not on another. This is passing Clang self-hosting. llvm-svn: 131214	2011-05-12 00:56:58 +00:00
Bill Wendling	68a363dd78	Fix comment. llvm-svn: 131173	2011-05-11 01:08:39 +00:00
Rafael Espindola	92dc58fea6	Use .cfi_sections to put the unwind info in .debug_frame when possible. With this clang will use .debug_frame in, for example, clang -g -c -m32 test.c This matches gcc's behaviour. It looks like .debug_frame is a bit bigger than .eh_frame, but has the big advantage of not being allocated. llvm-svn: 131140	2011-05-10 18:39:09 +00:00
Rafael Espindola	ef239d2fdc	Yet more dead code. llvm-svn: 130988	2011-05-06 15:31:55 +00:00
Rafael Espindola	4228ece8fd	Update comments. llvm-svn: 130987	2011-05-06 15:28:56 +00:00
Rafael Espindola	9b57a8739e	More dead code elimination. llvm-svn: 130985	2011-05-06 15:22:26 +00:00
Owen Anderson	35f6bae989	Allow FastISel of three-register-operand instructions. llvm-svn: 130934	2011-05-05 17:59:04 +00:00
Chandler Carruth	c83eb00361	Remove an unused variable in NDEBUG (found with -Wunused-variable). llvm-svn: 130688	2011-05-02 05:49:01 +00:00
Jakob Stoklund Olesen	e0ec7ed462	Add a SlotIndexes::insertMachineInstrInMaps to insert the instruction after any null indexes. This makes a difference if a live interval is referring to a deleted instruction. It can be important to insert an instruction before or after a deleted instruction to avoid interference. llvm-svn: 130686	2011-05-02 05:29:56 +00:00
Rafael Espindola	eb5d0cb4f4	GCC uses a different encoding of pointers in the FDE when using -fno-dwarf2-cfi-asm. Implement the same behavior. llvm-svn: 130637	2011-05-01 04:49:54 +00:00
Jakob Stoklund Olesen	4ec9e1c33a	Avoid using stale entries form the sibling value map. This could happen when trying to use a value that had been eliminated after dead code elimination and folding loads. llvm-svn: 130597	2011-04-30 06:42:21 +00:00
Rafael Espindola	16b23d9ff7	Factor some code to needsCFIMoves. Avoid printing moves when we don't have to. llvm-svn: 130501	2011-04-29 14:14:06 +00:00
Chris Lattner	52b19aa2b5	add a missing operator that caused us to have to use (*MIB).foo everywhere. llvm-svn: 130473	2011-04-29 05:24:07 +00:00
Devang Patel	900ceb725b	Teach dwarf writer to handle complex address expression for .debug_loc entries. This fixes clang generated blocks' variables' debug info. Radar 9279956. llvm-svn: 130373	2011-04-28 02:22:40 +00:00
Rafael Espindola	36e419b524	Remove unnecessary argument. llvm-svn: 130343	2011-04-27 23:17:57 +00:00
Rafael Espindola	0525497a16	Rename getPersonalityPICSymbol to getCFIPersonalitySymbol, document it, and give it a bit more responsibility. Also implement it for MachO. If hacked to use cfi, 32 bit MachO will produce .cfi_personality 155, L___gxx_personality_v0$non_lazy_ptr and 64 bit will produce .cfi_presonality ___gxx_personality_v0 The general idea is that .cfi_personality gets passed the final symbol. It is up to codegen to produce it if using indirect representation (like 32 bit MachO), but it is up to MC to decide which relocations to create. llvm-svn: 130341	2011-04-27 23:08:15 +00:00
Eli Friedman	c5406cdb50	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Eli Friedman	fc1152d772	Remove unused function. llvm-svn: 130337	2011-04-27 22:21:02 +00:00
Devang Patel	42f4a7ff92	Revert r130178. It turned out to be not the optimal path to emit complex location expressions. llvm-svn: 130326	2011-04-27 20:29:27 +00:00
Evan Cheng	dea3347167	Be careful about scheduling nodes above previous calls. It increase usages of more callee-saved registers and introduce copies. Only allows it if scheduling a node above calls would end up lessen register pressure. Call operands also has added ABI restrictions for register allocation, so be extra careful with hoisting them above calls. rdar://9329627 llvm-svn: 130245	2011-04-26 21:31:35 +00:00
Jakob Stoklund Olesen	c9cf507d93	Use the new TRI->getLargestLegalSuperClass hook to constrain register class inflation. This has two effects: 1. We never inflate to a larger register class than what the sub-target can handle. 2. Completely unconstrained virtual registers get the largest possible register class. llvm-svn: 130229	2011-04-26 18:52:36 +00:00
Devang Patel	4969322bc4	Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables. llvm-svn: 130178	2011-04-26 00:12:46 +00:00
Jay Foad	c146569beb	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Owen Anderson	e1b33b92a3	Teach FastISel to deal with instructions that have two immediate operands. llvm-svn: 130033	2011-04-22 23:38:06 +00:00
Eric Christopher	4de9ef5cf7	Fix comment. llvm-svn: 130027	2011-04-22 23:08:45 +00:00
Chris Lattner	d9c0db9bd7	Recommit the fix for rdar://9289512 with a couple tweaks to fix bugs exposed by the gcc dejagnu testsuite: 1. The load may actually be used by a dead instruction, which would cause an assert. 2. The load may not be used by the current chain of instructions, and we could move it past a side-effecting instruction. Change how we process uses to define the problem away. llvm-svn: 130018	2011-04-22 21:59:37 +00:00
Devang Patel	4f25432e4e	Refactor. llvm-svn: 129938	2011-04-21 21:07:35 +00:00
Daniel Dunbar	3a96439b36	Revert r1296656, "Fix rdar://9289512 - not folding load into compare at -O0...", which broke a couple GCC test suite tests at -O0. llvm-svn: 129914	2011-04-21 16:14:46 +00:00
Stuart Hastings	a552942e02	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Chris Lattner	5e00f501ff	Fix rdar://9289512 - not folding load into compare at -O0 The basic issue here is that bottom-up isel is matching the branch and compare, and was failing to fold the load into the branch/compare combo. Fixing this (by allowing folding into any instruction of a sequence that is selected) allows us to produce things like: cmpb $0, 52(%rax) je LBB4_2 instead of: movb 52(%rax), %cl cmpb $0, %cl je LBB4_2 This makes the generated -O0 code run a bit faster, but also speeds up compile time by putting less pressure on the register allocator and generating less code. This was one of the biggest classes of missing load folding. Implementing this shrinks 176.gcc's c-decl.s (as a random example) by about 4% in (verbose-asm) line count. llvm-svn: 129656	2011-04-17 06:35:44 +00:00
Rafael Espindola	9e5aaa3b78	Put each personality function in a section. This fixes the gnu ld warning: error in foo.o; no .eh_frame_hdr table will be created. llvm-svn: 129635	2011-04-16 03:51:21 +00:00
Rafael Espindola	694ad2f25c	Some refactoring suggested by Anton Korobeynikov. llvm-svn: 129600	2011-04-15 20:32:03 +00:00
Rafael Espindola	99831068c8	Add 129518 back with a fix for when we are producing eh just because of debug info. Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129571	2011-04-15 15:11:06 +00:00
Chris Lattner	0304b82f80	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
NAKAMURA Takumi	7aed456653	Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" It broke several builds. llvm-svn: 129557	2011-04-15 03:35:57 +00:00
Rafael Espindola	d5eed657e2	Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129518	2011-04-14 15:18:53 +00:00
Andrew Trick	e89c19ab7b	In the pre-RA scheduler, maintain cmp+br proximity. This is done by pushing physical register definitions close to their use, which happens to handle flag definitions if they're not glued to the branch. This seems to be generally a good thing though, so I didn't need to add a target hook yet. The primary motivation is to generate code closer to what people expect and rule out missed opportunity from enabling macro-op fusion. As a side benefit, we get several 2-5% gains on x86 benchmarks. There is one regression: SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is an independent scheduler bug that will be tracked separately. See rdar://problem/9283108. Incidentally, pre-RA scheduling is only half the solution. Fixing the later passes is tracked by: <rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump Fixes: <rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion llvm-svn: 129508	2011-04-14 05:15:06 +00:00
Jay Foad	b215137166	Like the coding standards say, do not use "using namespace std". llvm-svn: 129435	2011-04-13 12:46:01 +00:00
Jakob Stoklund Olesen	a1ffedb740	Skip a binary search when possible. llvm-svn: 129293	2011-04-11 20:01:44 +00:00
Jakob Stoklund Olesen	5add6d16b7	Build the Hopfield network incrementally when splitting global live ranges. It is common for large live ranges to have few basic blocks with register uses and many live-through blocks without any uses. This approach grows the Hopfield network incrementally around the use blocks, completely avoiding checking interference for some through blocks. llvm-svn: 129188	2011-04-09 02:59:09 +00:00
Andrew Trick	36a1759769	Added a check in the preRA scheduler for potential interference on a induction variable. The preRA scheduler is unaware of induction vars, so we look for potential "virtual register cycles" instead. Fixes <rdar://problem/8946719> Bad scheduling prevents coalescing llvm-svn: 129100	2011-04-07 19:54:57 +00:00
Jakob Stoklund Olesen	731b0d77a2	Use std::unique instead of a SmallPtrSet to ensure unique instructions in UseSlots. This allows us to always keep the smaller slot for an instruction which is what we want when a register has early clobber defines. Drop the UsingInstrs set and the UsingBlocks map. They are no longer needed. llvm-svn: 128886	2011-04-05 15:18:18 +00:00
Jakob Stoklund Olesen	65c8f18b8d	Cache the fairly expensive last split point computation and provide a fast inlined path for the common case. Most basic blocks don't contain a call that may throw, so the last split point os simply the first terminator. llvm-svn: 128874	2011-04-05 04:20:27 +00:00
Jakob Stoklund Olesen	78d65c6632	Stop caching basic block index ranges now that SlotIndexes can keep up. llvm-svn: 128821	2011-04-04 15:32:15 +00:00
Jakob Stoklund Olesen	024a1de4ae	Use basic block numbers as indexes when mapping slot index ranges. This is more compact and faster than using DenseMap. llvm-svn: 128763	2011-04-02 06:03:31 +00:00
Evan Cheng	39574b2766	Issue libcalls __udivmodi4 / __divmodi4 for div / rem pairs. rdar://8911343 llvm-svn: 128696	2011-04-01 00:42:02 +00:00
Jakob Stoklund Olesen	446412de55	Collect and coalesce DBG_VALUE instructions before emitting the function. Correctly terminate the range of register DBG_VALUEs when the register is clobbered or when the basic block ends. The code is now ready to deal with variables that are sometimes in a register and sometimes on the stack. We just need to teach emitDebugLoc to say 'stack slot'. llvm-svn: 128327	2011-03-26 02:19:36 +00:00
Evan Cheng	9d660cc8b7	Add comment to clarify what MachineConstantPoolEntry::isMachineConstantPoolEntry() means. llvm-svn: 128204	2011-03-24 06:28:45 +00:00
Jakob Stoklund Olesen	047a25b0b0	Dead code elimination may separate the live interval into multiple connected components. I have convinced myself that it can only happen when a phi value dies. When it happens, allocate new virtual registers for the components. llvm-svn: 127827	2011-03-17 20:37:07 +00:00
Jakob Stoklund Olesen	2786187b43	Rewrite instructions as part of ConnectedVNInfoEqClasses::Distribute. llvm-svn: 127779	2011-03-17 00:23:45 +00:00
Jakob Stoklund Olesen	29a9539e7f	Place context in member variables instead of passing around pointers. Use the opportunity to get rid of the trailing underscore variable names. llvm-svn: 127618	2011-03-14 20:57:14 +00:00
Owen Anderson	78afadfa5d	Teach FastISel to support register-immediate-immediate instructions. llvm-svn: 127496	2011-03-11 21:33:55 +00:00
Jim Grosbach	39476d9010	80 columns. llvm-svn: 127495	2011-03-11 21:02:27 +00:00
Jim Grosbach	4ed235527d	Trailing whitespace. llvm-svn: 127493	2011-03-11 20:59:19 +00:00
Jakob Stoklund Olesen	70541686bf	Make SpillIs an optional pointer. Avoid creating a bunch of temporary SmallVectors. llvm-svn: 127388	2011-03-10 01:21:58 +00:00
Jakob Stoklund Olesen	f9401745e0	Let shrinkToUses optionally return a list of now dead machine instructions. llvm-svn: 127192	2011-03-07 23:29:10 +00:00

... 2 3 4 5 6 ...

3786 Commits