llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Rafael Espindola	6e5ce4f5db	Fix a lot of confusion around inserting nops on empty functions. On MachO, and MachO only, we cannot have a truly empty function since that breaks the linker logic for atomizing the section. When we are emitting a frame pointer, the presence of an unreachable will create a cfi instruction pointing past the last instruction. This is perfectly fine. The FDE information encodes the pc range it applies to. If some tool cannot handle this, we should explicitly say which bug we are working around and only work around it when it is actually relevant (not for ELF for example). Given the unreachable we could omit the .cfi_def_cfa_register, but then again, we could also omit the entire function prologue if we wanted to. llvm-svn: 217801	2014-09-15 18:32:58 +00:00
Chad Rosier	7f50169e13	[AArch64] Improve AA to remove unneeded edges in the AA MI scheduling graph. Patch by Sanjin Sijaric <ssijaric@codeaurora.org>! Phabricator Review: http://reviews.llvm.org/D5103 llvm-svn: 217371	2014-09-08 14:43:48 +00:00
Benjamin Kramer	e991977346	Add override to overriden virtual methods, remove virtual keywords. No functionality change. Changes made by clang-tidy + some manual cleanup. llvm-svn: 217028	2014-09-03 11:41:21 +00:00
Pete Cooper	92fc86558d	Change MCSchedModel to be a struct of statically initialized data. This removes static initializers from the backends which generate this data, and also makes this struct match the other Tablegen generated structs in behaviour Reviewed by Andy Trick and Chandler C llvm-svn: 216919	2014-09-02 17:43:54 +00:00
Quentin Colombet	1849edbdf6	Add isInsertSubreg property. This patch adds a new property: isInsertSubreg and the related target hooks: TargetIntrInfo::getInsertSubregInputs and TargetInstrInfo::getInsertSubregLikeInputs to specify that a target specific instruction is a (kind of) INSERT_SUBREG. The approach is similar to r215394. <rdar://problem/12702965> llvm-svn: 216139	2014-08-20 23:49:36 +00:00
Quentin Colombet	b02f26b10f	Add isExtractSubreg property. This patch adds a new property: isExtractSubreg and the related target hooks: TargetIntrInfo::getExtractSubregInputs and TargetInstrInfo::getExtractSubregLikeInputs to specify that a target specific instruction is a (kind of) EXTRACT_SUBREG. The approach is similar to r215394. <rdar://problem/12702965> llvm-svn: 216130	2014-08-20 21:51:26 +00:00
Quentin Colombet	022fe32e53	Add isRegSequence property. This patch adds a new property: isRegSequence and the related target hooks: TargetIntrInfo::getRegSequenceInputs and TargetInstrInfo::getRegSequenceLikeInputs to specify that a target specific instruction is a (kind of) REG_SEQUENCE. <rdar://problem/12702965> llvm-svn: 215394	2014-08-11 22:17:14 +00:00
NAKAMURA Takumi	3f092f7493	TargetInstrInfo::genAlternativeCodeSequence(): Fix a couple of \param(s). [-Wdocumentation] llvm-svn: 214708	2014-08-04 10:23:22 +00:00
Gerolf Hoflehner	b4a5e33ee0	MachineCombiner Pass for selecting faster instruction sequence - target independent framework When the DAGcombiner selects instruction sequences it could increase the critical path or resource len. For example, on arm64 there are multiply-accumulate instructions (madd, msub). If e.g. the equivalent multiply-add sequence is not on the crictial path it makes sense to select it instead of the combined, single accumulate instruction (madd/msub). The reason is that the conversion from add+mul to the madd could lengthen the critical path by the latency of the multiply. But the DAGCombiner would always combine and select the madd/msub instruction. This patch uses machine trace metrics to estimate critical path length and resource length of an original instruction sequence vs a combined instruction sequence and picks the faster code based on its estimates. This patch only commits the target independent framework that evaluates and selects code sequences. The machine instruction combiner is turned off for all targets and expected to evolve over time by gradually handling DAGCombiner pattern in the target specific code. This framework lays the groundwork for fixing rdar://16319955 llvm-svn: 214666	2014-08-03 21:35:39 +00:00
Jiangning Liu	aec69df4ca	Add TargetInstrInfo interface isAsCheapAsAMove. llvm-svn: 214158	2014-07-29 01:55:19 +00:00
Eric Christopher	2ea2870f36	The hazard recognizer only needs a subtarget, not a target machine so make it take one. Fix up all users accordingly. llvm-svn: 210948	2014-06-13 22:38:52 +00:00
Tom Roeder	740d86dc79	Add a new attribute called 'jumptable' that creates jump-instruction tables for functions marked with this attribute. It includes a pass that rewrites all indirect calls to jumptable functions to pass through these tables. This also adds backend support for generating the jump-instruction tables on ARM and X86. Note that since the jumptable attribute creates a second function pointer for a function, any function marked with jumptable must also be marked with unnamed_addr. llvm-svn: 210280	2014-06-05 19:29:43 +00:00
Craig Topper	30281a67fb	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206142	2014-04-14 00:51:57 +00:00
Andrew Trick	bd486c29f4	Added a size field to the stack map record to handle subregister spills. Implementing this on bigendian platforms could get strange. I added a target hook, getStackSlotRange, per Jakob's recommendation to make this as explicit as possible. llvm-svn: 194942	2013-11-17 01:36:23 +00:00
Andrew Trick	b65138d3af	Fix the ExecutionDepsFix pass to handle AVX instructions. This pass is needed to break false dependencies. Without it, unlucky register assignment can result in wild (5x) swings in performance. This pass was trying to handle AVX but not getting it right. AVX doesn't have partial register defs, it has unused register reads in which the high bits of a source operand are copied into the unused bits of the dest. Fixing this requires conservative liveness analysis. This is awkard because the pass already has its own pseudo-liveness. However, proper liveness is expensive, and we would like to use a generic utility to compute it. The fix only invokes liveness on-demand. It is rare to detect a case that needs undef-read dependence breaking, but when it happens, it can be needed many times within a very large block. I think the existing heuristic which uses a register window of 16 is too conservative for loop-carried false dependencies. If the loop is a reduction. The out-of-order engine may be able to execute several loop iterations in parallel. However, I'll leave this tuning exercise for next time. llvm-svn: 192635	2013-10-14 22:19:03 +00:00
Arnold Schwaighofer	47322176be	IfConverter: Use TargetSchedule for instruction latencies For targets that have instruction itineraries this means no change. Targets that move over to the new schedule model will use be able the new schedule module for instruction latencies in the if-converter (the logic is such that if there is no itineary we will use the new sched model for the latencies). Before, we queried "TTI->getInstructionLatency()" for the instruction latency and the extra prediction cost. Now, we query the TargetSchedule abstraction for the instruction latency and TargetInstrInfo for the extra predictation cost. The TargetSchedule abstraction will internally call "TTI->getInstructionLatency" if an itinerary exists, otherwise it will use the new schedule model. ATTENTION: Out of tree targets! (I will also send out an email later to LLVMDev) This means, if your target implements unsigned getInstrLatency(const InstrItineraryData ItinData, const MachineInstr MI, unsigned PredCost); and returns a value for "PredCost", you now also need to implement unsigned getPredictationCost(const MachineInstr MI); (if your target uses the IfConversion.cpp pass) radar://15077010 llvm-svn: 191671	2013-09-30 15:28:56 +00:00
Andrew Trick	8cb647b5c8	mi-sched: Load clustering is a bit to expensive to enable unconditionally. llvm-svn: 189990	2013-09-04 21:00:08 +00:00
Richard Sandiford	c0fe83c1b6	[SystemZ] Remove no-op MVCs The stack coloring pass has code to delete stores and loads that become trivially dead after coloring. Extend it to cope with single instructions that copy from one frame index to another. The testcase happens to show an example of this kicking in at the moment. It did occur in Real Code too though. llvm-svn: 185705	2013-07-05 14:38:48 +00:00
David Blaikie	813e6b3974	DebugInfo: remove target-specific Frame Index handling for DBG_VALUE MachineInstrs Frame index handling is now target-agnostic, so delete the target hooks for creation & asm printing of target-specific addressing in DBG_VALUEs and any related functions. llvm-svn: 184067	2013-06-16 20:34:27 +00:00
Andrew Trick	5d13fe97ed	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Hal Finkel	1f0b02ddbe	Add a comment to TargetInstrInfo about FoldImmediate This comment documents the current behavior of the ARM implementation of this callback, and also the soon-to-be-committed PPC version. llvm-svn: 178959	2013-04-06 19:30:20 +00:00
Jakob Stoklund Olesen	04ea4c6a39	Clean up some confusing language, and use more realistic examples. llvm-svn: 178828	2013-04-05 01:25:41 +00:00
Chandler Carruth	ca305491f6	Sort the #include lines for the include/... tree with the script. AKA: Recompile ALL the source code! This one went much better. No manual edits here. I spot-checked for silliness and grep-checked for really broken edits and everything seemed good. It all still compiles. Yell if you see something that looks goofy. llvm-svn: 169133	2012-12-03 17:02:12 +00:00
Jakob Stoklund Olesen	79c1e7f5af	Remove all references to TargetInstrInfoImpl. This class has been merged into its super-class TargetInstrInfo. llvm-svn: 168760	2012-11-28 02:35:17 +00:00
Jakob Stoklund Olesen	43c340f4df	Move the guts of TargetInstrInfoImpl into the TargetInstrInfo class. The *Impl class no longer serves a purpose now that the super-class implementation is in CodeGen. llvm-svn: 168759	2012-11-28 02:35:13 +00:00
Benjamin Kramer	9cc1f23013	Work around a layering violation from Target to CodeGen. Technically this is still a layering violation but it's header-only which makes it less harmful. No functionality change. llvm-svn: 168173	2012-11-16 17:32:33 +00:00
Andrew Trick	011684e920	misched: rename interfaceto avoid gcc warnings llvm-svn: 167753	2012-11-12 21:28:10 +00:00
Andrew Trick	fcdfc4cdf8	misched: Target-independent support for MacroFusion. Uses the infrastructure from r167742 to support clustering instructure that the target processor can "fuse". e.g. cmp+jmp. Next step: target hook implementations with test cases, and enable. llvm-svn: 167744	2012-11-12 19:52:20 +00:00
Andrew Trick	7f7b5f4787	misched: Target-independent support for load/store clustering. This infrastructure is generally useful for any target that wants to strongly prefer two instructions to be adjacent after scheduling. A following checkin will add target-specific hooks with unit tests. Then this feature will be enabled by default with misched. llvm-svn: 167742	2012-11-12 19:40:10 +00:00
Andrew Trick	4ca94d939c	misched: Use the TargetSchedModel interface wherever possible. Allows the new machine model to be used for NumMicroOps and OutputLatency. Allows the HazardRecognizer to be disabled along with itineraries. llvm-svn: 165603	2012-10-10 05:43:09 +00:00
Andrew Trick	e3e6fae309	TargetSchedModel API. Implement latency lookup, disabled. llvm-svn: 164098	2012-09-18 04:03:34 +00:00
Andrew Trick	150c97940b	Revert r164061-r164067. Most of the new subtarget emitter. I have to work out the Target/CodeGen header dependencies before putting this back. llvm-svn: 164072	2012-09-17 23:00:42 +00:00
Andrew Trick	3d3f796f3f	TargetSchedModel API. Implement latency lookup, disabled. llvm-svn: 164065	2012-09-17 22:19:08 +00:00
Craig Topper	ef3c8f8807	Mark unimplemented copy constructors and copy assignment operators as LLVM_DELETED_FUNCTION. llvm-svn: 164016	2012-09-17 06:59:23 +00:00
Jakob Stoklund Olesen	8f5a57191e	Add a bit of documentation to copyPhysReg. llvm-svn: 162879	2012-08-29 23:52:55 +00:00
Andrew Trick	2a785149f0	Simplify the computeOperandLatency API. The logic for recomputing latency based on a ScheduleDAG edge was shady. This bypasses the problem by requiring the client to provide operand indices. This ensures consistent use of the machine model's API. llvm-svn: 162420	2012-08-23 00:39:43 +00:00
Jakob Stoklund Olesen	babff4afdb	Add an MCID::Select flag and TII hooks for optimizing selects. Select instructions pick one of two virtual registers based on a condition, like x86 cmov. On targets like ARM that support predication, selects can sometimes be eliminated by predicating the instruction defining one of the operands. Teach PeepholeOptimizer to recognize select instructions, and ask the target to optimize them. llvm-svn: 162059	2012-08-16 23:11:47 +00:00
Jakob Stoklund Olesen	33e364a3df	Remove the TII::scheduleTwoAddrSource() hook. It never does anything when running 'make check', and it get's in the way of updating live intervals in 2-addr. The hook was originally added to help form IT blocks in Thumb2 code before register allocation, but the pass ordering has changed since then, and we run if-conversion after register allocation now. When the MI scheduler is enabled, there will be no less than two schedulers between 2-addr and Thumb2ITBlockPass, so this hook is unlikely to help anything. llvm-svn: 161794	2012-08-13 21:52:57 +00:00
Andrew Trick	79c6fbc97e	Minor cleanup of defaultDefLatency API llvm-svn: 161470	2012-08-08 02:44:11 +00:00
Manman Ren	2c236a30f6	X86 Peephole: fold loads to the source register operand if possible. Add more comments and use early returns to reduce nesting in isLoadFoldable. Also disable folding for V_SET0 to avoid introducing a const pool entry and a const pool load. rdar://10554090 and rdar://11873276 llvm-svn: 161207	2012-08-02 19:37:32 +00:00
Manman Ren	78b8d454cc	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. This patch is a rework of r160919 and was tested on clang self-host on my local machine. rdar://10554090 and rdar://11873276 llvm-svn: 161152	2012-08-02 00:56:42 +00:00
Manman Ren	ceef7c4d9b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00
Manman Ren	ea77f9076b	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. rdar://10554090 and rdar://11873276 llvm-svn: 160919	2012-07-28 16:48:01 +00:00
Jakob Stoklund Olesen	db187a51eb	Add an experimental early if-conversion pass, off by default. This pass performs if-conversion on SSA form machine code by speculatively executing both sides of the branch and using a cmov instruction to select the result. This can help lower the number of branch mispredictions on architectures like x86 that don't have predicable instructions. The current implementation is very aggressive, and causes regressions on mosts tests. It needs good heuristics that have yet to be implemented. llvm-svn: 159694	2012-07-04 00:09:54 +00:00
Andrew Trick	baf8a62800	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Manman Ren	125c1ee4e9	Add SrcReg2 to analyzeCompare and optimizeCompareInstr to handle Compare instructions with two register operands. llvm-svn: 159465	2012-06-29 21:33:59 +00:00
Andrew Trick	251f64f946	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Andrew Trick	52238a0ce5	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Kay Tiong Khoo	b631f7fd59	*typo: Cyles changed to Cycles llvm-svn: 158404	2012-06-13 15:53:04 +00:00
Andrew Trick	a1df722f41	Removing strange "using" declarations form TargetInstrInfo. I can't imagine why these were added. Trial and error. llvm-svn: 158247	2012-06-08 23:56:26 +00:00

1 2 3 4 5 ...

382 Commits