llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Chandler Carruth	87f14b4eec	Re-sort all of the includes with ./utils/sort_includes.py so that subsequent changes are easier to review. About to fix some layering issues, and wanted to separate out the necessary churn. Also comment and sink the include of "Windows.h" in three .inc files to match the usage in Memory.inc. llvm-svn: 198685	2014-01-07 11:48:04 +00:00
Quentin Colombet	23080225fa	[RegAlloc] Make tryInstructionSplit less aggressive. The greedy register allocator tries to split a live-range around each instruction where it is used or defined to relax the constraints on the entire live-range (this is a last chance split before falling back to spill). The goal is to have a big live-range that is unconstrained (i.e., that can use the largest legal register class) and several small local live-range that carry the constraints implied by each instruction. E.g., Let csti be the constraints on operation i. V1= op1 V1(cst1) op2 V1(cst2) V1 live-range is constrained on the intersection of cst1 and cst2. tryInstructionSplit relaxes those constraints by aggressively splitting each def/use point: V1= V2 = V1 V3 = V2 op1 V3(cst1) V4 = V2 op2 V4(cst2) Because of how the coalescer infrastructure works, each new variable (V3, V4) that is alive at the same time as V1 (or its copy, here V2) interfere with V1. Thus, we end up with an uncoalescable copy for each split point. To make tryInstructionSplit less aggressive, we check if the split point actually relaxes the constraints on the whole live-range. If it does not, we do not insert it. Indeed, it will not help the global allocation problem: - V1 will have the same constraints. - V1 will have the same interference + possibly the newly added split variable VS. - VS will produce an uncoalesceable copy if alive at the same time as V1. <rdar://problem/15570057> llvm-svn: 198369	2014-01-02 22:47:22 +00:00
Andrew Trick	cbd3ee31c7	Move the PostRA scheduler's fixupKills function for reuse. llvm-svn: 198121	2013-12-28 21:56:55 +00:00
Andrew Trick	8f70ae7740	Stub out a PostMachineScheduler pass. Placeholder and boilerplate for a PostRA MachineScheduler pass. llvm-svn: 198120	2013-12-28 21:56:51 +00:00
Andrew Trick	c4eb826f6c	Factor MI-Sched in preparation for post-ra scheduling support. Factor the MachineFunctionPass into MachineSchedulerBase. Split the DAG class into ScheduleDAGMI and SchedulerDAGMILive. llvm-svn: 198119	2013-12-28 21:56:47 +00:00
Craig Topper	723841a8b1	Revert part of r198115 to see if it fixes a buildbot failure. llvm-svn: 198118	2013-12-28 18:44:58 +00:00
Craig Topper	04690a3406	Mark some Type and EVT methods as LLVM_READONLY. llvm-svn: 198115	2013-12-28 16:17:26 +00:00
Eric Christopher	e5adfbcda0	Remove AsmPrinter::needsRelocationsForDwarfStringPool() since it's just calling into MAI and is only abstracting for a single interface that we actually need to check in multiple places. llvm-svn: 198092	2013-12-28 01:39:17 +00:00
Andrea Di Biagio	86fc6e8bd5	Teach DAGCombiner how to fold a SIGN_EXTEND_INREG of a BUILD_VECTOR of ConstantSDNodes (or UNDEFs) into a simple BUILD_VECTOR. For example, given the following sequence of dag nodes: i32 C = Constant<1> v4i32 V = BUILD_VECTOR C, C, C, C v4i32 Result = SIGN_EXTEND_INREG V, ValueType:v4i1 The SIGN_EXTEND_INREG node can be folded into a build_vector since the vector in input is a BUILD_VECTOR of constants. The optimized sequence is: i32 C = Constant<-1> v4i32 Result = BUILD_VECTOR C, C, C, C llvm-svn: 198084	2013-12-27 20:20:28 +00:00
Roman Divacky	83d7fda017	Fix a typo. llvm-svn: 198045	2013-12-26 14:43:33 +00:00
Josh Magee	86d29cffa7	[stackprotector] Use analysis from the StackProtector pass for stack layout in PEI a nd LocalStackSlot passes. This changes the MachineFrameInfo API to use the new SSPLayoutKind information produced by the StackProtector pass (instead of a boolean flag) and updates a few pass dependencies (to preserve the SSP analysis). The stack layout follows the same approach used prior to this change - i.e., only LargeArray stack objects will be placed near the canary and everything else will be laid out normally. After this change, structures containing large arrays will also be placed near the canary - a case previously missed by the old implementation. Out of tree targets will need to update their usage of MachineFrameInfo::CreateStackObject to remove the MayNeedSP argument. The next patch will implement the rules for sspstrong and sspreq. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D2158 llvm-svn: 197653	2013-12-19 03:17:11 +00:00
Juergen Ributzka	24894afa71	[Stackmap] Refactor operand parsing. llvm-svn: 197329	2013-12-14 23:06:19 +00:00
Juergen Ributzka	d7df87c066	[Stackmap] Liveness Analysis Pass This optional register liveness analysis pass can be enabled with either -enable-stackmap-liveness, -enable-patchpoint-liveness, or both. The pass traverses each basic block in a machine function. For each basic block the instructions are processed in reversed order and if a patchpoint or stackmap instruction is encountered the current live-out register set is encoded as a register mask and attached to the instruction. Later on during stackmap generation the live-out register mask is processed and also emitted as part of the stackmap. This information is optional and intended for optimization purposes only. This will enable a client of the stackmap to reason about the registers it can use and which registers need to be preserved. Reviewed by Andy llvm-svn: 197317	2013-12-14 06:53:06 +00:00
Juergen Ributzka	34b3d4d2cb	Convert register liveness tracking to work on a sub-register level instead of just register units. Reviewed by Andy llvm-svn: 197315	2013-12-14 06:52:56 +00:00
Michael Gottesman	94f6f09b57	[block-freq] Rename getEntryFrequency() -> getEntryFreq() to match getBlockFreq() in all BlockFrequencyInfo. llvm-svn: 197304	2013-12-14 02:37:38 +00:00
Michael Gottesman	d274826780	[block-freq] Refactor LiveInterals::getSpillWeight to use the new MachineBlockFrequencyInfo methods. This is slightly more interesting than the previous batch of changes. Specifically: 1. We refactor getSpillWeight to take a MachineBlockFrequencyInfo (MBFI) object. This enables us to completely encapsulate the actual manner we use the MachineBlockFrequencyInfo to get our spill weights. This yields cleaner code since one does not need to fetch the actual block frequency before getting the spill weight if all one wants it the spill weight. It also gives us access to entry frequency which we need for our computation. 2. Instead of having getSpillWeight take a MachineBasicBlock (as one might think) to look up the block frequency via the MBFI object, we instead take in a MachineInstr object. The reason for this is that the method is supposed to return the spill weight for an instruction according to the comments around the function. llvm-svn: 197296	2013-12-14 00:53:32 +00:00
Matt Arsenault	9b762a4c61	Print the address space of a MachineMemOperand llvm-svn: 197288	2013-12-14 00:24:02 +00:00
Michael Gottesman	6052e0bc85	[block-freq] Add the equivalent methods to MachineBlockFrequencyInfo and BlockFrequencyInfo that were added to BlockFrequencyImpl in r197285 and r197284. llvm-svn: 197287	2013-12-14 00:06:03 +00:00
Andrew Trick	95fe9491c9	Revert "Convert liveness tracking to work on a sub-register level instead of just register units." This reverts commit r197253. This was a great change, but Juergen should be the commit author. llvm-svn: 197262	2013-12-13 19:04:08 +00:00
Andrew Trick	1157632f3d	Revert "Liveness Analysis Pass" This reverts commit r197254. This was an accidental merge of Juergen's patch. It will be checked in shortly, but wasn't meant to go in quite yet. Conflicts: include/llvm/CodeGen/StackMaps.h lib/CodeGen/StackMaps.cpp test/CodeGen/X86/stackmap-liveness.ll llvm-svn: 197260	2013-12-13 18:57:20 +00:00
Andrew Trick	e726cc0278	Grow the stackmap/patchpoint format to hold 64-bit IDs. llvm-svn: 197255	2013-12-13 18:37:10 +00:00
Andrew Trick	3b62606852	Liveness Analysis Pass llvm-svn: 197254	2013-12-13 18:37:03 +00:00
Andrew Trick	c8f768f93e	Convert liveness tracking to work on a sub-register level instead of just register units. llvm-svn: 197253	2013-12-13 18:36:56 +00:00
Hal Finkel	6840daed2f	Add isBarrier to SDep SDep had is* functions for the other kinds of order dependencies (isMustAlias, isWeak, isArtificial, etc.), but not for barrier. Upcoming commits in the PowerPC backend will make use of this function. llvm-svn: 197098	2013-12-12 00:19:07 +00:00
Hal Finkel	37cf25d396	Add two additional hazard recognizer functions This adds two additional functions to the hazard recognizer interface. These are optional (in the sense that the default implementations preserve the current behavior), and used by the post-RA scheduler. Upcoming commits will use this functionality in order to improve dispatch-group formation on the POWER7 and related cores. Dispatch groups are an odd construct: sometimes we need to insert nops to force a new one to start (for performance reasons), and some instructions need to appear in certain positions within a group, but the groups are not fundamentally cycle based (they can contain instructions with data dependencies with non-trivial latencies). Motivation: unsigned PreEmitNoops(SUnit ) - Used to force the post-RA scheduler to insert nops to force a new dispatch group to begin. We already have a NoopHazard, and this is also still needed. However, NoopHazard only causes a nop to be inserted if there are no other available instructions, and so is not always sufficient. The number of nops to insert depends on state that only the hazard recognizer has, so a general callback is necessary. bool ShouldPreferAnother(SUnit ) - Used to avoid scheduling instructions that would start a new dispatch group when others are available that could be part of the current dispatch group. In this case, we don't want to issue nops, because the non-preferred instruction will implicitly start a new dispatch group regardless. Although the motivation for these functions is driven by the PowerPC backend, they are completely general. llvm-svn: 197084	2013-12-11 22:33:43 +00:00
Reid Kleckner	b6a72325f3	Reland "Fix miscompile of MS inline assembly with stack realignment" This re-lands commit r196876, which was reverted in r196879. The tests have been fixed to pass on platforms with a stack alignment larger than 4. Update to clang side tests will land shortly. llvm-svn: 196939	2013-12-10 18:27:32 +00:00
Reid Kleckner	cb3c239850	Revert "Fix miscompile of MS inline assembly with stack realignment" This reverts commit r196876. Its tests failed on the bots, so I'll figure it out tomorrow. llvm-svn: 196879	2013-12-10 05:31:27 +00:00
Reid Kleckner	26454793b1	Fix miscompile of MS inline assembly with stack realignment For stack frames requiring realignment, three pointers may be needed: - ebp to address incoming arguments - esi (could be any callee-saved register) to address locals - esp to address outgoing arguments We would use esi unconditionally without verifying that it did not conflict with inline assembly. This change doesn't do the verification, it simply emits a fatal error on functions that use stack realignment, dynamic SP adjustments, and inline assembly. Because stack realignment is common on Windows, we also no longer assume that MS inline assembly clobbers esp. Instead, we analyze the inline instructions for implicit definitions and check if esp is there. If so, we require the use of a base pointer and consider it in the condition above. Mostly fixes PR16830, but we could try harder to find a non-conflicting base pointer. Reviewers: sunfish Differential Revision: http://llvm-reviews.chandlerc.com/D1317 llvm-svn: 196876	2013-12-10 05:12:23 +00:00
Jakub Staszak	11e1c882f7	Don't #include heavy Dominators.h file in LoopInfo.h. This change reduces overall time of LLVM compilation by ~1%. llvm-svn: 196667	2013-12-07 21:20:17 +00:00
Andrew Trick	2ac3f8f326	Factor out the SchedRemainder/SchedBoundary from GenericScheduler strategy. These helper classes take care of the book-keeping the drives the GenericScheduler heuristics. It is likely that developers writing target-specific schedulers that work similarly to GenericScheduler will want to use these helpers too. The immediate goal is to develop a GenericPostScheduler that can run in place of the old PostRAScheduler, but will use the new machine model. No functionality change intended. llvm-svn: 196643	2013-12-07 05:59:44 +00:00
Andrew Trick	7eb7f7648b	MI-Sched: Model "reserved" processor resources. This allows a target to use MI-Sched as an in-order scheduler that will model strict resource conflicts without defining a processor itinerary. Instead, the target can now use the new per-operand machine model and define in-order resources with BufferSize=0. For example, this would allow restricting the type of operations that can be formed into a dispatch group. (Normally NumMicroOps is sufficient to enforce dispatch groups). If the intent is to model latency in in-order pipeline, as opposed to resource conflicts, then a resource with BufferSize=1 should be defined instead. This feature is only casually tested as there are no in-tree targets using it yet. However, Hal will be experimenting with POWER7. llvm-svn: 196517	2013-12-05 17:56:02 +00:00
Andrew Trick	192311ab9a	MI-Sched: handle latency of in-order operations with the new machine model. The per-operand machine model allows the target to define "unbuffered" processor resources. This change is a quick, cheap way to model stalls caused by the latency of operations that use such resources. This only applies when the processor's micro-op buffer size is non-zero (Out-of-Order). We can't precisely model in-order stalls during out-of-order execution, but this is an easy and effective heuristic. It benefits cortex-a9 scheduling when using the new machine model, which is not yet on by default. MI-Sched for armv7 was evaluated on Swift (and only not enabled because of a performance bug related to predication). However, we never evaluated Cortex-A9 performance on MI-Sched in its current form. This change adds MI-Sched functionality to reach performance goals on A9. The only remaining change is to allow MI-Sched to run as a PostRA pass. I evaluated performance using a set of options to estimate the performance impact once MI sched is default on armv7: -mcpu=cortex-a9 -disable-post-ra -misched-bench -scheditins=false For a simple saxpy loop I see a 1.7x speedup. Here are the llvm-testsuite results: (min run time over 2 runs, filtering tiny changes) Speedups: \| Benchmarks/BenchmarkGame/recursive \| 52.39% \| \| Benchmarks/VersaBench/beamformer \| 20.80% \| \| Benchmarks/Misc/pi \| 19.97% \| \| Benchmarks/Misc/mandel-2 \| 19.95% \| \| SPEC/CFP2000/188.ammp \| 18.72% \| \| Benchmarks/McCat/08-main/main \| 18.58% \| \| Benchmarks/Misc-C++/Large/sphereflake \| 18.46% \| \| Benchmarks/Olden/power \| 17.11% \| \| Benchmarks/Misc-C++/mandel-text \| 16.47% \| \| Benchmarks/Misc/oourafft \| 15.94% \| \| Benchmarks/Misc/flops-7 \| 14.99% \| \| Benchmarks/FreeBench/distray \| 14.26% \| \| SPEC/CFP2006/470.lbm \| 14.00% \| \| mediabench/mpeg2/mpeg2dec/mpeg2decode \| 12.28% \| \| Benchmarks/SmallPT/smallpt \| 10.36% \| \| Benchmarks/Misc-C++/Large/ray \| 8.97% \| \| Benchmarks/Misc/fp-convert \| 8.75% \| \| Benchmarks/Olden/perimeter \| 7.10% \| \| Benchmarks/Bullet/bullet \| 7.03% \| \| Benchmarks/Misc/mandel \| 6.75% \| \| Benchmarks/Olden/voronoi \| 6.26% \| \| Benchmarks/Misc/flops-8 \| 5.77% \| \| Benchmarks/Misc/matmul_f64_4x4 \| 5.19% \| \| Benchmarks/MiBench/security-rijndael \| 5.15% \| \| Benchmarks/Misc/flops-6 \| 5.10% \| \| Benchmarks/Olden/tsp \| 4.46% \| \| Benchmarks/MiBench/consumer-lame \| 4.28% \| \| Benchmarks/Misc/flops-5 \| 4.27% \| \| Benchmarks/mafft/pairlocalalign \| 4.19% \| \| Benchmarks/Misc/himenobmtxpa \| 4.07% \| \| Benchmarks/Misc/lowercase \| 4.06% \| \| SPEC/CFP2006/433.milc \| 3.99% \| \| Benchmarks/tramp3d-v4 \| 3.79% \| \| Benchmarks/FreeBench/pifft \| 3.66% \| \| Benchmarks/Ptrdist/ks \| 3.21% \| \| Benchmarks/Adobe-C++/loop_unroll \| 3.12% \| \| SPEC/CINT2000/175.vpr \| 3.12% \| \| Benchmarks/nbench \| 2.98% \| \| SPEC/CFP2000/183.equake \| 2.91% \| \| Benchmarks/Misc/perlin \| 2.85% \| \| Benchmarks/Misc/flops-1 \| 2.82% \| \| Benchmarks/Misc-C++-EH/spirit \| 2.80% \| \| Benchmarks/Misc/flops-2 \| 2.77% \| \| Benchmarks/NPB-serial/is \| 2.42% \| \| Benchmarks/ASC_Sequoia/CrystalMk \| 2.33% \| \| Benchmarks/BenchmarkGame/n-body \| 2.28% \| \| Benchmarks/SciMark2-C/scimark2 \| 2.27% \| \| Benchmarks/Olden/bh \| 2.03% \| \| skidmarks10/skidmarks \| 1.81% \| \| Benchmarks/Misc/flops \| 1.72% \| Slowdowns: \| Benchmarks/llubenchmark/llu \| -14.14% \| \| Benchmarks/Polybench/stencils/seidel-2d \| -5.67% \| \| Benchmarks/Adobe-C++/functionobjects \| -5.25% \| \| Benchmarks/Misc-C++/oopack_v1p8 \| -5.00% \| \| Benchmarks/Shootout/hash \| -2.35% \| \| Benchmarks/Prolangs-C++/ocean \| -2.01% \| \| Benchmarks/Polybench/medley/floyd-warshall \| -1.98% \| \| Polybench/linear-algebra/kernels/3mm \| -1.95% \| \| Benchmarks/McCat/09-vor/vor \| -1.68% \| llvm-svn: 196516	2013-12-05 17:55:58 +00:00
Alp Toker	e845f8af67	Correct word hyphenations This patch tries to avoid unrelated changes other than fixing a few hyphen-related ambiguities and contractions in nearby lines. llvm-svn: 196471	2013-12-05 05:44:44 +00:00
Timur Iskhodzhanov	2340a0ee1c	Reland 196270 "Generalize debug info / EH emission in AsmPrinter" Addressing the existense AMDGPUAsmPrinter and other subclasses of AsmPrinter llvm-svn: 196288	2013-12-03 15:10:23 +00:00
NAKAMURA Takumi	c0b01ff922	Revert r196270, "Generalize debug info / EH emission in AsmPrinter" It broke CodeGen/R600 tests with +Asserts. llvm-svn: 196272	2013-12-03 13:15:54 +00:00
Timur Iskhodzhanov	8ce8c7a5d7	Generalize debug info / EH emission in AsmPrinter llvm-svn: 196270	2013-12-03 12:05:18 +00:00
Michael Gottesman	576426afb7	Added MachineBlockFrequencyInfo::view for displaying the block frequency propagation graph via graphviz. This is useful for debugging issues in the BlockFrequency implementation since one can easily visualize where probability mass and other errors occur in the propagation. This is the MI version of r194654. llvm-svn: 196183	2013-12-03 00:49:33 +00:00
Rafael Espindola	c36d63a948	Move getSymbolWithGlobalValueBase to TargetLoweringObjectFile. This allows it to be used in TargetLoweringObjectFileImpl.cpp. llvm-svn: 196117	2013-12-02 16:25:47 +00:00
Lang Hames	067c025250	Refactor a lot of patchpoint/stackmap related code to simplify and make it target independent. Most of the x86 specific stackmap/patchpoint handling was necessitated by the use of the native address-mode format for frame index operands. PEI has now been modified to treat stackmap/patchpoint similarly to DEBUG_INFO, allowing us to use a simple, platform independent register/offset pair for frame indexes on stackmap/patchpoints. Notes: - Folding is now platform independent and automatically supported. - Emiting patchpoints with direct memory references now just involves calling the TargetLoweringBase::emitPatchPoint utility method from the target's XXXTargetLowering::EmitInstrWithCustomInserter method. (See X86TargetLowering for an example). - No more ugly platform-specific operand parsers. This patch shouldn't change the generated output for X86. llvm-svn: 195944	2013-11-29 03:07:54 +00:00
Rafael Espindola	7e7db10302	Remove an always true parameter. llvm-svn: 195931	2013-11-28 19:35:07 +00:00
Lang Hames	433095d3fe	Fix a typo where we were creating <def,kill> operands instead of <def,dead> ones. Add an assertion to make sure we catch this in the future. Fixes <rdar://problem/15464559>. llvm-svn: 195401	2013-11-22 00:46:32 +00:00
Tom Stellard	439debedd3	Split SETCC if VSELECT requires splitting too. This patch is a rewrite of the original patch commited in r194542. Instead of relying on the type legalizer to do the splitting for us, we now peform the splitting ourselves in the DAG combiner. This is necessary for the case where the vector mask is a legal type after promotion and still wouldn't require splitting. Patch by: Juergen Ributzka NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195397	2013-11-22 00:39:23 +00:00
Lang Hames	682d6cc95b	Dereference the node iterator when dumping the PBQP graph structure in DOT format. Thanks to Arnaud A. de Grandmaison for the patch! llvm-svn: 195316	2013-11-21 06:30:14 +00:00
Eric Christopher	3d1796838e	Remove capability for polymorphic destruction from LexicalScope and LexicalScopes, we're not using it. llvm-svn: 195182	2013-11-20 00:54:28 +00:00
Eric Christopher	2624bc8ba4	Formatting, 80-col, trailing whitespace. llvm-svn: 195180	2013-11-20 00:54:19 +00:00
Juergen Ributzka	8e480fdae5	[DAG] Refactor vector splitting code in SelectionDAG. No functional change intended. Reviewed by Tom llvm-svn: 195156	2013-11-19 21:20:17 +00:00
Andrew Trick	5b8040c957	Fix patchpoint comments. llvm-svn: 195103	2013-11-19 05:05:43 +00:00
Andrew Trick	15aac659a7	Add an abstraction to handle patchpoint operands. Hard-coded operand indices were scattered throughout lowering stages and layers. It was super bug prone. llvm-svn: 195093	2013-11-19 03:29:56 +00:00
Juergen Ributzka	5357a6d64b	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. The memory leaks in this version have been fixed. Thanks Alexey for pointing them out. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 195064	2013-11-19 00:57:56 +00:00
Alexey Samsonov	3bfef6bdb6	Revert r194865 and r194874. This change is incorrect. If you delete virtual destructor of both a base class and a subclass, then the following code: Base *foo = new Child(); delete foo; will not cause the destructor for members of Child class. As a result, I observe plently of memory leaks. Notable examples I investigated are: ObjectBuffer and ObjectBufferStream, AttributeImpl and StringSAttributeImpl. llvm-svn: 194997	2013-11-18 09:31:53 +00:00
Andrew Trick	bd486c29f4	Added a size field to the stack map record to handle subregister spills. Implementing this on bigendian platforms could get strange. I added a target hook, getStackSlotRange, per Jakob's recommendation to make this as explicit as possible. llvm-svn: 194942	2013-11-17 01:36:23 +00:00
Duncan P. N. Exon Smith	c331c75e8e	Fix filename in header comment llvm-svn: 194924	2013-11-16 15:40:54 +00:00
Juergen Ributzka	ee3af15269	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 194865	2013-11-15 22:34:48 +00:00
Bob Wilson	d433cf7463	Avoid illegal integer promotion in fastisel Stop folding constant adds into GEP when the type size doesn't match. Otherwise, the adds' operands are effectively being promoted, changing the conditions of an overflow. Results are different when: sext(a) + sext(b) != sext(a + b) Problem originally found on x86-64, but also fixed issues with ARM and PPC, which used similar code. <rdar://problem/15292280> Patch by Duncan Exon Smith! llvm-svn: 194840	2013-11-15 19:09:27 +00:00
Daniel Sanders	0ebbe1d56c	Fix illegal DAG produced by SelectionDAG::getConstant() for v2i64 type Summary: When getConstant() is called for an expanded vector type, it is split into multiple scalar constants which are then combined using appropriate build_vector and bitcast operations. In addition to the usual big/little endian differences, the case where the element-order of the vector does not have the same endianness as the elements themselves is also accounted for. For example, for v4i32 on big-endian MIPS, the byte-order of the vector is <3210,7654,BA98,FEDC>. For little-endian, it is <0123,4567,89AB,CDEF>. Handling this case turns out to be a nop since getConstant() returns a splatted vector (so reversing the element order doesn't change the value) This fixes a number of cases in MIPS MSA where calling getConstant() during operation legalization introduces illegal types (e.g. to legalize v2i64 UNDEF into a v2i64 BUILD_VECTOR of illegal i64 zeros). It should also handle bigger differences between illegal and legal types such as legalizing v2i64 into v8i16. lowerMSASplatImm() in the MIPS backend no longer needs to avoid calling getConstant() so this function has been updated in the same patch. For the sake of transparency, the steps I've taken since the review are: * Added 'virtual' to isVectorEltOrderLittleEndian() as requested. This revealed that the MIPS tests were falsely passing because a polymorphic function was not actually polymorphic in the reviewed patch. * Fixed the tests that were now failing. This involved deleting the code to handle the MIPS MSA element-order (which was previously doing an byte-order swap instead of an element-order swap). This left isVectorEltOrderLittleEndian() unused and it was deleted. * Fixed build failures caused by rebasing beyond r194467-r194472. These build failures involved the bset, bneg, and bclr instructions added in these commits using lowerMSASplatImm() in a way that was no longer valid after this patch. Some of these were fixed by calling SelectionDAG::getConstant() instead, others were fixed by a new function getBuildVectorSplat() that provided the removed functionality of lowerMSASplatImm() in a more sensible way. Reviewers: bkramer Reviewed By: bkramer CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1973 llvm-svn: 194811	2013-11-15 12:56:49 +00:00
Matt Arsenault	9921608896	Add addrspacecast instruction. Patch by Michele Scandale! llvm-svn: 194760	2013-11-15 01:34:59 +00:00
Aaron Ballman	7c6e917033	Replacing HUGE_VALF with llvm::huge_valf in order to work around a warning triggered in MSVC 12. Patch reviewed by Reid Kleckner and Jim Grosbach. llvm-svn: 194533	2013-11-13 00:15:44 +00:00
Arnaud A. de Grandmaison	26d846f560	CalcSpillWeights: allow overidding the spill weight normalizing function This will enable the PBQP register allocator to provide its own normalizing function. No functionnal change. llvm-svn: 194417	2013-11-11 19:56:14 +00:00
Arnaud A. de Grandmaison	8c40e45072	CalcSpillWeights: give a better describing name to calculateSpillWeights Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint. No functionnal change. llvm-svn: 194404	2013-11-11 19:04:45 +00:00
Arnaud A. de Grandmaison	6b862708a7	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194356	2013-11-10 17:46:31 +00:00
Chandler Carruth	f2e7a23acb	Move the old pass manager infrastructure into a legacy namespace and give the files a legacy prefix in the right directory. Use forwarding headers in the old locations to paper over the name change for most clients during the transitional period. No functionality changed here! This is just clearing some space to reduce renaming churn later on with a new system. Even when the new stuff starts to go in, it is going to be hidden behind a flag and off-by-default as it is still WIP and under development. This patch is specifically designed so that very little out-of-tree code has to change. I'm going to work as hard as I can to keep that the case. Only direct forward declarations of the PassManager class are impacted by this change. llvm-svn: 194324	2013-11-09 12:26:54 +00:00
NAKAMURA Takumi	ba34a4d189	include/llvm/CodeGen/PBQP: Update @param(s) in comments. [-Wdocumentation] llvm-svn: 194314	2013-11-09 03:54:05 +00:00
NAKAMURA Takumi	dc501161f9	Fix whitespace. llvm-svn: 194313	2013-11-09 03:53:55 +00:00
Lang Hames	be91a1d947	Re-apply r194300 with fixes for warnings. llvm-svn: 194311	2013-11-09 03:08:56 +00:00
Nick Lewycky	ebeaea0192	Revert r194300 which broke the build. llvm-svn: 194308	2013-11-09 02:01:25 +00:00
Lang Hames	e3c935f4ab	Rewrite the PBQP graph data structure. The new graph structure replaces the node and edge linked lists with vectors. Free lists (well, free vectors) are used for fast insertion/deletion. The ultimate aim is to make PBQP graphs cheap to clone. The motivation is that the PBQP solver destructively consumes input graphs while computing a solution, forcing the graph to be fully reconstructed for each round of PBQP. This imposes a high cost on large functions, which often require several rounds of solving/spilling to find a final register allocation. If we can cheaply clone the PBQP graph and incrementally update it between rounds then hopefully we can reduce this cost. Further, once we begin pooling matrix/vector values (future work), we can cache some PBQP solver metadata and share it between cloned graphs, allowing the PBQP solver to re-use some of the computation done in earlier rounds. For now this is just a data structure update. The allocator and solver still use the graph the same way as before, fully reconstructing it between each round. I expect no material change from this update, although it may change the iteration order of the nodes, causing ties in the solver to break in different directions, and this could perturb the generated allocations (hopefully in a completely benign way). Thanks very much to Arnaud Allard de Grandmaison for encouraging me to get back to work on this, and for a lot of discussion and many useful PBQP test cases. llvm-svn: 194300	2013-11-09 00:14:07 +00:00
Juergen Ributzka	f27436b708	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Arnaud A. de Grandmaison	12a0cc8ff5	Revert "CalculateSpillWeights does not need to be a pass" Temporarily revert my previous commit until I understand why it breaks 3 target tests. llvm-svn: 194272	2013-11-08 18:19:19 +00:00
Arnaud A. de Grandmaison	926576cff3	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194269	2013-11-08 17:56:29 +00:00
Andrew Trick	75681a41c0	Add support for stack map generation in the X86 backend. Originally implemented by Lang Hames. llvm-svn: 193811	2013-10-31 22:11:56 +00:00
Rafael Espindola	bdb3c4f195	Produce .weak_def_can_be_hidden for some linkonce_odr values With this patch llvm produces a weak_def_can_be_hidden for linkonce_odr if they are also unnamed_addr or don't have their address taken. There is not a lot of documentation about .weak_def_can_be_hidden, but from the old discussion about linkonce_odr_auto_hide and the name of the directive this looks correct: these symbols can be hidden. Testing this with the ld64 in Xcode 5 linking clang reduces the number of exported symbols from 21053 to 19049. llvm-svn: 193718	2013-10-30 22:08:11 +00:00
Josh Magee	4b74743099	Reformat code with clang-format. Differential Revision: http://llvm-reviews.chandlerc.com/D2057 llvm-svn: 193672	2013-10-30 02:25:14 +00:00
NAKAMURA Takumi	579bb406bf	StackProtector.h: Fix trailing comments for doxygen. [-Wdocumentation] s!//<!///<! llvm-svn: 193669	2013-10-30 00:49:39 +00:00
NAKAMURA Takumi	7388ca6af3	Trailing whitespace in a comment line. llvm-svn: 193668	2013-10-30 00:49:33 +00:00
Josh Magee	5a6fad91a3	[stackprotector] Update the StackProtector pass to perform datalayout analysis. This modifies the pass to classify every SSP-triggering AllocaInst according to an SSPLayoutKind (LargeArray, SmallArray, AddrOf). This analysis is collected by the pass and made available for use, but no other pass uses it yet. The next patch will make use of this analysis in PEI and StackSlot passes. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1789 llvm-svn: 193653	2013-10-29 21:16:16 +00:00
Rafael Espindola	68ddc56344	Add a helper getSymbol to AsmPrinter. llvm-svn: 193627	2013-10-29 17:07:16 +00:00
Richard Sandiford	0ea0d286ba	Keep TBAA info when rewriting SelectionDAG loads and stores Most SelectionDAG code drops the TBAA info when creating a new form of a load and store (e.g. during legalization, or when converting a plain load to an extending one). This patch tries to catch all cases where the TBAA information can legitimately be carried over. The patch adds alternative forms of getLoad() and getExtLoad() that take a MachineMemOperand instead of individual fields. (The corresponding getTruncStore() already exists.) The idea is to use the MachineMemOperand forms when all fields are carried over (size, pointer info, isVolatile, isNonTemporal, alignment and TBAA info). If some adjustment is being made, e.g. to narrow the load, then we still pass the individual fields but also pass the TBAA info. llvm-svn: 193517	2013-10-28 11:17:59 +00:00
Tim Northover	ee00055f8f	LegalizeDAG: allow libcalls for max/min atomic operations ARM processors without ldrex/strex need to be able to make libcalls for all atomic operations, including the newer min/max versions. The alternative would probably be expanding these operations in terms of cmpxchg (as x86 does always), but in the configurations where this matters code-size tends to be paramount so the libcall is more desirable. llvm-svn: 193398	2013-10-25 09:30:20 +00:00
Eric Christopher	dd271b9204	Formatting and whitespace. llvm-svn: 193370	2013-10-24 21:04:51 +00:00
Benjamin Kramer	b0d0e9091e	Speling fixes. llvm-svn: 193165	2013-10-22 15:18:03 +00:00
Wan Xiaofei	dc0d564687	Using FoldingSet in SelectionDAG::getVTList. VTList has a long life cycle through the module and getVTList is frequently called. In current getVTList, sequential search over a std::vector is used, this is inefficient in big module. This patch use FoldingSet to implement hashing mechanism when searching. Reviewer: Nadav Rotem Test : Pass unit tests & LNT test suite llvm-svn: 193150	2013-10-22 08:02:02 +00:00
David Majnemer	417ca129be	CodeGen: Emit a libcall if the target doesn't support 16-byte wide atomics There are targets that support i128 sized scalars but cannot emit instructions that modify them directly. The proper thing to do is to emit a libcall. This fixes PR17481. llvm-svn: 192957	2013-10-18 08:03:43 +00:00
Rafael Espindola	c17b7cf2ed	Add support for metadata representing .ident directives. llvm-svn: 192764	2013-10-16 01:49:05 +00:00
Chandler Carruth	ad23439747	Simplify formatting and sort these. No functionality changed. llvm-svn: 192668	2013-10-15 02:03:44 +00:00
Andrew Trick	08ea8ac39c	LiveRegUnits: Use *MBB for consistency and convenience. llvm-svn: 192634	2013-10-14 22:18:59 +00:00
Andrew Trick	eddf0de054	Use a SparseSet in LiveRegUnits. Some clients may add block live ins and may track liveness over a large scope. This guarantees an efficient implementation in all cases with no memory allocation/deallocation, independent of the number of target registers. It could be slightly less convenient but is fine in the expected case. llvm-svn: 192622	2013-10-14 20:45:17 +00:00
Andrew Trick	9098ea07a2	Move LiveRegUnits implementation into .cpp. Comment and format. llvm-svn: 192621	2013-10-14 20:45:14 +00:00
Andrew Trick	2b5bcc832c	Remove extra indentation in LiveRegUnits. llvm-svn: 192620	2013-10-14 20:45:11 +00:00
Andrew Trick	b5845be802	Convert LiveRegUnits methods to the current convention (it's new code). llvm-svn: 192619	2013-10-14 20:45:09 +00:00
Rafael Espindola	0ee3f8f595	Remove the now unused strong phi elimination pass. llvm-svn: 192604	2013-10-14 16:39:04 +00:00
Matthias Braun	47d0426b55	Introduce ad hoc liveness tracking utility: LiveRegUnits Contains a set of live register (units) and code to move forward and backward in the schedule while updating the live set. llvm-svn: 192481	2013-10-11 19:04:35 +00:00
Justin Holewinski	f7d6ae0d5b	Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom comments for implicit defs For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers, while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be emitted as a true PTX register name. Other targets can use this to customize the output of implicit def comments. Fixes PR17519 llvm-svn: 192444	2013-10-11 12:39:36 +00:00
Matthias Braun	c7c3105acb	Print register in LiveInterval::print() llvm-svn: 192398	2013-10-10 21:29:05 +00:00
Matthias Braun	cf84f537f1	Represent RegUnit liveness with LiveRange instance Previously LiveInterval has been used, but having a spill weight and register number is unnecessary for a register unit. llvm-svn: 192397	2013-10-10 21:29:02 +00:00
Matthias Braun	ace00a09c8	Work on LiveRange instead of LiveInterval where possible Also change some pointer arguments to references at some places where 0-pointers are not allowed. llvm-svn: 192396	2013-10-10 21:28:57 +00:00
Matthias Braun	0fcc48eb53	Pass LiveQueryResult by value This makes the API a bit more natural to use and makes it easier to make LiveRanges implementation details private. llvm-svn: 192394	2013-10-10 21:28:52 +00:00
Matthias Braun	bcd7da9aa2	Refactor LiveInterval: introduce new LiveRange class LiveRange just manages a list of segments and a list of value numbers now as LiveInterval did previously, but without having details like spill weight or a fixed register number. LiveInterval is now a subclass of LiveRange and simply adds the spill weight and the register number. llvm-svn: 192393	2013-10-10 21:28:47 +00:00
Matthias Braun	63c2445abb	Rename LiveRange to LiveInterval::Segment The Segment struct contains a single interval; multiple instances of this struct are used to construct a live range, but the struct is not a live range by itself. llvm-svn: 192392	2013-10-10 21:28:43 +00:00
Matthias Braun	919868389c	Rename parameter: defined regs are not incoming. llvm-svn: 192391	2013-10-10 21:28:38 +00:00
Hao Liu	d0ab407a23	Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 192361	2013-10-10 17:00:52 +00:00
Rafael Espindola	bb93e39fe2	Revert "Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4)." This reverts commit r192352. It broke the build. llvm-svn: 192354	2013-10-10 15:15:17 +00:00
Hao Liu	0ff11c9c71	Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 192352	2013-10-10 15:01:24 +00:00
Reid Kleckner	0aaac9bca6	Explicitly request unsigned enum types when desired This fixes repeated -Wmicrosoft warnings when self-hosting clang on Windows, and gets us real unsigned enum types with MSVC. llvm-svn: 192227	2013-10-08 20:15:11 +00:00
Jiangning Liu	6d9b4a0e25	Implement aarch64 neon instruction set AdvSIMD (Across). llvm-svn: 192028	2013-10-05 08:22:10 +00:00
Craig Topper	4fe6b1993f	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 192026	2013-10-05 05:38:16 +00:00
Craig Topper	67bc19ea4d	Revert r191940 to see if it fixes the build bots. llvm-svn: 191941	2013-10-04 05:52:17 +00:00
Craig Topper	fb07bce847	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 191940	2013-10-04 05:22:20 +00:00
Pete Cooper	9b04f39eb6	Add v4f16 to supported value types. This is useful for some ARM intrinsics such as VCVTN which does a <4 x float> <-> <4 x half> conversion. llvm-svn: 191870	2013-10-03 03:29:21 +00:00
Arnold Schwaighofer	47322176be	IfConverter: Use TargetSchedule for instruction latencies For targets that have instruction itineraries this means no change. Targets that move over to the new schedule model will use be able the new schedule module for instruction latencies in the if-converter (the logic is such that if there is no itineary we will use the new sched model for the latencies). Before, we queried "TTI->getInstructionLatency()" for the instruction latency and the extra prediction cost. Now, we query the TargetSchedule abstraction for the instruction latency and TargetInstrInfo for the extra predictation cost. The TargetSchedule abstraction will internally call "TTI->getInstructionLatency" if an itinerary exists, otherwise it will use the new schedule model. ATTENTION: Out of tree targets! (I will also send out an email later to LLVMDev) This means, if your target implements unsigned getInstrLatency(const InstrItineraryData ItinData, const MachineInstr MI, unsigned PredCost); and returns a value for "PredCost", you now also need to implement unsigned getPredictationCost(const MachineInstr MI); (if your target uses the IfConversion.cpp pass) radar://15077010 llvm-svn: 191671	2013-09-30 15:28:56 +00:00
Benjamin Kramer	348df51da8	Remove an old workaround for a compiler that EOL'd years ago. llvm-svn: 191643	2013-09-29 19:39:02 +00:00
Benjamin Kramer	c03ffd11a9	Allocate AtomicSDNode operands in SelectionDAG's allocator to stop leakage. SDNode destructors are never called. As an optimization use AtomicSDNode's internal storage if we have a small number of operands. llvm-svn: 191636	2013-09-29 11:18:56 +00:00
Robert Wilhelm	198f21deb3	Even more spelling fixes for "instruction". llvm-svn: 191611	2013-09-28 13:42:22 +00:00
Josh Magee	f476a4f2e8	[stackprotector] Refactor the StackProtector pass from a single .cpp file into StackProtector.h and StackProtector.cpp. No functionality change. Future patches will add analysis which will be used in other passes (PEI, StackSlot). The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1521 llvm-svn: 191570	2013-09-27 21:58:43 +00:00
Amara Emerson	80d8b3db1e	[ARM] Use the load-acquire/store-release instructions optimally in AArch32. Patch by Artyom Skrobov. llvm-svn: 191428	2013-09-26 12:22:36 +00:00
Jiangning Liu	5867567c41	Initial support for Neon scalar instructions. Patch by Ana Pazos. 1.Added support for v1ix and v1fx types. 2.Added Scalar Pairwise Reduce instructions. 3.Added initial implementation of Scalar Arithmetic instructions. llvm-svn: 191263	2013-09-24 02:47:27 +00:00
Andrew Trick	439918c874	Allow subtarget selection of the default MachineScheduler and document the interface. The global registry is used to allow command line override of the scheduler selection, but does not work well as the normal selection API. For example, the same LLVM process should be able to target multiple targets or subtargets. llvm-svn: 191071	2013-09-20 05:14:41 +00:00
Adrian Prantl	1f4d59b4cc	simplify expression llvm-svn: 190826	2013-09-17 00:15:33 +00:00
Adrian Prantl	dfcc535bbe	Debug info: Fix PR16736 and rdar://problem/14990587. A DBG_VALUE is register-indirect iff the first operand is a register _and_ the second operand is an immediate. llvm-svn: 190821	2013-09-16 23:29:03 +00:00
Joey Gouly	fccb3bcae3	Add an instruction deprecation feature to TableGen. The 'Deprecated' class allows you to specify a SubtargetFeature that the instruction is deprecated on. The 'ComplexDeprecationPredicate' class allows you to define a custom predicate that is called to check for deprecation. For example: ComplexDeprecationPredicate<"MCR"> would mean you would have to define the following function: bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI, std::string &Info) Which returns 'false' for not deprecated, and 'true' for deprecated and store the warning message in 'Info'. The MCTargetAsmParser constructor was chaned to take an extra argument of the MCInstrInfo class, so out-of-tree targets will need to be changed. llvm-svn: 190598	2013-09-12 10:28:05 +00:00
Jack Carter	5523d7ed5a	white spaces and long lines llvm-svn: 190358	2013-09-09 22:02:08 +00:00
Andrew Trick	0d8da69006	mi-sched: cleanup register pressure update, remove a FIXME. llvm-svn: 190181	2013-09-06 17:32:47 +00:00
Andrew Trick	bf56e1926d	Added MachineSchedPolicy. Allow subtargets to customize the generic scheduling strategy. This is convenient for targets that don't need to add new heuristics by specializing the strategy. llvm-svn: 190176	2013-09-06 17:32:34 +00:00
Matthias Braun	667850a4b2	avoid unnecessary direct access to LiveInterval::ranges llvm-svn: 190170	2013-09-06 16:44:32 +00:00
Matthias Braun	c8c6a4ff0d	remove unused argument from LiveRanges::join() llvm-svn: 190169	2013-09-06 16:44:29 +00:00
Matthias Braun	15fb34e896	fix typo in comment llvm-svn: 190165	2013-09-06 16:19:22 +00:00
Andrew Trick	c785e832b3	mi-sched: Suppress register pressure tracking when the scheduling window is too small. If the instruction window is < NumRegs/2, pressure tracking is not likely to be effective. The scheduler has to process a very large number of tiny blocks. We want this to be fast. llvm-svn: 189991	2013-09-04 21:00:11 +00:00
Andrew Trick	6c4257f01b	mi-sched: bypass heuristic checks when regpressure tracking is disabled. llvm-svn: 189988	2013-09-04 21:00:02 +00:00
Andrew Trick	cf575d4815	Added -misched-regpressure option. Register pressure tracking is half the complexity of the scheduler. It's useful to be able to turn it off for compile time and performance comparisons. llvm-svn: 189987	2013-09-04 20:59:59 +00:00
Matt Arsenault	88b7b3058d	Fix grammar llvm-svn: 189855	2013-09-03 21:05:51 +00:00
Benjamin Kramer	4921fdebc8	Free PressureDiffs instead of leaking. Found by valgrind. llvm-svn: 189725	2013-08-31 20:10:01 +00:00
Andrew Trick	0d5cff0ac7	Use LiveRangeQuery for instruction-level liveness queries. Remove redundant or bug-prone LiveInterval APIs. llvm-svn: 189685	2013-08-30 17:58:49 +00:00
Andrew Trick	eeff73274d	mi-sched: update PressureDiffs on-the-fly for liveness. This removes all expensive pressure tracking logic from the scheduling critical path of node comparison. llvm-svn: 189643	2013-08-30 04:36:57 +00:00
Andrew Trick	b4ac7bbefa	Replace LiveInterval::killedAt with isKilledAtInstr. Return true for LRGs that end at EarlyClobber or Register slots. llvm-svn: 189642	2013-08-30 04:31:01 +00:00
Andrew Trick	447dd8fc9e	mi-sched: improve the generic register pressure comparison. Only compare pressure within the same set. When multiple sets are affected, we prioritize the most constrained set. llvm-svn: 189641	2013-08-30 04:27:29 +00:00
Andrew Trick	3c849ec211	mi-sched: Precompute a PressureDiff for each instruction, adjust for liveness later. Created SUPressureDiffs array to hold the per node PDiff computed during DAG building. Added a getUpwardPressureDelta API that will soon replace the old one. Compute PressureDelta here from the precomputed PressureDiffs. Updating for liveness will come next. llvm-svn: 189640	2013-08-30 03:49:48 +00:00
Andrew Trick	4c02a97896	Comment and revise the cyclic critical path code. This should be much more clear now. It's still disabled pending testing. llvm-svn: 189597	2013-08-29 18:04:49 +00:00
Andrew Trick	7e77ff64a2	Add a convenient PSetIterator for visiting pressure sets affected by a register. llvm-svn: 189121	2013-08-23 17:48:46 +00:00
Andrew Trick	d56a8b8df8	Adds cyclic critical path computation and heuristics, temporarily disabled. Estimate the cyclic critical path within a single block loop. If the acyclic critical path is longer, then the loop will exhaust OOO resources after some number of iterations. If lag between the acyclic critical path and cyclic critical path is longer the the time it takes to issue those loop iterations, then aggressively schedule for latency. llvm-svn: 189120	2013-08-23 17:48:43 +00:00
Andrew Trick	79efb910b7	MI Sched: record local vreg uses. This will be used to compute the cyclic critical path and to update precomputed per-node pressure differences. In the longer term, it could also be used to speed up LiveInterval update by avoiding visiting all global vreg users. llvm-svn: 189118	2013-08-23 17:48:39 +00:00
Andrew Trick	a44f3105fa	Remove unused field. llvm-svn: 189117	2013-08-23 17:48:36 +00:00
Andrew Trick	2656adf87e	mi-sched: Don't call MBB.size() in initSUnits. The driver already has instr count. This fixes a pathological compile time problem with very large blocks and lots of scheduling boundaries. llvm-svn: 189116	2013-08-23 17:48:33 +00:00
Michael Gottesman	d2ede9d0a0	[stackprotector] Refactor out the end of isInTailCallPosition into the function returnTypeIsEligibleForTailCall. This allows me to use returnTypeIsEligibleForTailCall in the stack protector pass. rdar://13935163 llvm-svn: 188765	2013-08-20 08:36:50 +00:00
Daniel Dunbar	657fb30f68	[typo] An LLVM. llvm-svn: 188589	2013-08-16 23:30:19 +00:00
Bill Wendling	4aae224206	Make a few more things const. llvm-svn: 188484	2013-08-15 20:25:44 +00:00
Joey Gouly	fda6af9c71	Drive-by fix for a doxygen comment in MachineInstr.h. llvm-svn: 188467	2013-08-15 16:02:44 +00:00
Mark Lacey	681f92310a	Auto-compute live intervals on demand. When new virtual registers are created during splitting/spilling, defer creation of the live interval until we need to use the live interval. Along with the recent commits to notify LiveRangeEdit when new virtual registers are created, this makes it possible for functions like TargetInstrInfo::loadRegFromStackSlot() and TargetInstrInfo::storeRegToStackSlot() to create multiple virtual registers as part of the process of generating loads/stores for different register classes, and then have the live intervals for those new registers computed when they are needed. llvm-svn: 188437	2013-08-14 23:50:16 +00:00
Mark Lacey	85f8607a10	Add the MachineInstrSpan class. MachineInstrSpan is initialized with a MachineBasicBlock::iterator, and is intended to track which instructions are inserted before/after that instruction from the time the MachineInstrSpan is created. It provides a begin()/end() interface to walk the range of instructions inserted around the initial instruction (including that initial instruction). It also provides a getInitial() interface to return the initial iterator. llvm-svn: 188436	2013-08-14 23:50:11 +00:00
Mark Lacey	97c418e9a9	Notify LiveRangeEdit of new virtual registers. Add a delegate class to MachineRegisterInfo with a single virtual function, MRI_NoteNewVirtualRegister(). Update LiveRangeEdit to inherit from this delegate class and override the definition of the callback with an implementation that tracks the newly created virtual registers. llvm-svn: 188435	2013-08-14 23:50:09 +00:00
Mark Lacey	6097c89b5a	Track new virtual registers by register number. Track new virtual registers by register number, rather than by the live interval created for them. This is the first step in separating the creation of new virtual registers and new live intervals. Eventually live intervals will be created and populated on demand after the virtual registers have been created and used in instructions. llvm-svn: 188434	2013-08-14 23:50:04 +00:00
Jakob Stoklund Olesen	18768bf820	Remove unnecessary parameter to RenumberValues. Patch by Matthias Braun! llvm-svn: 188393	2013-08-14 17:28:52 +00:00
Jakob Stoklund Olesen	ccf996bba7	Remove unused function. Patch by Matthias Braun! llvm-svn: 188392	2013-08-14 17:28:49 +00:00
Jakob Stoklund Olesen	6df1a2eeae	Remove unused struct/enum Patch by Matthias Braun! llvm-svn: 188388	2013-08-14 17:28:39 +00:00
Jack Carter	15a173ab4f	[Mips][msa] Value types for MSA support. Added v8f16 to ValueTypes.h, ValueTypes.cpp, ValueTypes.td, and CodeGenTarget.cpp Patch by Daniel Sanders llvm-svn: 188326	2013-08-13 22:34:26 +00:00
Carlo Kok	d10cf40934	For COFF only: dwarf debug info output a label reference as a section relative item only when it's one of dw_from strp, sec_offset, ref_addr or op_call_ref instead of going by size. llvm-svn: 188296	2013-08-13 17:45:53 +00:00
Michael Gottesman	4f8bae7094	Add editor C++ filetype declaration no functionality change. llvm-svn: 188205	2013-08-12 21:10:23 +00:00
Michael Gottesman	defd024179	[stackprotector] Add in the stackprotector libcall. We support this libcall on all platforms except for OpenBSD (See lib/Codegen/StackProtector.cpp). llvm-svn: 188193	2013-08-12 18:45:38 +00:00
Hal Finkel	bdc7aa32c1	Add ISD::FROUND for libm round() All libm floating-point rounding functions, except for round(), had their own ISD nodes. Recent PowerPC cores have an instruction for round(), and so here I'm adding ISD::FROUND so that round() can be custom lowered as well. For the most part, this is straightforward. I've added an intrinsic and a matching ISD node just like those for nearbyint() and friends. The SelectionDAG pattern I've named frnd (because ISD::FP_ROUND has already claimed fround). This will be used by the PowerPC backend in a follow-up commit. llvm-svn: 187926	2013-08-07 22:49:12 +00:00
Craig Topper	8ee8f7ec58	Put an llvm_unreachable at the end of getSplatIndex as its loop should never find all undef elements. llvm-svn: 187775	2013-08-06 05:41:22 +00:00
Craig Topper	e1c2f3da6c	Check against >= 0 instead of != -1 in getSplatIndex because it generally compiles to better code and is equivalent for shuffle indices. llvm-svn: 187774	2013-08-06 05:07:37 +00:00
Craig Topper	936da35b0d	Remove trailing whitespace and fix an 80-column violation. No functional change. llvm-svn: 187773	2013-08-06 05:01:21 +00:00
Benjamin Kramer	8730825a6d	Don't leak passes if added outside of the area determined by Started/Stopped flags. llvm-svn: 187722	2013-08-05 11:11:11 +00:00
Bill Wendling	e7b7059f1d	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
Andrew Trick	412a9a4e32	Down-scale slot index distance to save bits. llvm-svn: 187438	2013-07-30 19:59:19 +00:00
Andrew Trick	05b1db7761	whitespace llvm-svn: 187437	2013-07-30 19:59:15 +00:00
Andrew Trick	bdb01cf4bd	MI Sched: Track live-thru registers. When registers must be live throughout the scheduling region, increase the limit for the register class. Once we exceed the original limit, they will be spilled, and there's no point further reducing pressure. This isn't a perfect heuristics but avoids a situation where the scheduler could become trapped by trying to achieve the impossible. llvm-svn: 187436	2013-07-30 19:59:12 +00:00
Elena Demikhovsky	505373db43	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Andrew Trick	f3a0221ce0	MI Sched: Register pressure heuristics. Consider which set is being increased or decreased before comparing. llvm-svn: 187110	2013-07-25 07:26:35 +00:00
Bill Wendling	a61c3eb016	Replace the "NoFramePointerElimNonLeaf" target option with a function attribute. There's no need to specify a flag to omit frame pointer elimination on non-leaf nodes...(Honestly, I can't parse that option out.) Use the function attribute stuff instead. llvm-svn: 187093	2013-07-25 00:34:29 +00:00
Bill Wendling	63482ebdd5	Recommit r186217 with testcase fix: Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186863	2013-07-22 20:15:21 +00:00
Craig Topper	58fa7a9b4a	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Chandler Carruth	c0b3faf509	Revert commit r186217 -- this is breaking bots: http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RAincremental/builds/4328 Original commit log: Use the function attributes to pass along the stack protector buffer size. llvm-svn: 186234	2013-07-13 01:00:17 +00:00
Bill Wendling	c4d489b5ee	Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186217	2013-07-12 22:25:20 +00:00
Craig Topper	74e16da8dc	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186098	2013-07-11 16:22:38 +00:00
Michael Gottesman	ebc9c4bf07	Changed "mode: c++" => "C++" at the suggestion of Nick Lewycky. llvm-svn: 186025	2013-07-10 18:40:49 +00:00
Adrian Prantl	a295f68201	Reapply an improved version of r180816/180817. Change the informal convention of DBG_VALUE machine instructions so that we can express a register-indirect address with an offset of 0. The old convention was that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain register values the combination reg, reg is used. MachineInstrBuilder::BuildMI knows how to build the new DBG_VALUES. rdar://problem/13658587 llvm-svn: 185966	2013-07-09 20:28:37 +00:00
Michael Gottesman	fa360444cb	Fixed up the comments in FastISel.h so that they conform to the LLVM style guide. llvm-svn: 185964	2013-07-09 20:08:46 +00:00
Michael Gottesman	23be560da3	Added "mode: c++" to FastISel.h header. llvm-svn: 185963	2013-07-09 20:00:25 +00:00
Jakob Stoklund Olesen	9d2c118af7	Remove dead function. llvm-svn: 185731	2013-07-05 23:04:55 +00:00
Jakob Stoklund Olesen	d428205e4a	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	26add346da	Simplify landing pad lowering. Stop using the ISD::EXCEPTIONADDR and ISD::EHSELECTION when lowering landing pad arguments. These nodes were previously legalized into CopyFromReg nodes, but that never worked properly because the CopyFromReg node weren't guaranteed to be scheduled at the top of the basic block. This meant the exception pointer and selector registers could be clobbered before being copied to a virtual register. This patch copies the two physical registers to virtual registers at the beginning of the basic block, and lowers the landingpad instruction directly to two CopyFromReg nodes reading the virtual registers. This is safe because virtual registers don't get clobbered. A future patch will remove the ISD::EXCEPTIONADDR and ISD::EHSELECTION nodes. llvm-svn: 185617	2013-07-04 04:53:45 +00:00
Jakob Stoklund Olesen	8099b21497	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	8bc33424b2	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Jakob Stoklund Olesen	62ea9eb287	Simplify landing pad lowering. Stop using the ISD::EXCEPTIONADDR and ISD::EHSELECTION when lowering landing pad arguments. These nodes were previously legalized into CopyFromReg nodes, but that never worked properly because the CopyFromReg node weren't guaranteed to be scheduled at the top of the basic block. This meant the exception pointer and selector registers could be clobbered before being copied to a virtual register. This patch copies the two physical registers to virtual registers at the beginning of the basic block, and lowers the landingpad instruction directly to two CopyFromReg nodes reading the virtual registers. This is safe because virtual registers don't get clobbered. A future patch will remove the ISD::EXCEPTIONADDR and ISD::EHSELECTION nodes. llvm-svn: 185595	2013-07-03 23:56:24 +00:00
Jakob Stoklund Olesen	131f26f3e2	Add MachineBasicBlock::addLiveIn(). This function adds a live-in physical register to an MBB and ensures that it is copied to a virtual register immediately. llvm-svn: 185594	2013-07-03 23:56:20 +00:00
Craig Topper	d9a1ef8f1e	Use SmallVectorImpl instead of SmallVector for iterators and references to avoid specifying the vector size unnecessarily. llvm-svn: 185512	2013-07-03 05:11:49 +00:00
Eric Christopher	cb5bf5d646	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185511	2013-07-03 05:01:24 +00:00
Craig Topper	20f64aee06	Return SmallVectorImpl& instead of SmallVector& in a couple places to avoid having to specify the vector size in multiple places. llvm-svn: 185507	2013-07-03 04:30:58 +00:00
Craig Topper	70aa354012	Add a space between type and variable name. No functional change. llvm-svn: 185506	2013-07-03 04:27:31 +00:00
Manman Ren	4245062b12	Debug Info: use module flag to set up Dwarf version. Correctly handles ref_addr depending on the Dwarf version. Emit Dwarf with version from module flag. TODO: turn on/off features depending on the Dwarf version. llvm-svn: 185484	2013-07-02 23:40:10 +00:00
Rafael Espindola	304ef43e7d	Remove address spaces from MC. This is dead code since PIC16 was removed in 2010. The result was an odd mix, where some parts would carefully pass it along and others would assert it was zero (most of the object streamer for example). llvm-svn: 185436	2013-07-02 15:49:13 +00:00
David Blaikie	b019759ec4	PR16493: DebugInfo with TLS on PPC crashing due to invalid relocation Restrict the current TLS support to X86 ELF for now. Test that we don't produce it on PPC & we can flesh that test case out with the right thing once someone implements it. llvm-svn: 185389	2013-07-01 21:45:25 +00:00
David Blaikie	5109456c75	DebugInfo: PR14728: TLS support Based on GCC's output for TLS variables (OP_constNu, x@dtpoff, OP_lo_user), this implements debug info support for TLS in ELF. Verified that this output is correct/sufficient on Linux (using gold - if you're using binutils-ld, you'll need something with the fix for http://sourceware.org/bugzilla/show_bug.cgi?id=15685 in it). Support on non-ELF is sort of "arbitrary" at the moment - if Apple folks want to discuss (or just go ahead & implement) how this should work in MachO, etc, I'm open. llvm-svn: 185203	2013-06-28 20:05:11 +00:00
Chad Rosier	1f622b5f73	Improve diagnostics when getSizeInBits is called on the Other type. llvm-svn: 184760	2013-06-24 17:29:51 +00:00
David Blaikie	2075b3d872	DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation llvm-svn: 184669	2013-06-23 18:31:11 +00:00
Andrew Trick	54670161c3	Handle more cases in LiveRangeEdit::eliminateDeadDefs. Live intervals for dead physregs may be created during coalescing. We need to update these in the event that their instruction goes away. crash.ll is the unit test that catches it when MI sched is enabled on X86. llvm-svn: 184572	2013-06-21 18:33:20 +00:00
Andrew Trick	fdd89e475c	Refactor LiveRangeEdit::eliminateDeadDefs. I want to add logic to handle more cases. llvm-svn: 184571	2013-06-21 18:33:17 +00:00
Andrew Trick	2436517fbd	MI-Sched: Adjust regpressure limits for reserved regs. llvm-svn: 184564	2013-06-21 18:32:58 +00:00
David Blaikie	c3cec14be6	DebugInfo: PR14763/r183329 correct the location of indirect parameters We had been papering over a problem with location info for non-trivial types passed by value by emitting their type as references (this caused the debugger to interpret the location information correctly, but broke the type of the function). r183329 corrected the type information but lead to the debugger interpreting the pointer parameter as the value - the debug info describing the location needed an extra dereference. Use a new flag in DIVariable to add the extra indirection (either by promoting an existing DW_OP_reg (parameter passed in a register) to DW_OP_breg + 0 or by adding DW_OP_deref to an existing DW_OP_breg + n (parameter passed on the stack). llvm-svn: 184368	2013-06-19 21:55:13 +00:00
Bill Wendling	a9576dc938	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184360	2013-06-19 21:36:55 +00:00
Bill Wendling	4d82ecded8	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184352	2013-06-19 21:07:11 +00:00

... 2 3 4 5 6 ...

4567 Commits