llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Devang Patel	5ccc4e884c	Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. llvm-svn: 124138	2011-01-24 19:24:37 +00:00
Andrew Trick	977497dbf1	Temporarily workaround JM/lencod miscompile (SIGSEGV). rdar://problem/8893967 llvm-svn: 124137	2011-01-24 19:08:15 +00:00
Ted Kremenek	880c19c032	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Andrew Trick	e0bccb5f87	Enable support for precise scheduling of the instruction selection DAG. Disable using "-disable-sched-cycles". For ARM, this enables a framework for modeling the cpu pipeline and counting stalls. It also activates several heuristics to drive scheduling based on the model. Scheduling is inherently imprecise at this stage, and until spilling is improved it may defeat attempts to schedule. However, this framework provides greater control over tuning codegen. Although the flag is not target-specific, it should have very little affect on the default scheduler used by x86. The only two changes that affect x86 are: - scheduling a high-latency operation bumps the current cycle so independent operations can have their latency covered. i.e. two independent 4 cycle operations can produce results in 4 cycles, not 8 cycles. - Two operations with equal register pressure impact and no latency-based stalls on their uses will be prioritized by depth before height (height is irrelevant if no stalls occur in the schedule below this point). llvm-svn: 123971	2011-01-21 06:19:05 +00:00
Andrew Trick	7155e98904	Convert -enable-sched-cycles and -enable-sched-hazard to -disable flags. They are still not enable in this revision. Added TargetInstrInfo::isZeroCost() to fix a fundamental problem with the scheduler's model of operand latency in the selection DAG. Generalized unit tests to work with sched-cycles. llvm-svn: 123969	2011-01-21 05:51:33 +00:00
Eric Christopher	d17b5b7988	My editor's indent went crazy. Fix. llvm-svn: 123909	2011-01-20 08:56:34 +00:00
Eric Christopher	f7579ff174	Expand invalid return values for umulo and smulo. Handle these similarly to add/sub by doing the normal operation and then checking for overflow afterwards. This generally relies on the DAG handling the later invalid operations as well. Fixes the 64-bit part of rdar://8622122 and rdar://8774702. llvm-svn: 123908	2011-01-20 08:54:28 +00:00
Andrew Trick	bf079d8831	Selection DAG scheduler register pressure heuristic fixes. Added a check for already live regs before claiming HighRegPressure. Fixed a few cases of checking the wrong number of successors. Added some tracing until these heuristics are better understood. llvm-svn: 123892	2011-01-20 06:21:59 +00:00
Eric Christopher	58f8058502	Use only one API at a time. llvm-svn: 123866	2011-01-20 01:29:23 +00:00
Eric Christopher	1b0e5debb4	If we can, lower the multiply part of a umulo/smulo call to a libcall with an invalid type then split the result and perform the overflow check normally. Fixes the 32-bit parts of rdar://8622122 and rdar://8774702. llvm-svn: 123864	2011-01-20 00:29:24 +00:00
Jeffrey Yasskin	5f5e1f5ef1	Remove unused variables found by gcc-4.6's -Wunused-but-set-variable. llvm-svn: 123707	2011-01-18 00:51:23 +00:00
Stuart Hastings	f5f8318eb6	Remove checking that prevented overlapping CALLSEQ_START/CALLSEQ_END ranges, add legalizer support for nested calls. Necessary for ARM byval support. Radar 7662569. llvm-svn: 123704	2011-01-18 00:09:27 +00:00
Benjamin Kramer	869dc645f1	Fix an off-by-one error in ctpop combining. llvm-svn: 123664	2011-01-17 18:00:28 +00:00
Benjamin Kramer	e9488ed8eb	Add a DAGCombine to turn (ctpop x) u< 2 into (x & x-1) == 0. This shaves off 4 popcounts from the hacked 186.crafty source. This is enabled even when a native popcount instruction is available. The combined code is one operation longer but it should be faster nevertheless. llvm-svn: 123621	2011-01-17 12:04:57 +00:00
Chris Lattner	c4d1d86d3e	reapply my fix for PR8961 with a tweak to properly handle multi-instruction sequences like calls. Many thanks to Jakob for finding a testcase. llvm-svn: 123559	2011-01-16 02:27:38 +00:00
Benjamin Kramer	2e7ead5bb5	Add an assert so we don't silently miscompile ctpop for bit widths > 128. llvm-svn: 123549	2011-01-15 21:19:37 +00:00
Benjamin Kramer	b48a048de6	Reimplement CTPOP legalization with the "best" algorithm from http://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel In a silly microbenchmark on a 65 nm core2 this is 1.5x faster than the old code in 32 bit mode and about 2x faster in 64 bit mode. It's also a lot shorter, especially when counting 64 bit population on a 32 bit target. I hope this is fast enough to replace Kernighan-style counting loops even when the input is rather sparse. llvm-svn: 123547	2011-01-15 20:30:30 +00:00
Dan Gohman	a4f2631ea9	Delete an assignment to ThisBB which isn't needed, and tidy up some comments. llvm-svn: 123479	2011-01-14 22:26:16 +00:00
Andrew Trick	a0e69757d1	Support for precise scheduling of the instruction selection DAG, disabled in this checkin. Sorry for the large diffs due to refactoring. New functionality is all guarded by EnableSchedCycles. Scheduling the isel DAG is inherently imprecise, but we give it a best effort: - Added MayReduceRegPressure to allow stalled nodes in the queue only if there is a regpressure need. - Added BUHasStall to allow checking for either dependence stalls due to latency or resource stalls due to pipeline hazards. - Added BUCompareLatency to encapsulate and standardize the heuristics for minimizing stall cycles (vs. reducing register pressure). - Modified the bottom-up heuristic (now in BUCompareLatency) to prioritize nodes by their depth rather than height. As long as it doesn't stall, height is irrelevant. Depth represents the critical path to the DAG root. - Added hybrid_ls_rr_sort::isReady to filter stalled nodes before adding them to the available queue. Related Cleanup: most of the register reduction routines do not need to be templates. llvm-svn: 123468	2011-01-14 21:11:41 +00:00
Chris Lattner	a0074ca5fc	Set the insertion point correctly for instructions generated by load folding: they should go before the new instruction not after it. llvm-svn: 123420	2011-01-14 01:33:40 +00:00
Dan Gohman	f4ec824435	Fix r123346 to handle scalar types too. llvm-svn: 123352	2011-01-13 01:06:51 +00:00
Dan Gohman	5bbd766a7b	Apply the patch from PR8958, which allows llc to get slightly further on the associated testcase before aborting. llvm-svn: 123346	2011-01-12 23:56:26 +00:00
Eric Christopher	5a4d64216f	Move ExpandAtomic into the integer expansion routines - it's only used there. llvm-svn: 123202	2011-01-11 00:36:08 +00:00
Dale Johannesen	cd78621861	Fix PR 8916 (qv for analysis), at least the immediate problem. There's an inherent tension in DAGCombine between assuming that things will be put in canonical form, and the Depth mechanism that disables transformations when recursion gets too deep. It would not surprise me if there's a lot of little bugs like this one waiting to be discovered. The mechanism seems fragile and I'd suggest looking at it from a design viewpoint. llvm-svn: 123191	2011-01-10 21:53:07 +00:00
Anton Korobeynikov	cf5967630b	Rename TargetFrameInfo into TargetFrameLowering. Also, put couple of FIXMEs and fixes here and there. llvm-svn: 123170	2011-01-10 12:39:04 +00:00
Jakob Stoklund Olesen	32f1783ca1	Simplify a bunch of isVirtualRegister() and isPhysicalRegister() logic. These functions not longer assert when passed 0, but simply return false instead. No functional change intended. llvm-svn: 123155	2011-01-10 02:58:51 +00:00
Jakob Stoklund Olesen	ed53ab1635	Replace TargetRegisterInfo::printReg with a PrintReg class that also works without a TRI instance. Print virtual registers numbered from 0 instead of the arbitrary FirstVirtualRegister. The first virtual register is printed as %vreg0. TRI::NoRegister is printed as %noreg. llvm-svn: 123107	2011-01-09 03:05:53 +00:00
Jakob Stoklund Olesen	b3820cdc22	Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 123096	2011-01-08 23:10:50 +00:00
Evan Cheng	aa16fd02ad	Do not model all INLINEASM instructions as having unmodelled side effects. Instead encode llvm IR level property "HasSideEffects" in an operand (shared with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check the operand when the instruction is an INLINEASM. This allows memory instructions to be moved around INLINEASM instructions. llvm-svn: 123044	2011-01-07 23:50:32 +00:00
Bob Wilson	22f18a7e94	Add ARM patterns to match EXTRACT_SUBVECTOR nodes. Also fix an off-by-one in SelectionDAGBuilder that was preventing shuffle vectors from being translated to EXTRACT_SUBVECTOR. Patch by Tim Northover. The test changes are needed to keep those spill-q tests from testing aligned spills and restores. If the only aligned stack objects are spill slots, we no longer realign the stack frame. Prior to this patch, an EXTRACT_SUBVECTOR was legalized by loading from the stack, which created an aligned frame index. Now, however, there is nothing except the spill slot in the stack frame, so I added an aligned alloca. llvm-svn: 122995	2011-01-07 04:59:04 +00:00
Bob Wilson	bcbb3375dd	Change EXTRACT_SUBVECTOR to require a constant index. We were never generating any of these nodes with variable indices, and there was one legalizer function asserting on a non-constant index. If we ever have a need to support variable indices, we can add this back again. llvm-svn: 122993	2011-01-07 04:58:56 +00:00
Duncan Sands	06444485ee	Fix the other problem reported in PR8582. Testcase and patch by Nadav Rotem. llvm-svn: 122983	2011-01-06 23:45:22 +00:00
Eric Christopher	16127008fd	Add some fairly duplicated code to let type legalization split illegal typed atomics. This will lower exclusively to libcalls at the moment. llvm-svn: 122979	2011-01-06 22:28:56 +00:00
Evan Cheng	cb39cc2164	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Evan Cheng	70711ea54d	Revert r122936. I'll re-implement the change. llvm-svn: 122949	2011-01-06 06:17:53 +00:00
Evan Cheng	d425aa5d2a	r105228 reduced the memcpy / memset inline limit to 4 with -Os to avoid blowing up freebsd bootloader. However, this doesn't make much sense for Darwin, whose -Os is meant to optimize for size only if it doesn't hurt performance. rdar://8821501 llvm-svn: 122936	2011-01-06 01:04:47 +00:00
Evan Cheng	2af40ae781	Avoid zero extend bit test operands to pointer type if all the masks fit in the original type of the switch statement key. rdar://8781238 llvm-svn: 122935	2011-01-06 01:02:44 +00:00
Evan Cheng	bf92316fab	Optimize: r1025 = s/zext r1024, 4 r1026 = extract_subreg r1025, 4 to: r1026 = copy r1024 llvm-svn: 122925	2011-01-05 23:06:49 +00:00
Eric Christopher	651810d717	80-cols. llvm-svn: 122909	2011-01-05 21:45:56 +00:00
Eric Christopher	be2382f9a6	Remove TODO, these appear to be implemented. llvm-svn: 122849	2011-01-04 22:31:50 +00:00
Benjamin Kramer	a58b69aa9d	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. llvm-svn: 122710	2011-01-02 19:57:05 +00:00
Benjamin Kramer	38491f47ce	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret llvm-svn: 122707	2011-01-02 19:44:58 +00:00
Andrew Trick	dfa31b1cf9	Minor cleanup related to my latest scheduler changes. llvm-svn: 122545	2010-12-24 07:10:19 +00:00
Andrew Trick	c926e98fc7	Fix a few cases where the scheduler is not checking for phys reg copies. The scheduling node may have a NULL DAG node, yuck. llvm-svn: 122544	2010-12-24 06:46:50 +00:00
Andrew Trick	134b2a5907	Various bits of framework needed for precise machine-level selection DAG scheduling during isel. Most new functionality is currently guarded by -enable-sched-cycles and -enable-sched-hazard. Added InstrItineraryData::IssueWidth field, currently derived from ARM itineraries, but could be initialized differently on other targets. Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is active, and if so how many cycles of state it holds. Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry into the scheduler's available queue. ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to get information about it's SUnits, provides RecedeCycle for bottom-up scheduling, correctly computes scoreboard depth, tracks IssueCount, and considers potential stall cycles when checking for hazards. ScheduleDAGRRList now models machine cycles and hazards (under flags). It tracks MinAvailableCycle, drives the hazard recognizer and priority queue's ready filter, manages a new PendingQueue, properly accounts for stall cycles, etc. llvm-svn: 122541	2010-12-24 05:03:26 +00:00
Andrew Trick	53f4556c64	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Chris Lattner	b607e7deda	flags -> glue for selectiondag llvm-svn: 122509	2010-12-23 17:24:32 +00:00
Chris Lattner	fb9ff7a4ff	sdisel flag -> glue. llvm-svn: 122507	2010-12-23 17:13:18 +00:00
Andrew Trick	ca2e267ddc	Reorganize ListScheduleBottomUp in preparation for modeling machine cycles and instruction issue. llvm-svn: 122491	2010-12-23 05:42:20 +00:00
Andrew Trick	c046a115d4	Converted LiveRegCycles to LiveRegGens. It's easier to work with and allows multiple nodes per cycle. llvm-svn: 122474	2010-12-23 04:16:14 +00:00
Andrew Trick	e48d5d8395	In CheckForLiveRegDef use TRI->getOverlaps. llvm-svn: 122473	2010-12-23 03:43:21 +00:00
Andrew Trick	cc701bcfdc	Fixes PR8823: add-with-overflow-128.ll In the bottom-up selection DAG scheduling, handle two-address instructions that read/write unspillable registers. Treat the entire chain of two-address nodes as a single live range. llvm-svn: 122472	2010-12-23 03:15:51 +00:00
Jeffrey Yasskin	a199652a3e	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Benjamin Kramer	49942a90b7	DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal. The latter usually compiles into smaller code. example code: unsigned foo(unsigned x, unsigned y) { if (x != 0) y--; return y; } before: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] sbbl %eax, %eax ## encoding: [0x19,0xc0] notl %eax ## encoding: [0xf7,0xd0] addl 8(%esp), %eax ## encoding: [0x03,0x44,0x24,0x08] ret ## encoding: [0xc3] after: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] movl 8(%esp), %eax ## encoding: [0x8b,0x44,0x24,0x08] adcl $-1, %eax ## encoding: [0x83,0xd0,0xff] ret ## encoding: [0xc3] llvm-svn: 122455	2010-12-22 23:17:45 +00:00
Chris Lattner	04ef853e23	Fix a bug in ReduceLoadWidth that wasn't handling extending loads properly. We miscompiled the testcase into: _test: ## @test movl $128, (%rdi) movzbl 1(%rdi), %eax ret Now we get a proper: _test: ## @test movl $128, (%rdi) movsbl (%rdi), %eax movzbl %ah, %eax ret This fixes PR8757. llvm-svn: 122392	2010-12-22 08:02:57 +00:00
Chris Lattner	35fcc63498	more cleanups, move a check for "roundedness" earlier to reject unhanded cases faster and simplify code. llvm-svn: 122391	2010-12-22 08:01:44 +00:00
Chris Lattner	60dcb2b5c2	reduce indentation and improve comments, no functionality change. llvm-svn: 122389	2010-12-22 07:36:50 +00:00
Andrew Trick	afec190a28	In DelayForLiveRegsBottomUp, handle instructions that read and write the same physical register. Simplifies the fix from the previous checkin r122211. llvm-svn: 122370	2010-12-21 22:27:44 +00:00
Andrew Trick	1e3ad9f721	whitespace llvm-svn: 122368	2010-12-21 22:25:04 +00:00
Dale Johannesen	e0fb87c3d7	Reapply 122353-122355 with fixes. 122354 was wrong; the shift type was needed one place, the shift count type another. The transform in 123555 had the same problem. llvm-svn: 122366	2010-12-21 21:55:50 +00:00
Dale Johannesen	972aba543a	Revert 122353-122355 for the moment, they broke stuff. llvm-svn: 122360	2010-12-21 21:22:27 +00:00
Dale Johannesen	39186cfb0b	Add a new transform to DAGCombiner. llvm-svn: 122355	2010-12-21 20:10:51 +00:00
Dale Johannesen	5f3e7b08f6	Get the type of a shift from the shift, not from its shift count operand. These should be the same but apparently are not always, and this is cleaner anyway. This improves the code in an existing test. llvm-svn: 122354	2010-12-21 20:06:19 +00:00
Dale Johannesen	bad19334ee	Shift by the word size is invalid IR; don't create it. llvm-svn: 122353	2010-12-21 20:00:06 +00:00
Chris Lattner	8a3058137a	fix some typos llvm-svn: 122349	2010-12-21 18:05:22 +00:00
Stuart Hastings	fedc21e594	Fix indentation, add comment. llvm-svn: 122345	2010-12-21 17:16:58 +00:00
Stuart Hastings	a1f786efa9	Missing logic for nested CALLSEQ_START/END. llvm-svn: 122342	2010-12-21 17:07:24 +00:00
Chris Lattner	65c5243bd6	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Chris Lattner	b37e697277	improve "cannot yet select" errors a trivial amount: now they are just as useless, but at least a bit more gramatical llvm-svn: 122305	2010-12-21 02:07:03 +00:00
Dale Johannesen	036c3da142	Cosmetic changes. llvm-svn: 122259	2010-12-20 20:10:50 +00:00
Chris Lattner	249e131f39	implement type legalization promotion support for SMULO and UMULO, giving ARM (and other 32-bit-only) targets support for i8 and i16 overflow multiplies. The generated code isn't great, but this at least fixes CodeGen/Generic/overflow.ll when running on ARM hosts. llvm-svn: 122221	2010-12-20 02:05:39 +00:00
Chris Lattner	0f801998bf	Fix a bug in the scheduler's handling of "unspillable" vregs. Imagine we see: EFLAGS = inst1 EFLAGS = inst2 FLAGS gpr = inst3 EFLAGS Previously, we would refuse to schedule inst2 because it clobbers the EFLAGS of the predecessor. However, it also uses the EFLAGS of the predecessor, so it is safe to emit. SDep edges ensure that the right order happens already anyway. This fixes 2 testsuite crashes with the X86 patch I'm going to commit next. llvm-svn: 122211	2010-12-20 00:55:43 +00:00
Chris Lattner	85875bf06b	the result of CheckForLiveRegDef is dead, remove it. llvm-svn: 122209	2010-12-20 00:51:56 +00:00
Chris Lattner	ee7fa0d706	reduce indentation, no functionality change. llvm-svn: 122208	2010-12-20 00:50:16 +00:00
Nick Lewycky	c85935836b	Add missing standard headers. Patch by Joerg Sonnenberger! llvm-svn: 122193	2010-12-19 20:43:38 +00:00
Chris Lattner	92dcd2af36	teach MaskedValueIsZero how to analyze ADDE. This is enough to teach it that ADDE(0,0) is known 0 except the low bit, for example. llvm-svn: 122191	2010-12-19 20:38:28 +00:00
Chris Lattner	ac82ea26da	fix PR8642: if a critical edge has a PHI value that can trap, isel is required to split the edge. PHI values get evaluated on the edge, not in their predecessor block. llvm-svn: 122170	2010-12-19 04:58:57 +00:00
Bob Wilson	c57c2d755b	Fix a DAGCombiner crash when folding binary vector operations with constant BUILD_VECTOR operands where the element type is not legal. I had previously changed this code to insert TRUNCATE operations, but that was just wrong. llvm-svn: 122102	2010-12-17 23:06:49 +00:00
Dale Johannesen	c2c6ebd82a	Add a transform to DAG Combiner. This improves the code for the case where 32-bit divide by constant is turned into 64-bit multiply by constant. 8771012. llvm-svn: 122090	2010-12-17 21:45:49 +00:00
Bob Wilson	e06f6eabe7	Fix crash compiling a QQQQ REG_SEQUENCE for a Neon vld3_lane operation. Radar 8776599 llvm-svn: 122018	2010-12-17 01:21:12 +00:00
Chris Lattner	81815cd4db	take care of some todos, transforming [us]mul_lohi into a wider mul if the wider mul is legal. llvm-svn: 121848	2010-12-15 06:04:19 +00:00
Chris Lattner	746d6a1f60	when transforming a MULHS into a wider MUL, there is no need to SRA the result, the top bits are truncated off anyway, just use SRL. llvm-svn: 121846	2010-12-15 05:51:39 +00:00
Chris Lattner	14810c808b	Add a couple dag combines to transform mulhi/mullo into a wider multiply when the wider type is legal. This allows us to compile: define zeroext i16 @test1(i16 zeroext %x) nounwind { entry: %div = udiv i16 %x, 33 ret i16 %div } into: test1: # @test1 movzwl 4(%esp), %eax imull $63551, %eax, %eax # imm = 0xF83F shrl $21, %eax ret instead of: test1: # @test1 movw $-1985, %ax # imm = 0xFFFFFFFFFFFFF83F mulw 4(%esp) andl $65504, %edx # imm = 0xFFE0 movl %edx, %eax shrl $5, %eax ret Implementing rdar://8760399 and example #4 from: http://blog.regehr.org/archives/320 We should implement the same thing for [su]mul_hilo, but I don't have immediate plans to do this. llvm-svn: 121696	2010-12-13 08:39:01 +00:00
Chris Lattner	6df4d5d88e	reduce indentation by using continue, no functionality change. llvm-svn: 121662	2010-12-13 01:11:17 +00:00
Duncan Sands	47a4bbd31d	Catch attempts to remove a deleted node from the CSE maps. Better to catch this here rather than later after accessing uninitialized memory etc. Fires when compiling the testcase in PR8237. llvm-svn: 121635	2010-12-12 13:22:50 +00:00
Stuart Hastings	f7bba0cfe3	Initial support for nested CALLSEQ_START/CALLSEQ_END constructs in LegalizeDAG. Necessary for byval support on ARM. Radar 7662569. llvm-svn: 121412	2010-12-09 21:25:20 +00:00
Eric Christopher	ebd7ab9857	80-col fixups. llvm-svn: 121356	2010-12-09 04:48:06 +00:00
Eric Christopher	d492f798d1	Reword comment slightly. llvm-svn: 121293	2010-12-08 22:21:42 +00:00
Jay Foad	79e18ed269	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Devang Patel	6fe7fe8dd4	If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0. llvm-svn: 121059	2010-12-06 22:39:26 +00:00
Benjamin Kramer	009451fddc	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Jay Foad	789e8ac4f7	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Evan Cheng	f7e586d749	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Chris Lattner	f541629573	add TLI support indicating that jumps are more expensive than logical operations and use this to disable a specific optimization. Patch by Micah Villmow! llvm-svn: 120435	2010-11-30 18:12:52 +00:00
Jay Foad	c663d7e4f6	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Michael J. Spencer	d5ec932c3a	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Bob Wilson	3f245c3b4f	Fix a comment typo. llvm-svn: 120235	2010-11-28 06:51:19 +00:00
Wesley Peck	d589353ad0	Renaming ISD::BIT_CONVERT to ISD::BITCAST to better reflect the LLVM IR concept. llvm-svn: 119990	2010-11-23 03:31:01 +00:00
Benjamin Kramer	632a91cba5	Implement the "if (X == 6 \|\| X == 4)" -> "if ((X\|2) == 6)" optimization. This currently only catches the most basic case, a two-case switch, but can be extended later. llvm-svn: 119964	2010-11-22 09:45:38 +00:00
Benjamin Kramer	c99bab43c9	Silence Release build warnings about unused functions. llvm-svn: 119903	2010-11-20 15:53:24 +00:00
Duncan Sands	028cf0619e	On X86, MEMBARRIER, MFENCE, SFENCE, LFENCE are not target memory intrinsics, so don't claim they are. They are allocated using DAG.getNode, so attempts to access MemSDNode fields results in reading off the end of the allocated memory. This fixes crashes with "llc -debug" due to debug code trying to print MemSDNode fields for these barrier nodes (since the crashes are not deterministic, use valgrind to see this). Add some nasty checking to try to catch this kind of thing in the future. llvm-svn: 119901	2010-11-20 11:25:00 +00:00
Andrew Trick	3166f72d7a	Removing the useless test that I added recently. It was meant as an example, but not complicated enough to merit another test. llvm-svn: 119898	2010-11-20 07:26:51 +00:00
Bill Wendling	55c499847a	Check for _setjmp too, because it's also used. llvm-svn: 119875	2010-11-20 00:03:09 +00:00
Mon P Wang	4965983b22	Make isScalarToVector to return false if the node is a scalar. This will prevent DAGCombine from making an illegal transformation of bitcast of a scalar to a vector into a scalar_to_vector. llvm-svn: 119819	2010-11-19 19:08:12 +00:00
Duncan Sands	9327aedbea	Fix thinko: we must turn select(anyext, sext) into sext(select) not anyext(select). Spotted by Frits van Bommel. llvm-svn: 119739	2010-11-18 21:16:28 +00:00
Duncan Sands	a61bc1a41a	The DAGCombiner was threading select over pairs of extending loads even if the extension types were not the same. The result was that if you fed a select with sext and zext loads, as in the testcase, then it would get turned into a zext (or sext) of the select, which is wrong in the cases when it should have been an sext (resp. zext). Reported and diagnosed by Sebastien Deldon. llvm-svn: 119728	2010-11-18 20:05:18 +00:00
Dale Johannesen	06f479d543	Do not throw away alignment when generating the DAG for memset; we may need it to decide between MOVAPS and MOVUPS later. Adjust a test that was looking for wrong code. PR 3866 / 8675131. llvm-svn: 119605	2010-11-18 01:35:23 +00:00
John Thompson	b33f935bc3	Bug 8621 fix - pointer cast stripped from inline asm constraint argument. llvm-svn: 119590	2010-11-17 23:58:47 +00:00
Dan Gohman	52a761760d	Split pseudo-instruction expansion into a separate pass, to make it easier to debug, and to avoid complications when the CFG changes in the middle of the instruction selection process. llvm-svn: 119382	2010-11-16 21:02:37 +00:00
Andrew Trick	68d533cca4	typo (4th checkin for one fix) llvm-svn: 118913	2010-11-12 18:36:03 +00:00
Andrew Trick	d0c919dcad	Fixes PR8287: SD scheduling time. The fix is a failsafe that prevents catastrophic compilation time in the event of unreasonable LLVM IR. Code quality is a separate issue--someone upstream needs to do a better job of reducing to llvm.memcpy. If the situation can be reproduced with any supported frontend, then it will be a separate bug. llvm-svn: 118904	2010-11-12 17:50:46 +00:00
Chris Lattner	bcdb696f6c	tidy up. llvm-svn: 118896	2010-11-12 17:24:29 +00:00
Dan Gohman	8e986f9c2f	Remove the memmove->memcpy optimization from CodeGen. MemCpyOpt does this. llvm-svn: 118789	2010-11-11 16:24:49 +00:00
Dan Gohman	903935bf3e	Fix DAGCombiner to avoid folding a sext-in-reg or similar through a shl in order to fold it into a load. llvm-svn: 118471	2010-11-09 01:54:35 +00:00
Dale Johannesen	88f85df7f7	Fix an inline asm pasto from 117667; was preventing {i64, i64} from matching i128. llvm-svn: 118465	2010-11-09 01:15:07 +00:00
Duncan Sands	96b03ec2ce	When passing a parameter using the 'byval' mechanism, inline code needs to be used to perform the copy, which may be of lots of memory []. It would be good if the fall-back code generated something reasonable, i.e. did the copy in a loop, rather than vast numbers of loads and stores. Add a note about this. Currently target specific code seems to always kick in so this is more of a theoretical issue rather than a practical one now that X86 has been fixed. [] It's amazing how often people pass mega-byte long arrays by copy... llvm-svn: 118275	2010-11-05 15:20:29 +00:00
Eric Christopher	3f1ac311ff	Just return undef for invalid masks or elts, and since we're doing that, just do it earlier too. llvm-svn: 118195	2010-11-03 20:44:42 +00:00
Duncan Sands	41edf30895	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Duncan Sands	f6e5e02c9b	Inside the calling convention logic LocVT is always a simple value type, so there is no point in passing it around using an EVT. Use the simpler MVT everywhere. Rather than trying to propagate this information maximally in all the code that using the calling convention stuff, I chose to do a mainly low impact change instead. llvm-svn: 118167	2010-11-03 11:35:31 +00:00
Eric Christopher	719d9d324b	If we have an undef mask our Elt will be -1 for our access, handle this by using an undef as a pointer. Fixes rdar://8625016 llvm-svn: 118164	2010-11-03 09:36:40 +00:00
Dan Gohman	8071a75d31	Fix DAGCombiner to avoid going into an infinite loop when it encounters (and:i64 (shl:i64 (load:i64), 1), 0xffffffff). This fixes rdar://8606584. llvm-svn: 118143	2010-11-03 01:47:46 +00:00
Evan Cheng	67db408634	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
Devang Patel	e3575ef54f	If value map does not have register for an argument then try to find frame index before giving up. llvm-svn: 118022	2010-11-02 17:19:03 +00:00
Devang Patel	efd9ac540a	Use frameindex, if available, as a last resort to emit debug info for a parameter. llvm-svn: 118020	2010-11-02 17:01:30 +00:00
Bob Wilson	d67dddb134	Remove DAG combiner patch to fold vector splats. Instcombiner does it now. llvm-svn: 117720	2010-10-29 22:03:02 +00:00
Evan Cheng	392d2cbdcc	Avoiding overly aggressive latency scheduling. If the two nodes share an operand and one of them has a single use that is a live out copy, favor the one that is live out. Otherwise it will be difficult to eliminate the copy if the instruction is a loop induction variable update. e.g. BB: sub r1, r3, #1 str r0, [r2, r3] mov r3, r1 cmp bne BB => BB: str r0, [r2, r3] sub r3, r3, #1 cmp bne BB This fixed the recent 256.bzip2 regression. llvm-svn: 117675	2010-10-29 18:09:28 +00:00
John Thompson	6115a7f1d4	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Bob Wilson	65124cd7c7	Teach the DAG combiner to fold a splat of a splat. Radar 8597790. Also do some minor refactoring to reduce indentation. llvm-svn: 117558	2010-10-28 17:06:14 +00:00
Evan Cheng	bc4588c439	Re-commit 117518 and 117519 now that ARM MC test failures are out of the way. llvm-svn: 117531	2010-10-28 06:47:08 +00:00
Evan Cheng	fdc80a0316	Revert 117518 and 117519 for now. They changed scheduling and cause MC tests to fail. Ugh. llvm-svn: 117520	2010-10-28 02:00:25 +00:00
Evan Cheng	a07cd58978	Fix a major bug in operand latency computation. The use index must be adjusted by the number of defs first for it to match the instruction itinerary. llvm-svn: 117518	2010-10-28 01:46:29 +00:00
Dale Johannesen	e7f07349e4	Use a MemIntrinsicSDNode for ISD::PREFETCH, which touches memory, so a MachineMemOperand is useful (not propagated into the MachineInstr yet). No functional change except for dump output. llvm-svn: 117413	2010-10-26 23:11:10 +00:00
Devang Patel	7796146304	Assign source ordering to nodes created for StoreInst. llvm-svn: 117404	2010-10-26 22:14:52 +00:00
Nick Lewycky	f7dddd658f	For statistics that are only used in functions declared in !NDEBUG, wrap the declarations in !NDEBUG to avoid -Wunused-variable warnings. Patch by Matt Beaumont-Gay! llvm-svn: 117345	2010-10-26 00:51:57 +00:00
Devang Patel	fa145a94d1	Simplify. Do not count use of sdisel for single call instruction. llvm-svn: 117316	2010-10-25 21:31:46 +00:00
Devang Patel	3779ad5d59	Add counters to count basic blocks and machine basic blocks with out of order line number info. Add counters to count how many basic blocks are entirely selected by fastisel. llvm-svn: 117310	2010-10-25 20:55:43 +00:00
Chandler Carruth	7dd652736f	Move the remaining attribute macros to systematic names based on the attribute name and prefixed with 'LLVM_'. llvm-svn: 117203	2010-10-23 08:40:19 +00:00
Michael J. Spencer	08602f9803	X86: Base _fltused on the FunctionType of the called value instead of the potentially null "CalledFunction". Thanks Duncan! This is needed for indirect calls. llvm-svn: 117061	2010-10-21 20:49:23 +00:00
Michael J. Spencer	b9cffadc06	CodeGen-Windows: Only emit _fltused if a VarArg function is called with floating point args. This should be the minimum set of functions that could possibly need it. llvm-svn: 116978	2010-10-21 00:08:21 +00:00
Dale Johannesen	58fe3193a6	Remove Synthesizable from the Type system; as MMX vector types are no longer Legal on X86, we don't need it. No functional change. 8499854. llvm-svn: 116947	2010-10-20 21:32:10 +00:00
Dan Gohman	c781a28a1d	Make CodeGen TBAA-aware. llvm-svn: 116890	2010-10-20 00:31:05 +00:00
Jim Grosbach	a8c0be5343	Add a pre-dispatch SjLj EH hook on the unwind edge for targets to do any setup they require. Use this for ARM/Darwin to rematerialize the base pointer from the frame pointer when required. rdar://8564268 llvm-svn: 116879	2010-10-19 23:27:08 +00:00
Owen Anderson	46990c17f7	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Michael J. Spencer	e57b670425	X86-Windows: Emit an undefined global __fltused symbol when targeting Windows if any floating point arguments are passed to an external function. llvm-svn: 116665	2010-10-16 08:25:41 +00:00
Michael J. Spencer	16ad2c129c	Whitespace! llvm-svn: 116664	2010-10-16 08:25:21 +00:00
Chris Lattner	b63869a25b	fix the default va_arg expansion (in the realignment case) to not implicitly truncate the stack pointer to 32-bits on a 64-bit machine. llvm-svn: 116169	2010-10-10 18:36:26 +00:00
Dan Gohman	57f707c6a7	ComputeLinearIndex doesn't need its TLI argument. llvm-svn: 115792	2010-10-06 16:18:29 +00:00
Evan Cheng	6fbb6dea7c	- Add TargetInstrInfo::getOperandLatency() to compute operand latencies. This allow target to correctly compute latency for cases where static scheduling itineraries isn't sufficient. e.g. variable_ops instructions such as ARM::ldm. This also allows target without scheduling itineraries to compute operand latencies. e.g. X86 can return (approximated) latencies for high latency instructions such as division. - Compute operand latencies for those defined by load multiple instructions, e.g. ldm and those used by store multiple instructions, e.g. stm. llvm-svn: 115755	2010-10-06 06:27:31 +00:00
Owen Anderson	20b48697cd	Use a more efficient lowering of uint64_t --> float that can take advantage of hardware signed integer conversion without having to do a double cast (uint64_t --> double --> float). This is based on the algorithm from compiler_rt's __floatundisf for X86-64. llvm-svn: 115634	2010-10-05 17:24:05 +00:00
Evan Cheng	7218d81721	This DAG combine BRCOND transformation can look pass truncate of the operand: // %a = ... // %b = and i32 %a, 2 // %c = srl i32 %b, 1 // brcond i32 %c ... // // into // // %a = ... // %b = and i32 %a, 2 // %c = setcc eq %b, 0 // brcond %c ... Make sure it restores local variable N1, which corresponds to the condition operand if it fails to match. This apparently breaks TCE but since that backend isn't in the tree I don't have a test for it. llvm-svn: 115571	2010-10-04 22:41:01 +00:00
Devang Patel	1f2abb3af0	Fix code gen crash reported in PR 8235. We still lose debug info for the unused argument here. This is a known limitation recorded debuginfo-tests/trunk/dbg-declare2.ll function 'f6' test case. llvm-svn: 115323	2010-10-01 19:00:44 +00:00
Gabor Greif	7d9b24e1ef	typo llvm-svn: 115310	2010-10-01 10:32:19 +00:00
Chris Lattner	97f71e71c9	fix typo llvm-svn: 115300	2010-10-01 06:54:02 +00:00
Chris Lattner	01c6e93ea4	fix rdar://8494845 + PR8244 - a miscompile exposed by my patch in r101350 llvm-svn: 115294	2010-10-01 05:36:09 +00:00
Dale Johannesen	c14a1eda84	Massive rewrite of MMX: The x86_mmx type is used for MMX intrinsics, parameters and return values where these use MMX registers, and is also supported in load, store, and bitcast. Only the above operations generate MMX instructions, and optimizations do not operate on or produce MMX intrinsics. MMX-sized vectors <2 x i32> etc. are lowered to XMM or split into smaller pieces. Optimizations may occur on these forms and the result casted back to x86_mmx, provided the result feeds into a previous existing x86_mmx operation. The point of all this is prevent optimizations from introducing MMX operations, which is unsafe due to the EMMS problem. llvm-svn: 115243	2010-09-30 23:57:10 +00:00
Jakob Stoklund Olesen	8f4d623f9a	When isel is emitting instructions for an x86 target without CMOV, the CFG is edited during emission. If the basic block ends in a switch that gets lowered to a jump table, any phis at the default edge were getting updated wrong. The jump table data structure keeps a pointer to the header blocks that wasn't getting updated after the MBB is split. This bug was exposed on 32-bit Linux when disabling critical edge splitting in codegen prepare. The fix is to uipdate stale MBB pointers whenever a block is split during emission. llvm-svn: 115191	2010-09-30 19:44:31 +00:00
Evan Cheng	b44d480808	Model Cortex-a9 load to SUB, RSB, ADD, ADC, SBC, RSC, CMN, MVN, or CMP pipeline forwarding path. llvm-svn: 115098	2010-09-29 22:42:35 +00:00
Oscar Fuentes	eb27a44982	Removed a bunch of unnecessary target_link_libraries. llvm-svn: 114999	2010-09-28 22:39:14 +00:00
Dale Johannesen	6807dd822b	Don't try to make a vector of x86mmx; this won't work, and asserts. llvm-svn: 114843	2010-09-27 17:29:14 +00:00
John Thompson	83fab695f4	Fix for test/CodeGen/PowerPC/2008-10-17-AsmMatchingOperands.ll crash. llvm-svn: 114767	2010-09-24 22:24:05 +00:00
Michael J. Spencer	87fea6690f	Get rid of pop_macro warnings on MSVC. llvm-svn: 114750	2010-09-24 19:48:47 +00:00
Evan Cheng	d24bf49867	Revert 114634 for now since buildbot claim it broke Clang self-hosting. I doubt it but it's possible it's exposing another bug somewhere. llvm-svn: 114681	2010-09-23 18:32:19 +00:00
Oscar Fuentes	b58181793d	Fix VS 2010 build. Patch by Nathan Jeffords! llvm-svn: 114661	2010-09-23 16:59:36 +00:00
Evan Cheng	8266c28b96	Follow up to r114630. Do not optimize away unconditional branch following a conditional one. llvm-svn: 114634	2010-09-23 07:18:35 +00:00
Evan Cheng	3866797d9a	SDISel should not optimize a unconditional branch following a conditional branch when the unconditional branch destination is the fallthrough block. The canonicalization makes it easier to allow optimizations on DAGs to invert conditional branches. The branch folding pass (and AnalyzeBranch) will clean up the unnecessary unconditional branches later. This is one of the patches leading up to disabling codegen prepare critical edge splitting. llvm-svn: 114630	2010-09-23 06:51:55 +00:00
Owen Anderson	7d6373ea9d	A select between a constant and zero, when fed by a bit test, can be efficiently lowered using a series of shifts. Fixes <rdar://problem/8285015>. llvm-svn: 114599	2010-09-22 22:58:22 +00:00
John Thompson	5c363e0ebf	Fixed pr20314-2.c failure, added E, F, p constraint letters. llvm-svn: 114490	2010-09-21 22:04:54 +00:00
Chris Lattner	6543dacfac	Rework passing parent pointers into complexpatterns, I forgot that complex patterns are matched after the entire pattern has a structural match, therefore the NodeStack isn't in a useful state when the actual call to the matcher happens. llvm-svn: 114489	2010-09-21 22:00:25 +00:00
Devang Patel	a6b9c79c24	If only user of a vreg is an copy instruction to export copy of vreg out of current basic block then insert DBG_VALUE so that debug value of the variable is also transfered to new vreg. Testcase is in r114476. This fixes radar 8412415. llvm-svn: 114478	2010-09-21 20:56:33 +00:00
Chris Lattner	7798a97041	correct this logic. llvm-svn: 114474	2010-09-21 20:46:40 +00:00
Owen Anderson	f6dd8e7f5c	Reimplement r114460 in target-independent DAGCombine rather than target-dependent, by using the predicate to discover the number of sign bits. Enhance X86's target lowering to provide a useful response to this query. llvm-svn: 114473	2010-09-21 20:42:50 +00:00
Chris Lattner	a911c9ed3a	just like they can opt into getting the root of the pattern being matched, allow ComplexPatterns to opt into getting the parent node of the operand being matched. llvm-svn: 114472	2010-09-21 20:37:12 +00:00
Chris Lattner	32ec32b690	finish pushing MachinePointerInfo through selectiondags. At this point, I think I've audited all uses, so it should be dependable for address spaces, and the pointer+offset info should also be accurate when there. llvm-svn: 114464	2010-09-21 18:58:22 +00:00
Chris Lattner	8cdc5e75f7	update a bunch of code to use the MachinePointerInfo version of getStore. llvm-svn: 114461	2010-09-21 18:41:36 +00:00
Bob Wilson	c4345abcc0	Define the TargetLowering::getTgtMemIntrinsic hook for ARM so that NEON load and store intrinsics are represented with MemIntrinsicSDNodes. llvm-svn: 114454	2010-09-21 17:56:22 +00:00
Chris Lattner	86b3f287ce	eliminate an old SelectionDAG::getTruncStore method, propagating MachinePointerInfo around more. llvm-svn: 114452	2010-09-21 17:42:31 +00:00
Chris Lattner	bf98f86fed	eliminate last SelectionDAG::getLoad old entrypoint, on to stores. llvm-svn: 114450	2010-09-21 17:28:52 +00:00
Chris Lattner	8af4fb7aed	fix the code that infers SV info to be correct when dealing with an indexed load/store that has an offset in the index. llvm-svn: 114449	2010-09-21 17:24:05 +00:00
Chris Lattner	cdfd993df0	propagate MachinePointerInfo through various uses of the old SelectionDAG::getExtLoad overload, and eliminate it. llvm-svn: 114446	2010-09-21 17:04:51 +00:00
Chris Lattner	0d430648ae	continue MachinePointerInfo'izing, eliminating use of one of the old getLoad overloads. llvm-svn: 114443	2010-09-21 16:36:31 +00:00
Chris Lattner	5c0bdf4543	convert dagcombine off the old form of getLoad. This fixes several bugs with SVOffset computation. llvm-svn: 114442	2010-09-21 16:08:50 +00:00
Chris Lattner	637762e6d1	simplify DAGCombiner::SimplifySelectOps step #2/2. llvm-svn: 114437	2010-09-21 15:58:55 +00:00
Chris Lattner	e3616071c5	substantially reduce indentation and simplify DAGCombiner::SimplifySelectOps. no functionality change (step #1) llvm-svn: 114436	2010-09-21 15:46:59 +00:00
Chris Lattner	b68bd70665	a few more trivial updates. This fixes PerformInsertVectorEltInMemory to not pass a completely incorrect SrcValue, which would result in a miscompile with combiner-aa. llvm-svn: 114411	2010-09-21 07:32:19 +00:00
Chris Lattner	f94de5bf46	reimplement memcpy/memmove/memset lowering to use MachinePointerInfo instead of srcvalue/offset pairs. This corrects SV info for mem operations whose size is > 32-bits. llvm-svn: 114401	2010-09-21 05:40:29 +00:00
Chris Lattner	dbe51ad1b8	add overloads for SelectionDAG::getLoad, getStore, getTruncStore that take a MachinePointerInfo. Among other virtues, this doesn't silently truncate the svoffset to 32-bits. llvm-svn: 114399	2010-09-21 05:10:45 +00:00
Chris Lattner	e1fc671030	simplify interface to SelectionDAG::getMemIntrinsicNode, making it take a MachinePointerInfo llvm-svn: 114397	2010-09-21 04:57:15 +00:00
Chris Lattner	e4db4cad3b	chagne interface to SelectionDAG::getAtomic to take a MachinePointerInfo, eliminating some weird "infer a frame address" logic which was dead. llvm-svn: 114396	2010-09-21 04:53:42 +00:00
Chris Lattner	5de5fada20	don't implicitly drop the offset of a machinememoperand when legalizing atomics. llvm-svn: 114395	2010-09-21 04:51:11 +00:00
Chris Lattner	af01f8d142	force clients of MachineFunction::getMachineMemOperand to provide a MachinePointerInfo, propagating the type out a level of API. Remove the old MachineFunction::getMachineMemOperand impl. llvm-svn: 114393	2010-09-21 04:46:39 +00:00
Owen Anderson	fc94b337eb	When TCO is turned on, it is possible to end up with aliasing FrameIndex's. Therefore, CombinerAA cannot assume that different FrameIndex's never alias, but can instead use MachineFrameInfo to get the actual offsets of these slots and check for actual aliasing. This fixes CodeGen/X86/2010-02-19-TailCallRetAddrBug.ll and CodeGen/X86/tailcallstack64.ll when CombinerAA is enabled, modulo a different register allocation sequence. llvm-svn: 114348	2010-09-20 20:39:59 +00:00
Owen Anderson	9594834ec1	Revert r114312 while I sort out some issues. llvm-svn: 114313	2010-09-19 21:01:26 +00:00
Owen Anderson	65f23dcf7e	Tentatively enabled DAGCombiner Alias Analysis by default. As far as I know, r114268 fixed the last of the blockers to enabling it. I will be monitoring for failures. llvm-svn: 114312	2010-09-19 19:51:55 +00:00
Owen Anderson	015641f659	Invert the logic of reachesChainWithoutSideEffects(). What we want to check is that there is NO path to the destination containing side effects, not that SOME path contains no side effects. In practice, this only manifests with CombinerAA enabled, because otherwise the chain has little to no branching, so "any" is effectively equivalent to "all". llvm-svn: 114268	2010-09-18 04:45:14 +00:00
Devang Patel	e15f83013d	Check bb to ensure that alloca is in separate basic block. This fixes funcargs.exp regression reported by gdb testsuite. llvm-svn: 113992	2010-09-15 18:13:55 +00:00
Devang Patel	fbc9fca6ac	If dbg.declare from non-entry block is using alloca from entry block then use offset available in StaticAllocaMap to emit DBG_VALUE. Right now, this has no material impact because varible info also collected using offset table maintained in machine module info. llvm-svn: 113967	2010-09-15 14:48:53 +00:00
Devang Patel	632686ac96	Use frame index, if available for byval argument while lowering dbg_declare. Otherwise let getRegForValue() find register for this argument. llvm-svn: 113843	2010-09-14 20:29:31 +00:00
Michael J. Spencer	90f807fda5	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819	2010-09-13 23:59:48 +00:00
Eric Christopher	02df36a2fb	Silence more warnings. Two more unused variables. llvm-svn: 113771	2010-09-13 18:30:57 +00:00
John Thompson	ae3a86d6de	Added skeleton for inline asm multiple alternative constraint support. llvm-svn: 113766	2010-09-13 18:15:37 +00:00

... 2 3 4 5 6 ...

4901 Commits