llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Andrew Trick	dd4a20e7d7	Fix for -sched-high-latency-cycles in sched=list-ilp mode. llvm-svn: 127071	2011-03-05 09:18:16 +00:00
Cameron Zwarich	7de156da4a	Fix PR9398 - 10% of llc compile time is spent in Value::getNumUses. This reduces the percentage of time spent in CodeGenPrepare when llcing 403.gcc from 12.6% to 1.8% of total llc time. llvm-svn: 127069	2011-03-05 08:12:26 +00:00
Andrew Trick	d267891a21	Missing comment. llvm-svn: 127068	2011-03-05 08:04:11 +00:00
Andrew Trick	7db197d209	Increased the register pressure limit on x86_64 from 8 to 12 regs. This is the only change in this checkin that may affects the default scheduler. With better register tracking and heuristics, it doesn't make sense to artificially lower the register limit so much. Added -sched-high-latency-cycles and X86InstrInfo::isHighLatencyDef to give the scheduler a way to account for div and sqrt on targets that don't have an itinerary. It is currently defaults to 10 (the actual number doesn't matter much), but only takes effect on non-default schedulers: list-hybrid and list-ilp. Added several heuristics that can be individually disabled for the non-default sched=list-ilp mode. This helps us determine how much better we can do on a given benchmark than the default scheduler. Certain compute intensive loops run much faster in this mode with the right set of heuristics, and it doesn't seem to have much negative impact elsewhere. Not all of the heuristics are needed, but we still need to experiment to decide which should be disabled by default for sched=list-ilp. llvm-svn: 127067	2011-03-05 08:00:22 +00:00
Andrew Trick	5023de34d3	whitespace llvm-svn: 127065	2011-03-05 06:31:54 +00:00
Nick Lewycky	a2cb87f86d	Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever possible. This goes into instcombine and instsimplify because instsimplify doesn't need to check hasOneUse since it returns (almost exclusively) constants. This fixes PR9343 #4 #5 and #8! llvm-svn: 127064	2011-03-05 05:19:11 +00:00
Nick Lewycky	b2557b7cf1	Try once again to optimize "icmp (srem X, Y), Y" by turning the comparison into true/false or "icmp slt/sge Y, 0". llvm-svn: 127063	2011-03-05 04:28:48 +00:00
Jakob Stoklund Olesen	52420a0a1e	Rework the global split cost calculation. The global cost is the sum of block frequencies for spill code that must be inserted because preferences weren't met. llvm-svn: 127062	2011-03-05 03:28:51 +00:00
Jakob Stoklund Olesen	79944a508a	Compute the constraints for global live range splitting from an interference pattern. This simplifies the code and makes it faster too. The interference patterns are saved for each candidate register. It will be reused for actually executing the split. Work in progress. llvm-svn: 127054	2011-03-05 01:10:31 +00:00
Jim Grosbach	372394916e	Teach the register scavenger to take subregs into account when finding a free register. llvm-svn: 127049	2011-03-05 00:20:19 +00:00
Eric Christopher	d779c4197b	Support unregistering exception frames of functions when they are removed. Patch by Johannes Schaub! Fixes PR8548 llvm-svn: 127047	2011-03-04 23:37:39 +00:00
Eric Christopher	94626fc599	Improve readability with some whitespace! llvm-svn: 127043	2011-03-04 22:47:12 +00:00
Jakob Stoklund Olesen	87b598dd85	Extract a method. No functional change. llvm-svn: 127040	2011-03-04 22:11:11 +00:00
Bill Wendling	73b20a325f	Initialize variable. llvm-svn: 127038	2011-03-04 21:38:47 +00:00
Jakob Stoklund Olesen	e3b9a00584	Go back to comparing spill weights when deciding if interference can be evicted. It gives better results. Sometimes, a live range can be large and still have high spill weight. Such a range should not be spilled. llvm-svn: 127036	2011-03-04 21:32:50 +00:00
Bruno Cardoso Lopes	52c64a2eb3	Improve div/rem node handling on mips. Patch by Akira Hatanaka llvm-svn: 127034	2011-03-04 21:03:24 +00:00
Bruno Cardoso Lopes	e697a8bcb0	Expands register/immediate pairs when the immediate is too large to fit in 16-bit field. Patch by Akira Hatanaka llvm-svn: 127032	2011-03-04 20:48:08 +00:00
Dan Gohman	a8389213a0	When decling to reuse existing expressions that involve casts, ignore bitcasts, which are really no-ops here. This fixes slowdowns on MultiSource/Applications/aha and others. llvm-svn: 127031	2011-03-04 20:46:46 +00:00
Bruno Cardoso Lopes	0545e7fe48	Rewrite and simplify o32 vaarg passing, no functional changes. Patch by Sasa Stankovic llvm-svn: 127029	2011-03-04 20:27:44 +00:00
Joerg Sonnenberger	5f2f5fa638	Be nice to Xcore and the XMOS assembler and avoid quoting section names that contain only letters, digits and the characters "_" and ".". llvm-svn: 127028	2011-03-04 20:03:14 +00:00
Bruno Cardoso Lopes	99619e5bef	Lowers block address. Currently asserts when relocation model is not PIC. Patch by Akira Hatanaka llvm-svn: 127027	2011-03-04 20:01:52 +00:00
Benjamin Kramer	49043825b4	raw_ostream: while it is generally desirable to do larger writes, it can lead to inefficient file system buffering if the writes are not a multiple of the desired buffer size. Avoid this by limiting the large write to a multiple of the buffer size and copying the remainder into the buffer. Thanks to Dan for pointing this out. llvm-svn: 127026	2011-03-04 19:49:30 +00:00
Jakob Stoklund Olesen	8b66caf12a	Renumber slot indexes locally when possible. Initially, slot indexes are quad-spaced. There is room for inserting up to 3 new instructions between the original instructions. When we run out of indexes between two instructions, renumber locally using double-spaced indexes. The original quad-spacing means that we catch up quickly, and we only have to renumber a handful of instructions to get a monotonic sequence. This is much faster than renumbering the whole function as we did before. llvm-svn: 127023	2011-03-04 19:43:38 +00:00
Nick Lewycky	75a3dd996d	Revert broken srem logic from r126991. llvm-svn: 127021	2011-03-04 19:26:08 +00:00
Bruno Cardoso Lopes	5347fd6512	Fix an old copy-n-paste llvm-svn: 127020	2011-03-04 19:20:24 +00:00
Devang Patel	23ee9fdba3	Disable ARMGlobalMerge on darwin. The debugger is not yet able to extract individual variable's info from merged global. llvm-svn: 127019	2011-03-04 19:11:05 +00:00
Bruno Cardoso Lopes	9390dc6271	Expands FCOS and FSIN nodes when type is f64. llvm-svn: 127017	2011-03-04 18:54:14 +00:00
Jakob Stoklund Olesen	d99d8cfdf0	Number SlotIndexes uniformly without looking at the number of defs on each instruction. You can't really predict how many indexes will be needed from the number of defs, so let's keep it simple. Also remove an extra empty index that was inserted after each basic block. It was intended for live-out ranges, but it was never used that way. llvm-svn: 127014	2011-03-04 18:51:09 +00:00
Benjamin Kramer	0376064f8f	raw_ostream: If writing a string that is larger than the buffer, write it directly instead of doing many buffer-sized writes. This caps the number of write(2) calls per string to a maximum of 2. llvm-svn: 127010	2011-03-04 18:18:16 +00:00
Jakob Stoklund Olesen	dfc9b0b15f	Add SlotIndex statistics. llvm-svn: 127007	2011-03-04 18:08:29 +00:00
Jakob Stoklund Olesen	c3193f4aa8	Tweak debug output. No functional changes. llvm-svn: 127006	2011-03-04 18:08:26 +00:00
Bruno Cardoso Lopes	a49a596acf	Fixes addc pattern when immediate cannot be represented with 16-bit. Patch by Akira Hatanaka llvm-svn: 127005	2011-03-04 17:59:18 +00:00
Bruno Cardoso Lopes	5400401372	Remove (hopefully) all trailing whitespaces from the mips backend. Patch by Hatanaka, Akira llvm-svn: 127003	2011-03-04 17:51:39 +00:00
Duncan Sands	f6c9de94f8	Revert commit 126684 "Use the correct shift amount type". It is only the correct type after type legalization has completed. Before then it may simply not be big enough to hold the shift amount, particularly on x86 which uses a very small type for shifts (this issue broke stuff in the past which is why LegalizeTypes carefully uses a large type for shift amounts). llvm-svn: 127000	2011-03-04 14:28:59 +00:00
Kalle Raiskila	6e33c92ffb	Allow vector shifts (shl,lshr,ashr) on SPU. There was a previous implementation with patterns that would have matched e.g. shl <v4i32> <i32>, but this is not valid LLVM IR so they never were selected. llvm-svn: 126998	2011-03-04 13:19:18 +00:00
Kalle Raiskila	72cfda1a29	Allow load from constant on SPU. A 'load <4 x i32>* null' crashes llc before this fix. llvm-svn: 126995	2011-03-04 12:00:11 +00:00
Nick Lewycky	3bc3a84ba8	Fold "icmp pred (srem X, Y), Y" like we do for urem. Handle signed comparisons in the urem case, though not the other way around. This is enough to get #3 from PR9343! llvm-svn: 126991	2011-03-04 10:06:52 +00:00
Nick Lewycky	a5f309e983	Teach instruction simplify to use constant ranges to solve problems of the form "icmp pred %X, CI" and a number of examples where "%X = binop %Y, CI2". Some of these cases (div and rem) used to make it through opt -O2, but the others are probably now making code elsewhere redundant (probably instcombine). llvm-svn: 126988	2011-03-04 07:00:57 +00:00
Jakob Stoklund Olesen	c7bf5eaafb	DenseMap<uintptr_t,...> doesn't allow all values as keys. Avoid colliding with the sentinels, hopefully unbreaking llvm-gcc-x86_64-linux-selfhost. llvm-svn: 126982	2011-03-04 02:48:56 +00:00
Andrew Trick	1478921dc0	Minor pre-RA-sched fixes and cleanup. Fix the PendingQueue, then disable it because it's not required for the current schedulers' heuristics. Fix the logic for the unused list-ilp scheduler. llvm-svn: 126981	2011-03-04 02:03:45 +00:00
Devang Patel	9dbcb14cd8	Add ArrayRef variant. llvm-svn: 126978	2011-03-04 01:20:33 +00:00
Jakob Stoklund Olesen	3c64ea5ad2	Precompute block frequencies, pow() isn't free. llvm-svn: 126975	2011-03-04 00:58:40 +00:00
Jakob Stoklund Olesen	5bc7a96cca	Use an IndexedMap instead of a DenseMap for the live-out cache. This speeds up updateSSA() so it only accounts for 5% of the live range splitting time. llvm-svn: 126972	2011-03-04 00:15:36 +00:00
Eli Friedman	be07c34368	PR9377: Handle x86 str with register operand in a way consistent with gas. llvm-svn: 126970	2011-03-04 00:10:17 +00:00
Bill Wendling	6e6a0422eb	There are times when the landing pad won't have a call to 'eh.selector' in it. It's been assumed up til now that it would be in its immediate successor. However, this isn't necessarily the case. It could be in one of its successor's successors. Modify the code to more thoroughly check for an 'eh.selector' call in successors. It only looks at a successor if we get there as a result of an unconditional branch. Testcase ObjC/exceptions-4.m in r126968. llvm-svn: 126969	2011-03-03 23:14:05 +00:00
Bob Wilson	c48ba54186	PR8053: Fix encoding of S bit in some ARM instructions. Patch by Zonr Chang! llvm-svn: 126967	2011-03-03 23:07:15 +00:00
Eli Friedman	26f5c96de3	Revert r123908; the code in question is completely untested and wrong. llvm-svn: 126964	2011-03-03 22:33:23 +00:00
Joerg Sonnenberger	bb93506f95	Bug#9033: For the ELF assembler output, always quote the section name. llvm-svn: 126963	2011-03-03 22:31:08 +00:00
Devang Patel	a7b5a00033	Fix typo. llvm-svn: 126962	2011-03-03 21:49:41 +00:00
Devang Patel	3c2bd93f9e	Fix thinko in previous check-in. Add comment. llvm-svn: 126959	2011-03-03 20:08:10 +00:00
Devang Patel	4adbb350be	llvm::Function argument count is not a good indicator of how many arugments does the function have at source level. If we need more space, just resize vector conservatively. This vector is only used once per function. llvm-svn: 126957	2011-03-03 20:02:02 +00:00
Jim Grosbach	e4ef37b298	Allow a target to choose whether to prefer the scavenger emergency spill slot be next to the frame pointer or the stack pointer. llvm-svn: 126956	2011-03-03 20:01:52 +00:00
Jan Sjödin	3c1fa9a68b	Split MCEELFStreamer and ELFObjectWriter into .h and .cpp files, so that other components can use them. llvm-svn: 126942	2011-03-03 14:52:12 +00:00
Richard Osborne	2186cd4e47	Fix typo in comment. llvm-svn: 126941	2011-03-03 14:21:22 +00:00
Richard Osborne	88d0d840f2	Optimize fprintf -> iprintf if there are no floating point arguments and siprintf is available on the target. llvm-svn: 126940	2011-03-03 14:20:22 +00:00
Justin Holewinski	396fdaee6d	PTX: Fix Emacs renaming a symbol llvm-svn: 126938	2011-03-03 14:09:40 +00:00
Richard Osborne	021b589253	Optimize sprintf -> siprintf if there are no floating point arguments and siprintf is available on the target. llvm-svn: 126937	2011-03-03 14:09:28 +00:00
Justin Holewinski	86e749c620	PTX: Fix a couple of lint violations llvm-svn: 126936	2011-03-03 13:34:29 +00:00
Richard Osborne	df829ddcb7	Optimize printf -> iprintf if there are no floating point arguments and iprintf is available on the target. Currently iprintf is only marked as being available on the XCore. llvm-svn: 126935	2011-03-03 13:17:51 +00:00
Tilmann Scheller	ab6450ec63	Use X86_thiscall calling convention for Win64 as well. llvm-svn: 126934	2011-03-03 07:49:07 +00:00
Eli Friedman	f7eb4cce13	PR9352: Always emit a relocation for weak symbols. Not emitting relocations for calls to weak symbols with a definition has the appearance of working with LLVM-generated code because weak symbol definitions are put in their own sections. llvm-svn: 126933	2011-03-03 07:24:36 +00:00
Bob Wilson	72ccdfe148	Add a readme entry for the redundant movw issue for pr9370. llvm-svn: 126930	2011-03-03 06:39:09 +00:00
Jakob Stoklund Olesen	3511ad1023	Renumber slot indexes uniformly instead of spacing according to the number of defs. There are probably much larger speedups to be had by renumbering locally instead of looping over the whole function. For now, the greedy register allocator is 25% faster. llvm-svn: 126926	2011-03-03 06:29:01 +00:00
Jakob Stoklund Olesen	d0930e03c1	Represent sentinel slot indexes with a null pointer. This is much faster than using a pointer to a ManagedStatic object accessed with a function call. The greedy register allocator is 5% faster overall just from the SlotIndex default constructor savings. llvm-svn: 126925	2011-03-03 05:40:04 +00:00
Jakob Stoklund Olesen	2e6407e9aa	Avoid comparing invalid slot indexes, and assert that it doesn't happen. The SlotIndex created by the default construction does not represent a position in the function, and it doesn't make sense to compare it to other indexes. llvm-svn: 126924	2011-03-03 05:18:19 +00:00
Jakob Stoklund Olesen	92d724fb69	Avoid comparing invalid slot indexes. llvm-svn: 126922	2011-03-03 04:23:52 +00:00
Jakob Stoklund Olesen	2759a169ea	Cache basic block bounds instead of asking SlotIndexes::getMBBRange all the time. This speeds up the greedy register allocator by 15%. DenseMap is not as fast as one might hope. llvm-svn: 126921	2011-03-03 03:41:29 +00:00
Bob Wilson	42f80596ca	pr9367: Add missing predicated BLX instructions. Patch by Jyun-Yan You, with some minor adjustments and a testcase from me. llvm-svn: 126915	2011-03-03 01:41:01 +00:00
Jakob Stoklund Olesen	09a1939164	Change the SplitEditor interface to a single instance can be shared for multiple splits. llvm-svn: 126912	2011-03-03 01:29:13 +00:00
Jakob Stoklund Olesen	df789f852e	Only run the updateSSA loop when we have actually seen multiple values. When only a single value has been seen, new PHIDefs are never needed. llvm-svn: 126911	2011-03-03 01:29:10 +00:00
Jakob Stoklund Olesen	31949ff3f8	Fix PHI handling in LiveIntervals::shrinkToUses(). We need to wait until we meet a PHIDef in its defining block before resurrecting PHIKills in the predecessors. This should unbreak the llvm-gcc-build-x86_64-darwin10-x-mingw32-x-armeabi bot. llvm-svn: 126905	2011-03-03 00:20:51 +00:00
Bob Wilson	42e5d02b10	Avoid exponential blow-up when printing DAGs. David Greene changed CannotYetSelect() to print the full DAG including multiple copies of operands reached through different paths in the DAG. Unfortunately this blows up exponentially in some cases. The depth limit of 100 is way too high to prevent this -- I'm seeing a message string of 150MB with a depth of only 40 in one particularly bad case, even though the DAG has less than 200 nodes. Part of the problem is that the printing code is following chain operands, so if you fail to select an operation with a chain, the printer will follow all the chained operations back to the entry node. llvm-svn: 126899	2011-03-02 23:38:06 +00:00
Jakob Stoklund Olesen	c5dc2c61fc	Turn the Edit member into a pointer so it can change dynamically. No functional change. llvm-svn: 126898	2011-03-02 23:31:50 +00:00
Kevin Enderby	58cc960338	Fixes an assertion failure while disassembling ARM rsbs reg/reg form. Patch by Ted Kremenek! llvm-svn: 126895	2011-03-02 23:08:33 +00:00
Jakob Stoklund Olesen	9697ef837c	Transfer simply defined values directly without recomputing liveness and SSA. Values that map to a single new value in a new interval after splitting don't need new PHIDefs, and if the parent value was never rematerialized the live range will be the same. llvm-svn: 126894	2011-03-02 23:05:19 +00:00
Jakob Stoklund Olesen	22469169cd	Extract a method. No functional change. llvm-svn: 126893	2011-03-02 23:05:16 +00:00
Renato Golin	967b93c6e3	Fixing a bug when printing fpu text to object file. Patch by Mans Rullgard. llvm-svn: 126882	2011-03-02 21:20:09 +00:00
Duncan Sands	a7c3ebafe4	Remove DIFactory. Patch by Devang. llvm-svn: 126871	2011-03-02 20:30:37 +00:00
Stuart Hastings	a40e563e4e	Can't introduce floating-point immediate constants after legalization. Radar 9056407. llvm-svn: 126864	2011-03-02 19:36:30 +00:00
Tilmann Scheller	c557d1eeb4	Add Win64 thiscall calling convention. llvm-svn: 126862	2011-03-02 19:29:22 +00:00
David Greene	2fd6d03bc9	[AVX] Fix mask predicates for 256-bit UNPCKLPS/D and implement missing patterns for them. Add a SIMD test subdirectory to hold tests for SIMD instruction selection correctness and quality. ' llvm-svn: 126845	2011-03-02 17:23:43 +00:00
Che-Liang Chiou	8ab0f86f1b	ptx: fix lint and compiler warnings llvm-svn: 126838	2011-03-02 07:58:46 +00:00
Che-Liang Chiou	3529b49230	Add 64-bit addressing to PTX backend - Add '64bit' sub-target option. - Select 32-bit/64-bit loads/stores based on '64bit' option. - Fix function parameter order. Patch by Justin Holewinski llvm-svn: 126837	2011-03-02 07:36:48 +00:00
Rafael Espindola	7f301833ee	Add a special streamer to libLTO that just records symbols definitions and uses. The result produced by the streamer is used to give the linker more accurate information and to add to llvm.compiler.used. The second improvement removes the need for the user to add __attribute__((used)) to functions only used in inline asm. The first one lets us build firefox with LTO on Darwin :-) llvm-svn: 126830	2011-03-02 04:14:42 +00:00
Cameron Zwarich	bb1d033824	Fix some typos. llvm-svn: 126829	2011-03-02 04:03:46 +00:00
Cameron Zwarich	bd98c1011d	Remove some more unused code that I missed. llvm-svn: 126826	2011-03-02 03:48:29 +00:00
Cameron Zwarich	6a4612ba06	Eliminate the unused CodeGenPrepare option to split critical edges. llvm-svn: 126825	2011-03-02 03:31:46 +00:00
Che-Liang Chiou	2e7bb6da4c	Extend initial support for primitive types in PTX backend - Allow i16, i32, i64, float, and double types, using the native .u16, .u32, .u64, .f32, and .f64 PTX types. - Allow loading/storing of all primitive types. - Allow primitive types to be passed as parameters. - Allow selection of PTX Version and Shader Model as sub-target attributes. - Merge integer/floating-point test cases for load/store. - Use .u32 instead of .s32 to conform to output from NVidia nvcc compiler. Patch by Justin Holewinski llvm-svn: 126824	2011-03-02 03:20:28 +00:00
Jakob Stoklund Olesen	6751d65fd6	Move extendRange() into SplitEditor and delete the LiveRangeMap class. Extract the updateSSA() method from the too long extendRange(). LiveOutCache can be shared among all the new intervals since there is at most one of the new ranges live out from each basic block. llvm-svn: 126818	2011-03-02 01:59:34 +00:00
Nick Lewycky	6ad594dd20	Quiet a compiler warning about unused variable 'ExtVNI'. llvm-svn: 126815	2011-03-02 01:43:30 +00:00
Dan Gohman	0823ebc79b	Don't re-use existing addrec expansions if they contain casts. This fixes PR9259. llvm-svn: 126812	2011-03-02 01:34:10 +00:00
Evan Cheng	5275ba7f98	Catch more cases where 2-address pass should 3-addressify instructions. rdar://9002648. llvm-svn: 126811	2011-03-02 01:08:17 +00:00
Jakob Stoklund Olesen	a6fd3b66b0	Rename mapValue to extendRange because that is its function now. Simplify the signature - The return value and ParentVNI are no longer needed. llvm-svn: 126809	2011-03-02 00:49:28 +00:00
Jakob Stoklund Olesen	200a74580c	Simplify LiveIntervals::shrinkToUses() a bit by using the new extendInBlock(). llvm-svn: 126806	2011-03-02 00:33:03 +00:00
Jakob Stoklund Olesen	be281bb02c	Fix typo. llvm-svn: 126805	2011-03-02 00:33:01 +00:00
Jakob Stoklund Olesen	4d3c996555	Move LiveIntervalMap::extendTo into LiveInterval itself. This method could probably be used by LiveIntervalAnalysis::shrinkToUses, and now it can use extendIntervalEndTo() which coalesces ranges. llvm-svn: 126803	2011-03-02 00:06:15 +00:00
Jakob Stoklund Olesen	4f3da0eab4	Delete dead code. llvm-svn: 126801	2011-03-01 23:24:19 +00:00
Jakob Stoklund Olesen	15d9bf0424	Move the value map from LiveIntervalMap to SplitEditor. The value map is currently not used, all values are 'complex mapped' and LiveIntervalMap::mapValue is used to dig them out. This is the first step in a series changes leading to the removal of LiveIntervalMap. Its data structures can be shared among all the live intervals created by a split, so it is wasteful to create a copy for each. llvm-svn: 126800	2011-03-01 23:14:53 +00:00
Jakob Stoklund Olesen	13b552560b	Delete dead code. Local live range splitting is better driven by interference. This code was just guessing. llvm-svn: 126799	2011-03-01 23:14:50 +00:00
Jakob Stoklund Olesen	36e4edd585	Drop RAGreedy::trySpillInterferences(). This is a waste of time since we already know how to evict all interferences which is a better approach anyway. llvm-svn: 126798	2011-03-01 23:14:48 +00:00
Oscar Fuentes	15a668f50b	Fixes warnings emitted by Visual Studio 2010 compiler. Patch by Erik Olofsson! llvm-svn: 126796	2011-03-01 23:11:57 +00:00
Devang Patel	8233c58f5d	If argument numbering is encoded in metadata then emit arguments' debug info in that order. llvm-svn: 126794	2011-03-01 22:58:55 +00:00
Devang Patel	5c7b3c4228	Today, the language front ends produces llvm.dbg.* intrinsics, used to encode arguments' debug info, in order any way, most of the times. However, if a front end mix-n-matches llvm.dbg.declare and llvm.dbg.value intrinsics to encode debug info for arguments then code generator needs a way to find argument order. Use 8 bits from line number field to keep track of argument ordering while encoding debug info for an argument. That leaves 24 bit for line no, DebugLoc also allocates 24 bit for line numbers. If a function has more than 255 arguments then rest of the arguments will be ordered by llvm.dbg.* intrinsics' ordering in IR. llvm-svn: 126793	2011-03-01 22:58:13 +00:00
Cameron Zwarich	0287e8ad22	Stop computing the number of uses twice per value in CodeGenPrepare's sinking of addressing code. On 403.gcc this almost halves CodeGenPrepare time and reduces total llc time by 9.5%. Unfortunately, getNumUses() is still the hottest function in llc. llvm-svn: 126782	2011-03-01 21:13:53 +00:00
Jakob Stoklund Olesen	ef6bec1ff5	Keep track of which stage produced a live range, and bypass earlier stages when revisiting. This effectively disables the 'turbo' functionality of the greedy register allocator where all new live ranges created by splitting would be reconsidered as if they were originals. There are two reasons for doing this, 1. It guarantees that the algorithm terminates. Early versions were prone to infinite looping in certain corner cases. 2. It is a 2x speedup. We can skip a lot of unnecessary interference checks that won't lead to good splitting anyway. The problem is that region splitting only gets one shot, so it should probably be changed to target multiple physical registers at once. Local live range splitting is still 'turbo' enabled. It only accounts for a small fraction of compile time, so it is probably not necessary to do anything about that. llvm-svn: 126781	2011-03-01 21:10:07 +00:00
Duncan Sands	859a335e92	Add datalayout information for the IEEE quad precision fp128 type. llvm-svn: 126780	2011-03-01 20:56:50 +00:00
Dan Gohman	8626e387c3	Revert PathV2 changes, as sys::fs::unique_file is not finished yet. llvm-svn: 126773	2011-03-01 19:50:55 +00:00
Talin	1d7928c794	Added missing va_end(). llvm-svn: 126759	2011-03-01 18:00:49 +00:00
Duncan Sands	b0cd2d9a1e	Add a few missed unary cases when legalizing vector results. Put some cases in alphabetical order. llvm-svn: 126745	2011-03-01 15:15:43 +00:00
Anders Carlsson	1eb388e6c3	Make InstCombiner::FoldAndOfICmps create a ConstantRange that's the intersection of the LHS and RHS ConstantRanges and return "false" when the range is empty. This simplifies some code and catches some extra cases. llvm-svn: 126744	2011-03-01 15:05:01 +00:00
Nick Lewycky	e9d448e997	Optimize "icmp pred (urem X, Y), Y" --> true/false depending on pred. There's more work to do here, "icmp ult (urem X, 10), 11" doesn't optimize away yet. Fixes example 3 from PR9343! llvm-svn: 126741	2011-03-01 08:15:50 +00:00
Jim Grosbach	3b72823981	trailing whitespace. llvm-svn: 126733	2011-03-01 01:39:05 +00:00
Jim Grosbach	fbdcd70f4b	Generalize the register matching code in DAGISel a bit. llvm-svn: 126731	2011-03-01 01:37:19 +00:00
Bill Wendling	304dda7810	Narrow right shifts need to encode their immediates differently from a normal shift. 16-bit: imm6<5:3> = '001', 8 - <imm> is encded in imm6<2:0> 32-bit: imm6<5:4> = '01',16 - <imm> is encded in imm6<3:0> 64-bit: imm6<5> = '1', 32 - <imm> is encded in imm6<4:0> llvm-svn: 126723	2011-03-01 01:00:59 +00:00
Eli Friedman	c00dc3f262	Add an obvious missing safety check to DAE::RemoveDeadArgumentsFromCallers. llvm-svn: 126720	2011-03-01 00:33:47 +00:00
Chris Lattner	871d62dc5b	add a note llvm-svn: 126719	2011-03-01 00:24:51 +00:00
Ted Kremenek	8c2e50117d	Unbreak CMake build. llvm-svn: 126717	2011-03-01 00:02:51 +00:00
Ted Kremenek	7b6189c848	Unbreak CMake build. llvm-svn: 126715	2011-02-28 23:56:33 +00:00
Talin	753b7ce425	Add an END_WITH_NULL accessor for ConstantStruct. llvm-svn: 126714	2011-02-28 23:53:27 +00:00
Chris Lattner	77d6f9b20e	update cmake llvm-svn: 126694	2011-02-28 22:45:25 +00:00
Renato Golin	986151bc09	Fix .fpu printing in ARM assembly, regarding bug http://llvm.org/bugs/show_bug.cgi?id=8931 llvm-svn: 126689	2011-02-28 22:04:27 +00:00
Kevin Enderby	da76779962	Add missing whitespace in the formatting. llvm-svn: 126687	2011-02-28 21:45:12 +00:00
Jan Sjödin	6bc6d6ab39	Make all static functions become static class methods. Move shared (duplicated) functions to new MCELF class. llvm-svn: 126686	2011-02-28 21:45:04 +00:00
Owen Anderson	22d34c260e	Use the correct shift amount type. llvm-svn: 126684	2011-02-28 21:10:10 +00:00
Owen Anderson	1614a324d9	Clean whitespace. llvm-svn: 126683	2011-02-28 20:57:56 +00:00
Chris Lattner	355d573721	fix a signed comparison warning. llvm-svn: 126682	2011-02-28 20:50:35 +00:00
Dan Gohman	91de965338	Delete the GEPSplitter experiment. llvm-svn: 126671	2011-02-28 19:47:47 +00:00
Dan Gohman	db646bfdad	Delete the SimplifyHalfPowrLibCalls pass, which was unused, and only existed as the result of a misunderstanding. llvm-svn: 126669	2011-02-28 19:41:14 +00:00
Dan Gohman	1ee6941aca	Delete the LiveValues pass. I won't get get back to the project it was started for in the foreseeable future. llvm-svn: 126668	2011-02-28 19:37:59 +00:00
David Greene	3bc73b0ae9	[AVX] Add decode support for VUNPCKLPS/D instructions, both 128-bit and 256-bit forms. Because the number of elements in a vector does not determine the vector type (4 elements could be v4f32 or v4f64), pass the full type of the vector to decode routines. llvm-svn: 126664	2011-02-28 19:06:56 +00:00
Kevin Enderby	a1c2ea4ba0	Fix the arm's disassembler for blx that was building an MCInst without the needed two predicate operands before the imm operand. llvm-svn: 126662	2011-02-28 18:46:31 +00:00
Evan Cheng	4e6d375744	Fix a typo which cause dag combine crash. rdar://9059537. llvm-svn: 126661	2011-02-28 18:45:27 +00:00
Stuart Hastings	539d4e1460	Support for byval parameters on ARM. Will be enabled by a forthcoming patch to the front-end. Radar 7662569. llvm-svn: 126655	2011-02-28 17:17:53 +00:00
Kalle Raiskila	cc5b703c81	Add branch hinting for SPU. The implemented algorithm is overly simplistic (just speculate all branches are taken)- this is work in progress. llvm-svn: 126651	2011-02-28 14:08:24 +00:00
Frits van Bommel	4a2b658705	Teach SimplifyCFG that (switch (select cond, X, Y)) is better expressed as a branch. Based on a patch by Alistair Lynn. llvm-svn: 126647	2011-02-28 09:44:07 +00:00
Nick Lewycky	fe02856d37	Fix comment. llvm-svn: 126645	2011-02-28 09:18:11 +00:00
Nick Lewycky	dcc97b5f44	srem doesn't actually have the same resulting sign as its numerator, you could also have a zero when numerator = denominator. Reverts parts of r126635 and r126637. llvm-svn: 126644	2011-02-28 09:17:39 +00:00
Nick Lewycky	28f01da48e	Teach InstCombine to fold "(shr exact X, Y) == 0" --> X == 0, fixing #1 from PR9343. llvm-svn: 126643	2011-02-28 08:31:40 +00:00
Nick Lewycky	dd5df7ef0a	Teach value tracking to make use of flags in more situations. llvm-svn: 126642	2011-02-28 08:02:21 +00:00
Nick Lewycky	aa03b76c53	Teach ValueTracking to look at the dividend when determining the sign bit of an srem instruction. llvm-svn: 126637	2011-02-28 06:52:12 +00:00
Che-Liang Chiou	4026d01040	Add preliminary support for .f32 in the PTX backend. - Add appropriate TableGen patterns for fadd, fsub, fmul. - Add .f32 as the PTX type for the LLVM float type. - Allow parameters, return values, and global variable declarations to accept the float type. - Add appropriate test cases. Patch by Justin Holewinski llvm-svn: 126636	2011-02-28 06:34:09 +00:00
Nick Lewycky	e0f44d0aba	The sign of an srem instruction is the sign of its dividend (the first argument), regardless of the divisor. Teach instcombine about this and fix test7 in PR9343! llvm-svn: 126635	2011-02-28 06:20:05 +00:00
Benjamin Kramer	0bdf517525	Silence enum conversion warnings. llvm-svn: 126578	2011-02-27 18:13:53 +00:00
Duncan Sands	b6f7dcb996	Legalize support for fpextend of vector. PR9309. llvm-svn: 126574	2011-02-27 14:41:27 +00:00
NAKAMURA Takumi	b35d45a714	Target/X86: Always emit "push/pop GPRs" in prologue/epilogue and emit "spill/reload frames" for XMMs. It improves Win64's prologue/epilogue but it would not affect ia32 and amd64 (lack of nonvolatile XMMs). llvm-svn: 126568	2011-02-27 08:47:19 +00:00
Nadav Rotem	961627db07	Fix typos in the comments. llvm-svn: 126565	2011-02-27 07:40:43 +00:00
Tobias Grosser	a9cc8bda71	RegionPrinter: Ignore back edges when layouting the graph llvm-svn: 126564	2011-02-27 04:11:07 +00:00
Tobias Grosser	789fab6b30	Pass the graph to the DOTGraphTraits.getEdgeAttributes(). This follows the interface of getNodeAttributes. llvm-svn: 126562	2011-02-27 04:11:03 +00:00
Daniel Dunbar	d4a7704765	Support: Add llvm::AreStatisticsEnabled(). llvm-svn: 126558	2011-02-26 23:17:12 +00:00
Benjamin Kramer	412ffed4f0	Add some DAGCombines for (adde 0, 0, glue), which are useful to optimize legalized code for large integer arithmetic. 1. Inform users of ADDEs with two 0 operands that it never sets carry 2. Fold other ADDs or ADDCs into the ADDE if possible It would be neat if we could do the same thing for SETCC+ADD eventually, but we can't do that in target independent code. llvm-svn: 126557	2011-02-26 22:48:07 +00:00
Jim Grosbach	d3d3996a4a	Trailing whitespace. llvm-svn: 126526	2011-02-25 22:53:20 +00:00
Owen Anderson	bd26993873	Allow targets to specify a the type of the RHS of a shift parameterized on the type of the LHS. llvm-svn: 126518	2011-02-25 21:41:48 +00:00
Cameron Zwarich	974208a607	Roll out r126425 and r126450 to see if it fixes the failures on the buildbots. llvm-svn: 126488	2011-02-25 16:30:32 +00:00
Benjamin Kramer	39a5d8596c	Revert "SimplifyCFG: GEPs with just one non-constant index are also cheap." Yes, there are other types than i8* and GEPs on them can produce an add+multiply. We don't consider that cheap enough to be speculatively executed. llvm-svn: 126481	2011-02-25 10:33:33 +00:00
Bob Wilson	6bbffe19e9	Add patterns to use post-increment addressing for Neon VST1-lane instructions. llvm-svn: 126477	2011-02-25 06:42:42 +00:00
Jim Grosbach	61c746f927	Fix formatting of debug helper string. llvm-svn: 126471	2011-02-25 03:59:03 +00:00
Evan Cheng	56354c17d9	Fix typo. llvm-svn: 126467	2011-02-25 01:29:29 +00:00
Cameron Zwarich	1a975ff395	Set NumSignBits to 1 if KnownZero/KnownOne are being zero extended. In theory it is possible to do better if the high bit is set in either KnownZero/KnownOne, but in practice NumSignBits is always 1 when we are zero extending because nothing is known about that register. llvm-svn: 126465	2011-02-25 01:11:01 +00:00
Cameron Zwarich	aa61fbeebb	We only want to zero extend the existing information if the bit width is actually larger. llvm-svn: 126464	2011-02-25 01:10:55 +00:00
Jakob Stoklund Olesen	9d328484ce	Try harder to get the hint by preferring to evict hint interference. llvm-svn: 126463	2011-02-25 01:04:22 +00:00
Evan Cheng	fbdcea4b2e	Each prologue may have multiple vpush instructions to store callee-saved D registers since the vpush list may not have gaps. Make sure the stack adjustment instruction isn't moved between them. Ditto for vpop in epilogues. Sorry, can't reduce a small test case. rdar://9043312 llvm-svn: 126457	2011-02-25 00:24:46 +00:00
Benjamin Kramer	44b43a85db	SimplifyCFG: GEPs with just one non-constant index are also cheap. llvm-svn: 126452	2011-02-24 23:26:09 +00:00
Jakob Stoklund Olesen	4790d4c74b	Tweak the register allocator priority queue some more. New live ranges are assigned in long -> short order, but live ranges that have been evicted at least once are deferred and assigned in short -> long order. Also disable splitting and spilling for live ranges seen for the first time. The intention is to create a realistic interference pattern from the heavy live ranges before starting splitting and spilling around it. llvm-svn: 126451	2011-02-24 23:21:36 +00:00
Nick Lewycky	35539486f0	Remove dead variable. llvm-svn: 126450	2011-02-24 23:15:43 +00:00
Benjamin Kramer	b5996b08b7	SimplifyCFG: GEPs with constant indices are cheap enough to be executed unconditionally. llvm-svn: 126445	2011-02-24 22:46:11 +00:00
Joerg Sonnenberger	9c30c492df	Restore r125595 (reverted in r126336) with modifications: Introduce a variable in the AsmParserExtension whether [] is valid in an expression. If it is true, parse them like (). Enable this for ELF only. llvm-svn: 126443	2011-02-24 21:59:22 +00:00
Chris Lattner	55119c81aa	remove command line option debugging hook. llvm-svn: 126441	2011-02-24 21:53:03 +00:00
Devang Patel	f2b2417c2c	Enable DebugInfo support for COFF object files. Patch by Nathan Jeffords! llvm-svn: 126425	2011-02-24 21:04:00 +00:00
Nadav Rotem	ab7cf630f4	Enable support for vector sext and trunc: Limit the folding of any_ext and sext into the load operation to scalars. Limit the active-bits trunc optimization to scalars. Document vector trunc and vector sext in LangRef. Similar to commit 126080 (for enabling zext). llvm-svn: 126424	2011-02-24 21:01:34 +00:00
Rafael Espindola	3a1fd70006	Fix llvm-gcc bootstrap with gnu ld. The problem was codegen guessing the wrong values and printing .section .eh_frame,"aMS",@progbits,4 It is not clear at all if Codegen should try to guess, MC is the one that should know the default flags. llvm-svn: 126421	2011-02-24 20:18:01 +00:00
Devang Patel	29b946c0bd	Do not use DIFactory. Use DIBuilder. llvm-svn: 126398	2011-02-24 18:49:55 +00:00
Devang Patel	e0f113c206	Do not use DIFactory. llvm-svn: 126397	2011-02-24 18:49:30 +00:00
Richard Osborne	a8df984a31	Add XCore intrinsic for eeu instruction. llvm-svn: 126384	2011-02-24 13:39:18 +00:00
Benjamin Kramer	feccb33180	Plug some leaks in edis. - Don't leak parsed operands during tokenization. - Don't leak printed insts in llvm-mc. llvm-svn: 126381	2011-02-24 11:03:19 +00:00
Cameron Zwarich	724eb8706a	Merge information about the number of zero, one, and sign bits of live-out registers at phis. This enables us to eliminate a lot of pointless zexts during the DAGCombine phase. This fixes <rdar://problem/8760114>. llvm-svn: 126380	2011-02-24 10:00:25 +00:00
Cameron Zwarich	af4779907f	Add a getNumSignBits() method to APInt. llvm-svn: 126379	2011-02-24 10:00:20 +00:00
Cameron Zwarich	e79a75febe	Add a mechanism for invalidating the LiveOutInfo of a PHI, and use it whenever a block is visited before all of its predecessors. llvm-svn: 126378	2011-02-24 10:00:16 +00:00
Cameron Zwarich	5c9384705f	Track blocks visited in reverse postorder. llvm-svn: 126377	2011-02-24 10:00:13 +00:00
Cameron Zwarich	3d2f99227a	Refactor the LiveOutInfo interface into a few methods on FunctionLoweringInfo and make the actual map private. llvm-svn: 126376	2011-02-24 10:00:08 +00:00
Cameron Zwarich	e49bf60392	Have isel visit blocks in reverse postorder rather than an undefined order. This allows for the information propagated across basic blocks to be merged at phis. llvm-svn: 126375	2011-02-24 10:00:04 +00:00
Chris Lattner	e64e9c6f73	wire TargetLibraryInfo into simplify libcalls and use it in a couple of trivial places. This pass needs a lot of work. llvm-svn: 126367	2011-02-24 07:16:14 +00:00
Chris Lattner	d1bb8c0277	move a massive amount of code out into its own helper function to reduce nesting. This needs to be turned into a table. llvm-svn: 126366	2011-02-24 07:12:12 +00:00
Chris Lattner	72a2ebab6c	change instcombine to not turn a call to non-varargs bitcast of function prototype into a call to a varargs prototype. We do allow the xform if we have a definition, but otherwise we don't want to risk that we're changing the abi in a subtle way. On X86-64, for example, varargs require passing stuff in %al. llvm-svn: 126363	2011-02-24 05:10:56 +00:00
Evan Cheng	9db7b1367d	Fix bug in X86 folding / unfolding table. Int_CMPSDrm and Int_CMPSSrm memory operands starts at index 2, not 1. rdar://9045024 PR9305 llvm-svn: 126359	2011-02-24 02:36:52 +00:00
Jakob Stoklund Olesen	0f1be641db	Use the same spill slot for all live ranges that descend form the same original register. This avoids some silly stack slot shuffling when both sides of a copy get spilled. llvm-svn: 126353	2011-02-24 01:07:55 +00:00
Devang Patel	e8ade74a52	Use DW_FORM_data2 for DW_AT_language and let users use DW_LANG_lo_user=0x8000 to DW_LANG_hi_user=0xffff range. llvm-svn: 126339	2011-02-23 22:37:04 +00:00
Jim Grosbach	e7be1f2990	Revert r125595, which is an X86-only undocumented assembly syntax extension enabled for all targets. Non-X86 targets should not have this behavior enabled by default. Joerg, if you would like to resubmit with the behavior conditionalized to be X86-ELF only, that's fine. llvm-svn: 126336	2011-02-23 21:26:51 +00:00
Rafael Espindola	993a4ef35c	Put in the symbol table symbols only used in a .globl statement. Fixes PR9292. llvm-svn: 126330	2011-02-23 20:22:07 +00:00
Richard Osborne	d9564589f6	Add XCore intrinsic for clre instruction. llvm-svn: 126322	2011-02-23 18:52:05 +00:00
Richard Osborne	4a55817288	Add llvm.xcore.waitevent intrinsic. The effect of this intrinsic is to enable events on the thread and wait until a resource is ready to event. The vector of the resource that is ready is returned. llvm-svn: 126320	2011-02-23 18:35:59 +00:00
Jakob Stoklund Olesen	5edc1f287b	It is safe to ignore LastSplitPoint when the variable is not live out. No code will be inserted after the split point anyway. llvm-svn: 126319	2011-02-23 18:26:31 +00:00
Richard Osborne	aaac1b01fd	Add XCore intrinsic for the setv instruction. llvm-svn: 126315	2011-02-23 16:46:37 +00:00
Richard Osborne	2374e9683e	Fix format for setc instruction. llvm-svn: 126314	2011-02-23 15:20:16 +00:00
Richard Osborne	aa39bf94b4	Add XCore intrinsic for settw instruction. llvm-svn: 126313	2011-02-23 14:45:03 +00:00
Sean Callanan	e1308394f1	Fixed a bug in the enhanced disassembler that caused it to ignore valid uses of FS and GS as additional base registers in address computations. Added a test case for this. llvm-svn: 126302	2011-02-23 03:31:28 +00:00
Evan Cheng	98e040ea71	Change VFPNeonA8 definition to make the code easier to read. llvm-svn: 126298	2011-02-23 02:35:33 +00:00
Stuart Hastings	c0c38e8673	Omit private_extern declarations of extern symbols; followup to r124468. Patch by Rafael Avila de Espindola! llvm-svn: 126297	2011-02-23 02:27:05 +00:00
Evan Cheng	da40bcab44	More fcopysign correctness and performance fix. The previous codegen for the slow path (when values are in VFP / NEON registers) was incorrect if the source is NaN. The new codegen uses NEON vbsl instruction to copy the sign bit. e.g. vmov.i32 d1, #0x80000000 vbsl d1, d2, d0 If NEON is not available, it uses integer instructions to copy the sign bit. rdar://9034702 llvm-svn: 126295	2011-02-23 02:24:55 +00:00
Jakob Stoklund Olesen	17add01160	Keep track of how many times a live range has been dequeued, and prioritize new ranges. When a large live range is evicted, it will usually be split when it comes around again. By deferring evicted live ranges, the splitting happens at a time when the interference pattern is more realistic. This prevents repeated splitting and evictions. llvm-svn: 126282	2011-02-23 00:56:56 +00:00
Jakob Stoklund Olesen	18a19b665c	Fix a bug in determining if there is only a single interfering register. llvm-svn: 126277	2011-02-23 00:29:55 +00:00
Jakob Stoklund Olesen	58efee5c3e	Be more aggressive about evicting interference. Use interval sizes instead of spill weights to determine if it is legal to evict interference. A smaller interval can evict interference if all interfering live ranges are larger. Allow multiple interferences to be evicted as along as they are all larger than the live range being allocated. Spill weights are still used to select the preferred eviction candidate. llvm-svn: 126276	2011-02-23 00:29:52 +00:00
David Greene	7b0539174a	[AVX] General VUNPCKL codegen support. llvm-svn: 126264	2011-02-22 23:31:46 +00:00
Jakob Stoklund Olesen	505c6fac89	Change the RAGreedy register assignment order so large live ranges are allocated first. This is based on the observation that long live ranges are more difficult to allocate, so there is a better chance of solving the puzzle by handling the big pieces first. The allocator will evict and split long alive ranges when they get in the way. RABasic is still using spill weights for its priority queue, so the interface to the queue has been virtualized. llvm-svn: 126259	2011-02-22 23:01:52 +00:00
Jakob Stoklund Olesen	5b9699abdd	80 Col. llvm-svn: 126258	2011-02-22 23:01:49 +00:00
Cameron Zwarich	c5fa112a70	Make LoopDeletion work on loops with multiple edges, as long as the incoming values from all of the loop's exiting blocks are equal. Patch by Andrew Clinton. llvm-svn: 126253	2011-02-22 22:25:39 +00:00
Joerg Sonnenberger	67e0eb235d	Use the same (%dx) hack for in[bwl] as for out[bwl]. llvm-svn: 126244	2011-02-22 20:40:09 +00:00
Evan Cheng	f540b0e0f6	VFP single precision arith instructions can go down to NEON pipeline, but on Cortex-A8 only. llvm-svn: 126238	2011-02-22 19:53:14 +00:00
Devang Patel	74d085657f	Follow LLVM coding style. clang uses DBuilder, so it requries corresponding change. llvm-svn: 126231	2011-02-22 18:56:12 +00:00
Roman Divacky	f028b1614b	Stack alignment is 16 bytes on FreeBSD/i386 too. llvm-svn: 126226	2011-02-22 17:30:05 +00:00
Joerg Sonnenberger	cbdd830a75	Bug#9172: Don't use static in file scope, use an attribute on the parser. llvm-svn: 126225	2011-02-22 16:53:11 +00:00
Cameron Zwarich	bde7e8b3e0	MachineConstantPoolValues are not uniqued, so they need to be freed if they share entries. Add a DenseSet to MachineConstantPool for the MachineCPVs that it owns. This will hopefully fix the MC/ARM/elf-reloc-01.ll failure on the leaks bots. llvm-svn: 126218	2011-02-22 08:54:30 +00:00
Evan Cheng	f7c6f8580b	Guard against de-referencing MBB.end(). llvm-svn: 126192	2011-02-22 07:07:59 +00:00
Evan Cheng	6e3d087477	available_externally (hidden or not) GVs are always accessed via stubs. rdar://9027648. llvm-svn: 126191	2011-02-22 06:58:34 +00:00
Andrew Trick	ec08eae0aa	VirtRegRewriter assertion fix. Apparently it's ok for multiple operands to "kill" the same register. Fixes PR9237. llvm-svn: 126190	2011-02-22 06:52:56 +00:00
Cameron Zwarich	c942ffcae4	Roll out r126169 and r126170 in an attempt to fix the selfhost bot. llvm-svn: 126185	2011-02-22 03:24:52 +00:00
Eric Christopher	58b95654bc	Only use blx for external function calls on thumb, these could be fixed up by the dynamic linker, but it's better to use the correct instruction to begin with. Fixes rdar://9011034 llvm-svn: 126176	2011-02-22 01:37:10 +00:00
Cameron Zwarich	63ed1f4c67	Merge information about the number of zero, one, and sign bits of live-out registers at phis. This enables us to eliminate a lot of pointless zexts during the DAGCombine phase. This fixes <rdar://problem/8760114>. llvm-svn: 126170	2011-02-22 00:46:27 +00:00
Cameron Zwarich	939fceef5b	Have isel visit blocks in reverse postorder rather than an undefined order. This allows for the information propagated across basic blocks to be merged at phis. llvm-svn: 126169	2011-02-22 00:46:22 +00:00
Joerg Sonnenberger	9dceff5417	Recognize loopz and loopnz as aliases for loope and loopne. From Dimitry Andric. llvm-svn: 126168	2011-02-22 00:43:07 +00:00
Rafael Espindola	e4a04cce2b	Implement xgetbv and xsetbv. Patch by Jai Menon. llvm-svn: 126165	2011-02-22 00:35:18 +00:00
Eric Christopher	de9e3eaf5f	Revert r125960, it's breaking darwin10 bootstrap. llvm-svn: 126163	2011-02-21 23:52:19 +00:00
Evan Cheng	aaa5bd52f4	Skipping over debugvalue instructions to determine whether the split spot is in a IT block. rdar://9030770 llvm-svn: 126159	2011-02-21 23:40:47 +00:00
Evan Cheng	f5f2a92f8f	Add more debugging output. llvm-svn: 126158	2011-02-21 23:39:48 +00:00
Joerg Sonnenberger	dfe33572b5	Handle FK_PCRel_1 and add a test case for this and FK_PCRel_4. llvm-svn: 126157	2011-02-21 23:25:41 +00:00
Devang Patel	d5c4589795	Revert r124611 - "Keep track of incoming argument's location while emitting LiveIns." In other words, do not keep track of argument's location. The debugger (gdb) is not prepared to see line table entries for arguments. For the debugger, "second" line table entry marks beginning of function body. This requires some coordination with debugger to get this working. - The debugger needs to be aware of prolog_end attribute attached with line table entries. - The compiler needs to accurately mark prolog_end in line table entries (at -O0 and at -O1+) llvm-svn: 126155	2011-02-21 23:21:26 +00:00
Jakob Stoklund Olesen	508829f1d1	Add SplitKit::isOriginalEndpoint and use it to force live range splitting to terminate. An original endpoint is an instruction that killed or defined the original live range before any live ranges were split. When splitting global live ranges, avoid creating local live ranges without any original endpoints. We may still create global live ranges without original endpoints, but such a range won't be split again, and live range splitting still terminates. llvm-svn: 126151	2011-02-21 23:09:46 +00:00
Sean Callanan	8aaf83f2b8	Fixed a bug in the X86 disassembler where a member of the X86 instruction decode structure was being interpreted as being in units of bits, although it is actually stored in units of bytes. llvm-svn: 126147	2011-02-21 21:55:05 +00:00
Stuart Hastings	8181f2d5a3	End the line if we return early. Radar 9012638. llvm-svn: 126141	2011-02-21 21:07:07 +00:00
Richard Osborne	bd0e21b5ca	Add XCore intrinsics for various instructions on ports. llvm-svn: 126132	2011-02-21 18:23:30 +00:00
Duncan Sands	77c058dc70	The stack should be 16 byte aligned on 32 bit solaris. Patch by Yuri. llvm-svn: 126130	2011-02-21 17:37:17 +00:00
Duncan Sands	17a277da6d	If the phi node was used by an unreachable instruction that ends up using itself without going via a phi node then we could return false here in spite of making a change. Also, tweak the comment because this method can (and always could) return true without deleting the original phi node. For example, if the phi node was used by a read-only invoke instruction which is used by another phi node phi2 which is only used by and only uses the invoke, then phi2 would be deleted but not the invoke instruction and not the original phi node. llvm-svn: 126129	2011-02-21 17:32:05 +00:00
Stuart Hastings	d7ca2e5b61	Fix to correctly support attribute((section("__DATA, __common"))). Radar 9012638. llvm-svn: 126127	2011-02-21 17:27:17 +00:00
Chris Lattner	e7dc7e1e5b	a serious "compare CSE" issue that is nontrivial to get right, but which is responsible for us doing really bad things to 256.bzip2. llvm-svn: 126126	2011-02-21 17:03:47 +00:00
Chris Lattner	83c60ae907	fix a crasher in disabled code (on variable stride loops) llvm-svn: 126125	2011-02-21 17:02:55 +00:00
Duncan Sands	702e8b136e	Simplify RecursivelyDeleteDeadPHINode. The only functionality change should be that if the phi is used by a side-effect free instruction with no uses then the phi and the instruction now get zapped (checked by the unittest). llvm-svn: 126124	2011-02-21 16:27:36 +00:00
NAKAMURA Takumi	a03e9f0267	Target/X86/X86FastISel: [PR6275] Fix Win32's dllimport function with fastisel. "dllimport" function must not be GlobalVariable, but Function. It is enough to check with GlobalValue. test/CodeGen/X86/dll-linkage.ll is updated to check llc -O0. llvm-svn: 126110	2011-02-21 04:50:06 +00:00
Venkatraman Govindaraju	1a5bbc7f1e	Generate correct Sparc32 ABI compliant code for functions that return a struct. llvm-svn: 126108	2011-02-21 03:42:44 +00:00
Chris Lattner	c373140c8b	add a missed loop deletion case. llvm-svn: 126103	2011-02-21 02:13:39 +00:00
Chris Lattner	9d2899115f	Add some (disabled code) to print out negative strides. llvm-svn: 126102	2011-02-21 02:08:54 +00:00
Chris Lattner	8760c28fe1	add an idiom that loop idiom could theoretically catch. llvm-svn: 126101	2011-02-21 01:33:38 +00:00
Cameron Zwarich	3384d8f317	A lo/hi mul has higher latency than an imul r,ri, e.g. 5 cycles compared to 3 on Core 2 and Nehalem, so the code we generate is better than GCC's here. llvm-svn: 126100	2011-02-21 01:29:32 +00:00
Joerg Sonnenberger	2204260330	Use a vector of pairs to implement the section stack, not two independent vectors. llvm-svn: 126099	2011-02-21 01:07:42 +00:00
Cameron Zwarich	b7e676db6c	The signed version of our "magic number" computation for the integer approximation of a constant had a minor typo introduced when copying it from the book, which caused it to favor negative approximations over positive approximations in many cases. Positive approximations require fewer operations beyond the multiplication. In the case of division by 3, we still generate code that is a single instruction larger than GCC's code. llvm-svn: 126097	2011-02-21 00:22:02 +00:00
Rafael Espindola	5fd4a261e7	Add some limited support for labels in org directives. Hopefully enough to fix PR9245. llvm-svn: 126091	2011-02-20 20:20:07 +00:00
Nick Lewycky	ecae3aec02	Make RecursivelyDeleteDeadPHINode delete a phi node that has no users and add a test for that. With this change, test/CodeGen/X86/codegen-dce.ll no longer finds any instructions to DCE, so delete the test. Also renamed J and JP to I and IP in RecursivelyDeleteDeadPHINode. llvm-svn: 126088	2011-02-20 18:05:56 +00:00
Benjamin Kramer	85011c0273	Move "A \| ~(A & ?) -> -1" from InstCombine to InstructionSimplify. llvm-svn: 126082	2011-02-20 15:20:01 +00:00
Benjamin Kramer	50cd35c25e	InstCombine: Add a bunch of combines of the form x \| (y ^ z). We usually catch this kind of optimization through InstSimplify's distributive magic, but or doesn't distribute over xor in general. "A \| ~(A \| B) -> A \| ~B" hits 24 times on gcc.c. llvm-svn: 126081	2011-02-20 13:23:43 +00:00
Nadav Rotem	1660c0bc25	Fix 9267; Add vector zext support. The DAGCombiner folds the zext into complex load instructions. This patch prevents this optimization on vectors since none of the supported targets knows how to perform load+vector_zext in one instruction. llvm-svn: 126080	2011-02-20 12:37:50 +00:00
Nick Lewycky	4d7bb906df	Teach RecursivelyDeleteDeadPHINodes to handle multiple self-references. Patch by Andrew Clinton! llvm-svn: 126077	2011-02-20 08:38:20 +00:00
Nick Lewycky	cd05f2a659	Instead of keeping two Value*->id# mappings, keep one Value->Value mapping and one Value set. This is faster because we only need to use the set when there isn't already an entry in the map. No functionality change! llvm-svn: 126076	2011-02-20 08:11:03 +00:00

... 3 4 5 6 7 ...

45920 Commits