llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Jay Foad	5827394cd0	Don't generate redundant casts of constant values when lowering calls to memcpy, memmove and memset. llvm-svn: 71427	2009-05-11 11:32:25 +00:00
Bill Wendling	e0a4e2af03	This is a large rewrite of how Dwarf info for inlined functions is handled. The DwarfWriter expects DbgScopes and DIEs to behave themselves according to DwarfWriter's rules. However, inlined functions violate these rules. There are two different types of DIEs associated with an inlined function: an abstract instance, which has information about the original source code for the function being inlined; and concrete instances, which are created for each place the function was inlined and point back to the abstract instance. This patch tries to stay true to this schema. It bypasses how regular DbgScopes and DIEs are created and used when necessary. It provides special handling for DIEs of abstract and concrete instances. This doesn't take care of all of the problems with debug info for inlined functions, but it's a step in the right direction. For one thing, llvm-gcc generates wrong IR (it's missing some llvm.dbg intrinsics at the point where the function's inlined) for this example: #include <stdio.h> static __inline__ __attribute__((always_inline)) int bar(int x) { return 4; } void foo() { long long b = 1; int Y = bar(4); printf("%d\n", Y); } while clang generates correct IR. llvm-svn: 71410	2009-05-10 23:14:38 +00:00
Bill Wendling	c4ffa72fc4	--- Reverse-merging r71370 into '.': U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp Revert r71370. llvm-svn: 71373	2009-05-10 00:10:50 +00:00
Bill Wendling	bea75516f2	A debug function start was not being recorded when the optimization level wasn't None. However, we were always recording the region end. There's no longer a good reason for this code to be separated out between the different opt levels, as it was doing pretty much the same thing anyway. llvm-svn: 71370	2009-05-09 23:51:35 +00:00
Evan Cheng	581641b664	Oops. Don't forget to align single bb loops. llvm-svn: 71363	2009-05-09 19:18:01 +00:00
Duncan Sands	f7af13b2d4	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Evan Cheng	06b0d3879e	Enable loop bb placement optimization. llvm-svn: 71291	2009-05-08 23:35:49 +00:00
Mike Stump	9779d47f7a	Avoid warning in release-asserts build. llvm-svn: 71275	2009-05-08 22:53:06 +00:00
Bill Wendling	1dccca6452	Mirror how Fast ISel determines if a region.end intrinsic is the end of an inlined function or the end of a function. Before, this was never executing the "inlined" version of the Record method. This will become important once the inlined Dwarf writer patch lands. llvm-svn: 71268	2009-05-08 21:14:49 +00:00
Bill Wendling	6ad9e22d42	Compute the offsets of the compile units. We need this so that when we emit a concrete instance of an inlined function, we can get the actual address of the abstract instance inside of the compile unit. This isn't currently used, but will be by a future check-in. llvm-svn: 71263	2009-05-08 21:03:15 +00:00
Bill Wendling	d7428b0d9c	Minor clean ups. No functionality change. llvm-svn: 71256	2009-05-08 20:38:02 +00:00
Evan Cheng	aadb8051c0	Don't align loop header unless the loop back edge is below the header. llvm-svn: 71242	2009-05-08 19:01:44 +00:00
Anton Korobeynikov	b3dc881070	Factor out cycle-finder code and make it generic. llvm-svn: 71241	2009-05-08 18:51:58 +00:00
Anton Korobeynikov	026d2328a6	Do not emit bit tests if target does not support natively left shift llvm-svn: 71240	2009-05-08 18:51:34 +00:00
Anton Korobeynikov	353d4609cf	Properly expand libcalls for urem / srem. Also make code more straightforward. llvm-svn: 71238	2009-05-08 18:51:08 +00:00
Anton Korobeynikov	aa7f982935	Typo llvm-svn: 71237	2009-05-08 18:50:54 +00:00
Evan Cheng	10038ab095	Reverse branch condition only when there is a conditional branch. llvm-svn: 71214	2009-05-08 09:35:53 +00:00
Nick Lewycky	a8f179d44b	Add explicit braces to disambiguate nested if/else. Removes a warning. llvm-svn: 71211	2009-05-08 06:57:41 +00:00
Evan Cheng	2a1d20b0fb	Optimize code placement in loop to eliminate unconditional branches or move unconditional branch to the outside of the loop. e.g. /// A: /// ... /// <fallthrough to B> /// /// B: --> loop header /// ... /// jcc <cond> C, [exit] /// /// C: /// ... /// jmp B /// /// ==> /// /// A: /// ... /// jmp B /// /// C: --> new loop header /// ... /// <fallthough to B> /// /// B: /// ... /// jcc <cond> C, [exit] llvm-svn: 71209	2009-05-08 06:34:09 +00:00
Bob Wilson	d61f4e70d8	Fix pr4100. Do not remove no-op copies when they are dead. The register scavenger gets confused about register liveness if it doesn't see them. I'm not thrilled with this solution, but it only comes up when there are dead copies in the code, which is something that hopefully doesn't happen much. Here is what happens in pr4100: As shown in the following excerpt from the debug output of llc, the source of a move gets reloaded from the stack, inserting a new load instruction before the move. Since that source operand is a kill, the physical register is free to be reused for the destination of the move. The move ends up being a no-op, copying R3 to R3, so it is deleted. But, it leaves behind the load to reload %reg1028 into R3, and that load is not updated to show that it's destination operand (R3) is dead. The scavenger gets confused by that load because it thinks that R3 is live. Starting RegAlloc of: %reg1025<def,dead> = MOVr %reg1028<kill>, 14, %reg0, %reg0 Regs have values: Reloading %reg1028 into R3 Last use of R3[%reg1028], removing it from live set Assigning R3 to %reg1025 Register R3 [%reg1025] is never used, removing it from live set Alternative solutions might be either marking the load as dead, or zapping the load along with the no-op copy. I couldn't see an easy way to do either of those, though. llvm-svn: 71196	2009-05-07 23:47:03 +00:00
Bob Wilson	a46384485b	Fix a comment (again). llvm-svn: 71180	2009-05-07 21:20:42 +00:00
Bob Wilson	8028930294	Fix a comment. llvm-svn: 71179	2009-05-07 21:19:45 +00:00
Dan Gohman	ebacd61d7d	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Bill Wendling	9f97e4a3dc	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Argyrios Kyrtzidis	bd72fc132d	Move the tablegen-produced DebugLoc handling into a AsmWriter::processDebugLoc function. No functionality change. llvm-svn: 71156	2009-05-07 13:55:51 +00:00
Evan Cheng	ef5794cb99	Code refactoring. llvm-svn: 71151	2009-05-07 05:49:39 +00:00
Evan Cheng	1b99da6e30	Rename "loop aligner" pass to "code placement optimization" pass. llvm-svn: 71150	2009-05-07 05:42:24 +00:00
Bill Wendling	6edd6ef74f	Just turn aggressive stack coloring off at -O3. llvm-svn: 71140	2009-05-07 01:33:38 +00:00
Bill Wendling	7c50dcd02e	Temporarily revert r71010. It was causing massive failures during self-hosting. llvm-svn: 71138	2009-05-07 01:27:25 +00:00
Argyrios Kyrtzidis	0f60e636c0	Make DwarfWriter::RecordInlinedFnStart more like the other DwarfWriter's methods: -Have it return a label ID -Remove the unused Instruction parameter No functionality change. llvm-svn: 71132	2009-05-07 00:16:31 +00:00
Bill Wendling	6e1b018958	- Move some debug fields to coincide with how GCC emits them. No functionality change. - Reformatting. llvm-svn: 71118	2009-05-06 21:21:34 +00:00
Evan Cheng	0ee6696fd8	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Oscar Fuentes	24167db5ad	CMake: Updated lib/CodeGen/CMakeLists.txt. llvm-svn: 71085	2009-05-06 14:56:40 +00:00
Duncan Sands	938fde7e43	Add generic expansion of SUB when ADD and XOR are legal. Based on a patch by Micah Villmow. llvm-svn: 71078	2009-05-06 11:29:50 +00:00
Lang Hames	fcc5ebb1d4	Renamed Spiller classes (plus uses and related files) to VirtRegRewriter. llvm-svn: 71057	2009-05-06 02:36:21 +00:00
Dan Gohman	5e839321f2	If a MachineBasicBlock has multiple ways of reaching another block, allow it to have multiple CFG edges to that block. This is needed to allow MachineBasicBlock::isOnlyReachableByFallthrough to work correctly. This fixes PR4126. llvm-svn: 71018	2009-05-05 21:10:19 +00:00
Evan Cheng	984da04cd0	Enable stack coloring with regs at -O3. llvm-svn: 71010	2009-05-05 20:30:36 +00:00
Chris Lattner	a96ef42a06	Do not require variable debug info nodes to have a compile unit. For implicit decls like "self" and "_cmd" in ObjC, these decls should not have a location. llvm-svn: 70964	2009-05-05 04:55:56 +00:00
Evan Cheng	77e14276e0	Do not substitute if the new register isn't in the register class of the operand being updated. llvm-svn: 70953	2009-05-05 00:46:16 +00:00
Evan Cheng	95ce4ffb36	Move getInstrOperandRegClass from the scheduler to TargetInstrInfo. llvm-svn: 70950	2009-05-05 00:30:09 +00:00
Evan Cheng	ecfc8e8464	Do forward and backward substitution to eliminate loads and stores when possible. llvm-svn: 70937	2009-05-04 23:13:13 +00:00
Chris Lattner	7e3c94b55e	Make DBG_STOPPOINT nodes, and therefore DBG_LABEL labels, get a DebugLoc, so that it shows up in -print-machineinstrs. This doesn't appear to affect anything, but it was weird for some DBG_LABELs to have DebugLocs but not all of them. llvm-svn: 70921	2009-05-04 22:10:05 +00:00
Argyrios Kyrtzidis	f82d02a6ca	Restore a comment. llvm-svn: 70900	2009-05-04 19:23:45 +00:00
Argyrios Kyrtzidis	fb958c2b09	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. (Comes with Regression-Be-Gone(tm)) llvm-svn: 70871	2009-05-04 16:23:49 +00:00
Evan Cheng	9df9768ee5	Make sure to color with only allocatable registers for the specific register class. llvm-svn: 70821	2009-05-04 03:30:11 +00:00
Evan Cheng	bb12bac53b	The stack slots which share the same stack slot after coloring can, but do not have to, use the same register. In fact, they each may have different register class requirements. llvm-svn: 70815	2009-05-04 00:24:50 +00:00
Argyrios Kyrtzidis	e68261749e	Revert r70803 for now, it causes a regression. llvm-svn: 70811	2009-05-03 23:27:19 +00:00
Argyrios Kyrtzidis	bb6e4d027c	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. llvm-svn: 70803	2009-05-03 22:03:35 +00:00
Evan Cheng	a64d6b8822	Typo. llvm-svn: 70792	2009-05-03 19:10:11 +00:00
Evan Cheng	28aa6c41d1	In some rare cases, the register allocator can spill registers but end up not utilizing registers at all. The fundamental problem is linearscan's backtracking can end up freeing more than one allocated registers. However, reloads and restores might be folded into uses / defs and freed registers might not be used at all. VirtRegMap keeps track of allocations so it knows what's not used. As a horrible hack, the stack coloring can color spill slots with free registers. That is, it replace reload and spills with copies from and to the free register. It unfold instructions that load and store the spill slot and replace them with register using variants. Not yet enabled. This is part 1. More coming. llvm-svn: 70787	2009-05-03 18:32:42 +00:00
Anton Korobeynikov	15587901c3	Fix typo llvm-svn: 70770	2009-05-03 13:19:57 +00:00
Anton Korobeynikov	34d22f34a8	Properly handle sdiv / udiv / srem / urem libcalls llvm-svn: 70764	2009-05-03 13:18:16 +00:00
Anton Korobeynikov	7f560f113d	Proper name 16 bit libcalls llvm-svn: 70750	2009-05-03 13:14:08 +00:00
Anton Korobeynikov	b4da45ecd8	Add libcall expansion for 16 and 128 bit muls llvm-svn: 70749	2009-05-03 13:13:51 +00:00
Argyrios Kyrtzidis	a034549d67	-Move the DwarfWriter::ValidDebugInfo check to a static DIDescriptor::ValidDebugInfo -Create DebugLocs without the need to have a DwarfWriter around llvm-svn: 70682	2009-05-03 08:50:41 +00:00
Bob Wilson	da90bf9e40	Allow CONCAT_VECTORS nodes to be legal or have custom lowering for some targets. Changes to take advantage of this will come later. llvm-svn: 70560	2009-05-01 17:55:32 +00:00
Bill Wendling	2da6a65b62	Simplify more code and add timer stuff. llvm-svn: 70539	2009-05-01 08:40:06 +00:00
Bill Wendling	0bab670012	Simplify more code. llvm-svn: 70537	2009-05-01 08:35:12 +00:00
Bill Wendling	662dfea32e	Simplify some code. llvm-svn: 70534	2009-05-01 08:32:14 +00:00
Bill Wendling	2f01fd9bf1	Fix whitespace. It was confusing me. llvm-svn: 70533	2009-05-01 08:25:13 +00:00
Evan Cheng	d6a780a181	Code clean up. Bye bye PhysRegTracker. llvm-svn: 70524	2009-05-01 01:03:49 +00:00
Argyrios Kyrtzidis	9956976b76	Make DebugLoc independent of DwarfWriter. -Replace DebugLocTuple's Source ID with CompileUnit's GlobalVariable* -Remove DwarfWriter::getOrCreateSourceID -Make necessary changes for the above (fix callsites, etc.) llvm-svn: 70520	2009-04-30 23:22:31 +00:00
Jakob Stoklund Olesen	17d292db73	Join cross class copies using getCommonSubClass() llvm-svn: 70513	2009-04-30 21:24:03 +00:00
Evan Cheng	a4c868f1d4	Add a smarter heuristic to determine when to coalesce a virtual register with a physical one. More specifically, it avoid tying a virtual register in the loop with a physical register defined / used outside the loop. When it determines it's not profitable, it will use the physical register as the allocation preference instead. This is not turned on by default. Testing indicates this is just as likely to pessimize code. The main issue seems to be allocation preference doesn't work effectively. That will change once I've taught register allocator "swapping". llvm-svn: 70503	2009-04-30 18:39:57 +00:00
Jay Foad	9768cabf4a	Move helper functions for optimizing division by constant into the APInt class. llvm-svn: 70488	2009-04-30 10:15:35 +00:00
Chris Lattner	794fb5b4b3	fix a regression handling indirect results: these need to be considered memory operands otherwise the writebacks get lost when the inline asm doesn't otherwise have side effects. This fixes rdar://6839427, though clang really shouldn't generate these anymore. llvm-svn: 70455	2009-04-30 00:48:50 +00:00
Bill Wendling	40a162f75f	Instead of passing in an unsigned value for the optimization level, use an enum, which better identifies what the optimization is doing. And is more flexible for future uses. llvm-svn: 70440	2009-04-29 23:29:43 +00:00
Nate Begeman	b407809122	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Jakob Stoklund Olesen	0bfaaea2a4	MachineInstr::isRegTiedTo{Use,Def}Operand can safely be made const. llvm-svn: 70408	2009-04-29 20:57:16 +00:00
Nate Begeman	e4dd5a96ba	Update comment, replace theoretically impossible check with an assert. llvm-svn: 70391	2009-04-29 18:13:31 +00:00
Evan Cheng	62fdc300dd	spillPhysRegAroundRegDefsUses() may have invalidated iterators stored in fixed_ IntervalPtrs. Reset them. llvm-svn: 70378	2009-04-29 07:16:34 +00:00
Nate Begeman	414534b3eb	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Sanjiv Gupta	b1c777e865	Add a public method called getAddressSpace() to the GlobalAddressSDNode. llvm-svn: 70366	2009-04-29 04:43:24 +00:00
Chris Lattner	e1eefefdc3	Disable the load-shrinking optimization from looking at anything larger than 64-bits, avoiding a crash. This should really be fixed to use APInts, though type legalization happens to help us out and we get good code on the attached testcase at least. This fixes rdar://6836460 llvm-svn: 70360	2009-04-29 03:45:07 +00:00
Evan Cheng	51727d25b2	Determine allocation 'preference' with right register class. I haven't seen this changing codegen so no test case. llvm-svn: 70351	2009-04-29 00:42:27 +00:00
Bill Wendling	7546bed590	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Evan Cheng	46e0ff09e5	Move getMatchingSuperReg() out of coalescer and into TargetRegisterInfo. llvm-svn: 70309	2009-04-28 18:29:27 +00:00
Jakob Stoklund Olesen	6a489c6d7e	Don't coalesce a physical register with an incompatible virtual register. If the physical register does not belong to the virtual register's regclass, don't coalesce. The physical register could be an invalid operand for an instruction using the vreg. The regclass matching is done after determining the actual subregisters being copied. llvm-svn: 70298	2009-04-28 16:34:35 +00:00
Sanjiv Gupta	51c8ca2ca6	Initialized arrays can be in any address space. llvm-svn: 70297	2009-04-28 16:34:20 +00:00
Jakob Stoklund Olesen	26d9b11da5	Move getSubRegisterRegClass from ScheduleDagSDNodesEmit.cpp to a TargetRegisterClass method. Also make the method non-asserting. It will return NULL when given an invalid subreg index. The method is needed by an upcoming patch. llvm-svn: 70296	2009-04-28 16:34:09 +00:00
Evan Cheng	754a0d2f9e	Fix PR4034. Bug in LiveInterval::join when it's compacting new valno's. llvm-svn: 70291	2009-04-28 06:24:09 +00:00
Evan Cheng	8a9736a26c	Fix for PR4051. When 2address pass delete an instruction, update kill info when necessary. llvm-svn: 70279	2009-04-28 02:12:36 +00:00
Bill Wendling	ef47ace92f	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	2799e916c3	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Evan Cheng	c315cf24e3	Fix PR4076. Correctly create live interval of physical register with two-address update. llvm-svn: 70245	2009-04-27 20:42:46 +00:00
Owen Anderson	b313524b03	Don't skip the CopyMI when removing kill markers. This should have no effect on generated code, but makes the intermediate state of the coalescer more sane. llvm-svn: 70238	2009-04-27 19:55:47 +00:00
Duncan Sands	ae8fb3fbcb	Now that PR2957 is resolved, remove a bunch of no-longer needed workarounds. llvm-svn: 70234	2009-04-27 19:33:03 +00:00
Nate Begeman	9d121924fd	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Evan Cheng	43fc90ae59	Fix PR4056. It's possible a physical register def is dead if its implicit use is deleted by two-address pass. llvm-svn: 70213	2009-04-27 17:36:47 +00:00
Evan Cheng	c346651d6f	Also delete last unused val#. llvm-svn: 70212	2009-04-27 17:35:19 +00:00
Dan Gohman	744f455d55	When transforming sext(trunc(load(x))) into sext(smaller load(x)), the trunc is directly replaced with the smaller load, so don't try to create a new sext node. This fixes PR4050. llvm-svn: 70179	2009-04-27 02:00:55 +00:00
Evan Cheng	cd5a58a3e4	Reuse unused val#'s to avoid running out of memory in extreme cases. llvm-svn: 70069	2009-04-25 20:20:15 +00:00
Dan Gohman	1031d09a38	Refactor the code to grab the low and high parts of a value using EXTRACT_ELEMENT into a utility function. llvm-svn: 70056	2009-04-25 17:55:53 +00:00
Dan Gohman	60349bb21e	Add a top-level comment about DAGCombiner's role in the compiler. llvm-svn: 70052	2009-04-25 17:09:45 +00:00
Evan Cheng	696a04eba2	Do not share a single unknown val# for all the live ranges merged into a physical sub-register live interval. When coalescer is merging in clobbered virtaul register live interval into a physical register live interval, give each virtual register val# a separate val# in the physical register live interval. Otherwise, the coalescer would have lost track of the definitions information it needs to make correct coalescing decisions. llvm-svn: 70026	2009-04-25 09:25:19 +00:00
Dale Johannesen	493c3bcdc0	Fix PR 4057, a crash doing float->char const folding. This particular one is undefined behavior (although this isn't related to the crash), so it will no longer do it at compile time, which seems better. llvm-svn: 69990	2009-04-24 21:34:13 +00:00
Rafael Espindola	0b1037ad26	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	c1a09c7dfa	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Dan Gohman	63dab69b36	Instead of requiring TLI.LowerCallTo to return an ISD::BUILD_PAIR, use ISD::EXTRACT_ELEMENT. SelectionDAG has a special fast-path for the cast of an EXTRACT_ELEMENT with a BUILD_PAIR operand, for the common case. llvm-svn: 69948	2009-04-24 02:40:23 +00:00
Dan Gohman	9d13b714f5	Factor out a bit of code that appears in several places into a utility function. llvm-svn: 69937	2009-04-23 23:13:24 +00:00
Dan Gohman	313018c1d3	Handle Void types in ComputeValueVTs. This doesn't currently occur, but this change makes the code more general and easier to adapt for new purposes. llvm-svn: 69935	2009-04-23 22:50:03 +00:00
Evan Cheng	ff776c8193	Update comments. llvm-svn: 69919	2009-04-23 20:39:31 +00:00
Evan Cheng	f20fb20367	Fix an obvious type. llvm-svn: 69918	2009-04-23 20:18:13 +00:00
Evan Cheng	a36c6c6819	It has finally happened. Spiller is now using live interval info. This fixes a very subtle bug. vr defined by an implicit_def is allowed overlap with any register since it doesn't actually modify anything. However, if it's used as a two-address use, its live range can be extended and it can be spilled. The spiller must take care not to emit a reload for the vn number that's defined by the implicit_def. This is both a correctness and performance issue. llvm-svn: 69743	2009-04-21 22:46:52 +00:00
Devang Patel	d679dbbacc	Fix Visual Studio 2008 build failure. Patch by Marius Wachtler llvm-svn: 69637	2009-04-21 00:08:56 +00:00
Dan Gohman	de72d5129b	Make X86's copyRegToReg able to handle copies to and from subclasses. This makes the extra copyRegToReg calls in ScheduleDAGSDNodesEmit.cpp unnecessary. Derived from a patch by Jakob Stoklund Olesen. llvm-svn: 69635	2009-04-20 22:54:34 +00:00
Dan Gohman	69fa329052	Simplify this code. getConstant knows how to make broadcasted vector constants. llvm-svn: 69634	2009-04-20 22:51:43 +00:00
Bob Wilson	f7e9ff1d28	Move duplicated AddLiveIn function from X86 and ARM backends to be a method in the MachineFunction class, renaming it to addLiveIn for consistency with the same method in MachineBasicBlock. Thanks for Anton for suggesting this. llvm-svn: 69615	2009-04-20 18:36:57 +00:00
Bob Wilson	840cf4fa18	Revise my previous change 68996 as suggested by Duncan. llvm-svn: 69607	2009-04-20 17:27:09 +00:00
Evan Cheng	e6a6c3a70c	- Remove an arbitrary spill weight tweak that should not have been there. - Find more reloads from SS. llvm-svn: 69606	2009-04-20 17:23:48 +00:00
Evan Cheng	c248188b46	Added a linearscan register allocation optimization. When the register allocator spill an interval with multiple uses in the same basic block, it creates a different virtual register for each of the reloads. e.g. %reg1498<def> = MOV32rm %reg1024, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg1024, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg1024, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] => %reg1498<def> = MOV32rm %reg2036, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg2037, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg2038, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] From linearscan's point of view, each of reg2036, 2037, and 2038 are separate registers, each is "killed" after a single use. The reloaded register is available and it's often clobbered right away. e.g. In thise case reg1498 is allocated EAX while reg2036 is allocated RAX. This means we end up with multiple reloads from the same stack slot in the same basic block. Now linearscan recognize there are other reloads from same SS in the same BB. So it'll "downgrade" RAX (and its aliases) after reg2036 is allocated until the next reload (reg2037) is done. This greatly increase the likihood reloads from SS are reused. This speeds up sha1 from OpenSSL by 5.8%. It is also an across the board win for SPEC2000 and 2006. llvm-svn: 69585	2009-04-20 08:01:12 +00:00
Duncan Sands	870c1f4240	Now that BUILD_VECTOR operands are allowed to be bigger than the vector element type, turn checking of the operand type back on again, appropriately adjusted. llvm-svn: 69516	2009-04-19 06:40:30 +00:00
Chris Lattner	9b80af1a86	Fix PR3898, which manifests as failures on are an Xcore, patch by Jakob Stoklund Olesen! llvm-svn: 69472	2009-04-18 20:48:07 +00:00
Duncan Sands	d2ba02aa87	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Evan Cheng	1b6d7dc766	Add a new LiveInterval::overlaps(). It checks if the live interval overlaps a range specified by [Start, End). llvm-svn: 69434	2009-04-18 08:52:15 +00:00
Dale Johannesen	f957aa4768	Inline asm's were still introducing bogus dependencies; my earlier patch to this code only fixed half of it. llvm-svn: 69408	2009-04-18 00:09:40 +00:00
Evan Cheng	2d5be54315	Teach spiller to unfold instructions which modref spill slot when a scratch register is available and when it's profitable. e.g. xorq %r12<kill>, %r13 addq %rax, -184(%rbp) addq %r13, -184(%rbp) ==> xorq %r12<kill>, %r13 movq -184(%rbp), %r12 addq %rax, %r12 addq %r13, %r12 movq %r12, -184(%rbp) Two more instructions, but fewer memory accesses. It can also open up opportunities for more optimizations. llvm-svn: 69341	2009-04-17 01:29:40 +00:00
Dan Gohman	397a4671f0	In the list-burr's pseudo two-addr dependency heuristics, don't add dependencies on nodes with exactly one successor which is a COPY_TO_REGCLASS node. In the case that the copy is coalesced away, the dependence should be on the user of the copy, rather than the copy itself. llvm-svn: 69309	2009-04-16 20:59:02 +00:00
Dan Gohman	2165510df6	Handle SUBREG_TO_REG instructions with the same heuristics as INSERT_SUBREG instructions in the list-burr scheduler. llvm-svn: 69308	2009-04-16 20:57:10 +00:00
Devang Patel	d324ddbd7b	Do not treat beginning of inlined scope as beginning of normal function scope if the location info is missing. Insetad of doing ... if (inlined_subroutine && known_location) DW_TAG_inline_subroutine else DW_TAG_subprogram do if (inlined_subroutine) { if (known_location) DW_TAG_inline_subroutine } else { DW_TAG_subprogram } llvm-svn: 69300	2009-04-16 17:55:30 +00:00
Devang Patel	6052d698f5	Record line number at the beginning of a func.start. This line was accidently lost yesterday. llvm-svn: 69286	2009-04-16 15:07:09 +00:00
Devang Patel	87488c2a88	In -fast mode do what FastISel does. This code could use some refactoring help! llvm-svn: 69254	2009-04-16 02:33:41 +00:00
Devang Patel	d9a9d5bdbb	If FastISel is run and it has known DebugLoc then use it. llvm-svn: 69253	2009-04-16 01:33:10 +00:00
Devang Patel	c6c6759194	If location where the function was inlined is not know then do not emit debug info describing inlinied region. llvm-svn: 69252	2009-04-16 01:31:54 +00:00
Devang Patel	4a18de4193	s/RootDbgScope/FunctionDbgScope/g llvm-svn: 69216	2009-04-15 20:41:31 +00:00
Devang Patel	21463f4a25	Add DISubprogram is not null check. This fixes test/CodeGen//2009-01-21-invalid-debug-info.m test case. llvm-svn: 69210	2009-04-15 20:11:08 +00:00
Dan Gohman	5555c4538b	Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads to support replacing a node with another that has a superset of the result types. Use this instead of calling ReplaceAllUsesOfValueWith for each value. llvm-svn: 69209	2009-04-15 20:06:30 +00:00
Devang Patel	d9d88a827d	Check isInlinedSubroutine() before creating DW_TAG_inlined_subroutine. llvm-svn: 69202	2009-04-15 19:42:57 +00:00
Dan Gohman	420019a18a	Fix MachineInstr::getNumExplicitOperands to count variadic operands correctly. Patch by Jakob Stoklund Olesen! llvm-svn: 69190	2009-04-15 17:59:11 +00:00
Dan Gohman	e1e53b379b	Move MachineRegisterInfo::setRegClass out of line. llvm-svn: 69126	2009-04-15 01:19:35 +00:00
Dan Gohman	a885b48029	Move MachineJumpTableInfo::ReplaceMBBInJumpTables out of line. llvm-svn: 69125	2009-04-15 01:18:49 +00:00
Dan Gohman	c9b68844d6	Give RemoveRegOperandFromRegInfo a comment and move the code out of line. llvm-svn: 69124	2009-04-15 01:17:37 +00:00
Devang Patel	f2b9c22687	Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine scopes (only in FastISel mode). llvm-svn: 69116	2009-04-15 00:10:26 +00:00
Dan Gohman	3c19cf07d9	When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG operator is used by a CopyToReg to export the value to a different block, don't reuse the CopyToReg's register for the subreg operation result if the register isn't precisely the right class for the subreg operation. Also, rename the h-registers.ll test, now that there are more than one. llvm-svn: 69087	2009-04-14 22:17:14 +00:00
Dale Johannesen	b423c8f205	Do not force asm's to be chained if they don't touch memory and aren't volatile. This was interfering with good scheduling. llvm-svn: 69008	2009-04-14 00:56:56 +00:00
Evan Cheng	9f44d3148c	Fix PR3934 part 2. findOnlyInterestingUse() was not setting IsCopy and IsDstPhys which are returned by value and used by callee. This happened to work on the earlier test cases because of a logic error in the caller side. llvm-svn: 69006	2009-04-14 00:32:25 +00:00
Daniel Dunbar	e2a54c13b3	Make these errors more noticable in build logs. llvm-svn: 68998	2009-04-13 22:26:09 +00:00
Bob Wilson	355508e70f	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	8393d29bc8	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Bob Wilson	58529085d2	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Evan Cheng	fa48d5c8d0	PR3934: Fix a bogus two-address pass assertion. llvm-svn: 68979	2009-04-13 20:04:24 +00:00
Devang Patel	92d79ef835	Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code. This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode. llvm-svn: 68973	2009-04-13 18:13:16 +00:00
Devang Patel	ad7f61c279	Reapply 68847. Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default. llvm-svn: 68964	2009-04-13 17:02:03 +00:00
Dan Gohman	3873cb7a36	Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. This will be used to replace things like X86's MOV32to32_. Enhance ScheduleDAGSDNodesEmit to be more flexible and robust in the presense of subregister superclasses and subclasses. It can now cope with the definition of a virtual register being in a subclass of a use. Re-introduce the code for recording register superreg classes and subreg classes. This is needed because when subreg extracts and inserts get coalesced away, the virtual registers are left in the correct subclass. llvm-svn: 68961	2009-04-13 15:38:05 +00:00
Dan Gohman	1070168ae0	Don't abort on an aliasing physical register that does not have a live interval. This is needed for some upcoming subreg changes. llvm-svn: 68956	2009-04-13 15:22:29 +00:00
Dan Gohman	29d211684c	When assigning a physical register to a MachineOperand, set the subreg field to 0, since the subreg field is only used for virtual register subregs. This doesn't change current functionality; it just eliminates bogus noise from debug output. llvm-svn: 68955	2009-04-13 15:21:32 +00:00
Dan Gohman	15fa207d12	Add an assertion to verify that a copy was actually emitted. llvm-svn: 68953	2009-04-13 15:16:56 +00:00
Chris Lattner	c1bfdc9bb2	Add a new "available_externally" linkage type. This is intended to support C99 inline, GNU extern inline, etc. Related bugzilla's include PR3517, PR3100, & PR2933. Nothing uses this yet, but it appears to work. llvm-svn: 68940	2009-04-13 05:44:34 +00:00
Chris Lattner	d1b365fc66	make UpdateValueMap handle the possiblity that we could be copying into the right register, avoiding a copy. llvm-svn: 68889	2009-04-12 07:46:30 +00:00
Chris Lattner	ed355b8551	optimize FastISel::UpdateValueMap to avoid duplicate map lookups, and make it return the assigned register. llvm-svn: 68888	2009-04-12 07:45:01 +00:00
Dan Gohman	ac11c8d30f	Revert r68847. It breaks the build on non-Darwin targets, with this message from the assembler: Error: unknown pseudo-op: `.debug_inlined' llvm-svn: 68863	2009-04-11 15:57:04 +00:00
Devang Patel	6f907173e0	Keep track of inlined functions and their locations. This information is collected when nested llvm.dbg.func.start intrinsics are seen. (Right now, inliner removes nested llvm.dbg.func.start intrinisics during inlining.) Create debug_inlined dwarf section using these information. This info is used by gdb, at least on Darwin, to enable better experience debugging inlined functions. See DwarfWriter.cpp for more information on structure of debug_inlined section. llvm-svn: 68847	2009-04-11 00:16:47 +00:00
Devang Patel	92ff43635b	DebugLabelFolder ruthlessly deletes redundant labels. However, sometimes the redundant labels is referenced by debug info somewhere else. This patch provies a way so that dwarf writer can mark labels as used. llvm-svn: 68813	2009-04-10 18:58:59 +00:00
Bob Wilson	3b4991e76d	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Chris Lattner	0577b8e2ef	fix two problems with machine sinking: 1. Sinking would crash when the first instruction of a block was sunk due to iterator problems. 2. Instructions could be sunk to their current block, causing an infinite loop. This fixes PR3968 llvm-svn: 68787	2009-04-10 16:38:36 +00:00
Dan Gohman	b85fd685f8	Now that register classes have names, include the name in debug output. llvm-svn: 68786	2009-04-10 15:59:38 +00:00
Bill Wendling	5d9538852f	Pass in the std::string parameter instead of returning it by value. llvm-svn: 68747	2009-04-10 00:12:49 +00:00
Bill Wendling	ab0a8487ca	Constify getter methods. llvm-svn: 68745	2009-04-10 00:00:25 +00:00
Dan Gohman	8121b3f88d	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Bill Wendling	62c4c8bc44	StringMap<DIE>::iterator::first() returns a pointer to the first character of the key. This will cause it to create a new std::string, which isn't wanted. Instead, pass back the "const char". Modify the EmitString() method to take a "const char*". llvm-svn: 68741	2009-04-09 23:51:31 +00:00
Devang Patel	500dc5157e	Silence unused variable warning. llvm-svn: 68735	2009-04-09 23:45:17 +00:00
Chris Lattner	4c5310e25e	ignore register zero in isRegTiedToUseOperand, following the example of isRegTiedToDefOperand. Thanks to Bob for pointing this out! llvm-svn: 68734	2009-04-09 23:33:34 +00:00
Bill Wendling	e1dd8c815b	Use a StringMap instead of std::map for storing std::string->DIE* maps. This gives a micro speedup in the Dwarf writer. llvm-svn: 68728	2009-04-09 21:49:15 +00:00
Devang Patel	78fd57741b	llvm.dbg.func_start also defines beginning of function scope. llvm-svn: 68727	2009-04-09 21:42:11 +00:00
Bob Wilson	c53238dff1	Fix pr3954. The register scavenger asserts for inline assembly with register destinations that are tied to source operands. The TargetInstrDescr::findTiedToSrcOperand method silently fails for inline assembly. The existing MachineInstr::isRegReDefinedByTwoAddr was very close to doing what is needed, so this revision makes a few changes to that method and also renames it to isRegTiedToUseOperand (for consistency with the very similar isRegTiedToDefOperand and because it handles both two-address instructions and inline assembly with tied registers). llvm-svn: 68714	2009-04-09 17:16:43 +00:00
Chris Lattner	301c4f39a0	reg0 references are not real registers. This fixes a crash on the attached testcase. llvm-svn: 68712	2009-04-09 16:50:43 +00:00
Dan Gohman	68de98eef3	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Dan Gohman	6479d0cbb7	Don't copy the operand of a SwitchInst into virtual registers as eagerly. This helps avoid CopyToReg nodes in some cases where they aren't needed, and also helps subsequent optimizer heuristics in cases where the extra nodes would cause the node to appear to have multiple results. This doesn't have a significant impact currently; it'll help an upcoming change. llvm-svn: 68667	2009-04-09 02:33:36 +00:00
Devang Patel	187a4c385a	If subprogram type is not tagged as DW_TAG_subroutine_type then use it directly as a return value type. llvm-svn: 68647	2009-04-08 22:18:45 +00:00
Duncan Sands	d0e186d90f	Soft float support for FREM. llvm-svn: 68614	2009-04-08 16:20:57 +00:00
Duncan Sands	ee34b0d05d	Soft float support for undef. Reported by Xerxes Rånby. llvm-svn: 68607	2009-04-08 13:33:37 +00:00
Chris Lattner	80cbd85a29	change printStringChar to emit characters as unsigned char instead of char, avoiding sign extension for the top octet. For "negative" chars, we'd print stuff like: .asciz "\702... now we print: .asciz "\302... llvm-svn: 68577	2009-04-08 00:28:38 +00:00
Dan Gohman	c9ce27d6b7	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Devang Patel	74fc1ab23d	Revert prev. patch for now. llvm-svn: 68569	2009-04-07 23:00:04 +00:00
Devang Patel	6fce3ad779	Right now DBG_LABEL are required for llvm.dbg.region_start and llvm.dbg.region_end in non-fast mode also. llvm-svn: 68559	2009-04-07 22:27:56 +00:00
Dan Gohman	e98c3b1ea1	Don't attempt to handle aggregate argument values in FastISel; let SelectionDAG do those. This fixes PR3955. llvm-svn: 68546	2009-04-07 20:40:11 +00:00
Dan Gohman	ea48adc739	Fix a TargetLowering optimization so that it doesn't duplicate loads when an input node has multiple uses. llvm-svn: 68398	2009-04-03 20:11:30 +00:00
Dan Gohman	f431de416e	Delete ISD::INSERT_SUBREG and ISD::EXTRACT_SUBREG, which are unused. Note that these are distinct from TargetInstrInfo::INSERT_SUBREG and TargetInstrInfo::EXTRACT_SUBREG, which are used. llvm-svn: 68355	2009-04-03 00:25:26 +00:00
Sanjiv Gupta	01fb90085c	To convert the StopPoint insn into an assembler directive by ISel, we need to have access to the line number field. So we convert that info as an operand by custom handling DBG_STOPPOINT in legalize. llvm-svn: 68329	2009-04-02 18:03:10 +00:00
Evan Cheng	0674a512bf	Fully general expansion of integer shift of any size. llvm-svn: 68134	2009-03-31 19:39:24 +00:00
Dan Gohman	86e4d0130c	Reapply 68073, with fixes. EH Landing-pad basic blocks are not entered via fall-through. Don't miss fallthroughs from blocks terminated by conditional branches. Also, move isOnlyReachableByFallthrough out of line. llvm-svn: 68129	2009-03-31 18:39:13 +00:00
Dan Gohman	5f64e10d8d	Minor top-level comment fix. llvm-svn: 68113	2009-03-31 16:51:18 +00:00
Bill Wendling	1c40c8c242	Oy! When reverting r68073, I added in experimental code. Sorry... llvm-svn: 68099	2009-03-31 08:41:31 +00:00
Owen Anderson	59cff6919d	Remove the "fast" cases for spill and restore point determination, as these were subtlely wrong in obscure cases. Patch the testcase to account for this change. llvm-svn: 68093	2009-03-31 08:27:09 +00:00
Bill Wendling	4706abded2	Revert r68073. It's causing a failure in the Apple-style builds. llvm-svn: 68092	2009-03-31 08:26:26 +00:00
Dan Gohman	cba99ee717	Fix live-out reg logic to not insert over-aggressive AssertZExt instructions. This fixes lua. llvm-svn: 68083	2009-03-31 01:38:29 +00:00
Evan Cheng	d7824e208a	Turn a 2-address instruction into a 3-address one when it's profitable even if the two-address operand is killed. e.g. %reg1024<def> = MOV r1 %reg1025<def> = ADD %reg1024, %reg1026 r0 = MOV %reg1025 If it's not possible / profitable to commute ADD, then turning ADD into a LEA saves a copy. llvm-svn: 68065	2009-03-30 21:34:07 +00:00
Bill Wendling	76042faa52	Balance out quote in debug output. llvm-svn: 68059	2009-03-30 20:32:22 +00:00
Bill Wendling	3b2cea6ef5	Fix grammar-o in comment. llvm-svn: 68057	2009-03-30 20:30:02 +00:00
Dan Gohman	abcfb30fc2	Constify arguments in isSuccessor and isLayoutSuccessor. llvm-svn: 68054	2009-03-30 20:06:29 +00:00
Duncan Sands	602234cdf3	Fix PR3899: add support for extracting floats from vectors when using -soft-float. Based on a patch by Jakob Stoklund Olesen. llvm-svn: 67996	2009-03-29 13:51:06 +00:00
Arnold Schwaighofer	76188bc8a1	Make check in CheckTailCallReturnConstraints for ignorable instructions between a CALL and a RET node more generic. Add a test for tail calls with a void return. llvm-svn: 67943	2009-03-28 12:36:29 +00:00
Arnold Schwaighofer	636127325b	Enable tail call optimization for functions that return a struct (bug 3664) and for functions that return types that need extending (e.g i1). llvm-svn: 67934	2009-03-28 08:33:27 +00:00
Evan Cheng	a15fdaa292	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Dan Gohman	b360b0390a	Fix what surely must be a copy+pasto. llvm-svn: 67881	2009-03-27 23:55:04 +00:00
Dan Gohman	401df03e2d	Initialize LiveOutInfo's APInt members to zero, as APInt's default constructor produces an uninitialized APInt. This fixes PR3896. llvm-svn: 67879	2009-03-27 23:51:02 +00:00
John Mosby	67ffa789e8	Shrink wrapping in PEI: initial release. Finishing development, enable with --shrink-wrap. llvm-svn: 67828	2009-03-27 06:09:40 +00:00
Owen Anderson	2312267684	Don't assign a new stack slot if the pre-alloc splitter already assigned one. llvm-svn: 67764	2009-03-26 18:53:38 +00:00
Bill Wendling	f4247ff478	Pull transform from target-dependent code into target-independent code. llvm-svn: 67742	2009-03-26 06:14:09 +00:00
Evan Cheng	7e4217176a	Revert 67132. This is breaking some objective-c apps. Also fixes SDISel so it does not force promote return value if the function is not marked signext / zeroext. llvm-svn: 67701	2009-03-25 20:20:11 +00:00
Dale Johannesen	9fb1dcd077	When optimizing with debug info, don't keep the stoppoint nodes around until Legalize; doing this imposed an ordering on a sequence of loads that came from different lines, interfering with scheduling. llvm-svn: 67692	2009-03-25 17:36:08 +00:00
Evan Cheng	3a7489a4cc	CodeGen still defaults to non-verbose asm, but llc now overrides it and default to verbose. llvm-svn: 67668	2009-03-25 01:47:28 +00:00
Devang Patel	225831fc9e	Do not ignore DW_TAG_class_type! llvm-svn: 67661	2009-03-25 00:28:40 +00:00
Evan Cheng	758c95f07a	Fix PR3845: Avoid stale MachineInstruction pointer reference. llvm-svn: 67649	2009-03-24 20:33:17 +00:00
Chris Lattner	135eeefe66	more tidying: name the components of PhysReg in the case when the target constraint specifies a specific physreg. llvm-svn: 67618	2009-03-24 15:27:37 +00:00
Chris Lattner	8793e812ef	Tidy a bit more. llvm-svn: 67617	2009-03-24 15:25:07 +00:00
Chris Lattner	a60dd19c3e	simplify this code a bit now that "allocation to a vreg class" can never fail. llvm-svn: 67616	2009-03-24 15:22:11 +00:00
Dan Gohman	547cfc882e	Minor compile-time optimization; don't bother checking canClobberPhysRegDefs if the successor node doesn't clobber any physical registers. llvm-svn: 67587	2009-03-24 00:50:07 +00:00
Dan Gohman	e6d7478bc1	Add a pre-pass to the burr-list scheduler which makes adjustments to help out the register pressure reduction heuristics in the case of nodes with multiple uses. Currently this uses very conservative heuristics, so it doesn't have a broad impact, but in cases where it does help it can make a big difference. llvm-svn: 67586	2009-03-24 00:49:12 +00:00
Evan Cheng	b3196f1298	Do not emit comments unless -asm-verbose. llvm-svn: 67580	2009-03-24 00:17:40 +00:00
Evan Cheng	702a8b4399	Fix a bug in spill weight computation. If the alias is a super-register, and the super-register is in the register class we are trying to allocate. Then add the weight to all sub-registers of the super-register even if they are not aliases. e.g. allocating for GR32, bh is not used, updating bl spill weight. bl should get the same spill weight otherwise it will be choosen as a spill candidate since spilling bh doesn't make ebx available. This fix PR2866. llvm-svn: 67574	2009-03-23 22:57:19 +00:00
Dale Johannesen	34123aba43	Fix internal representation of fp80 to be the same as a normal i80 {low64, high16} rather than its own {high64, low16}. A depressing number of places know about this; I think I got them all. Bitcode readers and writers convert back to the old form to avoid breaking compatibility. llvm-svn: 67562	2009-03-23 21:16:53 +00:00
Dan Gohman	60c652de57	When unfolding a load during scheduling, the new operator node has a data dependency on the load node, so it really needs a data-dependence edge to the load node, even if the load previously existed. And add a few comments. llvm-svn: 67554	2009-03-23 20:20:43 +00:00
Evan Cheng	7e4a6972d6	Fix PR3391 and PR3864. Reg allocator infinite looping. llvm-svn: 67544	2009-03-23 18:24:37 +00:00
Dan Gohman	a6842708e6	Don't set SUnit::hasPhysRegDefs to true unless the defs are actually have uses, which reflects the way it's used. llvm-svn: 67540	2009-03-23 17:39:36 +00:00
Dan Gohman	b6b24c5fc1	Fix canClobberPhysRegDefs to check all SDNodes grouped together in an SUnit, instead of just the first one. This fix is needed by some upcoming scheduler changes. llvm-svn: 67531	2009-03-23 16:23:01 +00:00
Dan Gohman	78a1698ac0	Add a new bit to SUnit to record whether a node has implicit physreg defs, regardless of whether they are actually used. llvm-svn: 67528	2009-03-23 16:10:52 +00:00
Dan Gohman	18daca0895	Now that errs() is properly non-buffered, there's no need to explicitly flush it. llvm-svn: 67526	2009-03-23 15:57:19 +00:00
Evan Cheng	2ec94dd447	Model inline asm constraint which ties an input to an output register as machine operand TIED_TO constraint. This eliminated the need to pre-allocate registers for these. This also allows register allocator can eliminate the unneeded copies. llvm-svn: 67512	2009-03-23 08:01:15 +00:00
Evan Cheng	4b11d96b62	Do not fold away subreg_to_reg if the source register has a sub-register index. That means the source register is taking a sub-register of a larger register. e.g. On x86 %RAX<def> = ... %RAX<def> = SUBREG_TO_REG 0, %EAX:3<kill>, 3 The first def is defining RAX, not EAX so the top bits were not zero-extended. llvm-svn: 67511	2009-03-23 07:19:58 +00:00
Dan Gohman	60cfa194d4	Simplify this code; use a while instead of an if and a do-while. llvm-svn: 67400	2009-03-20 20:42:23 +00:00
Evan Cheng	0c629db2aa	For inline asm output operand that matches an input. Encode the input operand index in the high bits. llvm-svn: 67387	2009-03-20 18:03:34 +00:00
Sanjiv Gupta	15d6146126	Fixed build warnings for unused variables. llvm-svn: 67372	2009-03-20 13:49:20 +00:00
Sanjiv Gupta	5957bb99ba	Fixed the comment. No functionality change. llvm-svn: 67370	2009-03-20 09:38:50 +00:00
Chris Lattner	b18e8617be	Apply the patch requested in PR3846. llvm-svn: 67364	2009-03-20 05:08:24 +00:00
Sebastian Redl	e4e5b1c2f2	Fix the Win32 VS2008 build: - Make type declarations match the struct/class keyword of the definition. - Move AddSignalHandler into the namespace where it belongs. - Correctly call functions from template base. - Some other small changes. With this patch, LLVM and Clang should build properly and with far less noise under VS2008. llvm-svn: 67347	2009-03-19 23:26:52 +00:00
Evan Cheng	f47c144bff	Added MachineInstr::isRegTiedToDefOperand to check for two-addressness. llvm-svn: 67335	2009-03-19 20:30:06 +00:00
Chris Lattner	3181caa065	Fix PEI to not walk off the start of a block when an updated instruction is the first in its block. This is PR3842. llvm-svn: 67304	2009-03-19 17:15:43 +00:00
Mon P Wang	3d7fb6738a	Added missing support for widening when splitting an unary op (PR3683) and expanding a bit convert (PR3711). In both cases, we extract the valid part of the widen vector and then do the conversion. llvm-svn: 67175	2009-03-18 06:24:04 +00:00
Rafael Espindola	6a6d9e48dd	Don't force promotion of return arguments on the callee. Some architectures (like x86) don't require it. This fixes bug 3779. llvm-svn: 67132	2009-03-17 23:43:59 +00:00
Chris Lattner	e3c442050d	Fix codegen to compute the size of an allocation by multiplying the size by the array amount as an i32 value instead of promoting from i32 to i64 then doing the multiply. Not doing this broke wrap-around assumptions that the optimizers (validly) made. The ultimate real fix for this is to introduce i64 version of alloca and remove mallocinst. This fixes PR3829 llvm-svn: 67093	2009-03-17 19:36:00 +00:00
Sanjiv Gupta	390dd214db	r66870 missed this out. llvm-svn: 67082	2009-03-17 15:46:15 +00:00
Duncan Sands	34e7f207ee	Reapply r67049, with the test adjusted for darwin (which produces "call L_f$stub" rather than "call f"). llvm-svn: 67079	2009-03-17 09:46:22 +00:00
Mon P Wang	7184e30bdb	Fix a problem with DAGCombine where we were building an illegal build vector shuffle mask. Forced the mask to be built using i32. Note: this will be irrelevant once vector_shuffle no longer takes a build vector for the shuffle mask. llvm-svn: 67076	2009-03-17 06:33:10 +00:00
Evan Cheng	0dccb325d1	Spiller may unfold load / mod / store instructions as an optimization when the would be loaded value is available in a register. It needs to check if it's legal to clobber the register. Also, the register can contain values of multiple spill slots, make sure to check all instead of just the one being unfolded. llvm-svn: 67068	2009-03-17 01:23:09 +00:00
Bill Wendling	51bfef84e1	--- Reverse-merging (from foreign repository) r67049 into '.': U test/CodeGen/X86/2009-03-13-PHIElimBug.ll D test/CodeGen/X86/2009-03-16-PHIElimInLPad.ll U lib/CodeGen/PHIElimination.cpp r67049 was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/X86/2009-03-13-PHIElimBug.ll for PR3784 Failed with exit(1) at line 1 while running: llvm-as < /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/X86/2009-03-13-PHIElimBug.ll \| llc -march=x86 \| /usr/bin/grep -A 2 {call f} \| /usr/bin/grep movl child process exited abnormally llvm-svn: 67051	2009-03-16 20:27:20 +00:00
Duncan Sands	8572084279	Tweak the fix for PR3784: be less sensitive about just how invokes are set up. The fix could be disturbed by register copies coming after the EH_LABEL, and also didn't behave quite right when it was the invoke result that was used in a phi node. Also (see new testcase) fix another phi elimination bug while there: register copies in the landing pad need to come after the EH_LABEL, because that's where execution branches to when unwinding. If they come before the EH_LABEL then they will never be executed... Also tweak the original testcase so it doesn't use a no-longer existing counter. The accumulated phi elimination changes fix two of seven Ada testsuite failures that turned up after landing pad critical edge splitting was turned off. So there's probably more to come. llvm-svn: 67049	2009-03-16 19:58:38 +00:00
Owen Anderson	2648d3fad8	Give the pre-alloc splitter access to the VirtRegMap. It doesn't do anything useful with it at the moment, but it will in the future. llvm-svn: 67012	2009-03-14 21:40:05 +00:00
Daniel Dunbar	2cdac55ad0	Add newlines at end of file (this can annoy gcov) llvm-svn: 67000	2009-03-14 01:53:05 +00:00
Mon P Wang	1cd6172342	Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 if FPConstant is legal because if the FPConstant doesn't need to be stored in a constant pool, the transformation is unlikely to be profitable. llvm-svn: 66994	2009-03-14 00:25:19 +00:00
Dan Gohman	fa0a3504ba	Improve FastISel's handling of truncates to i1, and implement ptrtoint and inttoptr in X86FastISel. These casts aren't always handled in the generic FastISel code because X86 sometimes needs custom code to do truncation and zero-extension. llvm-svn: 66988	2009-03-13 23:53:06 +00:00
Evan Cheng	cda58e565f	Fix PR3784: If the source of a phi comes from a bb ended with an invoke, make sure the copy is inserted before the try range (unless it's used as an input to the invoke, then insert it after the last use), not at the end of the bb. Also re-apply r66140 which was disabled as a workaround. llvm-svn: 66976	2009-03-13 22:59:14 +00:00
Dan Gohman	790659c0d6	Fix FastISel's assumption that i1 values are always zero-extended by inserting explicit zero extensions where necessary. Included is a testcase where SelectionDAG produces a virtual register holding an i1 value which FastISel previously mistakenly assumed to be zero-extended. llvm-svn: 66941	2009-03-13 20:42:20 +00:00
Evan Cheng	f9951d1557	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Owen Anderson	dd7e4f8c43	Convert VirtRegMap to a MachineFunctionPass. llvm-svn: 66870	2009-03-13 05:55:11 +00:00
Bill Wendling	5499163a0a	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	02a239b837	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Dan Gohman	71245ea439	Fix a typo in a comment. llvm-svn: 66843	2009-03-12 23:55:10 +00:00
Owen Anderson	4060191b0e	Reorganize some #include's. llvm-svn: 66780	2009-03-12 06:58:19 +00:00
Chris Lattner	26a971c4ec	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Evan Cheng	47778669f5	Enable Chris' value propagation change. It make available known sign, zero, one bits information for values that are live out of basic blocks. The goal is to eliminate unnecessary sext, zext, truncate of values that are live-in to blocks. This does not handle PHI nodes yet. llvm-svn: 66777	2009-03-12 06:29:49 +00:00
Gabor Greif	04dfd9fef9	update llvm-svn: 66733	2009-03-11 22:52:25 +00:00
Owen Anderson	fb5980b6ab	Reorganization: Move the Spiller out of VirtRegMap.cpp into its own files. No (intended) functionality change. llvm-svn: 66720	2009-03-11 22:31:21 +00:00
Evan Cheng	042e05cb31	My last coalescer fix introduced a subtler one. It's aborting a commuting optimization too late and left the live intervals to be out of sync with instructions. This fixes 8b10b. llvm-svn: 66715	2009-03-11 22:18:44 +00:00
Duncan Sands	b27c523449	It makes no sense to have a ODR version of common linkage, so remove it. llvm-svn: 66690	2009-03-11 20:14:15 +00:00
Duncan Sands	87527ab284	Add parentheses to pacify gcc-4.3. llvm-svn: 66653	2009-03-11 09:04:34 +00:00
Chris Lattner	a0c4a99ec2	reapply my previous patch (r66358) with a tweak to set the alignment of the generated constant pool entry to the desired alignment of a type. If we don't do this, we end up trying to do movsd from 4-byte alignment memory. This fixes 450.soplex and 456.hmmer. llvm-svn: 66641	2009-03-11 05:08:08 +00:00
Bill Wendling	e22e99addb	Put the assignment back at the top of this method. llvm-svn: 66611	2009-03-11 00:03:50 +00:00
Evan Cheng	264173da40	Two coalescer fixes in one. 1. Use the same value# to represent unknown values being merged into sub-registers. 2. When coalescer commute an instruction and the destination is a physical register, update its sub-registers by merging in the extended ranges. llvm-svn: 66610	2009-03-11 00:03:21 +00:00
Bill Wendling	5f1261bf32	Make ivars private. Other cleanup. No functionality change. llvm-svn: 66607	2009-03-10 23:57:09 +00:00
Bill Wendling	45ffb6a251	Just make the Dwarf timer group static inside of the getter function. No need to alloc/dealloc. llvm-svn: 66591	2009-03-10 22:58:53 +00:00
Bill Wendling	3ebcbacf4e	Don't put static functions in anonymous namespace. llvm-svn: 66589	2009-03-10 22:36:31 +00:00
Bill Wendling	6a7265aeaa	These should stop the timer, not start it again. llvm-svn: 66586	2009-03-10 22:02:13 +00:00
Bill Wendling	1c78143120	- Fix misspelled method name. - Remove unused method. llvm-svn: 66585	2009-03-10 21:59:25 +00:00
Bill Wendling	30a55f9386	- Create GetOrCreateSourceID from getOrCreateSourceID. GetOrCreateSourceID is the untimed version of getOrCreateSourceID. getOrCreateSourceID calls GetOrCreateSourceID, of course. - Move some methods into the "private" section. Constify at least one method. - General clean-ups. llvm-svn: 66582	2009-03-10 21:47:45 +00:00
Bill Wendling	c0ffcaacf8	Refine the Dwarf writer timers so that they measure exception writing and debug writing individually. llvm-svn: 66577	2009-03-10 21:23:25 +00:00
Evan Cheng	535dbf4ffd	Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. llvm-svn: 66574	2009-03-10 20:47:18 +00:00
Bill Wendling	4609f8d7c1	Add a timer to the DwarfWriter pass that measures the total time it takes to emit exception and debug Dwarf info. llvm-svn: 66571	2009-03-10 20:41:52 +00:00
Dan Gohman	9c937ebae6	Fix a post-RA scheduling liveness bug. When a basic block is being scheduled in multiple regions, liveness data used by the anti-dependence breaker is carried from one region to the next, however the information reflects the state of the instructions before scheduling. After scheduling, there may be new live range overlaps. Handle this by pessimizing the liveness data carried between regions to the point where it will be conservatively correct now matter how the earlier region is scheduled. This fixes a miscompilation in 176.gcc with the post-RA scheduler enabled. llvm-svn: 66558	2009-03-10 18:10:43 +00:00
Chris Lattner	03060f6d50	wire up support for emitting "special" values from inline asm format strings with the standard ${:foo} syntax. llvm-svn: 66527	2009-03-10 05:37:13 +00:00
Chris Lattner	93662edfb8	Fix PR3763 by using proper APInt methods instead of uint64_t's. llvm-svn: 66434	2009-03-09 20:22:18 +00:00
Evan Cheng	8b47b553f9	Yet another case where the spiller marked two uses of the same register on the same instruction as kill. This fixes PR3706. llvm-svn: 66428	2009-03-09 19:00:05 +00:00
Chris Lattner	7917d8fd89	just remove the use_empty() check entirely, the only reason it existed was for llvm-gcc 3.4 (which used the __main hack) which is really really long dead. llvm-svn: 66417	2009-03-09 08:18:48 +00:00
Chris Lattner	6b52b88467	Make the code generator rip of dead constant expr uses before deciding whether a global is dead or not. This should fix PR3749 - linker adds spurious use to appending globals. I can't reasonably add a testcase for this, because the bc writer/reader strip dead constant users. llvm-svn: 66404	2009-03-09 05:52:15 +00:00
Bill Wendling	13fcab1ef3	Pass in a std::string when getting the names of debugging things. This cuts down on the number of times a std::string is created and copied. llvm-svn: 66396	2009-03-09 05:04:40 +00:00
Evan Cheng	483ece4d2e	If a MI uses the same register more than once, only mark one of them as 'kill'. llvm-svn: 66363	2009-03-08 03:58:35 +00:00
Chris Lattner	578b634f56	implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. For 2009-03-07-FPConstSelect.ll we now produce: _f: xorl %eax, %eax testl %edi, %edi movl $4, %ecx cmovne %rax, %rcx leaq LCPI1_0(%rip), %rax movss (%rcx,%rax), %xmm0 ret previously we produced: _f: subl $4, %esp cmpl $0, 8(%esp) movss LCPI1_0, %xmm0 je LBB1_2 ## entry LBB1_1: ## entry movss LCPI1_1, %xmm0 LBB1_2: ## entry movss %xmm0, (%esp) flds (%esp) addl $4, %esp ret on PPC the code also improves to: _f: cntlzw r2, r3 srwi r2, r2, 5 li r3, lo16(LCPI1_0) slwi r2, r2, 2 addis r3, r3, ha16(LCPI1_0) lfsx f1, r3, r2 blr from: _f: li r2, lo16(LCPI1_1) cmplwi cr0, r3, 0 addis r2, r2, ha16(LCPI1_1) beq cr0, LBB1_2 ; entry LBB1_1: ; entry li r2, lo16(LCPI1_0) addis r2, r2, ha16(LCPI1_0) LBB1_2: ; entry lfs f1, 0(r2) blr This also improves the existing pic-cpool case from: foo: subl $12, %esp call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax cmpl $0, 16(%esp) movsd .LCPI1_0@GOTOFF(%eax), %xmm0 je .LBB1_2 # entry .LBB1_1: # entry movsd .LCPI1_1@GOTOFF(%eax), %xmm0 .LBB1_2: # entry movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret to: foo: call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax xorl %ecx, %ecx cmpl $0, 4(%esp) movl $8, %edx cmovne %ecx, %edx fldl .LCPI1_0@GOTOFF(%eax,%edx) ret This triggers a few dozen times in spec FP 2000. llvm-svn: 66358	2009-03-08 01:51:30 +00:00
Chris Lattner	f89a9c9d74	random cleanups. llvm-svn: 66357	2009-03-08 01:47:41 +00:00
Duncan Sands	5ab54d488f	Introduce new linkage types linkonce_odr, weak_odr, common_odr and extern_weak_odr. These are the same as the non-odr versions, except that they indicate that the global will only be overridden by an equivalent global. In C, a function with weak linkage can be overridden by a function which behaves completely differently. This means that IP passes have to skip weak functions, since any deductions made from the function definition might be wrong, since the definition could be replaced by something completely different at link time. This is not allowed in C++, thanks to the ODR (One-Definition-Rule): if a function is replaced by another at link-time, then the new function must be the same as the original function. If a language knows that a function or other global can only be overridden by an equivalent global, it can give it the weak_odr linkage type, and the optimizers will understand that it is alright to make deductions based on the function body. The code generators on the other hand map weak and weak_odr linkage to the same thing. llvm-svn: 66339	2009-03-07 15:45:40 +00:00
Dan Gohman	42049dafd7	Fix ScheduleDAGRRList::CopyAndMoveSuccessors' handling of nodes with multiple chain operands. This can occur when the scheduler has added chain operands to a node that already has a chain operand, in order to handle physical register dependencies. This fixes an llvm-gcc bootstrap failure on x86-64 introduced in r66058. llvm-svn: 66240	2009-03-06 02:23:01 +00:00
Bill Wendling	5ddde1a24c	When we split a basic block, there's a default branch to the newly created BB. Delete this default branch, because we're going to generate our own. llvm-svn: 66234	2009-03-06 01:41:15 +00:00
Owen Anderson	39e5f053c4	(Hopefully) silence a warning. llvm-svn: 66158	2009-03-05 08:23:20 +00:00
Owen Anderson	744e33c12b	Be more careful about choosing restore points when doing restore folding. This fixes some subtle miscompilations. llvm-svn: 66147	2009-03-05 07:19:18 +00:00
Evan Cheng	90d25a92c2	Fix how livein live intervals are handled. Previously it could end at MBB start. Sorry, no small test case possible. llvm-svn: 66129	2009-03-05 03:34:26 +00:00
Bob Wilson	b75225556d	Fix BuildVectorSDNode::isConstantSplat to handle one-element vectors. It is an error to call APInt::zext with a size that is equal to the value's current size, so use zextOrTrunc instead. llvm-svn: 66039	2009-03-04 17:47:01 +00:00
Owen Anderson	fb7f64ea0d	Add a restore folder, which shaves a dozen or so machineinstrs off oggenc. Update a testcase to check this. llvm-svn: 66029	2009-03-04 08:52:31 +00:00
Eli Friedman	1bed40b86a	PR3686: make the legalizer handle bitcast from i80 to x86 long double. llvm-svn: 66021	2009-03-04 06:23:34 +00:00
Evan Cheng	db402a7a49	Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what llvm-gcc generates so codegen knows flags register is being clobbered by inline asm. 2. BURR scheduler should also check if inline asm nodes can clobber "live" physical registers. Previously it was only checking target nodes with implicit defs. llvm-svn: 65996	2009-03-04 01:41:49 +00:00
Bill Wendling	93eeea0493	The DAG combiner was performing a BT combine. The BT combine had a value of -1, so it changed it into a 31 via the TLO.ShrinkDemandedConstant() call. Then it would go through the DAG combiner again. This time it had a value of 31, which was turned into a -1 by TLI.SimplifyDemandedBits(). This would ping pong forever. Teach the TLO.ShrinkDemandedConstant() call not to lower a value if the demanded value is an XOR of all ones. llvm-svn: 65985	2009-03-04 00:18:06 +00:00
Bob Wilson	5b276b6bc9	Generalize BuildVectorSDNode::isConstantSplat to use APInts and handle arbitrary vector sizes. Add an optional MinSplatBits parameter to specify a minimum for the splat element size. Update the PPC target to use the revised interface. llvm-svn: 65899	2009-03-02 23:24:16 +00:00
Nate Begeman	6b41d33726	Fix a problem with DAGCombine on 64b targets where folding extracts + build_vector into a shuffle would fail, because the type of the new build_vector would not be legal. Try harder to create a legal build_vector type. Note: this will be totally irrelevant once vector_shuffle no longer takes a build_vector for shuffle mask. New: _foo: xorps %xmm0, %xmm0 xorps %xmm1, %xmm1 subps %xmm1, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 Old: _foo: xorps %xmm0, %xmm0 movss %xmm0, %xmm1 xorps %xmm2, %xmm2 unpcklps %xmm1, %xmm2 pshufd $80, %xmm1, %xmm1 unpcklps %xmm1, %xmm2 pslldq $16, %xmm2 pshufd $57, %xmm2, %xmm1 subps %xmm0, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 llvm-svn: 65791	2009-03-01 23:44:07 +00:00
Evan Cheng	276f9b02c5	Minor optimization: Look for situations like this: %reg1024<def> = MOV r1 %reg1025<def> = MOV r0 %reg1026<def> = ADD %reg1024, %reg1025 r0 = MOV %reg1026 Commute the ADD to hopefully eliminate an otherwise unavoidable copy. llvm-svn: 65752	2009-03-01 02:03:43 +00:00
Bob Wilson	7482f84ae6	Combine PPC's GetConstantBuildVectorBits and isConstantSplat functions to a new method in a BuildVectorSDNode "pseudo-class". llvm-svn: 65747	2009-03-01 01:13:55 +00:00
Evan Cheng	9c3ce7905e	Last commit accidentially deleted this code. llvm-svn: 65679	2009-02-28 06:02:14 +00:00
Devang Patel	c908eb0dba	It is possible that subprgoram definition is only encoding return value directly, instsad of an DIArray of all argument types. llvm-svn: 65643	2009-02-27 18:05:21 +00:00
Rafael Espindola	880e63bf01	Refactor TLS code and add some tests. The tests and expected results are: pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632	2009-02-27 13:37:18 +00:00
Evan Cheng	0ca6e3dba5	MachineLICM CSE should match destination register classes; avoid hoisting implicit_def's. llvm-svn: 65592	2009-02-27 00:02:22 +00:00
Owen Anderson	32d31bc5cc	Enable stack slot coloring DCE. Evan's spiller fixes were needed before this could happen. llvm-svn: 65501	2009-02-26 04:47:57 +00:00
Evan Cheng	86fc9440db	The last commit was overly conservative. It's ok to reuse value that's already marked livein. llvm-svn: 65498	2009-02-26 03:02:21 +00:00
Evan Cheng	435dbd4665	If an available register falls through to a succ block, unset the last kill. Sorry, it's impossible to reduce a sensible test case. It basically requires the moon and stars to align in order to cause a failure. llvm-svn: 65497	2009-02-26 02:30:42 +00:00

... 4 5 6 7 8 ...

7195 Commits