llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 05:52:53 +02:00

Author	SHA1	Message	Date
Lang Hames	11ccc79191	Tighten physical register invariants: Allocatable physical registers can only be live in to a block if it is the function entry point or a landing pad. llvm-svn: 150494	2012-02-14 18:51:53 +00:00
Andrew Trick	c3cc8fa604	RegAlloc superpass: includes phi elimination, coalescing, and scheduling. Creates a configurable regalloc pipeline. Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa. When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>. CodeGen transformation passes are never "required" as an analysis ProcessImplicitDefs does not require LiveVariables. We have a plan to massively simplify some of the early passes within the regalloc superpass. llvm-svn: 150226	2012-02-10 04:10:36 +00:00
Chad Rosier	b70d1dfae6	[fast-isel] Add support for SUBs with non-legal types. llvm-svn: 150047	2012-02-08 02:45:44 +00:00
Chad Rosier	66b35d7220	Add comment to test case. llvm-svn: 150046	2012-02-08 02:30:12 +00:00
Chad Rosier	1ef78d6989	[fast-isel] Add support for ORs with non-legal types. llvm-svn: 150045	2012-02-08 02:29:21 +00:00
Chad Rosier	26610906f0	[fast-isel] Add support for indirect branches. llvm-svn: 150014	2012-02-07 23:56:08 +00:00
Chad Rosier	945ab43c4f	[fast-isel] Add support for ADDs with non-legal types. llvm-svn: 149934	2012-02-06 23:50:07 +00:00
Chad Rosier	ec3053c33c	[fast-isel] HandlePHINodesInSuccessorBlocks() can promite i8 and i16 types too. llvm-svn: 149730	2012-02-04 00:39:19 +00:00
Chad Rosier	cff3c98417	[fast-isel] Add support for FPToUI. Also add test cases for FPToSI. llvm-svn: 149706	2012-02-03 20:27:51 +00:00
Chad Rosier	40b3e74387	[fast-isel] Add support for selecting UIToFP. llvm-svn: 149704	2012-02-03 19:42:52 +00:00
NAKAMURA Takumi	a7f8fe6300	Move test/CodeGen/Generic/2012-02-01-CoalescerBug.ll to CodeGen/ARM, for now. It requires TARGETS=arm. I cannot reproduce a fixed issue with other targets. llvm-svn: 149604	2012-02-02 11:44:58 +00:00
Lang Hames	5b641086b2	Rewrite instruction operands in AdjustCopiesBackFrom. Fixes PR11861. llvm-svn: 149097	2012-01-27 00:05:42 +00:00
Chad Rosier	2bc98784a8	Replace the use of isPredicable() with isPredicated() in MachineBasicBlock::canFallThrough(). We're interested in the state of the instruction (i.e., is this a barrier or not?), not if the instruction is predicable or not. rdar://10501092 llvm-svn: 149070	2012-01-26 18:24:25 +00:00
Jakob Stoklund Olesen	9e8bb24748	Clear kill flags before propagating a copy. The live range of the source register may be extended when a redundant copy is eliminated. Make sure any kill flags between the two copies are cleared. This fixes PR11765. llvm-svn: 149069	2012-01-26 17:52:15 +00:00
Jakob Stoklund Olesen	519e259532	Improve sub-register def handling in ProcessImplicitDefs. This boils down to using MachineOperand::readsReg() more. This fixes PR11829 where a use ended up after the first def when lowering REG_SEQUENCE instructions involving IMPLICIT_DEFs. llvm-svn: 148996	2012-01-25 23:36:27 +00:00
Anton Korobeynikov	682b2821ce	Properly emit ctors / dtors with priorities into desired sections and let linker handle the rest. This finally fixes PR5329 llvm-svn: 148990	2012-01-25 22:24:19 +00:00
Jakob Stoklund Olesen	fe157f5fc1	Set correct <def,undef> flags when lowering REG_SEQUENCE. A REG_SEQUENCE instruction is lowered into a sequence of partial defs: %vreg7:ssub_0<def,undef> = COPY %vreg20:ssub_0 %vreg7:ssub_1<def> = COPY %vreg2 %vreg7:ssub_2<def> = COPY %vreg2 %vreg7:ssub_3<def> = COPY %vreg2 The first def needs an <undef> flag to indicate it is the beginning of the live range, while the other defs are read-modify-write. Previously, we depended on LiveIntervalAnalysis to notice and fix the missing <def,undef>, but that solution was never robust, it was causing problems with ProcessImplicitDefs and the lowering of chained REG_SEQUENCE instructions. This fixes PR11841. llvm-svn: 148879	2012-01-24 23:28:42 +00:00
Evgeniy Stepanov	a0474f4619	An option to selectively enable part of ARM EHABI support. This change adds an new option --arm-enable-ehabi-descriptors that enables emitting unwinding descriptors. This provides a mode with a working backtrace() without the (currently broken) exception support. llvm-svn: 148800	2012-01-24 13:05:33 +00:00
Chandler Carruth	55876621c9	Revert r148686 (and r148694, a fix to it) due to a serious layering violation -- MC cannot depend on CodeGen. Specifically, the MCTargetDesc component of each target is actually a subcomponent of the MC library. As such, it cannot depend on the target-independent code generator, because MC itself cannot depend on the target-independent code generator. This change moved a flag from the ARM MCTargetDesc file ARMMCAsmInfo.cpp to the CodeGen layer in ARMException.cpp, leaving behind an 'extern' to refer back to it. That layering order isn't viable givin the constraints outlined above. Commandline flags are designed to be static specifically to avoid these types of bugs. Fixing this is likely going to require some non-trivial refactoring. llvm-svn: 148759	2012-01-24 00:30:17 +00:00
Jakob Stoklund Olesen	dd6ae694d6	Fix PR11829. PostRA LICM was too aggressive. This fixes a typo in r148589. llvm-svn: 148724	2012-01-23 21:01:15 +00:00
Evgeniy Stepanov	bffa428d01	An option to selectively enable parts of ARM EHABI support. This change adds an new value to the --arm-enable-ehabi option that disables emitting unwinding descriptors. This mode gives a working backtrace() without the (currently broken) exception support. llvm-svn: 148686	2012-01-23 07:57:39 +00:00
Anton Korobeynikov	76b0745f6c	Add fused multiple+add instructions from VFPv4. Patch by Ana Pazos! llvm-svn: 148658	2012-01-22 12:07:33 +00:00
Bob Wilson	793fa215cb	ARM vector any_extends need to be selected to vmovl. <rdar://problem/10723651> We have patterns for vector sext and zext operations but were missing anyext. Without those patterns, codegen will fail when the selection DAG has any_extend nodes. llvm-svn: 148568	2012-01-20 20:59:56 +00:00
Jim Grosbach	b6e177bd4e	VST2 four-register w/ update pseudos for fixed/register update. rdar://10724489 llvm-svn: 148560	2012-01-20 19:16:00 +00:00
Evgeniy Stepanov	f329e9ee4d	Emit ARM EHABI unwinding instructions for 3 more Thumb instructions. llvm-svn: 148473	2012-01-19 12:53:06 +00:00
NAKAMURA Takumi	4282fcb731	test/CodeGen/ARM/test-sharedidx.ll: Fix for -Asserts. llvm-svn: 148107	2012-01-13 07:03:55 +00:00
Evan Cheng	e31c929c2d	DAGCombine's logic for forming pre- and post- indexed loads / stores were being overly conservative. It was concerned about cases where it would prohibit folding simple [r, c] addressing modes. e.g. ldr r0, [r2] ldr r1, [r2, #4] => ldr r0, [r2], #4 ldr r1, [r2] Change the logic to look for such cases which allows it to form indexed memory ops more aggressively. rdar://10674430 llvm-svn: 148086	2012-01-13 01:37:24 +00:00
Andrew Trick	393dc735f6	ARM Ld/St Optimizer fix. Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits. Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12 llvm-svn: 147922	2012-01-11 03:56:08 +00:00
Jim Grosbach	59537e1ce3	ARM updating VST2 pseudo-lowering fixed vs. register update. rdar://10663487 llvm-svn: 147876	2012-01-10 21:11:12 +00:00
Evan Cheng	7855c5d08f	Allow machine-cse to look across MBB boundary when cse'ing instructions that define physical registers. It's currently very restrictive, only catching cases where the CE is in an immediate (and only) predecessor. But it catches a surprising large number of cases. rdar://10660865 llvm-svn: 147827	2012-01-10 02:02:58 +00:00
Rafael Espindola	7618aa1c64	Don't print an unused label before .cfi_endproc. llvm-svn: 147763	2012-01-09 00:17:29 +00:00
Rafael Espindola	19a13321f8	Don't print a label before .cfi_startproc when we don't need to. This makes the produce assembly when using CFI just a bit more readable. llvm-svn: 147743	2012-01-07 22:42:19 +00:00
Jakob Stoklund Olesen	71f92061aa	Use getRegForValue() to materialize the address of ARM globals. This enables basic local CSE, giving us 20% smaller code for consumer-typeset in -O0 builds. <rdar://problem/10658692> llvm-svn: 147720	2012-01-07 04:07:22 +00:00
Evan Cheng	8af07ba749	Added a late machine instruction copy propagation pass. This catches opportunities that only present themselves after late optimizations such as tail duplication .e.g. ## BB#1: movl %eax, %ecx movl %ecx, %eax ret The register allocator also leaves some of them around (due to false dep between copies from phi-elimination, etc.) This required some changes in codegen passes. Post-ra scheduler and the pseudo-instruction expansion passes have been moved after branch folding and tail merging. They were before branch folding before because it did not always update block livein's. That's fixed now. The pass change makes independently since we want to properly schedule instructions after branch folding / tail duplication. rdar://10428165 rdar://10640363 llvm-svn: 147716	2012-01-07 03:02:36 +00:00
Jakob Stoklund Olesen	536b4e24d8	Use movw+movt in ARMFastISel::ARMMaterializeGV. This eliminates a lot of constant pool entries for -O0 builds of code with many global variable accesses. This speeds up -O0 codegen of consumer-typeset by 2x because the constant island pass no longer has to look at thousands of constant pool entries. <rdar://problem/10629774> llvm-svn: 147712	2012-01-07 01:47:05 +00:00
Jakob Stoklund Olesen	39a3fa2c29	Enable aligned NEON spilling by default. Experiments show this to be a small speedup for modern ARM cores. llvm-svn: 147689	2012-01-06 22:19:37 +00:00
Jakob Stoklund Olesen	23eeb1f7b5	Reapply r146997, "Heed spill slot alignment on ARM." Now that canRealignStack() understands frozen reserved registers, it is safe to use it for aligned spill instructions. It will only return true if the registers reserved at the beginning of register allocation allow for dynamic stack realignment. <rdar://problem/10625436> llvm-svn: 147579	2012-01-05 00:26:57 +00:00
Evan Cheng	4e60b65bc6	Fix more places which should be checking for iOS, not darwin. llvm-svn: 147513	2012-01-04 01:55:04 +00:00
Jakob Stoklund Olesen	993997b659	Revert r146997, "Heed spill slot alignment on ARM." This patch caused a miscompilation of oggenc because a frame pointer was suddenly needed halfway through register allocation. <rdar://problem/10625436> llvm-svn: 147487	2012-01-03 22:34:35 +00:00
Eli Friedman	064187912e	Make sure DAGCombiner doesn't introduce multiple loads from the same memory location. PR10747, part 2. llvm-svn: 147283	2011-12-26 22:49:32 +00:00
Chad Rosier	c2f31859cc	Fix a couple of copy-n-paste bugs. Noticed by George Russell! llvm-svn: 147064	2011-12-21 18:56:22 +00:00
Evan Cheng	fb22f64814	Fix a couple of copy-n-paste bugs. Noticed by George Russell. llvm-svn: 147032	2011-12-21 03:04:10 +00:00
Jakob Stoklund Olesen	2b24e1eac4	Heed spill slot alignment on ARM. Use the spill slot alignment as well as the local variable alignment to determine when the stack needs to be realigned. This works now that the ARM target can always realign the stack by using a base pointer. Still respect the ARMBaseRegisterInfo::canRealignStack() function vetoing a realigned stack. Don't use aligned spill code in that case. llvm-svn: 146997	2011-12-20 22:15:04 +00:00
Evan Cheng	46b085721a	ARM target code clean up. Check for iOS, not Darwin where it makes sense. llvm-svn: 146981	2011-12-20 18:26:50 +00:00
Bob Wilson	8439df9506	Mark ARM eh_sjlj_dispatchsetup as clobbering all registers. Radar 10567930. We used to rely on the *eh_sjlj_setjmp instructions to mark that a function with setjmp/longjmp exception handling clobbers all the registers. But with the recent reorganization of ARM EH, those eh_sjlj_setjmp instructions are expanded away earlier, before PEI can see them to determine what registers to save and restore. Mark the dispatchsetup instruction in the same way, since that instruction cannot be expanded early. This also more accurately reflects when the registers are clobbered. llvm-svn: 146949	2011-12-20 01:29:27 +00:00
Evan Cheng	9362ee62bc	Move tests to FileCheck. llvm-svn: 146923	2011-12-19 23:26:44 +00:00
Devang Patel	0db1ed1a48	Do not sink instruction, if it is not profitable. On ARM, peephole optimization for ABS creates a trivial cfg triangle which tempts machine sink to sink instructions in code which is really straight line code. Sometimes this sinking may alter register allocator input such that use and def of a reg is divided by a branch in between, which may result in extra spills. Now mahine sink avoids sinking if final sink destination is post dominator. Radar 10266272. llvm-svn: 146604	2011-12-14 23:20:38 +00:00
Evan Cheng	68ba5536f3	- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function to finalize MI bundles (i.e. add BUNDLE instruction and computing register def and use lists of the BUNDLE instruction) and a pass to unpack bundles. - Teach more of MachineBasic and MachineInstr methods to be bundle aware. - Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to prevent IT blocks from being broken apart. llvm-svn: 146542	2011-12-14 02:11:42 +00:00
Chad Rosier	33f40b2c25	Add newline at EOF. llvm-svn: 146538	2011-12-14 01:34:39 +00:00
Chad Rosier	8af97606a9	[fast-isel] Unaligned loads of floats are not supported. Therefore, convert to a regular load and then move the result from a GPR to a FPR. llvm-svn: 146502	2011-12-13 19:22:14 +00:00

1 2 3 4 5 ...

1196 Commits