llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 22:12:47 +01:00

Author	SHA1	Message	Date
Bob Wilson	dc396388cb	Add a command line option "-arm-strict-align" to disallow unaligned memory accesses for ARM targets that would otherwise allow it. Radar 8465431. llvm-svn: 114941	2010-09-28 04:09:35 +00:00
Jakob Stoklund Olesen	cfed90fe40	Revert "Disable codegen prepare critical edge splitting. Machine instruction passes now" This reverts revision 114633. It was breaking llvm-gcc-i386-linux-selfhost. It seems there is a downstream bug that is exposed by -cgp-critical-edge-splitting=0. When that bug is fixed, this patch can go back in. Note that the changes to tailcallfp2.ll are not reverted. They were good are required. llvm-svn: 114859	2010-09-27 18:43:48 +00:00
Jakob Stoklund Olesen	51362ba70e	Explicitly disable CGP critical edge splitting for this test so it won't break by reenabling it temporarily. llvm-svn: 114858	2010-09-27 18:43:43 +00:00
Jakob Stoklund Olesen	d1779cf19c	Don't depend on basic block numbering. llvm-svn: 114857	2010-09-27 18:43:40 +00:00
Chris Lattner	0ebcc18dec	the latest assembler that runs on powerpc 10.4 machines doesn't support aligned comm. Detect when compiling for 10.4 and don't emit an alignment for comm. THis will hopefully fix PR8198. llvm-svn: 114817	2010-09-27 06:44:54 +00:00
Che-Liang Chiou	aeccb0793b	Add test case for PTX ret instruction llvm-svn: 114789	2010-09-25 07:49:54 +00:00
Che-Liang Chiou	0eaf890a31	Add ret instruction to PTX backend llvm-svn: 114788	2010-09-25 07:46:17 +00:00
Evan Cheng	1d50dccdc5	Enable code placement optimization pass for ARM. llvm-svn: 114746	2010-09-24 19:07:23 +00:00
Bob Wilson	ff8139baa0	Set alignment operand for NEON VST instructions. llvm-svn: 114709	2010-09-23 23:42:37 +00:00
Bob Wilson	026ef4b7f8	Set alignment operand for NEON VLD instructions. llvm-svn: 114696	2010-09-23 21:43:54 +00:00
Evan Cheng	1493b1799e	Disable codegen prepare critical edge splitting. Machine instruction passes now break critical edges on demand. llvm-svn: 114633	2010-09-23 06:55:34 +00:00
Owen Anderson	7d6373ea9d	A select between a constant and zero, when fed by a bit test, can be efficiently lowered using a series of shifts. Fixes <rdar://problem/8285015>. llvm-svn: 114599	2010-09-22 22:58:22 +00:00
Cameron Esfahani	662193bddc	Fix PR8201: Update the code to call via X86::CALL64pcrel32 in the 64-bit case. llvm-svn: 114597	2010-09-22 22:35:21 +00:00
Chris Lattner	1864d6728d	Fix an inconsistency in the x86 backend that led it to reject "calll foo" on x86-32: 32-bit calls were named "call" not "calll". 64-bit calls were correctly named "callq", so this only impacted x86-32. This fixes rdar://8456370 - llvm-mc rejects 'calll' This also exposes that mingw/64 is generating a 32-bit call instead of a 64-bit call, I will file a bugzilla. llvm-svn: 114534	2010-09-22 05:49:14 +00:00
Chris Lattner	26d11d7501	reimplement elf TLS support in terms of addressing modes, eliminating SegmentBaseAddress. llvm-svn: 114529	2010-09-22 04:39:11 +00:00
Chris Lattner	d42791ad4a	linux has a different stack alignment than the mac, relax this a bit. llvm-svn: 114519	2010-09-22 00:46:26 +00:00
Chris Lattner	e52da86fab	give VZEXT_LOAD a memory operand, it now works with segment registers. llvm-svn: 114515	2010-09-22 00:34:38 +00:00
Chris Lattner	706b9206da	revert r114386 now that address modes work correctly, we get a nice call through gs-relative memory now. llvm-svn: 114510	2010-09-22 00:11:31 +00:00
Chris Lattner	f9861312cb	give LCMPXCHG_DAG[8] a memory operand, allowing it to work with addrspace 256/257 llvm-svn: 114508	2010-09-21 23:59:42 +00:00
Chris Lattner	08b4ce2b31	filecheckize llvm-svn: 114507	2010-09-21 23:57:27 +00:00
Evan Cheng	1d58965067	OptimizeCompareInstr should avoid iterating pass the beginning of the MBB when the 'and' instruction is after the comparison. llvm-svn: 114506	2010-09-21 23:49:07 +00:00
Owen Anderson	d9fd152c3a	Enable target-specific mul-lowering on ARM, even at -Os. Remove a test that this makes irrelevant, but add a new test for the new, improved functionality. llvm-svn: 114494	2010-09-21 22:51:46 +00:00
Devang Patel	53b709a85c	Use FileCheck llvm-svn: 114475	2010-09-21 20:50:32 +00:00
Owen Anderson	97a8fdc19c	When adding the carry bit to another value on X86, exploit the fact that the carry-materialization (sbbl x, x) sets the registers to 0 or ~0. Combined with two's complement arithmetic, we can fold the intermediate AND and the ADD into a single SUB. This fixes <rdar://problem/8449754>. llvm-svn: 114460	2010-09-21 18:41:19 +00:00
Chris Lattner	ecdba24738	fix rdar://8453210, a crash handling a call through a GS relative load. For now, just disable folding the load into the call. llvm-svn: 114386	2010-09-21 03:37:00 +00:00
Evan Cheng	1ce02d180e	Enable machine sinking critical edge splitting. e.g. define double @foo(double %x, double %y, i1 %c) nounwind { %a = fdiv double %x, 3.2 %z = select i1 %c, double %a, double %y ret double %z } Was: _foo: divsd LCPI0_0(%rip), %xmm0 testb $1, %dil jne LBB0_2 movaps %xmm1, %xmm0 LBB0_2: ret Now: _foo: testb $1, %dil je LBB0_2 divsd LCPI0_0(%rip), %xmm0 ret LBB0_2: movaps %xmm1, %xmm0 ret This avoids the divsd when early exit is taken. rdar://8454886 llvm-svn: 114372	2010-09-20 22:52:00 +00:00
Owen Anderson	b8811b9ed9	CombinerAA is now reordering these stores. llvm-svn: 114354	2010-09-20 20:56:29 +00:00
Owen Anderson	fc94b337eb	When TCO is turned on, it is possible to end up with aliasing FrameIndex's. Therefore, CombinerAA cannot assume that different FrameIndex's never alias, but can instead use MachineFrameInfo to get the actual offsets of these slots and check for actual aliasing. This fixes CodeGen/X86/2010-02-19-TailCallRetAddrBug.ll and CodeGen/X86/tailcallstack64.ll when CombinerAA is enabled, modulo a different register allocation sequence. llvm-svn: 114348	2010-09-20 20:39:59 +00:00
Jim Grosbach	cf90f8beb1	Simplify ARM callee-saved register handling by removing the distinction between the high and low registers for prologue/epilogue code. This was a Darwin-only thing that wasn't providing a realistic benefit anymore. Combining the save areas simplifies the compiler code and results in better ARM/Thumb2 codegen. For example, previously we would generate code like: push {r4, r5, r6, r7, lr} add r7, sp, #12 stmdb sp!, {r8, r10, r11} With this change, we combine the register saves and generate: push {r4, r5, r6, r7, r8, r10, r11, lr} add r7, sp, #12 rdar://8445635 llvm-svn: 114340	2010-09-20 19:32:20 +00:00
NAKAMURA Takumi	a8d8b5f3c3	test/CodeGen/X86: Add explicit triplet -mtriple=i686-linux to 3 tests incompatible to Win32 codegen. r114297 raises 3 failures. They might fail also on mingw. llvm-svn: 114317	2010-09-19 21:58:55 +00:00
Eric Christopher	2901b19344	Add the exit instruction to the PTX target. Patch by Che-Liang Chiou <clchiou@gmail.com>! llvm-svn: 114294	2010-09-18 18:52:28 +00:00
Owen Anderson	015641f659	Invert the logic of reachesChainWithoutSideEffects(). What we want to check is that there is NO path to the destination containing side effects, not that SOME path contains no side effects. In practice, this only manifests with CombinerAA enabled, because otherwise the chain has little to no branching, so "any" is effectively equivalent to "all". llvm-svn: 114268	2010-09-18 04:45:14 +00:00
Bob Wilson	670e1915c0	Add target-specific DAG combiner for BUILD_VECTOR and VMOVRRD. An i64 value should be in GPRs when it's going to be used as a scalar, and we use VMOVRRD to make that happen, but if the value is converted back to a vector we need to fold to a simple bit_convert. Radar 8407927. llvm-svn: 114233	2010-09-17 22:59:05 +00:00
Jim Grosbach	8ae5cfffdd	Teach the (non-MC) instruction printer to use the cannonical names for push/pop, and shift instructions on ARM. Update the tests to match. llvm-svn: 114230	2010-09-17 22:36:38 +00:00
Evan Cheng	8c2bde65f0	Teach machine sink to 1) Do forward copy propagation. This makes it easier to estimate the cost of the instruction being sunk. 2) Break critical edges on demand, including cases where the value is used by PHI nodes. Critical edge splitting is not yet enabled by default. llvm-svn: 114227	2010-09-17 22:28:18 +00:00
Jim Grosbach	d2cf0742a4	Update tests to handle MC-inst instruction printing of shift operations. The legacy asm printer uses instructions of the form, "mov r0, r0, lsl #3", while the MC-instruction printer uses the form "lsl r0, r0, #3". The latter mnemonic is correct and preferred according the ARM documentation (A8.6.98). The former are pseudo-instructions for the latter. llvm-svn: 114221	2010-09-17 21:58:46 +00:00
Jim Grosbach	b86aebe2b7	FileCheck-ize llvm-svn: 114218	2010-09-17 21:46:16 +00:00
Jim Grosbach	b21c19d666	Move thumb2 tests to the thumb2 directory llvm-svn: 114206	2010-09-17 20:34:09 +00:00
Jim Grosbach	026811c86f	tweak test to check instructions rather than relying on the comment string llvm-svn: 114204	2010-09-17 20:27:26 +00:00
Dan Gohman	aaed2c137f	Avoid emitting a PIC base register if no PIC addresses are needed. This fixes rdar://8396318. llvm-svn: 114201	2010-09-17 20:24:24 +00:00
Jim Grosbach	fd21b4bb15	tweak test to check instructions rather than relying on the comment string llvm-svn: 114200	2010-09-17 20:21:03 +00:00
Jim Grosbach	5ca55bd98b	tweak test to check instructions rather than relying on the comment string llvm-svn: 114199	2010-09-17 20:17:41 +00:00
Dale Johannesen	cf9dc14249	When substituting sunkaddrs into indirect arguments an asm, we were walking the asm arguments once and stashing their Values. This is wrong because the same memory location can be in the list twice, and if the first one has a sunkaddr substituted, the stashed value for the second one will be wrong (use-after-free). PR 8154. llvm-svn: 114104	2010-09-16 18:30:55 +00:00
Kalle Raiskila	68e2c15954	Change SPU register re-interpretations from OR to COPY_TO_REGCLASS instruction. This cleans up after the mess r108567 left in the CellSPU backend. ORCvt-instruction were used to reinterpret registers, and the ORs were then removed by isMoveInstr(). This patch now removes 350 instrucions of format: or $3, $3, $3 (from the 52 testcases in CodeGen/CellSPU). One case of a nonexistant or is checked for. Some moves of the form 'ori $., $., 0' and 'ai $., $., 0' still remain. llvm-svn: 114074	2010-09-16 12:29:33 +00:00
Bob Wilson	e7e2f983e5	Reapply Gabor's 113839, 113840, and 113876 with a fix for a problem encountered while building llvm-gcc for arm. This is probably the same issue that the ppc buildbot hit. llvm::prior works on a MachineBasicBlock::iterator, not a plain MachineInstr. llvm-svn: 113983	2010-09-15 17:12:08 +00:00
Gabor Greif	f7635897c8	the darwin9-powerpc buildbot keeps consistently crashing, backing out following to get it back to green, so I can investigate in peace: svn merge -c -113840 llvm/test/CodeGen/ARM/arm-and-tst-peephole.ll svn merge -c -113876 -c -113839 llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp llvm-svn: 113980	2010-09-15 16:53:07 +00:00
Gabor Greif	a37b447ba6	forgot the testcase change for r113839 llvm-svn: 113840	2010-09-14 09:30:17 +00:00
Gabor Greif	40a8053a15	test for and-tst peephole optimization documents the status-quo with its opportunities llvm-svn: 113838	2010-09-14 08:50:43 +00:00
Owen Anderson	9c34a7831d	Re-apply r113679, which was reverted in r113720, which added a paid of new instcombine transforms to expose greater opportunities for store narrowing in codegen. This patch fixes a potential infinite loop in instcombine caused by one of the introduced transforms being overly aggressive. llvm-svn: 113763	2010-09-13 17:59:27 +00:00
Eric Christopher	d4aaabfa74	Revert 113679, it was causing an infinite loop in a testcase that I've sent on to Owen. llvm-svn: 113720	2010-09-12 06:09:23 +00:00

1 2 3 4 5 ...

3638 Commits