llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Devang Patel	a5de669faa	Remove dead code. llvm-svn: 127923	2011-03-18 23:33:58 +00:00
Devang Patel	bafc4fec54	Consider debug info intrinsics pointing to null value as dead instructions. llvm-svn: 127922	2011-03-18 23:28:02 +00:00
Jim Grosbach	89580295e7	Silence a warning. llvm-svn: 127918	2011-03-18 22:50:49 +00:00
Owen Anderson	16fce7d4af	Add support to the ARM asm parser for the register-shifted-register forms of basic instructions like ADD. More work left to be done to support other instances of shifter ops in the ISA. llvm-svn: 127917	2011-03-18 22:50:18 +00:00
Jim Grosbach	75deb766b9	Beginnings of MC-JIT code generation. Proof-of-concept code that code-gens a module to an in-memory MachO object. This will be hooked up to a run-time dynamic linker library (see: llvm-rtdyld for similarly conceptual work for that part) which will take the compiled object and link it together with the rest of the system, providing back to the JIT a table of available symbols which will be used to respond to the getPointerTo*() queries. llvm-svn: 127916	2011-03-18 22:48:41 +00:00
Evan Cheng	93d04c1c00	Match a few more obvious patterns to revsh. rdar://9147637. llvm-svn: 127913	2011-03-18 21:52:42 +00:00
Jakob Stoklund Olesen	9e6d3b0a42	Extend live debug values down the dominator tree by following copies. The llvm.dbg.value intrinsic refers to SSA values, not virtual registers, so we should be able to extend the range of a value by tracking that value through register copies. This greatly improves the debug value tracking for function arguments that for some reason are copied to a second virtual register at the end of the entry block. We only extend the debug value range where its register is killed. All original llvm.dbg.value locations are still respected. Copies from physical registers are ignored. That should not be a problem since the entry block already adds DBG_VALUE instructions for the virtual registers holding the function arguments. llvm-svn: 127912	2011-03-18 21:42:19 +00:00
Eli Friedman	8d903449c3	Revert r127852; it's apparently causing an ICE on mingw. llvm-svn: 127909	2011-03-18 21:12:29 +00:00
Owen Anderson	25ab3f714f	Clean whitespace. llvm-svn: 127900	2011-03-18 19:47:14 +00:00
Owen Anderson	acda6f77a8	Reduce code duplication. llvm-svn: 127899	2011-03-18 19:46:58 +00:00
Justin Holewinski	d9c382441b	PTX: Fix various codegen issues - Emit mad instead of mad.rn for shader model 1.0 - Emit explicit mov.u32 instructions for reading global variables - (most PTX instructions cannot take global variable immediates) llvm-svn: 127895	2011-03-18 19:24:28 +00:00
Jim Grosbach	b89c5053bf	setExecutable() should default to success if there's nothing custom for it. llvm-svn: 127891	2011-03-18 18:51:03 +00:00
Owen Anderson	c23c6e0c1a	Thumb2 PC-relative loads require a fixup rather than just an immediate. llvm-svn: 127888	2011-03-18 17:42:55 +00:00
Andrew Trick	dd6faad20a	Avoid creating canonical induction variables for non-native types. For example, on 32-bit architecture, don't promote all uses of the IV to 64-bits just because one use is a 64-bit cast. Alternate implementation of the patch by Arnaud de Grandmaison. llvm-svn: 127884	2011-03-18 16:50:32 +00:00
Joerg Sonnenberger	aa8ac259e9	Support explicit argument forms for the X86 string instructions. For now, only the default segments are supported. llvm-svn: 127875	2011-03-18 11:59:40 +00:00
Che-Liang Chiou	2b173c0443	ptx: fix parameter order that is reversed llvm-svn: 127874	2011-03-18 11:23:56 +00:00
Che-Liang Chiou	f4a2c17cf5	ptx: add unconditional and conditional branch llvm-svn: 127873	2011-03-18 11:08:52 +00:00
NAKAMURA Takumi	cdd69c874f	raw_ostream: [PR6745] Tweak formatting (double)%e for Windows hosts. On MSVCRT and compatible, output of %e is incompatible to Posix by default. Number of exponent digits should be at least 2. "%+03d" FIXME: Implement our formatter in future! llvm-svn: 127872	2011-03-18 09:30:10 +00:00
Bill Wendling	8a983f8c3f	Initialize the only-used-with-PPC-double-double parts of the APFloat class. This makes valgrind stop complaining about uninitialized variables being read when it accesses a bitfield (category) that shares its bits with these variables. llvm-svn: 127871	2011-03-18 09:09:44 +00:00
Jakob Stoklund Olesen	dbc283787d	Hoist spills when the same value is known to be in less loopy sibling registers. Stack slot real estate is virtually free compared to registers, so it is advantageous to spill earlier even though the same value is now kept in both a register and a stack slot. Also eliminate redundant spills by extending the stack slot live range underneath reloaded registers. This can trigger a dead code elimination, removing copies and even reloads that were only feeding spills. llvm-svn: 127868	2011-03-18 04:23:06 +00:00
Jakob Stoklund Olesen	2956265983	Accept instructions that read undefined values. This is not supposed to happen, but I have seen the x86 rematter getting confused when rematerializing partial redefs. llvm-svn: 127857	2011-03-18 03:06:04 +00:00
Jakob Stoklund Olesen	08bfed9973	Be more accurate about the slot index reading a register when dealing with defs and early clobbers. Assert when trying to find an undefined value. llvm-svn: 127856	2011-03-18 03:06:02 +00:00
Rafael Espindola	33eddc52d3	Check RequiresNullTerminator first, or we might read from an invalid address. llvm-svn: 127853	2011-03-18 02:55:51 +00:00
Eli Friedman	64a2b7e4f2	Add a target-specific branchless method for double-width relational comparisons on x86. Essentially, the way this works is that SUB+SBB sets the relevant flags the same way a double-width CMP would. This is a substantial improvement over the generic lowering in LLVM. The output is also shorter than the gcc-generated output; I haven't done any detailed benchmarking, though. llvm-svn: 127852	2011-03-18 02:34:11 +00:00
Ted Kremenek	1a2fa05e5f	Augment CrashRecoveryContext to have registered "cleanup" objects that can be used to release resources during a crash. llvm-svn: 127849	2011-03-18 02:05:11 +00:00
Johnny Chen	14f091b6ab	The disassembler for Thumb was wrongly adding 4 to the computed imm32 offset. Remove the offending logic and update the test cases. llvm-svn: 127843	2011-03-18 00:38:03 +00:00
Andrew Trick	a4b86e96b1	Remove TargetData and ValueTracking includes. I didn't mean for them to sneak in my last checkin. llvm-svn: 127842	2011-03-18 00:36:39 +00:00
Owen Anderson	0753b79795	There are two pseudos in this case that are Thumb mode, not one. llvm-svn: 127840	2011-03-17 23:52:05 +00:00
Andrew Trick	07887af00c	Added isValidRewrite() to check the result of ScalarEvolutionExpander. SCEV may generate expressions composed of multiple pointers, which can lead to invalid GEP expansion. Until we can teach SCEV to follow strict pointer rules, make sure no bad GEPs creep into IR. Fixes rdar://problem/9038671. llvm-svn: 127839	2011-03-17 23:51:11 +00:00
Andrew Trick	17df72f60b	whitespace llvm-svn: 127837	2011-03-17 23:46:48 +00:00
Rafael Espindola	3ecf930d14	Use RequiresNullTerminator to create buffers without a null terminator instead of copying. llvm-svn: 127835	2011-03-17 22:18:42 +00:00
Devang Patel	f8c3eb7368	Try to not lose variable's debug info during instcombine. This is done by lowering dbg.declare intrinsic into dbg.value intrinsic. Radar 9143931. llvm-svn: 127834	2011-03-17 22:18:16 +00:00
Johnny Chen	41abb5b0f7	It used to be that t_addrmode_s4 was used for both: o A8.6.195 STR (register) -- Encoding T1 o A8.6.193 STR (immediate, Thumb) -- Encoding T1 It has been changed so that now they use different addressing modes and thus different MC representation (Operand Infos). Modify the disassembler to reflect the change, and add relevant tests. llvm-svn: 127833	2011-03-17 22:04:05 +00:00
Devang Patel	3506f02e33	Refactor into a separate utility function. llvm-svn: 127832	2011-03-17 21:58:19 +00:00
Benjamin Kramer	52ffb6ea96	BuildUDIV: If the divisor is even we can simplify the fixup of the multiplied value by introducing an early shift. This allows us to compile "unsigned foo(unsigned x) { return x/28; }" into shrl $2, %edi imulq $613566757, %rdi, %rax shrq $32, %rax ret instead of movl %edi, %eax imulq $613566757, %rax, %rcx shrq $32, %rcx subl %ecx, %eax shrl %eax addl %ecx, %eax shrl $4, %eax on x86_64 llvm-svn: 127829	2011-03-17 20:39:14 +00:00
Benjamin Kramer	a85996c235	Add an argument to APInt's magic udiv calculation to specify the number of bits that are known zero in the divided number. This will come in handy soon. llvm-svn: 127828	2011-03-17 20:39:06 +00:00
Jakob Stoklund Olesen	047a25b0b0	Dead code elimination may separate the live interval into multiple connected components. I have convinced myself that it can only happen when a phi value dies. When it happens, allocate new virtual registers for the components. llvm-svn: 127827	2011-03-17 20:37:07 +00:00
Richard Osborne	6bad79b514	Add XCore intrinsic for setpsc. llvm-svn: 127821	2011-03-17 18:42:05 +00:00
Daniel Dunbar	0e9d7aeb1f	MC/Mach-O: Fix regression introduced in r126127, this assignment shouldn't have been removed. llvm-svn: 127812	2011-03-17 16:25:24 +00:00
Cameron Zwarich	cea63dc052	Move more logic into getTypeForExtArgOrReturn. llvm-svn: 127809	2011-03-17 14:53:37 +00:00
Cameron Zwarich	a5746339cc	Rename getTypeForExtendedInteger() to getTypeForExtArgOrReturn(). llvm-svn: 127807	2011-03-17 14:21:56 +00:00
Nick Lewycky	50afb5a262	Add comments for the demanglings. Correct mangled form of operator delete! llvm-svn: 127801	2011-03-17 05:20:12 +00:00
Nick Lewycky	9dac4ea71f	Add "swi" which is an obsolete mnemonic for "svc". llvm-svn: 127788	2011-03-17 01:46:14 +00:00
Eli Friedman	dcc256df41	A couple new README entries. llvm-svn: 127786	2011-03-17 01:22:09 +00:00
Joerg Sonnenberger	e37bdf4386	Fix handling of @IDNTPOFF relocations, they need to get STT_TLS. While here, add VK_ARM_TPOFF and VK_ARM_GOTTPOFF, too. llvm-svn: 127780	2011-03-17 00:35:10 +00:00
Jakob Stoklund Olesen	2786187b43	Rewrite instructions as part of ConnectedVNInfoEqClasses::Distribute. llvm-svn: 127779	2011-03-17 00:23:45 +00:00
Jakob Stoklund Olesen	5c0d2aecc5	Add a LiveRangeEdit delegate callback before shrinking a live range. The register allocator needs to adjust its live interval unions when that happens. llvm-svn: 127774	2011-03-16 22:56:16 +00:00
Jakob Stoklund Olesen	8751b4e276	Erase virtual registers that are unused after DCE. llvm-svn: 127773	2011-03-16 22:56:13 +00:00
Jakob Stoklund Olesen	940b7d46d3	Tag cached interference with a user-provided tag instead of the virtual register number. The live range of a virtual register may change which invalidates the cached interference information. llvm-svn: 127772	2011-03-16 22:56:11 +00:00
Jakob Stoklund Olesen	7b60f4161a	Clarify debugging output. llvm-svn: 127771	2011-03-16 22:56:08 +00:00
Cameron Zwarich	2bb1e45ea3	The x86-64 ABI says that a bool is only guaranteed to be sign-extended to a byte rather than an int. Thankfully, this only causes LLVM to miss optimizations, not generate incorrect code. This just fixes the zext at the return. We still insert an i32 ZextAssert when reading a function's arguments, but it is followed by a truncate and another i8 ZextAssert so it is not optimized. llvm-svn: 127766	2011-03-16 22:20:18 +00:00
Cameron Zwarich	860d06739b	Don't recompute something that we already have in a local variable. llvm-svn: 127764	2011-03-16 22:20:07 +00:00
Daniel Dunbar	8757b8c000	Revert r127757, "Patch to a fix dwarf relocation problem on ARM. One-line fix plus the test where it used to break.", which broke Clang self-host of a Debug+Asserts compiler, on OS X. llvm-svn: 127763	2011-03-16 22:16:39 +00:00
Richard Osborne	8b90369d96	Add XCore intrinsics for setclk, setrdy. llvm-svn: 127761	2011-03-16 21:56:00 +00:00
Renato Golin	bf788a5626	Patch to a fix dwarf relocation problem on ARM. One-line fix plus the test where it used to break. llvm-svn: 127757	2011-03-16 21:05:52 +00:00
Richard Osborne	318e25c620	Add checkevent intrinsic to check if any resources owned by the current thread can event. llvm-svn: 127741	2011-03-16 18:34:00 +00:00
Cameron Zwarich	c60b47a7e2	Fix a comment. llvm-svn: 127728	2011-03-16 08:13:42 +00:00
NAKAMURA Takumi	341bf54557	lib/Support/raw_ostream.cpp: On mingw, report_fatal_error() should not be called at dtor context. report_fatal_error() invokes exit(). We know report_fatal_error() might not write messages to stderr when any errors were detected on FD == 2. llvm-svn: 127726	2011-03-16 02:53:39 +00:00
NAKAMURA Takumi	bcbcf099b4	Windows/PathV2.inc: [PR8520] Recognize "NUL" as special (character) file. FIXME: It is a temporal hack. We should detect as many "special file name" as possible. llvm-svn: 127724	2011-03-16 02:53:32 +00:00
NAKAMURA Takumi	042801b7d2	Windows/Path.inc: [PR6270] PathV1::makeUnique(): Give arbitrary initial seed for workaround. FIXME: We should use sys::fs::unique_file() in future. llvm-svn: 127723	2011-03-16 02:53:24 +00:00
Jim Grosbach	c68c99f640	Tidy up. Whitespace and 80 column. llvm-svn: 127721	2011-03-16 01:21:55 +00:00
Devang Patel	f0148aca2e	Do not accidently initialize NumDbgValueLost and NumDbgLineLost counts. llvm-svn: 127720	2011-03-16 00:27:57 +00:00
Cameron Zwarich	7fd94ea393	Only convert allocas to scalars if it is profitable. The profitability metric I chose is having a non-memcpy/memset use and being larger than any native integer type. Originally I chose having an access of a size smaller than the total size of the alloca, but this caused some minor issues on the spirit benchmark where SRoA runs again after some inlining. This fixes <rdar://problem/8613163>. llvm-svn: 127718	2011-03-16 00:13:44 +00:00
Cameron Zwarich	88790f3d4d	Better use initializer lists. llvm-svn: 127716	2011-03-16 00:13:37 +00:00
Cameron Zwarich	f09bb5f2f5	Add a clarifying comment. llvm-svn: 127715	2011-03-16 00:13:35 +00:00
Johnny Chen	e88573849d	There were two issues fixed: 1. The ARM Darwin *r9 call instructions were pseudo-ized recently. Modify the ARMDisassemblerCore.cpp file to accomodate the change. 2. The disassembler was unnecessarily adding 8 to the sign-extended imm24: imm32 = SignExtend(imm24:'00', 32); // A8.6.23 BL, BLX (immediate) // Encoding A1 It has no business doing such. Removed the offending logic. Add test cases to arm-tests.txt. llvm-svn: 127707	2011-03-15 22:27:33 +00:00
John Thompson	da294e31da	Add scei vendor llvm-svn: 127705	2011-03-15 21:51:56 +00:00
Bill Wendling	c12aadb9b6	The VTBL (and VTBX) instructions are rather permissive concerning the masks they accept. If a value in the mask is out of range, it uses the value 0, for VTBL, or leaves the value unchanged, for VTBX. llvm-svn: 127700	2011-03-15 21:15:20 +00:00
Jakob Stoklund Olesen	26ac368165	Trace back through sibling copies to hoist spills and find rematerializable defs. After live range splitting, an original value may be available in multiple registers. Tracing back through the registers containing the same value, find the best place to insert a spill, determine if the value has already been spilled, or discover a reaching def that may be rematerialized. This is only the analysis part. The information is not used for anything yet. llvm-svn: 127698	2011-03-15 21:13:25 +00:00
Jakob Stoklund Olesen	992adc7152	Preserve both isPHIDef and isDefByCopy bits when copying parent values. llvm-svn: 127697	2011-03-15 21:13:22 +00:00
Bill Wendling	388dad6d62	Some minor cleanups based on feedback. llvm-svn: 127694	2011-03-15 20:47:26 +00:00
Jim Grosbach	611d473405	Trailing whitespae. llvm-svn: 127691	2011-03-15 20:25:54 +00:00
Cameron Zwarich	333ed540e7	Clean up something noticed by Fritz. llvm-svn: 127684	2011-03-15 18:42:33 +00:00
Evan Cheng	59ba6777c3	Do not form thumb2 ldrd / strd if the offset is by multiple of 4. rdar://9133587 llvm-svn: 127683	2011-03-15 18:41:52 +00:00
Richard Osborne	601c8a703b	Don't indent cases in a switch, no functionality change. llvm-svn: 127681	2011-03-15 15:55:30 +00:00
Richard Osborne	af1b66c427	On the XCore the scavenging slot should be closest to the SP. llvm-svn: 127680	2011-03-15 15:10:11 +00:00
Richard Osborne	70204c1c29	Add XCore intrinsics for getps, setps, setsr and clrsr. llvm-svn: 127678	2011-03-15 13:45:47 +00:00
Justin Holewinski	8948485aa7	PTX: Set PTX 2.0 as the minimum supported version - Remove PTX 1.4 code generation - Change type of intrinsics to .v4.i32 instead of .v4.i16 - Add and/or/xor integer instructions llvm-svn: 127677	2011-03-15 13:24:15 +00:00
Duncan Sands	e91289191a	Silence compiler warning about case values not being in the enumerated type MCFixupKind. This is the same technique that is used elsewhere in MC. llvm-svn: 127676	2011-03-15 08:54:51 +00:00
Duncan Sands	fc3e4d63e1	Avoid a compiler warning about reg possibly being used uninitialized when building with assertions disabled. llvm-svn: 127675	2011-03-15 08:41:24 +00:00
Cameron Zwarich	7947e73536	Do not add PHIs with no users when creating LCSSA form. Patch by Andrew Clinton. llvm-svn: 127674	2011-03-15 07:41:25 +00:00
Nick Lewycky	e30c07ab2b	Add C++ global operator {new,new[],delete,delete[]}(unsigned {int,long}) to the memory builtins as equivalent to malloc/free. This is different from any attribute we have. For example, you can delete the allocators when their result is unused, but you can't collapse two calls to the same function, even if no global/memory state has changed in between. The noalias return states that the result does not alias any other pointer, but instcombine optimizes malloc() as though the result is non-null for the purpose of eliminating unused pointers. llvm-svn: 127673	2011-03-15 07:31:32 +00:00
Evan Cheng	29faaebae9	Add a peephole optimization to optimize pairs of bitcasts. e.g. v2 = bitcast v1 ... v3 = bitcast v2 ... = v3 => v2 = bitcast v1 ... = v1 if v1 and v3 are of in the same register class. bitcast between i32 and fp (and others) are often not nops since they are in different register classes. These bitcast instructions are often left because they are in different basic blocks and cannot be eliminated by dag combine. rdar://9104514 llvm-svn: 127668	2011-03-15 05:13:13 +00:00
Eli Friedman	c0bfbd0610	PR9450: Make switch optimization in SimplifyCFG not dependent on the ordering of pointers in an std::map. llvm-svn: 127650	2011-03-15 02:23:35 +00:00
Evan Cheng	bac3e87eaa	sext(undef) = 0, because the top bits will all be the same. zext(undef) = 0, because the top bits will be zero. llvm-svn: 127649	2011-03-15 02:22:10 +00:00
Sean Callanan	a38db2eeda	Enabled disassembler support for AVX instructions in the instruction tables and fixed a few bugs that were causing decode conflicts. Rudimentary tests are coming up in the next patch. llvm-svn: 127646	2011-03-15 01:28:15 +00:00
Sean Callanan	5a51ccdc0f	X86 table-generator and disassembler support for the AVX instruction set. This code adds support for the VEX prefix and for the YMM registers accessible on AVX-enabled architectures. Instruction table support that enables AVX instructions for the disassembler is in an upcoming patch. llvm-svn: 127644	2011-03-15 01:23:15 +00:00
Andrew Trick	09d2dcd9ef	Remove getMinusSCEVForExitTest(). This function performed acrobatics to prove no-self-wrap, which we now have for free. llvm-svn: 127643	2011-03-15 01:16:14 +00:00
Johnny Chen	a86399b8e6	Fixed an ARM disassembler bug where it does not handle STRi12 correctly because an extra register operand was erroneously added. Remove an incorrect assert which triggers the bug. rdar://problem/9131529 llvm-svn: 127642	2011-03-15 01:13:17 +00:00
Bill Wendling	af19decfc9	There are some situations which can cause the URoR hack to infinitely recurse and then go kablooie. The problem was that it was tracking the PHI nodes anew each time into this function. But it didn't need to. And because the recursion didn't know that a PHINode was visited before, it would go ahead and call itself. There is a testcase, but unfortunately it's too big to add. This problem will go away with the EH rewrite. <rdar://problem/8856298> llvm-svn: 127640	2011-03-15 01:03:17 +00:00
Andrew Trick	5c8b815e5f	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Jim Grosbach	3de97c6e32	Clean up ARM tail calls a bit. They're pseudo-instructions for normal branches. Also more cleanly separate the ARM vs. Thumb functionality. Previously, the encoding would be incorrect for some Thumb instructions (the indirect calls). llvm-svn: 127637	2011-03-15 00:30:40 +00:00
Eric Christopher	7f724c8079	If we don't know how long a string is we can't fold an _chk version to the normal version. Fixes rdar://9123638 llvm-svn: 127636	2011-03-15 00:25:41 +00:00
Bill Wendling	da1364d669	Generate a VTBL instruction instead of a series of loads and stores when we can. As Nate pointed out, VTBL isn't super performant, but it has to be better than this: _shuf: @ BB#0: @ %entry push {r4, r7, lr} add r7, sp, #4 sub sp, #12 mov r4, sp bic r4, r4, #7 mov sp, r4 mov r2, sp vmov d16, r0, r1 orr r0, r2, #6 orr r3, r2, #7 vst1.8 {d16[0]}, [r3] vst1.8 {d16[5]}, [r0] subs r4, r7, #4 orr r0, r2, #5 vst1.8 {d16[4]}, [r0] orr r0, r2, #4 vst1.8 {d16[4]}, [r0] orr r0, r2, #3 vst1.8 {d16[0]}, [r0] orr r0, r2, #2 vst1.8 {d16[2]}, [r0] orr r0, r2, #1 vst1.8 {d16[1]}, [r0] vst1.8 {d16[3]}, [r2] vldr.64 d16, [sp] vmov r0, r1, d16 mov sp, r4 pop {r4, r7, pc} The "illegal" testcase in vext.ll is no longer illegal. <rdar://problem/9078775> llvm-svn: 127630	2011-03-14 23:02:38 +00:00
Jakob Stoklund Olesen	29a9539e7f	Place context in member variables instead of passing around pointers. Use the opportunity to get rid of the trailing underscore variable names. llvm-svn: 127618	2011-03-14 20:57:14 +00:00
Jakob Stoklund Olesen	da1afc2d80	Rename members to match LLVM naming conventions more closely. Remove the unused reserved_ bit vector, no functional change intended. This doesn't break 'svn blame', this file really is all my fault. llvm-svn: 127607	2011-03-14 19:56:43 +00:00
Jim Grosbach	6ee5aef028	Remove some dead patterns. llvm-svn: 127601	2011-03-14 18:34:35 +00:00
Evan Cheng	50f2d406ec	BIT_CONVERT has been renamed to BITCAST. llvm-svn: 127600	2011-03-14 18:19:52 +00:00
Evan Cheng	cb70b9e80b	Minor optimization. sign-ext/anyext of undef is still undef. llvm-svn: 127598	2011-03-14 18:15:55 +00:00
Evan Cheng	fbb846289a	Indentation. llvm-svn: 127595	2011-03-14 18:02:30 +00:00
Andrew Trick	da253e79f0	Negating a recurrence preserves no-self-wrap. llvm-svn: 127593	2011-03-14 17:38:54 +00:00
Andrew Trick	dab71254b6	HowFarToZero can compute a trip count as long as the recurrence has no-self-wrap. llvm-svn: 127591	2011-03-14 17:28:02 +00:00
Andrew Trick	5d45b563c5	Added SCEV::NoWrapFlags to manage unsigned, signed, and self wrap properties. Added the self-wrap flag for SCEV::AddRecExpr. A slew of temporary FIXMEs indicate the intention of the no-self-wrap flag without changing behavior in this revision. llvm-svn: 127590	2011-03-14 16:50:06 +00:00
Andrew Trick	e0442babf1	whitespace llvm-svn: 127589	2011-03-14 16:48:10 +00:00
Justin Holewinski	a2f7c8557c	PTX: Emit global arrays with proper sizes - Emit all arrays as type .b8 and proper sizes in bytes to conform to the output of nvcc llvm-svn: 127584	2011-03-14 15:40:11 +00:00
Justin Holewinski	995d10cfea	PTX: Add support for sqrt/sin/cos intrinsics llvm-svn: 127578	2011-03-14 14:09:33 +00:00
Che-Liang Chiou	6ff0aa8ab3	ptx: add set.p instruction and related changes to predicate execution llvm-svn: 127577	2011-03-14 11:26:01 +00:00
Jin-Gu Kang	9d52ff5473	This case is solved by Scalar Replacement of Aggregates (DT) and Early CSE pass so this patch reverts it to original source code. llvm-svn: 127574	2011-03-14 01:21:00 +00:00
Che-Liang Chiou	962612fc5c	ptx: add basic support of predicate execution llvm-svn: 127569	2011-03-13 17:26:00 +00:00
Jin-Gu Kang	5000ba8961	Add comment as following: load and store reference same memory location, the memory location is represented by getelementptr with two uses (load and store) and the getelementptr's base is alloca with single use. At this point, instructions from alloca to store can be removed. (this pattern is generated when bitfield is accessed.) For example, %u = alloca %struct.test, align 4 ; [#uses=1] %0 = getelementptr inbounds %struct.test* %u, i32 0, i32 0;[#uses=2] %1 = load i8* %0, align 4 ; [#uses=1] %2 = and i8 %1, -16 ; [#uses=1] %3 = or i8 %2, 5 ; [#uses=1] store i8 %3, i8* %0, align 4 llvm-svn: 127565	2011-03-13 14:05:51 +00:00
Jakob Stoklund Olesen	7d23be25ab	Now that we are deleting unused live intervals during allocation, pointers may be reused. Use the virtual register number as a cache tag instead. They are not reused. llvm-svn: 127561	2011-03-13 01:29:32 +00:00
Jakob Stoklund Olesen	2d87d5139b	Tell the register allocator about new unused virtual registers. This allows the allocator to free any resources used by the virtual register, including physical register assignments. llvm-svn: 127560	2011-03-13 01:23:11 +00:00
Oscar Fuentes	208de1fcc4	Build CompilerDriver library. llvm-svn: 127554	2011-03-12 22:01:42 +00:00
Benjamin Kramer	5986a24bae	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Duncan Sands	0514e10276	Speculatively revert commit 127478 (jsjodin) in an attempt to fix the llvm-gcc-i386-linux-selfhost and llvm-x86_64-linux-checks buildbots. The original log entry: Remove optimization emitting a reference insted of label difference, since it can create more relocations. Removed isBaseAddressKnownZero method, because it is no longer used. llvm-svn: 127540	2011-03-12 13:07:37 +00:00
Jin-Gu Kang	5e537a9449	This patch removes some of useless instructions generated by bitfield access. llvm-svn: 127539	2011-03-12 12:18:44 +00:00
Jakob Stoklund Olesen	6d02ddbbc3	Include snippets in the live stack interval. llvm-svn: 127530	2011-03-12 04:25:36 +00:00
Jakob Stoklund Olesen	1f9f236b8a	Spill multiple registers at once. Live range splitting can create a number of small live ranges containing only a single real use. Spill these small live ranges along with the large range they are connected to with copies. This enables memory operand folding and maximizes the spill to fill distance. Work in progress with known bugs. llvm-svn: 127529	2011-03-12 04:17:20 +00:00
Sean Callanan	4f6e58ff09	Fixed the comparison operator for the enhanced disassembler's disassembler map. llvm-svn: 127527	2011-03-12 03:27:54 +00:00
Jakob Stoklund Olesen	925b25d53d	That's it, I am declaring this a failure of the C++03 STL. There are too many compatibility problems with using mixed types in std::upper_bound, and I don't want to spend 110 lines of boilerplate setting up a call to a 10-line function. Binary search is not /that/ hard to implement correctly. I tried terminating the binary search with a linear search, but that actually made the algorithm slower against my expectation. Most live intervals have less than 4 segments. The early test against endIndex() does pay, and this version is 25% faster than plain std::upper_bound(). llvm-svn: 127522	2011-03-12 01:50:35 +00:00
Eric Christopher	80a45901e0	Sometimes isPredicable lies to us and tells us we don't need the operands. Go ahead and add them on when we might want to use them and let later passes remove them. Fixes rdar://9118569 llvm-svn: 127518	2011-03-12 01:09:29 +00:00
Jim Grosbach	f7531e7697	Add FIXME. llvm-svn: 127516	2011-03-12 00:51:00 +00:00
Jim Grosbach	555d910477	Pseudo-ize the ARM Darwin *r9 call instruction definitions. They're the same actual instruction as the non-Darwin defs, but have different call-clobber semantics and so need separate patterns. They don't need to duplicate the encoding information, however. llvm-svn: 127515	2011-03-12 00:45:26 +00:00
Jim Grosbach	923c731f15	Add a FIXME. llvm-svn: 127511	2011-03-11 23:25:21 +00:00
Jim Grosbach	daffeb06fb	Pseudo-ize the ARM 'B' instruction. llvm-svn: 127510	2011-03-11 23:24:15 +00:00
Jim Grosbach	2226dfbea2	Remove dead code. These ARM instruction definitions no longer exist. llvm-svn: 127509	2011-03-11 23:15:02 +00:00
Jim Grosbach	009af69d6d	Pseudo-ize VMOVDcc and VMOVScc. llvm-svn: 127506	2011-03-11 23:09:50 +00:00
Jim Grosbach	61ff87cd2d	80 columns llvm-svn: 127505	2011-03-11 23:00:16 +00:00
Jim Grosbach	27eaca3e0d	Properly pseudo-ize the ARM LDMIA_RET instruction. This has the nice side- effect that we get proper instruction printing using the "pop" mnemonic for it. llvm-svn: 127502	2011-03-11 22:51:41 +00:00
Cameron Zwarich	bf5c9cd119	Roll r127459 back in: Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127498	2011-03-11 21:52:04 +00:00
Cameron Zwarich	39a49276db	Fix the GCC test suite issue exposed by r127477, which was caused by stack protector insertion not working correctly with unreachable code. Since that revision was rolled out, this test doesn't actual fail before this fix. llvm-svn: 127497	2011-03-11 21:51:56 +00:00
Owen Anderson	78afadfa5d	Teach FastISel to support register-immediate-immediate instructions. llvm-svn: 127496	2011-03-11 21:33:55 +00:00
Jim Grosbach	ee6075cda5	ARM VDUPfd and VDUPfq can just be patterns. The instruction is the same as for VDUP32d and VDUP32q, respectively. llvm-svn: 127489	2011-03-11 20:44:08 +00:00
Jim Grosbach	3329263352	ARM VDUPLNfq and VDUPLNfd definitions can just be Pat<>s for VDUPLN32q and VDUPLN32d, respectively. llvm-svn: 127486	2011-03-11 20:31:17 +00:00
Jim Grosbach	431682981d	ARM VREV64df and VREV64qf can just be patterns. The instruction is the same as for VREV64d32 and VREV64q32, respectively. llvm-svn: 127485	2011-03-11 20:18:05 +00:00
Jim Grosbach	fff6ff502b	This FIXME has been fixed. llvm-svn: 127483	2011-03-11 20:07:37 +00:00
Jim Grosbach	2ecded3a94	Properly pseudo-ize ARM MVNCCi. llvm-svn: 127482	2011-03-11 19:55:55 +00:00
Jan Sjödin	b58b9618ce	Remove optimization emitting a reference insted of label difference, since it can create more relocations. Removed isBaseAddressKnownZero method, because it is no longer used. llvm-svn: 127478	2011-03-11 19:37:02 +00:00
Daniel Dunbar	a02706c889	Revert r127459, "Optimize trivial branches in CodeGenPrepare, which often get created from the", it broke some GCC test suite tests. llvm-svn: 127477	2011-03-11 19:30:30 +00:00
Jim Grosbach	39804c0b44	Fix MOVCCi32imm to be have ARM-mode Requires and a proper size (8 bytes, was 4). llvm-svn: 127469	2011-03-11 18:00:42 +00:00
Andrew Trick	6aa37a4a2b	Replace -dag-chain-limit flag with constant. It has survived a release cycle without being touched, so no longer needs to pollute the hidden-help text. llvm-svn: 127468	2011-03-11 17:46:59 +00:00
Benjamin Kramer	d4ea449e7e	ComputeMaskedBits: sub falls through to add, and sub doesn't have the same overflow semantics as add. Should fix the selfhost failures that started with r127463. llvm-svn: 127465	2011-03-11 14:46:49 +00:00
Benjamin Kramer	666407939f	InstCombine: Fix a thinko where transform an icmp under the assumption that it's a zero comparison when it's not. Fixes PR9454. llvm-svn: 127464	2011-03-11 11:37:40 +00:00
Nick Lewycky	cf0e3e88df	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
John Wiegley	e1168a569b	Fix use of CompEnd predicate to be standards conforming The existing CompEnd predicate does not define a strict weak order as required by the C++03 standard; therefore, its use as a predicate to std::upper_bound is invalid. For a discussion of this issue, see http://www.open-std.org/jtc1/sc22/wg21/docs/lwg-defects.html#270 This patch replaces the asymmetrical comparison with an iterator adaptor that achieves the same effect while being strictly standard-conforming by ensuring an apples-to-apples comparison. llvm-svn: 127462	2011-03-11 08:54:34 +00:00
Cameron Zwarich	9ed726c151	Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127459	2011-03-11 04:54:27 +00:00
Chris Lattner	2cd24b852f	silence a conditional assignment -Wuninitialized warning. llvm-svn: 127453	2011-03-11 02:12:51 +00:00
Jim Grosbach	ed45ac390c	Properly pseudo-ize ARM MOVCCi and MOVCCi16. llvm-svn: 127442	2011-03-11 01:09:28 +00:00
Eric Christopher	46f43c9cce	Change the x86 32-bit scheduler to register pressure and fix up the corresponding testcases back to the previous versions. Fixes some performance regressions only seen on 32-bit. llvm-svn: 127441	2011-03-11 01:05:58 +00:00
Evan Cheng	d5d2d4a158	Avoid replacing the value of a directly stored load with the stored value if the load is indexed. rdar://9117613. llvm-svn: 127440	2011-03-11 00:48:56 +00:00
Jim Grosbach	1986d9ac8f	Properly pseudo-ize MOVCCr and MOVCCs. llvm-svn: 127434	2011-03-10 23:56:09 +00:00
Dan Gohman	051cc738c3	RecursivelyDeleteTriviallyDeadInstructions only needs a Value, not an Instruction, so casting is not necessary. Also, it's theoretically possible that the Value is not an Instruction, since WeakVH follows RAUWs. llvm-svn: 127427	2011-03-10 20:57:44 +00:00
Rafael Espindola	222ccb2bdd	Don't compute the file size if we don't need to. llvm-svn: 127426	2011-03-10 20:54:07 +00:00
Dan Gohman	bb9c77f5bd	Fix reassociate to postpone certain instruction deletions until after it has finished all of its reassociations, because its habit of unlinking operands and holding them in a datastructure while working means that it's not easy to determine when an instruction is really dead until after all its regular work is done. rdar://9096268. llvm-svn: 127424	2011-03-10 19:51:54 +00:00
Jim Grosbach	5891b1323a	DMB can just be a pat referencing MCR. llvm-svn: 127423	2011-03-10 19:27:17 +00:00
Jim Grosbach	4b74ef6ca9	Reorganize a bit. No functional change, just moving patterns up. llvm-svn: 127422	2011-03-10 19:21:08 +00:00
Jim Grosbach	db549a7f6c	Pseudo-instructions are codegenonly by definition. llvm-svn: 127420	2011-03-10 19:06:39 +00:00
Benjamin Kramer	52a44b9c80	InstCombine: Turn umul_with_overflow into mul nuw if we can prove that it cannot overflow. This happens a lot in clang-compiled C++ code because it adds overflow checks to operator new[]: unsigned foo(unsigned n) { return new unsigned[n]; } We can optimize away the overflow check on 64 bit targets because (uint64_t)n4 cannot overflow. llvm-svn: 127418	2011-03-10 18:40:14 +00:00
Rafael Espindola	a271db1a12	Add r127409 back now that the windows file was updated. llvm-svn: 127417	2011-03-10 18:33:29 +00:00
Rafael Espindola	11ae298b4f	Try to fix the windows build. llvm-svn: 127416	2011-03-10 18:30:48 +00:00
Jakob Stoklund Olesen	891bfab351	Revert r127409 which broke all the Windows bots. llvm-svn: 127413	2011-03-10 18:01:43 +00:00
Justin Holewinski	a26d2f782e	PTX: Add preliminary support for floating-point divide and multiply-and-add llvm-svn: 127410	2011-03-10 16:57:18 +00:00
Rafael Espindola	b7a2d86ef5	Add support for MemoryBuffers that are not null terminated and add support for creating buffers that cover only a part of a file. llvm-svn: 127409	2011-03-10 16:10:30 +00:00
Cameron Zwarich	206503113e	Add an option to disable critical edge splitting in PHIElimination. llvm-svn: 127398	2011-03-10 05:59:17 +00:00
Che-Liang Chiou	fc6c7ba9d5	ptx: add the rest of special registers of ISA version 2.0 llvm-svn: 127397	2011-03-10 04:05:57 +00:00
Jakob Stoklund Olesen	92652a803f	Change the Spiller interface to take a LiveRangeEdit reference. This makes it possible to register delegates and get callbacks when the spiller edits live ranges. llvm-svn: 127389	2011-03-10 01:51:42 +00:00
Jakob Stoklund Olesen	70541686bf	Make SpillIs an optional pointer. Avoid creating a bunch of temporary SmallVectors. llvm-svn: 127388	2011-03-10 01:21:58 +00:00
Francois Pichet	348d41b59b	Unbreak the CMake build. llvm-svn: 127383	2011-03-10 00:51:01 +00:00
Stuart Hastings	fd42046d56	Revert 127359; it broke lencod. llvm-svn: 127382	2011-03-10 00:25:53 +00:00
Devang Patel	73d68195ce	Introduce DebugInfoProbe. This is used to monitor how llvm optimizer is treating debugging information. It generates output that lools like 8 times line number info lost by Scalar Replacement of Aggregates (SSAUp) 1 times line number info lost by Simplify well-known library calls 12 times variable info lost by Jump Threading llvm-svn: 127381	2011-03-10 00:21:25 +00:00
Evan Cheng	a3a7a7e364	Re-commit 127368 and 127371. They are exonerated. llvm-svn: 127380	2011-03-10 00:16:32 +00:00
Evan Cheng	d7a2008a55	Revert 127368 and 127371 for now. llvm-svn: 127376	2011-03-09 23:53:17 +00:00
Evan Cheng	b717770dfe	Change the definition of TargetRegisterInfo::getCrossCopyRegClass to be more flexible. If it returns a register class that's different from the input, then that's the register class used for cross-register class copies. If it returns a register class that's the same as the input, then no cross- register class copies are needed (normal copies would do). If it returns null, then it's not at all possible to copy registers of the specified register class. llvm-svn: 127368	2011-03-09 22:47:38 +00:00
Benjamin Kramer	f1c1220d8f	Fix a pasto that broke all x86_64-elf targets. llvm-svn: 127365	2011-03-09 22:07:13 +00:00
Devang Patel	7a4edc6463	Preserve line number information while simplifying libcalls. llvm-svn: 127362	2011-03-09 21:27:52 +00:00
Stuart Hastings	61f9a3dab2	X86 byval copies no longer always_inline. <rdar://problem/8706628> llvm-svn: 127359	2011-03-09 21:10:30 +00:00
Johnny Chen	6bf5d7a170	LLVM combines the offset mode of A8.6.199 A1 & A2 into STRBT. The insufficient encoding information of the combined instruction confuses the decoder wrt UQADD16. Add extra logic to recover from that. Fixed an assert reported by Sean Callanan llvm-svn: 127354	2011-03-09 20:01:14 +00:00
Eric Christopher	f99de77b19	Make these options hidden to reduce the amount of text -help puts on the command line, they'll still be seen with -help-hidden. llvm-svn: 127353	2011-03-09 19:46:51 +00:00
Devang Patel	c969bd2ff1	These llvm.dbg.* constants are not used anymore. llvm-svn: 127352	2011-03-09 19:41:33 +00:00
Jakob Stoklund Olesen	4d0c9d0af7	Make physreg coalescing independent on the number of uses of the virtual register. The damage done by physreg coalescing only depends on the number of instructions the extended physreg live range covers. This fixes PR9438. The heuristic is still luck-based, and physreg coalescing really should be disabled completely. We need a register allocator with better hinting support before that is possible. Convert a test to FileCheck and force spilling by inserting an extra call. The previous spilling behavior was dependent on misguided physreg coalescing decisions. llvm-svn: 127351	2011-03-09 19:27:06 +00:00
Bruno Cardoso Lopes	88bef593d8	Improve varags handling, with testcases. Patch by Sasa Stankovic llvm-svn: 127349	2011-03-09 19:22:22 +00:00
Andrew Trick	e529ddb2d7	Improve pre-RA-sched register pressure tracking for duplicate operands. This helps cases like 2008-07-19-movups-spills.ll, but doesn't have an obvious impact on benchmarks llvm-svn: 127347	2011-03-09 19:12:43 +00:00
Jan Sjödin	c7c66d9f88	Add createELFObjectTargetWriter method to TargetAsmBackend, which enables construction of non-standard ELFObjectWriters that can be used in MCJIT. llvm-svn: 127346	2011-03-09 18:44:41 +00:00
Jan Sjödin	6791bc64a2	Add constructors to MCElfStreamer and MCObjectStreamer to take an extra MCAssembler * argument. llvm-svn: 127343	2011-03-09 17:33:05 +00:00
Andrew Trick	c4703f6ea1	When SCEV can determine the loop test is X < X, set ExactBECount=0. When ExactBECount is a constant, use it for MaxBECount. When MaxBECount cannot be computed, replace it with ExactBECount. Fixes PR9424. llvm-svn: 127342	2011-03-09 17:29:58 +00:00
Andrew Trick	de565b0456	whitespace llvm-svn: 127340	2011-03-09 17:23:39 +00:00
Benjamin Kramer	4cf03850a2	Fix typo, make helper static. llvm-svn: 127335	2011-03-09 16:19:12 +00:00
Benjamin Kramer	782cb6d68d	Remove unused virtual dtor. llvm-svn: 127331	2011-03-09 14:20:28 +00:00
NAKAMURA Takumi	fe84f8672a	Target/X86: Tweak va_arg for Win64 not to miss taking va_start when number of fixed args > 4. llvm-svn: 127328	2011-03-09 11:33:15 +00:00
Nick Lewycky	c2b564b36f	Fix two cases I forgot to update when doing a mental "getSwappedPredicate". Thanks Duncan Sands! llvm-svn: 127323	2011-03-09 08:20:06 +00:00
Cameron Zwarich	90efa03e8b	Fix a crasher introduced by r127317 that is seen on the bots when using an alloca as both integer and floating-point vectors of the same size. Bugpoint is not cooperating with me, but I'll try to find a manual testcase tomorrow. llvm-svn: 127320	2011-03-09 07:34:11 +00:00
Nick Lewycky	485af203fc	Add another micro-optimization. Apologies for the lack of refactoring, but I gave up when I realized I couldn't come up with a good name for what the refactored function would be, to describe what it does. This is PR9343 test12, which is test3 with arguments reordered. Whoops! llvm-svn: 127318	2011-03-09 06:26:03 +00:00
Cameron Zwarich	e0b4705b03	Add support to scalar replacement for partial vector accesses of an alloca, e.g. a union of a float, <2 x float>, and <4 x float>. This mostly comes up with the use of vector intrinsics, especially in NEON when programmers know the layout of the register file. This enables codegen to eliminate a lot of the subregister traffic it would otherwise generate. This commit only enables this for a small number of floating-point cases, but a lot more integer cases. I assume this is okay for all ports, but I did not do extensive testing of the quality of code involving i512 vectors and the like. If there is a use case where this generates worse code than before, let me know and we can scale it back. This fixes <rdar://problem/9036264>. llvm-svn: 127317	2011-03-09 05:43:05 +00:00
Cameron Zwarich	e80c47f295	Move vector type merging to a separate function in preparation for it getting more complicated. llvm-svn: 127316	2011-03-09 05:43:01 +00:00
Matt Beaumont-Gay	3e3b6cc819	Add a virtual dtor to Delegate to silence -Wnon-virtual-dtor llvm-svn: 127311	2011-03-09 04:02:15 +00:00
Eli Friedman	50311331a7	PR9346: Prevent SimplifyDemandedBits from incorrectly introducing INT_MIN % -1. llvm-svn: 127306	2011-03-09 01:28:35 +00:00
Jakob Stoklund Olesen	eec325fc2f	Add a LiveRangeEdit::Delegate protocol. This will we used for keeping register allocator data structures up to date while LiveRangeEdit is trimming live intervals. llvm-svn: 127300	2011-03-09 00:57:29 +00:00
Eli Friedman	d1e9ff0d6c	PR9420; an instruction before an unreachable is guaranteed not to have any reachable uses, but there still might be uses in dead blocks. Use the standard solution of replacing all the uses with undef. This is a rare case because it's very sensitive to phase ordering in SimplifyCFG. llvm-svn: 127299	2011-03-09 00:48:33 +00:00
Bill Wendling	68934338ab	* Correct encoding for VSRI. * Add tests for VSRI and VSLI. llvm-svn: 127297	2011-03-09 00:33:17 +00:00
Jakob Stoklund Olesen	7905ed6549	Delete dead code. llvm-svn: 127295	2011-03-09 00:07:39 +00:00

... 2 3 4 5 6 ...

46118 Commits