llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Chandler Carruth	7d68167eb0	This just in, it is a bad idea to use 'udiv' on an offset of a pointer. A very bad idea. Let's not do that. Fixes PR14105. Note that this wasn't that glaring of an oversight. Originally, these routines were only called on offsets within an alloca, which are intrinsically positive. But over the evolution of the pass, they ended up being called for arbitrary offsets, and things went downhill... llvm-svn: 166095	2012-10-17 09:23:48 +00:00
Bill Wendling	4314405e52	Marked this variable as 'used' so that LTO doesn't get rid of it. llvm-svn: 166092	2012-10-17 08:08:06 +00:00
Chandler Carruth	2d0f74e722	Fix a really annoying "bug" introduced in r165941. The change from that revision makes no sense. We cannot use the address space of the post indexed type to conclude anything about a pre indexed pointer type's size. More importantly, this index can never be over a pointer. We are indexing over arrays and vectors here. Of course, I have no test case here. Neither did the original patch. =/ llvm-svn: 166091	2012-10-17 07:22:16 +00:00
Craig Topper	7e08ec038e	Remove LLVM_DELETED_FUNCTION from destructors that override non-deleted base class destructors. This isn't legal by the C++11 standard and clang now checks for it. Curiously gcc didn't catch this, possibly because of the template usage. llvm-svn: 166089	2012-10-17 05:15:58 +00:00
Michael Liao	f5ea791113	Check SSSE3 instead of SSE4.1 - All shuffle insns required, especially PSHUB, are added in SSSE3. llvm-svn: 166086	2012-10-17 03:59:18 +00:00
Michael Liao	db8bc2e5dc	Fix setjmp on models with non-Small code model nor non-Static relocation model - MBB address is only valid as an immediate value in Small & Static code/relocation models. On other models, LEA is needed to load IP address of the restore MBB. - A minor fix of MBB in MC lowering is added as well to enable target relocation flag being propagated into MC. llvm-svn: 166084	2012-10-17 02:22:27 +00:00
Jakob Stoklund Olesen	bd6db6eeb9	Use a SparseSet instead of a BitVector for UsedInInstr in RAFast. This is just as fast, and it makes it possible to avoid leaking the UsedPhysRegs BitVector implementation through MachineRegisterInfo::addPhysRegsUsed(). llvm-svn: 166083	2012-10-17 01:37:59 +00:00
Eric Christopher	98ed8efd11	Use a typedef to reduce some typing and reformat code accordingly. llvm-svn: 166077	2012-10-16 23:46:25 +00:00
Eric Christopher	e411070efb	Variable name cleanup. llvm-svn: 166076	2012-10-16 23:46:23 +00:00
Eric Christopher	ae3c2b9adc	Formatting and 80-col. llvm-svn: 166075	2012-10-16 23:46:21 +00:00
Eric Christopher	7eb11f8437	Spacing. llvm-svn: 166074	2012-10-16 23:46:19 +00:00
Jakob Stoklund Olesen	96ecdfd8dd	Avoid rematerializing a redef immediately after the old def. PR14098 contains an example where we would rematerialize a MOV8ri immediately after the original instruction: %vreg7:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7 %vreg22:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7 Besides being pointless, it is also wrong since the original instruction only redefines part of the register, and the value read by the new instruction is wrong. The problem was the LiveRangeEdit::allUsesAvailableAt() didn't special-case OrigIdx == UseIdx and found the wrong SSA value. llvm-svn: 166068	2012-10-16 22:51:58 +00:00
Jakob Stoklund Olesen	1cfbe5c549	Revert r166046 "Switch back to the old coalescer for now to fix the 32 bit bit" A fix for PR14098, including the test case is in the next commit. llvm-svn: 166067	2012-10-16 22:51:55 +00:00
Michael Gottesman	b9205da3c6	[InstCombine] Teach InstCombine how to handle an obfuscated splat. An obfuscated splat is where the frontend poorly generates code for a splat using several different shuffles to create the splat, i.e., %A = load <4 x float>* %in_ptr, align 16 %B = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef> %C = shufflevector <4 x float> %B, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 4, i32 undef> %D = shufflevector <4 x float> %C, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 2, i32 4> llvm-svn: 166061	2012-10-16 21:29:38 +00:00
Chad Rosier	d87fb99e77	[ms-inline asm] Add the helper function, isParseringInlineAsm(). To be used in a future commit. llvm-svn: 166054	2012-10-16 20:16:20 +00:00
Jakub Staszak	0b6558a5cf	Simplify code. No functionality change. llvm-svn: 166053	2012-10-16 19:52:32 +00:00
Michael Liao	05f5d27eb5	Check .rela instead of ELF64 for the compensation vaue resetting llvm-svn: 166051	2012-10-16 19:49:51 +00:00
Jakub Staszak	d3382d44bb	80-col fixup. llvm-svn: 166050	2012-10-16 19:39:40 +00:00
Michael Liao	ee2ce36cda	Teach DAG combine to fold (trunc (fptoXi x)) to (fptoXi x) llvm-svn: 166049	2012-10-16 19:38:35 +00:00
Rafael Espindola	2f08719190	Switch back to the old coalescer for now to fix the 32 bit bit llvm+clang+compiler-rt bootstrap. llvm-svn: 166046	2012-10-16 19:34:06 +00:00
Jakub Staszak	3eeb051e40	Simplify potentially quadratic behavior while erasing elements from std::vector. llvm-svn: 166045	2012-10-16 19:32:31 +00:00
Bill Wendling	235be85221	And now we can call the other 'get' method from this one and not duplicate the code. llvm-svn: 166037	2012-10-16 18:20:09 +00:00
Michael Liao	a5c31648c0	Support v8f32 to v8i8/vi816 conversion through custom lowering - Add custom FP_TO_SINT on v8i16 (and v8i8 which is legalized as v8i16 due to vector element-wise widening) to reduce DAG combiner and its overhead added in X86 backend. llvm-svn: 166036	2012-10-16 18:14:11 +00:00
Bill Wendling	da7701b08b	Use the appropriate Attributes::get method to create an Attributes object. llvm-svn: 166035	2012-10-16 18:06:06 +00:00
Owen Anderson	38e04ae4a0	Speculative fix the mask constants to be of type uintptr_t. I don't know of any case where the old form was incorrect, but I'm more confident that such cases don't exist in this version. llvm-svn: 166031	2012-10-16 17:10:33 +00:00
Dmitri Gribenko	bb0cd8940f	Fix function parameter spelling in comments. Caught by -Wdocumentation. llvm-svn: 166024	2012-10-16 15:37:50 +00:00
Bill Schmidt	ad04de0c32	This patch addresses PR13949. For the PowerPC 64-bit ELF Linux ABI, aggregates of size less than 8 bytes are to be passed in the low-order bits ("right-adjusted") of the doubleword register or memory slot assigned to them. A previous patch addressed this for aggregates passed in registers. However, small aggregates passed in the overflow portion of the parameter save area are still being passed left-adjusted. The fix is made in PPCTargetLowering::LowerCall_Darwin_Or_64SVR4 on the caller side, and in PPCTargetLowering::LowerFormalArguments_64SVR4 on the callee side. The main fix on the callee side simply extends existing logic for 1- and 2-byte objects to 1- through 7-byte objects, and correcting a constant left over from 32-bit code. There is also a fix to a bogus calculation of the offset to the following argument in the parameter save area. On the caller side, again a constant left over from 32-bit code is fixed. Additionally, some code for 1, 2, and 4-byte objects is duplicated to handle the 3, 5, 6, and 7-byte objects for SVR4 only. The LowerCall_Darwin_Or_64SVR4 logic is getting fairly convoluted trying to handle both ABIs, and I propose to separate this into two functions in a future patch, at which time the duplication can be removed. The patch adds a new test (structsinmem.ll) to demonstrate correct passing of structures of all seven sizes. Eight dummy parameters are used to force these structures to be in the overflow portion of the parameter save area. As a side effect, this corrects the case when aggregates passed in registers are saved into the first eight doublewords of the parameter save area: Previously they were stored left-justified, and now are properly stored right-justified. This requires changing the expected output of existing test case structsinregs.ll. llvm-svn: 166022	2012-10-16 13:30:53 +00:00
Stepan Dyatkovskiy	09c6b0a273	Issue: Stack is formed improperly for long structures passed as byval arguments for EABI mode. If we took AAPCS reference, we can found the next statements: A: "If the argument requires double-word alignment (8-byte), the NCRN (Next Core Register Number) is rounded up to the next even register number." (5.5 Parameter Passing, Stage C, C.3). B: "The alignment of an aggregate shall be the alignment of its most-aligned component." (4.3 Composite Types, 4.3.1 Aggregates). So if we have structure with doubles (9 double fields) and 3 Core unused registers (r1, r2, r3): caller should use r2 and r3 registers only. Currently r1,r2,r3 set is used, but it is invalid. Callee VA routine should also use r2 and r3 regs only. All is ok here. This behaviour is guessed by rounding up SP address with ADD+BFC operations. Fix: Main fix is in ARMTargetLowering::HandleByVal. If we detected AAPCS mode and 8 byte alignment, we waste odd registers then. P.S.: I also improved LDRB_POST_IMM regression test. Since ldrb instruction will not generated by current regression test after this patch. llvm-svn: 166018	2012-10-16 07:16:47 +00:00
NAKAMURA Takumi	83458d4d01	Reapply r165661, Patch by Shuxin Yang <shuxin.llvm@gmail.com>. Original message: The attached is the fix to radar://11663049. The optimization can be outlined by following rules: (select (x != c), e, c) -> select (x != c), e, x), (select (x == c), c, e) -> select (x == c), x, e) where the <c> is an integer constant. The reason for this change is that : on x86, conditional-move-from-constant needs two instructions; however, conditional-move-from-register need only one instruction. While the LowerSELECT() sounds to be the most convenient place for this optimization, it turns out to be a bad place. The reason is that by replacing the constant <c> with a symbolic value, it obscure some instruction-combining opportunities which would otherwise be very easy to spot. For that reason, I have to postpone the change to last instruction-combining phase. The change passes the test of "make check-all -C <build-root/test" and "make -C project/test-suite/SingleSource". Original message since r165661: My previous change has a bug: I negated the condition code of a CMOV, and go ahead creating a new CMOV using the ORIGINAL condition code. llvm-svn: 166017	2012-10-16 06:28:34 +00:00
Bill Wendling	b8253baeba	Cleanup whitespace. llvm-svn: 166016	2012-10-16 06:10:45 +00:00
Owen Anderson	cb8d1f6815	Fix a bug in the set(I,E)/reset(I,E) methods that I recently added. The boundary condition for checking if I and E were in the same word were incorrect, and, beyond that, the mask computation was not using a wide enough constant. llvm-svn: 166015	2012-10-16 06:04:27 +00:00
Craig Topper	3d4f7d96ea	Move X86MCInstLower class definition into implementation file. It's not needed outside. llvm-svn: 166014	2012-10-16 06:01:50 +00:00
Bill Wendling	86c5a69349	Cleanup whitespace. llvm-svn: 166013	2012-10-16 06:01:44 +00:00
Bill Wendling	2433b08890	Have AttributesImpl defriend the Attributes class. llvm-svn: 166012	2012-10-16 05:57:28 +00:00
Bill Wendling	bd68badfdd	Have AttrBuilder defriend the Attributes class. llvm-svn: 166011	2012-10-16 05:55:09 +00:00
Bill Wendling	17275364ed	Use the Attributes::get method which takes an AttrVal value directly to simplify the code a bit. No functionality change. llvm-svn: 166009	2012-10-16 05:23:31 +00:00
Bill Wendling	86964736b6	Put simple c'tors inline. llvm-svn: 166008	2012-10-16 05:22:28 +00:00
Bill Wendling	d4406cf3d4	Pass in the context to the Attributes::get method. llvm-svn: 166007	2012-10-16 05:20:51 +00:00
Craig Topper	ffe418869a	Fix filename in file header. llvm-svn: 166004	2012-10-16 02:21:30 +00:00
Rafael Espindola	af4181923d	Fix the cpu name and add -verify-machineinstrs. llvm-svn: 166003	2012-10-16 01:13:06 +00:00
Andrew Trick	af9fb59623	misched: Added handleMove support for updating all kill flags, not just for allocatable regs. This is a medium term workaround until we have a more robust solution in the form of a register liveness utility for postRA passes. llvm-svn: 166001	2012-10-16 00:22:51 +00:00
Jakob Stoklund Olesen	bf5a17c340	Remove unused BitVectors from getAllocatableSet(). llvm-svn: 165999	2012-10-16 00:05:06 +00:00
Nadav Rotem	b6fb0afb47	LTO also needs to initialize the TargetTransform infrastructure. llvm-svn: 165997	2012-10-15 22:50:02 +00:00
Jakob Stoklund Olesen	c808be0c56	Remove RegisterClassInfo::isReserved() and isAllocatable(). Clients can use the equivalent functions in MRI. llvm-svn: 165990	2012-10-15 22:41:03 +00:00
Michael Liao	a7e5913fde	Add __builtin_setjmp/_longjmp supprt in X86 backend - Besides used in SjLj exception handling, __builtin_setjmp/__longjmp is also used as a light-weight replacement of setjmp/longjmp which are used to implementation continuation, user-level threading, and etc. The support added in this patch ONLY addresses this usage and is NOT intended to support SjLj exception handling as zero-cost DWARF exception handling is used by default in X86. llvm-svn: 165989	2012-10-15 22:39:43 +00:00
Jakob Stoklund Olesen	bde4d183c1	Remove LIS::isAllocatable() and isReserved() helpers. All callers can simply use the corresponding MRI functions. llvm-svn: 165985	2012-10-15 22:14:34 +00:00
Owen Anderson	e678a60cc8	Add range-based set()/reset() to BitVector. These allow fast setting/resetting of ranges of bits, particularly useful when dealing with very large BitVector's. llvm-svn: 165984	2012-10-15 22:05:27 +00:00
Jakob Stoklund Olesen	56bb584754	Switch most getReservedRegs() clients to the MRI equivalent. Using the cached bit vector in MRI avoids comstantly allocating and recomputing the reserved register bit vector. llvm-svn: 165983	2012-10-15 21:57:41 +00:00
Jakob Stoklund Olesen	677503ea4e	Freeze the reserved registers as soon as isel is complete. Also provide an MRI::getReservedRegs() function to access the frozen register set, and isReserved() and isAllocatable() methods to test individual registers. The various implementations of TRI::getReservedRegs() are quite complicated, and many passes need to look at the reserved register set. This patch makes it possible for these passes to use the cached copy in MRI, avoiding a lot of malloc traffic and repeated calculations. llvm-svn: 165982	2012-10-15 21:33:06 +00:00
Jim Grosbach	8df1c73056	ARM: v1i64 and v2i64 VBSL intrinsic support. rdar://12502028 llvm-svn: 165981	2012-10-15 21:23:40 +00:00

... 3 4 5 6 7 ...

85963 Commits