llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Dmitri Gribenko	bb0cd8940f	Fix function parameter spelling in comments. Caught by -Wdocumentation. llvm-svn: 166024	2012-10-16 15:37:50 +00:00
Bill Schmidt	ad04de0c32	This patch addresses PR13949. For the PowerPC 64-bit ELF Linux ABI, aggregates of size less than 8 bytes are to be passed in the low-order bits ("right-adjusted") of the doubleword register or memory slot assigned to them. A previous patch addressed this for aggregates passed in registers. However, small aggregates passed in the overflow portion of the parameter save area are still being passed left-adjusted. The fix is made in PPCTargetLowering::LowerCall_Darwin_Or_64SVR4 on the caller side, and in PPCTargetLowering::LowerFormalArguments_64SVR4 on the callee side. The main fix on the callee side simply extends existing logic for 1- and 2-byte objects to 1- through 7-byte objects, and correcting a constant left over from 32-bit code. There is also a fix to a bogus calculation of the offset to the following argument in the parameter save area. On the caller side, again a constant left over from 32-bit code is fixed. Additionally, some code for 1, 2, and 4-byte objects is duplicated to handle the 3, 5, 6, and 7-byte objects for SVR4 only. The LowerCall_Darwin_Or_64SVR4 logic is getting fairly convoluted trying to handle both ABIs, and I propose to separate this into two functions in a future patch, at which time the duplication can be removed. The patch adds a new test (structsinmem.ll) to demonstrate correct passing of structures of all seven sizes. Eight dummy parameters are used to force these structures to be in the overflow portion of the parameter save area. As a side effect, this corrects the case when aggregates passed in registers are saved into the first eight doublewords of the parameter save area: Previously they were stored left-justified, and now are properly stored right-justified. This requires changing the expected output of existing test case structsinregs.ll. llvm-svn: 166022	2012-10-16 13:30:53 +00:00
Stepan Dyatkovskiy	09c6b0a273	Issue: Stack is formed improperly for long structures passed as byval arguments for EABI mode. If we took AAPCS reference, we can found the next statements: A: "If the argument requires double-word alignment (8-byte), the NCRN (Next Core Register Number) is rounded up to the next even register number." (5.5 Parameter Passing, Stage C, C.3). B: "The alignment of an aggregate shall be the alignment of its most-aligned component." (4.3 Composite Types, 4.3.1 Aggregates). So if we have structure with doubles (9 double fields) and 3 Core unused registers (r1, r2, r3): caller should use r2 and r3 registers only. Currently r1,r2,r3 set is used, but it is invalid. Callee VA routine should also use r2 and r3 regs only. All is ok here. This behaviour is guessed by rounding up SP address with ADD+BFC operations. Fix: Main fix is in ARMTargetLowering::HandleByVal. If we detected AAPCS mode and 8 byte alignment, we waste odd registers then. P.S.: I also improved LDRB_POST_IMM regression test. Since ldrb instruction will not generated by current regression test after this patch. llvm-svn: 166018	2012-10-16 07:16:47 +00:00
NAKAMURA Takumi	83458d4d01	Reapply r165661, Patch by Shuxin Yang <shuxin.llvm@gmail.com>. Original message: The attached is the fix to radar://11663049. The optimization can be outlined by following rules: (select (x != c), e, c) -> select (x != c), e, x), (select (x == c), c, e) -> select (x == c), x, e) where the <c> is an integer constant. The reason for this change is that : on x86, conditional-move-from-constant needs two instructions; however, conditional-move-from-register need only one instruction. While the LowerSELECT() sounds to be the most convenient place for this optimization, it turns out to be a bad place. The reason is that by replacing the constant <c> with a symbolic value, it obscure some instruction-combining opportunities which would otherwise be very easy to spot. For that reason, I have to postpone the change to last instruction-combining phase. The change passes the test of "make check-all -C <build-root/test" and "make -C project/test-suite/SingleSource". Original message since r165661: My previous change has a bug: I negated the condition code of a CMOV, and go ahead creating a new CMOV using the ORIGINAL condition code. llvm-svn: 166017	2012-10-16 06:28:34 +00:00
Bill Wendling	b8253baeba	Cleanup whitespace. llvm-svn: 166016	2012-10-16 06:10:45 +00:00
Owen Anderson	cb8d1f6815	Fix a bug in the set(I,E)/reset(I,E) methods that I recently added. The boundary condition for checking if I and E were in the same word were incorrect, and, beyond that, the mask computation was not using a wide enough constant. llvm-svn: 166015	2012-10-16 06:04:27 +00:00
Craig Topper	3d4f7d96ea	Move X86MCInstLower class definition into implementation file. It's not needed outside. llvm-svn: 166014	2012-10-16 06:01:50 +00:00
Bill Wendling	86c5a69349	Cleanup whitespace. llvm-svn: 166013	2012-10-16 06:01:44 +00:00
Bill Wendling	2433b08890	Have AttributesImpl defriend the Attributes class. llvm-svn: 166012	2012-10-16 05:57:28 +00:00
Bill Wendling	bd68badfdd	Have AttrBuilder defriend the Attributes class. llvm-svn: 166011	2012-10-16 05:55:09 +00:00
Bill Wendling	17275364ed	Use the Attributes::get method which takes an AttrVal value directly to simplify the code a bit. No functionality change. llvm-svn: 166009	2012-10-16 05:23:31 +00:00
Bill Wendling	86964736b6	Put simple c'tors inline. llvm-svn: 166008	2012-10-16 05:22:28 +00:00
Bill Wendling	d4406cf3d4	Pass in the context to the Attributes::get method. llvm-svn: 166007	2012-10-16 05:20:51 +00:00
Craig Topper	ffe418869a	Fix filename in file header. llvm-svn: 166004	2012-10-16 02:21:30 +00:00
Rafael Espindola	af4181923d	Fix the cpu name and add -verify-machineinstrs. llvm-svn: 166003	2012-10-16 01:13:06 +00:00
Andrew Trick	af9fb59623	misched: Added handleMove support for updating all kill flags, not just for allocatable regs. This is a medium term workaround until we have a more robust solution in the form of a register liveness utility for postRA passes. llvm-svn: 166001	2012-10-16 00:22:51 +00:00
Jakob Stoklund Olesen	bf5a17c340	Remove unused BitVectors from getAllocatableSet(). llvm-svn: 165999	2012-10-16 00:05:06 +00:00
Nadav Rotem	b6fb0afb47	LTO also needs to initialize the TargetTransform infrastructure. llvm-svn: 165997	2012-10-15 22:50:02 +00:00
Jakob Stoklund Olesen	c808be0c56	Remove RegisterClassInfo::isReserved() and isAllocatable(). Clients can use the equivalent functions in MRI. llvm-svn: 165990	2012-10-15 22:41:03 +00:00
Michael Liao	a7e5913fde	Add __builtin_setjmp/_longjmp supprt in X86 backend - Besides used in SjLj exception handling, __builtin_setjmp/__longjmp is also used as a light-weight replacement of setjmp/longjmp which are used to implementation continuation, user-level threading, and etc. The support added in this patch ONLY addresses this usage and is NOT intended to support SjLj exception handling as zero-cost DWARF exception handling is used by default in X86. llvm-svn: 165989	2012-10-15 22:39:43 +00:00
Jakob Stoklund Olesen	bde4d183c1	Remove LIS::isAllocatable() and isReserved() helpers. All callers can simply use the corresponding MRI functions. llvm-svn: 165985	2012-10-15 22:14:34 +00:00
Owen Anderson	e678a60cc8	Add range-based set()/reset() to BitVector. These allow fast setting/resetting of ranges of bits, particularly useful when dealing with very large BitVector's. llvm-svn: 165984	2012-10-15 22:05:27 +00:00
Jakob Stoklund Olesen	56bb584754	Switch most getReservedRegs() clients to the MRI equivalent. Using the cached bit vector in MRI avoids comstantly allocating and recomputing the reserved register bit vector. llvm-svn: 165983	2012-10-15 21:57:41 +00:00
Jakob Stoklund Olesen	677503ea4e	Freeze the reserved registers as soon as isel is complete. Also provide an MRI::getReservedRegs() function to access the frozen register set, and isReserved() and isAllocatable() methods to test individual registers. The various implementations of TRI::getReservedRegs() are quite complicated, and many passes need to look at the reserved register set. This patch makes it possible for these passes to use the cached copy in MRI, avoiding a lot of malloc traffic and repeated calculations. llvm-svn: 165982	2012-10-15 21:33:06 +00:00
Jim Grosbach	8df1c73056	ARM: v1i64 and v2i64 VBSL intrinsic support. rdar://12502028 llvm-svn: 165981	2012-10-15 21:23:40 +00:00
David Blaikie	5066b7681f	Add dependency on llvm-bcanalyzer from tests to the CMake build. This fixes a CMake build break introduced by r165739. Thanks Jan Voung for the quick suggestion/fix. llvm-svn: 165978	2012-10-15 21:11:46 +00:00
Bill Wendling	7a89835ee4	Move the Attributes::Builder outside of the Attributes class and into its own class named AttrBuilder. No functionality change. llvm-svn: 165960	2012-10-15 20:35:56 +00:00
Andrew Trick	a3ebac4349	Check output of the misched unit tests llvm-svn: 165959	2012-10-15 20:33:14 +00:00
Bill Wendling	ab9bf1a83b	Add comments. llvm-svn: 165958	2012-10-15 19:58:25 +00:00
Rafael Espindola	e152eedf59	Add a cpu to try to fix the atom builder. llvm-svn: 165956	2012-10-15 19:25:43 +00:00
Chad Rosier	860153bf6d	[ms-inline asm] If we parsed a statement and the opcode is valid, then it's an instruction. llvm-svn: 165955	2012-10-15 19:08:18 +00:00
Rafael Espindola	e554bb55b6	Add testcase for pr14088. llvm-svn: 165954	2012-10-15 19:00:10 +00:00
Andrew Trick	252355ac40	misched tests: add a triple to speculatively fix windows builders. llvm-svn: 165952	2012-10-15 18:21:08 +00:00
Rafael Espindola	03c701aad3	Make sure we iterate over newly created instructions. Fixes pr13625. Testcase to follow in one sec. llvm-svn: 165951	2012-10-15 18:21:07 +00:00
Andrew Trick	a5e2aeb12b	misched: ILP scheduler for experimental heuristics. llvm-svn: 165950	2012-10-15 18:02:27 +00:00
Chad Rosier	d95d1262b9	[ms-inline asm] Update the end loc for ParseIntelMemOperand. llvm-svn: 165947	2012-10-15 17:26:38 +00:00
Chad Rosier	77614bbd0e	[ms-inline asm] Add a few new APIs to the AsmParser class in support of MS-Style inline assembly. For the time being, these will be called directly by clang. However, in the near future I expect these to be sunk back into the MC layer and more basic APIs (e.g., getClobbers(), getConstraints(), etc.) will be called by clang. llvm-svn: 165946	2012-10-15 17:19:13 +00:00
Chad Rosier	aeec5a1023	[ms-inline asm] Use incoming argument rather than hard coding to false. llvm-svn: 165945	2012-10-15 16:50:34 +00:00
Jan Wen Voung	b1bdc84698	Fix a typo in bitcode docs, from 165814. llvm-svn: 165944	2012-10-15 16:47:58 +00:00
Micah Villmow	272663afc2	Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. llvm-svn: 165941	2012-10-15 16:24:29 +00:00
Adhemerval Zanella	ad392b0f93	PowerPC: add EmitTCEntry class for TOC creation This patch replaces the EmitRawText by a EmitTCEntry class (specialized for each Streamer) in PowerPC64 TOC entry creation. llvm-svn: 165940	2012-10-15 15:43:14 +00:00
Kostya Serebryany	21702ac519	[asan] fix a test llvm-svn: 165938	2012-10-15 14:30:30 +00:00
Kostya Serebryany	a6cd7ad8f2	[asan] make AddressSanitizer to be a FunctionPass instead of ModulePass. This will simplify chaining other FunctionPasses with asan. Also some minor cleanup llvm-svn: 165936	2012-10-15 14:20:06 +00:00
Chandler Carruth	0659e10e8a	Update the memcpy rewriting to fully support widened int rewriting. This includes extracting ints for copying elsewhere and inserting ints when copying into the alloca. This should fix the CanSROA assertion coming out of Clang's regression test suite. llvm-svn: 165931	2012-10-15 10:24:43 +00:00
Chandler Carruth	7755041393	Follow-up fix to r165928: handle memset rewriting for widened integers, and generally clean up the memset handling. It had rotted a bit as the other rewriting logic got polished more. llvm-svn: 165930	2012-10-15 10:24:40 +00:00
Silviu Baranga	e3e0e84559	Fixed PR13938: the ARM backend was crashing because it couldn't select a VDUPLANE node with the vector input size different from the output size. This was bacause the BUILD_VECTOR lowering code didn't check that the size of the input vector was correct for using VDUPLANE. llvm-svn: 165929	2012-10-15 09:41:32 +00:00
Chandler Carruth	65613836e9	First major step toward addressing PR14059. This teaches SROA to handle cases where we have partial integer loads and stores to an otherwise promotable alloca to widen[1] those loads and stores to cover the entire alloca and bitcast them into the appropriate type such that promotion can proceed. These partial loads and stores stem from an annoying confluence of ARM's calling convention and ABI lowering and the FCA pre-splitting which takes place in SROA. Clang lowers a { double, double } in-register function argument as a [4 x i32] function argument to ensure it is placed into integer 32-bit registers (a really unnerving implicit contract between Clang and the ARM backend I would add). This results in a FCA load of [4 x i32]* from the { double, double } alloca, and SROA decomposes this into a sequence of i32 loads and stores. Inlining proceeds, code gets folded, but at the end of the day, we still have i32 stores to the low and high halves of a double alloca. Widening these to be i64 operations, and bitcasting them to double prior to loading or storing allows promotion to proceed for these allocas. I looked quite a bit changing the IR which Clang produces for this case to be more friendly, but small changes seem unlikely to help. I think the best representation we could use currently would be to pass 4 i32 arguments thereby avoiding any FCAs, but that would still require this fix. It seems like it might eventually be nice to somehow encode the ABI register selection choices outside of the parameter type system so that the parameter can be a { double, double }, but the CC register annotations indicate that this should be passed via 4 integer registers. This patch does not address the second problem in PR14059, which is the reverse: when a struct alloca is loaded as a larger single integer. This patch also does not address some of the code quality issues with the FCA-splitting. Those don't actually impede any optimizations really, but they're on my list to clean up. [1]: Pedantic footnote: for those concerned about memory model issues here, this is safe. For the alloca to be promotable, it cannot escape or have any use of its address that could allow these loads or stores to be racing. Thus, widening is always safe. llvm-svn: 165928	2012-10-15 08:40:30 +00:00
Chandler Carruth	8b8a99cfa9	Hoist the canConvertValue predicate and the convertValue transform out into static helper functions. They're really quite generic and are going to be needed elsewhere shortly. llvm-svn: 165927	2012-10-15 08:40:22 +00:00
Bill Wendling	f2fff93263	Add an enum for the return and function indexes into the AttrListPtr object. This gets rid of some magic numbers. llvm-svn: 165924	2012-10-15 07:29:08 +00:00
Bill Wendling	5fb56f15bd	Use a ::get method to create the attribute from Attributes::AttrVals instead of a constructor. llvm-svn: 165923	2012-10-15 06:53:28 +00:00

1 2 3 4 5 ...

85738 Commits