llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 21:42:54 +02:00

Author	SHA1	Message	Date
Sanjay Patel	3effcd99a5	Don't prevent a vselect of constants from becoming a single load (PR20648). Fix for PR20648 - http://llvm.org/bugs/show_bug.cgi?id=20648 This patch checks the operands of a vselect to see if all values are constants. If yes, bail out of any further attempts to create a blend or shuffle because SelectionDAGLegalize knows how to turn this kind of vselect into a single load. This already happens for machines without SSE4.1, so the added checks just send more targets down that path. Differential Revision: http://reviews.llvm.org/D4934 llvm-svn: 216121	2014-08-20 20:34:56 +00:00
Duncan P. N. Exon Smith	456b7de124	X86: Add missing triples from r216119 llvm-svn: 216120	2014-08-20 19:58:59 +00:00
Duncan P. N. Exon Smith	305e4c4ae6	X86: Align the stack on word boundaries in LowerFormalArguments() The goal of the patch is to implement section 3.2.3 of the AMD64 ABI correctly. The controlling sentence is, "The size of each argument gets rounded up to eightbytes. Therefore the stack will always be eightbyte aligned." The equivalent sentence in the i386 ABI page 37 says, "At all times, the stack pointer should point to a word-aligned area." For both architectures, the stack pointer is not being rounded up to the nearest eightbyte or word between the last normal argument and the first variadic argument. Patch by Thomas Jablin! llvm-svn: 216119	2014-08-20 19:40:59 +00:00
Alexey Samsonov	d47abf2d7c	Fix null reference creation in ScheduleDAGInstrs constructor call. Both MachineLoopInfo and MachineDominatorTree may be null in ScheduleDAGMI constructor call. It is undefined behavior to take references to these values. This bug is reported by UBSan. llvm-svn: 216118	2014-08-20 19:36:05 +00:00
Keno Fischer	1dbe2693fc	Do not insert a tail call when returning multiple values on X86 Summary: This fixes http://llvm.org/bugs/show_bug.cgi?id=19530. The problem is that X86ISelLowering erroneously thought the third call was eligible for tail call elimination. It would have been if it's return value was actually the one returned by the calling function, but here that is not the case and additional values are being returned. Test Plan: Test case from the original bug report is included. Reviewers: rafael Reviewed By: rafael Subscribers: rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D4968 llvm-svn: 216117	2014-08-20 19:00:37 +00:00
Alexey Samsonov	73266a6708	Fix undefined behavior (left shift by 64 bits) in ScaledNumber::toString(). This bug is reported by UBSan. llvm-svn: 216116	2014-08-20 18:30:07 +00:00
Sanjay Patel	55357e1fa3	critical-anti-dependency breaker: don't use reg def info from kill insts (PR20308) In PR20308 ( http://llvm.org/bugs/show_bug.cgi?id=20308 ), the critical-anti-dependency breaker caused a miscompile because it broke a WAR hazard using a register that it thinks is available based on info from a kill inst. Until PR18663 is solved, we shouldn't use any def/use info from a kill because they are really just nops. This patch adds guard checks for kills around calls to ScanInstruction() where the DefIndices array is set. For good measure, add an assert in ScanInstruction() so we don't hit this bug again. The test case is a reduced version of the code from the bug report. Differential Revision: http://reviews.llvm.org/D4977 llvm-svn: 216114	2014-08-20 18:03:00 +00:00
Quentin Colombet	5404c5510b	[PeepholeOptimizer] Refactor the advanced copy optimization to take advantage of the isRegSequence property. This is a follow-up of r215394 and r215404, which respectively introduces the isRegSequence property and uses it for ARM. Thanks to the property introduced by the previous commits, this patch is able to optimize the following sequence: vmov d0, r2, r3 vmov d1, r0, r1 vmov r0, s0 vmov r1, s2 udiv r0, r1, r0 vmov r1, s1 vmov r2, s3 udiv r1, r2, r1 vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 bx lr into: udiv r0, r0, r2 udiv r1, r1, r3 vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 bx lr This patch refactors how the copy optimizations are done in the peephole optimizer. Prior to this patch, we had one copy-related optimization that replaced a copy or bitcast by a generic, more suitable (in terms of register file), copy. With this patch, the peephole optimizer features two copy-related optimizations: 1. One for rewriting generic copies to generic copies: PeepholeOptimizer::optimizeCoalescableCopy. 2. One for replacing non-generic copies with generic copies: PeepholeOptimizer::optimizeUncoalescableCopy. The goals of these two optimizations are slightly different: one rewrite the operand of the instruction (#1), the other kills off the non-generic instruction and replace it by a (sequence of) generic instruction(s). Both optimizations rely on the ValueTracker introduced in r212100. The ValueTracker has been refactored to use the information from the TargetInstrInfo for non-generic instruction. As part of the refactoring, we switched the tracking from the index of the definition to the actual register (virtual or physical). This one change is to provide better consistency with register related APIs and to ease the use of the TargetInstrInfo. Moreover, this patch introduces a new helper class CopyRewriter used to ease the rewriting of generic copies (i.e., #1). Finally, this patch adds a dead code elimination pass right after the peephole optimizer to get rid of dead code that may appear after rewriting. This is related to <rdar://problem/12702965>. Review: http://reviews.llvm.org/D4874 llvm-svn: 216088	2014-08-20 17:41:48 +00:00
Andrew Trick	fadfd8041e	Tweak CFGPrinter to wrap very long names. I added wrapping to the CFGPrinter a while back so the -view-cfg output is actually viewable. I've since enountered very long mangled names with the same problem, so I'm slightly tweaking this code to work in that case. llvm-svn: 216087	2014-08-20 17:38:12 +00:00
Rafael Espindola	016a21994a	Remove unused field. llvm-svn: 216086	2014-08-20 17:33:44 +00:00
Juergen Ributzka	7c8f6aa104	[FastISel][AArch64] Don't fold the sign-/zero-extend from i1 into the compare. This fixes a bug I introduced in a previous commit (r216033). Sign-/Zero- extension from i1 cannot be folded into the ADDS/SUBS instructions. Instead both operands have to be sign-/zero-extended with separate instructions. Related to <rdar://problem/17913111>. llvm-svn: 216073	2014-08-20 16:34:15 +00:00
Rafael Espindola	d31735b568	Quick fix for an use after free. llvm-svn: 216071	2014-08-20 15:19:37 +00:00
Dan Liew	6339b014b2	Add note to LangRef about how function arguments can be unnamed and how this affects the numbering of unnamed temporaries. llvm-svn: 216070	2014-08-20 15:06:30 +00:00
Aaron Ballman	1a252af32a	Silencing a -Wcast-qual warning. NFC. llvm-svn: 216068	2014-08-20 12:54:13 +00:00
Aaron Ballman	85f7f5057f	Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)). NFC. llvm-svn: 216067	2014-08-20 12:14:35 +00:00
Jiangning Liu	c3dd378a9e	Optimize ZERO_EXTEND and SIGN_EXTEND in both SelectionDAG Builder and type legalization stage. With those two optimizations, fewer signed/zero extension instructions can be inserted, and then we can expose more opportunities to Machine CSE pass in back-end. llvm-svn: 216066	2014-08-20 12:05:15 +00:00
Pavel Chupin	77b41f178f	[x32] Fix FrameIndex check in SelectLEA64_32Addr Summary: Fixes http://llvm.org/bugs/show_bug.cgi?id=20016 reproducible on new lea-5.ll case. Also use RSP/RBP for x32 lea to save 1 byte used for 0x67 prefix in ESP/EBP case. Test Plan: lea tests modified to include x32/nacl and new test added Reviewers: nadav, dschuff, t.p.northover Subscribers: llvm-commits, zinovy.nis Differential Revision: http://reviews.llvm.org/D4929 llvm-svn: 216065	2014-08-20 11:59:22 +00:00
Yi Kong	57329b1cc2	ARM: Fix codegen for rbit intrinsic LLVM generates illegal `rbit r0, #352` instruction for rbit intrinsic. According to ARM ARM, rbit only takes register as argument, not immediate. The correct instruction should be rbit <Rd>, <Rm>. The bug was originally introduced in r211057. Differential Revision: http://reviews.llvm.org/D4980 llvm-svn: 216064	2014-08-20 10:40:20 +00:00
Bill Wendling	18a9d9229f	Update projects lists. llvm-svn: 216048	2014-08-20 07:32:09 +00:00
Bill Wendling	b502632839	Add libcxxabi to the projects. llvm-svn: 216047	2014-08-20 07:30:08 +00:00
David Majnemer	03fa77d0ce	InstCombine: Annotate sub with nuw when we prove it's safe We can prove that a 'sub' can be a 'sub nuw' if the left-hand side is negative and the right-hand side is non-negative. llvm-svn: 216045	2014-08-20 07:17:31 +00:00
Craig Topper	baef4556f6	Fix an off by 1 bug that prevented SmallPtrSet from using all of its 'small' capacity. Then fix the early return in the move constructor that prevented 'small' moves from clearing the NumElements in the moved from object. The directed test missed this because it was always testing large moves due to the off by 1 bug. llvm-svn: 216044	2014-08-20 04:41:36 +00:00
NAKAMURA Takumi	266b262fd1	Constants.h: Fix possible typo in r216015. [-Wdocumentation] llvm-svn: 216043	2014-08-20 04:22:47 +00:00
Peter Collingbourne	e94d272f0d	[dfsan] Treat vararg custom functions like unimplemented functions. Because declarations of these functions can appear in places like autoconf checks, they have to be handled somehow, even though we do not support vararg custom functions. We do so by printing a warning and calling the uninstrumented function, as we do for unimplemented functions. llvm-svn: 216042	2014-08-20 01:40:23 +00:00
Juergen Ributzka	ce5953230a	[FastISel][AArch64] Use the proper FMOV instruction to materialize a +0.0. Use FMOVWSr/FMOVXDr instead of FMOVSr/FMOVDr, which have the proper register class to be used with the zero register. This makes the MachineInstruction verifier happy again. This is related to <rdar://problem/18027157>. llvm-svn: 216040	2014-08-20 01:10:36 +00:00
David Majnemer	b02f8f16bc	InstCombine: Annotate sub with nsw when we prove it's safe We can prove that a 'sub' can be a 'sub nsw' under certain conditions: - The sign bits of the operands is the same. - Both operands have more than 1 sign bit. The subtraction cannot be a signed overflow in either case. llvm-svn: 216037	2014-08-19 23:36:30 +00:00
Hans Wennborg	00cdc777eb	BumpPtrAllocator: don't accept 0 for the alignment parameter It seems unnecessary to have to use an extra branch to check for this special case. http://reviews.llvm.org/D4945 llvm-svn: 216036	2014-08-19 23:35:33 +00:00
Juergen Ributzka	21b19be38f	[FastISel][AArch64] Factor out ADDS/SUBS instruction emission and add support for extensions and shift folding. Factor out the ADDS/SUBS instruction emission code into helper functions and make the helper functions more clever to support most of the different ADDS/SUBS instructions the architecture support. This includes better immedediate support, shift folding, and sign-/zero-extend folding. This fixes <rdar://problem/17913111>. llvm-svn: 216033	2014-08-19 22:29:55 +00:00
Rafael Espindola	ee775d5673	Split parseAssembly into parseAssembly and parseAssemblyInto. This should restore the functionality of parsing new code into an existing module without the confusing interface. llvm-svn: 216031	2014-08-19 22:05:47 +00:00
Alexey Samsonov	8dee78d45c	Delete unused argument in AArch64MCInstLower constructor: it doesn't use Mangler, and Mangler is in fact not even created when AArch64MCInstLower is constructed. This bug is reported by UBSan. llvm-svn: 216030	2014-08-19 21:51:08 +00:00
Duncan P. N. Exon Smith	714ca5e31c	LangRef: Move example of function-scope uselistorder to a function Should make the example added in r216025 a little more clear. llvm-svn: 216027	2014-08-19 21:48:04 +00:00
Duncan P. N. Exon Smith	56ea569496	IR: Implement uselistorder assembly directives Implement `uselistorder` and `uselistorder_bb` assembly directives, which allow the use-list order to be recovered when round-tripping to assembly. This is the bulk of PR20515. llvm-svn: 216025	2014-08-19 21:30:15 +00:00
Lang Hames	e8f7bb2a7a	[MCJIT] Add an i386 RuntimeDyldMachO test case. llvm-svn: 216024	2014-08-19 21:26:36 +00:00
Duncan P. N. Exon Smith	10fa76d576	IR: Fix a missed case when threading OnlyIfReduced through ConstantExpr In r216015 I missed propagating `OnlyIfReduced` through the inline versions of `getGetElementPtr()` (I was relying on compile failures on mismatches between the header and source signatures to get them all). llvm-svn: 216023	2014-08-19 21:18:21 +00:00
Duncan P. N. Exon Smith	986b92fd8e	verify-uselistorder: Force -preserve-bc-use-list-order llvm-svn: 216022	2014-08-19 21:08:27 +00:00
Juergen Ributzka	233cb7bf1a	[FastISel][AArch64] Extend floating-point materialization test. This adds the missing test that I promised for r215753 to test the materialization of the floating-point value +0.0. Related to <rdar://problem/18027157>. llvm-svn: 216019	2014-08-19 20:35:07 +00:00
Rafael Espindola	851ae8ba7d	fix the gcc build llvm-svn: 216018	2014-08-19 20:06:25 +00:00
Lang Hames	a4e0584cbc	[MCJIT] Allow '$' characters in symbol names in RuntimeDyldChecker. llvm-svn: 216017	2014-08-19 20:04:45 +00:00
Duncan P. N. Exon Smith	6eb8c34f72	IR: Fix ConstantExpr::replaceUsesOfWithOnConstant() Change `ConstantExpr` to follow the model the other constants are using: only malloc a replacement if it's going to be used. This fixes a subtle bug where if an API user had used `ConstantExpr::get()` already to create the replacement but hadn't given it any users, we'd delete the replacement. This relies on r216015 to thread `OnlyIfReduced` through `ConstantExpr::getWithOperands()`. llvm-svn: 216016	2014-08-19 20:03:35 +00:00
Duncan P. N. Exon Smith	319f7343bb	IR: Thread OnlyIfReduced through ConstantExpr::getWithOperands() In order to change `ConstantExpr::replaceUsesOfWithOnConstant()` to work like other constants (e.g., using `ConstantArray::getImpl()`), thread `OnlyIfReduced` through as necessary. When `OnlyIfReduced` is false, there's no functionality change. When it's true, if there's no constant folding or type changes `nullptr` is returned instead of the new constant. `ConstantExpr::replaceUsesOfWithOnConstant()` will be updated to use the "true" version in a follow-up commit. llvm-svn: 216015	2014-08-19 19:45:37 +00:00
Rafael Espindola	0f7753400d	Fix the MSVC build. llvm-svn: 216014	2014-08-19 19:45:15 +00:00
Juergen Ributzka	f39a032c8b	Reapply [FastISel][AArch64] Add support for more addressing modes (r215597). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: FastISel didn't take much advantage of the different addressing modes available to it on AArch64. This commit allows the ComputeAddress method to recognize more addressing modes that allows shifts and sign-/zero-extensions to be folded into the memory operation itself. For Example: lsl x1, x1, #3 --> ldr x0, [x0, x1, lsl #3] ldr x0, [x0, x1] sxtw x1, w1 lsl x1, x1, #3 --> ldr x0, [x0, x1, sxtw #3] ldr x0, [x0, x1] llvm-svn: 216013	2014-08-19 19:44:17 +00:00
Juergen Ributzka	9c8880d176	Reapply [FastISel][X86] Add large code model support for materializing floating-point constants (r215595). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: In the large code model for X86 floating-point constants are placed in the constant pool and materialized by loading from it. Since the constant pool could be far away, a PC relative load might not work. Therefore we first materialize the address of the constant pool with a movabsq and then load from there the floating-point value. Fixes <rdar://problem/17674628>. llvm-svn: 216012	2014-08-19 19:44:13 +00:00
Juergen Ributzka	0d6f36970b	Reapply [FastISel][X86] Use XOR to materialize the "0" value (r215594). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. llvm-svn: 216011	2014-08-19 19:44:10 +00:00
Juergen Ributzka	496a8f883b	Reapply [FastISel][X86] Emit more efficient instructions for integer constant materialization (r215593). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: This mostly affects the i64 value type, which always resulted in an 15byte mobavsq instruction to materialize any constant. The custom code checks the value of the immediate and tries to use a different and smaller mov instruction when possible. This fixes <rdar://problem/17420988>. llvm-svn: 216010	2014-08-19 19:44:06 +00:00
Juergen Ributzka	1cb2d0a61e	Reapply [FastISel][AArch64] Make use of the zero register when possible (r215591). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: This change materializes now the value "0" from the zero register. The zero register can be folded by several instruction, so no materialization is need at all. Fixes <rdar://problem/17924413>. llvm-svn: 216009	2014-08-19 19:44:02 +00:00
Duncan P. N. Exon Smith	119044acf4	ADT: Unit test for ArrayRef::equals change in r215986 llvm-svn: 216008	2014-08-19 19:18:46 +00:00
Duncan P. N. Exon Smith	62fccf853a	IR: De-duplicate code for replacing operands in place This is non-trivial and sits in three places. Move it to ConstantUniqueMap. llvm-svn: 216007	2014-08-19 19:13:30 +00:00
Juergen Ributzka	15f8549d05	Reapply [FastISel] Let the target decide first if it wants to materialize a constant (215588). Note: This was originally reverted to track down a buildbot error. This commit exposed a latent bug that was fixed in r215753. Therefore it is reapplied without any modifications. I run it through SPEC2k and SPEC2k6 for AArch64 and it didn't introduce any new regeressions. Original commit message: This changes the order in which FastISel tries to materialize a constant. Originally it would try to use a simple target-independent approach, which can lead to the generation of inefficient code. On X86 this would result in the use of movabsq to materialize any 64bit integer constant - even for simple and small values such as 0 and 1. Also some very funny floating-point materialization could be observed too. On AArch64 it would materialize the constant 0 in a register even the architecture has an actual "zero" register. On ARM it would generate unnecessary mov instructions or not use mvn. This change simply changes the order and always asks the target first if it likes to materialize the constant. This doesn't fix all the issues mentioned above, but it enables the targets to implement such optimizations. Related to <rdar://problem/17420988>. llvm-svn: 216006	2014-08-19 19:05:24 +00:00
Rafael Espindola	7462d3e78d	Fix a pair of use after free. Should bring the bots back. llvm-svn: 216005	2014-08-19 18:59:14 +00:00

1 2 3 4 5 ...

106998 Commits