llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Matthias Braun	45ead6c770	AArch64: Relax assert about large shift sizes. The reason why these large shift sizes happen is because OpaqueConstants currently inhibit alot of DAG combining, but that has to be addressed in another commit (like the proposal in D6946). Differential Revision: http://reviews.llvm.org/D6940 llvm-svn: 230355	2015-02-24 18:52:04 +00:00
Matthias Braun	2c6ac76701	DAGCombiner: Move variable definitions closer to use; NFC llvm-svn: 230354	2015-02-24 18:52:01 +00:00
Matthias Braun	69fb836050	DAGCombiner: Move variable declaration closer to definiion; NFC llvm-svn: 230353	2015-02-24 18:51:59 +00:00
Tom Stellard	266ed1f528	R600/SI: Remove isel mubuf legalization We legalize mubuf instructions post-instruction selection, so this code is no longer needed. llvm-svn: 230352	2015-02-24 17:59:19 +00:00
Tim Northover	afcf85da25	ARM: treat [N x i32] and [N x i64] as AAPCS composite types The logic is almost there already, with our special homogeneous aggregate handling. Tweaking it like this allows front-ends to emit AAPCS compliant code without ever having to count registers or add discarded padding arguments. Only arrays of i32 and i64 are needed to model AAPCS rules, but I decided to apply the logic to all integer arrays for more consistency. llvm-svn: 230348	2015-02-24 17:22:34 +00:00
Tobias Grosser	3022fe4913	Revert "Raising minimum required CMake version to 2.8.12.2." This reverts commit r230062. Debian stable (wheezy) ships still with cmake 2.8.9. The commit broke my LLVM/Polly buildbot, to my knowledge our only Linux+cmake buildbot. llvm-svn: 230343	2015-02-24 16:39:46 +00:00
Sanjay Patel	2d353d537e	simplify control flow; NFC llvm-svn: 230342	2015-02-24 16:26:02 +00:00
Hans Wennborg	fb5af71543	Revert r230280: "Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap" This caused PR22674, failing this assert: Instructions.h:2281: llvm::Value* llvm::PHINode::getOperand(unsigned int) const: Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. llvm-svn: 230341	2015-02-24 16:19:29 +00:00
Michael Kuperstein	d658f2fd10	[x32] Mark RBX as reserved when EBX is the base pointer. This should have gone into r230334. llvm-svn: 230339	2015-02-24 16:13:16 +00:00
Sanjay Patel	c5eee89f6f	fix typo in comment; NFC llvm-svn: 230338	2015-02-24 16:11:05 +00:00
Michael Kuperstein	0bcb785609	[x32] x32 should use ebx as the base pointer. This fixes the original issue in PR22655, but not the secondary one. llvm-svn: 230334	2015-02-24 15:27:13 +00:00
Hal Finkel	5b5de90baa	[SDAG] Handle LowerOperation returning its input consistently For almost all node types, if the target requested custom lowering, and LowerOperation returned its input, we'd treat the original node as legal. This did not work, however, for many loads and stores, because they follow slightly different code paths, and we did not account for the possibility of LowerOperation returning its input at those call sites. I think that we now handle this consistently everywhere. At the call sites in LegalizeDAG, we used to assert in this case, so there's no functional change for any existing code there. For the call sites in LegalizeVectorOps, this really only affects whether or not we set Changed = true, but I think makes the semantics clearer. No test case here, but it will be covered by an upcoming PowerPC commit adding QPX support. llvm-svn: 230332	2015-02-24 12:59:47 +00:00
Toma Tabacu	6dd4831b6f	[mips] Reformat some TableGen definitions. NFC. Summary: Separated some instruction and pseudo-instruction definitions from InstAlias definitions, added banner for pseudo-instructions and removed a redundant whitespace from a pseudo-instruction definition. No functional change. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7552 llvm-svn: 230327	2015-02-24 11:52:19 +00:00
Kuba Brecka	fda2cdf0bc	Fix alloca_instruments_all_paddings.cc test to work under higher -O levels (llvm part) When AddressSanitizer only a single dynamic alloca and no static allocas, due to an early exit from FunctionStackPoisoner::poisonStack we forget to unpoison the dynamic alloca. This patch fixes that. Reviewed at http://reviews.llvm.org/D7810 llvm-svn: 230316	2015-02-24 09:47:05 +00:00
Craig Topper	554fe54ab0	[X86] Remove the AbsMem32 type from the assembly parser. Only really need the 16-bit version which will automatically get prioritized over AbsMem. llvm-svn: 230313	2015-02-24 08:02:13 +00:00
Reed Kotler	05834965d0	Beginning of alloca implementation for Mips fast-isel Summary: Begin to add various address modes; including alloca. Test Plan: Make sure there are no regressions in test-suite at O0/02 in mips32r1/r2 Reviewers: dsanders Reviewed By: dsanders Subscribers: echristo, rfuhler, llvm-commits Differential Revision: http://reviews.llvm.org/D6426 llvm-svn: 230300	2015-02-24 02:36:45 +00:00
Bob Wilson	be0e2eb89d	Fix handling of negative offsets for AddrModeT2_i8s4 in rewriteT2FrameIndex. This is a follow up to r230233 to fix something that I noticed by inspection. The AddrModeT2_i8s4 addressing mode does not support negative offsets. I spent a good chunk of the day trying to come up with a testcase for this but was not successful. This addressing mode is used to spill and restore GPRPair registers in Thumb2 code and that does not happen often. We also make very limited used of negative offsets when lowering frame indexes. I am going ahead with the change anyway, because I am pretty confident that it is correct. I also added a missing assertion to check that the low bits of the scaled offset are zero. llvm-svn: 230297	2015-02-24 01:37:31 +00:00
Sanjoy Das	5b35fffa29	Fix bug 22641 The bug was a result of getPreStartForExtend interpreting nsw/nuw flags on an add recurrence more strongly than is legal. {S,+,X}<nsw> implies S+X is nsw only if the backedge of the loop is taken at least once. NOTE: I had accidentally committed an unrelated change with the commit message of this change in r230275 (r230275 was reverted in r230279). This is the correct change for this commit message. Differential Revision: http://reviews.llvm.org/D7808 llvm-svn: 230291	2015-02-24 01:02:42 +00:00
Manman Ren	87a073ae1a	[LTO API] add lto_codegen_set_module to set the destination module. When debugging LTO issues with ld64, we use -save-temps to save the merged optimized bitcode file, then invoke ld64 again on the single bitcode file to speed up debugging code generation passes and ld64 stuff after code generation. llvm linking a single bitcode file via lto_codegen_add_module will generate a different bitcode file from the single input. With the newly-added lto_codegen_set_module, we can make sure the destination module is the same as the input. lto_codegen_set_module will transfer the ownship of the module to code generator. rdar://19024554 llvm-svn: 230290	2015-02-24 00:45:56 +00:00
Adam Nemet	0b7049f21a	[LoopAccesses] LAA::getInfo to use const reference for stride parameter And other required const-correctness fixes to make this work. llvm-svn: 230289	2015-02-24 00:41:59 +00:00
David Majnemer	f29778c502	X86: Only use 'lea' in Win64 epilogues if a frame pointer exists We can only use 'add' in epilogues, 'lea' is not permitted unless we've established a frame pointer in the prologue. llvm-svn: 230286	2015-02-24 00:11:32 +00:00
Sanjoy Das	de271b53ef	New instcombine rule: max(~a,~b) -> ~min(a, b) This case is interesting because ScalarEvolutionExpander lowers min(a, b) as ~max(~a,~b). I think the profitability heuristics can be made more clever/aggressive, but this is a start. Differential Revision: http://reviews.llvm.org/D7821 llvm-svn: 230285	2015-02-24 00:08:41 +00:00
Sanjoy Das	9dddcf7f33	Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap When emitting the increment operation, SCEVExpander marks the operation as nuw or nsw based on the flags on the preincrement SCEV. This is incorrect because, for instance, it is possible that {-6,+,1} is <nuw> while {-6,+,1}+1 = {-5,+,1} is not. This change teaches SCEV to mark the increment as nuw/nsw only if it can explicitly prove that the increment operation won't overflow. Apart from the attached test case, another (more realistic) manifestation of the bug can be seen in Transforms/IndVarSimplify/pr20680.ll. NOTE: this change was landed with an incorrect commit message in rL230275 and was reverted for that reason in rL230279. This commit message is the correct one. Differential Revision: http://reviews.llvm.org/D7778 llvm-svn: 230280	2015-02-23 23:22:58 +00:00
Sanjoy Das	e5ca05754e	Revert 230275. 230275 got committed with an incorrect commit message due to a mixup on my side. Will re-land in a few moments with the correct commit message. llvm-svn: 230279	2015-02-23 23:13:22 +00:00
Simon Pilgrim	ec80e98714	Fix based on post-commit comment on D7816 & rL230177 - BUILD_VECTOR operand truncation was using the the BV's output scalar type instead of the input type. llvm-svn: 230278	2015-02-23 23:04:28 +00:00
Andrea Di Biagio	fc5bcff188	[X86] Teach how to custom lower double-to-half conversions under fast-math. This patch teaches the backend how to expand a double-half conversion into a double-float conversion immediately followed by a float-half conversion. We do this only under fast-math, and if float-half conversions are legal for the target. Added test CodeGen/X86/fastmath-float-half-conversion.ll Differential Revision: http://reviews.llvm.org/D7832 llvm-svn: 230276	2015-02-23 22:59:02 +00:00
Sanjoy Das	421baaa45f	Fix bug 22641 The bug was a result of getPreStartForExtend interpreting nsw/nuw flags on an add recurrence more strongly than is legal. {S,+,X}<nsw> implies S+X is nsw only if the backedge of the loop is taken at least once. Differential Revision: http://reviews.llvm.org/D7808 llvm-svn: 230275	2015-02-23 22:55:13 +00:00
Rafael Espindola	3792f3c232	Fix invalid cast. Fixes PR22525. Patch by Ben Longbons with testcase by me. llvm-svn: 230271	2015-02-23 21:51:06 +00:00
David Majnemer	b6ac6360c4	X86: Use a smaller 'mov' instruction for stack probe calls Prologue emission, in some cases, requires calls to a stack probe helper function. The amount of stack to probe is passed as a register argument in the Win64 ABI but the instruction sequence used is pessimistic: it assumes that the number of bytes to probe is greater than 4 GB. Instead, select a more appropriate opcode depending on the number of bytes we are going to probe. llvm-svn: 230270	2015-02-23 21:50:30 +00:00
David Majnemer	5bffee412d	X86: Use 'mov' instead of 'lea' in Win64 SEH prologues when possible 'mov' and 'lea' are equivalent when the displacement applied with 'lea' is zero. However, 'mov' should encode smaller. llvm-svn: 230269	2015-02-23 21:50:27 +00:00
David Majnemer	672ed0abe3	X86: Explain why we cannot use a 'mov' in a Win64 epilogue llvm-svn: 230268	2015-02-23 21:50:25 +00:00
David Majnemer	9bb032eee6	X86: Consistently use 'epilogue' instead of 'epilog' llvm-svn: 230267	2015-02-23 21:50:18 +00:00
Sanjay Patel	28567e334a	add newline for easier reading; NFC llvm-svn: 230265	2015-02-23 21:32:09 +00:00
Bruno Cardoso Lopes	d2335534c5	[AsmPrinter] Access pointers to globals via pcrel GOT entries Front-ends could use global unnamed_addr to hold pointers to other symbols, like @gotequivalent below: @foo = global i32 42 @gotequivalent = private unnamed_addr constant i32* @foo @delta = global i32 trunc (i64 sub (i64 ptrtoint (i32** @gotequivalent to i64), i64 ptrtoint (i32* @delta to i64)) to i32) The global @delta holds a data "PC"-relative offset to @gotequivalent, an unnamed pointer to @foo. The darwin/x86-64 assembly output for this follows: .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta Since unnamed_addr indicates that the address is not significant, only the content, we can optimize the case above by replacing pc-relative accesses to "GOT equivalent" globals, by a PC relative access to the GOT entry of the final symbol instead. Therefore, "delta" can contain a pc relative relocation to foo's GOT entry and we avoid the emission of "gotequivalent", yielding the assembly code below: .globl _foo _foo: .long 42 .globl _delta _delta: .long _foo@GOTPCREL+4 There are a couple of advantages of doing this: (1) Front-ends that need to emit a great deal of data to store pointers to external symbols could save space by not emitting such "got equivalent" globals and (2) IR constructs combined with this opt opens a way to represent GOT pcrel relocations by using the LLVM IR, which is something we previously had no way to express. Differential Revision: http://reviews.llvm.org/D6922 rdar://problem/18534217 llvm-svn: 230264	2015-02-23 21:26:18 +00:00
Justin Bogner	40b338033b	InstrProf: Teach llvm-cov to show the max count instead of the last When multiple regions start on the same line, llvm-cov was just showing the count of the last one as the line count. This can be confusing and misleading for things like one-liner loops, where the count at the end isn't very interesting, or even "if" statements with an opening brace at the end of the line. Instead, use the maximum of all of the region start counts. llvm-svn: 230263	2015-02-23 21:21:34 +00:00
Andrew Kaylor	e73260c648	Removing unused private field. llvm-svn: 230259	2015-02-23 21:03:30 +00:00
Bruno Cardoso Lopes	a8c6f58e91	[X86][MMX] Fix test to reflect current codegen This test failed in several buildbots, a bit unclear how that happen since this was the previous behavior before r230248. llvm-svn: 230258	2015-02-23 20:57:46 +00:00
Andrew Kaylor	8eb14460aa	Second attempt to fix WinEHCatchDirector build failures. llvm-svn: 230257	2015-02-23 20:44:34 +00:00
Andrew Kaylor	695cbcb2a1	Attempting to fix WinEHCatchDirector destructor related build failures. llvm-svn: 230252	2015-02-23 20:19:15 +00:00
Andrew Kaylor	a496164a47	Adding test for Windows EH frame variable remapping. llvm-svn: 230250	2015-02-23 20:04:51 +00:00
Andrew Kaylor	4e424eb088	Remap frame variables for native Windows exception handling. Differential Revision: http://reviews.llvm.org/D7770 llvm-svn: 230249	2015-02-23 20:01:56 +00:00
Bruno Cardoso Lopes	390bb4572c	Revert "[X86][MMX] Add MMX instructions to foldable tables" This reverts commit r230226 since it breaks win buildbots. llvm-svn: 230248	2015-02-23 19:53:37 +00:00
Chad Rosier	96f24682d4	Revert "Revert "Raising minimum required CMake version to 2.8.12.2."" This reverts commit r230240, which was an accidental commit. llvm-svn: 230246	2015-02-23 19:34:04 +00:00
Eric Christopher	673f20f883	Rewrite the global merge pass to be subprogram agnostic for now. It was previously using the subtarget to get values for the global offset without actually checking each function as it was generating code. Go ahead and solidify the current behavior and make the existing FIXMEs more prominent. As a note the ARM backend previously had a thumb1 and non-thumb1 set of defaults. Only the former was tested so I've changed the behavior to only use that for now. llvm-svn: 230245	2015-02-23 19:28:45 +00:00
Chad Rosier	a95dc74f57	Prevent hoisting fmul from THEN/ELSE to IF if there is fmsub/fmadd opportunity. This patch adds the isProfitableToHoist API. For AArch64, we want to prevent a fmul from being hoisted in cases where it is more profitable to form a fmsub/fmadd. Phabricator Review: http://reviews.llvm.org/D7299 Patch by Lawrence Hu <lawrence@codeaurora.org> llvm-svn: 230241	2015-02-23 19:15:16 +00:00
Chad Rosier	5f625fb96e	Revert "Raising minimum required CMake version to 2.8.12.2." This reverts commit 247aed4710e8befde76da42b27313661dea7cf66. llvm-svn: 230240	2015-02-23 19:15:08 +00:00
Reid Kleckner	f0735a01d1	cmake: Don't do the libstdc++ version check when clang simulates MSVC If we're using clang-cl, that's a pretty good indication that we're going to use MSVC's STL. This simplifies the clang-cl ninja self-host configuration down to: CC=clang-cl CXX=clang-cl cmake .. -GNinja Modified version of zturner's patch: Differential Revision: http://reviews.llvm.org/D7824 llvm-svn: 230239	2015-02-23 19:07:25 +00:00
Mehdi Amini	1b3cca0d56	InstSimplify: simplify 0 / X if nnan and nsz From: Fiona Glaser <fglaser@apple.com> llvm-svn: 230238	2015-02-23 18:30:25 +00:00
Daniel Sanders	0eca0162a3	[mips] Honour -mno-odd-spreg for vector insert/extract when MSA is enabled. Summary: -mno-odd-spreg prohibits the use of odd-numbered single-precision floating point registers. However, vector insert/extract was still using them when manipulating the subregisters of an MSA register. Fixed this by ensuring that insertion/extraction is only performed on even-numbered vector registers when -mno-odd-spreg is given. Reviewers: vmedic, sstankovic Reviewed By: sstankovic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7672 llvm-svn: 230235	2015-02-23 17:22:16 +00:00
Bob Wilson	4d72fc7a63	Fix incorrect immediate size for AddrModeT2_i8s4 in rewriteT2FrameIndex. The natural way to handle this addressing mode would be to say that it has 8 bits and gets scaled by 4, but since the MC layer is expecting the scaling to be already reflected in the immediate value, we have been setting the Scale to 1. That's fine, but then NumBits needs to be adjusted to reflect the effective increase in the range of the immediate. That adjustment was missing. The consequence is that the register scavenger can fail. The estimateRSStackSizeLimit() function in ARMFrameLowering.cpp correctly assumes that the AddrModeT2_i8s4 address mode can handle scaled offsets up to 1020. Under just the right circumstances, we fail to reserve space for the scavenger because it thinks that nothing will be needed. However, the overly pessimistic behavior in rewriteT2FrameIndex causes some frame indexes to be out of range and require scavenged registers, and so the scavenger asserts. Unfortunately I have not been able to come up with a testcase for this. I can only reproduce it on an internal branch where the frame layout and register allocation is slightly different than trunk. We really need a way to serialize MachineInstr-level IR to write reasonable tests for things like this. rdar://problem/19909005 llvm-svn: 230233	2015-02-23 16:57:19 +00:00

... 2 3 4 5 6 ...

114046 Commits