llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Zlatko Buljan	92f1550331	[mips][microMIPS] Add CodeGen support for SUBU16, SUB, SUBU, DSUB and DSUBU instructions Differential Revision: http://reviews.llvm.org/D16676 llvm-svn: 267694	2016-04-27 11:31:44 +00:00
Zlatko Buljan	a2323fb2af	[mips][microMIPS] Add CodeGen support for SLL16, SRL16, SLL, SLLV, SRA, SRAV, SRL and SRLV instructions Differential Revision: http://reviews.llvm.org/D17989 llvm-svn: 267693	2016-04-27 11:02:23 +00:00
Artur Pilipenko	df0b222a6e	isSafeToLoadUnconditionally support queries without a context This is required to use this function from isSafeToSpeculativelyExecute Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D16231 llvm-svn: 267692	2016-04-27 11:00:48 +00:00
Artur Pilipenko	e4f3081483	Use DL preferred alignment for alloca in Value::getPointerAlignment Teach Value::getPointerAlignment that allocas with no explicit alignment are aligned to preferred alignment of the allocated type. Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D17569 llvm-svn: 267689	2016-04-27 10:42:29 +00:00
Simon Pilgrim	b8826cf36d	[InstCombine][SSE] Added DemandedBits tests for MOVMSK instructions MOVMSK zeros the upper bits of the gpr - we should be able to use this. llvm-svn: 267686	2016-04-27 09:53:09 +00:00
Adam Nemet	083815b3cc	Fixed sphinx warning from r267672 llvm-svn: 267675	2016-04-27 05:59:51 +00:00
Adam Nemet	ededcfa020	[LoopDist] Add llvm.loop.distribute.enable loop metadata Summary: D19403 adds a new pragma for loop distribution. This change adds support for the corresponding metadata that the pragma is translated to by the FE. As part of this I had to rethink the flag -enable-loop-distribute. My goal was to be backward compatible with the existing behavior: A1. pass is off by default from the optimization pipeline unless -enable-loop-distribute is specified A2. pass is on when invoked directly from opt (e.g. for unit-testing) The new pragma/metadata overrides these defaults so the new behavior is: B1. A1 + enable distribution for individual loop with the pragma/metadata B2. A2 + disable distribution for individual loop with the pragma/metadata The default value whether the pass is on or off comes from the initiator of the pass. From the PassManagerBuilder the default is off, from opt it's on. I moved -enable-loop-distribute under the pass. If the flag is specified it overrides the default from above. Then the pragma/metadata can further modifies this per loop. As a side-effect, we can now also use -enable-loop-distribute=0 from opt to emulate the default from the optimization pipeline. So to be precise this is the new behavior: C1. pass is off by default from the optimization pipeline unless -enable-loop-distribute or the pragma/metadata enables it C2. pass is on when invoked directly from opt unless -enable-loop-distribute=0 or the pragma/metadata disables it Reviewers: hfinkel Subscribers: joker.eph, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D19431 llvm-svn: 267672	2016-04-27 05:28:18 +00:00
Vaivaswatha Nagaraj	4cd68defaf	[Cloning] cloneLoopWithPreheader(): add assert to ensure no sub-loops Summary: cloneLoopWithPreheader() does not update LoopInfo for sub-loop of the original loop being cloned. Add assert to ensure no sub-loops for loop being cloned. Reviewers: anemet, ashutosh.nema, hfinkel Subscribers: mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D15922 llvm-svn: 267671	2016-04-27 05:25:09 +00:00
Craig Topper	02b21a3953	[Support][X86] Add a few more Intel model numbers to getHostCPUName for airmont and knl. llvm-svn: 267670	2016-04-27 05:17:00 +00:00
Craig Topper	33f93d918d	[Support][X86] Change the case values in the Intel family 6 code to hex so its easier to compare with Intel's docs. NFC llvm-svn: 267669	2016-04-27 05:16:58 +00:00
Mehdi Amini	cc7d938331	Revert "Support "preserving" the summary information when using setModule() API in LTOCodeGenerator" This reverts commit r267665. ASAN shows that there is a use of undefined value. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267668	2016-04-27 05:11:44 +00:00
Craig Topper	56463903be	[Support][X86] Add a couple more Broadwell CPU models numbers to getHostCPUName. llvm-svn: 267666	2016-04-27 04:40:03 +00:00
Mehdi Amini	6e6426ce69	Support "preserving" the summary information when using setModule() API in LTOCodeGenerator Another attempt at r267655... From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267665	2016-04-27 04:24:10 +00:00
Mehdi Amini	4c94cd20e1	Revert "Support "preserving" the summary information when using setModule() API in LTOCodeGenerator" This reverts commit r267657, r267656, and r267655. The test does not pass on multiple bots, I'm unsure why yet but let's unbreak them. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267664	2016-04-27 03:34:28 +00:00
Evgeny Stupachenko	9d35c64dc2	The patch fixes PR27392. Summary: It is incorrect to compare TripCount (which is BECount + 1) with extraiters (or Count) to check if we should enter unrolled loop or not, because TripCount can potentially overflow (when BECount is max unsigned integer). While comparing BECount with (Count - 1) is overflow safe and therefore correct. Reviewer: hfinkel Differential Revision: http://reviews.llvm.org/D19256 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 267662	2016-04-27 03:04:54 +00:00
Philip Reames	60e443db3d	[LVI] Delete stale and misleading comment. llvm-svn: 267661	2016-04-27 03:03:15 +00:00
Chuang-Yu Cheng	d389efdf8e	[ppc64] fix bug in prologue that mfocrf's cr operand should be explict state instead of implicit This fixes PR27414 Reviewers: kbarton mgrang tjablin http://reviews.llvm.org/D19255 llvm-svn: 267660	2016-04-27 02:59:28 +00:00
Ahmed Bougacha	1b19a8307b	[X86] Set AddPristinesAndCSRs to FixupBW LivePhysRegs. NFC. We run after PEI, so we need to AddPristinesAndCSRs. In practice, that makes no difference here, because we only ask about liveness of super-registers of defined GR8/GR16 registers, so they can't be pristine. Still, it's the correct thing to do. Thanks to Quentin for noticing! Follow-up to r267495. llvm-svn: 267658	2016-04-27 01:51:38 +00:00
Mehdi Amini	156bdb42b3	Fix the test from r267656: Support "preserving" the summary information when using setModule() API in LTOCodeGenerator From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267657	2016-04-27 01:49:11 +00:00
Mehdi Amini	c9e9acff2a	Add a test for r267655: Support "preserving" the summary information when using setModule() API in LTOCodeGenerator From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267656	2016-04-27 01:47:46 +00:00
Mehdi Amini	a75f9ab43e	Support "preserving" the summary information when using setModule() API in LTOCodeGenerator From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267655	2016-04-27 01:46:48 +00:00
Sanjoy Das	9c0d8f07ef	Fix typo in comment; NFC llvm-svn: 267653	2016-04-27 01:44:31 +00:00
Ahmed Bougacha	991d42e979	[X86] Don't assume that MMX extractelts are from index 0. It's probably the case for all 3 MMX users out there, but with hand-crafted IR, you can trigger selection failures. Fix that. llvm-svn: 267652	2016-04-27 01:35:29 +00:00
Ahmed Bougacha	208a5db302	[X86] Re-enable MMX i32 extractelt combine. This effectively adds back the extractelt combine removed by r262358: the direct case can still occur (because x86_mmx is special, see r262446), but it's the indirect case that's now superseded by the generic combine. llvm-svn: 267651	2016-04-27 01:35:25 +00:00
Cong Hou	3dea148bfe	Detects the SAD pattern on X86 so that much better code will be emitted once the pattern is matched. Differential revision: http://reviews.llvm.org/D14840 llvm-svn: 267649	2016-04-27 01:29:18 +00:00
Philip Reames	691e9bf845	[LVI] Add a comment explaining a subtle piece of code Or at least, I didn't understand the implications the first several times I read it it. llvm-svn: 267648	2016-04-27 01:02:25 +00:00
Adam Nemet	9d448b66bc	[Docs] Try to clarify the concept of domains for noalias scope Summary: This tries to anchor down the concept of domains a bit better. I had trouble initially relating this to anything. Also talking to David Majnemer on IRC suggested that I wasn't the only one. Reviewers: hfinkel Subscribers: llvm-commits, majnemer Differential Revision: http://reviews.llvm.org/D18799 llvm-svn: 267647	2016-04-27 00:52:48 +00:00
Mehdi Amini	ab55b5059b	ThinLTO: do not promote GlobalVariable that have a specific section. Differential Revision: http://reviews.llvm.org/D18298 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267646	2016-04-27 00:32:13 +00:00
Matt Arsenault	7c561c141c	SLSR: Use UnknownAddressSpace instead of 0 for pure arithmetic. In the case where isLegalAddressingMode is used for cases not related to addressing modes, such as pure adds and muls, it should not be using address space 0. LSR already passes -1 as the address space in these cases. llvm-svn: 267645	2016-04-27 00:32:09 +00:00
Mehdi Amini	01648c823a	LTOCodeGenerator: turns linkonce(_odr) into weak_(odr) when present "MustPreserve" set Summary: If the linker requested to preserve a linkonce function, we should honor this even if we drop all uses. Reviewers: dexonsmith Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19527 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267644	2016-04-27 00:32:02 +00:00
Adam Nemet	d2f4975b05	[LoopDist] Split main class. NFC This splits out the per-loop functionality from the Pass class. With this the fact whether the loop is forced-distribute with the new metadata/pragma can be cached in the per-loop class rather than passed around. llvm-svn: 267643	2016-04-27 00:31:03 +00:00
Philip Reames	0926f356af	[LVI] Reduce compile time by lazily scanning blocks if needed When encountering a non-local pointer, LVI would eagerly scan the block for dereferences of the given object to prove the pointer to be non null. That's all well and good, but then we'd go recurse through our input blocks. As a result, we could end up scanning each and every block we traverse, even if the final definition was obviously non null or we found a constant value somewhere up the chain. The previous code papered over this by using the isKnownNonNull routine from value tracking. This made the duplication less painful in the common case. Instead, we know do the block scan only after we've gotten the recursive results back. This lets us stop scanning individual blocks as soon as we've determined it to be non-null in any predecessor block and use our usual merge rules to propagate that information cheaply through successor blocks. For a pointer which can be found non-null, this does strictly less work and sometimes substaintially so. Note that the case where we can't prove something non-null is still the really expensive case. We end up scanning each and every block looking for a dereference and never end up finding one. llvm-svn: 267642	2016-04-27 00:30:55 +00:00
Quentin Colombet	1c43af45df	[MachineInstrBundle] Actually set the PartialDeadDef flag only when the register is defined! The users were checking the proper thing (Defined + PartialDeadDef), but the information may have been wrong for other use cases, so fix that. llvm-svn: 267641	2016-04-27 00:16:29 +00:00
Quentin Colombet	e5b08e124c	[MachineInstrBundle] Update the comment for PhysRegInfo::DeadDef. I missed read the comment when I commited r267621 and thought the comment did not need update. Matthias kindly proved me wrong. Fixing that. llvm-svn: 267638	2016-04-26 23:55:41 +00:00
Andrew Kaylor	322b6b2a32	Add optimization bisect opt-in calls for SystemZ passes Differential Revision: http://reviews.llvm.org/D19562 llvm-svn: 267636	2016-04-26 23:49:41 +00:00
Andrew Kaylor	9680f122b8	Add optimization bisect opt-in calls for NVPTX passes Differential Revision: http://reviews.llvm.org/D19518 llvm-svn: 267635	2016-04-26 23:44:31 +00:00
Quentin Colombet	c01de3fc6a	[X86] Make sure it is safe to clobber EFLAGS, if need be, when choosing the prologue. Do not use basic blocks that have EFLAGS live-in as prologue if we need to realign the stack. Realigning the stack uses AND instruction and this clobbers EFLAGS. An other alternative would have been to save and restore EFLAGS around the stack realignment code, but this is likely inefficient. Fixes PR27531. llvm-svn: 267634	2016-04-26 23:44:14 +00:00
Justin Bogner	1caa3a1a82	PM: Port Reassociate to the new pass manager llvm-svn: 267631	2016-04-26 23:39:29 +00:00
Mitch Bodart	d1778e20f3	[X86] Replace -mcpu with -mattr in several tests Differential Revision: http://reviews.llvm.org/D19568 llvm-svn: 267629	2016-04-26 23:36:38 +00:00
Justin Bogner	3b17932497	Reassociate: Convert another functor into a lambda. NFC Also move the explanatory comment with it. llvm-svn: 267628	2016-04-26 23:32:00 +00:00
Philip Reames	cb75ab3a5d	[LVI] Cut short search if we know we can't return a useful result Previously we were recursing on our operands for unary and binary operators regardless of whether we knew how to reason about the operator in question. This has the effect of doing a potentially large amount of work, only to throw it away. By checking whether the operation is one LVI can handle, we can cut short the search and return the (overdefined) answer more quickly. The quality of the results produced should not change. llvm-svn: 267626	2016-04-26 23:27:33 +00:00
Sanjay Patel	51a5a5649b	[SimplifyCFG] propagate branch metadata when creating select llvm-svn: 267624	2016-04-26 23:15:48 +00:00
Quentin Colombet	ce78de67e5	[X86] Teach the expansion of copy instructions how to do proper liveness. When the simple analysis provided by MachineBasicBlock::computeRegisterLiveness fails, fall back on the LivePhysReg utility. llvm-svn: 267623	2016-04-26 23:14:32 +00:00
Quentin Colombet	67573257d1	[MachineBasicBlock] Take advantage of the partially dead information. Thanks to that information we wouldn't lie on a register being live whereas it is not. llvm-svn: 267622	2016-04-26 23:14:29 +00:00
Quentin Colombet	c2937566b8	[MachineInstrBundle] Improvement the recognition of dead definitions. Now, it is possible to know that partial definitions are dead definitions and recognize that clobbered registers are also dead. llvm-svn: 267621	2016-04-26 23:14:24 +00:00
Philip Reames	d559635ec4	[LVI] Apply transfer rule for overdefine inputs for binary operators As pointed out by John Regehr over in http://reviews.llvm.org/D19485, LVI was being incredibly stupid about applying its transfer rules. Rather than gathering local facts from the expression itself, it was simply giving up entirely if one of the inputs was overdefined. This greatly impacts the precision of the overall analysis and makes it far more fragile as well. This patch builds on 267609 which did the same thing for unary casts. llvm-svn: 267620	2016-04-26 23:10:35 +00:00
Jingyue Wu	db7e23c040	[NVPTX] Fix some usages of CodeGenOpt::None. NVPTXLowerKernelArgs is required for correctness, so it should not be guarded by CodeGenOpt::None. NVPTXPeephole is optimization only, so it should be skipped when CodeGenOpt::None. llvm-svn: 267619	2016-04-26 22:59:25 +00:00
Philip Reames	368d1c8275	[LVI] A better fix for the assertion error introduced by 267609 Essentially, I was using the wrong size function. For types which were sized, but not primitive, I wasn't getting a useful size for the operand and failed an assert. I fixed this, and also added a guard that the input is a sized type. Test case is for the original mistake. I'm not sure how to actually exercise the sized type check. llvm-svn: 267618	2016-04-26 22:52:30 +00:00
Philip Reames	b43dddfd09	[LVI] Speculative fix for assertion seen in clang bots I'll clean this up and add a test case shortly. I want to make sure this does actually fix the bots; if not, I'll revert. llvm-svn: 267617	2016-04-26 22:31:53 +00:00
Sanjay Patel	65b8c3537e	[LowerExpectIntrinsic] make default likely/unlikely ratio bigger We need the default ratio to be sufficiently large that it triggers transforms based on block frequency info (BFI) and plays well with the recently introduced BranchProbability used by CGP. Differential Revision: http://reviews.llvm.org/D19435 llvm-svn: 267615	2016-04-26 22:23:38 +00:00

1 2 3 4 5 ...

130730 Commits