llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 21:13:02 +02:00

Author	SHA1	Message	Date
Sjoerd Meijer	b088b3e65b	This implements a more optimal algorithm for selecting a base constant in constant hoisting. It not only takes into account the number of uses and the cost of expressions in which constants appear, but now also the resulting integer range of the offsets. Thus, the algorithm maximizes the number of uses within an integer range that will enable more efficient code generation. On ARM, for example, this will enable code size optimisations because less negative offsets will be created. Negative offsets/immediates are not supported by Thumb1 thus preventing more compact instruction encoding. Differential Revision: http://reviews.llvm.org/D21183 llvm-svn: 275382	2016-07-14 07:44:20 +00:00
David Majnemer	a2e9622df1	[InstCombine] Masked loads with undef masks can fold to normal loads We were able to fold masked loads with an all-ones mask to a normal load. However, we couldn't turn a masked load with a mask with mixed ones and undefs into a normal load. llvm-svn: 275380	2016-07-14 06:58:42 +00:00
David Majnemer	1e74a4ec26	Simplify llvm.masked.load w/ undef masks We can always pick the passthru value if the mask is undef: we are permitted to treat the mask as-if it were filled with zeros. llvm-svn: 275379	2016-07-14 06:58:37 +00:00
Craig Topper	d3e63ad239	[AVX512] Implement EXTLOAD lowering with patterns to select existing VPMOVZX instructions instead of creating CodeGenOnly instructions. llvm-svn: 275378	2016-07-14 06:41:34 +00:00
Eli Friedman	f453c38f88	[X86] Fix stupid typo in isel lowering. Apparently someone miscounted the number of zeros in the immediate. Fixes https://llvm.org/bugs/show_bug.cgi?id=28544 . llvm-svn: 275376	2016-07-14 05:48:25 +00:00
Matt Arsenault	ce4eceea10	AMDGPU/R600: Delete/rename intrinsics no longer used by mesa Use the replacement pass to update the tests, and delete old names. llvm-svn: 275375	2016-07-14 05:47:17 +00:00
Matt Arsenault	a42057703b	AMDGPU/R600: Remove intrinsics with no tests and no users Mesa removed this path, so nothing is using these anymore. llvm-svn: 275372	2016-07-14 05:23:23 +00:00
Matt Arsenault	cfd2c39f42	AMDGPU: Remove unused intrinsics llvm-svn: 275371	2016-07-14 05:23:19 +00:00
Matt Arsenault	6622b93a6f	AMDGPU: Fix test not actually testing anything It wasn't actually running the pass, and since it is missing the llvm prefix, the eh intrinsic was not really an IntrinsicInst. Also add missing test for lifetime markers. llvm-svn: 275370	2016-07-14 05:23:15 +00:00
Matt Arsenault	a21ee6967c	AMDGPU: Remove dead code llvm-svn: 275369	2016-07-14 05:23:08 +00:00
Dean Michael Berris	b3cb9bd89d	XRay: Add entry and exit sleds Summary: In this patch we implement the following parts of XRay: - Supporting a function attribute named 'function-instrument' which currently only supports 'xray-always'. We should be able to use this attribute for other instrumentation approaches. - Supporting a function attribute named 'xray-instruction-threshold' used to determine whether a function is instrumented with a minimum number of instructions (IR instruction counts). - X86-specific nop sleds as described in the white paper. - A machine function pass that adds the different instrumentation marker instructions at a very late stage. - A way of identifying which return opcode is considered "normal" for each architecture. There are some caveats here: 1) We don't handle PATCHABLE_RET in platforms other than x86_64 yet -- this means if IR used PATCHABLE_RET directly instead of a normal ret, instruction lowering for that platform might do the wrong thing. We think this should be handled at instruction selection time to by default be unpacked for platforms where XRay is not availble yet. 2) The generated section for X86 is different from what is described from the white paper for the sole reason that LLVM allows us to do this neatly. We're taking the opportunity to deviate from the white paper from this perspective to allow us to get richer information from the runtime library. Reviewers: sanjoy, eugenis, kcc, pcc, echristo, rnk Subscribers: niravd, majnemer, atrick, rnk, emaste, bmakam, mcrosier, mehdi_amini, llvm-commits Differential Revision: http://reviews.llvm.org/D19904 llvm-svn: 275367	2016-07-14 04:06:33 +00:00
Davide Italiano	7dbaefb75a	[SCCP] Pass a Value * instead of templating this function. NFC. Thanks to Eli for the suggestion! llvm-svn: 275366	2016-07-14 03:02:34 +00:00
Chris Lattner	b9db25ac49	clarify a bit. llvm-svn: 275364	2016-07-14 02:52:04 +00:00
Davide Italiano	8b06b72045	[IPSCCP] Constant fold struct argument/instructions when all the lattice values are constant. This now should also work with the interprocedural variant of the pass. Slightly easier now that the yak is shaved. Differential Revision: http://reviews.llvm.org/D22329 llvm-svn: 275363	2016-07-14 02:51:41 +00:00
Lang Hames	928a47733c	[Object] Re-apply r275316 now that I have the corresponding LLD patch ready. llvm-svn: 275361	2016-07-14 02:24:01 +00:00
Nico Weber	d4fcb8d00e	Teach fast isel about thiscall (and callee-pop) calls. http://reviews.llvm.org/D22315 llvm-svn: 275360	2016-07-14 01:52:51 +00:00
Mehdi Amini	eabbdab4f5	[Scalarizer] PR28108: Skip over nullptr rather than crashing on it. Summary: In Scalarizer::gather we see if we already have a scattered form of Op, and in that case use the new form. In the particular case of PR28108, the found ValueVector SV has size 2, where the first Value is nullptr, and the second is indeed a proper Value. The nullptr then caused an assert to blow when we tried to do cast<Instruction>(SV[I]). With this patch we check SV[I] before doing the cast, and if it's nullptr we just skip over it. I don't know the Scalarizer well enough to know if this is the best fix or if something should be done else where to prevent the nullptr from being in the ValueVector at all, but at least this avoids the crash and looking at the test case output it looks reasonable. Reviewers: hfinkel, frasercrmck, wala, mehdi_amini Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21518 llvm-svn: 275359	2016-07-14 01:31:25 +00:00
Mehdi Amini	df0d548af9	Add missing test for r275347 "[IPRA] Set callee saved registers to none for local function when IPRA is enabled." llvm-svn: 275358	2016-07-14 01:31:20 +00:00
Davide Italiano	2a3337612a	[SCCP] Generalize tryToReplaceInstWithConstant to work also with arguments. llvm-svn: 275357	2016-07-14 01:27:29 +00:00
Matthias Braun	9fb93ea49d	MIRParser: Fix MIRParser not reporting nullptr on error. While some code paths in MIRParserImpl::parse() already returned nullptr in case of error one of the important ones did not. llvm-svn: 275355	2016-07-14 00:42:37 +00:00
Adrian Prantl	0973b9cea7	Synchronize LLVM and clang's ObjCDeclSpec::ObjCPropertyAttributeKind. This adds Clang-specific DWARF constants for nullability and ObjC class properties that are already generated by clang. This patch adds dwarfdump support and a more comprehensive testcase. <rdar://problem/27335745> llvm-svn: 275354	2016-07-14 00:41:18 +00:00
Lang Hames	40892552d6	[Object] Revert r275316, Archive::child_iterator changes, while I update lld. Should fix the bots broken by r275316. llvm-svn: 275353	2016-07-14 00:37:04 +00:00
David Majnemer	774e517b3f	[ConstantFolding] Fold masked loads We can constant fold a masked load if the operands are appropriately constant. Differential Revision: http://reviews.llvm.org/D22324 llvm-svn: 275352	2016-07-14 00:29:50 +00:00
Justin Lebar	2ce62bc5b9	Force a semicolon at the end of the LLVM_ENABLE_BITMASK_ENUMS_IN_NAMESPACE() macro. This silences a warning about an extra semicolon on gcc. llvm-svn: 275349	2016-07-13 23:52:19 +00:00
Mehdi Amini	86d853e8c7	Add EnableIPRA to TargetOptions, and move the cl::opt -enable-ipra to TargetMachine.cpp Avoid exposing a cl::opt in a public header and instead promote this option in the API. Alternatively, we could land the cl::opt in CommandFlags.h so that it is available to every tool, but we would still have to find an option for clang. llvm-svn: 275348	2016-07-13 23:39:46 +00:00
Mehdi Amini	b1cf766b75	[IPRA] Set callee saved registers to none for local function when IPRA is enabled. IPRA try to optimize caller saved register by propagating register usage information from callee to caller so it is beneficial to have caller saved registers compare to callee saved registers when IPRA is enabled. Please find more detailed explanation here https://groups.google.com/d/msg/llvm-dev/XRzGhJ9wtZg/tjAJqb0eEgAJ. This change makes local function do not have any callee preserved register when IPRA is enabled. A simple test case is also added to verify this change. Patch by Vivek Pandya <vivekvpandya@gmail.com> Differential Revision: http://reviews.llvm.org/D21561 llvm-svn: 275347	2016-07-13 23:39:34 +00:00
Sanjoy Das	2064e9f3b5	[JumpThreading] Delete commented out debug code; NFC llvm-svn: 275346	2016-07-13 23:33:20 +00:00
David Majnemer	fbdab24d99	[ConstantFolding] Extend FoldReinterpretLoadFromConstPtr to handle negative offsets Treat loads which clip before the start of a global initializer the same way we treat clipping beyond the end of the initializer: use zeros. llvm-svn: 275345	2016-07-13 23:33:07 +00:00
David Majnemer	daa2aa3ab7	Move a transform from InstCombine to InstSimplify. This transform doesn't require any new instructions, it can safely live in InstSimplify. llvm-svn: 275344	2016-07-13 23:32:53 +00:00
Michael Kuperstein	6651aa114a	Fix copy/paste bug in r275340. llvm-svn: 275343	2016-07-13 23:28:00 +00:00
Matthias Braun	245961151a	MIRParser: Move SlotMapping and SourceMgr refs to PFS; NFC Code cleanup: Move references to SlotMapping and SourceMgr into the PerFunctionMIParsingState to avoid unnecessary passing around in parameters. llvm-svn: 275342	2016-07-13 23:27:50 +00:00
Michael Kuperstein	2c81cfa081	[DAG] Correctly chain masked loads If a masked loads is not added to the chain, it should not reset the chain's root. This fixes the remaining part of PR28515. llvm-svn: 275340	2016-07-13 23:23:40 +00:00
Davide Italiano	4f8e022a6f	[SCCP] Have the logic for replacing insts with constant in a single place. The code was pretty much copy-pasted between SCCP and IPSCCP. The situation became clearly worse after I introduced the support for folding structs in SCCP. This commit is NFC as we currently (still) skip the replacement step in IPSCCP, but I'll change this soon. llvm-svn: 275339	2016-07-13 23:20:04 +00:00
Vedant Kumar	8860ce57c8	[Coverage] Return an ArrayRef to avoid copies (NFC) llvm-svn: 275338	2016-07-13 23:12:26 +00:00
Vedant Kumar	8c4c293a47	[Coverage] Mark a few methods const (NFC) llvm-svn: 275337	2016-07-13 23:12:23 +00:00
Adam Nemet	6f46258567	[LAA] Don't hold on to DominatorTree in the analysis result llvm-svn: 275335	2016-07-13 22:36:35 +00:00
Adam Nemet	81d02fda32	[LAA] Don't hold on to TargetLibraryInfo in the analysis result llvm-svn: 275334	2016-07-13 22:36:27 +00:00
Quentin Colombet	fbb869fff0	[MIR] Fix one GlobalISel test case that I missed in r275314. llvm-svn: 275333	2016-07-13 22:35:33 +00:00
Justin Lebar	77030ff94e	[MI] Clean up some loops over MachineInstr::memoperands(). NFC Use range-based for loops and llvm::any_of instead of explicit iterators. llvm-svn: 275332	2016-07-13 22:35:19 +00:00
Justin Lebar	aa40b7778e	[MI] Fix MachineInstr::isInvariantLoad. Summary: Previously it would say we had an invariant load if any of the memory operands were invariant. But the load should be invariant only if all the memory operands are invariant. No testcase because this has proven to be very difficult to tickle in practice. As just one example, ARM's ldrd instruction, which loads 64 bits into two 32-bit regs, is theoretically affected by this. But when it's produced, it loses its memoperands' invariance bits! Reviewers: jfb Subscribers: llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D22318 llvm-svn: 275331	2016-07-13 22:34:50 +00:00
Matthias Braun	7b1359c14c	MIRParser: Move MachineFunction reference into PFS; NFC Code cleanup: The PerFunctionMIParsingState is per function, moving a reference into PFS we can avoid passing around the MachineFunction in an extra parameter most of the time. Also change most signatures to consistently pass PFS reference first. llvm-svn: 275329	2016-07-13 22:23:23 +00:00
Matthias Braun	e72900785a	MIRYamlMapping: Update stale comment llvm-svn: 275328	2016-07-13 22:23:19 +00:00
Nico Weber	d3d545c30b	Add a triple to fix test on bots after 275320. llvm-svn: 275327	2016-07-13 22:19:40 +00:00
Adam Nemet	459477ff4f	[LAA] Don't hold on to DataLayout in the analysis result In fact, don't even pass this to the ctor since we can get it from the module. llvm-svn: 275326	2016-07-13 22:18:51 +00:00
Adam Nemet	328f634357	[LAA] Don't hold on to LoopInfo in the analysis result llvm-svn: 275325	2016-07-13 22:18:48 +00:00
Adam Nemet	0606c4dcdf	[LAA] Don't hold on to AliasAnalysis in the analysis result llvm-svn: 275322	2016-07-13 21:39:09 +00:00
Vedant Kumar	4b0b398802	[llvm-cov] Use a thread pool to speed up report generation (NFC) It's safe to print out source coverage views using multiple threads when using the -output-dir mode of the `llvm-cov show` sub-command. While testing this on my development machine, I observed that the speed up is roughly linear with the number of available cores. Avg. time for `llvm-cov show ./llvm-as -show-line-counts-or-regions`: 1 thread: 7.79s user 0.33s system 98% cpu 8.228 total 4 threads: 7.82s user 0.34s system 283% cpu 2.880 total llvm-svn: 275321	2016-07-13 21:38:36 +00:00
Nico Weber	7706680906	Fix a TODO in X86CallFrameOptimization to not rely on a codegen artifact. This happens to make X86CallFrameOptimization in -O0 / FastISel builds as well, but it's not clear if the pass should run in that setup. http://reviews.llvm.org/D22314 llvm-svn: 275320	2016-07-13 21:38:27 +00:00
Teresa Johnson	426a2a4730	Mark the textual headers in the module map for ProfileData Follow on to r275312. llvm-svn: 275319	2016-07-13 21:27:51 +00:00
Alina Sbirlea	d8cf49126f	Extended LoadStoreVectorizer to vectorize subchains. Summary: LSV used to abort vectorizing a chain for interleaved load/store accesses that alias. Allow a valid prefix of the chain to be vectorized, mark just the prefix and retry vectorizing the remaining chain. Reviewers: llvm-commits, jlebar, arsenm Subscribers: mzolotukhin Differential Revision: http://reviews.llvm.org/D22119 llvm-svn: 275317	2016-07-13 21:20:01 +00:00

1 2 3 4 5 ...

134974 Commits