llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Dan Gohman	8f3e512189	[WebAssembly] Update cfg-stackify.ll to remove the workaround added in r318288. Remove -switch-peel-threshold=100 and update the expected results in test10 in cfg-stackify.ll. llvm-svn: 318338	2017-11-15 21:38:33 +00:00
Evandro Menezes	b81553263f	[AArch64] Refactor the loads and stores optimizer Move remaining inline matching of instructions of some optimizations into separate functions, like in the other optimizations. Otherwise, NFC. Differential revision: https://reviews.llvm.org/D40090 llvm-svn: 318335	2017-11-15 21:06:22 +00:00
Rafael Espindola	df81e5d550	Simplify file handling in dsymutil. This moves the file handling out of DwarfLinker.cpp. This fixes what is at least an oddity if not a bug. DwarfLinker.cpp was using ToolOutputFile, which uses RemoveFileOnSignal. The issue is that dsymutil.cpp uses that too. It is now clear from the interface that only dsymutil.cpp is responsible for creating and deleting files. llvm-svn: 318334	2017-11-15 20:55:53 +00:00
Craig Topper	95d3c686a5	[X86] Add some explanatory comments to the ProcessorFeatures enum in Host.cpp. llvm-svn: 318331	2017-11-15 20:42:49 +00:00
Craig Topper	b6b01e9936	[X86] Add a return to the end of a switch to prevent an accidental fallthrough in the future. llvm-svn: 318330	2017-11-15 20:42:47 +00:00
Jake Ehrlich	20e541caa9	[llvm-objcopy] Change -O binary to respect section removal and behave like GNU objcopy The original -O binary implementation just copied segment data from the object and dumped it into a file. This doesn't take into account any operations performed on objects such as section removal. GNU objcopy has some specific behavior that we'd also like to respect. For instance using -O binary and -j <some_section> will dump <some_section> to a file. This change implements GNU objcopy style -O binary to as close of an approximation as I can determine. Differential Revision: https://reviews.llvm.org/D39713 llvm-svn: 318324	2017-11-15 19:13:31 +00:00
Sanjay Patel	d0c3452c1c	[InstCombine] trunc (binop X, C) --> binop (trunc X, C') Note that one-use and shouldChangeType() are checked ahead of the switch. Without the narrowing folds, we can produce inferior vector code as shown in PR35299: https://bugs.llvm.org/show_bug.cgi?id=35299 llvm-svn: 318323	2017-11-15 19:12:01 +00:00
Rafael Espindola	80b0b1837b	Use TempFile in lto caching. This requires a small change to TempFile: allowing a discard after a failed keep. With this the cache now handles signals and reuses a fd instead of reopening the file. llvm-svn: 318322	2017-11-15 19:09:22 +00:00
Sean Fertile	88680a74ac	[PowerPC] Implement mayBeEmittedAsTailCall for PPC Implements TargetLowering callback 'mayBeEmittedAsTailCall' that enables CodeGenPrepare to duplicate returns when they might enable a tail-call. Differential Revision: https://reviews.llvm.org/D39777 llvm-svn: 318321	2017-11-15 18:58:27 +00:00
Reid Kleckner	b527562cd4	[InstCombine] Salvage debug info during initial DCE InstCombine salvages debug info for every instruction it erases from its worklist, but it wasn't doing it during its initial DCE when populating its worklist. This fixes that. This should help improve availability of 'this' in optimized debug info when casts are necessary. llvm-svn: 318320	2017-11-15 18:51:12 +00:00
Sanjay Patel	1d82d52db7	[InstCombine] add tests for missing trunc folds; NFC As noted in PR35299: https://bugs.llvm.org/show_bug.cgi?id=35299 ...this is likely the root cause for a mis-vectorization transform. llvm-svn: 318319	2017-11-15 18:09:43 +00:00
Vedant Kumar	969fd3b155	[docs] Mention opt -metarenamer in the bugpoint docs Thanks to arsenm and davide for the suggestion! llvm-svn: 318318	2017-11-15 18:05:19 +00:00
Evandro Menezes	afc0e3dbbc	[AArch64] Adjust the cost model for Exynos M1 and M2 Fix the modeling of loads and stores using the pre or post indexed addressing modes. llvm-svn: 318312	2017-11-15 17:39:37 +00:00
Simon Pilgrim	04a7c8e081	[X86] Add CBW/CDQ/CDQE/CQO/CWD/CWDE to WriteALU schedule class Some CPUs are already overriding these sign extension instructions but we should be able to use the WriteALU schedule class by default. Differential Revision: https://reviews.llvm.org/D39899 llvm-svn: 318308	2017-11-15 17:11:24 +00:00
Adam Nemet	01d159b6a1	[SLP] Added more missed optimization remarks Summary: Added more remarks to SLP pass, in particular "missed" optimization remarks. Also proposed several tests for new functionality. Patch by Vladimir Miloserdov! For reference you may look at: https://reviews.llvm.org/rL302811 Reviewers: anemet, fhahn Reviewed By: anemet Subscribers: javed.absar, lattner, petecoup, yakush, llvm-commits Differential Revision: https://reviews.llvm.org/D38367 llvm-svn: 318307	2017-11-15 17:04:53 +00:00
Sean Fertile	71ec02692c	[PowerPC] Split out the tailcall calling convention checks. NFC. Move the calling convention checks for tail-call eligibility for the 64-bit SysV ABI into a separate function. This is so that it can be shared with 'mayBeEmittedAsTailCall' in a subsequent change. llvm-svn: 318305	2017-11-15 16:53:41 +00:00
Sanjay Patel	852b9d1ff8	[PassManager, SimplifyCFG] add test for PR34603 / D38566; NFC This is a recommit of r316908 which was reverted by r317444. llvm-svn: 318300	2017-11-15 16:37:30 +00:00
Sanjay Patel	dc0444504e	[(new) Pass Manager] instantiate SimplifyCFG with the same options as the old PM This is a recommit of r316869 which was speculatively reverted with r317444 and subsequently shown to not be the cause of PR35210. That crash should be fixed after r318237. Original commit message: The old PM sets the options of what used to be known as "latesimplifycfg" on the instantiation after the vectorizers have run, so that's what we'redoing here. FWIW, there's a later SimplifyCFGPass instantiation in both PMs where we do not set the "late" options. I'm not sure if that's intentional or not. Differential Revision: https://reviews.llvm.org/D39407 llvm-svn: 318299	2017-11-15 16:33:11 +00:00
Sanjay Patel	7b98bb7dd7	[Reassociate] simplify code; NFCI llvm-svn: 318298	2017-11-15 16:19:17 +00:00
Sander de Smalen	ffa5657a2b	[AArch64][SVE] Asm: Report SVE parsing diagnostics only once Summary: Prevent an issue where a diagnostic is reported multiple times by bailing out with a ParseFail if an invalid SVE register element qualifier/suffix is specified, for example: <stdin>:10:18: error: invalid sve vector kind qualifier add z20.h, z2.h, z31.x ^ <stdin>:10:18: error: invalid sve vector kind qualifier add z20.h, z2.h, z31.x ... <stdin>:10:18: error: invalid sve vector kind qualifier add z20.h, z2.h, z31.x ^ Reviewers: fhahn, rengolin Reviewed By: rengolin Subscribers: aemerson, javed.absar, tschuett, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D39894 llvm-svn: 318297	2017-11-15 15:44:43 +00:00
Petar Jovanovic	0557171790	[mips] Improve genConstMult() to work with arbitrary precision APInt is now used instead of uint64_t in function genConstMult() allowing multiplication optimizations with constants of arbitrary length. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D38130 llvm-svn: 318296	2017-11-15 15:24:04 +00:00
Igor Laevsky	865a89cfe5	[llvm-opt-fuzzer] Add opt fuzzer to the test-depends list. This should help with the buildbot failures after rL318293. llvm-svn: 318295	2017-11-15 15:07:37 +00:00
Igor Laevsky	06ce871e09	[llvm-opt-fuzzer] Only run tests for the x86 target. This fixes build bot failures after rL318293. llvm-svn: 318294	2017-11-15 13:35:42 +00:00
Igor Laevsky	1025a5098b	[llvm-opt-fuzzer] NFC. Add sanity tests. llvm-svn: 318293	2017-11-15 12:36:57 +00:00
Momchil Velikov	653fa328bc	[ARM] Split Arm jump table branch into i12 and rs suffixed versions This is a refactoring/cleanup of Arm `addrmode2` operand class. The patch removes it completely. Differential Revision: https://reviews.llvm.org/D39832 llvm-svn: 318291	2017-11-15 12:02:55 +00:00
Jonas Devlieghere	982b8cd3d9	[DebugInfo] Fix potential CU mismatch for SubprogramScopeDIEs. In constructAbstractSubprogramScopeDIE there can be a potential mismatch between `this` and the CU of ContextDIE when a scope is shared between two DISubprograms belonging to a different CU. In that case, `this` is the CU that was specified in the IR, but the CU of ContextDIE is that of the first subprogram that was emitted. This patch fixes the mismatch by looking up the CU of ContextDIE, and switching to use that. This fixes PR35212 (https://bugs.llvm.org/show_bug.cgi?id=35212) Patch by Philip Craig! Differential revision: https://reviews.llvm.org/D39981 llvm-svn: 318289	2017-11-15 10:57:05 +00:00
Ilya Biryukov	87bda90599	Workaround CodeGen/WebAssembly/cfg-stackify.ll failure after r318202 By disabling the introduced optimization. llvm-svn: 318288	2017-11-15 10:50:43 +00:00
Mikael Holmen	37eaf65cb9	[Lint] Don't warn about passing alloca'd value to tail call if using byval Summary: This fixes PR35241. When using byval, the data is effectively copied as part of the call anyway, so the pointer returned by the alloca will not be leaked to the callee and thus there is no reason to issue a warning. Reviewers: rnk Reviewed By: rnk Subscribers: Ka-Ka, llvm-commits Differential Revision: https://reviews.llvm.org/D40009 llvm-svn: 318279	2017-11-15 07:46:48 +00:00
Craig Topper	4b336f3aac	[X86] Redefine the 128-bit version of VPGATHERQD and VGATHERQPS to use a VK2 mask instead of a VK4 mask. This allows us to remove extra extend creation during lowering and more accurately reflects the semantics of the instruction. While there add an extra output VT to X86 masked gather node to better match the isel pattern predicate. Currently we're exploiting the fact that the isel table doesn't count how many output results a node actually has if the result type of any can be inferred from the first result and the type constraints defined in tablegen. I think we might ultimately want to lower all MGATHER/MSCATTER to an X86ISD node with the extra mask result and stop relying on this hole in the isel checking. llvm-svn: 318278	2017-11-15 07:46:43 +00:00
NAKAMURA Takumi	74b15a31b3	GISelWorkList.h: Fix -fmodules build in rL318210. llvm-svn: 318275	2017-11-15 07:34:35 +00:00
NAKAMURA Takumi	b7162520e4	Fix llvm/test/Transforms/LoopRotate/pr35210.ll in rL318237, it uses debug options. llvm-svn: 318273	2017-11-15 06:46:58 +00:00
Fangrui Song	6567b563fc	NFC Remove default argument of DataLayout::getPointerABIAlignment Differential Revision: https://reviews.llvm.org/D40005 llvm-svn: 318272	2017-11-15 06:17:32 +00:00
Craig Topper	368d3224be	[X86] Add getHostCPUName support for the Gemini Lake model number which also uses Goldmont. llvm-svn: 318271	2017-11-15 06:02:43 +00:00
Craig Topper	cd296f22b1	[X86] Add getHostCPUName support for cannonlake. This adds an explicit model number check and fallback path to the unknown family 6 detection. llvm-svn: 318270	2017-11-15 06:02:42 +00:00
Craig Topper	dcd7058011	[InstCombine] Simplify binops that are only used by a select and are fed by a select with the same condition. Summary: This patch optimizes a binop sandwiched between 2 selects with the same condition. Since we know its only used by the select we can propagate the appropriate input value from the earlier select. As I'm writing this I realize I may need to avoid doing this for division in case the select was protecting a divide by zero? Reviewers: spatel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39999 llvm-svn: 318267	2017-11-15 05:23:02 +00:00
Hiroshi Inoue	4ee809496b	[PowerPC] fix up in redundant compare elimination This patch fixes a potential problem in my previous commit (https://reviews.llvm.org/rL312514) by introducing an additional check. llvm-svn: 318266	2017-11-15 04:23:26 +00:00
Vedant Kumar	2aae3ca7a2	[docs] Document a way to simplify names in bugpoint output llvm-svn: 318257	2017-11-15 02:58:45 +00:00
Matt Arsenault	141501039d	AMDGPU: Add separate definitions for DS insts without m0 use llvm-svn: 318246	2017-11-15 01:34:06 +00:00
Craig Topper	c7a40b93be	[X86] Correct the spelling of pentiumpro in X86TargetParser.def Thanks to Erich Keane for spotting this. llvm-svn: 318243	2017-11-15 01:01:50 +00:00
Matt Arsenault	f225ab4cc5	AMDGPU: Don't use MUBUF vaddr if address may overflow Effectively revert r263964. Before we would not allow this if vaddr was not known to be positive. llvm-svn: 318240	2017-11-15 00:45:43 +00:00
Hans Wennborg	4937b695da	Revert r318193 "[SLPVectorizer] Failure to beneficially vectorize 'copyable' elements in integer binary ops." It crashes building sqlite; see reply on the llvm-commits thread. > [SLPVectorizer] Failure to beneficially vectorize 'copyable' elements in integer binary ops. > > Patch tries to improve vectorization of the following code: > > void add1(int * __restrict dst, const int * __restrict src) { > dst++ = src++; > dst++ = src++ + 1; > dst++ = src++ + 2; > dst++ = src++ + 3; > } > Allows to vectorize even if the very first operation is not a binary add, but just a load. > > Fixed issues related to previous commit. > > Reviewers: spatel, mzolotukhin, mkuper, hfinkel, RKSimon, filcab, ABataev > > Reviewed By: ABataev, RKSimon > > Subscribers: llvm-commits, RKSimon > > Differential Revision: https://reviews.llvm.org/D28907 llvm-svn: 318239	2017-11-15 00:38:13 +00:00
Mitch Phillips	1bb0b7dfdf	[cfi-verify] Validate there are no register clobbers between CFI-check and instruction execution. Summary: This patch adds another failure mode for `validateCFIProtection(..)`, wherein any register that affects the indirect control flow instruction is clobbered to between the CFI-check and the instruction's execution. Also includes a modification to make MCInstrDesc::hasDefOfPhysReg public. Reviewers: vlad.tsyrklevich Reviewed By: vlad.tsyrklevich Subscribers: llvm-commits, pcc, kcc Differential Revision: https://reviews.llvm.org/D39820 llvm-svn: 318238	2017-11-15 00:35:26 +00:00
Craig Topper	4e0cd496e5	[LoopRotate] processLoop should return true even if it just simplified the loop latch without making any other changes Simplifying a loop latch changes the IR and we need to make sure the pass manager knows to invalidate analysis passes if that happened. PR35210 discovered a case where we failed to invalidate the post dominator tree after this simplification because we no changes other than simplifying the loop latch. Fixes PR35210. Differential Revision: https://reviews.llvm.org/D40035 llvm-svn: 318237	2017-11-15 00:22:42 +00:00
Evgeniy Stepanov	0f4fe8b8dd	[asan] Prevent rematerialization of &__asan_shadow. Summary: In the mode when ASan shadow base is computed as the address of an external global (__asan_shadow, currently on android/arm32 only), regalloc prefers to rematerialize this value to save register spills. Even in -Os. On arm32 it is rather expensive (2 loads + 1 constant pool entry). This changes adds an inline asm in the function prologue to suppress this behavior. It reduces AsanTest binary size by 7%. Reviewers: pcc, vitalybuka Subscribers: aemerson, kristof.beyls, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D40048 llvm-svn: 318235	2017-11-15 00:11:51 +00:00
Vedant Kumar	4d07842a77	[PGO] Bump the indexed profile format version Differential Revision: https://reviews.llvm.org/D39447 llvm-svn: 318228	2017-11-14 23:56:48 +00:00
Petr Hosek	39a6141afc	[CMake][runtimes] Don't process common options in runtimes build This is no longer needed for any of the runtimes build and it breaks in case we don't have the working compiler yet, e.g. when building a compiler that uses compiler-rt and libc++ as a default runtime, because these common options check whether these are available. Differential Revision: https://reviews.llvm.org/D39932 llvm-svn: 318227	2017-11-14 23:56:05 +00:00
Craig Topper	6f595ad139	[X86] Fix the parameter order in the default implementation of X86_VENDOR macro in X86TargetParser.def The default implementation doesn't do anything so the order doesn't matter, but good for cleanliness. llvm-svn: 318226	2017-11-14 23:54:28 +00:00
Petr Hosek	c5c523800e	[CMake][runtimes] Set compiler as working even for default target Even when building builtins and runtimes for the default target we shouldn't assume that the just built compiler is already useable. When the compiler uses compiler-rt and libc++ as the default runtime and C++ library, it won't be usable until we finish building runtimes. Differential Revision: https://reviews.llvm.org/D39715 llvm-svn: 318224	2017-11-14 23:47:20 +00:00
Matt Arsenault	66ee956d8d	AMDGPU: Handle or in multi-use shl ptr combine llvm-svn: 318223	2017-11-14 23:46:42 +00:00
Hans Wennborg	3cc4cef33b	Fix switch-lower-peel-top-case.ll isel pass is not registered error The test was doing -stop-after=isel, but that pass is actually the AMDGPUDAGToDAGISel pass, which might not be built when targeting x86_64. This changes the test to -stop-after=expand-isel-pseudos instead. Follow-up to r318202. llvm-svn: 318220	2017-11-14 23:30:28 +00:00

1 2 3 4 5 ...

156689 Commits