llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Craig Topper	9e447ff84f	[X86] Custom legalize v2i32 gathers via widening rather than promoting. The default legalization for v2i32 is promotion to v2i64. This results in a gather that reads 64-bit elements rather than 32. If one of the elements is near a page boundary this can cause an illegal access that can fault. We also miscalculate the scale for the gather which is an even worse problem, but we probably could have found a separate way to fix that. llvm-svn: 319521	2017-12-01 06:02:02 +00:00
Craig Topper	8c871357e5	[X86][SelectionDAG] Make sure we explicitly sign extend the index when type promoting the index of scatter and gather. Type promotion makes no guarantee about the contents of the promoted bits. Since the gather/scatter instruction will use the bits to calculate addresses, we need to ensure they aren't garbage. llvm-svn: 319520	2017-12-01 06:02:00 +00:00
Craig Topper	be27f97ee7	[X86] Add another v2i32 gather test case with v2i64 index that wasn't sign extended. llvm-svn: 319519	2017-12-01 06:01:59 +00:00
Craig Topper	dfead84969	[X86] Add a DAG combine to simplify masks for AVX2 gather instructions. AVX2 gathers only use the upper bit of the mask allowing us to simplify sign_extend_inreg to a shift left. llvm-svn: 319514	2017-12-01 02:49:07 +00:00
Adam Nemet	e923ea8387	[cmake] Expose opt-viewer availability This will be used in https://github.com/apple/swift/pull/12938 llvm-svn: 319511	2017-12-01 01:44:26 +00:00
Sam Clegg	d4848399d4	[WebAssembly] Update MC tests now that hidden attr is supported Summary: Support was added in rL319488 but these tests were not updated. Subscribers: jfb, dschuff, jgravelle-google, aheejin, sunfish Differential Revision: https://reviews.llvm.org/D40693 llvm-svn: 319510	2017-12-01 01:18:47 +00:00
Jake Ehrlich	cf73d03a1b	Add flag to ArchiveWriter to test GNU64 format more efficiently Even with the sparse file optimizations the SYM64 test can still be painfully slow. This unnecessarily slows down devs. It's critical that we test that the switch to the SYM64 format occurs at 4GB but there isn't any better of a way to fake the size of the file than sparse files. This change introduces a flag that allows the cutoff to be arbitrarily set to whatever power of two is desired. The flag is hidden as it really isn't meant to be used outside this one test. This is unfortunate but appears necessary, at least until the average hard drive is much faster. The changes to the test require some explanation. Prior to this change we knew that the SYM64 format was being used because the file was simply too large to have validly handled this case if the SYM64 format were not used. To ensure that the SYM64 format is still being used I am grepping the file for "SYM64". Without changing the filename however this would be pointless because "SYM64" would occur in the file either way. So the filename of the test is also changed in order to avoid this issue. Differential Revision: https://reviews.llvm.org/D40632 llvm-svn: 319507	2017-12-01 00:54:28 +00:00
Zachary Turner	dec9bd8187	Mark all library options as hidden. These command line options are not intended for public use, and often don't even make sense in the context of a particular tool anyway. About 90% of them are already hidden, but when people add new options they forget to hide them, so if you were to make a brand new tool today, link against one of LLVM's libraries, and run tool -help you would get a bunch of junk that doesn't make sense for the tool you're writing. This patch hides these options. The real solution is to not have libraries defining command line options, but that's a much larger effort and not something I'm prepared to take on. Differential Revision: https://reviews.llvm.org/D40674 llvm-svn: 319505	2017-12-01 00:53:10 +00:00
Hans Wennborg	098be60f25	docs/GettingStarted.rst: Update the list of release versions and tags llvm-svn: 319502	2017-11-30 23:47:30 +00:00
Matt Arsenault	556bc5681a	AMDGPU: Use carry-less adds in FI elimination llvm-svn: 319501	2017-11-30 23:42:30 +00:00
Peter Collingbourne	6a66a26be3	ThinLTOBitcodeWriter: Try harder to discard unused references to the merged module. If the thin module has no references to an internal global in the merged module, we need to make sure to preserve that property if the global is a member of a comdat group, as otherwise promotion can end up adding global symbols to the comdat, which is not allowed. This situation can arise if the external global in the thin module has dead constant users, which would cause use_empty() to return false and would cause us to try to promote it. To prevent this from happening, discard the dead constant users before asking whether a global is empty. Differential Revision: https://reviews.llvm.org/D40593 llvm-svn: 319494	2017-11-30 23:05:52 +00:00
Zachary Turner	78c986c998	Simplify the DenseSet used for hashing CodeView records. This was storing the hash alongside the key so that the hash doesn't need to be re-computed every time, but in doing so it was allocating a structure to keep the key size small in the DenseMap. This is a noble goal, but it also leads to a pointer indirection on every probe, and this cost of this pointer indirection ends up being higher than the cost of having a slightly larger entry in the hash table. Removing this not only simplifies the code, but yields a small but noticeable performance improvement in the type merging algorithm. llvm-svn: 319493	2017-11-30 23:00:30 +00:00
Matt Arsenault	515e95accf	AMDGPU: Use gfx9 carry-less add/sub instructions llvm-svn: 319491	2017-11-30 22:51:26 +00:00
Reid Kleckner	caef969e5d	XOR the frame pointer with the stack cookie when protecting the stack Summary: This strengthens the guard and matches MSVC. Reviewers: hans, etienneb Subscribers: hiraditya, JDevlieghere, vlad.tsyrklevich, llvm-commits Differential Revision: https://reviews.llvm.org/D40622 llvm-svn: 319490	2017-11-30 22:41:21 +00:00
Sam Clegg	34662923fe	Add visibility flag to Wasm symbol flags The LLVM "hidden" flag needs to be passed through the Wasm intermediate objects in order for the linker to apply it to the final Wasm object. The corresponding change in LLD is here: https://github.com/WebAssembly/lld/pull/14 Patch by Nicholas Wilson Differential Revision: https://reviews.llvm.org/D40442 llvm-svn: 319488	2017-11-30 22:34:58 +00:00
Dan Gohman	1f05adbb2a	[memcpyopt] Commit file missed in r319482. This change was meant to be included with r319482 but was accidentally omitted. llvm-svn: 319483	2017-11-30 22:13:13 +00:00
Dan Gohman	41a3f0d702	[memcpyopt] Teach memcpyopt to optimize across basic blocks This teaches memcpyopt to make a non-local memdep query when a local query indicates that the dependency is non-local. This notably allows it to eliminate many more llvm.memcpy calls in common Rust code, often by 20-30%. Fixes PR28958. Differential Revision: https://reviews.llvm.org/D38374 llvm-svn: 319482	2017-11-30 22:10:53 +00:00
Davide Italiano	695d68f1d2	[InlineCost] Prefer getFunction() to two calls to getParent(). Improves clarity, also slightly cheaper. NFCI. llvm-svn: 319481	2017-11-30 22:10:35 +00:00
Shoaib Meenai	bb1b6ae244	[llvm] Add stripped installation targets CMake's generated installation scripts support `CMAKE_INSTALL_DO_STRIP` to enable stripping the installed binaries. LLVM's build system doesn't expose this option to the `install-` targets, but it's useful in conjunction with `install-distribution`. Add a new function to create the install targets, which creates both the regular install target and a second install target that strips during installation. Change the creation of all installation targets to use this new function. Stripping doesn't make a whole lot of sense for some installation targets (e.g. the LLVM headers), but consistency doesn't hurt. I'll make other repositories (e.g. clang, compiler-rt) use this in a follow-up, and then add an `install-distribution-stripped` target to actually accomplish the end goal of creating a stripped distribution. I don't want to do that step yet because the creation of that target would depend on the presence of the `install-*-stripped` target for each distribution component, and the distribution components from other repositories will be missing that target right now. Differential Revision: https://reviews.llvm.org/D40620 llvm-svn: 319480	2017-11-30 21:48:26 +00:00
Krzysztof Parzyszek	977257a5c0	[Hexagon] Implement HexagonSubtarget::useAA() llvm-svn: 319477	2017-11-30 21:25:28 +00:00
Krzysztof Parzyszek	5df605e346	[Hexagon] Fix wrong check in test/CodeGen/Hexagon/newvaluejump-solo.mir llvm-svn: 319476	2017-11-30 21:23:19 +00:00
Daniel Sanders	f43958d13f	[globalisel][tablegen] Add support for relative AtomicOrderings No test yet because the relevant rules are blocked on the atomic_load, and atomic_store nodes. llvm-svn: 319475	2017-11-30 21:05:59 +00:00
Krzysztof Parzyszek	aed6ab89e9	[Hexagon] Fix wrong pass in testcase llvm-svn: 319471	2017-11-30 20:39:15 +00:00
Krzysztof Parzyszek	aafdd52306	[Hexagon] Solo instructions cannot be used with new value jumps llvm-svn: 319470	2017-11-30 20:32:54 +00:00
Yaxun Liu	b450e44d98	[AMDGPU] Convert test/tools/llvm-objdump/AMDGPU/source-lines.ll to amdgiz Differential Revision: https://reviews.llvm.org/D40653 llvm-svn: 319469	2017-11-30 20:27:56 +00:00
Craig Topper	096e040fb2	[X86] Promote i8 CTPOP to i32 instead of i16 when we have the POPCNT instruction. The 32-bit version is shorter to encode and the zext we emit for the promotion is likely going to be a 32-bit zero extend anyway. llvm-svn: 319468	2017-11-30 20:15:31 +00:00
Jake Ehrlich	ba962c5889	[llvm-objcopy] Add support for --only-keep/-j and --keep This change adds support for the --only-keep option and the -j alias as well. A common use case for these being used together is to dump a specific section's data. Additionally the --keep option is added (GNU objcopy doesn't have this) to avoid removing a bunch of things. This allows people to err on the side of stripping aggressively and then to keep the specific bits that they need for their application. Differential Revision: https://reviews.llvm.org/D39021 llvm-svn: 319467	2017-11-30 20:14:53 +00:00
Daniel Sanders	fe50afca94	[aarch64][globalisel] Legalize G_ATOMIC_CMPXCHG_WITH_SUCCESS and G_ATOMICRMW_* G_ATOMICRMW_* is generally legal on AArch64. The exception is G_ATOMICRMW_NAND. G_ATOMIC_CMPXCHG_WITH_SUCCESS needs to be lowered to G_ATOMIC_CMPXCHG with an external comparison. Note that IRTranslator doesn't generate these instructions yet. llvm-svn: 319466	2017-11-30 20:11:42 +00:00
Amara Emerson	2e2a00eadf	[GlobalISel][IRTranslator] Fix crash during translation of zero sized loads/stores/args/returns. This fixes PR35358. rdar://35619533 Differential Revision: https://reviews.llvm.org/D40604 llvm-svn: 319465	2017-11-30 20:06:02 +00:00
Xinliang David Li	c9f71e7ca6	[PGO] Skip counter promotion for infinite loops Differential Revision: http://reviews.llvm.org/D40662 llvm-svn: 319462	2017-11-30 19:16:25 +00:00
Michal Gorny	f0ba71f375	[cmake] Include project name in Sphinx doctree dir to fix race conditions Modify add_sphinx_target() to include the project name alongside builder in Sphinx doctree directory. This aims to avoid crashes due to race conditions between multiple Sphinx instances running in parallel that attempt to create or read that directory simultaneously. This problem has originally been addressed in r283188. However, that commit presumed that there will be only one target per builder being run. However, r314863 introduced a second manpage target, reintroducing the race condition. Differential Revision: https://reviews.llvm.org/D40656 llvm-svn: 319461	2017-11-30 19:09:22 +00:00
Daniel Sanders	dd6b6d4d27	[globalisel][tablegen] Add support for specific immediates in the match pattern This enables a few rules such as ARM's uxtb instruction. llvm-svn: 319457	2017-11-30 18:48:35 +00:00
Zachary Turner	62abbe245b	Split TypeTableBuilder into two classes. llvm-svn: 319456	2017-11-30 18:39:50 +00:00
Zachary Turner	df13ea0927	[llvm-readobj] Fix mismatched line endings llvm-svn: 319453	2017-11-30 18:33:34 +00:00
Dan Gohman	258c56b7e6	[WebAssembly] Revert r319186 "Support bitcasted function addresses with varargs." The patch broke Emscripten's EM_ASM macros, which utiltize unprototyped functions. See https://bugs.llvm.org/show_bug.cgi?id=35385 for details. llvm-svn: 319452	2017-11-30 18:16:49 +00:00
Francis Visoiu Mistrih	7990c4a3be	[MIR] Fix DebugInfo tests after r319445 llvm-svn: 319447	2017-11-30 16:48:53 +00:00
Francis Visoiu Mistrih	cd4ff3e8fc	[CodeGen] Always use `printReg` to print registers in both MIR and debug output As part of the unification of the debug format and the MIR format, always use `printReg` to print all kinds of registers. Updated the tests using '_' instead of '%noreg' until we decide which one we want to be the default one. Differential Revision: https://reviews.llvm.org/D40421 llvm-svn: 319445	2017-11-30 16:12:24 +00:00
Igor Laevsky	68cbe780e4	[FuzzMutate] Bailout from injecting into empty basic blocks. In rare cases we can receive request to inject into completelly empty basic block. In the normal case all basic blocks contain at least terminator instruction, but it is possible that the only instruction is catchpad instruction which is not part of the instruction iterator. This case seems rare enough to not care about it. Submiting without review, since it seems almost NFC. I couldn't come up with any reasonable way to test this. llvm-svn: 319444	2017-11-30 15:41:58 +00:00
Igor Laevsky	0cdade5391	[FuzzMutate] Correctly handle vector types in the insertvalue operation Differential Revision: https://reviews.llvm.org/D40397 llvm-svn: 319442	2017-11-30 15:31:13 +00:00
Igor Laevsky	d529c1dd96	[FuzzMutate] Don't use index operands as sinks Differential Revision: https://reviews.llvm.org/D40396 llvm-svn: 319441	2017-11-30 15:29:16 +00:00
Igor Laevsky	c9d7c56f40	[FuzzMutate] Pick correct index for the insertvalue instruction Differential Revision: https://reviews.llvm.org/D40395 llvm-svn: 319440	2017-11-30 15:26:48 +00:00
Igor Laevsky	5feb1b9cc3	[FuzzMutate] Don't create load as a new source if it doesn't match with the descriptor Differential Revision: https://reviews.llvm.org/D40394 llvm-svn: 319439	2017-11-30 15:24:41 +00:00
Igor Laevsky	0061865148	[FuzzMutate] Don't crash when we can't remove instruction from empty function Differential Revision: https://reviews.llvm.org/D40393 llvm-svn: 319438	2017-11-30 15:07:38 +00:00
Sanjay Patel	a05b38588b	[LangRef] clarify semantics of the frem instruction As noted in D40594, the frem instruction corresponds to fmod() except that it can't set errno. I modified the text that we currently use for intrinsics that map to libm functions and applied it to frem. Differential Revision: https://reviews.llvm.org/D40629 llvm-svn: 319437	2017-11-30 14:59:03 +00:00
Alexey Bataev	0f33074407	[InstCombine] Additional test for PR35354, NFC. llvm-svn: 319436	2017-11-30 14:33:58 +00:00
Nemanja Ivanovic	3f9ad6b478	[PowerPC] Recommit r314244 with refactoring and off by default This re-commits everything that was pulled in r314244. The transformation is off by default (patch to enable it to follow). The code is refactored to have a single entry-point and provide fine-grained control over patterns that it selects. This patch also fixes the bugs in the original code. Everything that failed with the original patch has been re-tested with this patch (with the transformation turned on). So the patch to turn this on is soon to follow. Differential Revision: https://reviews.llvm.org/D38575 llvm-svn: 319434	2017-11-30 13:39:10 +00:00
Simon Pilgrim	57159f9108	[X86][AVX512] Tag fcmp/ptest/ternlog instructions scheduler classes llvm-svn: 319433	2017-11-30 13:18:06 +00:00
Simon Pilgrim	e892896417	[X86][AVX512] Regenerate avx512 schedule tests llvm-svn: 319432	2017-11-30 13:09:21 +00:00
Sean Eveson	d3fdef109a	[MC] Function stack size section. Re applying after fixing issues in the diff, sorry for any painful conflicts/merges! Original RFC: http://lists.llvm.org/pipermail/llvm-dev/2017-August/117028.html This change adds a '.stack-size' section containing metadata on function stack sizes to output ELF files behind the new -stack-size-section flag. The section contains pairs of function symbol references (8 byte) and stack sizes (unsigned LEB128). The contents of this section can be used to measure changes to stack sizes between different versions of the compiler or a source base. The advantage of having a section is that we can extract this information when examining binaries that we didn't build, and it allows users and tools easy access to that information just by referencing the binary. There is a follow up change to add an option to clang. Thanks. Reviewers: hfinkel, MatzeB Reviewed By: MatzeB Subscribers: thegameg, asb, llvm-commits Differential Revision: https://reviews.llvm.org/D39788 llvm-svn: 319430	2017-11-30 13:05:14 +00:00
Sean Eveson	b9a62958c9	Revert r319423: [MC] Function stack size section. I messed up the diff. llvm-svn: 319429	2017-11-30 12:43:25 +00:00

1 2 3 4 5 ...

157379 Commits