llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	e9a179168b	[X86][SSE41] Removed pblendw intrinsics tests - they are auto-upgraded Equivalent tests included in sse41-intrinsics-x86-upgrade.ll - the i8/i32 immediate diff doesn't matter anymore llvm-svn: 270767	2016-05-25 21:27:58 +00:00
Peter Collingbourne	2576e701e4	Move whole-program virtual call optimization pass after function attribute inference in LTO pipeline. As a result of D18634 we no longer infer certain attributes on linkonce_odr functions at compile time, and may only infer them at LTO time. The readnone attribute in particular is required for virtual constant propagation (part of whole-program virtual call optimization) to work correctly. This change moves the whole-program virtual call optimization pass after the function attribute inference passes, and enables the attribute inference passes at opt level 1, so that virtual constant propagation has a chance to work correctly for linkonce_odr functions. Differential Revision: http://reviews.llvm.org/D20643 llvm-svn: 270765	2016-05-25 21:26:14 +00:00
Simon Pilgrim	0da4a2c2e6	[X86][SSE41] Regenerated intrinsics tests llvm-svn: 270764	2016-05-25 21:21:51 +00:00
Ahmed Bougacha	b3c9ba99bf	[TLI] Also cover Linux 64 libfunc (stat64, ...) prototype checking. My script missed those in r270750. llvm-svn: 270763	2016-05-25 21:16:33 +00:00
Simon Pilgrim	5142359484	[X86][SSE41] Removed blendpd/blendps intrinsics tests - they are auto-upgraded Equivalent tests included in sse41-intrinsics-x86-upgrade.ll llvm-svn: 270761	2016-05-25 21:06:36 +00:00
Sanjay Patel	2d03d11d93	fix typo; NFC llvm-svn: 270760	2016-05-25 21:03:31 +00:00
Mehdi Amini	4f44ce6392	ValueMaterializer: rename materializeDeclFor() to materialize() It may materialize a declaration, or a definition. The name could be misleading. This is following a merge of materializeInitFor() into materializeDeclFor(). Differential Revision: http://reviews.llvm.org/D20593 llvm-svn: 270759	2016-05-25 21:03:21 +00:00
Mehdi Amini	ad6c68d47a	ValueMaterializer: fuse materializeDeclFor and materializeInitFor (NFC) They were originally separated to handle the co-recursion between the ValueMapper and the ValueMaterializer. This recursion does not exist anymore: the ValueMapper now uses a Worklist and the ValueMaterializer is scheduling job on the Worklist. Differential Revision: http://reviews.llvm.org/D20593 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 270758	2016-05-25 21:01:51 +00:00
Mehdi Amini	638f8c1e34	IRLinker: fix double scheduling of mapping a global value because of an alias This test was hitting an assertion in the value mapper because the IRLinker was trying to map two times @A while materializing the initializer for @C. Fix http://llvm.org/PR27850 Differential Revision: http://reviews.llvm.org/D20586 llvm-svn: 270757	2016-05-25 21:00:44 +00:00
Simon Pilgrim	0792d32b20	[X86][AVX2] Regenerate avx2 vector shift tests llvm-svn: 270756	2016-05-25 21:00:40 +00:00
Mike Aizatsky	a54a714ed7	[libfuzzer] replacing unittest for truncate_units with functional test. Differential Revision: http://reviews.llvm.org/D20641 llvm-svn: 270755	2016-05-25 21:00:17 +00:00
Simon Pilgrim	2acf33db0b	Simplify std::all_of/any_of predicates by using llvm::all_of/any_of. NFCI. llvm-svn: 270753	2016-05-25 20:41:11 +00:00
Zachary Turner	493dc32ae8	[codeview] Move StreamInterface and StreamReader to libcodeview. We have need to reuse this functionality, including making additional generic stream types that are smarter about how and when they copy memory versus referencing the original memory. So all of these structures belong in the common library rather than being pdb specific. llvm-svn: 270751	2016-05-25 20:37:03 +00:00
Ahmed Bougacha	15451fb9fc	[TLI] Fix NumParams==0 prototype checking typo. There was a typo in r267758. It caused invalid accesses when given something like "void @free(...)", as NumParams == 0, and we then try to look at the 0th parameter. Turns out, most of these were untested; add both attribute and missing-prototype checks for all libc libfuncs. Differential Revision: http://reviews.llvm.org/D20543 llvm-svn: 270750	2016-05-25 20:22:45 +00:00
Simon Pilgrim	0af11671d9	Simplify std::all_of predicate (to one line) by using llvm::all_of. NFCI. llvm-svn: 270749	2016-05-25 20:17:39 +00:00
Simon Pilgrim	7248b1ae50	Simplify std::all_of predicate (to one line) by using llvm::all_of. NFCI. llvm-svn: 270747	2016-05-25 20:13:39 +00:00
Rafael Espindola	2224908028	Fix shouldAssumeDSOLocal for private linkage. llvm-svn: 270746	2016-05-25 19:55:16 +00:00
Kostya Serebryany	d0ab64cb17	[libFuzzer] document the proposed FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION llvm-svn: 270744	2016-05-25 18:41:53 +00:00
Reid Kleckner	31bb5b2278	[IR] Copy comdats in GlobalObject::copyAttributesFrom This is probably correct for all uses except cross-module IR linking, where we need to move the comdat from the source module to the destination module. Fixes PR27870. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D20631 llvm-svn: 270743	2016-05-25 18:36:22 +00:00
Zachary Turner	8c7c869714	[llvm-pdbdump] Dump raw stream contents as binary block. Dumping it as ASCII makes it fairly useless. llvm-svn: 270742	2016-05-25 18:32:07 +00:00
Matt Arsenault	4a9e67f4ef	TableGen: Use StringRef instead of std::string llvm-svn: 270741	2016-05-25 18:07:40 +00:00
Matt Arsenault	1f6cee6a4f	AMDGPU: Fix v2i64/v2f64 bitcasts These operations tend to get promoted away to v4i32 so this doesn't happen often. llvm-svn: 270740	2016-05-25 18:07:36 +00:00
Matt Arsenault	ea952af911	AMDGPU: Fix missing br_cc i1 test coverage Also un xfail a test. llvm-svn: 270739	2016-05-25 17:58:27 +00:00
Chad Rosier	9f1f557fac	[SelectionDAG] Add smarts for BSWAP in computeKnownBits. llvm-svn: 270738	2016-05-25 17:52:38 +00:00
Matt Arsenault	d17f030a97	AMDGPU: Make vectorization defeating test changes Simplifies test updates in the future. llvm-svn: 270736	2016-05-25 17:42:39 +00:00
Davide Italiano	8471a2a052	[PM] CorrelatedValuePropagation: pass state to function. NFCI. While here, convert the logic of the pass to use static function(s). This is in preparation for porting this pass to the new PM. llvm-svn: 270734	2016-05-25 17:39:54 +00:00
Matt Arsenault	e21e61958d	AMDGPU: Fix inconsistent lowering of select of vectors f32 vectors would use a sequence of BFI instructions instead of unrolled cmp + select. This was better in the case of a VALU select with SGPR inputs, but we don't have a way of dealing with that in the DAG. llvm-svn: 270731	2016-05-25 17:34:58 +00:00
Sanjay Patel	e582594538	[x86] avoid code explosion from LoopVectorizer for gather loop (PR27826) By making pointer extraction from a vector more expensive in the cost model, we avoid the vectorization of a loop that is very likely to be memory-bound: https://llvm.org/bugs/show_bug.cgi?id=27826 There are still bugs related to this, so we may need a more general solution to avoid vectorizing obviously memory-bound loops when we don't have HW gather support. Differential Revision: http://reviews.llvm.org/D20601 llvm-svn: 270729	2016-05-25 17:27:54 +00:00
Xinliang David Li	e9df93e237	Use new triple API to check if comdat is supported llvm-svn: 270727	2016-05-25 17:17:51 +00:00
Xinliang David Li	e9cf4364da	Add a new helper API in triple /NFC llvm-svn: 270726	2016-05-25 17:11:31 +00:00
Chris Bieneman	d8d2780ae0	[obj2yaml] [yaml2obj] MachO support for rebase opcodes This is the first bit of support for MachO __LINKEDIT segment data. llvm-svn: 270724	2016-05-25 17:09:07 +00:00
Chris Bieneman	2b1c02aa22	[CMake] LINK_LIBS need to be public for Darwin dylib targets This should actually address PR27855. This results in adding references to the system libs inside generated dylibs so that they get correctly pulled in when linking against the dylib. llvm-svn: 270723	2016-05-25 17:08:43 +00:00
Tim Shen	198e5cb8a0	Move and add comments to the top for tailcall-string-rvo.ll Differential Revision: http://reviews.llvm.org/D20311 llvm-svn: 270722	2016-05-25 17:01:09 +00:00
Hal Finkel	c1f6823ee0	[SDAG] Add a fallback multiplication expansion LegalizeIntegerTypes does not have a way to expand multiplications for large integer types (i.e. larger than twice the native bit width). There's no standard runtime call to use in that case, and so we'd just assert. Unfortunately, as it turns out, it is possible to hit this case from standard-ish C code in rare cases. A particular case a user ran into yesterday involved an __int128 induction variable and a loop with a quadratic (not linear) recurrence which triggered some backend logic using SCEVExpander. In this case, the BinomialCoefficient code in SCEV generates some i129 variables, which get widened to i256. At a high level, this is not actually good (i.e. the underlying optimization, PPCLoopPreIncPrep, should not be transforming the loop in question for performance reasons), but regardless, the backend shouldn't crash because of cost-modeling issues in the optimizer. This is a straightforward implementation of the multiplication expansion, based on the algorithm in Hacker's Delight. I validated it against the code for the mul256b function from http://locklessinc.com/articles/256bit_arithmetic/ using random inputs. There should be no functional change for previously-working code (the new expansion code only replaces an assert). Fixes PR19797. llvm-svn: 270720	2016-05-25 16:50:22 +00:00
Teresa Johnson	de4624ad69	[ThinLTO] Fix test check prefix so that intended prefix tested There aren't any checks with prefix PROMOTE, should be PROMOTE_MOD1 which wasn't being tested (but works as expected). llvm-svn: 270719	2016-05-25 16:45:08 +00:00
Sanjay Patel	289425eb9f	[x86, AVX] allow explicit calls to VZERO* to modify state in VZeroUpperInserter pass (PR27823) As noted in the review, there are still problems, so this doesn't the bug completely. Differential Revision: http://reviews.llvm.org/D20529 llvm-svn: 270718	2016-05-25 16:39:47 +00:00
Lang Hames	0a4d39f9bf	[RuntimeDyld] Call the SymbolResolver::findSymbolInLogicalDylib method when searching for external symbols, and fall back to the SymbolResolver::findSymbol method if the former returns null. This makes RuntimeDyld behave more like a static linker: Symbol definitions from within the current module's "logical dylib" will be preferred to external definitions. We can build on this behavior in the future to properly support weak symbol handling. Custom symbol resolvers that override the findSymbolInLogicalDylib method may notice changes due to this patch. Clients who have not overridden this method should generally be unaffected, however users of the OrcMCJITReplacement class may notice changes. llvm-svn: 270716	2016-05-25 16:23:59 +00:00
Chad Rosier	8544c01533	Clarify that we match BSwap in InstCombine and BitReverse in CGP. NFC. Also, rename recognizeBitReverseOrBSwapIdiom to recognizeBSwapOrBitReverseIdiom, so the ordering of the MatchBSwaps and MatchBitReversals arguments are consistent with the function name. llvm-svn: 270715	2016-05-25 16:22:14 +00:00
Simon Pilgrim	a16bac494b	[X86][AVX] Sync with clang/test/CodeGen/avx2-builtins.c Only tests for the gather intrinsic are still to be added llvm-svn: 270710	2016-05-25 15:30:08 +00:00
Teresa Johnson	655b4d9f20	[ThinLTO] Refactor ODR resolution and internalization (NFC) Move the now index-based ODR resolution and internalization routines out of ThinLTOCodeGenerator.cpp and into either LTO.cpp (index-based analysis) or FunctionImport.cpp (index-driven optimizations). This is to enable usage by other linkers. llvm-svn: 270698	2016-05-25 14:03:11 +00:00
Oleg Ranevskyy	34bf60ca68	[SCEV] No-wrap flags are not propagated when folding "{S,+,X}+T ==> {S+T,+,X}" Summary: Description This makes `WidenIV::widenIVUse` (IndVarSimplify.cpp) fail to widen narrow IV uses in some cases. The latter affects IndVarSimplify which may not eliminate narrow IV's when there actually exists such a possibility, thereby producing ineffective code. When `WidenIV::widenIVUse` gets a NarrowUse such as `{(-2 + %inc.lcssa),+,1}<nsw><%for.body3>`, it first tries to get a wide recurrence for it via the `getWideRecurrence` call. `getWideRecurrence` returns recurrence like this: `{(sext i32 (-2 + %inc.lcssa) to i64),+,1}<nsw><%for.body3>`. Then a wide use operation is generated by `cloneIVUser`. The generated wide use is evaluated to `{(-2 + (sext i32 %inc.lcssa to i64))<nsw>,+,1}<nsw><%for.body3>`, which is different from the `getWideRecurrence` result. `cloneIVUser` sees the difference and returns nullptr. This patch also fixes the broken LLVM tests by adding missing <nsw> entries introduced by the correction. Minimal reproducer: ``` int foo(int a, int b, int c); int baz(); void bar() { int arr[20]; int i = 0; for (i = 0; i < 4; ++i) arr[i] = baz(); for (; i < 20; ++i) arr[i] = foo(arr[i - 4], arr[i - 3], arr[i - 2]); } ``` Clang command line: ``` clang++ -mllvm -debug -S -emit-llvm -O3 --target=aarch64-linux-elf test.cpp -o test.ir ``` Expected result: The ` -mllvm -debug` log shows that all the IV's for the second `for` loop have been eliminated. Reviewers: sanjoy Subscribers: atrick, asl, aemerson, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D20058 llvm-svn: 270695	2016-05-25 13:01:33 +00:00
Renato Golin	bde30e2034	[AArch64] Adding a TargetParser for AArch64 There's already a ARMTargetParser,now adding a similar one for aarch64. so we can use it to do ARCH/CPU/FPU parsing in clang and llvm, instead of string comparison. Patch by Jojo Ma. llvm-svn: 270687	2016-05-25 12:02:33 +00:00
Simon Pilgrim	e806c3471c	[X86][AVX2] Added more fast-isel tests to match clang/test/CodeGen/avx2-builtins.c llvm-svn: 270685	2016-05-25 10:56:23 +00:00
Simon Pilgrim	eb7d07a957	[X86][AVX2] Begun adding fast-isel tests to match clang/test/CodeGen/avx2-builtins.c llvm-svn: 270683	2016-05-25 10:15:06 +00:00
Simon Pilgrim	b4a440d8b8	[X86][SSE2] Use storeu intrinsics for _mm_storeu_pd/_mm_storeu_pd tests Also fixed name of _mm_store1_pd test llvm-svn: 270681	2016-05-25 09:42:29 +00:00
Simon Pilgrim	4950d6bb8d	[X86][SSE] Use storeu intrinsics for _mm_storeu_ps test llvm-svn: 270680	2016-05-25 09:28:06 +00:00
Simon Pilgrim	1a1ddc32da	[X86][SSE] Replace (V)CVTDQ2PD(Y) and (V)CVTPS2PD(Y) lossless conversion intrinsics with generic IR Followup to D20528 clang patch, this removes the (V)CVTDQ2PD(Y) and (V)CVTPS2PD(Y) llvm intrinsics and auto-upgrades to sitofp/fpext instead. Differential Revision: http://reviews.llvm.org/D20568 llvm-svn: 270678	2016-05-25 08:59:18 +00:00
Craig Topper	4710ab1424	[X86] Remove the llvm.x86.sse2.storel.dq intrinsic. It hasn't been used in a long time. llvm-svn: 270677	2016-05-25 06:56:32 +00:00
Gerolf Hoflehner	1352c2ef4f	[Support] Reapply cleanup r270643 llvm-svn: 270674	2016-05-25 06:23:45 +00:00
David Majnemer	3c41824d16	[FunctionAttrs] Volatile loads should disable readonly A volatile load has side effects beyond what callers expect readonly to signify. For example, it is not safe to reorder two function calls which each perform a volatile load to the same memory location. llvm-svn: 270671	2016-05-25 05:53:04 +00:00

1 2 3 4 5 ...

132348 Commits