llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Pavel Labath	306bb4c72a	DWARFDebugLoclists: Make it possible to read relocated addresses Summary: Handling relocations was not needed when the loclists section was a DWO-only thing. But since DWARF5, it is possible to use it in regular objects too, and the standard permits embedding addresses into the section directly. These addresses need to be relocated in unlinked files. Reviewers: JDevlieghere, dblaikie, probinson Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68271	2019-11-05 10:21:39 +01:00
Sjoerd Meijer	20d14524c6	Recommit "[HardwareLoops] Optimisation remarks" With a few things fixed: - initialisaiton of the optimisation remark pass (this was causing the buildbot failures on PPC), - a test case. Differential Revision: https://reviews.llvm.org/D69660	2019-11-05 09:06:22 +00:00
David Green	7fddf6fd7d	[AArch64] Update test checks on merge-store-dependency.ll. NFC	2019-11-05 09:01:47 +00:00
aqjune	974677b856	[IR] Remove switch's default block that causes clang 8 raise error	2019-11-05 16:31:51 +09:00
Craig Topper	8db440d712	[X86] Lower the cost of avx512 horizontal bool and/or reductions to 2*log2(bitwidth)+1 for legal types. This better represents the kshift+binop we'd get for each stage before the final extract. Its likely we'll do even better by doing a kmov and a cmp with a GPR, but this is a good start. The default handling was costing a worst case single source permute shuffle of the vector before the binop. This worst case assumes the shuffle might have to be emulated with extracts and inserts. But since we know we're doing a reduction we can assume we'll get kshift lowering. There's still some room for improvement here, but this is much better than it was.	2019-11-04 22:58:04 -08:00
aqjune	37bbfa1895	[IR] Add Freeze instruction Summary: - Define Instruction::Freeze, let it be UnaryOperator - Add support for freeze to LLLexer/LLParser/BitcodeReader/BitcodeWriter The format is `%x = freeze <ty> %v` - Add support for freeze instruction to llvm-c interface. - Add m_Freeze in PatternMatch. - Erase freeze when lowering IR to SelDag. Reviewers: deadalnix, hfinkel, efriedma, lebedev.ri, nlopes, jdoerfert, regehr, filcab, delcypher, whitequark Reviewed By: lebedev.ri, jdoerfert Subscribers: jfb, kristof.beyls, hiraditya, lebedev.ri, steven_wu, dexonsmith, xbolva00, delcypher, spatel, regehr, trentxintong, vsk, filcab, nlopes, mehdi_amini, deadalnix, llvm-commits Differential Revision: https://reviews.llvm.org/D29011	2019-11-05 15:54:56 +09:00
Yonghong Song	4e15497cad	[BPF] fix a use after free bug Commit fff2721286e1 ("[BPF] Fix CO-RE bugs with bitfields") fixed CO-RE handling bitfield issues. But the implementation introduced a use after free bug. The "Base" of the intrinsic might be freed so later on accessing the Type of "Base" might access the freed memory. The failed test case, CodeGen/BPF/CORE/offset-reloc-middle-chain.ll is exactly used to test such a case. Similarly to previous attempt to remember Metadata etc, remember "Base" pointee Alignment in advance to avoid such use after free bug.	2019-11-04 22:20:23 -08:00
Craig Topper	74c9f1897a	[X86] Teach X86MCInstLower to swap operands of commutable instructions to enable 2-byte VEX encoding. Summary: The 2 source operands commutable instructions are encoded in the VEX.VVVV field and the r/m field of the MODRM byte plus the VEX.B field. The VEX.B field is missing from the 2-byte VEX encoding. If the VEX.VVVV source is 0-7 and the other register is 8-15 we can swap them to avoid needing the VEX.B field. This works as long as the VEX.W, VEX.mmmmm, and VEX.X fields are also not needed. Fixes PR36706. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68550	2019-11-04 22:07:46 -08:00
aqjune	2c7365ed77	Fix clone_constant_impl to correctly deal with null pointers Summary: This patch resolves llvm-c-test's following error ``` LLVM ERROR: LLVMGetValueKind returned incorrect type ``` which arises when the input bitcode contains a null pointer. Reviewers: jdoerfert, CodaFi, deadalnix Reviewed By: jdoerfert Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68928	2019-11-05 13:53:52 +09:00
Devin Coughlin	902d726973	[analyzer] Add test directory for scan-build. The static analyzer's scan-build script is critical infrastructure but is not well tested. To start to address this, add a new test directory under tests/Analysis for scan-build lit tests and seed it with several tests. The goal is that future scan-build changes will be accompanied by corresponding tests. Differential Revision: https://reviews.llvm.org/D69781	2019-11-04 20:26:35 -08:00
Yonghong Song	1b8e92c220	[BPF] Fix CO-RE bugs with bitfields bitfield handling is not robust with current implementation. I have seen two issues as described below. Issue 1: struct s { long long f1; char f2; char b1:1; } *p; The current approach will generate an access bit size 56 (from b1 to the end of structure) which will be rejected as it is not power of 2. Issue 2: struct s { char f1; char b1:3; char b2:5; char b3:6: char b4:2; char f2; }; The LLVM will group 4 bitfields together with 2 bytes. But loading 2 bytes is not correct as it violates alignment requirement. Note that sometimes, LLVM breaks a large bitfield groups into multiple groups, but not in this case. To resolve the above two issues, this patch takes a different approach. The alignment for the structure is used to construct the offset of the bitfield access. The bitfield incurred memory access is an aligned memory access with alignment/size equal to the alignment of the structure. This also simplified the code. This may not be the optimal memory access in terms of memory access width. But this should be okay since extracting the bitfield value will have the same amount of work regardless of what kind of memory access width. Differential Revision: https://reviews.llvm.org/D69837	2019-11-04 20:08:05 -08:00
Vedant Kumar	73d1a7c36c	[CGDebugInfo] Emit subprograms for decls when AT_tail_call is understood Currently, clang emits subprograms for declared functions when the target debugger or DWARF standard is known to support entry values (DW_OP_entry_value & the GNU equivalent). Treat DW_AT_tail_call the same way to allow debuggers to follow cross-TU tail calls. Pre-patch debug session with a cross-TU tail call: ``` * frame #0: 0x0000000100000fa4 main`target at b.c:4:3 [opt] frame #1: 0x0000000100000f99 main`main at a.c:8:10 [opt] ``` Post-patch (note that the tail-calling frame, "helper", is visible): ``` * frame #0: 0x0000000100000fa4 main`target at b.c:4:3 [opt] frame #1: 0x0000000100000f80 main`helper [opt] [artificial] frame #2: 0x0000000100000f99 main`main at a.c:8:10 [opt] ``` rdar://46577651 Differential Revision: https://reviews.llvm.org/D69743	2019-11-04 15:14:24 -08:00
Evandro Menezes	912bf78b87	[AArch64] Update for Exynos Fix the costs of integer division.	2019-11-04 16:21:28 -06:00
Sam Clegg	e3f7791629	Add more binutils tools to LLVM_INSTALL_TOOLCHAIN_ONLY target Also add the aliases for these tools so that LLVM_INSTALL_BINUTILS_SYMLINKS and LLVM_INSTALL_TOOLCHAIN_ONLY can work together. Differential Revision: https://reviews.llvm.org/D69635	2019-11-04 14:11:23 -08:00
Stanislav Mekhanoshin	c9ad88cc28	[AMDGPU] Added assert in SIFoldOperands before ptr use. NFC.	2019-11-04 13:31:21 -08:00
Stanislav Mekhanoshin	7b89eceb5c	[AMDGPU] deduplicate tablegen predicates We are duplicating predicates if several parts of the combined predicate list contain the same condition. Added code to deduplicate the list. We have AssemblerPredicates and AssemblerPredicate in the PredicateControl, but we never use AssemblerPredicates with an actual list, so this one is dropped. This addresses the first part of the llvm bug 43886: https://bugs.llvm.org/show_bug.cgi?id=43886 Differential Revision: https://reviews.llvm.org/D69815	2019-11-04 12:19:17 -08:00
Erik Pilkington	93e350e524	[demangle] NFC: get rid of NodeOrString This class was a bit overengineered, and was triggering some PVS warnings. Instead, put strings into a NameType and let clients unconditionally treat it as a Node.	2019-11-04 12:17:12 -08:00
Craig Topper	7555ade6d7	[X86] Add support for -mvzeroupper and -mno-vzeroupper to match gcc -mvzeroupper will force the vzeroupper insertion pass to run on CPUs that normally wouldn't. -mno-vzeroupper disables it on CPUs where it normally runs. To support this with the default feature handling in clang, we need a vzeroupper feature flag in X86.td. Since this flag has the opposite polarity of the fast-partial-ymm-or-zmm-write we used to use to disable the pass, we now need to add this new flag to every CPU except KNL/KNM and BTVER2 to keep identical behavior. Remove -fast-partial-ymm-or-zmm-write which is no longer used. Differential Revision: https://reviews.llvm.org/D69786	2019-11-04 11:03:54 -08:00
Philip Reames	e6018c48b2	[SimplifyCFG] Use a (trivially) dominanting widenable branch to remove later slow path blocks This transformation is a variation on the GuardWidening transformation we have checked in as it's own pass. Instead of focusing on merge (i.e. hoisting and simplifying) two widenable branches, this transform makes the observation that simply removing a second slowpath block (by reusing an existing one) is often a very useful canonicalization. This may lead to later merging, or may not. This is a useful generalization when the intermediate block has loads whose dereferenceability is hard to establish. As noted in the patch, this can be generalized further, and will be. Differential Revision: https://reviews.llvm.org/D69689	2019-11-04 11:03:28 -08:00
Sanjay Patel	cd1ba33289	[DAGCombine][MSP430] use shift amount threshold in DAGCombine (2/2) Continuation of: D69116 Contributes to a fix for PR43559: https://bugs.llvm.org/show_bug.cgi?id=43559 See also D69099 and D69116 Use the TLI hook in DAGCombine.cpp to guard against creating shift nodes that are not optimal for a target. Patch by: @joanlluch (Joan LLuch) Differential Revision: https://reviews.llvm.org/D69120	2019-11-04 13:41:41 -05:00
Julian Lettner	b26b6f0aa3	[lit] Move measurement of testing time out of Run.execute	2019-11-04 10:16:24 -08:00
Julian Lettner	726e2ac362	[lit] Better/earlier errors when no tests are executed Fail early, when we discover no tests at all, or filter out all of them.	2019-11-04 10:16:24 -08:00
Simon Pilgrim	4865d2bec1	[X86] Fix uninitialized variable warnings. NFCI.	2019-11-04 17:24:35 +00:00
Simon Pilgrim	ab7cb1bbdc	VirtualFileSystem - fix uninitialized variable warnings. NFCI.	2019-11-04 17:24:35 +00:00
Simon Pilgrim	5f01a34336	createMCObjectStreamer - fix uninitialized variable warning. NFCI.	2019-11-04 17:24:34 +00:00
Simon Pilgrim	6eb000e647	MCDwarfFile::DirIndex - fix uninitialized variable warning. NFCI.	2019-11-04 17:24:34 +00:00
LLVM GN Syncbot	23303b3019	gn build: Merge 40d0d4e2335	2019-11-04 17:20:23 +00:00
Oliver Stannard	8a93903a08	Fix static analysis warnings in ARM calling convention lowering Fixes https://bugs.llvm.org/show_bug.cgi?id=43891	2019-11-04 17:17:55 +00:00
Jinsong Ji	90c7b69420	Lower generic MASSV entries to PowerPC subtarget-specific entries This patch (second of two patches) lowers the generic PowerPC vector entries to PowerPC subtarget-specific entries. For instance, the PowerPC generic entry 'cbrtd2_massv' is lowered to 'cbrtd2_P9' or Power9 subtarget. The first patch enables the vectorizer to recognize the IBM MASS vector library routines. This patch specifically adds support for recognizing the '-vector-library=MASSV' option, and defines mappings from IEEE standard scalar math functions to generic PowerPC MASS vector counterparts. For instance, the generic PowerPC MASS vector entry for double-precision 'cbrt' function is '__cbrtd2_massv' The overall support for MASS vector library is presented as such in two patches for ease of review. Patch by pjeeva01 (Jeeva P.) Differential Revision: https://reviews.llvm.org/D59883	2019-11-04 17:17:24 +00:00
Amy Huang	68fcd6e209	Recommit "[CodeView] Add option to disable inline line tables." This reverts commit 004ed2b0d1b86d424643ffc88fce20ad8bab6804. Original commit hash 6d03890384517919a3ba7fe4c35535425f278f89 Summary: This adds a clang option to disable inline line tables. When it is used, the inliner uses the call site as the location of the inlined function instead of marking it as an inline location with the function location. https://reviews.llvm.org/D67723	2019-11-04 09:15:26 -08:00
Ulrich Weigand	2676807e81	[FPEnv][SelectionDAG] Refactor strict FP node construction Small refactoring in visitConstrainedFPIntrinsic that should make it easier to create DAG nodes requiring extra arguments. That is the case currently only for STRICT_FP_ROUND, but may be the case for additional nodes (in particular compares) in the future. Extracted from the patch for D69281. NFC.	2019-11-04 17:45:54 +01:00
Jonas Paulsson	fcb5e0f825	Fix buildbots troubled by b7b170c. Add '# REQUIRES: systemz-registered-target' in the new tests.	2019-11-04 16:54:33 +01:00
Alexey Bataev	4fe440ba6d	[SLP]Fix PR43799: Crash on different sizes of GEP indices. Summary: If the GEP instructions are going to be vectorized, the indices in those GEP instructions must be of the same type. Otherwise, the compiler may crash when trying to build the vector constant. Reviewers: RKSimon, spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69627	2019-11-04 10:36:26 -05:00
Simon Pilgrim	5f3402b593	[X86] Convert ShrinkMode to scoped enum class. NFCI.	2019-11-04 15:35:20 +00:00
Simon Pilgrim	7c495ec4d4	AliasSetTracker - fix uninitialized variable warnings. NFCI.	2019-11-04 15:35:20 +00:00
Thomas Preud'homme	da638ff6f7	[test] Use system locale for mri-utf8.test Summary: llvm-ar's mri-utf8.test test relies on the en_US.UTF-8 locale to be installed for its last RUN line to work. If not installed, the unicode string gets encoded (interpreted) as ascii which fails since the most significant byte is non zero. This commit changes the test to only rely on the system being able to encode the pound sign in its default encoding (e.g. UTF-16 for Microsoft Windows) by always opening the file via input/output redirection. This avoids forcing a given locale to be present and supported. A Byte Order Mark is also added to help recognizing the encoding of the file and its endianness. Finally the XFAIL on system-darwin is removed since the test actually passes fine on Mac OS X and XFAIL was only added because it failed before. Reviewers: gbreynoo, MaskRay, rupprecht, JamesNagurne, jfb Subscribers: dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68472	2019-11-04 15:25:45 +00:00
Jonas Paulsson	707c981178	[SystemZ] Use LivePhysRegs instead of isCCLiveOut() in SystemZElimCompare.cpp Review: Ulrich Weigand https://reviews.llvm.org/D68267	2019-11-04 16:22:00 +01:00
Jonas Paulsson	1fa33bad39	[MachineVerifier] Improve verification of live-in lists. MachineVerifier::visitMachineFunctionAfter() is extended to check the live-through case for live-in lists. This is only done for registers without aliases and that are neither allocatable or reserved, such as the SystemZ::CC register. The MachineVerifier earlier only catched the case of a live-in use without an entry in the live-in list (as "using an undefined physical register"). A comment in LivePhysRegs.h has been added stating a guarantee that addLiveOuts() can be trusted for a full register both before and after register allocation. Review: Quentin Colombet https://reviews.llvm.org/D68267	2019-11-04 16:22:00 +01:00
Simon Pilgrim	ca9d43c062	[X86] Regenerate known-signbits-vector.ll tests. Use X86 instead of X32 and add a common CHECK prefix	2019-11-04 15:12:01 +00:00
David Green	08a8645125	[ARM] Use isFMAFasterThanFMulAndFAdd for MVE The Arm backend will usually return false for isFMAFasterThanFMulAndFAdd, where both the fused VFMA.f32 and a non-fused VMLA.f32 are usually available for scalar code. For MVE we don't have the non-fused version though. It makes more sense for isFMAFasterThanFMulAndFAdd to return true, allowing us to simplify some of the existing ISel patterns. The tests here are that non of the existing tests failed, and so we are still selecting VFMA and VFMS. The one test that changed shows we can now select from fast math flags, as opposed to just relying on the isFMADLegalForFAddFSub option. Differential Revision: https://reviews.llvm.org/D69115	2019-11-04 15:05:41 +00:00
Sanjay Patel	fd0e0f7dae	[IR] adjust assert when replacing undef elements in vector constant As noted in follow-up to: rGa1e8ad4f2fa7 It's not safe to assume that an element of the constant is always non-null. It's definitely not an expected case for the current instcombine user, but that may not hold if this function is eventually called from arbitrary places.	2019-11-04 10:05:24 -05:00
Ulrich Weigand	3f50b12768	[SystemZ] Fix typo Typo in comment. NFC.	2019-11-04 16:01:14 +01:00
Nico Weber	148929db9f	gn build: (manually) merge 51b4b17eb Also reverts r353980 since that duplicated the GenAsmMatcher target for AArch64. Instead use visiblity.	2019-11-04 09:51:41 -05:00
Nico Weber	8912b9c60d	gn build: run "gn format"	2019-11-04 09:50:16 -05:00
Nico Weber	c83c0227fd	gn build: add deps, see discussion on D69130	2019-11-04 09:22:12 -05:00
Benjamin Kramer	600b43131a	Revert "[LV] Apply sink-after & interleave-groups as VPlan transformations (NFC)" This reverts commit 2be17087f8c38934b7fc9208ae6cf4e9b4d44f4b. Fails ASAN.	2019-11-04 15:04:42 +01:00
David Green	93da66bf1e	[ARM] More MVE shuffle tests for sequences that can be converted to VMOVS. NFC.	2019-11-04 13:38:32 +00:00
David Green	1941ea029d	[ARM] Add vrev32 NEON fp16 patterns Fill in the gaps for vrev32.16 f16 patterns, extending the existing i16 patterns. Differential Revision: https://reviews.llvm.org/D69508	2019-11-04 13:37:01 +00:00
Sanjay Patel	5f211f835c	[InstSimplify] use FMF to improve fcmp+select fold This is part of a series of patches needed to solve PR39535: https://bugs.llvm.org/show_bug.cgi?id=39535	2019-11-04 08:29:56 -05:00
Sanjay Patel	ed1d652509	[InstSimplify] add more tests for fcmp+select; NFC The easy code fix won't catch non-canonical mismatched constant patterns, so adding extra coverage for those in case we decide that's important (but seems unlikely).	2019-11-04 08:23:08 -05:00

1 2 3 4 5 ...

187325 Commits