llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-21 18:22:53 +01:00

Author	SHA1	Message	Date
Sriraman Tallam	ecfaff26fe	Test commit.	2020-03-14 18:08:26 -07:00
Lang Hames	80d993d0f0	Revert "[JITLink][MachO] Treat linker private symbols as hidden rather than private." This reverts commit b64afadf306f284a684ee656c6eefbd43c192c8d. Reverting while I investigate bot failures.	2020-03-14 16:52:25 -07:00
Craig Topper	fa8604e589	[X86] Add avx512f only command lines to the vector add/sub saturation tests. NFC Gives us coverage of splitting the v32i16/v64i8 when we have avx512f and not avx512bw. Considering making v32i16/v64i8 a legal type on avx512f which needs this test coverage.	2020-03-14 16:50:44 -07:00
Lang Hames	16ceeef4af	[JITLink][MachO] Treat linker private symbols as hidden rather than private. Linker-private symbols should be resolvable across object file boundaries.	2020-03-14 16:33:15 -07:00
Lang Hames	da59eff7c0	[llvm-jitlink] Add -show-init-es option to dump initial ExecutionSession state. Inspecting this state can be helpful when debugging jit-linking testcases.	2020-03-14 16:07:46 -07:00
Lang Hames	601e0cfef0	[Orc][examples] Actually return MainResult from main	2020-03-14 15:11:23 -07:00
LLVM GN Syncbot	7e25c8f52a	[gn build] Port 633ea07200e	2020-03-14 21:50:50 +00:00
Lang Hames	4ce58a4d70	[Orc] Add basic OrcV2 C bindings and example. Renames the llvm/examples/LLJITExamples directory to llvm/examples/OrcV2Examples since it is becoming a home for all OrcV2 examples, not just LLJIT. See http://llvm.org/PR31103.	2020-03-14 14:41:22 -07:00
Florian Hahn	9304245f10	[ValueLattice] Go to overdefined in getRange() for full ranges. This is was split off 4878aa36d4aa27df644430139fab2734fde4a000, as it can go in separately.	2020-03-14 19:50:15 +00:00
Krzysztof Parzyszek	b8e8eb7800	[Hexagon] Only allow single HVX vector loads/stores in lowering This will prevent store widening from forming vector pair stores, which eventually end up broken up into single stores.	2020-03-14 14:26:01 -05:00
Simon Pilgrim	709977f244	Fix signed/unsigned comparison warning.	2020-03-14 18:42:27 +00:00
Simon Pilgrim	214f8753e7	[X86] getFauxShuffleMask - pull out repeated byte sizes varaibles. NFC.	2020-03-14 17:36:17 +00:00
Florian Hahn	4271526a2f	[ValueLattice] Add new state for undef constants. This patch adds a new undef lattice state, which is used to represent UndefValue constants or instructions producing undef. The main difference to the unknown state is that merging undef values with constants (or single element constant ranges) produces the constant/constant range, assuming all uses of the merge result will be replaced by the found constant. Contrary, merging non-single element ranges with undef needs to go to overdefined. Using unknown for UndefValues currently causes mis-compiles in CVP/LVI (PR44949) and will become problematic once we use ValueLatticeElement for SCCP. Reviewers: efriedma, reames, davide, nikic Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D75120	2020-03-14 17:19:59 +00:00
Georgii Rymar	89e0b77636	[yaml2obj] - Set a default value for `PAddr` property of a program header to a value of `VAddr` `PAddr` corresponds to `p_paddr` of a program header, which is the segment's physical address for systems in which physical addressing is relevant. `p_paddr` is often equal to `p_vaddr`, which is the virtual address of a segment. This patch changes the default for `PAddr` from 0 to a value of `VAddr`. Differential revision: https://reviews.llvm.org/D76131	2020-03-14 17:44:57 +03:00
Simon Pilgrim	a2623b33c2	[X86] getFauxShuffleMask - merge insertelement paths Merge the INSERT_VECTOR_ELT/SCALAR_TO_VECTOR and PINSRW/PINSRB shuffle mask paths - they both do the same thing (find source vector + handle implicit zero extension). The PINSRW/PINSRB path also handled in the insertion of zero case which needed to be added to the general case as well.	2020-03-14 13:11:03 +00:00
Martin Storsjö	e7213f8e05	[llvm-dlltool] Add a testcase to show the kind of weak external used for import library aliases. NFC.	2020-03-14 14:00:36 +02:00
Shengchen Kan	4b036bee4a	[X86] Disable nop padding before instruction following a prefix Reviewers: reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: LuoYuanke Subscribers: hiraditya, llvm-commits, annita.zhang Tags: #llvm Differential Revision: https://reviews.llvm.org/D76052	2020-03-14 13:15:30 +08:00
Diogo Sampaio	f65b040dae	[AArch64][Fix] LdSt optimization generate premature stack-popping Summary: When moving add and sub to memory operand instructions, aarch64-ldst-opt would prematurally pop the stack pointer, before memory instructions that do access the stack using indirect loads. e.g. ``` int foo(int offset){ int local[4] = {0}; return local[offset]; } ``` would generate: ``` sub sp, sp, #16 ; Push the stack mov x8, sp ; Save stack in register stp xzr, xzr, [sp], #16 ; Zero initialize stack, and post-increment, making it invalid ------ If an exception goes here, the stack value might be corrupted ldr w0, [x8, w0, sxtw #2] ; Access correct position, but it is not guarded by SP ``` Reviewers: fhahn, foad, thegameg, eli.friedman, efriedma Reviewed By: efriedma Subscribers: efriedma, kristof.beyls, hiraditya, danielkiss, llvm-commits, simon_tatham Tags: #llvm Differential Revision: https://reviews.llvm.org/D75755	2020-03-14 02:03:10 +00:00
Craig Topper	c09e8a1c6c	[X86] Remove isel patterns for X86VBroadcast+trunc+extload. Replace with DAG combines. This is a little more complicated than I'd like it to be. We have to manually match a trunc+srl+load pattern that generic DAG combine won't do for us due to isTypeDesirableForOp.	2020-03-13 18:12:16 -07:00
Michael Liao	ac398f6993	Fix `-Wunused-variable`. NFC.	2020-03-13 20:54:22 -04:00
Whitney Tsang	797eaf4fbd	[NFC][LoopUnrollAndJam] clang-format. I am currently working on this file.	2020-03-14 00:04:10 +00:00
Philip Reames	987e9d3a05	Adjust debug output for MCRelaxableFragment to include the size so that sanity checking relaxation offsets from -debug output is easier	2020-03-13 16:22:46 -07:00
Eli Friedman	5c72beb2cf	[SCEV] Add support for GEPs over scalable vectors. Because we have to use a ConstantExpr at some point, the canonical form isn't set in stone, but this seems reasonable. The pretty sizeof(<vscale x 4 x i32>) dumping is a relic of ancient LLVM; I didn't have to touch that code. :) Differential Revision: https://reviews.llvm.org/D75887	2020-03-13 16:12:45 -07:00
Brian Cain	fd389a668b	Initialize IsFast* values We must initialize these values in case some targets do not assign to them in allowsMemoryAccess().	2020-03-13 17:46:32 -05:00
Jan Korous	0fd46fd26a	[LLJIT] Add std::move() as a workaround for older compilers Clang 3.8 isn't able to bind the variable to rvalue-ref which breaks the build.	2020-03-13 15:25:25 -07:00
Craig Topper	8a39768ac7	[SelectionDAGBuilder] Simplify the struct type handling in getUniformBase.	2020-03-13 14:00:21 -07:00
Craig Topper	68f278abc5	[IR] Fix formatting. NFC	2020-03-13 14:00:20 -07:00
Lang Hames	c3e7231c1f	[MCJIT] Check for RuntimeDyld errors in MCJIT::finalizeLoadedModules. Patch based on https://reviews.llvm.org/D75912 by Alexander Shishkin. Thanks Alexander! To minimize disruption to existing clients, who may be relying on the fact that unused references to unresolved symbols do not generate an error, this patch makes error checking opt-in: Clients can call ExecutionEngine::hasError or LLVMExecutionEngineGetError to check whether and error has occurred. Differential revision: https://reviews.llvm.org/D75912	2020-03-13 13:58:41 -07:00
Richard Smith	3f9596e9a9	Fix "unused variable" warning in NDEBUG builds.	2020-03-13 13:56:57 -07:00
Amy Huang	24d4829906	CMake: Turn LLVM_ENABLE_ZLIB into a tri-state option Summary: Add FORCE_ON option to LLVM_ENABLE_ZLIB, which causes a configuration error if zlib is not found. Similar to https://reviews.llvm.org/D40050. Reviewers: hans, thakis, rnk Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76110	2020-03-13 13:52:46 -07:00
Akira Hatanaka	a30499dbf2	[ObjC][ARC] Don't remove autoreleaseRV/retainRV pairs if the call isn't a tail call This reapplies the patch in https://reviews.llvm.org/rG1f5b471b8bf4, which was reverted because it was causing crashes. https://bugs.chromium.org/p/chromium/issues/detail?id=1061289#c2 Check that HasSafePathToCall is true before checking the call is a tail call. Original commit message: Previosly ARC optimizer removed the autoreleaseRV/retainRV pair in the following code, which caused the object returned by @something to be placed in the autorelease pool because the call to @something isn't a tail call: ``` %call = call i8* @something(...) %2 = call i8* @objc_retainAutoreleasedReturnValue(i8* %call) %3 = call i8* @objc_autoreleaseReturnValue(i8* %2) ret i8* %3 ``` Fix the bug by checking whether @something is a tail call. rdar://problem/59275894	2020-03-13 13:52:14 -07:00
Stanislav Mekhanoshin	79c06fdf27	[AMDGPU] Fix endcf collapse Only collapse inner endcf if the outer one belongs to SI_IF. If it does belong to SI_ELSE then mask being restored in fact a partial inverse of what we need. Differential Revision: https://reviews.llvm.org/D76154	2020-03-13 13:50:21 -07:00
Martin Storsjö	87135ac747	[COFF] Assign unique names to autogenerated .weak.<name>.default symbols These symbols need to be external (MSVC tools error out if a weak external points at a symbol that isn't external; this was tried before but had to be reverted in bc5b7217dceecd3eec69593026a9e38dfbfd6908, and this was originally explicitly fixed in 732eeaf2a930ad2755cb4eb5d99a3deae0de4a72). If multiple object files have weak symbols with defaults, their defaults could cause linker errors due to duplicate definitions, unless the names of the defaults are unique. GNU binutils handles this by appending the name of another symbol from the same object file to the name of the default symbol. Try to implement something similar; before writing the object file, locate a symbol that should have a unique name and use the name of that one for making the weak defaults unique. Differential Revision: https://reviews.llvm.org/D75989	2020-03-13 22:44:55 +02:00
Matt Arsenault	9ffd832926	AMDGPU: Add flag to used fixed function ABI Pass all arguments to every function, rather than only passing the minimum set of inputs needed for the call graph.	2020-03-13 13:27:05 -07:00
Alexey Zhikhartsev	5c7dd4eee0	[LoopInterchange] Fix interchanging contents of preheader BBs Summary: Previously LCSSA was getting broken by placing instructions into the (newly) inner header instead of the preheader. Fixes PR43474 Reviewers: fhahn Reviewed By: fhahn Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75943	2020-03-13 15:59:37 -04:00
Matt Arsenault	ed76b808e1	AMDGPU: Don't handle kernarg.segment.ptr in functions Just lower this to null. Pass implicitarg.ptr in its place in the argument list.	2020-03-13 12:51:12 -07:00
Nico Weber	55ba7badfd	Revert "Reland "[DebugInfo] Enable the debug entry values feature by default"" This reverts commit 5aa5c943f7da155b95564058cd5d50a93eabfc89. Causes clang to assert, see https://bugs.chromium.org/p/chromium/issues/detail?id=1061533#c4 for a repro.	2020-03-13 15:37:44 -04:00
Stanislav Mekhanoshin	4a41912af9	[AMDGPU] Disable endcf collapse There are some functional regressions and I suspect our scopes are not as perfectly enclosed as I expected. Disable it for now. Differential Revision: https://reviews.llvm.org/D76148	2020-03-13 12:33:22 -07:00
Reid Kleckner	206ae5e2a7	Revert "[ObjC][ARC] Check the basic block size before calling DominatorTree::dominate" This reverts commit 5c3117b0a98dd11717eaffd7fb583985d39544b2 This should not be necessary after 7593a480dbce4e26f7dda4aa8f15bffd03acbfdb, and Florian Hahn has confirmed that the problem no longer reproduces with this patch. I happened to notice this code because the FIXME talks about OrderedBasicBlock. Reviewed By: fhahn, dexonsmith Differential Revision: https://reviews.llvm.org/D76075	2020-03-13 11:57:55 -07:00
Simon Pilgrim	dd904ff7d4	[X86][SSE] Prefer trunc(movd(x)) to pextrb(x,0) If we're extracting the 0'th index of a v16i8 vector we're better off using MOVD than PEXTRB, unless we're storing the value or we require the implicit zero extension of PEXTRB. The biggest perf diff is on SLM targets where MOVD (uops=1, lat=3 tp=1) is notably faster than PEXTRB (uops=2, lat=5, tp=4). This matches what we already do for PEXTRW. Differential Revision: https://reviews.llvm.org/D76138	2020-03-13 18:43:04 +00:00
Sanjay Patel	6180cf61cf	[SimplifyCFG] add test for chain of empty block conditional branches; NFC	2020-03-13 14:39:31 -04:00
Huihui Zhang	b15e0eb9ad	[SLPVectorizer][SVE] Bail out early for scalable vector. Summary: SLPVectorizer try to vectorize list of scalar instructions of the same type, instructions already vectorized are rejected through isValidElementType(). Without this patch, tryToVectorizeList() will first try to determine vectorization factor of a list of Instructions before checking whether each instruction has unsupported type or not. For instructions already vectorized for SVE, it will crash at getVectorElementSize(), where it try to return a fixed size. This patch make sure invalid element types are rejected before trying to get vectorization factor. This make sure we are not trying to vectorize instructions already vectorized. Reviewers: sdesmalen, efriedma, spatel, RKSimon, ABataev, apazos, rengolin Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76017	2020-03-13 11:23:31 -07:00
Sanjay Patel	57850e1de0	[SimplifyCFG] regenerate complete test checks; NFC	2020-03-13 14:12:28 -04:00
Sanjay Patel	4c46383ca2	[SimplifyCFG] regenerate test checks; NFC	2020-03-13 14:12:28 -04:00
Sanjay Patel	7c26e7a03f	[SimplifyCFG] fix formatting; NFC	2020-03-13 14:12:28 -04:00
Sanjay Patel	201f7ba5a1	[SimplifyCFG] fix debug print formatting; NFC	2020-03-13 14:12:28 -04:00
Florian Hahn	b0b8f8a192	[CVP,SCCP] Precommit test for D75055. Test case for PR44949.	2020-03-13 17:53:39 +00:00
Philip Reames	8b8ad6d63c	Use 15 byte long nops on modern Intel processors Back in D42616, we switched our default nop length from 15 to 10 bytes because some platforms have painful decode stalls when encountering multiple instruction prefixes. (10 byte long nops come from the fact that prefixes are used to pad after 8 bytes, and some platforms have issues w/more than two prefixes.) Based on Agner's guides, it appears to be the case that modern Intel (SandyBridge and later) can decode an arbitrary number of prefixes without issue. Intel's guide only provides up to 9 bytes; I read that as providing a safe default for all their chips. Older chips and Atom series have serious decode stalls. I can't find a conclusive reference beyond those two. Differential Revision: https://reviews.llvm.org/D75945	2020-03-13 10:51:09 -07:00
Simon Cook	bae1c75f0d	[TableGen] Support combining AssemblerPredicates with ORs For context, the proposed RISC-V bit manipulation extension has a subset of instructions which require one of two SubtargetFeatures to be enabled, 'zbb' or 'zbp', and there is no defined feature which both of these can imply to use as a constraint either (see comments in D65649). AssemblerPredicates allow multiple SubtargetFeatures to be declared in the "AssemblerCondString" field, separated by commas, and this means that the two features must both be enabled. There is no equivalent to say that _either_ feature X or feature Y must be enabled, short of creating a dummy SubtargetFeature for this purpose and having features X and Y imply the new feature. To solve the case where X or Y is needed without adding a new feature, and to better match a typical TableGen style, this replaces the existing "AssemblerCondString" with a dag "AssemblerCondDag" which represents the same information. Two operators are defined for use with AssemblerCondDag, "all_of", which matches the current behaviour, and "any_of", which adds the new proposed ORing features functionality. This was originally proposed in the RFC at http://lists.llvm.org/pipermail/llvm-dev/2020-February/139138.html Changes to all current backends are mechanical to support the replaced functionality, and are NFCI. At this stage, it is illegal to combine features with ands and ors in a single AssemblerCondDag. I suspect this case is sufficiently rare that adding more complex changes to support it are unnecessary. Differential Revision: https://reviews.llvm.org/D74338	2020-03-13 17:13:51 +00:00
Florian Hahn	8286c97bd8	Recommit "[SCCP] Use ValueLatticeElement instead of LatticeVal (NFCI)" This patch should fix the cause of the stage2 failures and PR45185. This reverts the revert commit c52f839e723ee288db2a3e21860b011f6a9d707e.	2020-03-13 17:03:22 +00:00

1 2 3 4 5 ...

193317 Commits