llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Yuanfang Chen	2dbac841f9	Revert "Revert "Revert "Reland "[Support] make report_fatal_error `abort` instead of `exit`"""" This reverts commit bb51d243308dbcc9a8c73180ae7b9e47b98e68fb.	2020-02-13 10:08:05 -08:00
Yuanfang Chen	93e82c22ef	Revert "Revert "Reland "[Support] make report_fatal_error `abort` instead of `exit`""" This reverts commit 80a34ae31125aa46dcad47162ba45b152aed968d with fixes. On bots llvm-clang-x86_64-expensive-checks-ubuntu and llvm-clang-x86_64-expensive-checks-debian only, llc returns 0 for these two tests unexpectedly. I tweaked the RUN line a little bit in the hope that LIT is the culprit since this change is not in the codepath these tests are testing. llvm\test\CodeGen\X86\inline-asm-avx-v-constraint-32bit.ll llvm\test\CodeGen\X86\inline-asm-avx512vl-v-constraint-32bit.ll	2020-02-13 10:02:53 -08:00
Nikita Popov	99acf1bd20	[MemorySSA] Don't verify MemorySSA unless VerifyMemorySSA enabled MemorySSA is often taking up an unreasonable fraction of runtime in assertion enabled builds. Turns out that there is one code-path that runs verifyMemorySSA() even if VerifyMemorySSA is not enabled. This patch makes it conditional as well. Differential Revision: https://reviews.llvm.org/D74505	2020-02-13 18:46:58 +01:00
Matt Arsenault	f95863ff55	AMDGPU: Use v_perm_b32 to implement bswap Also greatly improve i64 lowering. LegalizeIntegerTypes does the correct narrowing if i64 isn't legal. Just workaround this for SelectionDAG by making i64 legal and splitting in the patterns.	2020-02-13 09:45:31 -08:00
John Brawn	ad62d2e7de	[ARM] Fix infinite loop when lowering STRICT_FP_EXTEND If the target has FP64 but not FP16 then we have custom lowering for FP_EXTEND and STRICT_FP_EXTEND with type f64. However if the extend is from f32 to f64 the current implementation will cause in infinite loop for STRICT_FP_EXTEND due to emitting a merge_values of the original node which after replacement becomes a merge_values of itself. Fix this by not doing anything for f32 to f64 extend when we have FP64, though for STRICT_FP_EXTEND we have to do the strict-to-nonstrict mutation as that doesn't happen automatically for opcodes with custom lowering. Differential Revision: https://reviews.llvm.org/D74559	2020-02-13 16:12:50 +00:00
Sanjay Patel	946d17fa5b	[VectorCombine] adjust tests for extract-binop; NFC We want the extra-use tests to be consistent with the earlier single-use tests and be as cheap as possible in vector form to show cost model edge cases. So use i8 and extract from element 0 since that should be cheap for all x86 targets.	2020-02-13 10:51:01 -05:00
Sanjay Patel	7dfed75fcc	[VectorCombine] add more extract-binop tests; NFC See D74495.	2020-02-13 10:07:20 -05:00
Francesco Petrogalli	0612ebecbb	[llvm][lldb] Update links to ABI for the Arm Architecture. [NFC]	2020-02-13 14:57:53 +00:00
Sean Fertile	0d0a2b6d3c	[PowerPC][NFC] Small cleanup to restore CR field code in PPCFrameLowering. Skip the loop over the CalleSavedInfos in 'restoreCalleeSavedRegisters' when the register is a CR field and we are not targeting 32-bit ELF. This is safe because: 1) The helper function 'restoreCRs' returns if the target is not 32-bit ELF, making all the code in the loop related to CR fields dead for every other subtarget. This code is only called on ELF right now, but the patch to extend it for AIX also needs to skip 'restoreCRs'. 2) The loop will not otherwise modify the iterator, so the iterator manipulations at the bottom of the loop end up setting 'I' to its current value. This simplifciation allows us to remove one argument from 'restoreCRs'. Also add a helper function to determine if a register is one of the callee saved condition register fields.	2020-02-13 09:50:28 -05:00
Simon Pilgrim	d851cf8ef2	Move FIXME to start of comment so visual studio actually tags it. NFC.	2020-02-13 14:28:50 +00:00
Simon Pilgrim	ea74bfe847	[X86][SSE] Add i686-SSE2 bswap vector tests	2020-02-13 14:28:49 +00:00
Nico Weber	10c7916487	[gn build] Fix sync script on renames like "Foo.cpp" -> "LLVMFoo.cpp" Before, the script used `git log -SFoo.cpp` to find a commit where the number of occurrences of "Foo.cpp" changed -- but since a patch with + LLVMFoo.cpp - Foo.cpp contains the same number of instances of "Foo.cpp", the script incorrectly skipped this type of rename. As fix, look for '\bFoo\.cpp\b' instead and pass --pickaxe-regex so that we can grep for word boundaries. To test, check out 7531a5039fd (which renamed in llvm/lib/IR RemarkStreamer.cpp to LLVMRemarkStreamer.cpp) and look at the output of the script. Before this change, it correctly assigned the addition of LLVMRemarkStreamer.cpp to 7531a5039fd but incorrectly assigned the removal of RemarkStreamer.cpp to b8a847c. With this, it correctly assigns both to 7531a5039fd.	2020-02-13 09:26:47 -05:00
serge-sans-paille	dd5b542685	Fix integration of pass plugins with llvm dylib Call llvm_process_pass_plugin from clang when in standalone mode. Differential Revision: https://reviews.llvm.org/D74464	2020-02-13 14:18:08 +01:00
serge-sans-paille	eff8b15a75	Rework go bindings so that validation works fine Basically change the layout to please `go build` and remove references to `llvm-go`. Update llvm/test/Bindings/Go/ to use the system go compiler Differential Revision: https://reviews.llvm.org/D74540	2020-02-13 14:13:03 +01:00
Qiu Chaofan	a735dedfe5	[PowerPC] Exploit VSX rounding instrs for rint Exploit native VSX rounding instruction, x(v\|s)r(d\|s)pic, which does rounding using current rounding mode. According to C standard library, rint may raise INEXACT exception while nearbyint won't. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D72685	2020-02-13 20:59:50 +08:00
stozer	b6fc689526	Re-revert: Recover debug intrinsics when killing duplicated/empty blocks This reverts commit 61b35e4111160fe834a00c33d040e01150b576ac. This commit causes a timeout in chromium builds; likely to have a similar cause to the previous timeout issue caused by this commit (see 6ded69f294a9 for more details). It is possible that there is no way to fix this bug that will not cause this issue; further investigations as to the efficiency of handling large amounts of debug info will be necessary.	2020-02-13 11:48:19 +00:00
Daniel Kiss	245ebfa635	[AArch64] Fix BTI landing pad generation. In some cases BTI landing pad is inserted even compatible instruction was there already. Meta instruction does not count in this case therefore skip them in the check for first instructions in the function. Differential revision: https://reviews.llvm.org/D74492	2020-02-13 10:44:34 +00:00
Kerry McLaughlin	b7041a91bb	[AArch64][SVE] Add mul/mla/mls lane & dup intrinsics Summary: Implements the following intrinsics: - @llvm.aarch64.sve.dup - @llvm.aarch64.sve.mul.lane - @llvm.aarch64.sve.mla.lane - @llvm.aarch64.sve.mls.lane Reviewers: c-rhodes, sdesmalen, dancgr, efriedma, rengolin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74222	2020-02-13 10:32:59 +00:00
David Green	61eab1d530	[ARM] Fix ReconstructShuffle for bigendian Simon pointed out that this function is doing a bitcast, which can be incorrect for big endian. That makes the lowering of VMOVN in MVE wrong, but the function is shared between Neon and MVE so both can be incorrect. This attempts to fix things by using the newly added VECTOR_REG_CAST instead of the BITCAST. As it may now be used on Neon, I've added the relevant patterns for it there too. I've also added a quick dag combine for it to remove them where possible. Differential Revision: https://reviews.llvm.org/D74485	2020-02-13 09:56:46 +00:00
David Green	aa89daf1c2	[ARM] Extra vmovn tests to show BE differences. NFC	2020-02-13 09:56:46 +00:00
Roman Lebedev	01b56f3a2c	[NFC][llvm-exegesis] Docs/help: opcode-index=-1 means measure everything	2020-02-13 12:46:12 +03:00
Igor Kudrin	2da0ac3ddb	[DebugInfo] Fix dumping CIE ID in .eh_frame sections. We do not keep the actual value of the CIE ID field, because it is predefined, and use a constant when dumping a CIE record. The issue was that the predefined value is different for .debug_frame and .eh_frame sections, but we always printed the one which corresponds to .debug_frame. The patch fixes that by choosing an appropriate constant to print. See the following for more information about .eh_frame sections: https://refspecs.linuxfoundation.org/LSB_5.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html Differential Revision: https://reviews.llvm.org/D73627	2020-02-13 15:42:14 +07:00
Johannes Doerfert	a084e15b3b	[OpenMP][FIX] Collect blocks to be outlined after finalization Finalization can introduce new blocks we need to outline as well so it makes sense to identify the blocks that need to be outlined after finalization happened. There was also a minor unit test adjustment to account for the fact that we have a single outlined exit block now.	2020-02-13 00:42:22 -06:00
Vladimir Vereschaka	4bc72f1985	Revert "Replace std::foo with std::foo_t in LLVM." This reverts commit a4384c756bd8a819051009b5b273b2a34be8261b. These changes break LLVM build on Windows builders. See https://reviews.llvm.org/rGa4384c756bd8a819051009b5b273b2a34be8261b for details.	2020-02-12 20:54:21 -08:00
Craig Topper	752053399d	[X86] Add test RUN lines to show cases where we use 512-bit vcmppd/ps with garbage upper bits for 128/256-bit strict_fsetcc On KNL targets, we widen 128/256-bit strict_fsetcc nodes to 512-bits without forcing the upper bits to zero. This can cause spurious exceptions due to garbage upper bits. This behavior was inherited from the non-strict case where the spurious exception isn't a problem.	2020-02-12 20:51:52 -08:00
Yonghong Song	cf18c2c0f8	[BPF] explicit warning of not supporting dynamic stack allocation Currently, BPF does not support dynamic static allocation. For a program like below: extern void bar(int *); void foo(int n) { int a[n]; bar(a); } The current error message looks like: unimplemented operand UNREACHABLE executed at /.../llvm/lib/Target/BPF/BPFISelLowering.cpp:199! Let us make error message explicit so it will be clear to the user what is the problem. With this patch, the error message looks like: fatal error: error in backend: Unsupported dynamic stack allocation ... Differential Revision: https://reviews.llvm.org/D74521	2020-02-12 20:43:06 -08:00
Johannes Doerfert	17f538d282	Reapply "[OpenMP][IRBuilder] Perform finalization (incl. outlining) late" Reapply 8a56d64d7620b3764f10f03f3a1e307fcdd72c2f with minor fixes. The problem was that cancellation can cause new edges to the parallel region exit block which is not outlined. The CodeExtractor will encode the information which "exit" was taken as a return value. The fix is to ensure we do not return any value from the outlined function, to prevent control to value conversion we ensure a single exit block for the outlined region. This reverts commit 3aac953afa34885a72df96f2b703b65f85cbb149.	2020-02-12 22:29:07 -06:00
Serguei Katkov	ca1a0c2e8f	[Statepoint] Remove redundant clear of call target on register Patchable statepoint is lowered into sequence of nops, so zeroed call target should not be on register. It is better to use getTargetConstant instead of getConstant to select zero constant for call target. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D74465	2020-02-13 10:25:50 +07:00
Austin Kerbow	305dffc7b7	[AMDGPU][GlobalISel] Handle 64byte EltSIze in getRegSplitParts Reviewers: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74518	2020-02-12 19:11:52 -08:00
Nico Weber	7332bd4dc7	Fix ReST syntax on link to "Bisecting LLVM code" page Patch from nicolas17 (Nicolás Alvarez)! Differential Revision: https://reviews.llvm.org/D74422	2020-02-12 21:18:25 -05:00
Fangrui Song	a9c1f8f10e	[AsmPrinter][ELF] Emit local alias for ExternalLinkage dso_local GlobalAlias	2020-02-12 17:08:22 -08:00
Amy Huang	3f8a544115	Revert "[X86][SSE] lowerShuffleAsBitRotate - lower to vXi8 shuffles to ROTL on pre-SSSE3 targets" This reverts commit 11c16e71598d51f15b4cfd0f719c4dabcc0bebf7 because it causes a crash in chromium code. See https://reviews.llvm.org/rG11c16e71598d51f15b4cfd0f719c4dabcc0bebf7.	2020-02-12 17:00:37 -08:00
Johannes Doerfert	b34872013b	Revert "[OpenMP][IRBuilder] Perform finalization (incl. outlining) late" This reverts commit 8a56d64d7620b3764f10f03f3a1e307fcdd72c2f. Will be recommitted once the clang test problem is addressed.	2020-02-12 18:50:43 -06:00
Matt Arsenault	6fdf8e6b26	AMDGPU/GlobalISel: Select G_CTTZ_ZERO_UNDEF Directly select this rather than going through the intermediate instruction, which may provide some combine value in the future.	2020-02-12 16:19:46 -08:00
Matt Arsenault	3eb9775837	AMDGPU/GlobalISel: Select G_CTLZ_ZERO_UNDEF Directly select this rather than going through the intermediate instruction, which may provide some combine value in the future.	2020-02-12 16:19:45 -08:00
Matt Arsenault	ab1b814c33	AMDGPU/GlobalISel: Fix mapping G_ICMP with constrained result When SI_IF is inserted, it constrains the source register with a register class, which was quite likely a G_ICMP. This was incorrectly treating it as a scalar, and then applyMappingImpl would end up producing invalid MIR since this was unexpected. Also fix not using all VGPR sources for vcc outputs.	2020-02-12 16:19:45 -08:00
Matt Arsenault	3a9f8d3df6	PPC: Prepare tests for switch of default denormal-fp-math These tests fail when the default is switched to assume IEEE denormal handling. I'm not sure if PPC really has a way to control the denormal input handling.	2020-02-12 16:19:45 -08:00
Caroline Lebar	7f726c91e2	Replace std::foo with std::foo_t in LLVM. This patch is replacements missed in my last change doing this across LLVM. No functional change, although I think there was a missing typename in struct conjunction that is now fixed.	2020-02-12 16:14:36 -08:00
Johannes Doerfert	64f169d63d	[OpenMP][IRBuilder] Perform finalization (incl. outlining) late In order to fix PR44560 and to prepare for loop transformations we now finalize a function late, which will also do the outlining late. The logic is as before but the actual outlining step happens now after the function was fully constructed. Once we have loop transformations we can apply them in the finalize step before the outlining. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D74372	2020-02-12 17:55:01 -06:00
Johannes Doerfert	2f7f5ebd66	[Attributor] Use fine-grained liveness in all helpers We used coarse-grained liveness before, thus we looked if the instruction was executed, but we did not use fine-grained liveness, hence if the instruction was needed or could be deleted even if the surrounding ones are live. This patches introduces this level of liveness checks together with other liveness queries, e.g., for uses. For more control we enforce that all liveness queries go through the Attributor. Test have been adjusted to reflect the changes or augmented to prevent deletion of the parts we want to check. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D73313	2020-02-12 17:36:38 -06:00
Johannes Doerfert	0ac5032e49	[Attributor] Ignore uses if a value is simplified If we have a replacement for a value, via AAValueSimplify, the original value will lose all its uses. Thus, as long as a value is simplified we can skip the uses in checkForAllUses, given that these uses are transitive uses for the simplified version and will therefore affect the simplified version as necessary. Since this allowed us to remove calls without side-effects and a known return value, we need to make sure not to eliminate `musttail` calls. Those we keep around, or later remove the entire `musttail` call chain.	2020-02-12 17:36:38 -06:00
Johannes Doerfert	c561bed82b	[Attributor] Use assumed information to determine side-effects We relied on wouldInstructionBeTriviallyDead before but that functions does not take assumed information, especially for calls, into account. The replacement, AAIsDead::isAssumeSideEffectFree, does. This change makes AAIsDeadCallSiteReturn more complex as we can have a dead call or only dead users. The test have been modified to include a side effect where there was none in order to keep the coverage. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D73311	2020-02-12 17:36:38 -06:00
Guozhi Wei	af1f058dcb	[MBP] Partial tail duplication into hot predecessors Current tail duplication embedded in MBP duplicates a BB into all or none of its predecessors without too much cost analysis. So sometimes it is duplicated into cold predecessors, and in other cases it may miss the duplication into hot predecessors. This patch improves tail duplication in 3 aspects: A successor can be duplicated into part of its predecessors. A more fine-grained benefit analysis, combined with 1, now a successor is duplicated into hot predecessors only. If a successor can't be duplicated into one predecessor, it doesn't impact the duplication into other predecessors. Differential Revision: https://reviews.llvm.org/D73387	2020-02-12 15:22:33 -08:00
Stanislav Mekhanoshin	563a9ce27a	[TBLGEN] Fix subreg value overflow in DAGISelMatcher Tablegen's DAGISelMatcher emits integers in a VBR format, so if an integer is below 128 it can fit into a single byte, otherwise high bit is set, next byte is used etc. MatcherTable is essentially an unsigned char table. When SelectionDAGISel parses the table it does a reverse translation. In a situation when numeric value of an integer to emit is unknown it can be emitted not as OPC_EmitInteger but as OPC_EmitStringInteger using a symbolic name of the value. In this situation the value should not exceed 127. One of the situations when OPC_EmitStringInteger is used is if we need to emit a subreg into a matcher table. However, number of subregs can exceed 127. Currently last defined subreg for AMDGPU is 192. That results in a silent bug in the ISel with matcher reading from an invalid offset. Fixed this bug to emit actual VBR encoded value for a subregs which value exceeds 127. Differential Revision: https://reviews.llvm.org/D74368	2020-02-12 13:29:57 -08:00
Jinsong Ji	bbd2bc0129	[docs] Minor updates to DeveloperPolicy due to svn to git Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D73971	2020-02-12 21:08:15 +00:00
Ehud Katz	5b4188d47a	[LoopExtractor] Fix legacy pass dependencies Fixes a memory leak of allocating `LoopInfoWrapperPass` and `DominatorTreeWrapperPass`.	2020-02-12 22:39:21 +02:00
Roman Lebedev	acc25033f2	[llvm-exegesis] CombinationGenerator: don't store function_ref function_ref is non-owning, so if we get it as a parameter in constructor, our reference goes out-of-scope as soon as constructor returns. Instead, let's just take it as a parameter to the actual `generate()` call	2020-02-12 23:33:23 +03:00
Vedant Kumar	2b93b8f767	[AddressSanitizer] Ensure only AllocaInst is passed to dbg.declare Various parts of the LLVM code generator assume that the address argument of a dbg.declare is not a `ptrtoint`-of-alloca. ASan breaks this assumption, and this results in local variables sometimes being unavailable at -O0. GlobalISel, SelectionDAG, and FastISel all do not appear to expect dbg.declares to have a `ptrtoint` as an operand. This means that they do not place entry block allocas in the usual side table reserved for local variables available in the whole function scope. This isn't always a problem, as LLVM can try to lower the dbg.declare to a DBG_VALUE, but those DBG_VALUEs can get dropped for all the usual reasons DBG_VALUEs get dropped. In the ObjC test case I'm looking at, the cause happens to be that `replaceDbgDeclare` has hoisted dbg.declares into the entry block, causing LiveDebugValues to "kill" the DBG_VALUEs because the lexical dominance check fails. To address this, I propose: 1) Have ASan (always) pass an alloca to dbg.declares (this patch). This is a narrow bugfix for -O0 debugging. 2) Make replaceDbgDeclare not move dbg.declares around. This should be a generic improvement for optimized debug info, as it would prevent the lexical dominance check in LiveDebugValues from killing as many variables. This means reverting llvm/r227544, which fixed an assertion failure (llvm.org/PR22386) but no longer seems to be necessary. I was able to complete a stage2 build with the revert in place. rdar://54688991 Differential Revision: https://reviews.llvm.org/D74369	2020-02-12 11:24:02 -08:00
Jay Foad	cb7a62d110	[KnownBits] Introduce anyext instead of passing a flag into zext Summary: This was a very odd API, where you had to pass a flag into a zext function to say whether the extended bits really were zero or not. All callers passed in a literal true or false. I think it's much clearer to make the function name reflect the operation being performed on the value we're tracking (rather than on the KnownBits Zero and One fields), so zext means the value is being zero extended and new function anyext means the value is being extended with unknown bits. NFC. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74482	2020-02-12 19:06:53 +00:00
LLVM GN Syncbot	7208526d5c	[gn build] Port 6030fe01f4e	2020-02-12 18:34:39 +00:00

1 2 3 4 5 ...

191840 Commits