llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Sean Fertile	0974dd862a	[PowerPC][NFC] Remove Darwin specific logic in frame finalization. Remove some cumbersome Darwin specific logic for updating the frame offsets of the condition-register spill slots. The containing function has an early return if the subtarget is not ELF based which makes the Darwin logic dead.	2020-02-21 09:32:24 -05:00
Pavel Labath	ed1f03baaf	[Error/unittests] Add a FailedWithMessage gtest matcher Summary: We already have a "Failed" matcher, which can be used to check any property of the Error object. However, most frequently one just wants to check the error message, and while this is possible with the "Failed" matcher, it is also very convoluted (Failed<ErrorInfoBase>(testing::Property(&ErrorInfoBase::message, "the message"))). Now, one can just write: FailedWithMessage("the message"). I expect that most of the usages will remain this simple, but the argument of the matcher is not limited to simple strings -- the argument of the matcher can be any other matcher, so one can write more complicated assertions if needed (FailedWithMessage(ContainsRegex("foo\|bar"))). If one wants to match multiple error messages, he can pass multiple arguments to the matcher. If one wants to match the message list as a whole (perhaps to check the message count), I've also included a FailedWithMessageArray matcher, which takes a single matcher receiving a vector of error message strings. Reviewers: sammccall, dblaikie, jhenderson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74898	2020-02-21 15:29:48 +01:00
Simon Pilgrim	9776683c03	[X86] Regenerate hi reg tests	2020-02-21 14:23:54 +00:00
Simon Pilgrim	b7aa20de0a	[TargetLowering] SimplifyDemandedBits - use getValidShiftAmountConstant helper. Use the SelectionDAG::getValidShiftAmountConstant helper to get const/constsplat shift amounts, which allows us to drop the out of range shift amount early-out. First step towards better non-uniform shift amount support in SimplifyDemandedBits.	2020-02-21 14:23:53 +00:00
Krzysztof Parzyszek	952f45fdef	[Hexagon] Introduce noop intrinsic to cast between vector predicate types The (overloaded) intrinsic is llvm.hexagon.V6.pred.typecast[.128B]. The types of the operand and the return value are HVX boolean vector types. For each cast, there needs to be a corresponding intrinsic declared, with different suffixes appended to the name, e.g. ; cast <128 x i1> to <32 x i1> declare <32 x i1> @llvm.hexagon.V6.pred.typecast.128B.s1(<128 x i1>) ; cast <32 x i1> to <64 x i1> declare <64 x i1> @llvm.hexagon.V6.pred.typecast.128B.s2(<32 x i1>) etc.	2020-02-21 07:37:59 -06:00
Evgeniy Brevnov	836e38e46a	[DependenceAnalysis] Memory dependence analysis internal caching mechanism is broken in presence of TBAA (PR42733). Summary: There is a flaw in memory dependence analysis caching mechanism when memory accesses with TBAA are involved. Assume we first analysed and cached results for access with TBAA. Later we request dependence for the same memory but without TBAA (or different TBAA). By design these two queries should share one entry in the internal cache which corresponds to a general access (without TBAA). Thus upon second request internal cached is cleared and we continue analysis for access as if there is no TBAA. The problem is that even though internal cache is cleared the set of visited nodes is not. That means we won't traverse visited nodes again and populate internal cache with the corresponding dependence results. So we end up with internal cache in an incomplete state. Current implementation tries to signal that situation by resetting CacheInfo->Pair at line 1104. But that doesn't actually help since later code ignores this invalidation and relies on 'Cache->empty()' property to decide on cache completeness. Reviewers: reames, hfinkel, chandlerc, fedor.sergeev, asbirlea, fhahn, john.brawn, Prazek, sunfish Reviewed By: john.brawn Subscribers: DaniilSuchkov, kosarev, jfb, dantrushin, hiraditya, bmahjour, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73032	2020-02-21 20:20:36 +07:00
Sanjay Patel	6fe2359aa7	[ConstantFold] fold fsub -0.0, undef to undef rather than NaN A question about this behavior came up on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-February/139003.html ...and as part of backend improvements in D73978, but this is an IR change first because we already have fairly thorough tests in place here. We decided not to implement a more general change that would have folded any FP binop with nearly arbitrary constant + undef operand to undef because that is not theoretically correct (even if it is practically correct). Differential Revision: https://reviews.llvm.org/D74713	2020-02-21 08:03:19 -05:00
Swiftfuchs	007cdae506	[NFC] Corrected a minor typo in a comment	2020-02-21 13:56:44 +01:00
Nicolai Hähnle	db41577cd5	test/CodeGen/AMDGPU: Add a test case that shows a miscompilation Related to https://reviews.llvm.org/D74908 Change-Id: I6ebf3b5c7a32493016994f30d6796c41e95aecde	2020-02-21 13:38:24 +01:00
Sebastian Neubauer	ca62d6a9b1	Make unittests include path relative This change is relevant when embedding the llvm cmake project into another project. It should not change the build behavior of a normal llvm build. In the case where llvm is embedded as a cmake subproject, CMAKE_SOURCE_DIR does not point to the expected directory and building the tests fails. Using CMAKE_CURRENT_SOURCE_DIR fixes this problem, as it will always point to the same directory. Differential Revision: https://reviews.llvm.org/D73466	2020-02-21 10:19:11 +01:00
Craig Topper	294fe257f5	[X86] Don't bother avoiding illegal FCMOVs if we don't have the cmov subtarget feature. We'll be forced to emit branches so we might as well use the most direct condition.	2020-02-21 00:34:15 -08:00
Craig Topper	5b18676c88	[X86] Make combineCMov not create unsupported FCMOVs when f32/f64 are using X87. This makes the behavior consistent with what's in LowerSELECT.	2020-02-21 00:34:15 -08:00
Craig Topper	bee5d24788	[X86] Autogenerate complete checks. NFC	2020-02-21 00:34:14 -08:00
Sam Clegg	e2ce588bc1	[WebAssembly] Remove unneeded getWasmKindForNamedSection function I believe this was carried over from getELFKindForNamedSection since the wasm backend originally used ELF object writing as a template. Differential Revision: https://reviews.llvm.org/D74565	2020-02-20 22:49:08 -08:00
Craig Topper	f724f13454	[X86] Remove unnecessary isNullConstant in LowerSelect. NFC At this point in the code we know that Op1 or Op2 is all ones. Y points to the other operand. In the case that Op2 is zero, Op1 must be all ones and Y is Op2. The OR ORs Y into Res. But if Y is 0 the OR will be folded away by getNode so we don't need to check for it.	2020-02-20 21:41:13 -08:00
Craig Topper	a41ca2b24d	[X86] Add CMOV_VR64 pseudo instruction for MMX. Remove mmx handling from combineSelect. The combineSelect code was casting to i64 without any check that i64 was legal. This can break after type legalization. It also required splitting the mmx register on 32-bit targets. It's not clear that this makes sense. Instead switch to using a cmov pseudo like we do for XMM/YMM/ZMM.	2020-02-20 20:30:56 -08:00
Jim Lin	4d3738a380	[XCore] Add instruction pattern for bitrev Summary: Add support for lowering bitreverse to the bitrev instruction. Fix https://bugs.llvm.org/show_bug.cgi?id=34628. Reviewers: RKSimon, rtrieu, robertlytton Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74748	2020-02-21 09:28:49 +08:00
Vedant Kumar	1fb47dd1e8	[Dominators] Use Instruction::comesBefore for block-local queries, NFC Use the lazy instruction ordering facility for block-local dominance queries. Differential Revision: https://reviews.llvm.org/D74931	2020-02-20 16:41:51 -08:00
Bill Wendling	370e769899	Filter callbr insts from critical edge splitting Similarly to how splitting predecessors with an indirectbr isn't handled in the generic way, we also shouldn't split callbrs, for similar reasons.	2020-02-20 16:24:42 -08:00
Craig Topper	e5382d4140	[X86] Add CMOV_VK1 pseudo so we don't crash on v1i1 ISD::SELECT	2020-02-20 15:13:48 -08:00
Craig Topper	82b0f843d7	[X86] Expand vselect of v1i1 under avx512. We already do this for v2i1, v4i1, etc.	2020-02-20 15:13:47 -08:00
Craig Topper	3164857f56	[X86] Custom legalize v1i1 UADDSAT/USUBSAT/SADDSAT/UADDSAT to match v2i1/v4i1/v8i1 etc.	2020-02-20 15:13:46 -08:00
Craig Topper	b969000b87	[X86] Fix a couple copy mistakes in v4i1 or/and/xor isel patterns. VK1 was being used as the output of the copy to regclass, but it should be VK2/VK4. Shouldn't matter in practice though since VK1/VK2/VK4/VK8/VK16 are all identicaly and just have different VTs.	2020-02-20 15:13:45 -08:00
Craig Topper	94227b21b3	[X86] Custom legalize v1i1 add/sub/mul to xor/xor/and with avx512. We already did this for v2i1, v4i1, v8i1, etc.	2020-02-20 15:13:44 -08:00
Florian Hahn	65ce9e19e8	[SCCP] Do not mark unknown loads as overdefined. For tracked globals that are unknown after solving, we expect all non-store uses to be replaced. This is a follow-up to f8045b250d80, which removed forcedconstant. We should not mark unknown loads as overdefined, as they either load from an unknown pointer or an undef global. Restore the original logic for loads.	2020-02-20 22:48:58 +01:00
Eli Friedman	82be9993cd	[SVE] Add support for lowering GEPs involving scalable vectors. This includes both GEPs where the indexed type is a scalable vector, and GEPs where the result type is a scalable vector. Differential Revision: https://reviews.llvm.org/D73602	2020-02-20 13:45:41 -08:00
David Tenty	94bdd8542e	[AIX] Improve 32/64-bit build configuration Summary: AIX supports both 32-bit and 64-bit environments (with 32-bit being the default). This patch improves support for building LLVM on AIX in both 32-bit and 64-bit mode. - Change host detection to return correct 32/64-bit triple as config_guess does not return the correct version on 64-bit. This can confuse JIT tests and other things that care about what the host triple is. - Remove manual setting of 64-bit flags on AIX. AIX provides OBJECT_MODE environment variable to enable the user to obtain a 64-bit development environment. CMake will properly set these flags provided the user sets the correct OBJECT_MODE before configuring and setting them manually will interfere with 32-bit builds. - Don't present the LLVM_BUILD_32_BITS option on AIX, users should use OBJECT_MODE when running CMake instead. Reviewers: hubert.reinterpretcast, DiggerLin, stevewan Reviewed By: DiggerLin, stevewan Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74256	2020-02-20 15:41:00 -05:00
Craig Topper	c62fbc0139	Recommit "[X86] Replace a bad use of MVT::getVectorVT with EVT::getVectorVT"" With the correct author this time	2020-02-20 12:28:54 -08:00
Craig Topper	c55a492217	Revert 714265dabb606bfef2f85694234f152edbfa91ac "[X86] Replace a bad use of MVT::getVectorVT with EVT::getVectorVT" I accidentally messed up the author on the previous commit somehow.	2020-02-20 12:28:33 -08:00
Quentin Colombet	040440d537	[X86] Replace a bad use of MVT::getVectorVT with EVT::getVectorVT The type here isn't guaranteed to be a simple type. Fixes PR44976	2020-02-20 12:25:37 -08:00
Nico Weber	d35d5f2e63	Revert "[AArch64][SVE] Add intrinsics for SVE2 bitwise ternary operations" This reverts commit ce70e2899879e092b153a4078b993833b6696713. It broke MC/AArch64/SVE2/bsl-diagnostics.s everywhere.	2020-02-20 15:11:13 -05:00
Sanjay Patel	1dc06667d0	[ConstantFold] add/move tests for FP with undef operand; NFC	2020-02-20 15:07:11 -05:00
Sourabh Singh Tomar	b4f51213cf	Revert "[NFCI][DebugInfo]: Corrected a Typo." This reverts commit 3e1090922a0b808f424ff424b744752b0d53a3ee as per Paul Robinson's suggestion.	2020-02-21 01:15:09 +05:30
Francesco Petrogalli	b7293ff05c	[llvm][build] Fix shared lib builds. [NFC] The code at https://reviews.llvm.org/D74808 has broken builds that are configured with -DBUILD_SHARED_LIBS=On. This patch adds the correct library dependencies.	2020-02-20 19:42:53 +00:00
Quentin Colombet	b8760a31e4	[GISel][KnownBits] Give up on PHI analysis as soon as we don't know anything When analyzing PHIs, we gather the known bits for every operand and merge them together to get the known bits of the result of the PHI. It is not unusual that merging the information leads to know nothing on the result (e.g., phi a: i8 3, b: i8 unknown, ..., after looking at the second argument we know we will know nothing on the result), thus, as soon as we reach that state, stop analyzing the following operand (i.e., on the previous example, we won't process anything after looking at `b`). This improves compile time in particular with PHIs with a large number of operands. NFC.	2020-02-20 11:34:01 -08:00
David Tenty	dc211a8b72	[AIX] Pack BasicBlockBits Summary: D51664 introduces a new structure BasicBlockBits which it expects to be packed a certain way. This change is very similar to D60164, and we apply the same fix: "On AIX, the canonical layout of bit-fields would cause these ... to span four bytes. Applying the pack pragma for compilers that employ the AIX canonical layout allows these ... to fit within the expected two bytes. In the future, the pragma would also likely need to be applied when building with Clang on AIX." Reviewers: jasonliu, hubert.reinterpretcast, sfertile, xingxue Reviewed By: sfertile Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74911	2020-02-20 14:28:48 -05:00
Simon Pilgrim	997562c67a	[X86] Fix NSW/NUW typo in avg test (PR44973) The not_avg_v16i8_wide_constants test shouldn't assume NSW/NUW for the addition of -1 - copy + paste typo from other avg tests	2020-02-20 19:22:37 +00:00
Lang Hames	14da40b10d	[ORC] Add a convenience method for setting the ExecutionSession to LLJITBuilder. Can be used to set a custom pre-configured ExecutionSession for the LLJIT or LLLazyJIT instance being constructed.	2020-02-20 11:15:08 -08:00
Lang Hames	871e8efa78	[ORC][examples] Fix ThinLtoJIT example after changes in 85fb997659b.	2020-02-20 11:15:08 -08:00
Sanjay Patel	8d3b95bafd	[x86] allow peeking through an extract_subvector to find a splatted operand The motivating case is seen in "splat4_v8f32_load_store" and based on code in PR42024: https://bugs.llvm.org/show_bug.cgi?id=42024 (I haven't stepped through the v8i32 sibling test yet to see why that diverged.) There are other potential improvements visible like allowing scalarization or vector narrowing. Differential Revision: https://reviews.llvm.org/D74909	2020-02-20 13:59:59 -05:00
Sean Fertile	e054548325	[PowerPC][NFC] Cleanup some of the Darwin mentions in the README.txt.	2020-02-20 13:57:13 -05:00
Francis Visoiu Mistrih	e96a7a4c4c	Revert "[macho][NFC] Extract all CPU_(SUB_)TYPE logic to libObject" This reverts commit 726c342ce27ada28efe90cb04ffb69c75065710a. This breaks the windows bots with linker errors.	2020-02-20 10:51:25 -08:00
Francis Visoiu Mistrih	13b4fc1781	[macho][NFC] Extract all CPU_(SUB_)TYPE logic to libObject This moves all the logic of converting LLVM Triples to MachO::CPU_(SUB_)TYPE from the specific target (Target)AsmBackend to more convenient functions in libObject. This also gets rid of the separate two X86AsmBackend classes. Differential Revision: https://reviews.llvm.org/D74808	2020-02-20 10:28:07 -08:00
Simon Pilgrim	61ba4efa56	[DAGCombiner] Use SDValue::getConstantOperandAPInt helper where possible. NFC.	2020-02-20 18:23:05 +00:00
Craig Topper	817b15d7ab	[X86] Fix a -Wparentheses warning. NFC	2020-02-20 09:32:03 -08:00
Craig Topper	e08fad5a77	[X86] Rewrite LowerBRCOND to remove dead code and handle ISD::SETCC and overflow ops directly. There's a lot of old leftover code in LowerBRCOND. Especially the detecting or AND or OR of X86ISD::SETCC nodes. Those were needed before LegalizeDAG was changed to visit nodes before their operands. It also relied on reversing the output of LowerSETCC to find the flags producing node to use for the X86ISD::BRCOND node. Rather than using LowerSETCC this patch uses emitFlagsForSetcc to handle the integer ISD::SETCC case. This gives the flag producer and the comparison code to use directly. I've removed the addTest flag and just produce a X86ISD::BRCOND and return immediately. Floating point ISD::SETCC case is just an X86ISD::FCMP with special care for OEQ and UNE derived from the previous code. I've left f128 out so it will emit a test. And LowerSETCC will be called later to produce a libcall and X86ISD::SETCC. We have combines that can merge the test and X86ISD::SETCC. We need to handle two cases for overflow ops. Either they are used directly or they have a seteq 0 or setne 1 to invert the overflow. The old code did not handle the setne 1 case, but I think some other combines were making up for it. If we fail to find a condition, we'll wrap an AND with 1 on the original condition and tell emitFlagsForSetcc to emit a compare with 0. This will pickup the LowerAndToBT and or the EmitTest case. I kept the isTruncWithZeroHighBitsInput call, but we might be able to fold that in to emitFlagsForSetcc. Differential Revision: https://reviews.llvm.org/D74750	2020-02-20 08:50:18 -08:00
Craig Topper	8c6da7f0b6	[AArch64] Move isOverflowIntrOpRes help function to the ISD namespace in SelectionDAG.h. NFC Enables sharing with an upcoming X86 change.	2020-02-20 08:50:17 -08:00
Sanjay Patel	208855c4bd	[x86] add vector tests for splatted memory ops; NFC These correspond to patterns seen in PR42024: https://bugs.llvm.org/show_bug.cgi?id=42024	2020-02-20 11:35:05 -05:00
Danilo Carvalho Grael	8bc53af309	[AArch64][SVE] Add intrinsics for SVE2 bitwise ternary operations Summary: Add intrinsics for the following operations: - eor3, bcax - bsl, bsl1n, bsl2n, nbsl Reviewers: kmclaughlin, c-rhodes, sdesmalen, efriedma, rengolin Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74785	2020-02-20 11:36:48 -05:00
Craig Topper	941f4562bb	[X86] Add DAG combines to form CVTPH2PS/CVTPS2PH from vXf16->vXf32/vXf64 fp_extends and vXf32->vXf16 fp_round. Only handle power of 2 element count for simplicity. Not sure what to do with vXf64->vXf16 fp_round to avoid double rounding Differential Revision: https://reviews.llvm.org/D74886	2020-02-20 08:26:17 -08:00

1 2 3 4 5 ...

192396 Commits