llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Matt Arsenault	08da10f080	GlobalISel: Correct result type for G_FCMP in lowerFPTOUI Using the final result type doesn't make any sense. Use the natural default boolean type for the select condition.	2020-01-06 17:21:51 -05:00
Matt Arsenault	546a3ee122	GlobalISel: Start adding computeNumSignBits to GISelKnownBits	2020-01-06 17:21:51 -05:00
Matt Arsenault	5365e52310	AMDGPU: Fix legalizing f16 fpow The existing test only covered one case for r600. The use of mul_legacy also looks suspicious to me, but leave it for now. The patterns are also not making use of source modifiers.	2020-01-06 17:21:51 -05:00
Matt Arsenault	d483a5285e	AMDGPU: Use ImmLeaf This solves one GlobalISel importer error, but the pattern still fails for another reason.	2020-01-06 17:21:51 -05:00
Matt Arsenault	e299148752	AMDGPU: Use ImmLeaf for inline immediate predicates	2020-01-06 17:21:51 -05:00
Matt Arsenault	51e4379d7b	llc/MIR: Fix setFunctionAttributes for MIR functions A random set of attributes are implemented by llc/opt forcing the string attributes on the IR functions before processing anything. This would not happen for MIR functions, which have not yet been created at this point. Use a callback in the MIR parser, purely to avoid dealing with the ugliness that the command line flags are in a .inc file, and would require allowing access to these flags from multiple places (either from the MIR parser directly, or a new utility pass to implement these flags). It would probably be better to cleanup the flag handling into a separate library. This is in preparation for treating more command line flags with a corresponding function attribute in a more uniform way. The fast math flags in particular have a messy system where the command line flag sets the behavior from a function attribute if present, and otherwise the command line flag. This means if any other pass tries to inspect the function attributes directly, it will be inconsistent with the intended behavior. This is also inconsistent with the current behavior of -mcpu and -mattr, which overwrites any pre-existing function attributes. I would like to move this to consistenly have the command line flags not overwrite any pre-existing attributes, and to always ensure the command line flags are consistent with the function attributes.	2020-01-06 17:21:51 -05:00
Craig Topper	da61391439	[X86] Improve v4i32->v4f64 uint_to_fp for AVX1/AVX2 targets. Use zext+or+fsub to do the conversion. Similar to D71971. Differential Revision: https://reviews.llvm.org/D71971	2020-01-06 14:07:35 -08:00
Craig Topper	e8b3a53224	[LegalizeTypes] Add widening support for STRICT_FSETCC/FSETCCS This patch adds widening which really just scalarizes because we don't have a strategy for the extra elements we would need to pad with. Differential Revision: https://reviews.llvm.org/D72193	2020-01-06 13:45:55 -08:00
Craig Topper	b9ec974e54	[X86] Rename vec-strict--cmp.ll to vec-strict-cmp-.ll to match other strict files wich have the size at the end. NFC	2020-01-06 13:45:55 -08:00
Sanjay Patel	f58158f235	[x86] add tests for concat self + shuffle; NFC	2020-01-06 15:28:33 -05:00
Evgenii Stepanov	70efb3f71b	Lower TAGPstack with negative offset to SUBG. Summary: This never really occurs in the current codegen, so only a MIR test is possible. Reviewers: ostannard, pcc Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72123	2020-01-06 11:48:35 -08:00
Amara Emerson	65d30d2d20	[X86] Fix an 8 bit testb being selected when folding a volatile i32 load pattern. Differential Revision: https://reviews.llvm.org/D71581	2020-01-06 11:46:42 -08:00
Jinsong Ji	3b935c2612	[PowerPC][LoopVectorize] Extend getRegisterClassForType to consider double and other floating point type In https://reviews.llvm.org/D67148, we use isFloatTy to test floating point type, otherwise we return GPRRC. So 'double' will be classified as GPRRC, which is not accurate. This patch covers other floating point types. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D71946	2020-01-06 18:44:59 +00:00
Nico Weber	8b63012d97	Make check-llvm run 50% faster on macOS, 18% faster on Windows. While looking at cycle time graphs of some of my bots, I noticed that 327894859cc made check-llvm noticeably slower on macOS and Windows. As it turns out, the 5 substitutions added in that change were enough to cause lit to thrash the build-in cache in re.compile() (re.sub() is implemented as re.compile().sub()), and apparently applySubstitutions() is on the cricital path and slow when all regexes need to compile all the time. (See `_MAXCACHE = 512` in cpython/Lib/re.py) Supporting full regexes for lit substitutions seems a bit like overkill, but for now add a simple unbounded cache to recover the lost performance. No intended behavior change.	2020-01-06 12:57:42 -05:00
Fangrui Song	6817d16707	[llvm-readelf] Print EI_ABIVERSION as decimal instead of hexadecimal This matches GNU readelf and llvm-readobj. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D72234	2020-01-06 09:25:45 -08:00
LLVM GN Syncbot	bc9205ef6b	[gn build] Port 350da402ef6	2020-01-06 17:22:13 +00:00
diggerlin	799cbce7e6	[AIX] Use csect reference for function address constants SUMMARY: We currently emit a reference for function address constants as labels; for example: foo_ptr: .long foo however, there may be no such label in the case where the function is undefined. Although the label exists when the function is defined, we will (to be consistent) also use a csect reference in that case. Address one comment https://reviews.llvm.org/D71144#inline-653255 Reviewers: daltenty,hubert.reinterpretcast,jasonliu,Xiangling_L Subscribers: cebowleratibm, wuzish, nemanjai Differential Revision: https://reviews.llvm.org/D71144	2020-01-06 11:45:00 -05:00
David Green	abc5e9dd7d	[ARM] Use the correct opcodes for Thumb2 segmented stack frame lowering The segmented stack lowering code appears to be using ARM opcodes under Thumb2. The MRC opcode will be the same for Thumb and ARM, but t2LDR seems wrong. Either way, using the correct thumb vs arm opcodes is more correct. Differential Revision: https://reviews.llvm.org/D72074	2020-01-06 16:38:49 +00:00
David Green	c2667b0918	[ARM] Use correct TRAP opcode for thumb in FastISel We were previously unconditionally using the ARM::TRAP opcode, even under Thumb. My understanding is that these are essentially the same thing (they both result in a trap under Thumb), but the ARM::TRAP opcode is marked as requiring IsARM, so it is more correct to use ARM::tTRAP. Differential Revision: https://reviews.llvm.org/D72075	2020-01-06 16:38:49 +00:00
diggerlin	6c72101f26	[AIX] Use csect reference for function address constants SUMMARY: We currently emit a reference for function address constants as labels; for example: foo_ptr: .long foo however, there may be no such label in the case where the function is undefined. Although the label exists when the function is defined, we will (to be consistent) also use a csect reference in that case. Reviewers: daltenty,hubert.reinterpretcast,jasonliu,Xiangling_L Subscribers: cebowleratibm, wuzish, nemanjai Differential Revision: https://reviews.llvm.org/D71144	2020-01-06 11:38:22 -05:00
Simon Pilgrim	9ec7402b13	[AMDGPU] Fix "use of uninitialized variable" static analyzer warning. NFCI. Add "unreachable" default case to AMDGPUTargetStreamer::getArchNameFromElfMach	2020-01-06 16:36:56 +00:00
Simon Pilgrim	db77f4ab04	Fix "use of uninitialized variable" static analyzer warnings. NFCI. Add "unreachable" default cases like we do for the other switch()s in X86MCInstLower::Lower	2020-01-06 16:36:56 +00:00
Simon Pilgrim	28d04b5fa9	Fix "use of uninitialized variable" static analyzer warning. NFCI.	2020-01-06 16:36:56 +00:00
Simon Tatham	60e89f6b5d	[ARM,MVE] Fix many signedness errors in MVE intrinsics. Summary: Running an end-to-end test last week I noticed that a lot of the ACLE intrinsics that operate differently on vectors of signed and unsigned integers were ending up generating the signed version of the instruction unconditionally. This is because the IR intrinsics had no way to distinguish signed from unsigned: the LLVM type system just calls them both `v8i16` (or whatever), so you need either separate intrinsics for signed and unsigned, or a flag parameter that tells ISel which one to choose. This patch fixes all the problems of that kind that I've noticed, by adding an i32 flag parameter to many of the IR intrinsics which is set to 1 for unsigned (matching the existing practice in cases where we got it right), and conditioning all the isel patterns on that flag. So the fundamental change is in `IntrinsicsARM.td`, changing the low-level IR intrinsics API; there are knock-on changes in `arm_mve.td` (adjusting code gen for the ACLE intrinsics to use the modified API) and in `ARMInstrMVE.td` (adjusting isel to expect the new unsigned flags). The rest of this patch is boringly updating tests. Reviewers: dmgreen, miyuki, MarkMurrayARM Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72270	2020-01-06 16:33:16 +00:00
Simon Tatham	8539ab95a4	[ARM,MVE] Support -ve offsets in gather-load intrinsics. Summary: The ACLE intrinsics with `gather_base` or `scatter_base` in the name are wrappers on the MVE load/store instructions that take a vector of base addresses and an immediate offset. The immediate offset can be up to 127 times the alignment unit, and it can be positive or negative. At the MC layer, we got that right. But in the Sema error checking for the wrapping intrinsics, the offset was erroneously constrained to be positive. To fix this I've adjusted the `imm_mem7bit` class in the Tablegen that defines the intrinsics. But that causes integer literals like `0xfffffffffffffe04` to appear in the autogenerated calls to `SemaBuiltinConstantArgRange`, which provokes a compiler warning because that's out of the non-overflowing range of an `int64_t`. So I've also tweaked `MveEmitter` to emit that as `-0x1fc` instead. Updated the tests of the Sema checks themselves, and also adjusted a random sample of the CodeGen tests to actually use negative offsets and prove they get all the way through code generation without causing a crash. Reviewers: dmgreen, miyuki, MarkMurrayARM Reviewed By: dmgreen Subscribers: kristof.beyls, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72268	2020-01-06 16:33:07 +00:00
Simon Tatham	83d15c1840	[ARM,MVE] Generate the right instruction for vmaxnmq_m_f16. Summary: Due to a copy-paste error in the isel patterns, the predicated version of this intrinsic was expanding to the `VMAXNMT.F32` instruction instead of `VMAXNMT.F16`. Similarly for vminnm. Reviewers: dmgreen, miyuki, MarkMurrayARM Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72269	2020-01-06 16:28:20 +00:00
Matt Arsenault	3006dd38dd	AMDGPU/GlobalISel: Select scalar v2s16 G_BUILD_VECTOR	2020-01-06 11:19:33 -05:00
Matt Arsenault	63e136c627	AMDGPU/GlobalISel: Select more G_EXTRACTs correctly This assumed a 32-bit extract size, which would produce invalid copies with 64-bit extracts. Handle the easy case. Ideally we would have a way to get the proper subreg index for any 32-bit offset, but there should probably be a tablegenerated way of getting the subreg index for any size and offset.	2020-01-06 11:10:13 -05:00
Simon Pilgrim	159c5f3b0c	[X86] Add extra PR43971 test case mentioned in D70267	2020-01-06 13:44:55 +00:00
Simon Pilgrim	df5719cf52	[CostModel][X86] Add missing scalar i64->f32 uitofp costs	2020-01-06 13:17:02 +00:00
Simon Pilgrim	91eba037c5	[DAG] DAGCombiner::XformToShuffleWithZero - use APInt::extractBits helper. NFCI.	2020-01-06 13:17:02 +00:00
James Henderson	5e309984b7	[test][DebugInfo][NFC] Rename method for clarity The checkGetOrParseLineTableEmitsError function could end up generating both recoverable and unrecoverable errors, but it is only intended for handling the latter. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72156	2020-01-06 11:30:26 +00:00
James Henderson	91705af363	[NFC] Fix trivial typos in comments Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D72143 Patch by Kazuaki Ishizaki.	2020-01-06 10:50:26 +00:00
Sjoerd Meijer	53b50e4755	[ARM][MVE] More MVETailPredication debug messages. NFC. I've added a few more debug messages to MVETailPredication because I wanted to trace better which instructions are added/removed. And while I was at it, I factored out one function which I thought was clearer, and have added some comments to describe better the flow between MVETailPredication and ARMLowOverheadLoops. Differential Revision: https://reviews.llvm.org/D71549	2020-01-06 09:56:02 +00:00
Shengchen Kan	87257b3ef4	Add interface emitPrefix for MCCodeEmitter Differential Revision: https://reviews.llvm.org/D72047	2020-01-06 17:51:14 +08:00
Ehud Katz	75c38fdddc	[APFloat] Fix compilation warnings	2020-01-06 11:30:40 +02:00
Neil Henning	fdbe9ceae2	Add ExternalAAWrapperPass to createLegacyPMAAResults. Our out-of-tree custom aliasing solution for the HPC# Burst compiler here at Unity makes use of the `ExternalAAwrapperPass` infrastructure to insert our custom aliasing resolution into the core of LLVM. This is great for all cases except for function inlining, where because `createLegacyPMAAResults` does not make use of `ExternalAAWrapperPass`, when we have a definite no-alias result within a function it won't be propagated to the calling function during inlining. This commit just rectifies this oversight by adding the missing dependency. Differential Revision: https://reviews.llvm.org/D71348	2020-01-06 08:50:18 +00:00
Ehud Katz	c279732bd6	[APFloat] Add recoverable string parsing errors to APFloat Implementing the APFloat part in PR4745. Differential Revision: https://reviews.llvm.org/D69770	2020-01-06 10:09:01 +02:00
Anton Afanasyev	7d50198c39	[Metadata] Add TBAA struct metadata to `AAMDNode` Summary: Make `AAMDNodes`' `getAAMetadata()` and `setAAMetadata()` to take `!tbaa.struct` into account as well as `!tbaa`. This impacts llvm.org/pr42022. This is a temprorary fix needed to keep `!tbaa.struct` tag by SROA pass. New field `TBAAStruct` should be deleted when `!tbaa` tag replaces `!tbaa.struct`. Merging two `!tbaa.struct`'s to one is conservatively considered to be `nullptr` (giving `MayAlias`) -- this could be enhanced, but relying on the said future replacement. Reviewers: RKSimon, spatel, vporpo Subscribers: hiraditya, kosarev, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70924	2020-01-06 11:05:15 +03:00
Craig Topper	ef76ff13d3	[TargetLowering] Use SETCC input type to call getBooleanContents instead of the setcc result type. This isn't a functonal change since we also check the bit width is the same and the input type is integer. This guarantees the input and output type are the same. But passing the input type makes the code more readable.	2020-01-05 23:15:49 -08:00
Fangrui Song	20e1b9e897	[MC] Reorder members of MCFragment's subclasses to decrease padding On a 64-bit platform: sizeof(MCBoundaryAlignFragment): 64 -> 56 sizeof(MCOrgFragment): 72 -> 64 sizeof(MCFillFragment): 80 -> 72 sizeof(MCLEBFragment): 88 -> 80	2020-01-05 20:22:16 -08:00
Fangrui Song	2e7fc6eeed	[MC] Reorder MCFragment members to decrease padding sizeof(MCFragment) does not change, but some if its subclasses do, e.g. on a 64-bit platform, sizeof(MCEncodedFragment) decreases from 64 to 56, sizeof(MCDataFragment) decreases from 224 to 216.	2020-01-05 19:09:40 -08:00
QingShan Zhang	7dfabff224	[DAGCombine] Don't check the legality of type when combine the SIGN_EXTEND_INREG This is the DAG node for SIGN_EXTEND_INREG : t21: v4i32 = sign_extend_inreg t18, ValueType:ch:v4i16 It has two operands. The first one is the value it want to extend, and the second one is the type to specify how to extend the value. For this example, it means that, it is signed extend the t18(v4i32) from v4i16 to v4i32. That is the semantics of c code: vector int foo(vector int m) { return m << 16 >> 16; } And it could be any vector type that hardware support the operation, though the type 'v4i16' is NOT legal for the target. When we are trying to combine the srl + sra, what we did now is calling the TLI.isOperationLegal(), which will also check the legality of the type. That doesn't make sense. Differential Revision: https://reviews.llvm.org/D70230	2020-01-06 03:00:58 +00:00
Fangrui Song	241548a521	[MC] Delete MCFragment::isDummy. NFC isa<...>, dyn_cast<...> and cast<...> are used by other fragments. Don't make MCDummyFragment special.	2020-01-05 18:49:47 -08:00
Craig Topper	19d6875129	[X86] Improve v2i64->v2f32 and v4i64->v4f32 uint_to_fp on avx and avx2 targets. Summary: Based on Simon's D52965, but improved to handle strict fp and improve some of the shuffling. Rather than use v2i1/v4i1 and let type legalization continue, just generate all the code with legal types and use an explicit shuffle. I also added an explicit setcc to the v4i64 code to match the semantics of vselect which doesn't just use the sign bit. I'm also using a v4i64->v4i32 truncate instead of the shuffle in Simon's original code. With the setcc this will become a pack. Future work can look into using X86ISD::BLENDV and a different shuffle that only moves the sign bit. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71956	2020-01-05 17:44:08 -08:00
Liu, Chen3	6ec5dfb8d7	[NFC] Modify the format: Drop the else since we alerady returned in the if.	2020-01-06 09:35:19 +08:00
Brian Gesiak	5837e6239c	[Coroutines] Remove corresponding phi values when apply simplifyTerminatorLeadingToRet Summary: In addMustTailToCoroResumes, we set musttail on those resume instructions that are followed by a ret instruction. This is done by simplifyTerminatorLeadingToRet which replace a sequence of branches leading to a ret with a clone of the ret. However it forgets to remove corresponding PHI values that come from basic block of replaced branch, and may cause jumpthreading pass hangs (https://bugs.llvm.org/show_bug.cgi?id=43720) This patch fix this issue Test Plan: cppcoro library with O3+flto check-llvm Reviewers: modocache, GorNishanov, lewissbaker Reviewed By: modocache Subscribers: mehdi_amini, EricWF, hiraditya, dexonsmith, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71826 Patch by junparser (JunMa)!	2020-01-05 18:26:30 -05:00
Fangrui Song	f4801ca87c	[MC][ARM] Delete MCSection::HasData and move SHF_ARM_PURECODE logic to ARMELFObjectWriter::addTargetSectionFlags This simplifies the generic interface and also makes SHF_ARM_PURECODE more robust (fixes a TODO). Inspecting MCDataFragment contents covers more cases than MCObjectStreamer::EmitBytes.	2020-01-05 14:20:34 -08:00
Fangrui Song	b05126b279	[MC] Delete MCSection::{rbegin,rend}	2020-01-05 12:51:15 -08:00
Fangrui Song	45f2208643	[MC] Merge MCSymbol::getSectionPtr into getSection and simplify	2020-01-05 12:03:40 -08:00

1 2 3 4 5 ...

189648 Commits