llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Stanislav Mekhanoshin	3dc15fa04e	[AMDGPU] Fix VC warning about singed/unsigned comparison. NFC. This is the warning reported in https://reviews.llvm.org/D89599	2020-10-26 11:55:57 -07:00
Joe Ellis	50136e3679	[SVE] Fix TypeSize warning in llvm::getGEPInductionOperand We do not need to use the implicit cast here. We can instead can rely on a comparison between two TypeSize objects instead. This algorithm will work fine with scalable vectors. Reviewed By: DavidTruby Differential Revision: https://reviews.llvm.org/D90146	2020-10-26 17:40:32 +00:00
Joe Ellis	fa3637aa5e	[SVE][InstCombine] Fix TypeSize warning in canReplaceGEPIdxWithZero The warning would fire when calling canReplaceGEPIdxWithZero on a GEP whose source element type is a scalable vector. The size of scalable vector types is not known, so this optimization cannot be performed. This patch fixes the issue by: - bailing out early in this routine if the GEP instruction's source element type is a scalable vector. - making use of getFixedSize -- this removes the dependency on the deprecated interface. Reviewed By: fpetrogalli Differential Revision: https://reviews.llvm.org/D89968	2020-10-26 17:40:26 +00:00
Joe Ellis	18290a4a32	[SVE][AArch64] Fix TypeSize warning in loop vectorization legality The warning would fire when calling isDereferenceableAndAlignedInLoop with a scalable load. Calling isDereferenceableAndAlignedInLoop with a scalable load would result in the use of the now deprecated implicit cast of TypeSize to uint64_t through the overloaded operator. This patch fixes this issue by: - no longer considering vector loads as candidates in canVectorizeWithIfConvert. This doesn't make sense in the context of identifying scalar loads to vectorize. - making use of getFixedSize inside isDereferenceableAndAlignedInLoop -- this removes the dependency on the deprecated interface, and will trigger an assertion error if the function is ever called with a scalable type. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D89798	2020-10-26 17:40:04 +00:00
Evgeny Leviant	2575bcd8cb	[ARM][SchedModels] Convert IsLdstsoScaledNotOptimalPred to MCSchedPredicate Differential revision: https://reviews.llvm.org/D90150	2020-10-26 20:22:41 +03:00
Evgeny Leviant	2ff9958f77	Fix issue in cortex-a57 sched model Differential revision: https://reviews.llvm.org/D90152	2020-10-26 20:16:40 +03:00
Benjamin Kramer	6892b335d9	[AMDGPU] Avoid unused variable warning in Release builds. NFC. SIRegisterInfo.cpp:480:19: error: unused variable 'SOffset'	2020-10-26 18:11:57 +01:00
Peter Waller	ff2b8ca717	[SVE][CodeGen][DAGCombiner] Fix TypeSize warning in redundant store elimination The modified code in visitSTORE was missing a scalable vector check, and still using the now deprecated implicit cast of TypeSize to uint64_t through the overloaded operator. This patch fixes these issues. This brings the logic in line with the comment on the context line immediately above the added precondition. Add a test in sve-redundant-store.ll that the warning is not triggered. Differential Revision: https://reviews.llvm.org/D89701	2020-10-26 16:37:48 +00:00
Peter Waller	752c121e75	Revert "[SVE][CodeGen][DAGCombiner] Fix TypeSize warning in redundant store elimination" This reverts commit 4604441386dc5fcd3165f4b39f5fa2e2c600f1bc. Reverting because it was not the intended version of the patch, which follows this patch.	2020-10-26 16:37:00 +00:00
Peter Waller	dec44ead4c	[SVE][CodeGen][DAGCombiner] Fix TypeSize warning in redundant store elimination The modified code in visitSTORE was missing a scalable vector check, and still using the now deprecated implicit cast of TypeSize to uint64_t through the overloaded operator. This patch fixes these issues. This brings the logic in line with the comment on the context line immediately above the added precondition. Add a test in Redundantstores.ll that the warning is not triggered.	2020-10-26 16:23:42 +00:00
Kazushi (Jam) Marukawa	4f90b31115	[VE] Add vector shift instructions Add VSLL/VSLD/VSRL/VSLA/VSLAX/VSRA/VSRAX/VSFA instructionss. Add additonal AsmParser for VSLD special operand. Also add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90143	2020-10-27 00:30:27 +09:00
Kazushi (Jam) Marukawa	f2d171e4f2	[VE] Add vector logical instructions Add VAND/VOR/VXOE/VEQV/VLDZ/VPCNT/VBRV/VSEQ instrucitons and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90141	2020-10-27 00:29:33 +09:00
Kazushi (Jam) Marukawa	d1e71a6f1c	[VE] Support atomic store Support atomic store instructions and add a regression test. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90137	2020-10-27 00:28:11 +09:00
Simon Pilgrim	dfaaccca5a	[InstCombine] collectBitParts - add bitreverse intrinsic support.	2020-10-26 14:36:36 +00:00
Jay Foad	7ac5d3d4c1	[AMDGPU] Make more use of printNamedBit in AMDGPUInstPrinter. NFC.	2020-10-26 14:03:35 +00:00
Andrew Ng	168234fec6	[llvm-ar][Object] Fix detection of need for 64-bit archive symbol tables The code to detect the requirement for 64-bit offsets in the archive symbol table was not correctly accounting for the archive file signature and the size of all the contents of the symbol table itself, e.g. the symbol table's header and string table. Also was not considering the variation in symbol table formats. This could result in the creation of large archives with a corrupt symbol table. Change the testing environment variable SYM64_THRESHOLD to be an absolute value rather than a power of 2 in order to enable precise testing of this detection code. Differential Revision: https://reviews.llvm.org/D89891	2020-10-26 12:29:28 +00:00
Simon Pilgrim	ef2af915dd	[InstCombine] Add support for zext(and(neg(amt),width-1)) rotate shift amount patterns Alive2: https://alive2.llvm.org/ce/z/bCvvHd	2020-10-26 11:22:41 +00:00
Djordje Todorovic	cee1ab5a48	[NFC][InstrRefLDV] Fix a typo	2020-10-26 04:04:16 -07:00
Kazushi (Jam) Marukawa	ce3aa09426	[VE] Add vector comparison and min/max Add VCMP/VCPS/VCPX/VCMS/VCMX vector instructions. Also add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89643	2020-10-26 18:32:04 +09:00
Kazushi (Jam) Marukawa	5f7c58423d	[VE] Add integer arithmetic vector instructions Add VADD/VADS/VADX/VSUB/VSBS/VSBX/VMPY/VMPS/VMPX/VMPD/VDIV/VDVS/VDVX instructions. Also add regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89642	2020-10-26 18:30:11 +09:00
Florian Hahn	3355197eef	[AsmPrinter] Add per BB instruction mix remark. This patch adds a remarks that provides counts for each opcode per basic block. An snippet of the generated information can be seen below. The current implementation uses the target specific opcode for the counts. For example, on AArch64 this means we currently get 2 entries for `add` instructions if the block contains 32 and 64 bit adds. Similarly, immediate version are treated differently. Unfortunately there seems to be no convenient way to get only the mnemonic part of the instruction as a string AFAIK. This could be improved in the future. ``` --- !Analysis Pass: asm-printer Name: InstructionMix DebugLoc: { File: arm64-instruction-mix-remarks.ll, Line: 30, Column: 30 } Function: foo Args: - String: 'BasicBlock: ' - BasicBlock: else - String: "\n" - String: INST_MADDWrrr - String: ': ' - INST_MADDWrrr: '2' - String: "\n" - String: INST_MOVZWi - String: ': ' - INST_MOVZWi: '1' ``` Reviewed By: anemet, thegameg, paquette Differential Revision: https://reviews.llvm.org/D89892	2020-10-26 09:25:45 +00:00
Sebastian Neubauer	d177eafb58	[AMDGPU] Emit new pal metadata by default If no pal metadata is given, default to the msgpack format instead of the legacy metadata. This makes tests better readable. Differential Revision: https://reviews.llvm.org/D90035	2020-10-26 10:16:17 +01:00
Evgeny Leviant	cc3a38e45c	[ARM][SchedModels] Rename and generalize predicate. NFC	2020-10-26 12:14:55 +03:00
Kazushi (Jam) Marukawa	a3eb3522be	[VE] Support atomic load Support atomic load instruction and add a regression test. VE uses release consitency, so need to insert fence around atomic instructions. This patch enable AtomicExpandPass and use emitLeadingFence and emitTrailingFence mechanism for such purpose. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90135	2020-10-26 18:02:45 +09:00
Evgeny Leviant	6f6abdc5f2	[ARM][SchedModels] Get rid of IsLdrAm2ScaledPred Differential revision: https://reviews.llvm.org/D90024	2020-10-26 12:01:39 +03:00
Evgeny Leviant	be3fc14b59	[ARM][SchedModels] Convert IsLdstsoMinusRegPred to MCSchedPredicate Differential revision: https://reviews.llvm.org/D90029	2020-10-26 11:54:08 +03:00
Evgeny Leviant	80a867a3d9	[ARM][SchedModels] Convert IsLdrAm3NegRegOffPred to MCSchedPredicate Differential revision: https://reviews.llvm.org/D90045	2020-10-26 11:43:02 +03:00
David Green	c8c5cd6011	[Schedule] Add a MultiHazardRecognizer This adds a MultiHazardRecognizer and starts to make use of it in the ARM backend. The idea of the class is to allow multiple independent hazard recognizers to be added to a single base MultiHazardRecognizer, allowing them to all work in parallel without requiring them to be chained into subclasses. They can then be added or not based on cpu or subtarget features, which will become useful in the ARM backend once more hazard recognizers are being used for various things. This also renames ARMHazardRecognizer to ARMHazardRecognizerFPMLx in the process, to more clearly explain what that recognizer is designed for. Differential Revision: https://reviews.llvm.org/D72939	2020-10-26 08:06:17 +00:00
Kazushi (Jam) Marukawa	75e202d847	[VE] Support atomic fence Support atomic fence instruction and add a regression test. Add MEMBARRIER pseudo insturction also to use it as a barrier against to the compiler optimizations. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90112	2020-10-26 17:03:09 +09:00
Max Kazantsev	fba008b8ac	Fix broken build after previous commit	2020-10-26 14:55:46 +07:00
Max Kazantsev	05aa5af1d6	[NFC] Remove unused funciton param	2020-10-26 14:53:22 +07:00
Max Kazantsev	655138fd3f	[NFC] Factor out common code into lambda for further improvement	2020-10-26 14:50:45 +07:00
Max Kazantsev	99fbcc22a1	[IndVars] Use contextual knowledge when proving trivial conds No exact example where it would help, but it's a generally a more powerful way to prove predicates.	2020-10-26 13:48:32 +07:00
Christudasan Devadasan	53166b8424	[AMDGPU] Avoid offset register in MUBUF for direct stack object accesses We use an absolute address for stack objects and it would be necessary to have a constant 0 for soffset field. Fixes: SWDEV-228562 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D89234	2020-10-26 11:08:37 +05:30
Craig Topper	ed9b546976	[X86] Don't disassemble wbinvd with 0xf2 or 0x66 prefix. The 0xf3 prefix has been defined as wbnoinvd on Icelake Server. So the prefix isn't ignored by the CPU. AMD documentation suggests that wbnoinvd is treated as wbinvd on older processors. Intel documentation is not clear. Perhaps 0xf2 and 0x66 are treated the same, but its not documented. This patch changes TB to PS in the td file so 0xf2 and 0x66 will be treated as errors. This matches versions of objdump after wbnoinvd was added.	2020-10-25 20:56:01 -07:00
Liu, Chen3	9e9f59e512	[X86] VEX/EVEX prefix doesn't work for inline assembly. For now, we lost the encoding information if we using inline assembly. The encoding for the inline assembly will keep default even if we add the vex/evex prefix. Differential Revision: https://reviews.llvm.org/D90009	2020-10-26 08:37:45 +08:00
Craig Topper	73bb226dd9	[X86] Use TargetConstant for immediates for VASTART_SAVE_XMM_REGS.	2020-10-25 12:52:56 -07:00
Craig Topper	89b0e93980	[X86] Use TargetConstant instead of Constant for operands to X86vaarg64.	2020-10-25 12:24:59 -07:00
Nikita Popov	049b2e39fb	[SCEV] Strenthen nowrap flags after constant folding for mul exprs Same change as 0dda6333175c1749f12be660456ecedade3bcf21, but for mul expressions. We want to first fold any constant operans and then strengthen the nowrap flags, as we can compute more precise flags at that point.	2020-10-25 19:43:58 +01:00
Nikita Popov	1b8fa86304	[SCEV] Always constant fold mul expression operands Establish parity with the handling of add expressions, by always constant folding mul expression operands before checking the depth limit (this is a non-recursive simplification). The code was already unconditionally constant folding the case where all operands were constants, but was not folding multiple constant operands together if there were also non-constant operands. This requires picking out a different demonstration for depth-based folding differences in the limit-depth.ll test.	2020-10-25 18:50:06 +01:00
Nikita Popov	d01a625d45	[SCEV] Separate out constant folding in mul expr creation Separate out the code handling constant folding into a separate block, that is independent of other folds that need a constant first operand. Also make some minor adjustments to make the constant folding look nearly identical to the same code in getAddExpr(). The only reason this change is not strictly NFC is that the C1(C2+V) fold is moved below the constant folding, which means that it now also applies to C1C2*(C3+V), as it should.	2020-10-25 18:46:50 +01:00
Nikita Popov	fa8894df7b	[SCEV] Strength nowrap flags after constant folding We should first try to constant fold the add expression and only strengthen nowrap flags afterwards. This allows us to determine stronger flags if e.g. only two operands are left after constant folding (and thus "guaranteed no wrap region" code applies) or the resulting operands are non-negative and thus nsw->nuw strengthening applies.	2020-10-25 18:00:22 +01:00
Sanjay Patel	aa7efec9b3	[InstSimplify] peek through 'not' operand in logic-of-icmps fold This extends D78430 to solve cases like: https://llvm.org/PR47858 There are still missed opportunities shown in the tests, and as noted in the earlier patches, we have related functionality in InstCombine, so we may want to extend other folds in a similar way. A semi-random sampling of test diff proofs in this patch: https://rise4fun.com/Alive/sS4C	2020-10-25 11:13:30 -04:00
Simon Pilgrim	766ee2bbfc	Remove superfluous whitespace around if(). NFC.	2020-10-25 14:38:16 +00:00
Simon Pilgrim	c5969773c7	[InstCombine] matchBSwapOrBitReversem - recognise or(fshl(),fshl()) bswap patterns. I'm not certain InstCombinerImpl::matchBSwapOrBitReverse needs to filter the or(op0(),op1()) ops - there are just too many cases that recognizeBSwapOrBitReverseIdiom/collectBitParts handle now (and quickly).	2020-10-25 10:17:45 +00:00
Craig Topper	79ebcd9370	[X86] Use TargetConstant for FPDiff with X86::TC_RETURN. It's required to be a constant and can never be in a register so make it explicit.	2020-10-25 00:29:11 -07:00
Arthur Eubanks	8eb1142964	Fix typo SSC -> SCC	2020-10-24 16:26:48 -07:00
Fangrui Song	123fab6226	[X86] Produce R_X86_64_GOTPCRELX for test/binop instructions (MOV32rm/TEST32rm/...) when -Wa,-mrelax-relocations=yes is enabled We have been producing R_X86_64_REX_GOTPCRELX (MOV64rm/TEST64rm/...) and R_X86_64_GOTPCRELX for CALL64m/JMP64m without the REX prefix since 2016 (to be consistent with GNU as), but not for MOV32rm/TEST32rm/...	2020-10-24 15:14:17 -07:00
Fangrui Song	bf4d4552bd	[AArch64][GlobalISel] Fix -Wunused-variable. NFC	2020-10-24 12:47:11 -07:00
Benjamin Kramer	1f13ddec12	[X86] Add a stub for Intel's alderlake. No scheduling, no autodetection.	2020-10-24 19:01:22 +02:00

1 2 3 4 5 ...

140456 Commits