llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
David Tenty	c0fffb92d9	[DebugInfo] Fix LineTest byteswap for cross-targeting builds Summary: The byte swap fix for big endian hosts in 9782c922cb21 (for D81570) swaps based on the host endianess, but for cross-targeting builds (i.e. big endian host targeting little endian) the host-endianess won't necessarily match the generated DWARF. This change updates the test to use symmetrical constants so the results aren't endian dependent. Reviewers: jhenderson, hubert.reinterpretcast, stevewan, ikudrin Reviewed By: ikudrin Subscribers: ikudrin, aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82827	2020-07-02 14:39:42 -04:00
Arthur Eubanks	abe0d64460	[NewPM] Add -basic-aa to pr33196.ll The legacy pass manager implicitly adds BasicAA, but the new PM does not. This causes pr33196.ll to fail under NPM. There are almost certainly lots of other failures like this, wanted to get some input on if adding -basic-aa to tests makes sense at scale. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D82915	2020-07-02 11:27:52 -07:00
Nico Weber	6d6c8377f5	[gn build] make building on an arm mac work Currently requires `llvm_targets_to_build = [ "X86", "AArch64" ]`: building just the host arch (i.e. aarch64) causes some linker errors.	2020-07-02 14:22:33 -04:00
Arthur Eubanks	c38ff9ab39	[NewPM][LSR] Rename strength-reduce -> loop-reduce The legacy pass was called "loop-reduce". This lowers the number of check-llvm failures under NPM by 83. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D82925	2020-07-02 11:15:29 -07:00
sstefan1	cf13fc2dbd	[OpenMPOpt][Fix] Remove double initialization of omp::types.	2020-07-02 19:51:54 +02:00
Nemanja Ivanovic	84b83b6cc1	[PowerPC] Remove undefs from splat input when changing shuffle mask As of 1fed131660b2c5d3ea7007e273a7a5da80699445, we have code that changes shuffle masks so that we can put the shuffle in a canonical form that can be matched to a single instruction. However, it does not properly account for undef elements in the BUILD_VECTOR that is the RHS splat so we can end up with undefs where they shouldn't be. This patch converts the splat input with undefs to one without.	2020-07-02 12:26:56 -05:00
Simon Pilgrim	50de1da1b9	[InstCombine] Add some sext/trunc tests to show missing support for non-uniform vectors	2020-07-02 17:11:56 +01:00
Sander de Smalen	3254840625	[AArch64][SVE] NFC: Rename isOrig -> isReverseInstr This is a non-functional to clarify some of the terminology in the AArch64SVEInstrInfo/SVEInstrFormats.td files around the tables for mapping an instruction to it's reverse instruction counter part, and vice versa. e.g. DIV -> DIVR and DIVR -> DIV. Reviewers: paulwalker-arm, cameron.mcinally, rengolin, efriedma Reviewed By: paulwalker-arm, efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D82979	2020-07-02 17:01:15 +01:00
Simon Pilgrim	d6a5eff9f1	[InstCombine] Add (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) support for non-uniform vectors As noted on PR46531, we were only performing this transform on uniform vectors as we were using the m_APInt pattern matcher to extract the shift amount. Differential Revision: https://reviews.llvm.org/D83035	2020-07-02 16:56:33 +01:00
Simon Pilgrim	16b26b11ea	[InstCombine] Add some (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) tests for vectors with undef elements Suggested on D83035	2020-07-02 16:04:30 +01:00
Ryan Santhiraraja	b3878897c6	Preserve GlobalsAA analysis result in LowerConstantIntrinsics LowerConstantIntrinsics fails to preserve the analysis result of GlobalsAA. Not preserving the analysis might affect benchmark performance. This change fixes this issue. Patch by Ryan Santhiraraja <rsanthir@quicinc.com> Reviewers: fpetrogalli, joerg, fhahn Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D82342	2020-07-02 15:40:41 +01:00
Dmitry Preobrazhensky	9877ca8a82	[AMDGPU][CODEGEN] Added support of new inline assembler constraints Added support for constraints 'I', 'J', 'B', 'C', 'DA', 'DB'. See https://gcc.gnu.org/onlinedocs/gcc/Machine-Constraints.html#Machine-Constraints. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D81651	2020-07-02 17:20:15 +03:00
Jon Roelofs	76a7da04bb	Fix missing build dependencies on omp_gen Differential Revision: https://reviews.llvm.org/D83003	2020-07-02 07:55:20 -06:00
Nathan James	5b06ff50e1	[ASTMatchers] Enhanced support for matchers taking Regex arguments Added new Macros `AST(_POLYMORPHIC)_MATCHER_REGEX(_OVERLOAD)` that define a matchers that take a regular expression string and optionally regular expression flags. This lets users match against nodes while ignoring the case without having to manually use `[Aa]` or `[A-Fa-f]` in their regex. The other point this addresses is in the current state, matchers that use regular expressions have to compile them for each node they try to match on, Now the regular expression is compiled once when you define the matcher and used for every node that it tries to match against. If there is an error while compiling the regular expression an error will be logged to stderr showing the bad regex string and the reason it couldn't be compiled. The old behaviour of this was down to the Matcher implementation and some would assert, whereas others just would never match. Support for this has been added to the documentation script as well. Support for this has been added to dynamic matchers ensuring functionality is the same between the 2 use cases. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D82706	2020-07-02 14:52:25 +01:00
Nathan James	888ea87b16	call ::pthread_detach on llvm_execute_on_thread_impl Fixes all TSAN bugs in clangd Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D83039	2020-07-02 14:41:05 +01:00
Simon Pilgrim	5d2157065e	Regenerate sext/trunc tests and replace %tmp variable names to silence update_test_checks warnings	2020-07-02 14:37:21 +01:00
Sander de Smalen	67ab949978	[AArch64][SVE] Put zeroing pseudos and patterns under flag. This patch puts the _ZERO pseudos and corresponding patterns under the predicate 'UseExperimentalZeroingPseudos', so that they can be enabled/disabled through compile flags. This is done because the zeroing pseudos use MOVPRFX to do merging of the inactive lanes, but it depends on the uarch whether this operation is actually merged with the destructive operation. If not, it may be more profitable to use a SELECT and to give the compiler the freedom to schedule these instructions as normal, rather than keeping them bundled together. Additionally, this feature is not yet fully implemented and there are still known bugs (see D80410) that need to be resolved before the 'experimental' can be dropped from the name. Reviewers: paulwalker-arm, cameron.mcinally, efriedma Reviewed By: paulwalker-arm Tags: #llvm Differential Revision: https://reviews.llvm.org/D82780	2020-07-02 14:24:33 +01:00
David Green	cee41879a1	[BasicAA] Fix recursive phi MustAlias calculations With the option -basic-aa-recphi we can detect recursive phis that loop through constant geps, which allows us to detect more no-alias case for pointer IV's. If the other phi operand and the other alias value are MustAlias though, we cannot presume that every element in the loop is also MustAlias. We need to instead be conservative and return MayAlias. Differential Revision: https://reviews.llvm.org/D82987	2020-07-02 14:01:38 +01:00
Guillaume Chatelet	132b11f5e0	[Alignment][NFC] Transition and simplify calls to DL::getABITypeAlignment This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82977	2020-07-02 11:28:02 +00:00
Guillaume Chatelet	1a720eec07	[Alignment][NFC] VectorLayout now uses Align internally By rewritting `ScalarizerVisitor::getVectorLayout` in such a way it returns `VectorLayout` (or `None`) it becomes obvious that `VectorLayout::VecAlign` cannot be `0`. This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82981	2020-07-02 11:25:55 +00:00
Kerry McLaughlin	07d934b054	[AArch64][SVE] Add reg+imm addressing mode for unpredicated stores Reviewers: sdesmalen, efriedma, david-arm Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82985	2020-07-02 12:00:01 +01:00
Simon Pilgrim	4e895924ce	[InstCombine] Add some (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) tests for non-uniform vectors As noticed on PR46531	2020-07-02 11:56:51 +01:00
Simon Pilgrim	b350b4ff82	Regenerate apint-shift tests and replace %tmp variable names to silence update_test_checks warnings	2020-07-02 11:56:51 +01:00
Anna Welker	18bf72a2b7	[LV] Enable the LoopVectorizer to create pointer inductions This patch enables the LoopVectorizer to build a phi of pointer type and provide the vector loads and stores with vector type getelementptrs built from the pointer induction variable, which produces much less instructions than the previous approach of creating scalar getelementpointers and glue them together to a vector. Differential Revision: https://reviews.llvm.org/D81267	2020-07-02 11:39:28 +01:00
Roman Lebedev	9bb300da96	Regenerate llvm/test/CodeGen/X86/optimize-max-0.ll It surprizingly appears to be affected by the last SCEV patch	2020-07-02 13:35:30 +03:00
Roman Lebedev	bac1e3b4d7	[ScalarEvolution] createSCEV(): recognize `udiv`/`urem` disguised as an `sdiv`/`srem` Summary: While InstCombine trivially converts that `srem` into a `urem`, it might happen later than wanted, in particular i'd like for that to happen on https://godbolt.org/z/bwuEmJ test case early in pipeline, before first instcombine run, just before `-mem2reg`. SCEV should recognize this case natively. Reviewers: mkazantsev, efriedma, nikic, reames Reviewed By: efriedma Subscribers: clementval, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82721	2020-07-02 13:22:12 +03:00
David Green	c309732838	[BasicAA] New basic-aa-recphi test. NFC	2020-07-02 10:54:01 +01:00
LLVM GN Syncbot	294e5820f1	[gn build] Port 804d9687443	2020-07-02 09:46:36 +00:00
David Sherwood	4591bf7068	[SVE] Add warnings checks in four more LLVM SVE tests I have added CHECK lines to the following tests: llvm/test/CodeGen/AArch64/sve-breakdown-scalable-vectortype.ll llvm/test/CodeGen/AArch64/sve-calling-convention-tuple-types.ll llvm/test/CodeGen/AArch64/sve-intrinsics-create-tuple.ll llvm/test/CodeGen/AArch64/sve-intrinsics-loads.ll since they are now free of warnings related to invalid use of EVT::getVectorNumElements() and VectorType::getNumElements(). Differential Revision: https://reviews.llvm.org/D82957	2020-07-02 10:43:17 +01:00
Ben Dunbobbin	d762921b07	[Support][Windows] Prevent 2s delay when renaming a file that does not exist Differential Revision: https://reviews.llvm.org/D82542	2020-07-02 10:41:17 +01:00
Nuno Lopes	fcb721b853	DSE: fix builtin function recognition to take decl into account	2020-07-02 10:28:47 +01:00
Jay Foad	079643288c	[AMDGPU] Fix formatting in MIR tests	2020-07-02 10:27:34 +01:00
Sander de Smalen	58e4673769	[CodeGen][SVE] Don't drop scalable flag in DAGCombiner::visitEXTRACT_SUBVECTOR There was a rogue 'assert' in AArch64ISelLowering for the tuple.get intrinsics, that shouldn't really have been there (I suspect this was a remnant from when we expected the wider vector always to have come from a vector CONCAT). When I tried to create a more minimal reproducer, I found a bug in DAGCombiner where it drops the scalable flag when trying to fold: extract_subv (bitcast X), Index --> bitcast (extract_subv X, Index') This patch fixes both issues. Reviewers: david-arm, efriedma, spatel Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D82910	2020-07-02 10:16:43 +01:00
Sander de Smalen	b6c6989aee	[AArch64][SVE] Add unpred load/store patterns for bf16 types Reviewers: kmclaughlin, c-rhodes, efriedma Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D82909	2020-07-02 10:01:24 +01:00
Qiu Chaofan	5777c725bf	[NFC] Fix typo in triples from unkown to unknown	2020-07-02 16:21:54 +08:00
Nicholas Guy	f4899fdead	[ARM] Rearrange SizeReduction when using -Oz Move the Thumb2SizeReduce pass to before IfConversion when optimising for minimal code size. Running the Thumb2SizeReduction pass before IfConversionallows T1 instructions to propagate to the final output, rather than the ifConverter modifying T2 instructions and preventing them from being reduced later. This change does introduce a regression regarding execution time, so it's only applied when optimising for size. Running the LLVM Test Suite with this change produces a geomean difference of -0.1% for the size..text metric. Differential Revision: https://reviews.llvm.org/D82439	2020-07-02 09:19:38 +01:00
David Sherwood	f46a2e5105	[CodeGen] Fix warnings in getCopyToPartsVector Whilst trying to assemble the following test: clang/test/CodeGen/aarch64-sve-intrinsics/acle_sve_set2.c I discovered we were hitting some warnings about possible invalid calls to getVectorNumElements() in getCopyToPartsVector(). I've tried to fix these by using ElementCount types where possible and I've made the assumption that we don't support using a fixed width vector to copy parts of a scalable vector, and vice versa. Looking at how the copy is implemented I think that's the right thing for now. Differential Revision: https://reviews.llvm.org/D82744	2020-07-02 09:08:20 +01:00
Craig Topper	e3a77ca3c8	[X86] Enable multibyte NOPs in 64-bit mode for padding/alignment. The default CPU used by llvm-mc doesn't have the NOPL feature, but if we know we're compiling in 64-bit mode we should be able to use nopl.	2020-07-01 23:59:01 -07:00
Krzysztof Pszeniczny	bc78f19428	This patch adds basic debug info support with basic block sections. This patch uses ranges for debug information when a function contains basic block sections rather than using [lowpc, highpc]. This is also the first in a series of patches for debug info and does not contain the support for linker relaxation. That will be done as a follow up patch. Differential Revision: https://reviews.llvm.org/D78851	2020-07-01 23:53:00 -07:00
Pushpinder Singh	ddad2e078e	[AMDGPU] Control num waves per EU for implicit work-group size Summary: If amdgpu-flat-work-group-size is not specified in LLVM IR, the backend uses default value of 1024. For this, minimum waves per EU should be 4. However, backend is still setting minimum value to 1 instead of calculated value. This is not observed normally as frontend always provide amdgpu-flat-work-group-size attribute. Reviewers: rampitec, b-sumner, sameerds, msearles Reviewed By: rampitec Subscribers: qcolombet, arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81991	2020-07-01 22:53:52 -04:00
Biplob Mishra	30db684f8c	[PowerPC]Implement Vector Shift Double Bit Immediate Builtins Implement Vector Shift Double Bit Immediate Builtins in LLVM/Clang. * vec_sldb (); * vec_srdb (); Differential Revision: https://reviews.llvm.org/D82440	2020-07-01 20:34:53 -05:00
Valentin Clement	a027dcd60c	[flang][openmp] Use common Directive and Clause enum from llvm/Frontend Summary: This patch is removing the custom enumeration for OpenMP Directives and Clauses and replace them with the newly tablegen generated one from llvm/Frontend. This is a first patch and some will follow to share the same infrastructure where possible. The next patch should use the clauses allowance defined in the tablegen file. Reviewers: jdoerfert, DavidTruby, sscalpone, kiranchandramohan, ichoyjx Reviewed By: DavidTruby, ichoyjx Subscribers: jholewinski, cfe-commits, dblaikie, MaskRay, ymandel, ichoyjx, mgorny, yaxunl, guansong, jfb, sstefan1, aaron.ballman, llvm-commits Tags: #llvm, #flang, #clang Differential Revision: https://reviews.llvm.org/D82906	2020-07-01 20:58:11 -04:00
Xiang1 Zhang	2a61d4a241	[X86-64] Support Intel AMX instructions Summary: INTEL ADVANCED MATRIX EXTENSIONS (AMX). AMX is a new programming paradigm, it has a set of 2-dimensional registers (TILES) representing sub-arrays from a larger 2-dimensional memory image and operate on TILES. Spec can be found in Chapter 3 here https://software.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewers: LuoYuanke, annita.zhang, pengfei, RKSimon, xiangzhangllvm Reviewed By: xiangzhangllvm Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82705	2020-07-02 08:57:04 +08:00
Lei Huang	006e66f9bb	[PowerPC][NFC] Update doc for FeatureISA3_1/FeatureISA3_0 definitions	2020-07-01 19:36:19 -05:00
Anil Mahmud	e9e3ccc44e	[PowerPC] Exploit xxspltiw and xxspltidp instructions Exploits the VSX Vector Splat Immediate Word and VSX Vector Splat Immediate Double Precision instructions: xxspltiw XT,IMM32 xxspltidp XT,IMM32 Differential Revision: https://reviews.llvm.org/D82911	2020-07-01 19:18:29 -05:00
Roman Lebedev	3b3568202e	[NFCI] Actually provide correct check lines in sdiv.ll	2020-07-02 02:00:02 +03:00
Matt Arsenault	577c01f0b2	AMDGPU: Set more mov flags on V_ACCVGPR_{READ\|WRITE}_B32 This fixes extra copies when materializing constants in AGPRs. This made it a lot harder to trigger the spilling in spill-agpr.ll	2020-07-01 18:58:59 -04:00
Matt Arsenault	6975946134	RegAllocGreedy: Use TargetInstrInfo already in the class	2020-07-01 18:58:59 -04:00
Matt Arsenault	2f0742a09c	AMDGPU: Fix missing tracksRegLiveness in tests I have no idea why this is considered optional, or why it's not the default. Also add uses of the copied registers for more useful liveness testing.	2020-07-01 18:58:59 -04:00
Stanislav Mekhanoshin	e91645ec9c	[AMDGPU] Limit promote alloca to vector with VGPR budget Allow only up to 1/4 of available VGPRs for the vectorization of any given alloca. Differential Revision: https://reviews.llvm.org/D82990	2020-07-01 15:57:24 -07:00

1 2 3 4 5 ...

199454 Commits