llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 18:54:02 +01:00

Author	SHA1	Message	Date
Anna Welker	7da989b1b5	[ARM][MVE] Enable tail predication for loops containing MVE gather/scatters Widen the scope of memory operations that are allowed to be tail predicated to include gathers and scatters, such that loops that are auto-vectorized with the option -enable-arm-maskedgatscat (and actually end up containing an MVE gather or scatter) can be tail predicated. Differential Revision: https://reviews.llvm.org/D85138	2020-08-12 15:32:37 +01:00
Matt Arsenault	a7f874dd31	AMDGPU/GlobalISel: Select llvm.amdgcn.global.atomic.fadd Remove the intermediate transform in the DAG path. I believe this is the last non-deprecated intrinsic that needs handling.	2020-08-12 10:04:53 -04:00
Matt Arsenault	e479314184	AMDGPU: Handle intrinsics in performMemSDNodeCombine This avoids a possible regression in a future patch	2020-08-12 10:04:53 -04:00
Xing GUO	748a235862	[DWARFYAML] Make the address size of compilation units optional. This patch makes the 'AddrSize' field optional. If the address size is missing, yaml2obj will infer it from the object file. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85805	2020-08-12 21:47:32 +08:00
Xing GUO	d15e24388b	[MachOYAML] Simplify the section data emitting function. NFC. This patch helps simplify some codes in writeSectionData() function. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D85821	2020-08-12 21:46:43 +08:00
Sanjay Patel	00ebcf5eea	[VectorCombine] early exit if target has no vector registers Based on post-commit discussion in: D81766 Other vectorization passes (SLP and Loop) use this TTI API similarly.	2020-08-12 09:22:31 -04:00
Sanjay Patel	575e40e45d	[VectorCombine] add test for x86 target with SSE disabled; NFC	2020-08-12 09:22:31 -04:00
David Green	40e668f84c	[ARM] Add additional predicated VFMA tests. NFC	2020-08-12 14:20:20 +01:00
Sanjay Patel	8a2dab8cbf	[InstCombine] eliminate a pointer cast around insertelement I'm not sure if this solves PR46839 completely, but reducing the casting should help: https://bugs.llvm.org/show_bug.cgi?id=46839 Differential Revision: https://reviews.llvm.org/D85647	2020-08-12 09:08:17 -04:00
Sanjay Patel	fe63c251a7	[VectorCombine] add test for Hexagon that would crash; NFC This test verifies the code change from: rGb0b95dab1ce2 (although that would not be true if PR47128 is fixed)	2020-08-12 08:38:20 -04:00
Kai Nacke	c6e90fc4cb	[SystemZ/ZOS] Implement computeHostNumPhysicalCores On z/OS, the information is stored in the Common System Data Area (CSD). It is the number of CPs allocated to the current LPAR. Reviewers: aganea, hubert.reinterpertcast, MaskRay Reviewed By: hubert.reinterpertcast Differential Revision: https://reviews.llvm.org/D85531	2020-08-12 08:31:33 -04:00
Sam Parker	0906f80f16	[LoopUnroll] Adjust CostKind query When TTI was updated to use an explicit cost, TCK_CodeSize was used although the default implicit cost would have been the hand-wavey cost of size and latency. So, revert back to this behaviour. This is not expected to have (much) impact on targets since most (all?) of them return the same value for SizeAndLatency and CodeSize. When optimising for size, the logic has been changed to query CodeSize costs instead of SizeAndLatency. This patch also adds a testing option in the unroller so that OptSize thresholds can be specified. Differential Revision: https://reviews.llvm.org/D85723	2020-08-12 12:56:09 +01:00
David Green	f1a40c93c6	[ARM] Commutative vmin/maxnma tests. NFC	2020-08-12 12:50:18 +01:00
Simon Pilgrim	4c91fdf3fa	[X86][SSE] Fold HOP(SHUFFLE(X),SHUFFLE(Y)) --> SHUFFLE(HOP(X,Y)) This is beginning to look like a canonicalization stage that could be performed as part of shuffle combining Another step towards PR41813	2020-08-12 12:16:36 +01:00
Georgii Rymar	88b9fc954b	[llvm-readobj] - Refine logic of the symbol table locating in printRelocationsHelper(). This removes the last `unwrapOrError` call from the `printRelocationsHelper`. There is a little additional complexity because of `SHT_RELR/SHT_ANDROID_RELR` sections. Such sections contains only relative relocations and they do not have a symbol table associated with them, hence we should not try to treat their `sh_link` field as a reference to a symbol table. Differential revision: https://reviews.llvm.org/D85430	2020-08-12 14:03:56 +03:00
Simon Pilgrim	33f8351d13	[X86][AVX] Fold CONCAT(HOP(X,Y),HOP(Z,W)) -> HOP(CONCAT(X,Z),CONCAT(Y,W)) for float types Only do this for AVX2+ targets as we still get some regressions on AVX1 without PERMPD/PERMQ	2020-08-12 11:31:05 +01:00
Cullen Rhodes	72fa7ea564	[Transforms][SROA] Skip uses of allocas where the type is scalable When visiting load and store instructions in SROA skip scalable vectors. This is relevant in the implementation of the 'arm_sve_vector_bits' attribute that is used to define VLS types, where an alloca of a fixed-length vector could be bitcasted to scalable. See D85128 for more information. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D85725	2020-08-12 09:35:48 +00:00
Florian Hahn	371c1e57df	[SCEV] Look through single value PHIs. Now that SCEVExpander can preserve LCSSA form, we do not have to worry about LCSSA form when trying to look through PHIs. SCEVExpander will take care of inserting LCSSA PHI nodes as required. This increases precision of the analysis in some cases. Reviewed By: mkazantsev, bmahjour Differential Revision: https://reviews.llvm.org/D71539	2020-08-12 10:03:42 +01:00
Igor Kudrin	cde37ff605	[llvm-dwarfdump] Avoid crashing if an abbreviation offset is invalid. Note that DWARFUnit::getAbbreviations() returns nullptr if the abbreviations could not be read, but callers used the returned pointer without checking. Differential Revision: https://reviews.llvm.org/D85738	2020-08-12 16:01:53 +07:00
Sjoerd Meijer	25b37a74a0	[ARM][MVE] tail-predication: overflow checks for backedge taken count. This pick ups the work on the overflow checks for get.active.lane.mask, which ensure that it is safe to insert the VCTP intrinisc that enables tail-predication. For a 2d auto-correlation kernel and its inner loop j: M = Size - i; for (j = 0; j < M; j++) Sum += Input[j] * Input[j+i]; For this inner loop, the SCEV backedge taken count (BTC) expression is: (-1 + (sext i16 %Size to i32)),+,-1}<nw><%for.body> and LoopUtil cannotBeMaxInLoop couldn't calculate a bound on this, thus "BTC cannot be max" could not be determined. So overflow behaviour had to be assumed in the loop tripcount expression that uses the BTC. As a result tail-predication had to be forced (with an option) for this case. This change solves that by using ScalarEvolution's helper getConstantMaxBackedgeTakenCount which is able to determine the range of BTC, thus can determine it is safe, so that we no longer need to force tail-predication as reflected in the changed test cases. Differential Revision: https://reviews.llvm.org/D85737	2020-08-12 09:32:26 +01:00
David Sherwood	96c98d5a46	[SVE][CodeGen] Fix issues with EXTRACT_SUBVECTOR when using scalable FP vectors In this patch I have fixed two issues: 1. Our SVE tuple get/set intrinsics were using the wrong constant type for the index passed to EXTRACT_SUBVECTOR. I have fixed this by using the function SelectionDAG::getVectorIdxConstant to create the value. Also, I have updated the documentation for EXTRACT_SUBVECTOR describing what type the constant index should be and we now enforce this when creating the node. 2. The AArch64 backend was missing the appropriate patterns for extracting certain subvectors (nxv4f16 and nxv2f32) from legal SVE types. I have added them as part of this patch. The only way that I could find to test the new patterns was to use the SVE tuple get intrinsics, although I realise it looks a bit unusual. Tests added here: test/CodeGen/AArch64/sve-extract-subvector.ll Differential Revision: https://reviews.llvm.org/D85516	2020-08-12 08:35:46 +01:00
Kazushi (Jam) Marukawa	bb4ad38d95	[VE] Change to promote i32 AND/OR/XOR operations VE has only 64 bits AND/OR/XOR instructions. We pretended that VE has 32 bits instructions also, but doing it increase the number of generated instructions. Therefore, we decide to promote 32 bits operations and use only 64 bits instructions in back end. We also avoid pretending that VE has 32 bits LEA instruction. Update regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85726	2020-08-12 16:23:50 +09:00
Kiran Chandramohan	f33b4004b5	[MLIR,OpenMP] Lowering of parallel operation: proc_bind clause 2/n This patch adds the translation of the proc_bind clause in a parallel operation. The values that can be specified for the proc_bind clause are specified in the OMP.td tablegen file in the llvm/Frontend/OpenMP directory. From this single source of truth enumeration for proc_bind is generated in llvm and mlir (used in specification of the parallel Operation in the OpenMP dialect). A function to return the enum value from the string representation is also generated. A new header file (DirectiveEmitter.h) containing definitions of classes directive, clause, clauseval etc is created so that it can be used in mlir as well. Reviewers: clementval, jdoerfert, DavidTruby Differential Revision: https://reviews.llvm.org/D84347	2020-08-12 08:03:13 +01:00
Craig Topper	acc3d72e97	[X86][GlobalISel] Replace a misuse of SUBREG_TO_REG with INSERT_SUBREG. SUBREG_TO_REG is supposed to be used when we know the producing instruction already zeroed the bits we're extending. But that's not the case here. So INSERT_SUBREG with an IMPLICIT_DEF is the correct thing to use.	2020-08-11 23:51:02 -07:00
Kyungwoo Lee	2beab8ee7a	[NFC] Factor out hasForceAttributes This is a preparation for https://reviews.llvm.org/D85586. Differential Revision: https://reviews.llvm.org/D85793	2020-08-12 02:16:57 -04:00
Johannes Doerfert	8755dd12ec	[Attributor][NFC] Reformat tests after D85099 Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D85700	2020-08-12 01:04:19 -05:00
Johannes Doerfert	bd4a02e66e	[UpdateTestChecks] Match unnamed values like "@[0-9]+" and "![0-9]+" With this patch we will match most uses of "temporary" named things in the IR via regular expressions, not their name at creation time. The new "values" we match are: - "unnamed" globals: `@[0-9]+` - debug metadata: `!dbg ![0-9]+` - loop metadata: `!loop ![0-9]+` - tbaa metadata: `!tbaa ![0-9]+` - range metadata: `!range ![0-9]+` - generic metadata: `metadata ![0-9]+` - attributes groups: `#[0-9]` We still don't match the declarations but that can be done later. This patch can introduce churn when existing check lines contain the old hardcoded versions of the above "values". We can add a flag to opt-out, or opt-in, if necessary. Reviewed By: arichardson, MaskRay Differential Revision: https://reviews.llvm.org/D85099	2020-08-12 01:04:16 -05:00
Petr Hosek	6abbc429da	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-11 20:22:11 -07:00
Jordan Rupprecht	7059d43168	[NFC] Inline variable only used in debug builds	2020-08-11 19:38:01 -07:00
Sanjay Patel	b6c5255819	[VectorCombine] add safety check for 0-width register Based on post-commit discussion in D81766, Hexagon sets this to "0". I'll see if I can come up with a test, but making the obvious code fix first to unblock that target.	2020-08-11 20:30:02 -04:00
Thomas Lively	77de82e267	[WebAssembly][AsmParser] Name missing features in error message Rather than just saying that some feature is missing, report the exact features to make the error message more useful and actionable. Differential Revision: https://reviews.llvm.org/D85795	2020-08-11 17:26:14 -07:00
Vedant Kumar	60a6ae1b59	Revert "[Instruction] Add updateLocationAfterHoist helper" This reverts commit 4a646ca9e2caf70d6312714770f516fb83b7e3cb. This is causing some bots to fail with "!dbg attachment points at wrong subprogram for function", like: http://lab.llvm.org:8011/builders/sanitizer-windows/builds/67958/steps/stage%201%20check/logs/stdio	2020-08-11 14:54:09 -07:00
Amy Huang	ae7dfdd3ba	[globalopt] Change so that emitting fragments doesn't use the type size of DIVariables When turning on -debug-info-kind=constructor we ran into a "fragment covers entire variable" error during thinlto. The fragment is currently always emitted if there is no type size, but sometimes the variable has a forward declared struct type which doesn't have a size. This changes the code to get the type size from the GlobalVariable instead. Differential Revision: https://reviews.llvm.org/D85572	2020-08-11 14:50:56 -07:00
Kazu Hirata	88e8651c48	[Instcombine] Fix uses of undef (PR46940) Without this patch, we attempt to distribute And over Xor even in unsafe circumstances like so: undef & (true ^ true) ==> (undef & true) ^ (undef & true) and evaluate it to undef instead of false. Note that "true ^ true" may show up implicitly with one true being part of a PHI node. This patch fixes the problem by teaching SimplifyUsingDistributiveLaws to not use undef as part of simplifications. Reviewers: spatel, aqjune, nikic, lebedev.ri, fhahn, jdoerfert Differential Revision: https://reviews.llvm.org/D85687	2020-08-11 14:13:32 -07:00
Vedant Kumar	62d3804379	[Instruction] Add updateLocationAfterHoist helper Introduce a helper on Instruction which can be used to update the debug location after hoisting. Use this in GVN and LICM, where we were mistakenly introducing new line 0 locations after hoisting (the docs recommend dropping the location in this case). For more context, see the discussion in https://reviews.llvm.org/D60913. Differential Revision: https://reviews.llvm.org/D85670	2020-08-11 14:05:20 -07:00
Jian Cai	6985687166	[AARCH64] [MC] add memtag as an alias of mte architecture extension Add memtag as an alis of met architectture extesion to be consistent with GNU as. LINK:https://sourceware.org/bugzilla/show_bug.cgi?id=26339 Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D85620	2020-08-11 13:28:47 -07:00
Nikita Popov	587bdc1d95	[InstSimplify] Respect CanUseUndef in more places Similar to what we do in IIQ, add an isUndefValue() helper that checks for undef values while respective CanUseUndef. This makes it much easier to search for places that don't respect the flag yet.	2020-08-11 21:53:33 +02:00
Roman Lebedev	415270824a	[NFC][InstCombine] Add tests for PHI merging/aggregate reconstruction (PR47060) We should be able to see that the new aggregate we have produced is identical to the source aggregate from which we've extracted the elements that we used to form a new aggregate. This happens (a lot) in clang C++ exception code on unwind branch.	2020-08-11 22:40:29 +03:00
Arthur Eubanks	38b62da11d	[gn build] Fix CodeGenTests Missing dependency required by https://reviews.llvm.org/D67687. Fixes lld-link: error: undefined symbol: public: __cdecl llvm::PipelineTuningOptions::PipelineTuningOptions(void) >>> referenced by C:\src\llvm-project\llvm\unittests\CodeGen\PassManagerTest.cpp:215 >>> obj/llvm/unittests/CodeGen/CodeGenTests.PassManagerTest.obj:(private: virtual void __cdecl `anonymous namespace'::PassManagerTest_Basic_Test::TestBody(void)) Differential Revision: https://reviews.llvm.org/D85682	2020-08-11 12:31:42 -07:00
Thomas Lively	3f29056047	[WebAssembly][NFC] Replace WASM with standard Wasm The officially specified abbreviation for WebAssembly is Wasm and the spec explicitly calls out WASM as being an incorrect spelling. This patch fixes a few comments and error messages to use the spec-compliant abbreviation. Differential Revision: https://reviews.llvm.org/D85764	2020-08-11 12:27:59 -07:00
diggerlin	b434098ac8	[AIX][XCOFF] change the operand of branch instruction from symbol name to qualified symbol name for function declarations SUMMARY: 1. in the patch , remove setting storageclass in function .getXCOFFSection and construct function of class MCSectionXCOFF there are XCOFF::StorageMappingClass MappingClass; XCOFF::SymbolType Type; XCOFF::StorageClass StorageClass; in the MCSectionXCOFF class, these attribute only used in the XCOFFObjectWriter, (asm path do not need the StorageClass) we need get the value of StorageClass, Type,MappingClass before we invoke the getXCOFFSection every time. actually , we can get the StorageClass of the MCSectionXCOFF from it's delegated symbol. 2. we also change the oprand of branch instruction from symbol name to qualify symbol name. for example change bl .foo extern .foo to bl .foo[PR] extern .foo[PR] 3. and if there is reference indirect call a function bar. we also add extern .bar[PR] Reviewers: Jason liu, Xiangling Liao Differential Revision: https://reviews.llvm.org/D84765	2020-08-11 15:26:19 -04:00
Thomas Lively	8ddb9f38f8	Reland "[SLPVectorizer] Pre-commit a test for D85759" This reverts commit 52b71aa8b1a019403b0ecc184744b2f8ca2f7cba. The problem was a missing lit.local.cfg file, which was causing the test to be incorrectly run on bots that had not built the WebAssembly target.	2020-08-11 12:18:33 -07:00
Thomas Lively	1fbdc03295	Revert "[SLPVectorizer] Pre-commit a test for D85759" This reverts commit 94791970de109eb9a6b296825ddb0fc2a196b366. The test is failing on multiple bots, event though it passes for me locally. Reverting while I investigate further.	2020-08-11 12:11:24 -07:00
Yuanfang Chen	a7a84877c6	NFC. Constify MachineVerifier::verify parameter	2020-08-11 11:59:45 -07:00
Dávid Bolvanský	ee8d84179f	[BPI] Teach BPI about bcmp function bcmp is similar to memcmp	2020-08-11 20:44:53 +02:00
Jessica Paquette	9e2fede73a	Update AMDGPU testcases after bebe6a6449811e877f7eba3f1798ddd1fa83e440 I didn't build AMDGPU locally so I didn't see this. ``` (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) ``` kicks in here. Differential Revision: https://reviews.llvm.org/D85761	2020-08-11 11:32:36 -07:00
Thomas Lively	5b54f97dfb	[SLPVectorizer] Pre-commit a test for D85759 8cc911fa5b06 refactored the `getIntrinsicInstrCost` function and was meant to be a nonfunctional change, but it accidentally changed how costs were calculated in the SLP vectorizer, which regressed WebAssembly codegen and resulted in a downstream bug report at https://github.com/emscripten-core/emscripten/issues/11449. The fix for this regression is in D85759, and this patch just pre-commits the test from that patch to demonstrate the regressed behavior first.	2020-08-11 11:30:09 -07:00
Jessica Paquette	8e321c8c5a	[GlobalISel] Combine (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) This implements ``` (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) ``` when `op` is an extend, a shift, or an and. This is similar to `DAGCombiner::hoistLogicOpWithSameOpcodeHands` (with a bunch of missing cases, e.g. G_TRUNC, G_BITCAST, etc.) This is implemented so it works both pre and post-legalization. This also adds a general way to add a series of instructions in a combine. (`applyBuildInstructionSteps`). Differential Revision: https://reviews.llvm.org/D85050	2020-08-11 10:40:06 -07:00
Jordan Rupprecht	97e61228e0	[NFC][llvm-objcopy] Fix redundant config checks. The check for unsupported options checks some configs twice in a \|\| expression. Fixes PR47107.	2020-08-11 10:37:19 -07:00
jasonliu	5a22f2e49a	[XCOFF][llvm-readobj] Move XCOFF test to XCOFF directory Summary: COFF and XCOFF in llvm are very different and serves different platform. Since we have different Dumper.cpp file in llvm-readobj's implementation, we should have separate testing directory for them too. Reviewed By: jhenderson, DiggerLin Differential Revision: https://reviews.llvm.org/D85675	2020-08-11 17:31:59 +00:00

1 2 3 4 5 ...

201817 Commits