llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Sanjay Patel	a486f56de3	[SLP] reduce code duplication while matching reductions; NFC	2021-01-12 16:03:57 -05:00
Philip Reames	42d8098f91	[LV] Weaken spuriously strong assert in LoopVersioning LoopVectorize uses some utilities on LoopVersioning, but doesn't actually use it for, you know, versioning. As a result, the precondition LoopVersioning expects is too strong for this user. At the moment, LoopVectorize supports any loop with a unique exit block, so check the same precondition here. Really, the whole class structure here is a mess. We should separate the actual versioning from the metadata updates, but that's a bigger problem.	2021-01-12 12:57:13 -08:00
Nikita Popov	302c879f6c	[InstCombine] Duplicate tests for logical and/or (NFC) This replicates existing and/or tests to also test variants using select. This should help us get a more accurate view on which optimizations we're missing if we disable the select -> and/or fold.	2021-01-12 21:50:41 +01:00
Philip Reames	e62e2effb0	[LV] Relax assumption that LCSSA implies single entry This relates to the ongoing effort to support vectorization of multiple exit loops (see D93317). The previous code assumed that LCSSA phis were always single entry before the vectorizer ran. This was correct, but only because the vectorizer allowed only a single exiting edge. There's nothing in the definition of LCSSA which requires single entry phis. A common case where this comes up is with a loop with multiple exiting blocks which all reach a common exit block. (e.g. see the test updates) Differential Revision: https://reviews.llvm.org/D93725	2021-01-12 12:34:52 -08:00
Nikita Popov	b14fd31fef	[InstCombine] Regenerate test checks (NFC)	2021-01-12 21:26:42 +01:00
Florian Hahn	89fc038707	[FunctionAttrs] Derive willreturn for fns with readonly `&` mustprogress`. Similar to D94125, derive `willreturn` for functions that are `readonly` and `mustprogress` in FunctionAttrs. To quote the reasoning from D94125: Since D86233 we have `mustprogress` which, in combination with `readonly`, implies `willreturn`. The idea is that every side-effect has to be modeled as a "write". Consequently, `readonly` means there is no side-effect, and `mustprogress` guarantees that we cannot "loop" forever without side-effect. Reviewed By: jdoerfert, nikic Differential Revision: https://reviews.llvm.org/D94502	2021-01-12 20:02:34 +00:00
David Truby	7a27e1780d	[clang][aarch64] Precondition isHomogeneousAggregate on isCXX14Aggregate MSVC on WoA64 includes isCXX14Aggregate in its definition. This is de-facto specification on that platform, so match msvc's behaviour. Fixes: https://bugs.llvm.org/show_bug.cgi?id=47611 Co-authored-by: Peter Waller <peter.waller@arm.com> Differential Revision: https://reviews.llvm.org/D92751	2021-01-12 19:44:01 +00:00
Nikita Popov	2f31020aa4	[InstSimplify] Don't fold gep p, -p to null This is a partial fix for https://bugs.llvm.org/show_bug.cgi?id=44403. Folding gep p, q-p to q is only legal if p and q have the same provenance. This fold should probably be guarded by something like getUnderlyingObject(p) == getUnderlyingObject(q). This patch is a partial fix that removes the special handling for gep p, 0-p, which will fold to a null pointer, which would certainly not pass an underlying object check (unless p is also null, in which case this would fold trivially anyway). Folding to a null pointer is particularly problematic due to the special handling it receives in many places, making end-to-end miscompiles more likely. Differential Revision: https://reviews.llvm.org/D93820	2021-01-12 20:24:23 +01:00
Florian Hahn	60e7732238	[FunctionAttrs] Precommit tests for willreturn inference. Tests for D94502.	2021-01-12 19:16:50 +00:00
Craig Topper	fc77e995ec	[RISCV] Use vmerge.vim for llvm.riscv.vfmerge with a 0.0 scalar operand. We can use a 0 immediate to avoid needing to materialize 0 into an FPR first. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D94459	2021-01-12 11:08:26 -08:00
Arthur Eubanks	d28c8be642	[NewPM] Run non-trivial loop unswitching under -O2/3/s/z Fixes https://bugs.llvm.org/show_bug.cgi?id=48715. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D94448	2021-01-12 11:04:40 -08:00
Craig Topper	6008e3ca8a	[LegalizeDAG][RISCV][PowerPC][AMDGPU][WebAssembly] Improve expansion of SETONE/SETUEQ on targets without SETO/SETUO. If SETO/SETUO aren't legal, they'll be expanded and we'll end up with 3 comparisons. SETONE is equivalent to (SETOGT \|\| SETOLT) so if one of those operations is supported use that expansion. We don't need both since we can commute the operands to make the other. SETUEQ can be implemented with !(SETOGT \|\| SETOLT) or (SETULE && SETUGE). I've only implemented the first because it didn't look like most of the affected targets had legal SETULE/SETUGE. Reviewed By: frasercrmck, tlively, nemanjai Differential Revision: https://reviews.llvm.org/D94450	2021-01-12 10:45:03 -08:00
Dávid Bolvanský	58595a7835	[instCombine] Add (A ^ B) \| ~(A \| B) -> ~(A & B) define i32 @src(i32 %x, i32 %y) { %0: %xor = xor i32 %y, %x %or = or i32 %y, %x %neg = xor i32 %or, 4294967295 %or1 = or i32 %xor, %neg ret i32 %or1 } => define i32 @tgt(i32 %x, i32 %y) { %0: %and = and i32 %x, %y %neg = xor i32 %and, 4294967295 ret i32 %neg } Transformation seems to be correct! https://alive2.llvm.org/ce/z/Cvca4a	2021-01-12 19:29:17 +01:00
Dávid Bolvanský	f16a8fbbac	[Tests] Add tests for new InstCombine OR transformation, NFC	2021-01-12 19:29:17 +01:00
Michał Górny	69e423efd5	[llvm] [cmake] Remove obsolete /usr/local hack for *BSD Remove the hack adding /usr/local paths on FreeBSD and DragonFlyBSD. It does not seem to be necessary today, and it breaks cross builds. Differential Revision: https://reviews.llvm.org/D94491	2021-01-12 19:26:04 +01:00
Cullen Rhodes	b30ad48824	[SVE][NFC] Regenerate a few CodeGen tests Regenerated using llvm/utils/update_llc_test_checks.py as part of D94504, committing separately to reduce the diff for D94504.	2021-01-12 18:10:36 +00:00
Simon Pilgrim	ce76c6de45	[AMDGPU] Regenerate umax crash test	2021-01-12 18:02:15 +00:00
Simon Pilgrim	873ac18cc1	[X86] Regenerate sdiv_fix_sat.ll + udiv_fix_sat.ll tests Adding missing libcall PLT qualifiers	2021-01-12 17:25:30 +00:00
Jinsong Ji	1fa0f655fd	[PowerPC][NFCI] PassSubtarget to ASMWriter Subtarget feature bits are needed to change instprinter's behavior based on feature bits. Most of the other popular targets were updated back in 2015, in https://reviews.llvm.org/rGb46d0234a6969 we should update it too. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D94449	2021-01-12 16:25:35 +00:00
Nemanja Ivanovic	dac23cf4a3	[PowerPC] Add support for embedded devices with EFPU2 PowerPC cores like e200z759n3 [1] using an efpu2 only support single precision hardware floating point instructions. The single precision instructions efs* and evfs* are identical to the spe float instructions while efd* and evfd* instructions trigger a not implemented exception. This patch introduces a new command line option -mefpu2 which leads to single-hardware / double-software code generation. [1] Core reference: https://www.nxp.com/files-static/32bit/doc/ref_manual/e200z759CRM.pdf Differential revision: https://reviews.llvm.org/D92935	2021-01-12 09:47:00 -06:00
Bjorn Pettersson	ffd34706f2	[SLP] Add test case showing a bug when dealing with padded types We shouldn't vectorize stores of non-packed types (i.e. types that has padding between consecutive variables in a scalar layout, but being packed in a vector layout). The problem was detected as a miscompile in a downstream test case. This is a pre-commit of a test case for the fix in D94446.	2021-01-12 16:35:33 +01:00
Kazushi (Jam) Marukawa	c563794bd1	[VE] Update VELIntrinsic tests Update comment and style of regression tests for VELIntrinsic Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D94490	2021-01-13 00:12:50 +09:00
Bevin Hansson	0c7bd7a98b	[X86] Improved lowering for saturating float to int. Adapted from D54696 by @nikic. This patch improves lowering of saturating float to int conversions, FP_TO_[SU]INT_SAT, for X86. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D86079	2021-01-12 15:44:41 +01:00
Valentin Clement	1f72844930	[mlir][openacc] Use TableGen information for default enum Use TableGen and information in ACC.td for the Default enum in the OpenACC dialect. This patch generalize what was done for OpenMP for directives. Follow up patch after D93576 Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D93710	2021-01-12 09:42:42 -05:00
Paul C. Anagnostopoulos	e29674d7e7	[TableGen] Improve error message for semicolon after braced body. Add a test for this message. Differential Revision: https://reviews.llvm.org/D94412	2021-01-12 09:38:05 -05:00
Simon Pilgrim	5d734c97c3	[X86][SSE] getFauxShuffleMask - handle PACKSS(SRAI(),SRAI()) shuffle patterns. We can't easily treat ASHR a faux shuffle, but if it was just feeding a PACKSS then it was likely being used as sign-extension for a truncation, so just peek through and adjust the mask accordingly.	2021-01-12 14:07:53 +00:00
Simon Pilgrim	a5ff4dc7cb	[X86][SSE] combineSubToSubus - add v16i32 handling on pre-AVX512BW targets. v16i32 -> v16i16/v8i16 truncation is now good enough using PACKSS/PACKUS + shuffle combining that its no longer necessary to early-out on pre-AVX512BW targets. This was noticed while looking at completing PR40111 and moving combineSubToSubus to DAGCombine entirely.	2021-01-12 13:44:11 +00:00
Bevin Hansson	db993cf4ce	[Fixed Point] Add codegen for conversion between fixed-point and floating point. The patch adds the required methods to FixedPointBuilder for converting between fixed-point and floating point, and uses them from Clang. This depends on D54749. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D86632	2021-01-12 13:53:01 +01:00
Simon Pilgrim	8b69d643b8	[X86][SSE] combineSubToSubus - remove SSE2 early-out. SSE2 truncation codegen has improved over the past few years (mainly due to better shuffle lowering/combining and computeKnownBits) - its no longer necessary to early-out from v8i32/v8i64 truncations. This was noticed while looking at completing PR40111 and moving combineSubToSubus to DAGCombine entirely.	2021-01-12 12:52:11 +00:00
Fraser Cormack	3cf82c317a	[RISCV] Improve scalable-vector shift tests (NFC) All i8/i16 and several i32 tests were testing immediate shift amounts which exceeded the bits in the vector elements, creating poison values. Amend the tests to test well-behaved shift amounts.	2021-01-12 11:40:21 +00:00
Christian Sigg	b72cc5450e	Change the LLVM_ATTRIBUTE_DEPRECATED macro to use C++14 attribute. C++14 attributes are superior because they can be applied to functions with inline definition and the syntax is cleaner. I intend to convert all uses and then remove the macro. One issue that might hold back switching uses to C++14 attributes is that clang-format does not put long attributes on separate lines and formatted code will look like: ``` template <typename T> [[deprecated("blah blah")]] void foooooooooooooooooooooooooooo() { ... } ``` Putting long attributes on a separate line would be prettier. See https://stackoverflow.com/questions/45740466/clang-format-setting-to-control-c-attributes AttributeMacros probably won't help because it can't match the custom message. https://clang.llvm.org/docs/ClangFormatStyleOptions.html Reviewed By: rriddle, MaskRay Differential Revision: https://reviews.llvm.org/D94219	2021-01-12 12:41:00 +01:00
Nico Weber	87b210e9ac	Revert "[Test] Add failing test for PR48725" This reverts commit e8287cb2b2923af9da72fd953e2ec5495c33861a. Test unexpectedly passes on mac, see comment 2 on PR48725.	2021-01-12 06:30:32 -05:00
Georgii Rymar	65aa867260	[obj2yaml] - Don't crash when an object has an empty symbol table. Currently we crash when we have an object with SHT_SYMTAB/SHT_DYNSYM sections of size 0. With this patch instead of the crash we start to dump them properly. Differential revision: https://reviews.llvm.org/D93697	2021-01-12 14:08:59 +03:00
Georgii Rymar	10f626b307	[obj2yaml,yaml2obj] - Fix issues with creating/dumping group sections. We have the following issues related to group sections: 1) yaml2obj is unable to set the custom `sh_entsize` value, because the `EntSize` key is currently ignored. 2) obj2yaml is unable to dump the group section which `sh_entsize != 4`. 3) obj2yaml always dumps the "EntSize" for group sections, though usually we are trying to omit dumping default values when dumping keys. I.e. we should not print the "EntSize" key when `sh_entsize` == 4. This patch fixes (1),(3) and adds the test case to document the behavior of (2). Differential revision: https://reviews.llvm.org/D93854	2021-01-12 14:07:42 +03:00
Jay Foad	9b4838079a	[AMDGPU][GlobalISel] Remove some duplicate RUN lines Differential Revision: https://reviews.llvm.org/D86618	2021-01-12 11:02:16 +00:00
Jay Foad	f1bab75f10	[SlotIndexes] Fix and simplify basic block splitting Remove the InsertionPoint argument from SlotIndexes::insertMBBInMaps because it was confusing: what does it mean to insert a new block between two instructions, in the middle of an existing block? Instead, support the case that MachineBasicBlock::splitAt really needs, where the new block contains some instructions that are already in the maps because they have been moved there from the tail of the previous block. In all other use cases the new block is empty. Based on work by Carl Ritson! Differential Revision: https://reviews.llvm.org/D94311	2021-01-12 10:50:14 +00:00
Georgii Rymar	09dcbafb1d	[llvm-readobj] - One more attempt to fix BB. Add `this->` for `W`, which is the member of `ObjDumper` An example of error: readobj/ELFDumper.cpp:738:13: error: use of undeclared identifier 'W' assert(&W.getOStream() == &llvm::fouts());	2021-01-12 13:17:59 +03:00
Georgii Rymar	b88a437339	[llvm-readobj] - An attempt to fix BB. This adds the `template` keyword for 'getAsArrayRef' calls. An example of error: /b/1/openmp-gcc-x86_64-linux-debian/llvm.src/llvm/tools/llvm-readobj/ELFDumper.cpp:4491:50: error: use 'template' keyword to treat 'getAsArrayRef' as a dependent template name for (const Elf_Rel &Rel : this->DynRelRegion.getAsArrayRef<Elf_Rel>())	2021-01-12 13:09:49 +03:00
Georgii Rymar	861dc1490e	[llvm-readobj] - Add 'override' to fix build bots. This should fix bots after landing D93900. An example of error is: /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/llvm/tools/llvm-readobj/ELFDumper.cpp:883:8: warning: 'printSectionMapping' overrides a member function but is not marked 'override' [-Winconsistent-missing-override] void printSectionMapping() {}	2021-01-12 13:01:15 +03:00
Georgii Rymar	519478e0e8	[llvm-readef/obj] - Change the design structure of ELF dumper. NFCI. This is a refactoring for design of stuff in `ELFDumper.cpp`. The current design of ELF dumper is far from ideal. Currently most overridden functions (inherited from `ObjDumper`) in `ELFDumper` just forward to the functions of `ELFDumperStyle` (which can be either `GNUStyle` or `LLVMStyle`). A concrete implementation may be in any of `ELFDumper`/`DumperStyle`/`GNUStyle`/`LLVMStyle`. This patch reorganizes the classes by introducing `GNUStyleELFDumper`/`LLVMStyleELFDumper` which inherit from `ELFDumper`. The implementations are moved: `DumperStyle` -> `ELFDumper` `GNUStyle` -> `GNUStyleELFDumper` `LLVMStyle` -> `LLVMStyleELFDumper` With that we can avoid having a lot of redirection calls and helper methods. The number of code lines changes from 7142 to 6922 (reduced by ~3%) and the code overall looks cleaner. Differential revision: https://reviews.llvm.org/D93900	2021-01-12 12:36:17 +03:00
Heejin Ahn	98877ee977	[WebAssembly] Remove more unnecessary brs in CFGStackify After placing markers, we removed some unnecessary branches, but it only handled the simplest case. This makes more unnecessary branches to be removed. Reviewed By: dschuff, tlively Differential Revision: https://reviews.llvm.org/D94047	2021-01-12 01:18:10 -08:00
Max Kazantsev	80a0b769c3	[Test] Add failing test for PR48725	2021-01-12 16:06:34 +07:00
Sebastian Neubauer	94e83a8359	[AMDGPU] Fix failing assert with scratch ST mode In ST mode, flat scratch instructions have neither an sgpr nor a vgpr for the address. This lead to an assertion when inserting hard clauses. Differential Revision: https://reviews.llvm.org/D94406	2021-01-12 09:54:02 +01:00
Sander de Smalen	f46619d424	[LiveDebugValues] Fix comparison operator in VarLocBasedImpl The issue was introduced in commit rG84a1120943a651184bae507fed5d648fee381ae4 and would cause a VarLoc's StackOffset to be compared with its own, instead of the StackOffset from the other VarLoc. This patch fixes that.	2021-01-12 08:44:58 +00:00
Heejin Ahn	9c958940cd	[WebAssembly] Misc. refactoring in CFGStackify (NFC) Updating `ScopeTops` is something we frequently do in CFGStackify, so this factors it out as a function. This also makes a few utility functions templated so that they are not dependent on input vector types and simplifies function parameters. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D94046	2021-01-12 00:36:27 -08:00
Craig Topper	741910103d	[DAGCombiner] Replace static helper function isConstantFPBuildVectorOrConstantFP with the identical version in SelectionDAG. NFC	2021-01-11 23:41:40 -08:00
Petr Hosek	e0d03e7b34	[CMake] Split the target side of runtimes build Previously, llvm/runtimes/CMakeLists.txt played two different roles: 1. host side which could used to set up the build of runtimes for different targets in the right order; 2. target side to build the runtimes for the specified target. This change splits llvm/runtimes/CMakeLists.txt and moves the target side to runtimes/CMakeLists laying down the foundation for the "A vision for building the runtimes" proposal. From the user perspective, there shouldn't be any visible difference at the moment. Differential Revision: https://reviews.llvm.org/D93408	2021-01-11 23:39:36 -08:00
Craig Topper	f1b751d17b	[SelectionDAG] Make isConstantIntBuildVectorOrConstantInt and isConstantFPBuildVectorOrConstantFP methods const.	2021-01-11 23:26:53 -08:00
Craig Topper	aecf5b1559	[CodeGen] Try to make the print of memory operand alignment a little more user friendly. Memory operands store a base alignment that does not factor in the effect of the offset on the alignment. Previously the printing code only printed the base alignment if it was different than the size. If there is an offset, the reader would need to figure out the effective alignment themselves. This has confused me before and someone else was recently confused on IRC. This patch prints the possibly offset adjusted alignment if it is different than the size. And prints the base alignment if it is different than the alignment. The MIR parser has been updated to read basealign in addition to align. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D94344	2021-01-11 19:58:47 -08:00
Kazu Hirata	0452f12eb6	[llvm] Simplify string comparisons (NFC) Identified with readability-string-compare.	2021-01-11 18:48:09 -08:00

1 2 3 4 5 ...

209514 Commits