llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	4d7e8eba5b	[CostModel][X86] Update comment describing source of costs - we now use llvm-mca more than IACA	2021-07-02 14:29:32 +01:00
Simon Pilgrim	5e6cee0948	[CostModel][X86] Drop some hard coded fp<->int scalarization costs Scalarization costs handling is a lot better now, and the hard coded costs were higher than the worse case numbers from the script in D103695	2021-07-02 14:29:32 +01:00
LLVM GN Syncbot	62f1d32559	[gn build] Port a92964779cb5	2021-07-02 13:05:44 +00:00
Nico Weber	415469ea29	Revert "[InstrProfiling] Use external weak reference for bias variable" This reverts commit 33a7b4d9d8e6a113108aa71ed78ca32a83c68523. Breaks check-profile on macOS, see comments on https://reviews.llvm.org/D105176	2021-07-02 09:05:12 -04:00
Florian Hahn	3eec7a94c7	[Matrix] Fix crash during fusion if the same load is re-used. This patch fixes a crash when the same load is used for both operands of a fuseable multiply.	2021-07-02 14:00:17 +01:00
Simon Pilgrim	1a043ee5ed	[CostModel][X86] Find AVX conversion costs using legalized types if custom types didn't match Building on rG2a1ef8784ad9a, fallback to attempting to match against legalized types like we do for SSE targets.	2021-07-02 13:49:31 +01:00
Simon Pilgrim	c54e15d18e	[CostModel][X86] Adjust uitofp(vXi64) SSE/AVX legalized costs based on llvm-mca reports. Update v4i64 -> v4f32/v4f64 uitofp costs based on the worst case costs from the script in D103695. Fixes a few regressions before we start adding AVX costs for legalized types.	2021-07-02 13:09:00 +01:00
Alexey Bataev	2e4610cfa1	[SLP]Fix gathering of the scalars by not ignoring UndefValues. The compiler should not ignore UndefValue when gathering the scalars, otherwise the resulting code may be less defined than the original one. Also, grouped scalars to insert them at first to reduce the analysis in further passes. Differential Revision: https://reviews.llvm.org/D105275	2021-07-02 04:46:48 -07:00
Alexandru Octavian Butiu	9690f39948	[MachineCopyPropagation] Fix differences in code gen when compiling with -g Fixes bugs [[ https://bugs.llvm.org/show_bug.cgi?id=50580 \| 50580 ]] and [[ https://bugs.llvm.org/show_bug.cgi?id=49446 \| 49446 ]] When compiling with -g "DBG_VALUE <reg>" instructions are added in the MIR, if such a instruction is inserted between instructions that use <reg> then MachineCopyPropagation invalidates <reg> , this causes some copies to not be propagated and causes differences in code generation (ex bugs 50580 and 49446 ). DBG_VALUE instructions should be ignored since they don't actually modify the register. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D104394	2021-07-02 19:27:06 +08:00
Alex Richardson	a73a5b4199	Place the BlockAddress type in the address space of the containing function While this should not matter for most architectures (where the program address space is 0), it is important for CHERI (and therefore Arm Morello). We use address space 200 for all of our code pointers and without this change we assert in the SelectionDAG handling of BlockAddress nodes. It is also useful for AVR: previously programs targeting AVR that attempt to read their own machine code via a pointer to a label would instead read from RAM using a pointer relative to the the start of program flash. Reviewed By: dylanmckay, theraven Differential Revision: https://reviews.llvm.org/D48803	2021-07-02 12:17:55 +01:00
Florian Hahn	6aab8b237f	[AArch64] Use custom lowering for fp16 vector copysign. The custom copysign lowering already supports fp16. Use it. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D105277	2021-07-02 11:15:30 +01:00
Roman Lebedev	0c04b3c95c	[NFC][SimplifyCFG] Autogenerate checklines in trapping-load-unreachable.ll test	2021-07-02 12:59:14 +03:00
Florian Hahn	d7c2069f15	[Matrix] Hoist address computation before multiply to enable fusion. If the store address does not dominate the matrix multiply, try to hoist address computation instructions without side-effects and/or memory reads before the multiply, to allow fusion. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D105193	2021-07-02 09:52:11 +01:00
Roman Lebedev	5bd901b404	Revert "[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR" This reverts commit 4facbf213c51e4add2e8c19b08d5e58ad71c72de. ``` ****************** FAIL: LLVM :: CodeGen/WebAssembly/funcref-call.ll (44466 of 44468) **************** TEST 'LLVM :: CodeGen/WebAssembly/funcref-call.ll' FAILED ****************** Script: -- : 'RUN: at line 1'; /builddirs/llvm-project/build-Clang12/bin/llc < /repositories/llvm-project/llvm/test/CodeGen/WebAssembly/funcref-call.ll --mtriple=wasm32-unknown-unknown -asm-verbose=false -mattr=+reference-types \| /builddirs/llvm-project/build-Clang12/bin/FileCheck /repositories/llvm-project/llvm/test/CodeGen/WebAssembly/funcref-call.ll -- Exit Code: 2 Command Output (stderr): -- llc: /repositories/llvm-project/llvm/include/llvm/Support/LowLevelTypeImpl.h:44: static llvm::LLT llvm::LLT::scalar(unsigned int): Assertion `SizeInBits > 0 && "invalid scalar size"' failed. ```	2021-07-02 11:49:51 +03:00
Paulo Matos	e346ccc104	[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR Reland of 31859f896. This change implements new DAG notes GLOBAL_GET/GLOBAL_SET, and lowering methods for load and stores of reference types from IR globals. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set. Differential Revision: https://reviews.llvm.org/D104797	2021-07-02 09:46:28 +02:00
Douglas Yung	81b1336b45	Relax newly added opcode check to check only for a number instead of a specific opcode.	2021-07-01 23:09:25 -07:00
Evgeniy Brevnov	de9e734de3	[NFC][DSE]Change 'do-while' to 'for' loop to simplify code structure With 'for' loop there is is a single place where 'Current' is adjusted. It helps to avoid copy paste and makes a bit easy to understand overall loop controll flow. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D101044	2021-07-02 10:00:47 +07:00
Fangrui Song	50b7df4325	[llvm-symbolizer] Move setGroupedShortOptions and don't ignore case setGroupedShortOptions in the ctor seems more popular.	2021-07-01 19:43:49 -07:00
Lang Hames	85a8d3c7b3	[ORC] Rename SPSTargetAddress to SPSExecutorAddress. Also removes SPSTagTargetAddress, which was accidentally introduced at some point (and never used).	2021-07-02 12:40:14 +10:00
Craig Topper	ee95ce8f4b	[ScalarizeMaskedMemIntrin][SelectionDAGBuilder] Use the element type to calculate alignment for gather/scatter when alignment operand is 0. Previously we used the vector type, but we're loading/storing invididual elements so I think only element alignment should matter. Noticed while looking at the code for something else so I don't have a test case. Differential Revision: https://reviews.llvm.org/D105220	2021-07-01 19:08:47 -07:00
Matt Arsenault	fb5d867be7	Mips/GlobalISel: Use accurate memory LLTs	2021-07-01 20:08:14 -04:00
Akira Hatanaka	1ee120aa43	Precommit test cases in https://reviews.llvm.org/D104953	2021-07-01 17:03:07 -07:00
Jessica Paquette	b028312fa0	[GlobalISel] Translate <1 x N> getelementptrs to scalar G_PTR_ADDs In `IRTranslator::translateGetElementPtr`, when we run into a vector gep with some scalar operands, we try to normalize those operands using `buildSplatVector`. This is fine except for when the getelementptr has a <1 x N> type. In that case it is treated as a scalar. If we run into one of these then every call to ``` // With VectorWidth = 1 LLT::fixed_vector(VectorWidth, PtrTy) ``` will assert. Here's an example (equivalent to the added testcase): https://godbolt.org/z/hGsTnMYdW To get around this, this patch adds a variable, `WantSplatVector`, which is true when our vector type ought to actually be represented using a vector. When it's false, we'll translate as a scalar. This checks if `VectorWidth > 1`. This fixes this bug: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=35496 Differential Revision: https://reviews.llvm.org/D105316	2021-07-01 16:38:47 -07:00
Eli Friedman	0c7ff0674a	[AArch64] Optimize SVE bitcasts of unpacked types. Target-independent code only knows how to spill to the stack; instead, use AArch64ISD::REINTERPRET_CAST. Differential Revision: https://reviews.llvm.org/D104573	2021-07-01 15:35:48 -07:00
LLVM GN Syncbot	a3a9d61236	[gn build] Port 33a7b4d9d8e6	2021-07-01 22:26:09 +00:00
Petr Hosek	c555365b5c	[InstrProfiling] Use external weak reference for bias variable We need the compiler generated variable to override the weak symbol of the same name inside the profile runtime, but using LinkOnceODRLinkage results in weak symbol being emitted which leads to an issue where the linker might choose either of the weak symbols potentially disabling the runtime counter relocation. This change replaces the use of weak definition inside the runtime with an external weak reference to address the issue. We also place the compiler generated symbol inside a COMDAT group so dead definition can be garbage collected by the linker. Differential Revision: https://reviews.llvm.org/D105176	2021-07-01 15:25:31 -07:00
Jon Roelofs	a9ef1d9c4a	[GISel] Print better error messages for missing Combiner Observer calls Differential revision: https://reviews.llvm.org/D105290	2021-07-01 15:18:18 -07:00
Sanjay Patel	1cbf22d059	[InstSimplify] do not propagate poison from select arm to icmp user This is the cause of the miscompile in: https://llvm.org/PR50944 The problem has likely existed for some time, but it was made visible with: 5af8bacc94024 ( D104661 ) handleOtherCmpSelSimplifications() assumed it can convert select of constants to bool logic ops, but that does not work with poison. We had a very similar construct in InstCombine, so the fix here mimics the fix there. The bug is in instsimplify, but I'm not sure how to reproduce it outside of instcombine. The reason this is visible in instcombine is because we have a hack (FIXME) to bypass simplification of a select when it has an icmp user: `955f125899/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp (L2632)` So we get to an unusual case where we are trying to simplify an instruction that has an operand that would have already simplified if we had processed it in normal order. Differential Revision: https://reviews.llvm.org/D105298	2021-07-01 17:40:07 -04:00
Sanjay Patel	01dbad81cc	[InstCombine][test] add tests for icmp simplify miscompile (PR50944); NFC	2021-07-01 17:40:07 -04:00
Daniel Rodríguez Troitiño	c26c170592	[cmake] Invoke strip without -l and with non-grouped flags. `llvm-strip` does not support `-l`. Apple's `strip` supports `-l`, but it is not documented, and the latest code doesn't seem to do anything meaningful. From the old source code drops it seems that `-l` was added around version 795 of cctools and removed before 898. The code around the flag usage in 795 talks about problems with kext and forcing the execution of `ld -r`, which seems a behaviour that is not enforceable in latest versions of cctools. The `-l` flag was added in https://reviews.llvm.org/D15133 without a lot of explanation. Since the flag is not active, removing it should not modify the behaviour for most people (except if someone is trying to compile LLVM with a really old version of `strip`). Additionally, break the invocation into two different flags, since `llvm-strip` doesn't at the moment support grouped flags, and other `strip` implementations should work the same no matter if grouped or not. Test Plan: Using `strip` from Xcode 12.5 in Big Sur to strip the same binary (a simple Hello World), using both `-Sxl` and `-Sx` produces exactly the same binary. Repeating the same process with `clang` results also in the same binary. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D105243	2021-07-01 13:37:58 -07:00
Daniel Rodríguez Troitiño	67647eca64	[llvm-strip] Support grouped options in llvm-strip GNU and Apple `strip` implementations seems to support grouped options. Enable the support for grouped options introduced in https://reviews.llvm.org/D83639 for `llvm-strip` invocations. Includes test that checks that both the grouped and non grouped invocations produces the same result. Reviewed By: alexander-shaposhnikov, MaskRay Differential Revision: https://reviews.llvm.org/D105249	2021-07-01 13:36:45 -07:00
David Green	ee40bb98a6	[ARM] Reassociate BFI D104868 removed an (incorrect) fold for distributing BFI instructions in a chain, combining them into a single instruction. BFIs like that are hard to test, as the patterns are often destroyed before they become BFIs. But it can come up in places, with chains of BFIs that can be combined. This patch adds a replacement, which reassociates BFI instructions with non-overlapping insertion masks so that low bits are inserted first. This can end up sorting the nodes so that adjacent inserts are next to one another, allowing the existing folds to combine into a single BFI. Differential Revision: https://reviews.llvm.org/D105096	2021-07-01 21:08:13 +01:00
Valentin Churavy	0b1b7443f1	[Orc] At CBindings for LazyRexports At C bindings and an example for LLJIT with lazy reexports Differential Revision: https://reviews.llvm.org/D104672	2021-07-01 21:52:05 +02:00
Joel E. Denny	4a1757d821	[lit] Extend --xfail/LIT_XFAIL to take full test name The new documentation entry gives an example use case from libomptarget. Reviewed By: yln, jhenderson, davezarzycki Differential Revision: https://reviews.llvm.org/D105208	2021-07-01 15:46:37 -04:00
Tobias Gysi	d1d5a9b638	[CMake][MLIR][Linalg] Adding variable to specify tablegen file dependencies. Synchronizing multiple custom targets requires not only target but also file dependencies. Building Linalg involves running yaml-gen followed by tablegen. Currently, these custom targets are only synchronized using a target dependency resulting in issues in specific incremental build setups (https://llvm.discourse.group/t/missing-build-cmake-tblgen-dependency/3727/10). This patch introduces a novel LLVM_TARGET_DEPENDS variable to the TableGen.cmake file to provide a way to specify file dependencies. Additionally, it adapts the Linalg CMakeLists.txt to introduce the necessary file dependency between yaml-gen and tablegen. Differential Revision: https://reviews.llvm.org/D105272	2021-07-01 18:54:30 +00:00
Nico Weber	856c1b08d5	[gn build] Port 050b064f15ee	2021-07-01 14:52:21 -04:00
Nikita Popov	460a9b9357	[OpaquePtr] Support VecOfAnyPtrsToElt intrinsics In this case the pointer type is part of the mangled name, so we can allow on opaque pointer outside --force-opaque-pointers mode as well.	2021-07-01 20:35:33 +02:00
Scott Linder	62d5b36f28	[ADT] Follow up to fix bug in "Add makeVisitor to STLExtras.h" Address mistakenly comparing the pointer values of two C-style strings rather than comparing their contents in the unit tests for makeVisitor, added in 6d6f35eb7b92c6dd4478834497752f4e963db16d	2021-07-01 18:24:49 +00:00
Philip Reames	ade0ec7fb0	[instcombine] Fold overflow check using overflow intrinsic to comparison This follows up to D104665 (which added umulo handling alongside the existing uaddo case), and generalizes for the remaining overflow intrinsics. I went to add analogous handling to LVI, and discovered that LVI already had a more general implementation. Instead, we can port was LVI does to instcombine. (For context, LVI uses makeExactNoWrapRegion to constrain the value 'x' in blocks reached after a branch on the condition `op.with.overflow(x, C).overflow`.) Differential Revision: https://reviews.llvm.org/D104932	2021-07-01 09:41:55 -07:00
Nikita Popov	6203a57d08	[OpaquePtr] Support opaque pointers in intrinsic type check This adds support for opaque pointers in intrinsic type checks of IIT kind Pointer and PtrToElt. This is less straight-forward than it might initially seem, because we should only accept opaque pointers here in --force-opaque-pointers mode. Otherwise, there would be more than one valid type signature for a given intrinsic name. Differential Revision: https://reviews.llvm.org/D105155	2021-07-01 18:26:41 +02:00
Matt Arsenault	20d89b9242	GlobalISel: Use LLT in call lowering callbacks This preserves the memory type so the lowerings can rely on them.	2021-07-01 12:15:54 -04:00
Bradley Smith	f53921a516	[SelectionDAG] Implement PromoteIntRes_INSERT_SUBVECTOR Inserting into a smaller-than-legal scalable vector would result in an internal compiler error. For example, inserting a <vscale x 4 x i8> into a <vscale x 8 x i8> (both illegal vector types for SVE) would cause a crash. This crash was happening because there was no code to promote (legalise) the result of an INSERT_SUBVECTOR node. This patch implements PromoteIntRes_INSERT_SUBVECTOR, which legalises the ISD node. This is currently done by going through memory. This is necessary because of the requirement that the SubVec parameter of the INSERT_SUBVECTOR node must be smaller than the Vec parameter, which means that INSERT_SUBVECTOR cannot always have a legal result/operand types. Co-Authored-by: Joe Ellis <joe.ellis@arm.com> Differential Revision: https://reviews.llvm.org/D102766	2021-07-01 17:05:53 +01:00
Stanislav Mekhanoshin	bb5a67dc29	[AMDGPU] Fix immediate sign during V_MOV_B64_PSEUDO expansion Creating a V_MOV_B32 with zero extended immediate source prevented conversion to V_BFREV_B32. Differential Revision: https://reviews.llvm.org/D105235	2021-07-01 09:00:29 -07:00
David Green	878abea094	[ARM] Extra BFI codegen tests. NFC	2021-07-01 16:56:23 +01:00
Irina Dobrescu	b31f0d013b	[AArch64][GlobalISel]Legalise some vector types for min/max Differential Revision: https://reviews.llvm.org/D105200	2021-07-01 16:29:38 +01:00
Nico Weber	394ac62585	[clangd] Unbreak mac build differently 0c96a92d8666b8 This reverts b56e5f8a10c1 (and follow-up f6db88535cb) and instead restores the state we had before 0c96a92d8666b8: ClangdMain.cpp includes Features.inc before including Transport.h. This is a bit ugly, but it matches the former state and making Transport.h include Features.h means that xpc/ needs to be able to find the generated Features.inc, wich is also a bit ugly.	2021-07-01 10:51:27 -04:00
Simon Pilgrim	adc36a317c	[LoopVectorize][X86] Regenerate conversion-cost.ll tests	2021-07-01 15:34:20 +01:00
Simon Pilgrim	f0a2f7f0f9	[CostModel][X86] Adjust fp<->int vXi32 SSE legalized costs based on llvm-mca reports. Building on rG2a1ef8784ad9a, adjust the SSE cost tables to use the legalized types based on the worst case costs from the script in D103695. To account for different numbers of src/dst legalized type registers we must scale the cost by maximum of the src/dst, not just use src	2021-07-01 15:34:20 +01:00
Arnold Schwaighofer	10d39b09ad	[coro async] Add support for specifying which parameter is swiftself in async resume functions Differential Revision: https://reviews.llvm.org/D104147	2021-07-01 07:33:15 -07:00
Sjoerd Meijer	db79ea7a1b	[AArch64] Add some more tests to CodeGen/AArch64/aarch64-load-ext.ll. NFC.	2021-07-01 15:15:21 +01:00

1 2 3 4 5 ...

217986 Commits