llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Simon Cook	01e56fec1b	[RISCV] Fix name of Zba extension (NFC)	2021-01-24 21:02:34 +00:00
Kazu Hirata	0ec2908ce9	[llvm] Use pop_back_val (NFC)	2021-01-24 12:18:57 -08:00
Kazu Hirata	0bd8f7e194	[Target] Use llvm::append_range (NFC)	2021-01-24 12:18:56 -08:00
Kazu Hirata	2fb5578408	[CodeGen] Forward-declare TargetMachine (NFC) InstrEmitter.h needs TargetMachine but relies on a forward declaration of TargetMachine in MachineOperand.h. This patch adds a forward declaration right in InstrEmitter.h. While we are at it, this patch removes the one in MachineOperand.h, where it is unnecessary.	2021-01-24 12:18:54 -08:00
Craig Topper	4269bb7f57	[RISCV] Use SRLIWPat in the PACKUW pattern. This makes the code more tolerant if we ever change SimplifyDemandedBits to not remove 1s from the lsbs of a contiguous mask.	2021-01-24 10:41:58 -08:00
Nikita Popov	710501602d	[Utils] Use NoAliasScopeDeclInst in a few more places (NFC) In the cloning infrastructure, only track an MDNode mapping, without explicitly storing the Metadata mapping, same as is done during inlining. This makes things slightly simpler.	2021-01-24 16:24:11 +01:00
David Green	4ee0b17117	[CostModel] Tests for showing the cost of intrinsics from the vectorizer. NFC	2021-01-24 14:47:15 +00:00
Florian Hahn	91e3095774	[LTO] Move DisableVerify setting to LTOCodeGenerator class (NFC). To simplify the transition to using LTOBackend, move DisableVerify to the LTOCodeGenerator class, like most/all other options. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D95223	2021-01-24 14:14:40 +00:00
Sanjay Patel	bbdf8f6f05	[SLP] fix fast-math requirements for fmin/fmax reductions a6f0221276 enabled intersection of FMF on reduction instructions, so it is safe to ease the check here. There is still some room to improve here - it looks like we have nearly duplicate flags propagation logic inside of the LoopUtils helper but it is limited targets that do not form reduction intrinsics (they form the shuffle expansion).	2021-01-24 08:55:56 -05:00
David Zarzycki	1c88e8b5ca	Fix x86 exegesis tests after c042aff8860df3cad2b274bf0a495e83ae36ddee In c042aff8860df3cad2b274bf0a495e83ae36ddee, unused FileCheck prefixes became an error, which exposed some testing bugs in four exegesis tests. I've tried my best to either fix the testing bugs, or expand the testing to cover more scenarios. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D95287	2021-01-24 08:51:06 -05:00
David Green	2c039d2841	[AArch64] Saturating add cost tests. NFC	2021-01-24 13:49:17 +00:00
Jeroen Dobbelaere	ba40414330	[InstCombine] Remove unused llvm.experimental.noalias.scope.decl A @llvm.experimental.noalias.scope.decl is only useful if there is !alias.scope and !noalias metadata that uses the declared scope. When that is not the case for at least one of the two, the intrinsic call can as well be removed. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D95141	2021-01-24 13:55:50 +01:00
Jeroen Dobbelaere	2c513aff8c	[LoopRotate] Use llvm.experimental.noalias.scope.decl for duplicating noalias metadata as needed Similar to D92887, LoopRotation also needs duplicate the noalias scopes when rotating a `@llvm.experimental.noalias.scope.decl` across a block boundary. This is based on the version from the Full Restrict paches (D68511). The problem it fixes also showed up in Transforms/Coroutines/ex5.ll after D93040 (when enabling strict checking with -verify-noalias-scope-decl-dom). Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D94306	2021-01-24 13:53:13 +01:00
Jeroen Dobbelaere	76b232f00a	[LoopUnroll] Use llvm.experimental.noalias.scope.decl for duplicating noalias metadata as needed This is a fix for https://bugs.llvm.org/show_bug.cgi?id=39282. Compared to D90104, this version is based on part of the full restrict patched (D68484) and uses the `@llvm.experimental.noalias.scope.decl` intrinsic to track the location where !noalias and !alias.scope scopes have been introduced. This allows us to only duplicate the scopes that are really needed. Notes: - it also includes changes and tests from D90104 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92887	2021-01-24 13:48:20 +01:00
Lang Hames	f8db80b74e	[examples] Fix "Target does not support MC emission!" in HowToUseJIT example. Patch by Shivam Gupta. Thanks Shivam! Differential Revision: https://reviews.llvm.org/D92280	2021-01-24 22:11:54 +11:00
Nikita Popov	a78d45f1a3	[ValueTracking] Don't assume readonly function will return This is similar to D94106, but for the isGuaranteedToTransferExecutionToSuccessor() helper. We should not assume that readonly functions will return, as this is only true for mustprogress functions (in which case we already infer willreturn). As with the DCE change, for now continue assuming that readonly intrinsics will return, as not all target intrinsics have been annotated yet. Differential Revision: https://reviews.llvm.org/D95288	2021-01-24 10:40:21 +01:00
Craig Topper	7c6fbeb405	[RISCV] Make the code in MatchSLLIUW ignore the lower bits of the AND mask where the shift has guaranteed zeros. This avoids being dependent on SimplifyDemandedBits having cleared those bits. It could make sense to teach SimplifyDemandedBits to keep all lower bits 1 in an AND mask when possible. This could be implemented with slli+srli in the general case rather than needing to materialize the constant.	2021-01-24 00:34:45 -08:00
Lang Hames	cc37457339	[JITLink] Use edge kind names for fixups in EHFrameEdgeFixer. Previously FDE field names were used, but the fixup kind used for a field can vary based on the pointer encoding. This change will improve readability / maintainability when EH-frame support is added to JITLink/ELF.	2021-01-24 15:38:04 +11:00
Ben Shi	a22a29def0	[AVR] Optimize 8-bit int shift Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D90678	2021-01-24 11:04:37 +08:00
Michael Kruse	01519ebfe5	[OpenMPIRBuilder] Silence compiler warning. NFC. Address the compiler warning OMPIRBuilder.cpp:1232:27: comparison of integers of different signs: 'size_t' (aka 'unsigned long') and 'int' [-Wsign-compare]	2021-01-23 21:00:37 -06:00
Michael Kruse	d945273b52	[OpenMPIRBuilder] Implement tileLoops. The tileLoops method implements the code generation part of the tile directive introduced in OpenMP 5.1. It takes a list of loops forming a loop nest, tiles it, and returns the CanonicalLoopInfo representing the generated loops. The implementation takes n CanonicalLoopInfos, n tile size Values and returns 2*n new CanonicalLoopInfos. The input CanonicalLoopInfos are invalidated and BBs not reused in the new loop nest removed from the function. In a modified version of D76342, I was able to correctly compile and execute a tiled loop nest. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D92974	2021-01-23 19:39:29 -06:00
Craig Topper	c76bfbf6f9	[RISCV] Group some Zbs isel patterns together and remove a stale comment. NFC	2021-01-23 16:45:05 -08:00
Craig Topper	c414ad9fcb	[RISCV] Add isel patterns to remove masks on SLO/SRO shift amounts.	2021-01-23 15:57:41 -08:00
Craig Topper	418421be2d	[RISCV] Add test cases for SRO/SLO with shift amounts masked to bitwidth-1. NFC The sro/slo instructions ignore extra bits in the shift amount, so we can ignore the mask just like we do for sll, srl, and sra.	2021-01-23 15:45:51 -08:00
Craig Topper	5e12df72cc	[RISCV] Add isel patterns to remove (and X, 31) from sllw/srlw/sraw shift amounts. We try to do this during DAG combine with SimplifyDemandedBits, but it fails if there are multiple nodes using the AND. For example, multiple shifts using the same shift amount.	2021-01-23 15:08:18 -08:00
Roman Lebedev	fa6d8bb131	[NFC][SimplifyCFG] Extract CloneInstructionsIntoPredecessorBlockAndUpdateSSAUses() out of PerformBranchToCommonDestFolding() To be used in PerformValueComparisonIntoPredecessorFolding()	2021-01-24 00:54:55 +03:00
Roman Lebedev	f60e6717e0	[NFC][SimplifyCFG] Perform early-continue in FoldValueComparisonIntoPredecessors() per-pred loop	2021-01-24 00:54:54 +03:00
Roman Lebedev	4e528a34c7	[NFC][SimplifyCFG] Extract PerformValueComparisonIntoPredecessorFolding() out of FoldValueComparisonIntoPredecessors() Less nested code is much easier to follow and modify.	2021-01-24 00:54:54 +03:00
Nikita Popov	15041a14bb	[IR] Add NoAliasScopeDeclInst (NFC) Add an intrinsic type class to represent the llvm.experimental.noalias.scope.decl intrinsic, to make code working with it a bit nicer by hiding the metadata extraction from view.	2021-01-23 22:40:32 +01:00
Arthur Eubanks	7939adb8b8	[NewPM][opt] Make -enable-new-pm default to LLVM_ENABLE_NEW_PASS_MANAGER This is controlled by the ENABLE_EXPERIMENTAL_NEW_PASS_MANAGER CMake flag. https://lists.llvm.org/pipermail/llvm-dev/2021-January/147993.html Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D95254	2021-01-23 12:36:09 -08:00
Arthur Eubanks	9f31f4aef8	[test] Pin dead-calls-willreturn.ll to legacy PM The new PM inliner does not delete dead calls.	2021-01-23 12:35:36 -08:00
Nikita Popov	0b9bd04fa8	[PhaseOrdering] Add tests for PR44461 and PR48844 (NFC) In both cases, optimization is prevented because "br X == C \|\| X == C2" is converted into a switch. In one case loop rotation is blocked, in the other vectorization.	2021-01-23 21:24:54 +01:00
Nikita Popov	e47540d476	[SimplifyCFG] Regenerate test checks (NFC)	2021-01-23 21:24:54 +01:00
Kazu Hirata	ad24e08f1c	Revert "[Target] Use llvm::append_range (NFC)" This reverts commit cc7a23828657f35f706343982cf96bb6583d4d73. The X86WinEHState.cpp hunk seems to break certain builds.	2021-01-23 11:25:27 -08:00
Florian Hahn	5b8c530938	[FuzzMutate] Add mutator to modify instruction flags. This patch adds a new InstModificationIRStrategy to mutate flags/options for instructions. For example, it may add or remove nuw/nsw flags from add, mul, sub, shl instructions or change the predicate for icmp instructions. Subtle changes such as those mentioned above should lead to a more interesting range of inputs. The presence or absence of overflow flags can expose subtle bugs, for example. Reviewed By: bogner Differential Revision: https://reviews.llvm.org/D94905	2021-01-23 19:05:20 +00:00
Kazu Hirata	dcbeaf027c	[llvm] Use pop_back_val (NFC)	2021-01-23 10:56:33 -08:00
Kazu Hirata	f3e18e1dd6	[Target] Use llvm::append_range (NFC)	2021-01-23 10:56:31 -08:00
Kazu Hirata	f4934140cd	[llvm] Forward-declare ICFLoopSafetyInfo (NFC) LoopUtils.h needs ICFLoopSafetyInfo but relies on a forward declaration of ICFLoopSafetyInfo in IVDescriptors.h. This patch adds a forward declaration right in LoopUtils.h. While we are at it, this patch removes the one in IVDescriptors.h, where it is unnecessary.	2021-01-23 10:56:30 -08:00
Florian Hahn	1a0245b43a	[InstCombine] Set MadeIRChange in replaceInstUsesWith. Some utilities used by InstCombine, like SimplifyLibCalls, may add new instructions and replace the uses of a call, but return nullptr because the inserted call produces multiple results. Previously, the replaced library calls would get removed by InstCombine's deleter, but after 292077072ec1279d89d21873fe900061e55ef936 this may not happen, if the willreturn attribute is missing. As a work-around, update replaceInstUsesWith to set MadeIRChange, if it replaces any uses. This catches the cases where it is used as replacer by utilities used by InstCombine and seems useful in general; updating uses will modify the IR. This fixes an expensive-check failure when replacing @__sinpif/@__cospifi with @__sincospif_sret.	2021-01-23 17:52:59 +00:00
Ben Shi	0d8fd96903	[AVR] Optimize 16-bit comparison with constant Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D93976	2021-01-24 00:38:57 +08:00
Sanjay Patel	77d1393c15	[SLP] fix fast-math-flag propagation on FP reductions As shown in the test diffs, we could miscompile by propagating flags that did not exist in the original code. The flags required for fmin/fmax reductions will be fixed in a follow-up patch.	2021-01-23 11:17:20 -05:00
Sanjay Patel	6c70b83a23	[SLP] add reduction test with mixed fast-math-flags; NFC	2021-01-23 11:17:20 -05:00
Florian Hahn	283961f4c8	[Local] Treat calls that may not return as being alive. With the addition of the `willreturn` attribute, functions that may not return (e.g. due to an infinite loop) are well defined, if they are not marked as `willreturn`. This patch updates `wouldInstructionBeTriviallyDead` to not consider calls that may not return as dead. This patch still provides an escape hatch for intrinsics, which are still assumed as willreturn unconditionally. It will be removed once all intrinsics definitions have been reviewed and updated. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D94106	2021-01-23 16:05:14 +00:00
Ben Shi	dbeacbd88e	[AVR] Optimize 8-bit logic left/right shifts Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D89047	2021-01-23 23:54:16 +08:00
LLVM GN Syncbot	06a1ff4c94	[gn build] Port 0057cc5a215e	2021-01-23 14:07:39 +00:00
Roman Lebedev	cbc12c9267	[SimplifyCFG] Change 'LoopHeaders' to be ArrayRef<WeakVH>, not a naked set, thus avoiding dangling pointers If i change it to AssertingVH instead, a number of existing tests fail, which means we don't consistently remove from the set when deleting blocks, which means newly-created blocks may happen to appear in that set if they happen to occupy the same memory chunk as did some block that was in the set originally. There are many places where we delete blocks, and while we could probably consistently delete from LoopHeaders when deleting a block in transforms located in SimplifyCFG.cpp itself, transforms located elsewhere (Local.cpp/BasicBlockUtils.cpp) also may delete blocks, and it doesn't seem good to teach them to deal with it. Since we at most only ever delete from LoopHeaders, let's just delegate to WeakVH to do that automatically. But to be honest, personally, i'm not sure that the idea behind LoopHeaders is sound.	2021-01-23 16:48:35 +03:00
LLVM GN Syncbot	f4c75ab0c3	[gn build] Port 2325157c0568	2021-01-23 13:38:51 +00:00
Nikita Popov	e42864bbb1	[LSR] Add test for PR46943 (NFC) LSR should be dropping nowrap flags when adding new postinc users.	2021-01-23 13:53:09 +01:00
Florian Hahn	438026988c	[LTO] Store target attributes as vector of strings (NFC). The target features are obtained as a list of features/attributes. Instead of storing them in a single string, store the vector. This matches lto::Config's behavior and simplifies the transition to lto::backend(). Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D95224	2021-01-23 12:11:58 +00:00
Jeroen Dobbelaere	3fda99577d	[InlineFunction] Use llvm.experimental.noalias.scope.decl for noalias arguments. Insert a llvm.experimental.noalias.scope.decl intrinsic that identifies where a noalias argument was inlined. This patch includes some refactorings from D90104. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D93040	2021-01-23 12:10:57 +01:00

... 5 6 7 8 9 ...

210482 Commits