llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Rahul Joshi	b39d7177a6	[NFC] Use appropriate names for `for_each` and `transform` template parameters Differential Revision: https://reviews.llvm.org/D83848	2020-07-16 09:35:16 -07:00
David Green	f529a69d02	[BasicAA] Fix -basicaa-recphi for geps with negative offsets As shown in D82998, the basic-aa-recphi option can cause miscompiles for gep's with negative constants. The option checks for recursive phi, that recurse through a contant gep. If it finds one, it performs aliasing calculations using the other phi operands with an unknown size, to specify that an unknown number of elements after the initial value are potentially accessed. This works fine expect where the constant is negative, as the size is still considered to be positive. So this patch expands the check to make sure that the constant is also positive. Differential Revision: https://reviews.llvm.org/D83576	2020-07-16 17:22:40 +01:00
LLVM GN Syncbot	9506f9a061	[gn build] Port 1360e140cc7	2020-07-16 16:14:13 +00:00
Vy Nguyen	86e9087897	[llvm-exegesis] Add benchmark latency option on X86 that uses LBR for more precise measurements. Starting with Skylake, the LBR contains the precise number of cycles between the two consecutive branches. Making use of this will hopefully make the measurements more precise than the existing methods of using RDTSC. Differential Revision: https://reviews.llvm.org/D77422	2020-07-16 12:12:46 -04:00
Louis Dionne	d189420bce	[CMake] Enforce the minimum CMake version to be at least 3.13.4 This commit changes the warning for CMake < 3.13.4 into a fatal error. The intent is to revert and re-apply this simple commit until all build bots are migrated to CMake >= 3.13.4. This is part of the effort discussed on llvm-dev here: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140578.html Differential Revision: https://reviews.llvm.org/D78646	2020-07-16 10:49:11 -04:00
Florian Hahn	99ff7e1519	[SCCP] Add test cases for adding !range to call-sites.	2020-07-16 15:34:58 +01:00
Xing GUO	6bb624d77f	[DWARFYAML] Implement the .debug_str_offsets section. This patch helps add support for emitting the .debug_str_offsets section to yaml2elf. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D83853	2020-07-16 22:33:13 +08:00
David Green	d9cdd45f36	[BasicAA] Add additional negative phi tests. NFC	2020-07-16 15:32:38 +01:00
Petar Avramovic	a46cb78476	AMDGPU/GlobalISel: Legalize s64->s16 G_SITOFP/G_UITOFP Add widenScalar for TypeIdx == 0 for G_SITOFP/G_UITOFP. Legailize, using widenScalar, as s64->s32 G_SITOFP/G_UITOFP followed by s32->s16 G_FPTRUNC. Differential Revision: https://reviews.llvm.org/D83880	2020-07-16 16:31:57 +02:00
Jay Foad	47db2fc583	[PowerPC] Precommit 64-bit funnel shift test cases	2020-07-16 15:20:52 +01:00
Florian Hahn	f350dae7c4	[SCCP] Only track returns of functions with non-void ret ty (NFC). There is no need to add functions with void return types to the set of tracked return values. This does not change functionality, because we such functions do not have return values and we never update or access them.	2020-07-16 15:15:19 +01:00
James Y Knight	1e2065572a	Remove TwoAddressInstructionPass::sink3AddrInstruction. This function has a bug which will incorrectly reschedule instructions after an INLINEASM_BR (which can branch). (The bug may also allow scheduling past a throwing-CALL, I'm not certain.) I could fix that bug, but, as the removed FIXME notes, it's better to attempt rescheduling before converting to 3-addr form, as that may remove the need to convert in the first place. In fact, the code to do such reordering was added to this pass only a few months later, in 2011, via the addition of the function rescheduleMIBelowKill. That code does not contain the same bug. The removal of the sink3AddrInstruction function is not a no-op: in some cases it would move an instruction post-conversion, when rescheduleMIBelowKill would not move the instruction pre-converison. However, this does not appear to be important: the machine instruction scheduler can reorder the after-conversion instructions, in any case. This patch fixes a kernel panic 4.4 LTS x86_64 Linux kernels, when built with clang after 4b0aa5724feaa89a9538dcab97e018110b0e4bc3. Link: https://github.com/ClangBuiltLinux/linux/issues/1085 Differential Revision: https://reviews.llvm.org/D83708	2020-07-16 10:02:52 -04:00
Jay Foad	5903937557	[PowerPC] Use CHECK-LABEL for better diagnostics	2020-07-16 13:41:29 +01:00
Roman Lebedev	f174aef3be	Reland "[NFC] SimplifyCFG: refactor/deduplicate command-line settings override handling" Initially i forgot to stage the SimplifyCFGPass::SimplifyCFGPass() change to actually take the passed params..	2020-07-16 15:25:11 +03:00
David Green	22c19a3e72	[ARM] Add a PreferNoCSEL option. NFC This disables CSEL, falling back to the old predicated move behaviour for cases where that is useful for debugging.	2020-07-16 12:42:07 +01:00
Paul Walker	bb56ec9a61	[SVE] Add lowering for scalable vector fadd, fdiv, fmul and fsub operations. Lower the operations to predicated variants. This is prep work required for fixed length code generation but also fixes a bug whereby these operations fail selection when "unpacked" vector types (e.g. MVT::nxv2f32) are used. This patch also adds the missing "unpacked" patterns for FMA. Differential Revision: https://reviews.llvm.org/D83765	2020-07-16 11:31:35 +00:00
Roman Lebedev	6ce66f09d3	Revert "[NFC] SimplifyCFG: refactor/deduplicate command-line settings override handling" Seems to be breaking the bots. This reverts commit 740a1da108ab9097268b509c85ed9ede7f4d5df5.	2020-07-16 14:27:07 +03:00
Georgii Rymar	6a5758618b	[yaml2obj] - Fix an issue with NoHeaders key. When setting the NoHeaders to false, the e_shnum field wasn't set correctly. This patch fixes this bug. Differential revision: https://reviews.llvm.org/D83941	2020-07-16 14:22:36 +03:00
Roman Lebedev	38861762dc	[NFC] SimplifyCFG: refactor/deduplicate command-line settings override handling	2020-07-16 13:40:02 +03:00
Roman Lebedev	78ebeae023	[NFC] SimplifyCFGPass::SimplifyCFGPass(): use default SimplifyCFGOptions - we aren't deviating from them here	2020-07-16 13:40:02 +03:00
Roman Lebedev	e0797fbf7b	Reland "[NFC] SimplifyCFGOptions: drop multi-parameter ctor, use default member-init" This reverts commit 5831e86190966d58385678eb74b26aefacbfd101, which reverted commit 90c1b0442a031d6cad686fdc4e5d3db03c3603a6 in preparation for reverting commit b2018198c32a0535bb1f5bb5b40fbcf50d8d47b7 in commit 1067d3e176ea7b0b1942c163bf8c6c90107768c1 due to the introducton of a dependency cycle. Now that the other revert is reverted with a fix, this can be relanded.	2020-07-16 13:40:01 +03:00
Roman Lebedev	e7e1fdf7ba	Reland "[NFCI] createCFGSimplificationPass(): migrate to also take SimplifyCFGOptions" This reverts commit 1067d3e176ea7b0b1942c163bf8c6c90107768c1, which reverted commit b2018198c32a0535bb1f5bb5b40fbcf50d8d47b7, because it introduced a Dependency Cycle between Transforms/Scalar and Transforms/Utils. So let's just move SimplifyCFGOptions.h into Utils/, thus avoiding the cycle.	2020-07-16 13:40:01 +03:00
Pavel Iliin	ced6d5f200	[ARM] VBIT/VBIF support added. Vector bitwise selects are matched by pseudo VBSP instruction and expanded to VBSL/VBIT/VBIF after register allocation depend on operands registers to minimize extra copies.	2020-07-16 11:25:53 +01:00
Sjoerd Meijer	e1b03eb222	Follow up of 2b3c505d0f6e: fixed a typo, and added some more formatting. NFC.	2020-07-16 11:16:48 +01:00
David Green	a9b961b8da	[ARM] CSEL generation This adds a peephole optimisation to turn a t2MOVccr that could not be folded into any other instruction into a CSEL on 8.1-m. The t2MOVccr would usually be expanded into a conditional mov, that becomes an IT; MOV pair. We can instead generate a CSEL instruction, which can potentially be smaller and allows better register allocation freedom, which can help reduce codesize. Performance is more variable and may depend on the micrarchitecture details, but initial results look good. If we need to control this per-cpu, we can add a subtarget feature as we need it. Original patch by David Penry. Differential Revision: https://reviews.llvm.org/D83566	2020-07-16 11:10:53 +01:00
Kerry McLaughlin	30eb603e95	[SVE][CodeGen] Legalisation of masked loads and stores Summary: This patch modifies IncrementMemoryAddress to use a vscale when calculating the new address if the data type is scalable. Also adds tablegen patterns which match an extract_subvector of a legal predicate type with zip1/zip2 instructions Reviewers: sdesmalen, efriedma, david-arm Reviewed By: efriedma, david-arm Subscribers: tschuett, hiraditya, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83137	2020-07-16 10:55:45 +01:00
Florian Hahn	aa7315a8f1	[Matrix] Also run lowering during -O0. Currently the backends cannot lower the matrix intrinsics directly and rely on the lowering to vector instructions happening in the middle-end. At the moment, this means the backend crashes when matrix types extension code is compiled with -O0, e.g. http://green.lab.llvm.org/green/job/test-suite-verify-machineinstrs-aarch64-O0-g/7902/ This patch enables also runs the lowering with -O0 in the middle-end as a temporary solution. Long term, a lightweight version of the lowering should run in the backend, on demand.	2020-07-16 10:51:31 +01:00
Max Kazantsev	4858da5c88	[Test] Add test that shows how SimplifyCFG may insert redunant Phi It happens when a block cannot be threaded because of a convergent function.	2020-07-16 16:23:11 +07:00
Petar Avramovic	7057b4d851	AMDGPU/GlobalISel: Select G_FREEZE Select G_FREEZE in the same way that COPY is selected. Differential Revision: https://reviews.llvm.org/D83031	2020-07-16 11:10:48 +02:00
Max Kazantsev	dda3bbf0e0	Re-enable "[InstCombine] Simplify boolean Phis with const inputs using CFG" This reverts commit b893822e32ffe3c1dcf4d5ac0571a282582d72b2. + Clang test fixes + Insertion point fix for landing pads	2020-07-16 16:09:08 +07:00
Adrian Kuegel	eab3a83f04	Revert "[NFCI] createCFGSimplificationPass(): migrate to also take SimplifyCFGOptions" This reverts commit b2018198c32a0535bb1f5bb5b40fbcf50d8d47b7. This commit introduced a Dependency Cycle between Transforms/Scalar and Transforms/Utils. Transforms/Scalar already depends on Transforms/Utils, so if SimplifyCFGOptions.h is moved to Scalar, and Utils/Local.h still depends on it, we have a cycle.	2020-07-16 10:54:10 +02:00
Adrian Kuegel	728e199470	Revert "[NFC] SimplifyCFGOptions: drop multi-parameter ctor, use default member-init" This reverts commit 90c1b0442a031d6cad686fdc4e5d3db03c3603a6. This is based on another commit which also needs to be reverted. The other commit introduced a Dependency Cycle between Transforms/Scalar and TransformUtils. Scalar already depends (in many ways) on TransformUtils, so making TransformUtils depend on Scalar should be avoided.	2020-07-16 10:32:50 +02:00
Mikael Holmen	95bbed36d0	[MasmParser] Remove unused method emitStructValue to silence warning The method was added in bc8e262afe83 and has been unused ever since so remove it to silence a gcc warning.	2020-07-16 09:36:17 +02:00
Craig Topper	2c95c9a618	[X86] Allow lsl/lar to be parsed with a GR16, GR32, or GR64 as source register. This matches GNU assembler behavior. Operand size is determined only from the destination register.	2020-07-15 23:51:37 -07:00
Max Kazantsev	c2145a0c83	Revert "[InstCombine] Simplify boolean Phis with const inputs using CFG" This reverts commit 00472067c34ccbceb2fad4b905524f3c780bb7d5. Need to fix failing clang tests.	2020-07-16 12:58:39 +07:00
Amy Kwan	1157465de4	[PowerPC][Power10] Fix VINS* (vector insert byte/half/word) instructions to have i32 arguments. Previously, the vins* intrinsic was incorrectly defined to have its second and third argument arguments as an i64. This patch fixes the second and third argument of the vins* instruction and intrinsic to have i32s instead. Differential Revision: https://reviews.llvm.org/D83497	2020-07-16 00:30:24 -05:00
Max Kazantsev	be5802be10	[InstCombine] Simplify boolean Phis with const inputs using CFG This patch adds simplification for pattern: ``` if (cond) / \ ... ... \ / p = phi [true] [false] ... br p, succ_1, succ_2 ``` If we can prove that top block's branches dominate respective inputs of a block that has a Phi with constant inputs, we can use the branch condition (maybe inverted) instead of Phi. This will make proofs of implication for further jump threading more transparent. Differential Revision: https://reviews.llvm.org/D81375 Reviewed By: xbolva00	2020-07-16 12:06:10 +07:00
Craig Topper	340c0ed23d	Revert "[InstSimplify] Remove select ?, undef, X -> X and select ?, X, undef -> X transforms" and subsequent patches This reverts most of the following patches due to reports of miscompiles. I've left the added test cases with comments updated to be FIXMEs. 1cf6f210a2e [IR] Disable select ? C : undef -> C fold in ConstantFoldSelectInstruction unless we know C isn't poison. 469da663f2d [InstSimplify] Re-enable select ?, undef, X -> X transform when X is provably not poison 122b0640fc9 [InstSimplify] Don't fold vectors of partial undef in SimplifySelectInst if the non-undef element value might produce poison ac0af12ed2f [InstSimplify] Add test cases for opportunities to fold select ?, X, undef -> X when we can prove X isn't poison 9b1e95329af [InstSimplify] Remove select ?, undef, X -> X and select ?, X, undef -> X transforms	2020-07-15 22:02:33 -07:00
Kiran Kumar T P	62537b4233	[flang][OpenMP] Enhance parser support for taskwait construct to OpenMP 5.0 Summary: This patch enhances parser support for taskwait construct to OpenMP 5.0. 2.17.5 taskwait Construct !$omp taskwait [clause[ [,] clause] ... ] where clause is one of the following: depend([depend-modifier,]dependence-type : locator-list) The patch includes code changes and testcase modifications. Reviewed By: Valentin Clement, Kiran Chandramohan Differential Revision: https://reviews.llvm.org/D82255	2020-07-16 10:10:59 +05:30
George Rokos	d67d0200e4	Fix lit test related to declare mapper patch D67833.	2020-07-15 20:31:36 -07:00
Carl Ritson	c5860207c6	[AMDGPU] Update VMEM scalar write hazard mitigation sequence Using s_waitcnt_depctr 0xffe3 is potentially faster than v_nop. Reviewed By: rampitec, foad Differential Revision: https://reviews.llvm.org/D83872	2020-07-16 11:37:45 +09:00
George Rokos	02e72ee959	[OpenMP 5.0] Codegen support to pass user-defined mapper functions to runtime This patch implements the code generation to use OpenMP 5.0 declare mapper (a.k.a. user-defined mapper) constructs. Patch written by Lingda Li. Differential Revision: https://reviews.llvm.org/D67833	2020-07-15 18:11:43 -07:00
Quentin Colombet	300655df66	[CalcSpillWeights] Propagate the fact that a live-interval is not spillable When we calculate the weight of a live-interval, add some code to check if the original live-interval was markied as not spillable and if so, progagate that information down to the new interval. Previously we would just recompute a weight for the new interval, thus, we could in theory just spill live-intervals marked as not spillable by just splitting them. That goes against the spirit of a non-spillable live-interval. E.g., previously we could do: v1 = // v1 must not be spilled ... = v1 Split: v1 = // v1 must not be spilled ... v2 = v1 // v2 can be spilled ... v3 = v2 // v3 can be spilled = v3 There's no test case for that one as we would need to split a non-spillable live-interval without using LiveRangeEdit to see this happening. RegAlloc inserts non-spillable intervals only as part of the spilling mechanism, thus at this point the intervals are not splittable anymore. On top of that, RegAlloc uses the LiveRangeEdit API, which already properly propagate that information. In other words, this could only happen if a target was to mark a live-interval as not spillable before register allocation and split it without using LRE, e.g., through LiveIntervals::splitSeparateComponent.	2020-07-15 17:57:36 -07:00
dfukalov	ad2755be86	[AMDGPU][CostModel] Improve cost estimation for fused {fadd\|fsub}(a,fmul(b,c)) Summary: If result of fmul(b,c) has one use, in almost all cases (except denormals are IEEE) the pair of operations will be fused in one fma/mad/mac/etc. Reviewers: rampitec Reviewed By: rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits, kerbowa Tags: #llvm Differential Revision: https://reviews.llvm.org/D83919	2020-07-16 03:06:38 +03:00
Roman Lebedev	70e59b1628	[NFC] SimplifyCFGOptions: drop multi-parameter ctor, use default member-init Likewise, just use the builder pattern. Taking multiple params is unmaintainable.	2020-07-16 01:48:34 +03:00
Roman Lebedev	fc354c8afc	[NFCI] createCFGSimplificationPass(): migrate to also take SimplifyCFGOptions Taking so many parameters is simply unmaintainable. We don't want to include the entire llvm/Transforms/Utils/Local.h into llvm/Transforms/Scalar.h so i've split SimplifyCFGOptions into it's own header.	2020-07-16 01:27:54 +03:00
Roman Lebedev	0d7bdb01fc	[NFCI] CFGSimplifyPass: change (the only) constructor to take SimplifyCFGOptions Taking that long list of parameters is already simply unmaintainable.	2020-07-16 01:27:53 +03:00
Craig Topper	9c23a3f346	[X86] Teach assembler parser to accept lsl and lar with a 64 or 32 source register when the destination is a 64 register. Previously we only accepted a 32-bit source with a 64-bit dest. Accepting 64-bit as well is more consistent with gas behavior. I think maybe we should accept 16 bit register as well, but I'm not sure.	2020-07-15 15:17:06 -07:00
Roman Lebedev	1d426b3c0e	[NFC][SimplifyCFG] HoistThenElseCodeToIf(): after hoisting terminator, do return Changed, not just true Otherwise, if Changed was still false before that, we would not account for that hoist in NumHoistCommonCode statistic.	2020-07-16 00:32:48 +03:00
Roman Lebedev	01fd9b5b86	[NFC][SimplifyCFG] HoistThenElseCodeToIf(): count number of common instruction "blocks" hoisted I.e. out of all the times HoistThenElseCodeToIf() was called, how many times did it actually hoist something?	2020-07-16 00:21:56 +03:00

1 2 3 4 5 ...

200223 Commits