llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 02:33:06 +01:00

Author	SHA1	Message	Date
serge-sans-paille	c5e705c5f8	Harmonize Python shebang Differential Revision: https://reviews.llvm.org/D83857	2020-07-16 21:53:45 +02:00
Matt Arsenault	6bfa8ac3f9	Fix incorrect file path in documentation	2020-07-16 15:53:11 -04:00
Matt Arsenault	39e76f9342	AMDGPU: Add a few more missing test for AGPR tuple copying	2020-07-16 15:53:11 -04:00
Craig Topper	ab11c1e42d	[X86] Change the tuning settings for pentium4 to be more modern since its the default 32-bit cpu in clang Alternative to D83897. I believe the big change here is that I removed slow unaligned memory 16 Down side that it may adversely effect tuning if someone explicitly targets -march=pentium4 and expects pentium4 tuned code. Of course pentium4 is so old our default behavior with the previous settings may not have been the best either. Reviewed By: echristo, RKSimon Differential Revision: https://reviews.llvm.org/D83913	2020-07-16 12:51:25 -07:00
Matt Arsenault	92817d9c2a	AMDGPU: Add missing tests for copyPhysReg AGPR tuples	2020-07-16 15:27:57 -04:00
Mircea Trofin	5f8ff6c8ac	[llvm] Moved InlineSizeEstimatorAnalysis test to .ll Summary: Following guidance in https://llvm.org/docs/TestingGuide.html#testing-analysis Reviewers: mehdi_amini Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83918	2020-07-16 12:25:16 -07:00
LLVM GN Syncbot	207caa9df8	[gn build] Port 5e8b4be9f85	2020-07-16 19:08:09 +00:00
Wouter van Oortmerssen	53d8604583	[WebAssembly] Triple::wasm64 related cleanup Differential Revision: https://reviews.llvm.org/D83713	2020-07-16 12:01:10 -07:00
Eric Christopher	3a125536d7	Temporarily Revert "[AssumeBundles] Use operand bundles to encode alignment assumptions" due to the performance bugs filed in https://bugs.llvm.org/show_bug.cgi?id=46753. An SROA change soon may obviate some of these problems. This reverts commit 8d09f20798ac180b1749276bff364682ce0196ab.	2020-07-16 11:54:04 -07:00
Nadav Rotem	0cf1290686	[InjectTLIMappings] Use StringRef instead of std::string for FN name. https://reviews.llvm.org/D83797	2020-07-16 11:53:04 -07:00
Zakk Chen	4f4e8c6413	[RISCV] Add support for -mcpu option. Summary: 1. gcc uses `-march` and `-mtune` flag to chose arch and pipeline model, but clang does not have `-mtune` flag, we uses `-mcpu` to chose both infos. 2. Add SiFive e31 and u54 cpu which have default march and pipeline model. 3. Specific `-mcpu` with rocket-rv[32\|64] would select pipeline model only, and use the driver's arch choosing logic to get default arch. Reviewers: lenary, asb, evandro, HsiangKai Reviewed By: lenary, asb, evandro Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D71124	2020-07-16 11:46:22 -07:00
Nadav Rotem	875d6beaad	[LiveVariables] Replace std::vector with SmallVector. Replace std::vector with SmallVector to reduce the number of mallocs. This method is frequently executed, and the number of elements in the vector is typically small. https://reviews.llvm.org/D83920	2020-07-16 11:39:54 -07:00
Thomas Lively	9ac94b78c6	[WebAssembly] Implement v128.select Although the SIMD spec proposal does not specifically include a select instruction, the select instruction in MVP WebAssembly is polymorphic over the selected types, so it is able to work on v128 values when they are enabled. This patch introduces a new variant of the select instruction for each legal vector type. Additional ISel patterns are adapted from the SELECT_I32 and SELECT_I64 patterns. Depends on D83736. Differential Revision: https://reviews.llvm.org/D83737	2020-07-16 11:37:25 -07:00
Nadav Rotem	8c87e8c6cf	[TableGen] Change std::vector to SmallVector The size of VTList that is pushed into this container is usually 1, but often 6 or 7. Change the vector to SmallVector to eliminate frequent mallocs. This happens hundreds of thousands of times in each tablegen execution during the LLVM/clang build. https://reviews.llvm.org/D83849	2020-07-16 11:32:44 -07:00
Matt Arsenault	51efd8682f	AMDGPU: Move handling of AGPR copies to a separate function This is in preparation for fixing multiple problems with the way AGPR copies are handled, but this change is NFC itself. First, it's relying on recursively calling copyPhysReg, which is losing information necessary to get correct super register handling. Second, it's constructing a new RegScavenger and doing a O(N^2) walk on every single sub-spill for every AGPR tuple copy. Third, it's using the forward form of the scavenger, and not using the preferred backwards scan.	2020-07-16 14:32:24 -04:00
Arthur Eubanks	09cfe7939a	[SCEV] Fix ScalarEvolution tests under NPM Many tests use opt's -analyze feature, which does not translate well to NPM and has better alternatives. The alternative here is to explicitly add a pass that calls ScalarEvolution::print(). The legacy pass manager RUNs aren't changing, but they are now pinned to the legacy pass manager. For each legacy pass manager RUN, I added a corresponding NPM RUN using the 'print<scalar-evolution>' pass. For compatibility with update_analyze_test_checks.py and existing test CHECKs, 'print<scalar-evolution>' now prints what -analyze prints per function. This was generated by the following Python script and failures were manually fixed up: import sys for i in sys.argv: with open(i, 'r') as f: s = f.read() with open(i, 'w') as f: for l in s.splitlines(): if "RUN:" in l and ' -analyze ' in l and '\\' not in l: f.write(l.replace(' -analyze ', ' -analyze -enable-new-pm=0 ')) f.write('\n') f.write(l.replace(' -analyze ', ' -disable-output ').replace(' -scalar-evolution ', ' "-passes=print<scalar-evolution>" ').replace(" \| ", " 2>&1 \| ")) f.write('\n') else: f.write(l) There are a couple failures still in ScalarEvolution under NPM, but those are due to other unrelated naming conflicts. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D83798	2020-07-16 11:24:07 -07:00
Thomas Lively	ea404b6fcd	[WebAssembly] Autogenerate tests for simd-select.ll Updating the simd-select.ll tests manually with consistent named regexps for the register numbers was taking more time than it was worth, so this patch updates that test file to have autogenerated output. This is not a significant readability regression because the tests in that file are all very small. Depends on D83734. Differential Revision: https://reviews.llvm.org/D83736	2020-07-16 11:19:09 -07:00
Thomas Lively	1cc997c0a7	[WebAssembly] Lower vselect to v128.bitselect We were previously expanding vselect and matching on the expansion to generate bitselects, but in some cases the expansion would be further combined and a bitselect would not get generated. This patch improves codegen in those cases by legalizing vselect and lowering it to v128.bitselect. The old pattern that matches the expansion is still useful for lowering IR that already uses the expansion rather than a select operation. Differential Revision: https://reviews.llvm.org/D83734	2020-07-16 11:11:19 -07:00
Craig Topper	00bbcf7ab4	[X86] Add test case for PR46455.	2020-07-16 11:06:55 -07:00
Matt Arsenault	ccceacf068	DAG: Try scalarizing when expanding saturating add/sub In an upcoming AMDGPU patch, the scalar cases will be legal and vector ops should be scalarized, rather than producing a long sequence of vector ops which will also need to be scalarized. Use a lazy heuristic that seems to work and improves the thumb2 MVE test.	2020-07-16 14:05:16 -04:00
Denis Antrushin	86fd847c5d	MIR Statepoint refactoring. Part 1: Basic MI level changes. Basic support for variadic-def MIR Statepoint: - Change TableGen STATEPOINT description to variadic out list (For self-documentation purpose; by itself it does not affect code generation in any way). - Update StatepointOpers helper class to handle variadic defs. - Update MachineVerifier to properly handle them, too. With this change, new Statepoint instruction can be passed through backend (excluding ISEL) without errors. Full change set is available at D81603. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D81645	2020-07-17 00:57:21 +07:00
Matt Arsenault	67958cd15e	Use findEnumAttribute helper for preallocated	2020-07-16 13:50:49 -04:00
Matt Arsenault	e12de24e43	IR: Rename Argument::hasPassPointeeByValueAttr to prepare for byref When the byref attribute is added, there will need to be two similar functions for the existing cases which have an associate value copy, and byref which does not. Most, but not all of the existing uses will use the existing version. The associated size function added by D82679 also needs to contextually differ, and will help eliminate a few places still relying on pointee element types.	2020-07-16 13:50:49 -04:00
Matt Arsenault	bd9b22ee76	ValueTracking: Fix isKnownNonZero for non-0 null pointers for byval The IR doesn't have a proper concept of invalid pointers, and "null" constants are just all zeros (though it really needs one). I think it's not possible to break this for AMDGPU due to the copy semantics of byval. If you have an original stack object at 0, the byval copy will be placed above it so I don't think it's really possible to hit a 0 address.	2020-07-16 13:50:49 -04:00
Nico Weber	0ee3da9a31	[gn build] Fix merge script mishap	2020-07-16 13:46:45 -04:00
Matt Arsenault	56d084f606	AMDGPU: Rename gfx9 version of v_add_i32/v_sub_i32 The carry-out opcode is renamed, so eliminate the deceptive _gfx9, which looked like the encoded instruction. The real encoded version was named _gfx9_gfx9. Move it into the VI encoding namespace. The gfx9 namespace is just to deal with the renamed instructions that reinterpret the opcode. When codegened, it would fail to find the real instruction since it wasn't in the right namespace.	2020-07-16 13:32:05 -04:00
Jinsong Ji	f958fcb9e1	[docs] fix ident in llvm-exegesis.rst	2020-07-16 17:30:09 +00:00
Matt Arsenault	6705a324ed	AMDGPU: Rename add/sub with carry out instructions The hardware has created a real mess in the naming for add/sub, which have been renamed basically every generation. Switch the carry out pseudos to have the gfx9/gfx10 names. We were using the original SI/CI v_add_i32/v_sub_i32 names. Later targets reintroduced these names as carryless instructions with a saturating clamp bit, which we do not define. Do this rename so we can unambiguously add these missing instructions. The carry-in versions should also be renamed, but at least those had a consistent _u32 name to begin with. The 16-bit instructions were also renamed, but aren't ambiguous. This does regress assembler error message quality in some cases. In mismatched wave32/wave64 situations, this will switch from "unsupported instruction" to "invalid operand", with the error pointing at the wrong position. I couldn't quite follow how the assembler selects these, but the previous behavior seemed accidental to me. It looked like there was a partial attempt to handle this which was never completed (i.e. there is an AMDGPUOperand::isBoolReg but it isn't used for anything).	2020-07-16 13:16:30 -04:00
Jinsong Ji	eacbb3ef70	[docs][lldb] Fix lldb item in releasenotes Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D83962	2020-07-16 17:07:53 +00:00
Michael Forster	070c7b821a	Add hashing support for std::tuple Summary: All tuple values are passed directly to hash_combine. This is inspired by the implementation used for Swift: `4a1b4edbe1` `845f3829b9` Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83887	2020-07-16 19:01:25 +02:00
Florian Hahn	9c302c2155	[Matrix] Add test for running matrix lowering with -O0.	2020-07-16 17:56:33 +01:00
Rahul Joshi	b39d7177a6	[NFC] Use appropriate names for `for_each` and `transform` template parameters Differential Revision: https://reviews.llvm.org/D83848	2020-07-16 09:35:16 -07:00
David Green	f529a69d02	[BasicAA] Fix -basicaa-recphi for geps with negative offsets As shown in D82998, the basic-aa-recphi option can cause miscompiles for gep's with negative constants. The option checks for recursive phi, that recurse through a contant gep. If it finds one, it performs aliasing calculations using the other phi operands with an unknown size, to specify that an unknown number of elements after the initial value are potentially accessed. This works fine expect where the constant is negative, as the size is still considered to be positive. So this patch expands the check to make sure that the constant is also positive. Differential Revision: https://reviews.llvm.org/D83576	2020-07-16 17:22:40 +01:00
LLVM GN Syncbot	9506f9a061	[gn build] Port 1360e140cc7	2020-07-16 16:14:13 +00:00
Vy Nguyen	86e9087897	[llvm-exegesis] Add benchmark latency option on X86 that uses LBR for more precise measurements. Starting with Skylake, the LBR contains the precise number of cycles between the two consecutive branches. Making use of this will hopefully make the measurements more precise than the existing methods of using RDTSC. Differential Revision: https://reviews.llvm.org/D77422	2020-07-16 12:12:46 -04:00
Louis Dionne	d189420bce	[CMake] Enforce the minimum CMake version to be at least 3.13.4 This commit changes the warning for CMake < 3.13.4 into a fatal error. The intent is to revert and re-apply this simple commit until all build bots are migrated to CMake >= 3.13.4. This is part of the effort discussed on llvm-dev here: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140578.html Differential Revision: https://reviews.llvm.org/D78646	2020-07-16 10:49:11 -04:00
Florian Hahn	99ff7e1519	[SCCP] Add test cases for adding !range to call-sites.	2020-07-16 15:34:58 +01:00
Xing GUO	6bb624d77f	[DWARFYAML] Implement the .debug_str_offsets section. This patch helps add support for emitting the .debug_str_offsets section to yaml2elf. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D83853	2020-07-16 22:33:13 +08:00
David Green	d9cdd45f36	[BasicAA] Add additional negative phi tests. NFC	2020-07-16 15:32:38 +01:00
Petar Avramovic	a46cb78476	AMDGPU/GlobalISel: Legalize s64->s16 G_SITOFP/G_UITOFP Add widenScalar for TypeIdx == 0 for G_SITOFP/G_UITOFP. Legailize, using widenScalar, as s64->s32 G_SITOFP/G_UITOFP followed by s32->s16 G_FPTRUNC. Differential Revision: https://reviews.llvm.org/D83880	2020-07-16 16:31:57 +02:00
Jay Foad	47db2fc583	[PowerPC] Precommit 64-bit funnel shift test cases	2020-07-16 15:20:52 +01:00
Florian Hahn	f350dae7c4	[SCCP] Only track returns of functions with non-void ret ty (NFC). There is no need to add functions with void return types to the set of tracked return values. This does not change functionality, because we such functions do not have return values and we never update or access them.	2020-07-16 15:15:19 +01:00
James Y Knight	1e2065572a	Remove TwoAddressInstructionPass::sink3AddrInstruction. This function has a bug which will incorrectly reschedule instructions after an INLINEASM_BR (which can branch). (The bug may also allow scheduling past a throwing-CALL, I'm not certain.) I could fix that bug, but, as the removed FIXME notes, it's better to attempt rescheduling before converting to 3-addr form, as that may remove the need to convert in the first place. In fact, the code to do such reordering was added to this pass only a few months later, in 2011, via the addition of the function rescheduleMIBelowKill. That code does not contain the same bug. The removal of the sink3AddrInstruction function is not a no-op: in some cases it would move an instruction post-conversion, when rescheduleMIBelowKill would not move the instruction pre-converison. However, this does not appear to be important: the machine instruction scheduler can reorder the after-conversion instructions, in any case. This patch fixes a kernel panic 4.4 LTS x86_64 Linux kernels, when built with clang after 4b0aa5724feaa89a9538dcab97e018110b0e4bc3. Link: https://github.com/ClangBuiltLinux/linux/issues/1085 Differential Revision: https://reviews.llvm.org/D83708	2020-07-16 10:02:52 -04:00
Jay Foad	5903937557	[PowerPC] Use CHECK-LABEL for better diagnostics	2020-07-16 13:41:29 +01:00
Roman Lebedev	f174aef3be	Reland "[NFC] SimplifyCFG: refactor/deduplicate command-line settings override handling" Initially i forgot to stage the SimplifyCFGPass::SimplifyCFGPass() change to actually take the passed params..	2020-07-16 15:25:11 +03:00
David Green	22c19a3e72	[ARM] Add a PreferNoCSEL option. NFC This disables CSEL, falling back to the old predicated move behaviour for cases where that is useful for debugging.	2020-07-16 12:42:07 +01:00
Paul Walker	bb56ec9a61	[SVE] Add lowering for scalable vector fadd, fdiv, fmul and fsub operations. Lower the operations to predicated variants. This is prep work required for fixed length code generation but also fixes a bug whereby these operations fail selection when "unpacked" vector types (e.g. MVT::nxv2f32) are used. This patch also adds the missing "unpacked" patterns for FMA. Differential Revision: https://reviews.llvm.org/D83765	2020-07-16 11:31:35 +00:00
Roman Lebedev	6ce66f09d3	Revert "[NFC] SimplifyCFG: refactor/deduplicate command-line settings override handling" Seems to be breaking the bots. This reverts commit 740a1da108ab9097268b509c85ed9ede7f4d5df5.	2020-07-16 14:27:07 +03:00
Georgii Rymar	6a5758618b	[yaml2obj] - Fix an issue with NoHeaders key. When setting the NoHeaders to false, the e_shnum field wasn't set correctly. This patch fixes this bug. Differential revision: https://reviews.llvm.org/D83941	2020-07-16 14:22:36 +03:00
Roman Lebedev	38861762dc	[NFC] SimplifyCFG: refactor/deduplicate command-line settings override handling	2020-07-16 13:40:02 +03:00

1 2 3 4 5 ...

200254 Commits