llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Juneyoung Lee	2f184e9475	[Utils] Add missing freeze and poison keyword highlights This patch adds missing keyword highlights for freeze and poison Reviewed By: MaskRay, porglezomp Differential Revision: https://reviews.llvm.org/D104017	2021-06-14 09:21:26 +09:00
Eric Astor	d15098fdc8	[ms] [llvm-ml] When parsing MASM, "jmp short" instructions are case insensitive Handle "short" in a case-insensitive fashion in MASM. Required to correctly parse z_Windows_NT-586_asm.asm from the OpenMP runtime. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D104195	2021-06-13 18:36:00 -04:00
Eric Astor	679dc9bc3b	[ms] [llvm-ml] Fix capitalization of the ignored CPU directives These directives are matched in lowercase, so make sure to use lowercase for their P suffix. Differential Revision: https://reviews.llvm.org/D104206	2021-06-13 18:34:42 -04:00
Eric Astor	3e051c60b8	Fix misspelled instruction in X86 assembly parser Did not correctly handle "jecxz short <address>". Discovered while working on LLVM-ML; shows up in z_Windows_NT-586_asm.asm from the OpenMP runtime Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D104194	2021-06-13 18:34:15 -04:00
David Green	a615d4a362	[DSE] Extra multiblock loop tests, NFC. Some of these can be DSE'd, some of which cannot. Useful in D100464.	2021-06-13 22:30:42 +01:00
LemonBoy	71d85d4af9	[SPARC] Legalize truncation and extension between fp128 and half Lower truncations and expansions between fp128 and half values into libcalls. Expand truncating stores into two separate truncation and a store operations. Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D104185	2021-06-13 20:05:15 +02:00
Nikita Popov	478596b756	[LoopUnroll] Test multi-exit runtime unrolling with predictable exit (NFC) The (prior to prologue insertion) predictable exit shouldn't get folded here. Make sure it isn't...	2021-06-13 18:48:38 +02:00
Simon Pilgrim	0b64dd4442	RawError.h - remove unused <string> include. NFCI.	2021-06-13 17:32:57 +01:00
Simon Pilgrim	3f3834f7e5	BoundsChecking.cpp - tidy implicit header dependencies. NFCI. We don't use <vector> but we do use std::pair (<utility>)	2021-06-13 17:08:15 +01:00
Simon Pilgrim	836026294d	DIPrinter.h - tidy implicit header dependencies. NFCI. We don't use <string> but we do use std::unique_ptr (<memory>) and llvm::Optional<>	2021-06-13 17:00:15 +01:00
Simon Pilgrim	910cf30f57	DetailedRecordsBackend.cpp - printSectionHeading - avoid std::string creation/copies. Don't create std::string from constant c-strings or pass std::string by value - we can use StringRef instead.	2021-06-13 16:49:40 +01:00
Simon Pilgrim	9ec7689e46	DetailedRecordsBackend.cpp - tidy implicit header dependencies. NFCI. We don't use <algorithm>, <set> or <vector>, but we do use std::pair (<utility>).	2021-06-13 16:27:17 +01:00
Simon Pilgrim	fdadecc8f8	ProfiledCallGraph.h - remove unused <string> include. NFCI.	2021-06-13 15:19:25 +01:00
Simon Pilgrim	20d33f98a5	RegUsageInfoPropagate.cpp - remove unused <string> and <map> includes. NFCI.	2021-06-13 15:19:24 +01:00
Simon Pilgrim	f71e5f90f8	MachOObjectFile.cpp - remove unused <string> include. NFCI.	2021-06-13 15:19:24 +01:00
Simon Pilgrim	cf2264bfb7	DWARFDebugFrame.cpp - remove unused <string> include. NFCI.	2021-06-13 15:19:24 +01:00
Simon Pilgrim	052e3ea653	GVN.cpp - remove unused <vector> include. NFCI.	2021-06-13 14:06:32 +01:00
Simon Pilgrim	8c6f5b0343	LoopUnrollAndJamPass.cpp - remove unused <vector> include. NFCI.	2021-06-13 14:06:32 +01:00
David Green	9fd9749580	[ARM] Introduce t2WhileLoopStartTP This adds t2WhileLoopStartTP, similar to the t2DoLoopStartTP added in D90591. It keeps a reference to both the tripcount register and the element count register, so that the ARMLowOverheadLoops pass in the backend can pick the correct one without having to search for it from the operand of a VCTP. Differential Revision: https://reviews.llvm.org/D103236	2021-06-13 13:55:34 +01:00
Sanjay Patel	416150a164	[InstCombine] fold ctlz/cttz of bool types https://alive2.llvm.org/ce/z/tX4pUT	2021-06-13 08:26:40 -04:00
Simon Pilgrim	cf219f88a5	ArgumentPromotion.cpp - remove unused <string> include. NFCI.	2021-06-13 13:03:47 +01:00
Simon Pilgrim	d6c6c4cfea	VPlanSLP.cpp - tidy implicit header dependencies. NFCI. We don't use std::string and std::vector, but we do use std::pair and std::max.	2021-06-13 12:37:17 +01:00
Lang Hames	d271ed7707	[JITLink][MachO] Add missing testcase. This test was accidentally left out of f9649d123db.	2021-06-13 20:43:49 +10:00
Kristina Bessonova	9a828c143e	[ARM][NEON] Combine base address updates for vld1Ndup intrinsics Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D103836	2021-06-13 11:18:32 +02:00
Luo, Yuanke	426a9ac6ee	[X86] Check immediate before get it. For CMP imm instruction, when the operand 1 is symbol address we should check if it is immediate first. Here is the example code. `CMP64mi32 $noreg, 8, killed renamable $rcx, @d, $noreg, @a, implicit-def $eflags` Many thanks to Craig, Topper for the test case to reproduce this issue. Differential Revision: https://reviews.llvm.org/D104037	2021-06-13 15:40:52 +08:00
Luo, Yuanke	c9854fe645	Revert "[X86] Check immediate before get it." This reverts commit 9eb2f723c24523194b833779d20b027bf89a4f55.	2021-06-13 13:55:38 +08:00
Shoaib Meenai	2f43148446	[runtimes] Fix umbrella component targets When we're building the runtimes for multiple platform targets, we create umbrella build targets for each distribution component, but those targets didn't have any dependencies and were just no-ops. Make the umbrella target depend on the sub-targets for each platform to fix this, which is consistent with the behavior of the umbrella targets for each runtime, and also consistent with the behavior when we've only specified the default target.	2021-06-12 19:49:44 -07:00
David Blaikie	d3ee11b29a	llvm-objcopy: fix section size truncation/extension when dumping sections Since this only comes up with inputs containing sections at least 4GB large (I guess I could use a bzero section or something, so the input file doesn't have to be 4GB, but even then the output file would have to be 4GB, right?) I've skipped testing this. If there's a nice way to test this without needing 4GB inputs or output files. The subtlety here is demonstrated by this code: struct t { operator uint64_t(); }; static_assert(std::is_same_v<int, decltype(std::declval<bool>() ? 0 : std::declval<t>())>); static_assert(std::is_same_v<uint64_t, decltype(std::declval<bool>() ? 0 : std::declval<uint64_t>())>); Because of this difference, the original source code was getting an int type (truncating the actual size) and then extending it again, resulting in bogus values (I haven't thought through this hard enough to explain why the resulting value was 0xffff... - sign extension, possible UB, but in any case it's the wrong answer - in this particular case I was looking at that resulted in a size so large that we couldn't open a file large enough to write to and ended up with a rather vague: error: 'file_name.o': Invalid argument	2021-06-12 19:00:10 -07:00
Luo, Yuanke	4f7d0be5fe	[X86] Check immediate before get it. For CMP imm instruction, when the operand 1 is symbol address we should check if it is immediate first. Here is the example code. `CMP64mi32 $noreg, 8, killed renamable $rcx, @d, $noreg, @a, implicit-def $eflags` Many thanks to Craig, Topper for the test case to reproduce this issue. Differential Revision: https://reviews.llvm.org/D104037	2021-06-13 09:08:40 +08:00
Roman Lebedev	209d27cb4a	[NFC][X86][Codegen] Add shuffle test that would benefit from sorting in reduceBuildVecToShuffle()	2021-06-13 00:07:48 +03:00
Ian McIntyre	78819ccd55	[llvm-objcopy] Exclude empty sections in IHexWriter output IHexWriter was evaluating a section's physical address when deciding if that section should be written to an output. This approach does not account for a zero-sized section that has the same physical address as a sized section. The behavior varies from GNU objcopy, and may result in a HEX file that does not include all program sections. The IHexWriter now excludes zero-sized sections when deciding what should be written to the output. This affects the contents of the writer's `Sections` collection; we will not try to insert multiple sections that could have the same physical address. The behavior seems consistent with GNU objcopy, which always excludes empty sections, no matter the address. The new test case evaluates the IHexWriter behavior when provided a variety of empty sections that overlap or append a filled section. See the input file's comments for more information. Given that test input, and the change to the IHexWriter, GNU objcopy and llvm-objcopy produce the same output. Reviewed By: jhenderson, MaskRay, evgeny777 Differential Revision: https://reviews.llvm.org/D101332	2021-06-12 12:23:07 -07:00
Xun Li	ab9fd44679	[CHR] Don't run ControlHeightReduction if any BB has address taken This patch is to address https://bugs.llvm.org/show_bug.cgi?id=50610. In computed goto pattern, there are usually a list of basic blocks that are all targets of indirectbr instruction, and each basic block also has address taken and stored in a variable. CHR pass could potentially clone these basic blocks, which would generate a cloned version of the indirectbr and clonved version of all basic blocks in the list. However these basic blocks will not have their addresses taken and stored anywhere. So latter SimplifyCFG pass will simply remove all tehse cloned basic blocks, resulting in incorrect code. To fix this, when searching for scopes, we skip scopes that contains BBs with addresses taken. Added a few test cases. Reviewed By: aeubanks, wenlei, hoy Differential Revision: https://reviews.llvm.org/D103867	2021-06-12 10:29:53 -07:00
Craig Topper	e263936592	[X86] Add ISD::FREEZE and ISD::AssertAlign to the list of opcodes that don't guarantee upper 32 bits are zero. The freeze issue was reported here https://llvm.discourse.group/t/bug-or-feature-freeze-instruction/3639 I don't have a test for AssertAlign. I just noticed it was missing and assume it should be similar to the other two Asserts. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D104178	2021-06-12 09:52:29 -07:00
Florian Hahn	c1b8ea5789	[VPlan] Add more sinking/merging tests with predicated loads/stores.	2021-06-12 15:36:51 +01:00
Florian Hahn	a3f4e168f5	Revert "Allow signposts to take advantage of deferred string substitution" This reverts commit 4fc93a3a1f95ef5a0a57750fc621f2411ea445a8 because it breaks LLDB builds on certain macOS platform & SDK combinations, e.g. http://green.lab.llvm.org/green/job/lldb-cmake-standalone/3288/consoleFull#-195476041949ba4694-19c4-4d7e-bec5-911270d8a58c	2021-06-12 12:08:25 +01:00
Kristina Bessonova	175bc341a3	[lit] Attempt for fix tests failing because of 'warning: non-portable path to file' This is an attempt to fix clang test failures due to 'nonportable-include-path' warnings on Windows when a path to llvm-project's base directory contains some uppercase letters (excluding a drive letter). The issue originates from 2 problems: * discovery.py loads site config in lower case causing all the paths based on __file__ and requested within the config file to be in lowercase as well, * neither os.path.abspath() nor os.path.realpath() (both used to obtain paths of config files, sources, object directories, etc) do not return paths in the correct case for Windows (at least consistently for all python versions). As os.path library doesn't seem to provide any relaible way to restore the case for paths on Windows, this patch proposes to use pathlib.resolve(). pathlib is a part of Python 3.4 while llvm lit requires Python 3.6. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D103014	2021-06-12 12:49:03 +02:00
Florian Hahn	f2662d35c8	Revert "[X86FixupLEAs] Transform the sequence LEA/SUB to SUB/SUB" This reverts commit 1b748faf2bae246e2fc77d88420df13c2e60f4df because it breaks building the llvm-test-suite with -verify-machineinstrs on X86: http://green.lab.llvm.org/green/job/test-suite-verify-machineinstrs-x86_64-O3/9585/ Running llc -verify-machineinstr on X86 crashes on the IR below: target datalayout = "e-m:o-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128" %struct.widget = type { i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, [16 x [16 x i16]], [6 x [32 x i32]], [16 x [16 x i32]], [4 x [12 x [4 x [4 x i32]]]], [16 x i32], i8, i32, i32*, i32, i32, i32, i32, i32, %struct.baz, %struct.wobble.1, i32, i32, i32, i32, i32, i32, %struct.quux.2, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, [3 x i32], i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32**, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, [3 x [2 x i32]], [3 x [2 x i32]], i32, i32, i64, i64, %struct.zot.3, %struct.zot.3, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32 } %struct.baz = type { i32, i32, i32, i32, i32, i32, i32, i32, i32, %struct.snork, %struct.wombat.0, %struct.wobble, i32, i32, i32, i32, i32, i32, i32, i32, i32 (%struct.widget, %struct.eggs), i32, i32, i32, i32 } %struct.snork = type { %struct.spam, %struct.zot, i32 (%struct.wombat, %struct.widget, %struct.snork) } %struct.spam = type { i32, i32, i32, i32, i8, i32 } %struct.zot = type { i32, i32, i32, i32, i32, i8, i32* } %struct.wombat = type { i32, i32, i32, i32, i32, i32, i32, i32, void (i32, i32, i32, i32), void (%struct.wombat, %struct.widget, %struct.zot)* } %struct.wombat.0 = type { [4 x [11 x %struct.quux]], [2 x [9 x %struct.quux]], [2 x [10 x %struct.quux]], [2 x [6 x %struct.quux]], [4 x %struct.quux], [4 x %struct.quux], [3 x %struct.quux] } %struct.quux = type { i16, i8 } %struct.wobble = type { [2 x %struct.quux], [4 x %struct.quux], [3 x [4 x %struct.quux]], [10 x [4 x %struct.quux]], [10 x [15 x %struct.quux]], [10 x [15 x %struct.quux]], [10 x [5 x %struct.quux]], [10 x [5 x %struct.quux]], [10 x [15 x %struct.quux]], [10 x [15 x %struct.quux]] } %struct.eggs = type { [1000 x i8], [1000 x i8], [1000 x i8], i32, i32, i32, i32, i32, i32, i32, i32 } %struct.wobble.1 = type { i32, [2 x i32], i32, i32, %struct.wobble.1, %struct.wobble.1, i32, [2 x [4 x [4 x [2 x i32]]]], i32, i64, i64, i32, i32, [4 x i8], [4 x i8], i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32, i32 } %struct.quux.2 = type { i32, i32, i32, i32, i32, %struct.quux.2* } %struct.zot.3 = type { i64, i16, i16, i16 } define void @blam(%struct.widget* %arg, i32 %arg1) local_unnamed_addr { bb: %tmp = load i32, i32* undef, align 4 %tmp2 = sdiv i32 %tmp, 6 %tmp3 = sdiv i32 undef, 6 %tmp4 = load i32, i32* undef, align 4 %tmp5 = icmp eq i32 %tmp4, 4 %tmp6 = select i1 %tmp5, i32 %tmp3, i32 %tmp2 %tmp7 = getelementptr inbounds [4 x [4 x i32]], [4 x [4 x i32]]* undef, i64 0, i64 0, i64 0 %tmp8 = zext i16 undef to i32 %tmp9 = zext i16 undef to i32 %tmp10 = load i16, i16* undef, align 2 %tmp11 = zext i16 %tmp10 to i32 %tmp12 = zext i16 undef to i32 %tmp13 = zext i16 undef to i32 %tmp14 = zext i16 undef to i32 %tmp15 = load i16, i16* undef, align 2 %tmp16 = zext i16 %tmp15 to i32 %tmp17 = zext i16 undef to i32 %tmp18 = sub nsw i32 %tmp8, %tmp9 %tmp19 = shl nsw i32 undef, 1 %tmp20 = add nsw i32 %tmp19, %tmp18 %tmp21 = sub nsw i32 %tmp11, %tmp12 %tmp22 = shl nsw i32 undef, 1 %tmp23 = add nsw i32 %tmp22, %tmp21 %tmp24 = sub nsw i32 %tmp13, %tmp14 %tmp25 = shl nsw i32 undef, 1 %tmp26 = add nsw i32 %tmp25, %tmp24 %tmp27 = sub nsw i32 %tmp16, %tmp17 %tmp28 = shl nsw i32 undef, 1 %tmp29 = add nsw i32 %tmp28, %tmp27 %tmp30 = sub nsw i32 %tmp20, %tmp29 %tmp31 = sub nsw i32 %tmp23, %tmp26 %tmp32 = shl nsw i32 %tmp30, 1 %tmp33 = add nsw i32 %tmp32, %tmp31 store i32 %tmp33, i32* undef, align 4 %tmp34 = mul nsw i32 %tmp31, -2 %tmp35 = add nsw i32 %tmp34, %tmp30 store i32 %tmp35, i32* undef, align 4 %tmp36 = select i1 %tmp5, i32 undef, i32 undef br label %bb37 bb37: ; preds = %bb %tmp38 = load i32, i32* undef, align 4 %tmp39 = ashr i32 %tmp38, %tmp6 %tmp40 = load i32, i32* undef, align 4 %tmp41 = sdiv i32 %tmp39, %tmp40 store i32 %tmp41, i32* undef, align 4 ret void }	2021-06-12 11:41:38 +01:00
Florian Hahn	daff62b038	Revert "[X86FixupLEAs] Sub register usage of LEA dest should block LEA/SUB optimization" This reverts commit f35bcea1d4748889b8240defdf00cb7a71cbe070 because it depends on 1b748faf2bae246e2fc77d88420df13c2e60f4df, which breaks building the llvm-test-suite with -verify-machineinstrs on X86. See 154adc0f135cff3f8a8861c335d2b88c8049d098 for more details.	2021-06-12 11:40:47 +01:00
madhur13490	1d0f3b309b	[AMDGPU][IndirectCalls] Fix register usage propagation for indirect/external calls This patch computes max SGPRs and VGPRs used by module in presence of indirect calls and makes that as register requirement for functions/kernels which makes indirect calls. This patch also refactors code AMDGPUSubTarget.cpp which add a "base" variants of getMaxNumSGPRs which is used by MachineFunction and new Function version. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D103636	2021-06-12 11:59:34 +05:30
spupyrev	09c0e58fa9	A post-processing for BFI inference The current implementation for computing relative block frequencies does not handle correctly control-flow graphs containing irreducible loops. This results in suboptimally generated binaries, whose perf can be up to 5% worse than optimal. To resolve the problem, we apply a post-processing step, which iteratively updates block frequencies based on the frequencies of their predesessors. This corresponds to finding the stationary point of the Markov chain by an iterative method aka "PageRank computation". The algorithm takes at most O(\|E\| * IterativeBFIMaxIterations) steps but typically converges faster. It is turned on by passing option `use-iterative-bfi-inference` and applied only for functions containing profile data and irreducible loops. Tested on SPEC06/17, where it is helping to get correct profile counts for one of the binaries (403.gcc). In prod binaries, we've seen a speedup of up to 2%-5% for binaries containing functions with hot irreducible loops. Reviewed By: hoy, wenlei, davidxl Differential Revision: https://reviews.llvm.org/D103289	2021-06-11 21:46:04 -07:00
Adrian Prantl	5f69f29b5c	Allow signposts to take advantage of deferred string substitution One nice feature of the os_signpost API is that format string substitutions happen in the consumer, not the logging application. LLVM's current Signpost class doesn't take advantage of this though and instead always uses a static "Begin/End %s" format string. This patch uses variadic macros to allow the API to be used as intended. Unfortunately, the primary use-case I had in mind (the LLDB_SCOPED_TIMER() macro) does not get much better from this, because __PRETTY_FUNCTION__ is not a macro, but a static string, so signposts created by LLDB_SCOPED_TIMER() still use a static "%s" format string. At least LLDB_SCOPED_TIMERF() works as intended. This reapplies the previsously reverted patch with support for platforms where signposts are unavailable. Differential Revision: https://reviews.llvm.org/D103575	2021-06-11 16:52:34 -07:00
Adrian Prantl	5ed934f9e8	Revert "Allow signposts to take advantage of deferred string substitution" I forgot to make the LLDB macro conditional on Linux. This reverts commit 541ccd1c1bb23e1e20a382844b35312c0caffd79.	2021-06-11 16:46:34 -07:00
Andrew Litteken	77b6ee14d4	[IRSim] Strip out the findSimilarity call from the constructor Both doInitialize and runOnModule were running the entire analysis due to the actual work being done in the constructor. Strip it out here and only get the similarity during runOnModule. Author: lanza Reviewers: AndrewLitteken, paquette, plofti Differential Revision: https://reviews.llvm.org/D92524	2021-06-11 18:41:28 -05:00
Adrian Prantl	111b1fef7a	Allow signposts to take advantage of deferred string substitution One nice feature of the os_signpost API is that format string substitutions happen in the consumer, not the logging application. LLVM's current Signpost class doesn't take advantage of this though and instead always uses a static "Begin/End %s" format string. This patch uses variadic macros to allow the API to be used as intended. Unfortunately, the primary use-case I had in mind (the LLDB_SCOPED_TIMER() macro) does not get much better from this, because __PRETTY_FUNCTION__ is not a macro, but a static string, so signposts created by LLDB_SCOPED_TIMER() still use a static "%s" format string. At least LLDB_SCOPED_TIMERF() works as intended. Differential Revision: https://reviews.llvm.org/D103575	2021-06-11 16:35:43 -07:00
Alexander Shaposhnikov	29e2b0118b	[llvm-objcopy][MachO] Do not strip symbols with the flag REFERENCED_DYNAMICALLY set Do not strip symbols having the flag REFERENCED_DYNAMICALLY set. Test plan: make check-all Differential revision: https://reviews.llvm.org/D104092	2021-06-11 16:34:59 -07:00
Arthur Eubanks	bd92e2af3f	[NFC][OpaquePtr] Make getMemoryParamAllocType() compatible with opaque pointers These ABI attributes now always require the type parameter. sret was missing from the first set of checks but was covered by the second set.	2021-06-11 16:01:23 -07:00
Sanjay Patel	2226e4e9b8	[InstCombine] add tests for bit manipulation intrinsics with bool values; NFC	2021-06-11 18:20:14 -04:00
Sanjay Patel	12e7c56398	[InstCombine] update test checks; NFC	2021-06-11 18:20:14 -04:00
Kevin Athey	4f02bdce75	[sanitizer] Remove numeric values from -asan-use-after-return flag. (NFC) for issue: https://github.com/google/sanitizers/issues/1394 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D104152	2021-06-11 15:14:51 -07:00
Andrew Litteken	0fda28ba27	[IRSim] Don't copy the Mapper for createCandidatesFromSuffixTree Every invocation this was copying the Mapper for no reason. Take a const ref instead. Author: lanza Reviewers: AndrewLitteken, plofti, paquette, Differential Review: https://reviews.llvm.org/D92532	2021-06-11 16:36:23 -05:00

1 2 3 4 5 ...

217145 Commits