llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 18:54:02 +01:00

Author	SHA1	Message	Date
Stanislav Mekhanoshin	86aeb69232	[AMDGPU] gfx1032 target Differential Revision: https://reviews.llvm.org/D89487	2020-10-15 12:41:18 -07:00
Thomas Lively	438da14930	Reland "[WebAssembly] v128.load{8,16,32,64}_lane instructions" This reverts commit 7c8385a352ba21cb388046290d93b53dc273cd9f with a typing fix to an instruction selection pattern.	2020-10-15 19:32:34 +00:00
Sanjay Patel	885b6533d2	[InstCombine] update tests for logic folds to exercise commuted patterns; NFC This was the intent for D88551. I also varied the types a bit for extra coverage and tried to make better test/value names.	2020-10-15 14:37:49 -04:00
Anh Tuyen Tran	23c4b1ff43	[NFC][CaptureTracking] Move static function isNonEscapingLocalObject to llvm namespace Function isNonEscapingLocalObject is a static one within BasicAliasAnalysis.cpp. It wraps around PointerMayBeCaptured of CaptureTracking, checking whether a pointer is to a function-local object, which never escapes from the function. Although at the moment, isNonEscapingLocalObject is used only by BasicAliasAnalysis, its functionality can be used by other pass(es), one of which I will put up for review very soon. Instead of copying the contents of this static function, I move it to llvm scope, and place it amongst other functions with similar functionality in CaptureTracking. The rationale for the location are: - Pointer escape and pointer being captured are actually two sides of the same coin - isNonEscapingLocalObject is wrapping around another function in CaptureTracking Reviewed By: jdoerfert (Johannes Doerfert) Differential Revision: https://reviews.llvm.org/D89465	2020-10-15 18:37:29 +00:00
David Green	d80045e272	[LV] Add a getRecurrenceBinOp and make use of it. NFC	2020-10-15 18:21:41 +01:00
Sanjay Patel	21d0e78943	[CostModel] remove cost-kind predicate for ctlz/cttz intrinsics in basic TTI implementation The cost modeling for intrinsics is a patchwork based on different expectations from the callers, so it's a mess. I'm hoping to untangle this to allow canonicalization to the new min/max intrinsics in IR. The general goal is to remove the cost-kind restriction here in the basic implementation class. Ie, if some intrinsic has throughput cost of 104, assume that it has the same size, latency, and blended costs. Effectively, an intrinsic with cost N is composed of N simple instructions. If that's not correct, the target should provide a more accurate override. The x86-64 SSE2 subtarget cost diffs require explanation: 1. The scalar ctlz/cttz are assuming "BSR+XOR+CMOV" or "TEST+BSF+CMOV/BRANCH", so not cheap. 2. The 128-bit SSE vector width versions assume cost of 18 or 26 (no explanation provided in the tables, but this corresponds to a bunch of shift/logic/compare). 3. The 512-bit vectors in the test file are scaled up by a factor of 4 from the legal vector width costs. 4. The plain latency cost-kind is not affected in this patch because that calc is diverted before we get to getIntrinsicInstrCost(). Differential Revision: https://reviews.llvm.org/D89461	2020-10-15 13:14:41 -04:00
Hiroshi Yamauchi	7e9ad11889	[PGO] Remove the old memop value profiling buckets. Following up D81682 and D83903, remove the code for the old value profiling buckets, which have been replaced with the new, extended buckets and disabled by default. Also syncing InstrProfData.inc between compiler-rt and llvm. Differential Revision: https://reviews.llvm.org/D88838	2020-10-15 10:09:49 -07:00
Sanjay Patel	9be1dfc8dc	[x86] add no 'unwind' to reduce test noise; NFC I suggested this in D89412, but the comment was missed.	2020-10-15 11:51:13 -04:00
Thomas Lively	29bda9c7c9	Revert "[WebAssembly] v128.load{8,16,32,64}_lane instructions" This reverts commit 7c6bfd90ab2ddaa60de62878c8512db0645e8452.	2020-10-15 15:49:36 +00:00
Thomas Lively	1dd8fe9b9b	[WebAssembly] v128.load{8,16,32,64}_lane instructions Prototype the newly proposed load_lane instructions, as specified in https://github.com/WebAssembly/simd/pull/350. Since these instructions are not available to origin trial users on Chrome stable, make them opt-in by only selecting them from intrinsics rather than normal ISel patterns. Since we only need rough prototypes to measure performance right now, this commit does not implement all the load and store patterns that would be necessary to make full use of the offset immediate. However, the full suite of offset tests is included to make it easy to track improvements in the future. Since these are the first instructions to have a memarg immediate as well as an additional immediate, the disassembler needed some additional hacks to be able to parse them correctly. Making that code more principled is left as future work. Differential Revision: https://reviews.llvm.org/D89366	2020-10-15 15:33:10 +00:00
sunshaoce	d5acf960dc	[RISCV] fix a mistake in RISCVInstrInfoV.td A commit of VALUVVNoVm was wrong, fixed it. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D88142	2020-10-15 23:16:53 +08:00
Simon Pilgrim	dbf16ebda1	[InstCombine] Use m_SpecificInt instead of m_APInt + comparison. NFCI.	2020-10-15 16:06:27 +01:00
Simon Pilgrim	2eb22bd836	[InstCombine] SimplifyDemandedUseBits - xor - refactor cast<ConstantInt> usage to PatternMatch. NFCI. First step towards replacing these to add full vector support.	2020-10-15 16:06:23 +01:00
Simon Pilgrim	6365cb5780	[InstCombine] InstCombineAndOrXor - refactor cast<ConstantInt> usages to PatternMatch. NFCI. First step towards replacing these to add full vector support.	2020-10-15 16:06:17 +01:00
JonChesterfield	9e44ac573e	[NFC] Fix license header from D87841	2020-10-15 15:41:11 +01:00
Paul C. Anagnostopoulos	15f7b61423	[TableGen] Add the !not and !xor operators. Update the TableGen Programmer's Reference.	2020-10-15 10:12:59 -04:00
Paul C. Anagnostopoulos	0c4c5ded27	[RISCV] [TableGen] Modify RISCVCompressInstEmitter.cpp to use getAllDerivedDefinitions().	2020-10-15 10:11:12 -04:00
Jeremy Morse	7be6e1ef11	Add "not" to an llvm-symbolizer test that expects to fail In a7b209a6d40d77b, llvm-symbolizer was adjusted to return a failure status code when it produced an error, to flag up DWARF parsing problems. The test for missing PDB file is analogous, and returns a failure status now too. This should fix the llvm-clang-win-x-armv7l buildbot croaking: http://lab.llvm.org:8011/#/builders/60/builds/77	2020-10-15 15:09:33 +01:00
Matt Arsenault	f5208b88d2	AMDGPU: Fix verifier error on killed spill of partially undef register This does unfortunately end up with extra waitcnts getting inserted that were avoided before. Ideally we would avoid the spills of these undef components in the first place.	2020-10-15 09:45:44 -04:00
Simon Pilgrim	895ec0d76a	[InstCombine] visitXor - refactor ((X^C1)>>C2)^C3 -> (X>>C2)^((C1>>C2)^C3) fold. NFCI. This is still ConstantInt-only (scalar) but is refactored to use PatternMatch to make adding vector support in the future relatively trivial.	2020-10-15 14:38:15 +01:00
Carl Ritson	35ab0dcf9e	[AMDGPU] Minimize number of s_mov generated by copyPhysReg Generate the minimal set of s_mov instructions required when expanding a SGPR copy operation in copyPhysReg. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D89187	2020-10-15 22:35:02 +09:00
Denis Antrushin	7b96615eab	[Statepoints] Remove MI limit on number of tied operands. After D87915 statepoint can have more than 15 tied operands. Remove this restriction from statepoint lowering code.	2020-10-15 19:02:38 +07:00
Adrian Kuegel	0413b3adc6	Fix unused variable warning when compiling with asserts disabled. Differential Revision: https://reviews.llvm.org/D89454	2020-10-15 12:50:19 +02:00
Jeremy Morse	023a53e89a	[DebugInstrRef] Support recording of instruction reference substitutions Add a table recording "substitutions" between pairs of <instruction, operand> numbers, from old pairs to new pairs. Post-isel optimizations are able to record the outcome of an optimization in this way. For example, if there were a divide instruction that generated the quotient and remainder, and it were replaced by one that only generated the quotient: $rax, $rcx = DIV-AND-REMAINDER $rdx, $rsi, debug-instr-num 1 DBG_INSTR_REF 1, 0 DBG_INSTR_REF 1, 1 Became: $rax = DIV $rdx, $rsi, debug-instr-num 2 DBG_INSTR_REF 1, 0 DBG_INSTR_REF 1, 1 We could enter a substitution from <1, 0> to <2, 0>, and no substitution for <1, 1> as it's no longer generated. This approach means that if an instruction or value is deleted once we've left SSA form, all variables that used the value implicitly become "optimized out", something that isn't true of the current DBG_VALUE approach. Differential Revision: https://reviews.llvm.org/D85749	2020-10-15 11:30:14 +01:00
Simon Pilgrim	5e11311f3c	[AggressiveInstCombine] foldAnyOrAllBitsSet - add uniform vector support Replace m_ConstantInt with m_APInt to support uniform vectors (with no undef elements) Adding non-undef support would involve some refactoring of the MaskOps struct but this might still be worth it.	2020-10-15 11:02:35 +01:00
Simon Pilgrim	5e945bf77e	[AggressiveInstCombine] foldAnyOrAllBitsSet - add uniform vector tests	2020-10-15 10:48:24 +01:00
Sebastian Neubauer	bbbf801cb8	[AMDGPU] Add objdump invalid metadata testcase Checks that metadata and invalid message are printed. Differential Revision: https://reviews.llvm.org/D89375	2020-10-15 11:17:14 +02:00
Denis Antrushin	8f75ef894b	[Statepoints] Unlimited tied operands. Current limit on amount of tied operands (15) sometimes is too low for statepoint. We may get couple dozens of gc pointer operands on statepoint. Review D87154 changed format of statepoint to list every gc pointer only once, which makes it trivial to find tiedness relation between statepoint operands: defs are mapped 1-1 to gc pointer operands passed on registers. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D87915	2020-10-15 16:16:11 +07:00
Georgii Rymar	feeb476dcc	[yaml2obj] - Allow specifying no tags to create empty sections in few cases. Currently we have a few sections that does not support specifying no keys for them. E.g. it is required that one of "Content", "Size" or "Entries" key is present. There is no reason to have this restriction. We can allow this and emit an empty section instead. This opens road for a simplification and generalization of the code in `validate()` that is discussed in the D89039 thread. Depends on D89039. Differential revision: https://reviews.llvm.org/D89391	2020-10-15 11:22:02 +03:00
Georgii Rymar	1e8da37adf	[yaml2obj/obj2yaml] - Add support of 'Size' and 'Content' keys for all sections. Many sections either do not have a support of `Size`/`Content` or support just a one of them, e.g only `Content`. `Section` is the base class for sections. This patch adds `Content` and `Size` members to it and removes similar members from derived classes. This allows to cleanup and generalize the code and adds a support of these keys for all sections (`SHT_MIPS_ABIFLAGS` is a only exception, it requires unrelated specific changes to be done). I had to update/add many tests to test the new functionality properly. Differential revision: https://reviews.llvm.org/D89039	2020-10-15 11:11:41 +03:00
Craig Topper	9b01853207	[TargetLowering] Replace Log2_32_Ceil with Log2_32 in SimplifySetCC ctpop combine. This combine can look through (trunc (ctpop X)). When doing this it tries to make sure the trunc doesn't lose any information from the ctpop. It does this by checking that the truncated type has more bits that Log2_32_Ceil of the ctpop type. The Ceil is unnecessary and pessimizes non-power of 2 types. For example, ctpop of i256 requires 9 bits to represent the max value of 256. But ctpop of i255 only requires 8 bits to represent the max result of 255. Log2_32_Ceil of 256 and 255 both return 8 while Log2_32 returns 8 for 256 and 7 for 255 The code with popcnt enabled is a regression for this test case, but it does match what already happens with i256 truncated to i9. Since power of 2 is more likely, I don't think it should block this change. Differential Revision: https://reviews.llvm.org/D89412	2020-10-15 01:05:07 -07:00
David Sherwood	272d0be3ed	[SVE][NFC] Replace some TypeSize comparisons in non-AArch64 Targets In most of lib/Target we know that we are not dealing with scalable types so it's perfectly fine to replace TypeSize comparison operators with their fixed width equivalents, making use of getFixedSize() and so on. Differential Revision: https://reviews.llvm.org/D89101	2020-10-15 09:01:21 +01:00
Luqman Aden	febfadd9f6	[LLD] Set alignment as part of Characteristics in TLS table. Fixes https://bugs.llvm.org/show_bug.cgi?id=46473 LLD wasn't previously specifying any specific alignment in the TLS table's Characteristics field so the loader would just assume the default value (16 bytes). This works most of the time except if you have thread locals that want specific higher alignments (e.g. 32 as in the bug) even if they specify an alignment on the thread local. This change updates LLD to take the max alignment from tls section. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D88637	2020-10-15 00:22:40 -07:00
Luqman Aden	54ac46e473	Revert "[LLD] Set alignment as part of Characteristics in TLS table." Revert individual wip commits and will instead follow up with a single commit with all the changes. Makes cherry-picking easier and will contain all the right tags. This reverts commit 32a4ad3b6ce6028a371b028cf06fa5feff9534bf. This reverts commit 7fe13af676678815989a6d0ece684687953245e7. This reverts commit 51fbc1bef657bb0f5808986555ec3517a84768c4. This reverts commit f80950a8bb985c082b26534b0e157447bf803935. This reverts commit 0778cad9f325df4d7b32b22f3dba201a16a0b8fe. This reverts commit 8b70d527d7ec1c8b9e921177119a0d906ffad4f0.	2020-10-15 00:21:36 -07:00
David Blaikie	b9463c453a	Fix llvm-symbolizer assembly-based test to require x86 and specify x86 when assembling	2020-10-15 00:01:43 -07:00
David Blaikie	64302c9d6e	llvm-symbolizer: Exit non-zero when DWARF parsing errors have been rendered	2020-10-14 23:42:00 -07:00
Carl Ritson	bb0a1482e8	[AMDGPU] Pre-commit test for D89187	2020-10-15 15:29:07 +09:00
David Blaikie	3e93456b5c	llvm-symbolizer: Ensure non-zero exit when an error is printed (this doesn't cover all cases - libDebugInfoDWARF has a default error handler that prints errors without any exit code handling - I'll be following up with a patch for that after this)	2020-10-14 22:48:39 -07:00
David Blaikie	19cabd9a15	llvm-dwarfdump: Exit non-zero on an error path	2020-10-14 22:10:43 -07:00
Vinay Madhusudan	f0c50f45e9	[AArch64] Combine UADDVs to generate vector add ADD(UADDV a, UADDV b) --> UADDV(ADD a, b) This partially solves the bug: https://bugs.llvm.org/show_bug.cgi?id=46888 Meta ticket: https://bugs.llvm.org/show_bug.cgi?id=46929 Differential Revision: https://reviews.llvm.org/D88731	2020-10-15 09:56:31 +05:30
Jonas Devlieghere	60743997d4	[dsymutil] Fix handling of aliases to private external symbols dsymutil was incorrectly ignoring aliases to private extern symbols in the MachODebugMapParser. This resulted in spurious warnings about not being able to find symbols. rdar://49652389 Differential revision: https://reviews.llvm.org/D89444	2020-10-14 20:18:56 -07:00
Luqman Aden	dcba0fbd53	Mask out existing alignment bits.	2020-10-14 19:34:32 -07:00
Luqman Aden	b2a8035b8d	Fix style warnings.	2020-10-14 19:34:31 -07:00
Luqman Aden	0141307554	[LLD] Set alignment as part of Characteristics in TLS table. Differential Revision: https://reviews.llvm.org/D88637	2020-10-14 19:34:31 -07:00
Tony	3008bac6d4	[AMDGPU] Correct typos in SIMemoryLegalizer.cpp comments	2020-10-15 02:07:56 +00:00
Kazushi (Jam) Marukawa	5b2066afb8	[VE] Add vector load/store instructions Add vector registers and vector load/store instructions. Add regression tests for vector load/store instructions too. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89183	2020-10-15 09:26:55 +09:00
Kazushi (Jam) Marukawa	26a0bcf89a	[VE] Change to expand SHL_PARTS/SRA_PARTS/SRL_PARTS VE doesn't have SHL_PARTS/SRA_PARTS/SRL_PARTS instructions, so need to expand them. Add regression tests too. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D89396	2020-10-15 09:04:34 +09:00
Amara Emerson	96ddc65777	[AArch64][GlobalISel] Don't use explicit zero registers for compare results. These cause problems for later optimizations, just using an unused vreg like SelectionDAG generates better code in the end, and obviates the need for some GISel specific flag optimizations. Differential Revision: https://reviews.llvm.org/D89419	2020-10-14 16:49:33 -07:00
Reid Kleckner	4b5ed96964	[ADT] Use alignas + sizeof for inline storage, NFC AlignedCharArrayUnion is really only needed to handle the "union" case when we need memory of suitable size and alignment for multiple types. SmallVector only needs storage for one type, so use that directly.	2020-10-14 16:16:02 -07:00
Snehasish Kumar	29a7970dc1	[llvm] Update default cutoff threshold for machine function splitter. Based on internal testing at Google we found that setting the profile summary cutoff threshold to 999950 yields the best results in terms of itlb and icache metrics (as observed on Intel CPUs). default = Split out code if no profile count available for block size-% = The fraction of bytes split out of .text and .text.hot itlb = Misses per kilo instructions (MPKI) for itlb icache = Misses per kilo instructions (MPKI) for L1 icache Search1 \| cutoff \| size-% \| itlb \| icache \| \|---------\|---------\|-----------\|---------\| \| default \| 42.5861 \| 0.0822151 \| 2.46363 \| \| 999999 \| 44.9350 \| 0.0767194 \| 2.44416 \| \| 999950 \| 50.0660 \| 0.075744 \| 2.4091 \| \| 999500 \| 56.9158 \| 0.082564 \| 2.4188 \| \| 995000 \| 63.8625 \| 0.0814927 \| 2.42832 \| \| 990000 \| 71.7314 \| 0.106906 \| 2.57785 \| Search2 \| cutoff \| size-% \| itlb \| icache \| \|---------\|--------\|----------\|---------\| \| default \| 2.8845 \| 0.626712 \| 4.73245 \| \| 999999 \| 3.3291 \| 0.602309 \| 4.70045 \| \| 999950 \| 3.8577 \| 0.587842 \| 4.71632 \| \| 999500 \| 4.4170 \| 0.63577 \| 4.68351 \| \| 995000 \| 5.1020 \| 0.657969 \| 4.82272 \| \| 990000 \| 5.7153 \| 0.719122 \| 5.39496 \| Differential Revision: https://reviews.llvm.org/D89085	2020-10-14 12:48:10 -07:00

1 2 3 4 5 ...

205145 Commits