llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Ana Pazos	8024ba9bb0	[MC][PGO][PGSO] Cleanup unused MBFI in AsmPrinter Summary: Machine Block Frequency Info (MBFI) is being computed but unused in AsmPrinter. MBFI computation was introduced with PGO change D71149 and then its use was removed in D71106. No need to keep computing it. Reviewers: MaskRay, jyknight, skan, yamauchi, davidxl, efriedma, huihuiz Reviewed By: MaskRay, skan, yamauchi Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78526	2020-04-21 10:01:56 -07:00
Benjamin Kramer	f8bca900d0	Fix an unused-variable warning in Release mode.	2020-04-21 18:59:27 +02:00
Fangrui Song	e8dd5bcb82	[XRay] Change xray_instr_map sled addresses from absolute to PC relative for x86-64 xray_instr_map contains absolute addresses of sleds, which are relocated by `R_*_RELATIVE` when linked in -pie or -shared mode. By making these addresses relative to PC, we can avoid the dynamic relocations and remove the SHF_WRITE flag from xray_instr_map. We can thus save VM pages containg xray_instr_map (because they are not modified). This patch changes x86-64 and bumps the sled version to 2. Subsequent changes will change powerpc64le and AArch64. Reviewed By: dberris, ianlevesque Differential Revision: https://reviews.llvm.org/D78082	2020-04-21 09:36:09 -07:00
Sanjay Patel	1d148ac389	[InstCombine] add tests for logic-of-icmps; NFC These are mostly replicated from D78430 (instsimplify). If we implement more general transforms for instcombine, then we probably don't need to add that complexity to instsimplify.	2020-04-21 12:26:45 -04:00
Johannes Doerfert	901eef3953	[Attributor] Use a pointer value type for the OpcodeInstMap This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 490390 (320725/s) temporary memory allocations: 84601 (55330/s) peak heap memory consumption: 41.70MB peak RSS (including heaptrack overhead): 131.18MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 489359 (301144/s) temporary memory allocations: 82983 (51066/s) peak heap memory consumption: 36.76MB peak RSS (including heaptrack overhead): 126.48MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -1031 (-10739/s) temporary memory allocations: -1618 (-16854/s) peak heap memory consumption: -4.94MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:20:09 -05:00
Johannes Doerfert	7e065e876c	[Attributor] Use a pointer value type for the QueryMap This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 596180 (374484/s) temporary memory allocations: 84979 (53378/s) peak heap memory consumption: 52.14MB peak RSS (including heaptrack overhead): 139.79MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 489200 (303285/s) temporary memory allocations: 83406 (51708/s) peak heap memory consumption: 41.70MB peak RSS (including heaptrack overhead): 131.76MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -106980 (-5094285/s) temporary memory allocations: -1573 (-74904/s) peak heap memory consumption: -10.44MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:20:04 -05:00
Johannes Doerfert	e1f644edc6	[Attributor] Use a pointer value type for the access kind -> accesses map This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 616219 (381559/s) temporary memory allocations: 83294 (51575/s) peak heap memory consumption: 72.15MB peak RSS (including heaptrack overhead): 160.04MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 595004 (357145/s) temporary memory allocations: 83840 (50324/s) peak heap memory consumption: 52.14MB peak RSS (including heaptrack overhead): 138.32MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -21215 (-415980/s) temporary memory allocations: 546 (10705/s) peak heap memory consumption: -20.01MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:20:02 -05:00
Johannes Doerfert	100494f780	[Attributor] Pass the Attributor to the AbstractAttribute constructors AbstractAttribute::initialize is used to initialize the deduction and the object we do not always call it. To make sure we have the option to initialize the object even if initialize is not called we pass the Attributor to AbstractAttribute constructors now.	2020-04-21 11:20:02 -05:00
Johannes Doerfert	5f65e29ab6	[Attributor] Use a pointer value type for the AAMap This reduces memory consumption and the need to copy complex data structures repeatedly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 613353 (376521/s) temporary memory allocations: 83636 (51341/s) peak heap memory consumption: 75.64MB peak RSS (including heaptrack overhead): 162.97MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 616575 (349929/s) temporary memory allocations: 83650 (47474/s) peak heap memory consumption: 72.15MB peak RSS (including heaptrack overhead): 159.81MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: 3222 (24225/s) temporary memory allocations: 14 (105/s) peak heap memory consumption: -3.49MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-21 11:19:58 -05:00
Stefan Pintilie	b69f1904be	[PowerPC][Future] Add offsets to PC Relative relocations. This is an optimization that applies to global addresses and allows for the following transformation: Convert this: paddi r3, 0, symbol@PCREL, 1 ld r4, 8(r3) To this: pld r4, symbol@PCREL+8(0), 1 An instruction is saved and the linker can do the addition when the symbol is resolved. Differential Revision: https://reviews.llvm.org/D76160	2020-04-21 11:08:19 -05:00
Kang Zhang	70d4ad5aec	[PowerPC] Add a new test case expand-isel-liveness.mir	2020-04-21 16:00:34 +00:00
Nick Desaulniers	35d2b83690	[InlineSpiller] simplify insertReload() NFC Summary: The repeated use of std::next() on a MachineBasicBlock::iterator was clever, but we only need to reconstruct the iterator post creation of the spill instruction. This helps simplifying where we plan to place the spill, as discussed in D77849. From here, we can simplify the code a little by flipping the return code of a helper. Reviewers: efriedma Reviewed By: efriedma Subscribers: qcolombet, hiraditya, llvm-commits, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D78520	2020-04-21 08:31:20 -07:00
Jay Foad	bd08402d21	[AMDGPU] Remove selectSGPRVectorRegClassID. NFC. This was yet another function that had to be updated whenever you added a new register class. Remove it by refactoring its only caller to use standard helper functions from SIRegisterInfo. Differential Revision: https://reviews.llvm.org/D78557	2020-04-21 16:29:21 +01:00
Sean Fertile	ac14acbc42	[PowerPC][AIX][NFC] Fix use of FileCheck variable in lit test.	2020-04-21 10:56:46 -04:00
Pavel Labath	4e52ec5b13	[DWARFDebugLine] Check for errors when parsing v2 file/dir lists Summary: Without this we could silently accept an invalid prologue because the default DataExtractor behavior is to return an empty string when reaching the end of file. And empty string is also used to terminate these lists. This makes the parsing code slightly more complicated, but this complexity will go away once the parser starts working with truncating data extractors. The reason I am doing it this way is because without this, the truncation would regress the quality of error messages (right now, we produce bad error messages only near EOF, but truncation would make everything behave as if it was near EOF). Reviewers: dblaikie, probinson, jhenderson Subscribers: hiraditya, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77555	2020-04-21 16:55:36 +02:00
Pavel Labath	3b6fda8c6b	[DWARFDataExtractor] Add a "truncating" constructor Summary: This constructor allows us to create a new DWARFDataExtractor which will only present a subrange of an entire debug section. Since debug sections typically consist of multiple contributions, it is expected that one will create a new data extractor for each contribution in order to avoid unexpectedly running off into the next one. This is very useful for unifying the flows for detecting parse errors. Without it, the code needs to consider two very different scenarios: 1. If there is another contribution after the current one, the DataExtractor functions will just start reading from there. This is detectable by comparing the current offset against the known end-of-contribution offset. 2. If this is the last contribution, the data extractor will just start returning zeroes (or other default values). This situation can not be detected by checking the parsing offset, as this will not be advanced in case of errors. Using a truncated data extractor simplifies the code (and reduces cognitive load) by making these two cases behave identically -- a running off the end of a contribution will _always_ produce an EOF error (if one uses error-aware parsing methods) or return default values. Reviewers: dblaikie, probinson, jhenderson, ikudrin Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77556	2020-04-21 16:48:09 +02:00
Pavel Iliin	294e4ed894	[AArch64][NFC] One more intrinsic test.	2020-04-21 15:20:07 +01:00
Simon Pilgrim	a69c2ce309	X86ISelLowering.cpp - clang-format to fix col80 limit. NFC.	2020-04-21 15:18:23 +01:00
Sanjay Patel	0aa2c9a1ec	[InstCombine] improve types/names for logic-of-icmp helper function; NFC	2020-04-21 10:16:45 -04:00
Simon Pilgrim	4bf9b69a3f	SHA1.h - remove unnecessary ArrayRef.h/StringRef.h includes. NFC. By moving the update(StringRef) wrapper into SHA1.cpp we can depend just on system headers.	2020-04-21 15:12:17 +01:00
Florian Hahn	e99abe0a68	[VPlan] Make various tryTo* helpers private and mark as const (NFC). The individual tryTo* helpers do not need to be public. Also, the builder contained two consecutive public: sections, which is not necessary. Moved the remaining public methods after the constructor. Also make some of the tryTo* helpers const. Reviewers: gilr, rengolin, Ayal, hsaito Reviewed by: gilr Differential Revision: https://reviews.llvm.org/D78288	2020-04-21 14:49:02 +01:00
Sanjay Patel	3790b83344	[InstCombine] improve types/names for logic-of-icmp helper functions; NFC	2020-04-21 09:18:22 -04:00
Georgii Rymar	501805a2da	[yaml2obj] - Verify that sections are sorted by their file offsets when creating segments. This validates that sections listed for a segment in the YAML declaration are ordered by their file offsets. It might help to simplify the file size computation, but also is useful by itself as helps to avoid issues in test cases and to maintain their readability. Differential revision: https://reviews.llvm.org/D78361	2020-04-21 15:50:42 +03:00
Fraser Cormack	c91e30954f	Let targets adjust physical output- and anti-deps Differential Revision: https://reviews.llvm.org/D78380	2020-04-21 13:45:03 +01:00
Kerry McLaughlin	5992856e99	[AArch64][SVE] Add addressing mode for contiguous loads & stores Summary: This patch adds the register + register addressing mode for SVE contiguous load and store intrinsics (LD1 & ST1) Reviewers: sdesmalen, fpetrogalli, efriedma, rengolin Reviewed By: fpetrogalli Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78509	2020-04-21 12:04:43 +01:00
Simon Pilgrim	f7ad752f8a	TGLexer.h - add missing <vector> include. Looks like we are implicitly depending on <vector> but not all machines will include it.	2020-04-21 11:57:10 +01:00
Sam Parker	8eac61ddd3	Attempting fix buildbot getUserCost is faulting for some builders.	2020-04-21 11:42:21 +01:00
Simon Pilgrim	fdae127fbd	TGParser.h - cleanup includes and forward declarations. NFC. Replace Twine.h/SourceMgr.h includes with forward declarations and include in TGParser.cpp Remove forward declarations we already have to include in Record.h	2020-04-21 11:32:58 +01:00
Simon Pilgrim	667c8b4b59	TGLexer.h - cleanup includes and forward declarations. NFC. Replace ArrayRef.h with a forward declaration and include in ArrayRef.cpp Remove SMLoc forward declaration as we already have to include SMLoc.h	2020-04-21 11:32:58 +01:00
Simon Pilgrim	03c8087cf9	AliasAnalysisSummary.h - cleanup includes and forward declarations. NFC. Push InstrTypes.h include down to AliasAnalysisSummary.cpp	2020-04-21 11:32:58 +01:00
Kazushi (Jam) Marukawa	93fe251d64	[VE] Create a TargetInfo header. NFC Summary: Move the declarations of getThe<Name>Target() functions into a new header in TargetInfo and make users of these functions include this new header in order to follow other architectures. Differential Revision: https://reviews.llvm.org/D78543	2020-04-21 11:42:17 +02:00
Sander de Smalen	5b076d66c9	[SVEIntrinsicOpts] NFC: Remove unused isReinterpretFromBool for no-assert builds isReinterpretFromBool's only use is in an assert, which causes a warning that the function is defined but not used in no-assert builds.	2020-04-21 09:49:22 +01:00
LLVM GN Syncbot	26f7fe74a1	[gn build] Port 2214b9076f1	2020-04-21 08:34:22 +00:00
Sam Parker	3a66ca47fd	[TTI] Remove getOperationCost This API call has been used recently with, a very valid, expectation that it would do something useful but it doesn't actually query any backend information. So, remove this method and merge its functionality into getUserCost. As well as that, also use getCastInstrCost to get a proper cost from the backend for the concerned instructions though we only currently return the answer if it's considered free. The default implementation now also checks int/ptr conversions too, as well as truncs and bitcasts. Differential Revision: https://reviews.llvm.org/D76124	2020-04-21 09:15:34 +01:00
Sam Parker	443c6d595b	[ARM][ParallelDSP] Handle squaring multiplies The logic in ARMParallelDSP is setup to merge two 16-bits loads into a 32-bit load and feed them into the smlads. This requires that four loads are combined for the four inputs, but there wasn't actually a check for this. Differential Revision: https://reviews.llvm.org/D78492	2020-04-21 08:39:56 +01:00
Craig Topper	9d4cb2dec8	[DenseMap] Don't capture the BucketEnd pointer before an operation that might change the number of buckets. This code was added in 887efa51c1e0e43ca684ed78b92dbc3a0720881b to fix reverse iteration. The call to InsertIntoBucket/InsertIntoBucketWithLookup can change the number of buckets which will invalidate the BucketEnd. So don't cache it and calculate it when creating the iterator.	2020-04-21 00:36:34 -07:00
Craig Topper	098fedfbf7	[CallSite removal][Instrumentation] Use CallBase instead of CallSite in AddressSanitizer/DataFlowSanitizer/MemorySanitizer. NFC Differential Revision: https://reviews.llvm.org/D78524	2020-04-20 22:39:14 -07:00
Max Kazantsev	4c8fb07eca	[LICM][NFC] Reorder checks to speed up things slightly Side effect check is made faster than potentially heavy other checks.	2020-04-21 11:34:44 +07:00
Craig Topper	747225250b	[Local] Update getOrEnforceKnownAlignment/getKnownAlignment to use Align/MaybeAlign. Differential Revision: https://reviews.llvm.org/D78443	2020-04-20 21:31:44 -07:00
Johannes Doerfert	5fe96b2ac1	[OpenMPOpt] Make the combination of `ident_t` deterministic Before we kept the first applicable `ident_t` during deduplication of runtime calls. The problem is that "first" is dependent on the iteration order of a DenseMap. Since the proper solution, which is to combine the information from all `ident_t`, should be deterministic on its own, we will not try to make the iteration order deterministic. Instead, we will create a fresh `ident_t` if there is not a unique existing `ident_t*` to pick.	2020-04-20 23:27:08 -05:00
Johannes Doerfert	04401a1993	[OpenMPOpt] Use a pointer value type in map The value type was a set before which can easily lead to excessive memory usage and copying. We use a pointer to a vector instead now.	2020-04-20 23:27:08 -05:00
Johannes Doerfert	2b1b7faf12	[OpenMPOpt] Make the SCC a vector to ensure deterministic results	2020-04-20 23:27:08 -05:00
Fangrui Song	1148aba0c6	Reland D76675 [llvm-objcopy] Match GNU behaviour regarding file symbols Don't error on Config.KeepFileSymbols for COFF and Mach-O. Original description: GNU objcopy removes STT_FILE symbols for strip-debug operations, and keeps them for --discard-all operation. Match their behaviour for llvm-objcopy. Bug: https://github.com/android/ndk/issues/1212 Differential Revision: https://reviews.llvm.org/D76675	2020-04-20 21:18:48 -07:00
Yi Kong	920851081d	Revert "[llvm-objcopy] Match GNU behaviour regarding file symbols" This reverts commit 7c65e88d0bc85ff2732a4e23c397ff842b97b828. Broke non ELF targets.	2020-04-21 12:04:01 +08:00
Yi Kong	1e469f4962	[llvm-objcopy] Match GNU behaviour regarding file symbols GNU objcopy removes STT_FILE symbols for strip-debug operations, and keeps them for --discard-all operation. Match their behaviour for llvm-objcopy. Bug: https://github.com/android/ndk/issues/1212 Differential Revision: https://reviews.llvm.org/D76675	2020-04-21 11:30:04 +08:00
Shengchen Kan	2b533e246f	[MC][NFC] Use camelCase style for functions in MCObjectStreamer	2020-04-20 20:09:20 -07:00
Shengchen Kan	88597bc560	[MC][Bugfix] Remove redundant parameter for relaxInstruction Summary: Before this patch, `relaxInstruction` takes three arguments, the first argument refers to the instruction before relaxation and the third argument is the output instruction after relaxation. There are two quite strange things: 1) The first argument's type is `const MCInst &`, the third argument's type is `MCInst &`, but they may be aliased to the same variable 2) The backends of ARM, AMDGPU, RISC-V, Hexagon assume that the third argument is a fresh uninitialized `MCInst` even if `relaxInstruction` may be called like `relaxInstruction(Relaxed, STI, Relaxed)` in a loop. In this patch, we drop the thrid argument, and let `relaxInstruction` directly modify the given instruction. Also, this patch fixes the bug https://bugs.llvm.org/show_bug.cgi?id=45580, which is introduced by D77851, and breaks the assumption of ARM, AMDGPU, RISC-V, Hexagon. Reviewers: Razer6, MaskRay, jyknight, asb, luismarques, enderby, rtaylor, colinl, bcain Reviewed By: Razer6, MaskRay, bcain Subscribers: bcain, nickdesaulniers, nathanchance, wuzish, annita.zhang, arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, tpr, sbc100, jgravelle-google, kristof.beyls, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78364	2020-04-21 11:06:55 +08:00
Yonghong Song	07d39317f2	BPF: fix a CORE optimization bug For the test case in this patch like below struct t { int a; } __attribute__((preserve_access_index)); int foo(void ); int test(struct t arg) { long param[1]; param[0] = (long)&arg->a; return foo(param); } The IR right before BPF SimplifyPatchable phase: %1:gpr = LD_imm64 @"llvm.t:0:0$0:0" %2:gpr = LDD killed %1:gpr, 0 %3:gpr = ADD_rr %0:gpr(tied-def 0), killed %2:gpr STD killed %3:gpr, %stack.0.param, 0 After SimplifyPatchable phase, the incorrect IR is generated: %1:gpr = LD_imm64 @"llvm.t:0:0$0:0" %3:gpr = ADD_rr %0:gpr(tied-def 0), killed %1:gpr CORE_MEM killed %3:gpr, 306, %0:gpr, @"llvm.t:0:0$0:0" Note that CORE_MEM pseudo op is introduced to encode memory operations related to CORE. In the above, we intend to check whether we have a store like (%3:gpr + 0) = ... and if this is the case, we could replace it with (%0:gpr + @"llvm.t:0:0$0:0"_ = ... Unfortunately, in the above, IR for the store is *(%stack.0.param + 0) = %3:gpr and transformation should not happen. Note that we won't have problem if the actual CORE dereference (arg->a) happens. This patch fixed the problem by skip CORE optimization if the use of ADD_rr result is not the base address of the store operation. Differential Revision: https://reviews.llvm.org/D78466	2020-04-20 19:54:51 -07:00
Mircea Trofin	252ef476b0	[llvm][NFC][CallSite] Remove CallSite from ArgumentPromotion Reviewers: dblaikie, craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78528	2020-04-20 19:33:42 -07:00
Johannes Doerfert	62c312e2e4	[Attributor] Use the BumpPtrAllocator in InformationCache as well We now also use the BumpPtrAllocator from the Attributor in the InformationCache. The lifetime of objects in either is pretty much the same and it should result in consistently good performance regardless of the allocator. Doing so requires to call more constructors manually but so far that does not seem to be problematic or messy. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 615359 (368257/s) temporary memory allocations: 83315 (49859/s) peak heap memory consumption: 75.64MB peak RSS (including heaptrack overhead): 163.43MB total memory leaked: 269.04KB ``` After: ``` calls to allocation functions: 613042 (359555/s) temporary memory allocations: 83322 (48869/s) peak heap memory consumption: 75.64MB peak RSS (including heaptrack overhead): 162.92MB total memory leaked: 269.04KB ``` Difference: ``` calls to allocation functions: -2317 (-68147/s) temporary memory allocations: 7 (205/s) peak heap memory consumption: 2.23KB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ---	2020-04-20 21:12:41 -05:00

... 2 3 4 5 6 ...

195599 Commits