llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Kazu Hirata	607e84883f	[llvm] Use llvm::is_sorted (NFC)	2021-01-27 23:25:39 -08:00
Kazu Hirata	3902d5e888	[DebugInfo] Forward-declare PDBFile (NFC) NativeEnumInjectedSources.h needs PDBFile but relies on a forward declaration of PDBFile in InjectedSourceStream.h. This patch adds a forward declaration right in NativeEnumInjectedSources.h. While we are at it, this patch removes the one in InjectedSourceStream.h, where it is unnecessary.	2021-01-27 23:25:38 -08:00
Ben Shi	d588dec200	[AVR] Optimize 16-bit int shift Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D90092	2021-01-28 15:10:11 +08:00
Hongtao Yu	9b80fe63e4	[CSSPGO] Support of CS profiles in extended binary format. This change brings up support of context-sensitive profiles in the format of extended binary. Existing sample profile reader/writer/merger code is being tweaked to reflect the fact of bracketed input contexts, like (`[...]`). The paired brackets are also needed in extbinary profiles because we don't yet have an otherwise good way to tell calling contexts apart from regular function names since the context delimiter `@` can somehow serve as a part of the C++ mangled names. Reviewed By: wmi, wenlei Differential Revision: https://reviews.llvm.org/D95547	2021-01-27 21:29:46 -08:00
Craig Topper	ae2c374959	[RISCV] Copy isUnneededShiftMask from X86. In d2927f786e877410d90c1e6f0e0c7d99524529c5, I added patterns to remove (and X, 31) from sllw/srlw/sraw shift amounts. There is code in SelectionDAGISel.cpp that knows to use computeKnownBits to fill in bits of the mask that were removed by SimplifyDemandedBits based on bits being known zero. The non-W shift patterns use immbottomxlenset which allows the mask to have more than log2(xlen) trailing ones, but doesn't have a call to computeKnownBits to fill in bits of the mask that may have been cleared by SimplifyDemandedBits. This patch copies code from X86 to handle more than log2(xlen) bottom bits set and uses computeKnownBits to fill in missing bits before counting. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D95422	2021-01-27 20:46:10 -08:00
Fangrui Song	78c4743b21	IntrinsicEmitter: Change IntrinsicsToAttributesMap from uint8_t[] to uint16_t[] We need at least 252 UniqAttributes now, which will soon overflow. Actually with downstream backends we can easily use up the last few values. So bump to uint16_t.	2021-01-27 20:34:35 -08:00
Serge Pavlov	58bed2df84	[Support] Fix build for Haiku This change fixes two issues with building LLVM on Haiku. The first issue is that LLVM requires wait4(), which on Haiku is hidden behind the _BSD_SOURCE feature flag when using the --std=c++14 flag. Additionally, the wait4() function is only available in libbsd.so, so this is now a dependency. The other fix is that Haiku does not have the (non-standard) rusage.maxrss member, so by default the used memory info will be set to 0 on this platform. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D87920 Patch by Niels Sascha Reedijk.	2021-01-28 10:50:04 +07:00
Carl Ritson	90c5c37149	[AMDGPU][NFC] Pre-commit test for D95509	2021-01-28 12:37:58 +09:00
Carl Ritson	832b5dd984	[AMDGPU][NFC] Generate llvm.amdgcn.set.inactive tests This is a pre-commit for D95509.	2021-01-28 11:43:36 +09:00
David Blaikie	94830306c1	DebugInfo: Fix bug in addr+offset exprloc to use DWARFv5 addrx op instead of DWARFv4 GNU extension	2021-01-27 18:39:44 -08:00
Stanislav Mekhanoshin	348c15fbb0	[AMDGPU] Do not reassign spilled registers We cannot call LRM::unassign() if LRM::assign() was never called before, these are symmetrical calls. There are two ways of assigning a physical register to virtual, via LRM::assign() and via VRM::assignVirt2Phys(). LRM::assign() will call the VRM to assign the register and then update LiveIntervalUnion. Inline spiller calls VRM directly and thus LiveIntervalUnion never gets updated. A call to LRM::unassign() then asserts about inconsistent liveness. We have to note that not all callers of the InlineSpiller even have LRM to pass, RegAllocPBQP does not have it, so we cannot always pass LRM into the spiller. The only way to get into that spiller LRE_DidCloneVirtReg() call is from LiveRangeEdit::eliminateDeadDefs if we split an LI. This patch refuses to reassign a LiveInterval created by a split to workaround the problem. In fact we cannot reassign a spill anyway as all registers of the needed class are occupied and we are spilling. Fixes: SWDEV-267996 Differential Revision: https://reviews.llvm.org/D95489	2021-01-27 16:29:05 -08:00
Fangrui Song	612a586751	[llvm-c] Move LLVMX86_AMXTypeKind & LLVMPoisonValueValueKind to the bottom to avoid value changes compared with LLVM<=11 Fixes PR48905	2021-01-27 16:28:04 -08:00
Teresa Johnson	75870000ad	[LTO] Prevent devirtualization for symbols dynamically exported Identify dynamically exported symbols (--export-dynamic[-symbol=], --dynamic-list=, or definitions needed to preempt shared objects) and prevent their LTO visibility from being upgraded. This helps avoid use of whole program devirtualization when there may be overrides in dynamic libraries. Differential Revision: https://reviews.llvm.org/D91583	2021-01-27 15:54:13 -08:00
Roman Lebedev	dfdf4c2e00	[CodeGen] IndirectBrExpandPass: preserve Dominator Tree, if available This fully de-pessimizes the common case of no indirectbr's, (where we don't actually need to do anything to preserve domtree) and avoids domtree recomputation in the case there were indirectbr's. Note that two indirectbr's could have a common successor, and not all successors of an indirectbr's are meant to survive the expansion. Though, the code assumes that an indirectbr's doesn't have duplicate successors, those should have been deduplicated by simplifycfg or something already.	2021-01-28 01:58:53 +03:00
Jessica Paquette	5c03b478d2	[AArch64][GlobalISel] Allow vector store legalization into 128-bit-wide types We are allowed to store 128-bit-wide values using the q registers on AArch64. GlobalISel was clamping the number of elements in vector stores into 64 bits instead. This results in some poor codegen like below: https://godbolt.org/z/E56dq8 ``` ; SDAG uses a stp + q registers in both cases here. define void @float(<16 x float> %val, <16 x float>* %ptr) { store <16 x float> %val, <16 x float>* %ptr ret void } define void @double(<8 x double> %val, <8 x double>* %ptr) { store <8 x double> %val, <8 x double>* %ptr ret void } ``` This adds similar legalization for vector stores with s8 and s16 elements. Differential Revision: https://reviews.llvm.org/D95107	2021-01-27 14:46:44 -08:00
David Blaikie	19cd3f5c02	DebugInfo: Deduplicate addresses in debug_addr Experimental, using non-existent DWARF support to use an expr for the location involving an addr_index (to compute address + offset so addresses can be reused in more places). The global variable debug info had to be deferred until the end of the module (so bss variables would all be emitted first - so their labels would have the relevant section). Non-bss variables seemed to not have their label assigned to a section even at the end of the module, so I didn't know what to do there. Also, the hashing code is broken - doesn't know how to hash these expressions (& isn't hashing anything inside subprograms, which seems problematic), so for test purposes this change just skips the hash computation. (GCC's actually overly sensitive in its hash function, it seems - I'm forgetting the specific case right now - anyway, we might want to just use the frontend-known file hash and give up on optimistic .dwo/.dwp reuse)	2021-01-27 14:00:43 -08:00
James Y Knight	37f683eeb0	Itanium Mangling: Mangle `__alignof__` differently than `alignof`. The two operations have acted differently since Clang 8, but were unfortunately mangled the same. The new mangling uses new "vendor extended expression" syntax proposed in https://github.com/itanium-cxx-abi/cxx-abi/issues/112 GCC had the same mangling problem, https://gcc.gnu.org/PR88115, and will hopefully be switching to the same mangling as implemented here. Additionally, fix the mangling of `__uuidof` to use the new extension syntax, instead of its previous nonstandard special-case. Adjusts the demangler accordingly. Differential Revision: https://reviews.llvm.org/D93922	2021-01-27 16:46:51 -05:00
Craig Topper	5638b4271f	[RISCV] Move RISCVVPseudosTable from RISCVBaseInfo.h to RISCVInstrInfo.h. NFC RISCVBaseInfo.h belongs to the MC layer, but the Pseudo instructions are only used by the CodeGen layer. So it makes sense to keep this table in the CodeGen layer.	2021-01-27 13:38:26 -08:00
Varun Gandhi	52a271678f	[Demangle] Support demangling Swift calling convention in MS demangler. Previously, Clang was able to mangle the Swift calling convention but 'MicrosoftDemangle.cpp' was not able to demangle it. Reviewed By: compnerd, rnk Differential Revision: https://reviews.llvm.org/D95053	2021-01-27 13:24:54 -08:00
David Green	4d950d6390	[ARM] Add BE check variants for soft-fp test. NFC	2021-01-27 21:19:46 +00:00
Craig Topper	cea57db5fd	[RISCV] Reduce field sizes in searchable tables to reduce binary size.	2021-01-27 12:24:01 -08:00
Craig Topper	a9e4190a0c	[RISCV] Rework fault first only load isel. -Remove the ISD opcode for READ_VL. Just emit the MachineSDNode directly. -Move segmented fault first only load intrinsic handling completely to RISCVISelDAGToDAG.cpp and emit the ReadVL MachineSDNode there instead of lowering to ISD opcodes first.	2021-01-27 11:51:41 -08:00
Sanjay Patel	2ae45edb62	[LoopVectorize] use IR fast-math-flags exclusively (not FP function attributes) I am trying to untangle the fast-math-flags propagation logic in the vectorizers (see a6f022127 for SLP). The loop vectorizer has a mix of checking FP function attributes, IR-level FMF, and just wrong assumptions. I am trying to avoid regressions while fixing this, and I think the IR-level logic is good enough for that, but it's hard to say for sure. This would be the 1st step in the clean-up. The existing test that I changed to include 'fast' actually shows a miscompile: the function only had the equivalent of nnan, but we created new instructions that had fast (all FMF set). This is similar to the example in https://llvm.org/PR35538 Differential Revision: https://reviews.llvm.org/D95452	2021-01-27 14:17:11 -05:00
Julian Lettner	8ef46f8e36	[lit][NFC] Use enum to represent test order Use an enum to more explicitly represent the test execution order.	2021-01-27 11:10:17 -08:00
LLVM GN Syncbot	7f9359f621	[gn build] Port 0b50fa99452f	2021-01-27 18:55:59 +00:00
Fangrui Song	b3b970744d	[ThinLTO] Add Visibility bits to GlobalValueSummary::GVFlags Imported functions and variable get the visibility from the module supplying the definition. However, non-imported definitions do not get the visibility from (ELF) the most constraining visibility among all modules (Mach-O) the visibility of the prevailing definition. This patch * adds visibility bits to GlobalValueSummary::GVFlags * computes the result visibility and propagates it to all definitions Protected/hidden can imply dso_local which can enable some optimizations (this is stronger than GVFlags::DSOLocal because the implied dso_local can be leveraged for ELF -shared while default visibility dso_local has to be cleared for ELF -shared). Note: we don't have summaries for declarations, so for ELF if a declaration has the most constraining visibility, the result visibility may not be that one. Differential Revision: https://reviews.llvm.org/D92900	2021-01-27 10:43:51 -08:00
Craig Topper	96c406a887	[FaultsMaps][llvm-objdump] Move FaultMapParser to Object/. Remove CodeGen dependency from llvm-objdump FaultsMapParser lived in CodeGen and was forcing llvm-objdump to link CodeGen and everything CodeGen depends on. This was previously attempted in r240364 to fix a link failure. The CodeGen dependency was independently added to fix the same link failure, and that ended up being kept. Removing the dependency seems like the correct layering for llvm-objdump. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D95414	2021-01-27 10:39:59 -08:00
Craig Topper	5d45e25ce3	[RISCV] Group the legal vector types into lists we can iterator over in the RISCVISelLowering constructor Remove the RISCVVMVTs namespace because I don't think it provides a lot of value. If we change the mappings we'd likely have to add or remove things from the list anyway. Add a wrapper around addRegisterClass that can determine the register class from the fixed size of the type. Reviewed By: frasercrmck, rogfer01 Differential Revision: https://reviews.llvm.org/D95491	2021-01-27 10:20:12 -08:00
Valentin Clement	f7b64e36c0	[flang][openacc] Allow multiple wait clauses kernels loop and enter data had a too restrictive constraint for the wait clause. The wait clause is allowed multiple times and not only once. This patch fix this problem. Reviewed By: SouraVX Differential Revision: https://reviews.llvm.org/D95469	2021-01-27 13:18:46 -05:00
Florian Hahn	5d270f12d2	[LoopUtils] Pass SCEVExpander instead SE to addRuntimeChecks. This gives the user control over which expander to use, which in turn allows the user to decide what to do with the expanded instructions. Used in D75980. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D94295	2021-01-27 17:36:19 +00:00
Paul C. Anagnostopoulos	8871b402ec	[TableGen] [DetailedRecords] Print record name that is null string as "" Differential Revision: https://reviews.llvm.org/D95312 Add a test for the backend.	2021-01-27 10:41:46 -05:00
Simon Pilgrim	b96bc31852	ExecutionDomainFix.cpp - use const refs in for-range loops. NFCI. Avoid unnecessary copies. Reported by clang-tidy.	2021-01-27 15:39:32 +00:00
Simon Pilgrim	bd3db94c3e	[Support] CommandLine.cpp - Fix clang-tidy namespace comment warnings. NFCI. Ensure namespace braces have the correct comment with them	2021-01-27 15:39:31 +00:00
Simon Pilgrim	588dbadb28	[Support] Fix clang-tidy auto warnings. NFCI. Use auto pointer/reference to fix llvm-qualified-auto remarks.	2021-01-27 15:39:31 +00:00
Roman Lebedev	080389bd22	[CodeGen] SafeStack: preserve DominatorTree if it is avaliable While this is mostly NFC right now, because only ARM happens to run this pass with DomTree available before it, and required after it, more backends will be affected once the SimplifyCFG's switch for domtree preservation is flipped, and DwarfEHPrepare also preserves the domtree.	2021-01-27 18:32:35 +03:00
Roman Lebedev	aaa9ac5eb9	[NFC] StackProtector: be consistent and to initialize DominatorTreeWrapperPass We already ask for it, so it might be good to ensure that it is actually initialized before us. Doesn't seem to matter in practice though.	2021-01-27 18:32:35 +03:00
Valentin Clement	1973b57fb0	[flang][openacc] Fix clause restriction for exit data directive Restriction on clauses for the EXIT DATA directive were not fully correct. This patch fixes the situation. The async, if and finalize clauses are allowed only once. Reviewed By: SouraVX Differential Revision: https://reviews.llvm.org/D95470	2021-01-27 10:07:19 -05:00
Valentin Clement	202f767031	[flang][openacc] Fix clause restriction for host_data directive Restriction on clauses for the HOST_DATA directive were not fully correct. This patch fixes the situation. The if and if_present clauses are allowed only once. Reviewed By: SouraVX Differential Revision: https://reviews.llvm.org/D95473	2021-01-27 10:06:33 -05:00
Florian Hahn	54e6d8f840	[LoopUnswitch] Add test cases not partially unswitched due to cost. This pre-commits tests for D95468.	2021-01-27 13:10:33 +00:00
Jeremy Morse	1f513e79c9	[DWARF] Create subprogram's DIE in DISubprogram's unit This is a fix for PR48790. Over in D70350, subprogram DIEs were permitted to be shared between CUs. However, the creation of a subprogram DIE can be triggered early, from other CUs. The subprogram definition is then created in one CU, and when the function is actually emitted children are attached to the subprogram that expect to be in another CU. This breaks internal CU references in the children. Fix this by redirecting the creation of subprogram DIEs in getOrCreateContextDIE to the CU specified by it's DISubprogram definition. This ensures that the subprogram DIE is always created in the correct CU. Differential Revision: https://reviews.llvm.org/D94976	2021-01-27 12:36:14 +00:00
Mindong Chen	9ad5ebcbb2	[SCEV] Fix incorrect loop exit count analysis. In computeLoadConstantCompareExitLimit, the addrec used to compute the exit count should be from the loop which the exiting block belongs to. Reviewed by: mkazantsev Differential Revision: https://reviews.llvm.org/D92367	2021-01-27 19:36:05 +08:00
Sjoerd Meijer	215e8fb34a	[MachineLICM][MachineSink] Move SinkIntoLoop to MachineSink. This moves SinkIntoLoop from MachineLICM to MachineSink. The motivation for this work is that hoisting is a canonicalisation transformation, but we do not really have a good story to sink instructions back if that is better, e.g. to reduce live-ranges, register pressure and spilling. This has been discussed a few times on the list, the latest thread is: https://lists.llvm.org/pipermail/llvm-dev/2020-December/147184.html There it was pointed out that we have the LoopSink IR pass, but that works on IR, lacks register pressure informatiom, and is focused on profile guided optimisations, and then we have MachineLICM and MachineSink that both perform sinking. MachineLICM is more about hoisting and CSE'ing of hoisted instructions. It also contained a very incomplete and disabled-by-default SinkIntoLoop feature, which we now move to MachineSink. Getting loop-sinking to do something useful is going to be at least a 3-step approach: 1) This is just moving the code and is almost a NFC, but contains a bug fix. This uses helper function `isLoopInvariant` that was factored out in D94082 and added to MachineLoop. 2) A first functional change to make loop-sink a little bit less restrictive, which it really is at the moment, is the change in D94308. This lets it do more (alias) analysis using functions in MachineSink, making it a bit more powerful. Nothing changes much: still off by default. But it shows that MachineSink is a better home for this, and it starts using its functionality like `hasStoreBetween`, and in the next step we can use `isProfitableToSinkTo`. 3) This is the going to be he interesting step: decision making when and how many instructions to sink. This will be driven by the register pressure, and deciding if reducing live-ranges and loop sinking will help in better performance. 4) Once we are happy with 3), this should be enabled by default, that should be the end goal of this exercise. Differential Revision: https://reviews.llvm.org/D93694	2021-01-27 10:49:56 +00:00
David Green	5c8cfcc0df	[AArch64] Add vector saturating add intrinsic costs This adds sadd.sat, uadd.sat, ssub.sat and usub.sat costs for AArch64, similar to how they were recently added for ARM. Differential Revision: https://reviews.llvm.org/D95292	2021-01-27 10:38:32 +00:00
Fraser Cormack	0f5c801164	[RISCV] Fix a codegen crash in getSetCCResultType This patch fixes some crashes coming from `RISCVISelLowering::getSetCCResultType`, which would occasionally return an EVT constructed from an invalid MVT, which has a null Type pointer. The attached test shows this happening currently for some fixed-length vectors, which hit this issue when the V extension was enabled, even though they're not legal types under the V extension. The fix was also pre-emptively extended to scalable vectors which can't be represented as an MVT, even though a test case couldn't be found for them. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D95434	2021-01-27 10:22:54 +00:00
Jay Foad	fbb73224ea	[AMDGPU] Write "GFX6-GFX9" instead of "GFX6-9" in docs ... and similarly for some other cases. This is for consistency and to make it easier to search for mentions of a particular architecture. Differential Revision: https://reviews.llvm.org/D95453	2021-01-27 10:07:07 +00:00
David Green	cfa651ce67	[ARM] Add neon FP16 scalar_to_vector patterns. This adds some simple fp16 scalar_to_vector patterns, preventing a selection failure if this came up. Differential Revision: https://reviews.llvm.org/D95427	2021-01-27 09:59:15 +00:00
Cassie Jones	211141f4c9	[Test][AArch64] Use named vregs in overflow legalization tests. NFC	2021-01-27 04:40:49 -05:00
Cassie Jones	cf15864409	[AArch64][GlobalISel] Make G_SADDE and G_SSUBE legal This makes G_SADDE and G_SSUBE legal in preparation for further work legalizing overflowing operations. It's fine that they don't have an instruction selector implementation yet, because G_UADDE and G_USUBE are already legal on AArch64 without an instruction selector implementation. This completes the set of G_[SU]{ADD,SUB}[EO] operations on AArch64. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D95325	2021-01-27 04:36:17 -05:00
Alexey Bader	efed872d6e	Fix an error about implicit fallthrough during self build - new tag for ittapi. A fix has been implemented in the ittap repo to fix an error about implicit fallthrough in a switch that was occurring during self build. A new tag has been created for that fix. This is to update the tag. Reviewed By: bader Differential Revision: https://reviews.llvm.org/D95462 Patch by Zahira Ammarguellat.	2021-01-27 08:55:52 +03:00
Kazu Hirata	923c60906b	[llvm-objdump] Use append_range (NFC)	2021-01-26 20:00:19 -08:00

1 2 3 4 5 ...

210470 Commits