llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Sebastian Neubauer	7e4be9501b	[AMDGPU] Add amdgpu_gfx calling convention Add a calling convention called amdgpu_gfx for real function calls within graphics shaders. For the moment, this uses the same calling convention as other calls in amdgpu, with registers excluded for return address, stack pointer and stack buffer descriptor. Differential Revision: https://reviews.llvm.org/D88540	2020-11-09 16:51:44 +01:00
Momchil Velikov	74a3336afb	[ARM][MachineOutliner] Emit more CFI instructions This patch make the outliner emit CFI instructions in a few more places: * after LR is restored, but before the return in an outlined function * around save/restore of LR to/from a register at calls to outlined functions * around save/restore of LR to/from the stack at calls to outlined functions The latter two only when the function does NOT spill LR. If the function spills LR, then outliner generated saves/restores around calls are not considered interesting for unwinding the frame. Differential Revision: https://reviews.llvm.org/D89483	2020-11-09 15:26:18 +00:00
Sam Tebbs	f1a46b83c8	[ARM][LowOverheadLoops] Merge a VCMP and the new VPST into a VPT There were cases where a VCMP and a VPST were merged even if the VCMP didn't have the same defs of its operands as the VPST. This is fixed by adding RDA checks for the defs. This however gave rise to cases where the new VPST created would precede the un-merged VCMP and so would fail a predicate mask assertion since the VCMP wasn't predicated. This was solved by converting the VCMP to a VPT instead of inserting the new VPST. Differential Revision: https://reviews.llvm.org/D90461	2020-11-09 15:03:48 +00:00
Sjoerd Meijer	64e65d36e2	[LoopFlatten] FlattenInfo bookkeeping. NFC. Introduce struct FlattenInfo to group some of the bookkeeping. Besides this being a bit of a clean-up, it is a prep step for next additions (D90640). I could take things a bit further, but thought this was a good first step also not to make this change too large. Differential Revision: https://reviews.llvm.org/D90408	2020-11-09 14:50:26 +00:00
Florian Hahn	a912c32467	[VPlan] Print result value for loads in VPWidenMemoryInst (NFC). For loads, print the result value.	2020-11-09 14:01:29 +00:00
Florian Hahn	fa1fae0b2b	[VPlan] Add isStore helper to VPWidenMemoryInstructionRecipe (NFC). Move logic to check if the recipe is a store to a helper for easier reuse.	2020-11-09 14:01:29 +00:00
Jay Foad	ef136f65b2	[AMDGPU] Remove unused DisableDecoder machinery. NFC. This has been unused since D24738.	2020-11-09 13:53:27 +00:00
Florian Hahn	1c03782e21	[VPlan] Use VPValue def for VPWidenCall. This patch turns VPWidenCall into a VPValue and uses it during VPlan construction and codegeneration instead of the plain IR reference where possible. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D84681	2020-11-09 13:29:41 +00:00
David Green	6a6e7afc31	[ARM] Remove kill flags between VCMP and insertion point When we fold a VCMP into a VPST instruction any kill flags between the old VCMP position and the new insertion point need to be removed, in order to keep the verifier happy. Differential Revision: https://reviews.llvm.org/D90964	2020-11-09 13:17:53 +00:00
Lucas Prates	890ac39cb5	[ARM][AArch64] Adding Neoverse V1 CPU support Add support for the Neoverse V1 CPU to the ARM and AArch64 backends. This is based on patches from Mark Murray and Victor Campos. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D90765	2020-11-09 13:15:40 +00:00
Francesco Petrogalli	f5046224cf	[llvm][AArch64] Simplify (and (sign_extend..) #bitmask). Fold VT = (and (sign_extend NarrowVT to VT) #bitmask) into VT = (zero_extend NarrowVT) With this combine, the test replaces a sign extended load + an unsigned extention with a zero extended load to render one of the operands of the last multiplication. BEFORE \| AFTER f_i16_i32: \| f_i16_i32: .fnstart \| .fnstart ldrsh r0, [r0] \| ldrh r1, [r1] ldrsh r1, [r1] \| ldrsh r0, [r0] smulbb r0, r1, r0 \| smulbb r0, r0, r1 uxth r1, r1 \| mul r0, r0, r1 mul r0, r0, r1 \| bx lr bx lr \| Reviewed By: resistor Differential Revision: https://reviews.llvm.org/D90605	2020-11-09 12:53:36 +00:00
Florian Hahn	a9b449ce40	[VPlan] Add printOperands helper to VPUser (NFC). Factor out the code for printing operands of a VPUser so it can be re-used when printing other recipes.	2020-11-09 12:30:57 +00:00
LemonBoy	3c49ee343c	[InstCombine] Fix constant-folding of overflowing arithmetic ops on vectors Feeding vector values to `InstCombiner::OptimizeOverflowCheck` produces a scalar boolean flag if it proves the overflow check can be eliminated. This causes `InstCombiner::CreateOverflowTuple` to crash as it correctly expects a vector of i1 values instead. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D89628	2020-11-09 14:41:07 +03:00
Michał Górny	1cdecf37fe	[llvm] [Support] Fix segv if argv0 is null in getMainExecutable() When LLDB Python bindings are used and stack backtraces are enabled for logging, getMainExecutable() is called with argv0 being null. This caused the fallback function getprogpath() (used on FreeBSD, NetBSD and Linux) to segfault. Make it handle null executable name gracefully. Differential Revision: https://reviews.llvm.org/D91012	2020-11-09 11:35:11 +01:00
Georgii Rymar	f954f43df3	[yaml2obj] - ProgramHeaders: introduce FirstSec/LastSec instead of Sections list. Imagine we have a YAML declaration of few sections: `foo1`, `<unnamed 2>`, `foo3`, `foo4`. To put them into segment we can do (1): ``` Sections: - Section: foo1 - Section: foo4 ``` or we can use (2): ``` Sections: - Section: foo1 - Section: foo3 - Section: foo4 ``` or (3) : ``` Sections: - Section: foo1 ## "(index 2)" here is a name that we automatically created for a unnamed section. - Section: (index 2) - Section: foo3 - Section: foo4 ``` It looks really confusing that we don't have to list all of sections. At first I've tried to make this rule stricter and report an error when there is a gap (i.e. when a section is included into segment, but not listed explicitly). This did not work perfect, because such approach conflicts with unnamed sections/fills (see (3)). This patch drops "Sections" key and introduces 2 keys instead: `FirstSec` and `LastSec`. Both are optional. Differential revision: https://reviews.llvm.org/D90458	2020-11-09 13:00:50 +03:00
Georgii Rymar	04b897752e	Recommit: [llvm-readelf/obj] - Allow dumping of ELF header even if some elements are corrupt. This is recommit for D90903 with fixes for BB: 1) Used std::move<> when returning Expected<> (http://lab.llvm.org:8011/#/builders/112/builds/913) 2) Fixed the name of temporarily file in the file-headers.test (http://lab.llvm.org:8011/#/builders/36/builds/1269) (a local old temporarily file was used before) For creating `ELFObjectFile` instances we have the factory method `ELFObjectFile<ELFT>::create(MemoryBufferRef Object)`. The problem of this method is that it scans the section header to locate some sections. When a file is truncated or has broken fields in the ELF header, this approach does not allow us to create the `ELFObjectFile` and dump the ELF header. This is https://bugs.llvm.org/show_bug.cgi?id=40804 This patch suggests a solution - it allows to delay scaning sections in the `ELFObjectFile<ELFT>::create`. It now allows user code to call an object initialization (`initContent()`) later. With that it is possible, for example, for dumpers just to dump the file header and exit. By default initialization is still performed as before, what helps to keep the logic of existent callers untouched. I've experimented with different approaches when worked on this patch. I think this approach is better than doing initialization of sections (i.e. scan of them) on demand, because normally users of `ELFObjectFile` API expect to work with a valid object. In most cases when a section header table can't be read (because of an error), we don't have to continue to work with object. So we probably don't need to implement a more complex API. Differential revision: https://reviews.llvm.org/D90903	2020-11-09 12:53:53 +03:00
Tim Northover	5d0b348cd7	[MergeFunctions] fix function attribute comparison in FunctionComparator The comparison of AttributeSets stopped after seeing a matching type attribute. Subsequent mismatching attributes were not detected causing a crash.	2020-11-09 09:19:11 +00:00
Georgii Rymar	aaa86f8a5c	Revert "[llvm-readelf/obj] - Allow dumping of ELF header even if some elements are corrupt." This reverts commit ea8a0b8b29eb08d3f0f6ac40942a2d8e98ab57ee. It broke BBots. http://lab.llvm.org:8011/#/builders/14/builds/1439 http://lab.llvm.org:8011/#/builders/112/builds/913	2020-11-09 11:50:50 +03:00
Georgii Rymar	0e596f45f4	[llvm-readelf/obj] - Allow dumping of ELF header even if some elements are corrupt. For creating `ELFObjectFile` instances we have the factory method `ELFObjectFile<ELFT>::create(MemoryBufferRef Object)`. The problem of this method is that it scans the section header to locate some sections. When a file is truncated or has broken fields in the ELF header, this approach does not allow us to create the `ELFObjectFile` and dump the ELF header. This is https://bugs.llvm.org/show_bug.cgi?id=40804 This patch suggests a solution - it allows to delay scaning sections in the `ELFObjectFile<ELFT>::create`. It now allows user code to call an object initialization (`initContent()`) later. With that it is possible, for example, for dumpers just to dump the file header and exit. By default initialization is still performed as before, what helps to keep the logic of existent callers untouched. I've experimented with different approaches when worked on this patch. I think this approach is better than doing initialization of sections (i.e. scan of them) on demand, because normally users of `ELFObjectFile` API expect to work with a valid object. In most cases when a section header table can't be read (because of an error), we don't have to continue to work with object. So we probably don't need to implement a more complex API. Differential revision: https://reviews.llvm.org/D90903	2020-11-09 11:27:07 +03:00
Georgii Rymar	b24d31e79d	[yaml2obj] - Implement BBAddrMapSection::getEntries(). NFC. This allows to use the generic fields validation mechanism that we have. The behavior (i.e. an error reported) remains the same.	2020-11-09 11:11:57 +03:00
Michael Liao	171536d3a1	[GlobalsAA] Teach to handle `addrspacecast`.	2020-11-09 00:04:52 -05:00
Sanjay Patel	de04fc5560	[InstSimplify] allow vector folds for (Pow2C << X) == NonPow2C Existing pre-conditions seem to be correct: https://rise4fun.com/Alive/lCLB Name: non-zero C1 Pre: !isPowerOf2(C1) && isPowerOf2(C2) && C1 != 0 %sub = shl i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false Name: one == C2 Pre: !isPowerOf2(C1) && isPowerOf2(C2) && C2 == 1 %sub = shl i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false Name: nuw Pre: !isPowerOf2(C1) && isPowerOf2(C2) %sub = shl nuw i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false Name: nsw Pre: !isPowerOf2(C1) && isPowerOf2(C2) %sub = shl nsw i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false	2020-11-08 09:52:05 -05:00
Simon Pilgrim	1f0f66fcab	[DSE] Don't dereference a dyn_cast<> result - use cast<> instead. NFCI. We were relying on the dyn_cast<> succeeding - better use cast<> and have it assert that its the correct type than dereference a null result.	2020-11-08 13:07:45 +00:00
Simon Pilgrim	3bc6fadf68	[InstCombine] foldSelectFunnelShift - block poison in funnel shift value As raised by @nlopes on D90382 - if this is not a rotate then the select was blocking poison from the 'shift-by-zero' non-TVal, but a funnel shift won't - so freeze it.	2020-11-08 12:58:30 +00:00
Florian Hahn	738feb9acf	[LoopInterchange] Skip non SCEV-able operands in cost function. This fixes a crash when trying to get a SCEV expression for operands that are not SCEV-able.	2020-11-08 11:41:19 +00:00
Pedro Tammela	b59848c68a	[Reg2Mem] add support for the new pass manager This patch refactors the pass to accomodate the new pass manager boilerplate. Differential Revision: https://reviews.llvm.org/D91005	2020-11-08 11:14:05 +00:00
Arthur Eubanks	35a4ce12d4	Revert "[NewPM] Provide method to run all pipeline callbacks, used for -O0" This reverts commit ae38540042668675dd16c642d850115f217ea59f. As well as some follow-up test fixes. The original change causes new-pass-manager.ll to fail when polly is enabled.	2020-11-08 00:32:35 -08:00
Craig Topper	8e0220801f	[X86] Improve lowering of fptoui Invert the select condition when masking in the sign bit of a fptoui operation. Also, rather than lowering the sign mask to select/xor and expecting the select to get cleaned up later, directly lower to shift/xor. Patch by Layton Kifer! Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D90658	2020-11-07 23:50:03 -08:00
Craig Topper	9510c04a9c	[RISCV] Remove assertsexti32 from a couple B extension isel patterns that don't demanded the sign extended bits.	2020-11-07 22:43:16 -08:00
Kazu Hirata	6c4e8bd385	[Mem2Reg] Use llvm::count instead of std::count (NFC)	2020-11-07 20:18:47 -08:00
Kazu Hirata	9abe698dc8	[JumpThreading] Fix function names (NFC)	2020-11-07 19:35:03 -08:00
Carl Ritson	339223bd21	[AMDGPU] SIWholeQuadMode fix mode insertion when SCC always defined Fix a crash when SCC is defined until end of block and mode change must be inserted in SCC live region. Reviewed By: mceier Differential Revision: https://reviews.llvm.org/D90997	2020-11-08 11:14:57 +09:00
Craig Topper	c3928d2bd0	[RISCV] Use (not X) in instead of (xor X, -1) in isel patterns to improve readability. NFC	2020-11-07 11:50:52 -08:00
Jonas Devlieghere	85b15f0ad5	[DWARFLinker] Convert analyzeContextInfo to a work list (NFC) Convert analyzeContextInfo to a work list using the same approach I used to remove the recursion from lookForDIEsToKeep. This fixes the crash reported in https://llvm.org/PR48029. Tested using the reproducer attached to PR48029 as well as by comparing the clang MD5 hashes before and after the change (with and without gmodules). Differential revision: https://reviews.llvm.org/D90873	2020-11-07 10:46:09 -08:00
Nikita Popov	80a7041502	[BasicAA] Unify struct/other offset (NFC) The distinction between StructOffset and OtherOffset has been originally introduced by 82069c44ca39df9d506e16bfb0ca2481866dd0bb, which applied different reasoning to both offset kinds. However, this distinction was not actually correct, and has been fixed by c84e77aeaefccb8d0c4c508b8017dcad80607f53. Since then, we only ever consider the sum StructOffset + OtherOffset, so we may as well store it in that form directly.	2020-11-07 18:56:05 +01:00
Nikita Popov	c44d13b67c	[BasicAA] Use smul_ov helper (NFCI) Instead of performing the multiplication in double the bit width and using active bits to determine overflow, use the existing smul_ov() APInt method to detect overflow. The smul_ov() implementation is not particularly efficient, but it's still better than doing this a wide, usually 128-bit, type.	2020-11-07 18:14:48 +01:00
Nikita Popov	c77875582f	[CaptureTracking] Add statistics (NFC) Add basic statistics on the number of pointers that have been determined to maybe capture / not capture.	2020-11-07 12:57:00 +01:00
Nikita Popov	32c88b4c53	[CaptureTracking] Early abort on too many uses (NFCI) If there are too many uses, we should directly return -- there's no point in inspecting the remaining uses in the worklist, as we have to conservatively assume a capture anyway. This also means that tooManyUses() gets called exactly once, rather than potentially many times. This restores the behavior prior to e9832dfdf366ddffba68164adb6855d17c9f87c1, where this was accidentally changed while moving the AddUses logic into a closure, thus making the return a return from the closure rather than the whole function.	2020-11-07 11:52:08 +01:00
Nikita Popov	c9414e5876	[CaptureTracking] Correctly handle multiple uses in one instruction If the same value is used multiple times in the same instruction, CaptureTracking may end up reporting the wrong use as being captured, and/or report the same use as being captured multiple times. Make sure that all checks take the use operand number into account, rather than performing unreliable comparisons against the used value. I'm not sure whether this can cause any problems in practice, but at least some capture trackers (ArgUsesTracker, AACaptureUseTracker) do care about which call argument is captured.	2020-11-07 11:31:20 +01:00
Nikita Popov	5c9c72b83d	[CaptureTracking] Avoid duplicate shouldExplode() check (NFCI) We check shouldExplore() before adding uses to the worklist, so uses that should not be explored will not reach captured() in the first place.	2020-11-07 10:16:58 +01:00
Kazu Hirata	f462bf9b0e	[BranchProbabilityInfo] Simplify getEdgeProbability (NFC) The patch simplifies BranchProbabilityInfo::getEdgeProbability by handling two cases separately, depending on whether we have edge probabilities. - If we have edge probabilities, then add up probabilities for successors being equal to Dst. - Otherwise, return the number of ocurrences divided by the total number of successors. Differential Revision: https://reviews.llvm.org/D90980	2020-11-06 22:47:22 -08:00
Atmn Patel	3dd5790777	Revert "[LoopDeletion] Allows deletion of possibly infinite side-effect free loops" This reverts commit 0b17c6e4479d62bd4ff05c48d6cdf340b198832f. This patch causes a compile-time error in SCEV.	2020-11-07 00:32:12 -05:00
Fangrui Song	840325e8ac	AsmPrinter/Dwarf*: Use llvm::Register instead of unsigned	2020-11-06 21:00:28 -08:00
Fangrui Song	d460b6115e	[AsmPrinter] Rename ByteStreamer::EmitInt8 to emitInt8 to be consistent with other emit*	2020-11-06 20:02:56 -08:00
Jonas Devlieghere	34b67b4c77	[DWARFLinker] Add CompileUnit::getInfo helper that takes a DWARFDie (NFC) Eliminate the need to go through the DIE index by passing the DIE to CompileUnit::getInfo directly. Before: unsigned Idx = Unit->getOrigUnit().getDIEIndex(Die); CompileUnit::DIEInfo &Info = Unit->getInfo(Idx); After: CompileUnit::DIEInfo &Info = Unit->getInfo(Die);	2020-11-06 19:37:44 -08:00
Atmn Patel	51ad1efef5	[LoopDeletion] Allows deletion of possibly infinite side-effect free loops From C11 and C++11 onwards, a forward-progress requirement has been introduced for both languages. In the case of C, loops with non-constant conditionals that do not have any observable side-effects (as defined by 6.8.5p6) can be assumed by the implementation to terminate, and in the case of C++, this assumption extends to all functions. The clang frontend will emit the `mustprogress` function attribute for C++ functions (D86233, D85393, D86841) and emit the loop metadata `llvm.loop.mustprogress` for every loop in C11 or later that has a non-constant conditional. This patch modifies LoopDeletion so that only loops with the `llvm.loop.mustprogress` metadata or loops contained in functions that are required to make progress (`mustprogress` or `willreturn`) are checked for observable side-effects. If these loops do not have an observable side-effect, then we delete them. Loops without observable side-effects that do not satisfy the above conditions will not be deleted. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D86844	2020-11-06 22:06:58 -05:00
Elvina Yakubova	7acee7fe26	[AArch64] Add pipeline model for HiSilicon's TSV110 This patch adds the scheduling and cost model for TSV110. Reviewed by: SjoerdMeijer, bryanpkc Differential Revision: https://reviews.llvm.org/D89972	2020-11-07 01:23:00 +03:00
Eric Astor	fb1b9af1ea	[ms] [llvm-ml] Allow arbitrary strings as integer constants MASM interprets strings in expression contexts as integers expressed in big-endian base-256, treating each character as its ASCII representation. This completely eliminates the need to special-case single-character strings. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D90788	2020-11-06 17:15:49 -05:00
Atmn Patel	f15e3a4579	[LoopDeletion] Remove dead loops with no exit blocks Currently, LoopDeletion refuses to remove dead loops with no exit blocks because it cannot statically determine the control flow after it removes the block. This leads to miscompiles if the loop is an infinite loop and should've been removed. Differential Revision: https://reviews.llvm.org/D90115	2020-11-06 17:08:34 -05:00
Rahman Lavaee	2405269073	[obj2yaml] [yaml2obj] Add yaml support for SHT_LLVM_BB_ADDR_MAP section. YAML support allows us to better test the feature in the subsequent patches. The implementation is quite similar to the .stack_sizes section. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D88717	2020-11-06 12:44:42 -08:00

1 2 3 4 5 ...

140922 Commits