llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Hiroshi Yamauchi	49175f8a79	[PGO][PGSO] Use IsColdXNthPercentile for sample PGO. Summary: This performs better for sample PGO. NFC as PGSOColdCodeOnlyForSamplePGO is still true. Reviewers: davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75550	2020-03-05 09:54:54 -08:00
Jordan Rupprecht	654f0b7002	[llvm-readobj] Include section name of notes. This changes the output of `llvm-readelf -n` from: ``` Displaying notes found at file offset 0x<...> with length 0x<...>: ``` to: ``` Displaying notes found in: .note.foo ``` And similarly, adds a `Name:` field to the `llvm-readobj -n` output for notes. This change not only increases GNU compatibility, it also makes it much easier to read notes. Note that we still fall back to printing the file offset/length in cases where we don't have a section name, such as when printing notes in program headers or printing notes in a partially stripped file (GNU readelf does the same). Fixes llvm.org/PR41339. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D75647	2020-03-05 09:53:14 -08:00
Philip Reames	636d13a938	[X86/MC] Factor out common code [NFC]	2020-03-05 09:43:41 -08:00
Pablo Barrio	0456b1384a	Fix MemTagSanitizer docs to point at Armv8.5-A MTE The Memory Tagging Extension was introduced in Armv8.5-A.	2020-03-05 17:23:58 +00:00
Rodrigo Dominguez	463c945344	AMDGPU: Add/Fix tests for image atomic intrinsic. Summary: Add tests for 64-bit image atomic swap and cmpswap. Fix tests for 32-bit image atomic add. Change-Id: Ibb7619749c1ad504b24aa1c5f3185417a3013f3c Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, jfb, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75295	2020-03-05 12:18:15 -05:00
David Stuttard	a1cc4559df	AMDGPU: Fix SMRD test in trivially disjoint mem access code Summary: This seems like an obvious error - cut and paste issue? The change does make a change to one of the lit tests - it stops s_buffer_load re-ordering past an MUBUF instruction (which is not surprising). Change-Id: I80be99de5b62af4f42e91af2591b76a52ac9efa6 Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75686	2020-03-05 17:14:01 +00:00
Chris Bowler	9320aeabf1	[AIX] Extend int arguments to register width when passed in stack memory. This is a follow up to the previous patch: [AIX] Implement caller arguments passed in stack memory. This corrects a defect in AIX 64-bit where an i32 is written to the stack with stw (4 bytes) rather than the expected std (8 bytes.) Integer arguments pass on the stack as images of their register representation. I also took the opportunity to tidy up some of the calling convention AIX tests I added in my last commit. This patch adds the missed assembly expected output for the stack arg int case, which would have caught this problem. Differential Revision: https://reviews.llvm.org/D75126	2020-03-05 11:49:16 -05:00
Juneyoung Lee	82b7ae1c5a	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into branch conditions of dominating blocks' terminators Summary: ``` br i1 c, BB1, BB2: BB1: use1(c) BB2: use2(c) ``` In BB1 and BB2, c is never undef or poison because otherwise the branch would have triggered UB. This is a resubmission of 952ad47 with crash fix of llvm/test/Transforms/LoopRotate/freeze-crash.ll. Checked with Alive2 Reviewers: xbolva00, spatel, lebedev.ri, reames, jdoerfert, nlopes, sanjoy Reviewed By: reames Subscribers: jdoerfert, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75401	2020-03-06 01:08:35 +09:00
Sanjay Patel	85bde64f46	[VectorCombine] add tests for different extract indexes; NFC	2020-03-05 10:33:21 -05:00
Florian Hahn	45b8bf3b33	[VPlan] Use consecutive numbers to print VPValues instead of addresses. Currently when printing VPValues we use the object address, which makes it hard to distinguish VPValues as they usually are large numbers with varying distance between them. This patch adds a simple slot tracker, similar to the ModuleSlotTracker used for IR values. In order to dump a VPValue or anything containing a VPValue, a slot tracker for the enclosing VPlan needs to be created. The existing VPlanPrinter can take care of that for the existing code. We assign consecutive numbers to each VPValue we encounter in a reverse post order traversal of the VPlan. Reviewers: rengolin, hsaito, fhahn, Ayal, dorit, gilr Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D73078	2020-03-05 14:55:15 +00:00
Daniel Kiss	77f345b07b	[AArch64] Harmonize print format of hint instructions. Summary: Hint instructions printed as "hint\t#hintnum" except in case of ARM v8.3a instruction only "hint #hintnum" is printed. This patch changes all format to the fist one. Reviewers: pbarrio, LukeCheeseman, vsk Reviewed By: vsk Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75625	2020-03-05 15:35:24 +01:00
Simon Pilgrim	c63367859f	Fix use-after-move warning. NFCI.	2020-03-05 14:22:25 +00:00
Simon Pilgrim	f057e90767	Fix "Value stored to 'RegForm' is never read" static analyzer warnings. NFC.	2020-03-05 14:22:24 +00:00
Simon Pilgrim	4abc657be6	Fix static analyzer uninitialized variable warning. NFCI.	2020-03-05 14:22:24 +00:00
Krasimir Georgiev	358a57e659	Revert "[BFI] Use CallbackVH to notify BFI about deletion of basic blocks" This reverts commit 8975aa6ea8172963d6532caa8ed2a6f6e0074a02. Causes a compilation warning: llvm-project/llvm/include/llvm/Analysis/BlockFrequencyInfoImpl.h:1037:43: warning: 'llvm::BlockFrequencyInfoImpl<llvm::BasicBlock>::BFICallbackVH' has virtual functions but non-virtual destructor [-Wnon-virtual-dtor] class BlockFrequencyInfoImpl<BasicBlock>::BFICallbackVH : public CallbackVH { ^ 1 warning generated.	2020-03-05 14:40:16 +01:00
Igor Kudrin	2ded1a0b45	Fix typos in comment marks.	2020-03-05 20:01:45 +07:00
Sanjay Patel	3910866e84	[VectorCombine] add x86 AVX run to test for better coverage; NFC	2020-03-05 07:54:31 -05:00
Daniil Suchkov	72bf7205ba	[BFI] Use CallbackVH to notify BFI about deletion of basic blocks With AssertingVHs instead of bare pointers in BlockFrequencyInfoImpl::Nodes (but without CallbackVHs) ~1/36 of all tests ran by make check fail. It means that there are users of BFI that delete basic blocks while keeping BFI. Some of those transformations add new basic blocks, so if a new basic block happens to be allocated at address where an already deleted block was and we don't explicitly set block frequency for that new block, BFI will report some non-default frequency for the block even though frequency for the block was never set. Inliner is an example of a transformation that adds and removes BBs while querying and updating BFI. With this patch, thanks to updates via CallbackVH, BFI won't keep stale pointers in its Nodes map. This is a resubmission of 408349a25d0f5a012003f84c95b49bcc7782fa70 with fixed MSVC compilation errors. Reviewers: davidxl, yamauchi, asbirlea, fhahn, fedor.sergeev Reviewed-By: asbirlea, davidxl Tags: #llvm Differential Revision: https://reviews.llvm.org/D75341	2020-03-05 18:55:07 +07:00
Daniil Suchkov	11c3969199	Revert "[BFI] Use CallbackVH to notify BFI about deletion of basic blocks" Reverting the patch because it causes compilation failure on MSVC. This reverts commit 408349a25d0f5a012003f84c95b49bcc7782fa70.	2020-03-05 18:27:42 +07:00
Daniil Suchkov	cc62ec6eb1	[BFI] Use CallbackVH to notify BFI about deletion of basic blocks With AssertingVHs instead of bare pointers in BlockFrequencyInfoImpl::Nodes (but without CallbackVHs) ~1/36 of all tests ran by make check fail. It means that there are users of BFI that delete basic blocks while keeping BFI. Some of those transformations add new basic blocks, so if a new basic block happens to be allocated at address where an already deleted block was and we don't explicitly set block frequency for that new block, BFI will report some non-default frequency for the block even though frequency for the block was never set. Inliner is an example of a transformation that adds and removes BBs while querying and updating BFI. With this patch, thanks to updates via CallbackVH, BFI won't keep stale pointers in its Nodes map. Reviewers: davidxl, yamauchi, asbirlea, fhahn, fedor.sergeev Reviewed-By: asbirlea, davidxl Tags: #llvm Differential Revision: https://reviews.llvm.org/D75341	2020-03-05 18:10:36 +07:00
Sam Parker	6e096c0e3b	[ARM][MVE] Enable SHRN for tail predication These instructions don't swap lanes so make them valid. Differential Revision: https://reviews.llvm.org/D75667	2020-03-05 11:00:45 +00:00
LLVM GN Syncbot	cf41359f7a	[gn build] Port cada5b881b6	2020-03-05 10:56:10 +00:00
Igor Kudrin	a4fab2c27b	[DebugInfo] Do not truncate 64-bit values when dumping CIEs and FDEs. This fixes printing long values that might reside in CIE and FDE, including offsets, lengths, and addresses. Differential Revision: https://reviews.llvm.org/D73887	2020-03-05 17:37:28 +07:00
Igor Kudrin	6b87aa0046	[DebugInfo] Refine the condition to detect CIEs. The condition was not accurate enough and could interpret some FDEs in .eh_frame or 64-bit DWARF .debug_frame sections as CIEs. Even though such FDEs are unlikely in a normal situation, the wrong interpretation could hide an issue in a buggy generator. Differential Revision: https://reviews.llvm.org/D73886	2020-03-05 17:37:09 +07:00
Georgii Rymar	3b5de8c142	[Object/ELF] - Fix a position calculation expression in ELFFile<ELFT>::getEntry(). It fixes now what 1c991f907a43d7a56e82dd67a76514843841ed9a tried to fix. (A test case failture on 32-bit Arch Linux) On 32-bit hosts it still fails (because it truncates the `Pos` value to 32 bits). It seems happens because of `sizeof` that returns `size_t`, which has a different size on 32/64 bits hosts. I've tested on a 32-bit host and verified that relocation-errors.test test and other LLVM tools tests pass now.	2020-03-05 12:49:31 +03:00
Daniil Suchkov	1ae4357369	[Test] Add a regression test for failure introduced by 952ad4701cf0d8da79789f6b83ddaa386c60d535	2020-03-05 16:32:37 +07:00
Daniil Suchkov	0be08100f6	Revert "[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into branch conditions of dominating blocks' terminators" That commit causes SIGSEGV on some simple tests. This reverts commit 952ad4701cf0d8da79789f6b83ddaa386c60d535.	2020-03-05 16:32:36 +07:00
serge-sans-paille	2cab2132ef	Avoid dangling reference on SectionList Bug spotted by https://cookieplmonster.github.io/2020/02/01/emulator-bug-llvm-bug/ Basically, holding references to object inside a resized vector is a bad idea. Differential Revision: https://reviews.llvm.org/D75110	2020-03-05 09:42:24 +01:00
Jun Ma	80803bd47c	[Coroutines] Optimized coroutine elision based on reachability Differential Revision: https://reviews.llvm.org/D75440	2020-03-05 14:43:50 +08:00
David Blaikie	557ca60d45	X86AsmBackend.cpp: #ifndef NDEBUG some only-used-in-asserts variables to fix the -Werror non-asserts build	2020-03-04 22:36:24 -08:00
Lang Hames	87a444bd04	[ORC] Remove hard dependency on libobjc when using MachOPlatform with LLJIT. The LLJIT::MachOPlatformSupport class used to unconditionally attempt to register __objc_selrefs and __objc_classlist sections. If libobjc had not been loaded this resulted in an assertion, even if no objc sections were actually present. This patch replaces this unconditional registration with a check that no objce sections are present if libobjc has not been loaded. This will allow clients to use MachOPlatform with LLJIT without requiring libobjc for non-objc code.	2020-03-04 21:49:28 -08:00
Sameer Sahasrabuddhe	d13ab42a5a	StructurizeCFG: simplify phi nodes when possible After structurization, some phi nodes can have a single incoming edge and can be simplified away. This change runs a simplify query on all phis that are either modified or added by the structurizer. This also moves some phis closer to their use as a side benefit. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D75500	2020-03-05 10:33:15 +05:30
Craig Topper	cac4f78ec3	[X86] Simplify the code at the end of lowerShuffleAsBroadcast. The original code could create a bitcast from f64 to i64 and back on 32-bit targets. This was only working because getBitcast was able to fold the casts away to avoid leaving the illegal i64 type. Now we handle the scalar case directly by broadcasting using the scalar type as the element type. Then bitcasting to the final VT. This works since we ensure the scalar type is the same size as the final VT element type. No more casts to i64. For the vector case, we cast to VT or subvector of VT. And then do the broadcast. I think this all matches what we generated before, just in a more readable way.	2020-03-04 20:45:02 -08:00
Philip Reames	7d0a2e9be8	Consistently capitalize a variable [NFC] One instance in a copy paste was pointed out in a review, fix all instances at once.	2020-03-04 20:00:08 -08:00
Michael Trent	4f4788bcfc	Fix dyld opcode *_ADD_ADDR_IMM_SCALED error detection. Summary: Move the check for malformed REBASE_OPCODE_ADD_ADDR_IMM_SCALED and BIND_OPCODE_DO_BIND_ADD_ADDR_IMM_SCALED opcodes after the immediate has been applied to the SegmentOffset. This fixes specious errors where SegmentOffset is pointing between two sections when trying to correct the SegmentOffset value. Update the regression tests to verify the proper error message. Reviewers: pete, ab, lhames, steven_wu, jhenderson Reviewed By: pete Subscribers: hiraditya, dexonsmith, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75629	2020-03-04 19:57:45 -08:00
Igor Kudrin	0ceba89b0b	[DebugInfo] Avoid crashing on an invalid section identifier. A DWARFSectionKind is read from input. It is not validated on parsing, so an unexpected value may result in reaching llvm_unreachable() in DWARFUnitIndex::getColumnHeader() when dumping the index section. Differential Revision: https://reviews.llvm.org/D75609	2020-03-05 10:54:43 +07:00
QingShan Zhang	5321b1b59f	[DAGCombine] Check the uses of negated floating constant and remove the hack PowerPC hits an assertion due to somewhat the same reason as https://reviews.llvm.org/D70975. Though there are already some hack, it still failed with some case, when the operand 0 is NOT a const fp, it is another fma that with const fp. And that const fp is negated which result in multi-uses. A better fix is to check the uses of the negated const fp. If there are already use of its negated value, we will have benefit as no extra Node is added. Differential revision: https://reviews.llvm.org/D75501	2020-03-05 03:42:50 +00:00
Jim Lin	efbb9a592b	[AVR][NFC] Use Register instead of unsigned Summary: Use Register type for variables instead of unsigned type. Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75595	2020-03-05 11:38:24 +08:00
Greg Clayton	8024597fa7	Fix buildbots with merge that didn't happen for 4050b01ba9ece02721ec496383baee219ca8cc2b.	2020-03-04 19:28:24 -08:00
Greg Clayton	990fd897af	Fix GSYM tests to run the yaml files and fix test failures on some machines. YAML files were not being run during lit testing as there was no lit.local.cfg file. Once this was fixed, some buildbots would fail due to a StringRef that pointed to a std::string inside of a temporary llvm::Triple object. These issues are fixed here by making a local triple object that stays around long enough so the StringRef points to valid data. Fixed memory sanitizer bot bugs as well. Differential Revision: https://reviews.llvm.org/D75390	2020-03-04 19:14:08 -08:00
hsmahesha	0a487e92a9	AMDGPU/GlobalISel: Support llvm.trap and llvm.debugtrap intrinsics Summary: Lower trap and debugtrap intrinsics to AMDGPU machine instruction(s). Reviewers: arsenm, nhaehnle, kerbowa, cdevadas, t-tye, kzhuravl Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, yaxunl, rovka, dstuttard, tpr, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74688	2020-03-05 08:16:57 +05:30
Shengchen Kan	159cc6860a	[X86] Add a private member function determinePaddingPrefix for X86AsmBackend Summary: X86 can reduce the bytes of NOP by padding instructions with prefixes to get a better peformance in some cases. So a private member function `determinePaddingPrefix` is added to determine which prefix is the most suitable. Reviewers: annita.zhang, reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: reames Subscribers: llvm-commits, dexonsmith, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D75357	2020-03-05 09:26:33 +08:00
Philip Reames	836c9c8d77	[X86] Relax existing instructions to reduce the number of nops needed for alignment purposes If we have an explicit align directive, we currently default to emitting nops to fill the space. As discussed in the context of the prefix padding work for branch alignment (D72225), we're allowed to play other tricks such as extending the size of previous instructions instead. This patch will convert near jumps to far jumps if doing so decreases the number of bytes of nops needed for a following align. It does so as a post-pass after relaxation is complete. It intentionally works without moving any labels or doing anything which might require another round of relaxation. The point of this patch is mainly to mock out the approach. The optimization implemented is real, and possibly useful, but the main point is to demonstrate an approach for implementing such "pad previous instruction" approaches. The key notion in this patch is to treat padding previous instructions as an optional optimization, not as a core part of relaxation. The benefit to this is that we avoid the potential concern about increasing the distance between two labels and thus causing further potentially non-local code grown due to relaxation. The downside is that we may miss some opportunities to avoid nops. For the moment, this patch only implements a small set of existing relaxations.. Assuming the approach is satisfactory, I plan to extend this to a broader set of instructions where there are obvious "relaxations" which are roughly performance equivalent. Note that this patch doesn't change which instructions are relaxable. We may wish to explore that separately to increase optimization opportunity, but I figured that deserved it's own separate discussion. There are possible downsides to this optimization (and all "pad previous instruction" variants). The major two are potentially increasing instruction fetch and perturbing uop caching. (i.e. the usual alignment risks) Specifically: * If we pad an instruction such that it crosses a fetch window (16 bytes on modern X86-64), we may cause the decoder to have to trigger a fetch it wouldn't have otherwise. This can effect both decode speed, and icache pressure. * Intel's uop caching have particular restrictions on instruction combinations which can fit in a particular way. By moving around instructions, we can both cause misses an change misses into hits. Many of the most painful cases are around branch density, so I don't expect this to be too bad on the whole. On the whole, I expect to see small swings (i.e. the typical alignment change problem), but nothing major or systematic in either direction. Differential Revision: https://reviews.llvm.org/D75203	2020-03-04 16:52:35 -08:00
Matt Arsenault	7b8e05d66e	Add constexpr to DenormalMode constructors This will allow their use in member initializers in a future commit.	2020-03-04 18:46:46 -05:00
Matt Arsenault	bb3d51d74e	X86: Generate mir checks in sqrt test	2020-03-04 18:46:46 -05:00
Stefan Gränitz	81cfd93bf4	[ORC] Decompose LazyCallThroughManager::callThroughToSymbol() Summary: Decompose callThroughToSymbol() into findReexport(), resolveSymbol(), notifyResolved() and reportCallThroughError(). This allows derived classes to reuse the functionality while adding their own code in between. Reviewers: lhames Reviewed By: lhames Subscribers: hiraditya, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75084	2020-03-05 00:24:23 +01:00
Craig Topper	9526630da6	[X86] Convert vXi1 vectors to xmm/ymm/zmm types via getRegisterTypeForCallingConv rather than using CCPromoteToType in the td file Previously we tried to promote these to xmm/ymm/zmm by promoting in the X86CallingConv.td file. But this breaks when we run out of xmm/ymm/zmm registers and need to fall back to memory. We end up trying to create a non-sensical scalar to vector. This lead to an assertion. The new tests in avx512-calling-conv.ll all trigger this assertion. Since we really want to treat these types like we do on avx2, it seems better to promote them before the calling convention code gets involved. Except when the calling convention is one that passes the vXi1 type in a k register. The changes in avx512-regcall-Mask.ll are because we indicated that xmm/ymm/zmm types should be passed indirectly for the Win64 ABI before we go to the common lines that promoted the vXi1 types. This caused the promoted types to be picked up by the default calling convention code. Now we promote them earlier so they get passed indirectly as though they were xmm/ymm/zmm. Differential Revision: https://reviews.llvm.org/D75154	2020-03-04 15:02:32 -08:00
shafik	1ac2fd4baf	[dsymutil] Fix template stripping in getDIENames(...) to account for overloaded operators Currently dsymutil when generating accelerator tables will attempt to strip the template parameters from names for subroutines. For some overload operators which contain < in their names e.g. operator< the current method ends up stripping the operator name as well, we just end up with the name operator in the table for each case. Differential Revision: https://reviews.llvm.org/D75545	2020-03-04 14:54:31 -08:00
Craig Topper	25a844d351	[X86] Disable commuting for the first source operand of zero masked scalar fma intrinsic instructions. I believe this is the correct fix for D75506 rather than disabling all commuting. We can still commute the remaining two sources. Differential Revision:m https://reviews.llvm.org/D75526	2020-03-04 14:35:53 -08:00
Matt Arsenault	b25fbd212d	AMDGPU: Remove VOP3OpSelMods0 complex pattern Use default operand of 0 instead.	2020-03-04 17:18:22 -05:00

1 2 3 4 5 ...

192962 Commits