llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
David Green	3bac3332a1	[llvm-mca] Add a -mattr flag This adds a -mattr flag to llvm-mca, for cases where the -mcpu option does not contain all optional features. Differential Revision: https://reviews.llvm.org/D68190 llvm-svn: 373358	2019-10-01 17:41:38 +00:00
Vedant Kumar	60bb1d584e	[ReleaseProcess] Document requirement to set MACOSX_DEPLOYMENT_TARGET llvm-svn: 373356	2019-10-01 17:10:45 +00:00
Philip Reames	262d461975	[IndVars] An implementation of loop predication without a need for speculation This patch implements a variation of a well known techniques for JIT compilers - we have an implementation in tree as LoopPredication - but with an interesting twist. This version does not assume the ability to execute a path which wasn't taken in the original program (such as a guard or widenable.condition intrinsic). The benefit is that this works for arbitrary IR from any frontend (including C/C++/Fortran). The tradeoff is that it's restricted to read only loops without implicit exits. This builds on SCEV, and can thus eliminate the loop varying portion of the any early exit where all exits are understandable by SCEV. A key advantage is that fixing deficiency exposed in SCEV - already found one while writing test cases - will also benefit all of full redundancy elimination (and most other loop transforms). I haven't seen anything in the literature which quite matches this. Given that, I'm not entirely sure that keeping the name "loop predication" is helpful. Anyone have suggestions for a better name? This is analogous to partial redundancy elimination - since we remove the condition flowing around the backedge - and has some parallels to our existing transforms which try to make conditions invariant in loops. Factoring wise, I chose to put this in IndVarSimplify since it's a generally applicable to all workloads. I could split this off into it's own pass, but we'd then probably want to add that new pass every place we use IndVars. One solid argument for splitting it off into it's own pass is that this transform is "too good". It breaks a huge number of existing IndVars test cases as they tend to be simple read only loops. At the moment, I've opted it off by default, but if we add this to IndVars and enable, we'll have to update around 20 test files to add side effects or disable this transform. Near term plan is to fuzz this extensively while off by default, reflect and discuss on the factoring issue mentioned just above, and then enable by default. I also need to give some though to supporting widenable conditions in this framing. Differential Revision: https://reviews.llvm.org/D67408 llvm-svn: 373351	2019-10-01 17:03:44 +00:00
Matt Arsenault	40d9d9ea98	AMDGPU/GlobalISel: Increase max legal size to 1024 There are 1024 bit register classes defined for AGPRs. Additionally OpenCL defines vectors up to 16 x i64, and this helps those tests legalize. llvm-svn: 373350	2019-10-01 16:35:06 +00:00
Craig Topper	077ba42c7b	[X86] Add a VBROADCAST_LOAD ISD opcode representing a scalar load broadcasted to a vector. Summary: This adds the ISD opcode and a DAG combine to create it. There are probably some places where we can directly create it, but I'll leave that for future work. This updates all of the isel patterns to look for this new node. I had to add a few additional isel patterns for aligned extloads which we should probably fix with a DAG combine or something. This does mean that the broadcast load folding for avx512 can no longer match a broadcasted aligned extload. There's still some work to do here for combining a broadcast of a broadcast_load. We also need to improve extractelement or demanded vector elements of a broadcast_load. I'll try to get those done before I submit this patch. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68198 llvm-svn: 373349	2019-10-01 16:28:20 +00:00
Jay Foad	e112e8cd7d	[AMDGPU] Add VerifyScheduling support. Summary: This is cut and pasted from the corresponding GenericScheduler functions. Reviewers: arsenm, atrick, tstellar, vpykhtin Subscribers: MatzeB, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68264 llvm-svn: 373346	2019-10-01 15:45:47 +00:00
Simon Pilgrim	6bc0c00509	[DAG][X86] Convert isNegatibleForFree/GetNegatedExpression to a target hook (PR42863) This patch converts the DAGCombine isNegatibleForFree/GetNegatedExpression into overridable TLI hooks. The intention is to let us extend existing FNEG combines to work more generally with negatible float ops, allowing it work with target specific combines and opcodes (e.g. X86's FMA variants). Unlike the SimplifyDemandedBits, we can't just handle target nodes through a Target callback, we need to do this as an override to allow targets to handle generic opcodes as well. This does mean that the target implementations has to duplicate some checks (recursion depth etc.). Partial reversion of rL372756 - I've identified the infinite loop issue inside the X86 override but haven't fixed it yet so I've only (re)committed the common TargetLowering refactoring part of the patch. Differential Revision: https://reviews.llvm.org/D67557 llvm-svn: 373343	2019-10-01 15:32:04 +00:00
Jakub Kuderski	c1f3d6cc06	[Dominators][CodeGen] Add MachinePostDominatorTree verification Summary: This patch implements Machine PostDominator Tree verification and ensures that the verification doesn't fail the in-tree tests. MPDT verification can be enabled using `verify-machine-dom-info` -- the same flag used by Machine Dominator Tree verification. Flipping the flag revealed that MachineSink falsely claimed to preserve CFG and MDT/MPDT. This patch fixes that. Reviewers: arsenm, hliao, rampitec, vpykhtin, grosser Reviewed By: hliao Subscribers: wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68235 llvm-svn: 373341	2019-10-01 15:23:27 +00:00
Simon Pilgrim	947cafdb7b	Revert rL349624 : Let TableGen write output only if it changed, instead of doing so in cmake, attempt 2 Differential Revision: https://reviews.llvm.org/D55842 ----------------- As discussed on PR43385 this is causing Visual Studio msbuilds to perpetually rebuild all tablegen generated files llvm-svn: 373338	2019-10-01 13:39:43 +00:00
Djordje Todorovic	3ef644a00f	Revert "Reland "[utils] Implement the llvm-locstats tool"" This reverts commit rL373317 due to test failure on the clang-s390x-linux build bot. llvm-svn: 373336	2019-10-01 13:21:15 +00:00
David Bolvansky	bfbcc92d79	Revert [InstCombine] sprintf(dest, "%s", str) -> memccpy(dest, str, 0, MAX) Seems to be slower than memcpy + strlen. llvm-svn: 373335	2019-10-01 13:19:04 +00:00
David Bolvansky	0f61b8fdbf	[InstCombine] sprintf(dest, "%s", str) -> memccpy(dest, str, 0, MAX) llvm-svn: 373333	2019-10-01 13:03:10 +00:00
Michal Gorny	0b0a55e861	[llvm-exegesis/lib] Fix missing linkage to MCParser Otherwise, shared-lib build fails with: lib64/libLLVMExegesis.a(SnippetFile.cpp.o): In function `llvm::exegesis::readSnippets(llvm::exegesis::LLVMState const&, llvm::StringRef)': SnippetFile.cpp:(.text._ZN4llvm8exegesis12readSnippetsERKNS0_9LLVMStateENS_9StringRefE+0x31f): undefined reference to `llvm::createMCAsmParser(llvm::SourceMgr&, llvm::MCContext&, llvm::MCStreamer&, llvm::MCAsmInfo const&, unsigned int)' SnippetFile.cpp:(.text._ZN4llvm8exegesis12readSnippetsERKNS0_9LLVMStateENS_9StringRefE+0x41c): undefined reference to `llvm::MCAsmParser::setTargetParser(llvm::MCTargetAsmParser&)' collect2: error: ld returned 1 exit status llvm-svn: 373332	2019-10-01 13:02:48 +00:00
Sam Parker	c6b7001dcc	[NFC][ARM][MVE] More tests Add some tail predication tests with fast math. llvm-svn: 373331	2019-10-01 13:02:14 +00:00
Simon Pilgrim	5291c85727	DIExpression::createFragmentExpression - silence static analyzer DIExpression* null dereference warning with an assertion. NFCI. llvm-svn: 373326	2019-10-01 11:25:38 +00:00
Simon Pilgrim	54d6405d6c	VirtualFileSystem - replace dyn_cast<>+assert with cast<> calls. NFCI. Silences a number of clang static analyzer null dereference warnings. llvm-svn: 373325	2019-10-01 11:25:29 +00:00
Simon Pilgrim	e2c0fb5218	ObjectFile makeTriple - silence static analyzer dyn_cast<COFFObjectFile> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<COFFObjectFile> directly and if not assert will fire for us. llvm-svn: 373324	2019-10-01 11:25:17 +00:00
Simon Pilgrim	b86cf28787	InstrProf - avoid static analyzer dyn_cast<ConstantInt> null dereference warning. The static analyzer is warning about a potential null dereference, as we're already earlying-out for a null Constant pointer I've just folded this into a dyn_cast_or_null<ConstantInt>. No test case, this is by inspection only. llvm-svn: 373322	2019-10-01 10:38:30 +00:00
Simon Pilgrim	48776a7651	ConstantFold - ConstantFoldSelectInstruction - assume constant vector elements are constant. NFCI. Goes a bit further than rL372743 which added the early out - elements should be Constant so use cast<Constant> instead (and rely on the assert if anything fails). llvm-svn: 373321	2019-10-01 10:22:01 +00:00
George Rimar	1323de275a	[obj2yaml] - Fix BB after r373315. The success return value for data extractor's cursor should also be checked. llvm-svn: 373319	2019-10-01 10:02:47 +00:00
Djordje Todorovic	cecb4ad7fc	Reland "[utils] Implement the llvm-locstats tool" The tool reports verbose output for the DWARF debug location coverage. The llvm-locstats for each variable or formal parameter DIE computes what percentage from the code section bytes, where it is in scope, it has location description. The line 0 shows the number (and the percentage) of DIEs with no location information, but the line 100 shows the number (and the percentage) of DIEs where there is location information in all code section bytes (where the variable or parameter is in the scope). The line 50..59 shows the number (and the percentage) of DIEs where the location information is in between 50 and 59 percentage of its scope covered. Differential Revision: https://reviews.llvm.org/D66526 llvm-svn: 373317	2019-10-01 09:59:15 +00:00
George Rimar	1513465226	[yaml2obj] - Allow specifying custom Link values for SHT_HASH section. This allows setting any sh_link values for SHT_HASH sections. Differential revision: https://reviews.llvm.org/D68214 llvm-svn: 373316	2019-10-01 09:54:40 +00:00
George Rimar	9dc51d1303	[yaml2obj/obj2yaml] - Add support for SHT_HASH sections. SHT_HASH specification is: http://www.sco.com/developers/gabi/latest/ch5.dynamic.html#hash In short the format is the following: it has 2 uint32 fields in its header: nbucket and nchain followed by (nbucket + nchain) uint32 values. This patch allows dumping and parsing such sections. Differential revision: https://reviews.llvm.org/D68085 llvm-svn: 373315	2019-10-01 09:45:59 +00:00
Diana Picus	3533492850	Fixup r373278: Move test to X86 directory ...since it's using an x86 triple. llvm-svn: 373314	2019-10-01 09:27:20 +00:00
Clement Courbet	133859bfc5	[llvm-exegesis][NFC] Refactor X86 tests fixtures into a base class. Reviewers: gchatelet, a.sidorin Subscribers: tschuett, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68262 llvm-svn: 373313	2019-10-01 09:20:36 +00:00
Dmitri Gribenko	0235033ca1	Revert "[OCaml] Handle nullptr in Llvm.global_initializer" This reverts commit r373299. It broke tests: http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/18485 llvm-svn: 373311	2019-10-01 08:29:07 +00:00
Dmitri Gribenko	e08712db9b	Revert "GlobalISel: Handle llvm.read_register" This reverts commit r373294. It broke Clang's CodeGen/arm64-microsoft-status-reg.cpp: http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/18483 llvm-svn: 373310	2019-10-01 08:24:01 +00:00
Sam Parker	85dccb5a7b	[NFC][HardwareLoops] Update some iterators llvm-svn: 373309	2019-10-01 07:53:28 +00:00
Craig Topper	8b8482596d	[X86] Consider isCodeGenOnly in the EVEX2VEX pass to make VMAXPD/PS map to the non-commutable VEX instruction. Use EVEX2VEX override to fix the scalar instructions. Previously the match was ambiguous and VMAXPS/PD and VMAXCPS/PD were mapped to the same VEX instruction. But we should keep the commutableness when change the opcode. llvm-svn: 373303	2019-10-01 07:10:09 +00:00
Heejin Ahn	46235d2c6f	[WebAssembly] Make sure EH pads are preferred in sorting Summary: In CFGSort, we try to make EH pads have higher priorities as soon as they are ready to be sorted, to prevent creation of unwind destination mismatches in CFGStackify. We did that by making priority queues' comparison function prefer EH pads, but it was possible for an EH pad to be popped from `Preferred` queue and then not sorted immediately and enter `Ready` queue instead in a certain condition. This patch makes sure that special condition does not consider EH pads as its candidates. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68229 llvm-svn: 373302	2019-10-01 06:53:28 +00:00
Heejin Ahn	168270d8a6	[WebAssembly] Unstackify regs after fixing unwinding mismatches Summary: Fixing unwind mismatches for exception handling can result in splicing existing BBs and moving some of instructions to new BBs. In this case some of stackified def registers in the original BB can be used in the split BB. For example, we have this BB and suppose %r0 is a stackified register. ``` bb.1: %r0 = call @foo ... use %r0 ... ``` After fixing unwind mismatches in CFGStackify, `bb.1` can be split and some instructions can be moved to a newly created BB: ``` bb.1: %r0 = call @foo bb.split (new): ... use %r0 ... ``` In this case we should make %r0 un-stackified, because its use is now in another BB. When spliting a BB, this CL unstackifies all def registers that have uses in the new split BB. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68218 llvm-svn: 373301	2019-10-01 06:21:53 +00:00
Aditya Kumar	606b1e6641	[OCaml] Handle nullptr in Llvm.global_initializer LLVMGetInitializer returns nullptr in case there is no initializer. There is not much that can be done with nullptr in OCaml, not even test if it is null. Also, there does not seem to be a C or OCaml API to test if there is an initializer. So this diff changes Llvm.global_initializer to return an option. Differential Revision: https://reviews.llvm.org/D65195 Reviewed by: whitequark Authored by: kren1 llvm-svn: 373299	2019-10-01 03:45:09 +00:00
Matt Arsenault	bb7496a929	AMDGPU/GlobalISel: Select s1 src G_SITOFP/G_UITOFP llvm-svn: 373298	2019-10-01 02:23:20 +00:00
Yuanfang Chen	efb78fa23a	Remove a undefined constructor introduced by r373244. llvm-svn: 373297	2019-10-01 02:08:14 +00:00
Matt Arsenault	58ef717b2b	AMDGPU/GlobalISel: Add support for init.exec intrinsics TThe existing wave32 behavior seems broken and incomplete, but this reproduces it. llvm-svn: 373296	2019-10-01 02:07:25 +00:00
Matt Arsenault	181dd44e96	AMDGPU/GlobalISel: Allow scc/vcc alternative mappings for s1 constants llvm-svn: 373295	2019-10-01 02:07:19 +00:00
Matt Arsenault	7ccd848288	GlobalISel: Handle llvm.read_register SelectionDAG has a bunch of machinery to defer this to selection time for some reason. Just directly emit a copy during IRTranslator. The x86 usage does somewhat questionably check hasFP, which could depend on the whole function being at minimum translated. This does lose the convergent bit if the callsite had it, which may be a problem. We also lose that in general for intrinsics, which may also be a problem. llvm-svn: 373294	2019-10-01 02:07:16 +00:00
Matt Arsenault	99f21e87cc	AMDGPU/GlobalISel: Avoid creating shift of 0 in arg lowering This is sort of papering over the fact that we don't run a combiner anywhere, but avoiding creating 2 instructions in the first place is easy. llvm-svn: 373293	2019-10-01 01:44:46 +00:00
Matt Arsenault	5fbe025edb	TLI: Remove DAG argument from getRegisterByName Replace with the MachineFunction. X86 is the only user, and only uses it for the function. This removes one obstacle from using this in GlobalISel. The other is the more tolerable EVT argument. The X86 use of the function seems questionable to me. It checks hasFP, before frame lowering. llvm-svn: 373292	2019-10-01 01:44:39 +00:00
Fangrui Song	9112a48c7b	[llvm-readobj/llvm-readelf] Delete --arm-attributes (alias for --arch-specific) D68110 added --arch-specific (supported by GNU readelf) and made --arm-attributes an alias for it. The tests were later migrated to use --arch-specific. Note, llvm-readelf --arch-specific currently just uses llvm-readobj style output for ARM attributes. The readelf-style output is not implemented. Reviewed By: compnerd, kongyi, rupprecht Differential Revision: https://reviews.llvm.org/D68196 llvm-svn: 373291	2019-10-01 01:31:15 +00:00
Craig Topper	5dcc2cdb53	[X86] Add test case to show missed opportunity to shrink a constant index to a gather in order to avoid splitting. Also add a test case for an index that could be shrunk, but would create a narrow type. We can go ahead and do it we just need to be before type legalization. Similar test cases for scatter as well. llvm-svn: 373290	2019-10-01 01:27:52 +00:00
Matt Arsenault	8e2a689581	AMDGPU/GlobalISel: Select G_UADDO/G_USUBO llvm-svn: 373288	2019-10-01 01:23:13 +00:00
Matt Arsenault	23ad46189f	GlobalISel: Implement widenScalar for G_SITOFP/G_UITOFP sources Legalize 16-bit G_SITOFP/G_UITOFP for AMDGPU. llvm-svn: 373287	2019-10-01 01:06:48 +00:00
Matt Arsenault	5c1daa07b2	AMDGPU/GlobalISel: Legalize G_GLOBAL_VALUE Handle other cases besides LDS. Mostly a straight port of the existing handling, without the intermediate custom nodes. llvm-svn: 373286	2019-10-01 01:06:43 +00:00
David Blaikie	84bfbd29ff	DebugInfo: Add parsing support for debug_loc base address specifiers llvm-svn: 373278	2019-10-01 00:29:13 +00:00
Evandro Menezes	beefde3233	[SimplifyLibCalls] Define the value of the Euler number This patch fixes the build break on Windows hosts. There must be a better way of accessing the equivalent POSIX math constant `M_E`. llvm-svn: 373274	2019-09-30 23:21:02 +00:00
David Blaikie	d0e7211801	DebugInfo: Simplify section label caching/usage llvm-svn: 373273	2019-09-30 23:19:10 +00:00
Amaury Sechet	bbfbfdfe83	Add partial bswap test to the X86 backend. NFC llvm-svn: 373271	2019-09-30 22:52:28 +00:00
Amaury Sechet	a5458d458c	[DAGCombiner] Clang format MatchRotate. NFC llvm-svn: 373269	2019-09-30 21:41:52 +00:00
David Blaikie	ebe369bba1	Remove else-after-return llvm-svn: 373266	2019-09-30 21:02:06 +00:00

... 3 4 5 6 7 ...

185851 Commits