llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Adrian Prantl	2f0f7953de	Generalize llvm::replaceDbgDeclare and actually support the use-case that is mentioned in the documentation (inserting a deref before the plus_uconst). llvm-svn: 320203	2017-12-08 21:58:18 +00:00
Florian Hahn	aac67b4df6	[CodeExtractor] Add debug locations for new call and branch instrs. Summary: If a partially inlined function has debug info, we have to add debug locations to the call instruction calling the outlined function. We use the debug location of the first instruction in the outlined function, as the introduced call transfers control to this statement and there is no other equivalent line in the source code. We also use the same debug location for the branch instruction added to jump from artificial entry block for the outlined function, which just jumps to the first actual basic block of the outlined function. Reviewers: davide, aprantl, rriddle, dblaikie, danielcdh, wmi Reviewed By: aprantl, rriddle, danielcdh Subscribers: eraman, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D40413 llvm-svn: 320199	2017-12-08 21:49:03 +00:00
Dan Gohman	7819b72558	[WebAssembly] Reapply r319186: "Support bitcasted function addresses with varargs." This puts the functionality under control of a command-line option which is off by default to avoid breaking existing setups. llvm-svn: 320197	2017-12-08 21:27:00 +00:00
Dan Gohman	b714195257	[WebAssemby] Re-apply r320041: "Support main functions with alternate signatures." This includes a fix so that it doesn't transform declarations, and it puts the functionality under control of a command-line option which is off by default to avoid breaking existing setups. llvm-svn: 320196	2017-12-08 21:18:21 +00:00
Evandro Menezes	40a55dc70f	[AArch64] Add Exynos to host detection Differential revision: https://reviews.llvm.org/D40985 llvm-svn: 320195	2017-12-08 21:09:59 +00:00
Konstantin Zhuravlyov	9b3981c8d5	AMDGPU/GCN: Bring processors in sync with AMDGPUUsage - Add gfx704 - Change bonaire to gfx704 - Remove gfx804 - Remove gfx901 - Remove gfx903 Differential Revision: https://reviews.llvm.org/D40046 llvm-svn: 320194	2017-12-08 20:52:28 +00:00
Simon Pilgrim	a4c877ae58	[X86] CMOV pseudo instructions shouldn't need scheduling info as they should be lowered early llvm-svn: 320193	2017-12-08 20:42:35 +00:00
Simon Pilgrim	bea62f28c1	[X86][X87] Tag x87 load/store instructions scheduler classes llvm-svn: 320192	2017-12-08 20:31:48 +00:00
Craig Topper	de19ce3756	[X86] Teach lowering to only let through (insert_subvector (vXi1 zeros), subvec, 0) for vector sizes that have native KSHIFT support. For narrow sizes we'll widen the zero vector and widen the insert. Then do an extract_subvector to get back down to correct size. This allows us to remove some patterns from the isel table that had to COPY_TO_REGCLASS to an oversized register, do the shift and then COPY_TO_REGCLASS back to the narrow register. Now this is represented explicitly in the DAG. This seems to have perturbed the register allocation in one of the tests, but the number of instructions didn't change. llvm-svn: 320190	2017-12-08 20:10:33 +00:00
Simon Pilgrim	7071d8986f	[X86][X87] Tag x87 float compare instructions scheduler classes llvm-svn: 320189	2017-12-08 20:10:31 +00:00
Matt Arsenault	31e8314d0b	AMDGPU: Set IntrReadMem on memtime intrinsics llvm-svn: 320188	2017-12-08 20:01:02 +00:00
Matt Arsenault	76f90573fb	AMDGPU: image_getlod and image_getresinfo do not read memory llvm-svn: 320187	2017-12-08 20:00:57 +00:00
Matt Arsenault	071224ac12	AMDGPU: Preserve MMO in adjustWritemask Follow up to r319705. Currently the MMO is produced after this in the custom inserter, so this doesn't change anything yet. llvm-svn: 320186	2017-12-08 20:00:45 +00:00
Xinliang David Li	b3f9d82232	Revert r320104: infinite loop profiling bug fix Causes unexpected memory issue with New PM this time. The new PM invalidates BPI but not BFI, leaving the reference to BPI from BFI invalid. Abandon this patch. There is a more general solution which also handles runtime infinite loop (but not statically). llvm-svn: 320180	2017-12-08 19:38:07 +00:00
Brian M. Rzycki	35d2478535	[JumpThreading] Minor comment cleanup. NFC. (test commit) llvm-svn: 320179	2017-12-08 19:36:32 +00:00
Simon Pilgrim	6642e6b86e	[X86][MPX] Tag TSX/HLE/SGX instructions scheduler classes Currently tagged these as system instructions. llvm-svn: 320177	2017-12-08 19:26:22 +00:00
Konstantin Zhuravlyov	1f1d5b7879	AMDGPU: Report Arg's Value name in metadata if kernel_arg_name metadata is not available Differential Revision: https://reviews.llvm.org/D40924 llvm-svn: 320176	2017-12-08 19:22:12 +00:00
Michael Trent	4d0db94081	Reverting r320166 to fix test failures. llvm-svn: 320174	2017-12-08 19:09:26 +00:00
Simon Pilgrim	808f8d4de2	[X86][MPX] Tag MPX instructions scheduler classes Currently tagged these as system instructions, once we have uses for them (ASAN?) and they are faster we will need to improve on this. llvm-svn: 320173	2017-12-08 19:03:42 +00:00
Sanjay Patel	cd710da732	[x86] use hasAVX2() rather than hasInt256(); NFC These are aliases, but the thing we're checking here is that the target has vpsllv*, not that the data type is 256-bit. Those instructions exist for 128-bit vectors too...but sadly, not for all element sizes. llvm-svn: 320170	2017-12-08 18:35:51 +00:00
Simon Pilgrim	10899212ed	[X86] Tag move immediate instructions scheduler classes llvm-svn: 320169	2017-12-08 18:35:40 +00:00
Michael Trent	09dad69aba	Updated llvm-objdump to display local relocations in Mach-O binaries Summary: llvm-objdump's Mach-O parser was updated in r306037 to display external relocations for MH_KEXT_BUNDLE file types. This change extends the Macho-O parser to display local relocations for MH_PRELOAD files. When used with the -macho option relocations will be displayed in a historical format. rdar://35778019 Reviewers: enderby Reviewed By: enderby Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40867 llvm-svn: 320166	2017-12-08 17:51:04 +00:00
Simon Pilgrim	e0407a99f3	[X86][SHA] Tag SHA instructions scheduler classes Put these under VecIMul itinerary classes for now - seems to be a good average value llvm-svn: 320161	2017-12-08 16:38:41 +00:00
Simon Pilgrim	0fcfdac391	[X86] Tag VIA PadLock crypto instructions scheduler classes llvm-svn: 320159	2017-12-08 16:06:40 +00:00
Simon Pilgrim	9ab46ba243	[X86] Tag PKU/INVPCID/RDPID/SMAP/SMX/PTWRITE system instructions scheduler classes llvm-svn: 320158	2017-12-08 15:48:37 +00:00
Alexey Bataev	1d998a57b0	[InstCombine] PR35354: Convert store(bitcast, load bitcast (select (Cond, &V1, &V2)) --> store (, load (select(Cond, load &V1, load &V2))) Summary: If we have the code like this: ``` float a, b; a = std::max(a ,b); ``` it is converted into something like this: ``` %call = call dereferenceable(4) float* @_ZSt3maxIfERKT_S2_S2_(float* nonnull dereferenceable(4) %a.addr, float* nonnull dereferenceable(4) %b.addr) %1 = bitcast float* %call to i32* %2 = load i32, i32* %1, align 4 %3 = bitcast float* %a.addr to i32* store i32 %2, i32* %3, align 4 ``` After inlinning this code is converted to the next: ``` %1 = load float, float* %a.addr %2 = load float, float* %b.addr %cmp.i = fcmp fast olt float %1, %2 %__b.__a.i = select i1 %cmp.i, float* %a.addr, float* %b.addr %3 = bitcast float* %__b.__a.i to i32* %4 = load i32, i32* %3, align 4 %5 = bitcast float* %arrayidx to i32* store i32 %4, i32* %5, align 4 ``` This pattern is not recognized as minmax pattern. Patch solves this problem by converting sequence ``` store (bitcast, (load bitcast (select ((cmp V1, V2), &V1, &V2)))) ``` to a sequence ``` store (,load (select((cmp V1, V2), &V1, &V2))) ``` After this the code is recognized as minmax pattern. Reviewers: RKSimon, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40304 llvm-svn: 320157	2017-12-08 15:32:10 +00:00
Simon Pilgrim	8d591300e2	[X86][AVX512] Tag CLWB instruction to CLFLUSH/PREFETCH scheduler class llvm-svn: 320156	2017-12-08 15:19:10 +00:00
Simon Pilgrim	9b6cd76713	[X86][AVX512] Tag AVX512_512_SEXT_MASK_* instructions scheduler classes Match VPTERNLOG which these pseudos will eventually alias to llvm-svn: 320154	2017-12-08 15:17:32 +00:00
Tim Renouf	108460f836	[AMDGPU] add labels to +DumpCode output Summary: +DumpCode is a hack to embed disassembly in the ELF file. This commit fixes it to include labels, to make it slightly more useful. Reviewers: arsenm, kzhuravl Subscribers: nhaehnle, timcorringham, dstuttard, llvm-commits, t-tye, yaxunl, wdng, kzhuravl Differential Revision: https://reviews.llvm.org/D40169 llvm-svn: 320146	2017-12-08 14:09:34 +00:00
Max Kazantsev	43be39711f	[NFC] Rename variable from Cond to Pred to make it more sound llvm-svn: 320144	2017-12-08 12:54:32 +00:00
Max Kazantsev	b55d0a6618	[SCEV] Fix predicate usage in computeExitLimitFromICmp In this method, we invoke `SimplifyICmpOperands` which takes the `Cond` predicate by reference and may change it along with `LHS` and `RHS` SCEVs. But then we invoke `computeShiftCompareExitLimit` with Values from which the SCEVs have been derived, these Values have not been modified while `Cond` could be. One of possible outcomes of this is that we may falsely prove that an infinite loop ends within some finite number of iterations. In this patch, we save the original `Cond` and pass it along with original operands. This logic may be removed in future once `computeShiftCompareExitLimit` works with SCEVs instead of value operands. Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D40953 llvm-svn: 320142	2017-12-08 12:19:45 +00:00
Francis Visoiu Mistrih	23a48f9c77	[CodeGen] Move printing MO_MachineBasicBlock operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. llvm-svn: 320141	2017-12-08 11:48:02 +00:00
Francis Visoiu Mistrih	f10deaa987	[CodeGen] Move printing MO_CImmediate operands to MachineOperand::print Work towards the unification of MIR and debug output by refactoring the interfaces. llvm-svn: 320140	2017-12-08 11:40:06 +00:00
Gadi Haber	55d27cdd65	[X86][Haswell]: Updating the scheduling information for the Haswell subtarget. Updated the scheduling information for the Haswell subtarget with the following changes: Regrouped the instructions after adding appropriate load + store latencies. Added scheduling for missing instructions such as the GATHER instrs. The changes were made after revisiting the latencies impact of all memory uOps. Reviewers: RKSimon, zvi, craig.topper, apilipenko Differential Revision: https://reviews.llvm.org/D40021 Change-Id: Iaf6c1f5169add1552845a8a566af4e5a359217a7 llvm-svn: 320137	2017-12-08 09:48:44 +00:00
Igor Laevsky	afa69531bc	[FuzzMutate] Correctly insert sinks and sources around invoke instructions Differential Revision: https://reviews.llvm.org/D40840 llvm-svn: 320136	2017-12-08 08:53:16 +00:00
Craig Topper	49ee09bdf6	[X86] Always consider inserting a vXi1 vector into the lsbs of a zero vector to be legal during lowering. Add isel patterns to emit shifts. Previously we only allowed these through if the subvector came from a compare or test instruction which we would again check for during isel. With this change we only check for the compare and test instructions during isel and have fallback patterns that emit the shifts if needed. I noticed that in a lot of cases we don't actually see the compare during lowering and rely on an odd legalization of concat_vectors with a zero vector as the second argument. This keeps the concat_vectors around long enough for a later dag combine to expose the compare then we re-legalize the concat_vectors and catch the compare. llvm-svn: 320134	2017-12-08 08:10:58 +00:00
Abderrazek Zaafrani	7af4e0afb5	[AArch64] Avoid SIMD interleaved store instruction for Exynos. Replace interleaved store instructions by equivalent and more efficient instructions based on latency cost model. Https://reviews.llvm.org/D38196 llvm-svn: 320123	2017-12-08 00:58:49 +00:00
Derek Schuff	25c8677cb5	Revert "[WebAssemby] Support main functions with alternate signatures." This reverts commit 959e37e669b0c3cfad4cb9f1f7c9261ce9f5e9ae. That commit doesn't handle the case where main is declared rather than defined, in particular the even-more special case where main is a prototypeless declaration (which is of course the one actually used by musl currently). llvm-svn: 320121	2017-12-08 00:39:54 +00:00
Craig Topper	1f33f8f1b9	[X86] Handle alls version of vXi1 insert_vector_elt with a constant index without falling back to shuffles. We previously only supported inserting to the LSB or MSB where it was easy to zero to perform an OR to insert. This change effectively extracts the old value and the new value, xors them together and then xors that single bit with the correct location in the original vector. This will cancel out the old value in the first xor leaving the new value in the position. The way I've implemented this uses 3 shifts and two xors and uses an additional register. We can avoid the additional register at the cost of another shift. llvm-svn: 320120	2017-12-08 00:16:09 +00:00
Craig Topper	674cb8ec15	[X86] Fix indentation. NFC llvm-svn: 320119	2017-12-08 00:15:57 +00:00
Bill Seurer	53615e1ec1	[PowerPC][asan] Update asan to handle changed memory layouts in newer kernels In more recent Linux kernels with 47 bit VMAs the layout of virtual memory for powerpc64 changed causing the address sanitizer to not work properly. This patch adds support for 47 bit VMA kernels for powerpc64 and fixes up test cases. https://reviews.llvm.org/D40907 There is an associated patch for compiler-rt. Tested on several 4.x and 3.x kernel releases. llvm-svn: 320109	2017-12-07 22:53:33 +00:00
Zachary Turner	1e778523f5	[DebugInfo] Fix register variables not showing up in pdb. Previously, when linking against libcmt from the MSVC runtime, lld-link /verbose would show "Ignoring unknown symbol record with kind 0x1006". It turns out this was because TypeIndexDiscovery did not handle S_REGISTER records, so these records were not getting properly remapped. Patch by: Alexnadre Ganea Differential Revision: https://reviews.llvm.org/D40919 llvm-svn: 320108	2017-12-07 22:51:16 +00:00
Alina Sbirlea	0e9a4ac953	[ModRefInfo] Make enum ModRefInfo an enum class [NFC]. Summary: Make enum ModRefInfo an enum class. Changes to ModRefInfo values should be done using inline wrappers. This should prevent future bit-wise opearations from being added, which can be more error-prone. Reviewers: sanjoy, dberlin, hfinkel, george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40933 llvm-svn: 320107	2017-12-07 22:41:34 +00:00
Eric Christopher	22c363f5c5	Temporarily revert "[PowerPC] Allow tail calls of fastcc functions from C CallingConv functions." It is causing sanitizer failures on llvm tests in a bootstrapped compiler. No bot link since it's currently down, but following up to get the bot up. This reverts commit r319218. llvm-svn: 320106	2017-12-07 22:26:19 +00:00
Xinliang David Li	736038bd8f	[PGO] detect infinite loop and form MST properly Differential Revision: http://reviews.llvm.org/D40873 llvm-svn: 320104	2017-12-07 22:23:28 +00:00
Jessica Paquette	6dd05e4ce4	[MachineOutliner] Fix offset overflow check The offset overflow check before was incorrect. It would always give the correct result, but it was comparing the SCALED potential fixed-up offset against an UNSCALED minimum/maximum. As a result, the outliner was missing a bunch of frame setup/destroy instructions that ought to have been safe to outline. This fixes that, and adds an instruction to the .mir test that failed the old test. llvm-svn: 320090	2017-12-07 21:51:43 +00:00
Mark Searles	3f47e53999	[AMDGPU] Revert "[AMDGPU] Add options for waitcnt pass debugging; add instr count in debug output." Patch caused a buildbot failure; http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/15733/steps/build_Lld/logs/stdio : lib/Target/AMDGPU/SIInsertWaitcnts.cpp:396:11: error: private field 'InstCnt' is not used [-Werror,-Wunused-private-field] int32_t InstCnt = 0; ^ 1 error generated. " This reverts commit 71627f79010aafe74fdcba901bba28dd7caa0869. llvm-svn: 320086	2017-12-07 21:14:41 +00:00
Mark Searles	335078bea8	[AMDGPU] Add options for waitcnt pass debugging; add instr count in debug output. -amdgpu-waitcnt-forcezero={1\|0} Force all waitcnt instrs to be emitted as s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) -amdgpu-waitcnt-forceexp=<n> Force emit a s_waitcnt expcnt(0) before the first <n> instrs -amdgpu-waitcnt-forcelgkm=<n> Force emit a s_waitcnt lgkmcnt(0) before the first <n> instrs -amdgpu-waitcnt-forcevm=<n> Force emit a s_waitcnt vmcnt(0) before the first <n> instrs Differential Revision: https://reviews.llvm.org/D40091 llvm-svn: 320084	2017-12-07 20:36:39 +00:00
Mark Searles	33a8064970	[AMDGPU] Add GCNHazardRecognizer::checkInlineAsmHazards() and GCNHazardRecognizer::checkVALUHazardsHelper(). checkInlineAsmHazards() checks INLINEASM for hazards that we particularly care about (so not exhaustive); this patch adds a check for INLINEASM that defs vregs that hold data-to-be stored by immediately preceding store of more than 8 bytes. If the instr were not within an INLINEASM, this scenario would be handled by checkVALUHazard(). Add checkVALUHazardsHelper(), which will be called by both checkVALUHazards() and checkInlineAsmHazards(). Differential Revision: https://reviews.llvm.org/D40098 llvm-svn: 320083	2017-12-07 20:34:25 +00:00
Craig Topper	ec3fb09954	[X86] Fix InsertBitToMaskVector to only issue KSHIFTS of native size so that upper bits are properly zeroed. There's no v2i1 or v4i1 kshift, and v8i1 is only supported with AVXDQ. Isel has fake patterns to extend these types to native shifts, but makes no guarantees about the value of any bits shifted in when shifting right. This patch promotes the vector to a type that supports a native shift first and only allows inserting into the msb of a native sized shift. I've constructed this in a way that doesn't do the promotion if we're going to fallback to using a xmm/ymm/zmm shuffle. I think I have a plan to remove the shuffle fall back entirely. In which case we this can be simplified, but I wanted to fix the correctness issue first. llvm-svn: 320081	2017-12-07 20:10:04 +00:00

1 2 3 4 5 ...

108736 Commits