llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Craig Topper	8295be1621	[X86] Allow vpclmulqdq instructions to be commuted during isel to allow load folding. The commuting patterns for the AVX version actually still had priority over the new patterns. llvm-svn: 318800	2017-11-21 21:05:21 +00:00
Craig Topper	dd308dec13	[X86] Add BITALG, VAES, VBMI2, VNNI, VPCLMULQDQ, and VPOPCNTDQ instructions to icelake CPU. This is based on table 1-1 of the October 2017 revision of Intel® Architecture Instruction Set Extensions and Future Features Programming Reference llvm-svn: 318799	2017-11-21 21:05:18 +00:00
Nirav Dave	71ae010599	Avoid unecessary opsize byte in segment move to memory Segment moves to memory are always 16-bit. Remove invalid 32 and 64 bit variants. Recommiting with missing clang inline assembly test change. Fixes PR34478. Reviewers: rnk, craig.topper Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D39847 llvm-svn: 318797	2017-11-21 19:28:13 +00:00
Craig Topper	4812755896	[X86] Sort bits in getHostCPUFeatures again. llvm-svn: 318792	2017-11-21 18:50:41 +00:00
Chad Rosier	d593be8413	[AArch64] Mark mrs of TPIDR_EL0 (thread pointer) as having side effects. This partially reverts r298851. The the underlying issue is that we don't currently model the dependency between mrs (read system register) and msr (write system register) instructions. Something like the below should never be reordered: msr TPIDR_EL0, x0 ;; set thread pointer mrs x8, TPIDR_EL0 ;; read thread pointer but was being reordered after r298851. The functional part of the patch that wasn't reverted needed to remain in place in order to not break r299462. PR35317 llvm-svn: 318788	2017-11-21 18:08:34 +00:00
Hans Wennborg	bdbe363676	Fix r318786 llvm-svn: 318787	2017-11-21 18:00:01 +00:00
Nuno Lopes	7eb86bf989	removed unused private method decl. NFC llvm-svn: 318786	2017-11-21 17:53:19 +00:00
Hans Wennborg	05ce93732b	Rename test/Transforms/CountingFunctionInserter -> EntryExitInstrumenter The pass was renamed in r318195. llvm-svn: 318784	2017-11-21 17:22:19 +00:00
Hans Wennborg	2ebede8b36	EntryExitInstrumenter: support __cyg_profile_func_enter_bare It works just like __cyg_profile_func_enter but takes no arguments. llvm-svn: 318783	2017-11-21 17:22:19 +00:00
Oliver Stannard	1e82259f07	[ARM] Remove pre-UAL FLDM/FSTM aliases These are pre-UAL syntax, and we don't support any other pre-UAL instructions, with the exception of FLDMX/FSTMX, which don't have a UAL equivalent. Therefore there's no reason to keep them or their AsmParser hacks around. With the AsmParser hacks removed, the FLDMX and FSTMX instructions get the same operand diagnostics as the UAL instructions. Differential revision: https://reviews.llvm.org/D39196 llvm-svn: 318777	2017-11-21 16:20:25 +00:00
Alina Sbirlea	77a10244b2	Add MemorySSA as loop dependency, disabled by default [NFC]. Summary: First step in adding MemorySSA as dependency for loop pass manager. Adding the dependency under a flag. New pass manager: MSSA pointer in LoopStandardAnalysisResults can be null. Legacy and new pass manager: Use cl::opt EnableMSSALoopDependency. Disabled by default. Reviewers: sanjoy, davide, gberry Subscribers: mehdi_amini, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D40274 llvm-svn: 318772	2017-11-21 15:45:46 +00:00
Oliver Stannard	33706d76ee	[ARM] Don't omit non-default predication code This was causing the (invalid) predicated versions of the NEON VRINTX and VRINTZ instructions to be accepted, with the condition code being ignored. Also, there is no NEON VRINTR instruction, so that part of the check was not necessary. Differential revision: https://reviews.llvm.org/D39193 llvm-svn: 318771	2017-11-21 15:34:15 +00:00
Oliver Stannard	bdf17e56f9	[Asm] Improve "too few operands" errors - We can still emit this error if the actual instruction has two or more operands missing compared to the expected one. - We should only emit this error once per instruction. Differential revision: https://reviews.llvm.org/D36746 llvm-svn: 318770	2017-11-21 15:16:50 +00:00
Oliver Stannard	b6954c234d	[Asm] Finish matching once end of formal and actual lists reached (NFC) This is NFC, as the matcher would continue looping up to the maximum number of operands with no effect, but this should improve performance a bit, and makes the debug trace clearer. Differential revision: https://reviews.llvm.org/D36744 llvm-svn: 318769	2017-11-21 15:12:05 +00:00
Sander de Smalen	e797a890ac	Revert r318759 due to make check-all failure on Windows llvm-svn: 318768	2017-11-21 15:07:43 +00:00
Oliver Stannard	b6bf719f2a	[ARM] Add diagnostics for SPR/DPR lists Differential revision: https://reviews.llvm.org/D39195 llvm-svn: 318766	2017-11-21 15:06:01 +00:00
Alexey Bataev	9bb90dea87	[InstCombine] Test for PR35354: unable to vectorize loop with std::max on floats, NFC. llvm-svn: 318764	2017-11-21 14:49:13 +00:00
Sam Kolton	72071c43fc	[AMDGPU] SDWA: remove omod src operand for VOP2b instructions Summary: VOP2b instructions (v_subbrev_u32, v_add_i32 ...) shouldn't support OMod operand in SDWA encoding Reviewers: rampitec, dp Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D40172 llvm-svn: 318761	2017-11-21 14:11:59 +00:00
Alex Bradbury	164d043067	[RISCV][NFC] Remove unnecessary {} around single statement if block Almost too trivial to worry about, but it seems worth having consistency with upcoming commits. llvm-svn: 318760	2017-11-21 12:41:41 +00:00
Sander de Smalen	57583b3ffa	[TableGen] AsmMatcher: Fix bug with reported diagnostic for operand. Summary: The generated diagnostic by the AsmMatcher isn't always applicable to the AsmOperand. This is because the code will only update the diagnostic if it is more specific than the previous diagnostic. However, when having validated operands and 'moved on' to a next operand (for some instruction/alias for which all previous operands are valid), if the diagnostic is InvalidOperand, than that should be set as the diagnostic, not the more specific message about a previous operand for some other instruction/alias candidate. Reviewers: craig.topper, olista01, rengolin, stoklund Reviewed By: olista01 Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D40011 llvm-svn: 318759	2017-11-21 12:26:06 +00:00
Simon Pilgrim	17fc3553e8	[X86][XOP] Add missing scheduler classes to XOP instructions All match equivalent basic classes (WritePHAdd, WriteFAdd etc.) according to both the AMD 15h SOG and Agner's tables. llvm-svn: 318758	2017-11-21 12:02:18 +00:00
Alex Bradbury	c88d30059b	[RISCV][NFC] Clean up RISCVDAGToDAGISel::Select As pointed out in post-commit review of r318738, `return ReplaceNode(..)` when both ReplaceNode and the current function return void is confusing. This patch moves to using a more obvious early return, and moves to just using an if to catch the one case we currently care about. A future patch that adds further custom instruction selection can introduce a switch. llvm-svn: 318757	2017-11-21 12:00:19 +00:00
Martell Malone	1b00723cb5	[ARM] Use SEH exceptions on thumbv7-windows Reviewers: mstorsjo Differential Revision: https://reviews.llvm.org/D40286 llvm-svn: 318756	2017-11-21 11:30:20 +00:00
Simon Pilgrim	7d84f55b6d	[X86][LWP] Add missing LWP itinerary class to lwpins instructions It's on all other LWP instruction but I missed it from lwpins, despite similar scheduling behaviour. llvm-svn: 318751	2017-11-21 11:17:11 +00:00
Eugene Leviant	db429d866b	[MI scheduler] Fix VADD and VSUB in cortex-a57 model This patch fixes instregex for interger vector add/sub instructions Differential revision: https://reviews.llvm.org/D40254 llvm-svn: 318749	2017-11-21 11:01:28 +00:00
Coby Tayree	fe22c86371	[x86][icelake]BITALG vpopcnt{b,w} Differential Revision: https://reviews.llvm.org/D40213 llvm-svn: 318748	2017-11-21 10:32:42 +00:00
Diana Picus	211036d05f	[ARM GlobalISel] Add comment for r318398. NFC. Mention the purpose of the BICri tests added by r318398, as requested in post-commit review. llvm-svn: 318747	2017-11-21 10:17:02 +00:00
Coby Tayree	194b252eca	[x86][icelake]VNNI Introducing Vector Neural Network Instructions, consisting of: vpdpbusd{s} vpdpwssd{s} Differential Revision: https://reviews.llvm.org/D40208 llvm-svn: 318746	2017-11-21 10:04:28 +00:00
Coby Tayree	c6c4bff339	[x86][icelake]vbmi2 introducing vbmi2, consisting of vpcompress{b,w} vpexpand{b,w} vpsh{l,r}d{w,d,q} vpsh{l,r}dv{w,d,q} Differential Revision: https://reviews.llvm.org/D40206 llvm-svn: 318745	2017-11-21 09:48:44 +00:00
NAKAMURA Takumi	be536e28e1	SLPVectorizer.cpp: Avoid std::stable_sort(properlyDominates()). properlyDominates() shouldn't be used as sort key. It causes different output between stdlibc++ and libc++. Instead, I introduced RPOT. In most cases, it works for CSE. llvm-svn: 318743	2017-11-21 09:41:01 +00:00
Coby Tayree	836d1e6a37	[x86][icelake]vpclmulqdq introduction an icelake promotion of pclmulqdq Differential Revision: https://reviews.llvm.org/D40101 llvm-svn: 318741	2017-11-21 09:30:33 +00:00
Coby Tayree	48de83a1a7	[x86][icelake]VAES introduction an icelake promotion of AES Differential Revision: https://reviews.llvm.org/D40078 llvm-svn: 318740	2017-11-21 09:11:41 +00:00
Alex Bradbury	43a1fee3ed	[RISCV] Use register X0 (ZERO) for constant 0 The obvious approach of defining a pattern like the one below actually doesn't work: `def : Pat<(i32 0), (i32 X0)>;` As was noted when Lanai made this change (https://reviews.llvm.org/rL288215), attempting to handle the constant 0 in tablegen leads to assertions due to a physical register being used where a virtual register is expected. llvm-svn: 318738	2017-11-21 08:23:08 +00:00
Alex Bradbury	cc724f15ca	[RISCV] Support and tests for a variety of additional LLVM IR constructs Previous patches primarily ensured that codegen was possible for the standard RISC-V instructions. However, there are a number of IR inputs that wouldn't be appropriately lowered. This patch both adds test cases and supports lowering for a number of these cases: * Improved sext/zext/trunc support * Support for setcc variants that don't map directly to RISC-V instructions * Lowering mul, and hence support for external symbols * addc, adde, subc, sube * mulhs, srem, mulhu, urem, udiv, sdiv * {srl,sra,shl}_parts * brind * br_jt * bswap, ctlz, cttz, ctpop * rotl, rotr * BlockAddress operands Differential Revision: https://reviews.llvm.org/D29938 llvm-svn: 318737	2017-11-21 08:11:03 +00:00
Alex Bradbury	dba566bb03	[RISCV] Implement lowering of ISD::SELECT Although ISD::SELECT_CC is a more natural match for RISCVISD::SELECT_CC (and ultimately the integer RISC-V conditional branch instructions), we choose to expand ISD::SELECT_CC and lower ISD::SELECT. The appropriate compare+branch will be created in the case where an ISD::SELECT condition value is created by an ISD::SETCC node, which operates on XLen types. Other datatypes such as floating point don't have conditional branch instructions, and lowering ISD::SELECT allows more flexibility for handling these cases. Differential Revision: https://reviews.llvm.org/D29937 llvm-svn: 318735	2017-11-21 07:51:32 +00:00
Dean Michael Berris	57f2739291	[XRay] Use optimistic logging model for FDR mode Summary: Before this change, the FDR mode implementation relied on at thread-exit handling to return buffers back to the (global) buffer queue. This introduces issues with the initialisation of the thread_local objects which, even through the use of pthread_setspecific(...) may eventually call into an allocation function. Similar to previous changes in this line, we're finding that there is a huge potential for deadlocks when initialising these thread-locals when the memory allocation implementation is also xray-instrumented. In this change, we limit the call to pthread_setspecific(...) to provide a non-null value to associate to the key created with pthread_key_create(...). While this doesn't completely eliminate the potential for the deadlock(s), it does allow us to still clean up at thread exit when we need to. The change is that we don't need to do more work when starting and ending a thread's lifetime. We also have a test to make sure that we actually can safely recycle the buffers in case we end up re-using the buffer(s) available from the queue on multiple thread entry/exits. This change cuts across both LLVM and compiler-rt to allow us to update both the XRay runtime implementation as well as the library support for loading these new versions of the FDR mode logging. Version 2 of the FDR logging implementation makes the following changes: * Introduction of a new 'BufferExtents' metadata record that's outside of the buffer's contents but are written before the actual buffer. This data is associated to the Buffer handed out by the BufferQueue rather than a record that occupies bytes in the actual buffer. * Removal of the "end of buffer" records. This is in-line with the changes we described above, to allow for optimistic logging without explicit record writing at thread exit. The optimistic logging model operates under the following assumptions: * Threads writing to the buffers will potentially race with the thread attempting to flush the log. To avoid this situation from occuring, we make sure that when we've finalized the logging implementation, that threads will see this finalization state on the next write, and either choose to not write records the thread would have written or write the record(s) in two phases -- first write the record(s), then update the extents metadata. * We change the buffer queue implementation so that once it's handed out a buffer to a thread, that we assume that buffer is marked "used" to be able to capture partial writes. None of this will be safe to handle if threads are racing to write the extents records and the reader thread is attempting to flush the log. The optimism comes from the finalization routine being required to complete before we attempt to flush the log. This is a fairly significant semantics change for the FDR implementation. This is why we've decided to update the version number for FDR mode logs. The tools, however, still need to be able to support older versions of the log until we finally deprecate those earlier versions. Reviewers: dblaikie, pelikan, kpw Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D39526 llvm-svn: 318733	2017-11-21 07:16:57 +00:00
Craig Topper	0604471011	[X86] Simplify type constraints for AVX2 masked gather. We don't need separate 32 and 64 node types. We can use SDTCisInt and SDTCisSameSizeAs to ensure the mask size the result type and is integer. llvm-svn: 318732	2017-11-21 06:28:15 +00:00
Serguei Katkov	6fb481ba59	Revert "[CGP] Enable complex addr mode (2nd attempt)" Revert the patch rl318728 causing buildbot hangs-ups. llvm-svn: 318731	2017-11-21 06:03:43 +00:00
Craig Topper	b0d4cbb3b0	[X86] Simplify the predicates for avx2 masked gather patterns. We don't need a dyn_cast and we only need to check the type of the index. The base ptr is guaranteed to be scalar. llvm-svn: 318730	2017-11-21 06:01:20 +00:00
Rafael Espindola	709b1b61bb	move static function. NFC llvm-svn: 318729	2017-11-21 05:35:45 +00:00
Serguei Katkov	e8b6e750ba	[CGP] Enable complex addr mode (2nd attempt) 2nd attempt to enable complex addr modes after fix of the crash by rL318638. llvm-svn: 318728	2017-11-21 05:31:47 +00:00
Yaxun Liu	5069545c52	[AMDGPU] Fix DAGTypeLegalizer::SplitInteger for shift amount type DAGTypeLegalizer::SplitInteger uses default pointer size as shift amount constant type, which causes less performant ISA in amdgcn---amdgiz target since the default pointer type is i64 whereas the desired shift amount type is i32. This patch fixes that by using TLI.getScalarShiftAmountTy in DAGTypeLegalizer::SplitInteger. The X86 change is necessary since splitting i512 requires shifting amount of 256, which cannot be held by i8. Differential Revision: https://reviews.llvm.org/D40148 llvm-svn: 318727	2017-11-21 02:29:54 +00:00
Rafael Espindola	e5791cf034	Split a rename_handle out of rename on windows. llvm-svn: 318725	2017-11-21 01:52:44 +00:00
Richard Trieu	1e6ea96860	Add default values for member functions. Initialize IsVis2 and IsVis3 in SparcSubtarget::initializeSubtargetDependencies. MSan detected uninitialized read of IsVis3 after r318704. Initializing the variables to false will prevent undefined behavior. llvm-svn: 318724	2017-11-21 01:45:17 +00:00
Zachary Turner	7c8a28c1eb	Re-revert "Refactor debuginfo-tests." This is still breaking greendragon. At this point I give up until someone can fix the greendragon bots, and I will probably abandon this effort in favor of using a private github repository. llvm-svn: 318722	2017-11-21 01:20:28 +00:00
David Blaikie	0d3708bcad	xray-record-yaml.h: Remove unused file llvm-svn: 318715	2017-11-21 00:33:17 +00:00
David Blaikie	0710bb8ab4	llvm-rc/ResourceScriptTokenList.h: Turns this into a .def file to imply that it's non-modular Also undef the macros at the end of the file to make it easier to use. llvm-svn: 318714	2017-11-21 00:23:19 +00:00
David Blaikie	f412868b63	XRayRecord.h: Add missing #include llvm-svn: 318713	2017-11-21 00:23:19 +00:00
David Blaikie	2b177f1e03	YAML/XRay/std::vector: Fix ODR violation by removing local specialization There's a generic partial specialization for all std::vector<T> that does what's desired, so no need for this full specialization that's causing an ODR violation anyway. llvm-svn: 318712	2017-11-21 00:23:17 +00:00
Davide Italiano	7f9b83e34d	[SCCP] If we replace with a constant, we can't replace with a range. This microoptimization is NFC. llvm-svn: 318711	2017-11-21 00:21:52 +00:00

1 2 3 4 5 ...

156925 Commits