llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
David Majnemer	95fedaaedc	Use the range variant of transform instead of unpacking begin/end No functionality change is intended. llvm-svn: 278476	2016-08-12 04:32:42 +00:00
David Majnemer	9880e078f0	Use the range variant of remove_if instead of unpacking begin/end No functionality change is intended. llvm-svn: 278475	2016-08-12 04:32:37 +00:00
David Majnemer	93f4b65cd3	Use the range variant of count_if instead of unpacking begin/end No functionality change is intended. llvm-svn: 278474	2016-08-12 04:32:29 +00:00
David Majnemer	319d420e44	Use the range variant of find/find_if instead of unpacking begin/end If the result of the find is only used to compare against end(), just use is_contained instead. No functionality change is intended. llvm-svn: 278469	2016-08-12 03:55:06 +00:00
David Majnemer	ae16160dfe	Use the range variant of find_if instead of unpacking begin/end No functionality change is intended. llvm-svn: 278443	2016-08-12 00:18:03 +00:00
David Majnemer	85242fb9f9	Use the range variant of find instead of unpacking begin/end If the result of the find is only used to compare against end(), just use is_contained instead. No functionality change is intended. llvm-svn: 278433	2016-08-11 22:21:41 +00:00
Vyacheslav Klochkov	fba6edf883	X86-FMA3: Implemented commute transformation for EVEX/AVX512 FMA3 opcodes. This helped to improved memory-folding and register coalescing optimizations. Also, this patch fixed the tracker #17229. Reviewer: Craig Topper. Differential Revision: https://reviews.llvm.org/D23108 llvm-svn: 278431	2016-08-11 22:07:33 +00:00
David Majnemer	5423e4bff5	Use range algorithms instead of unpacking begin/end No functionality change is intended. llvm-svn: 278417	2016-08-11 21:15:00 +00:00
Krzysztof Parzyszek	c1b900dd08	[Hexagon] Allow non-returning calls in hardware loops llvm-svn: 278416	2016-08-11 21:14:25 +00:00
Matt Arsenault	ba6f871af4	AMDGPU: Remove unused tablegen utilities llvm-svn: 278414	2016-08-11 21:08:43 +00:00
Wei Ding	f99a9aad71	AMDGPU : Add intrinsic for instruction v_cvt_pk_u8_f32 Differential Revision: http://reviews.llvm.org/D23336 llvm-svn: 278403	2016-08-11 20:34:48 +00:00
Matt Arsenault	35f283a410	AMDGPU: Prune includes llvm-svn: 278391	2016-08-11 19:18:50 +00:00
Krzysztof Parzyszek	04f5a651f5	[Hexagon] Standardize "select" pseudo-instructions - PS_pselect: general register pairs - PS_vselect: vector registers (+ 128B version) - PS_wselect: vector register pairs (+ 128B version) llvm-svn: 278390	2016-08-11 19:12:18 +00:00
Krzysztof Parzyszek	fb4a7f1227	[Hexagon] Skip byval arguments when checking parameter attributes From the point of view of register assignment, byval parameters are ignored: a byval parameter is not going to be assigned to a register, and it will not affect the assignments of subsequent parameters. When matching registers with parameters in the bit tracker, make sure to skip byval parameters before advancing the registers. llvm-svn: 278375	2016-08-11 18:15:16 +00:00
Matt Arsenault	7a366c7517	AMDGPU: Fix crashes on memory functions llvm-svn: 278369	2016-08-11 17:31:42 +00:00
Matt Arsenault	a7b500f129	AArch64: Assert on analyzeBranch failing llvm-svn: 278366	2016-08-11 17:22:59 +00:00
Matt Arsenault	ebc69d3073	AMDGPU: Remove custom getSubReg This was kind of confusing, the subregister class shouldn't really be necessary. llvm-svn: 278362	2016-08-11 17:15:32 +00:00
Matt Arsenault	278cead855	AMDGPU: Remove unused tracking of flat instructions llvm-svn: 278361	2016-08-11 17:15:28 +00:00
Duncan P. N. Exon Smith	51ce2b263f	Hexagon: Avoid dereferencing end() in HexagonCopyToCombine::findPairable Check for end() before skipping through debug values. This avoids dereferencing end() when the instruction is the final one in the basic block. (It still assumes that a debug value will not be the final instruction in the basic block. No tests seemed to violate that.) Many Hexagon tests trigger this, but they happen to magically pass right now. I found this because WIP patches for PR26753 convert them into crashes. llvm-svn: 278355	2016-08-11 16:40:03 +00:00
Wei Ding	afed1ab282	AMDGPU : Add LLVM intrinsics for SAD related instructions. Differential Revision: http://reviews.llvm.org/D23133 llvm-svn: 278354	2016-08-11 16:33:53 +00:00
Duncan P. N. Exon Smith	def4d1cdf6	X86: Use operator lookup for operator==, NFC Avoid relying on the MachineInstrBundleIterator operator== being implemented as a member function. llvm-svn: 278347	2016-08-11 15:51:29 +00:00
Valery Pykhtin	ea5c9fff44	Revert "[AMDGPU] fix failure on printing of non-existing instruction operands." This reverts revision 278333, newly added test failed. llvm-svn: 278336	2016-08-11 14:22:05 +00:00
Valery Pykhtin	90cb3bbb89	[AMDGPU] fix failure on printing of non-existing instruction operands. Differential revision: https://reviews.llvm.org/D23323 llvm-svn: 278333	2016-08-11 13:49:46 +00:00
Simon Pilgrim	3d0b7d6e80	Fixed VS2015 (Update 3) warning - differing const/volatile qualifiers for overridden function Dropped the const qualifier to match llvm::CallLowering::lowerCall llvm-svn: 278329	2016-08-11 12:19:43 +00:00
Igor Breger	68a61a0447	[AVX512] Fix extractelement i1 lowering. The previous implementation (not custom) doesn't enforce zeroing off upper bits. The assumption is that i1 PRODUCER (truncate and extractelement) must zero all upper bits, so i1 CONSUMER instructions ( test, zext, save, etc) can be done without additional zeroing. Make extractelement i1 lowering custom for all vector i1. Differential Revision: http://reviews.llvm.org/D23246 llvm-svn: 278328	2016-08-11 12:13:46 +00:00
Marina Yatsina	671f7eb81b	Avoid false dependencies of undef machine operands This patch helps avoid false dependencies on undef registers by updating the machine instructions' undef operand to use a register that the instruction is truly dependent on, or use a register with clearance higher than Pref. Pseudo example: loop: xmm0 = ... xmm1 = vcvtsi2sdl eax, xmm0<undef> ... = inst xmm0 jmp loop In this example, selecting xmm0 as the undef register creates false dependency between loop iterations. This false dependency cannot be solved by inserting an xor before vcvtsi2sdl because xmm0 is alive at the point of the vcvtsi2sdl instruction. Selecting a different register instead of xmm0, especially a register that is not used in the loop, will eliminate this problem. Differential Revision: https://reviews.llvm.org/D22466 llvm-svn: 278321	2016-08-11 07:32:08 +00:00
Craig Topper	8bbc225e62	[AVX-512] Promote 512-bit integer loads to v8i64 similar to what is done for 128/256-bit vectors for overall consistency. llvm-svn: 278318	2016-08-11 06:04:07 +00:00
Craig Topper	27e352c1ed	[AVX-512] Add patterns to allow EVEX encoded stores of v16i16/v8i16/v16i8/v32i8 even when BWI is not supported. llvm-svn: 278317	2016-08-11 06:04:04 +00:00
Craig Topper	e8b95618e0	[AVX-512] Fix the 128-bit and 256-bit nontemporal load patterns with elements type other than i64. These loads have all been promoted to v2i64/v4i64 loads so we need bitcasts or we end up selecting VMOVDQA32/VMOVDQU32 instead. llvm-svn: 278316	2016-08-11 06:04:00 +00:00
Dominic Chen	f6f7f50802	[WebAssembly] Cleanup trailing whitespace Summary: Test for commit access. Subscribers: jfb, dschuff Differential Revision: https://reviews.llvm.org/D23392 llvm-svn: 278313	2016-08-11 04:10:56 +00:00
Tim Northover	cd8fd28f8c	GlobalISel: implement simple function calls on AArch64. We're still limited in the arguments we support, but this at least handles the basic cases. llvm-svn: 278293	2016-08-10 21:44:01 +00:00
Changpeng Fang	179532ade9	AMDGPU/SI: Implement amdgcn image intrinsics with sampler Summary: This patch define and implement amdgcn image intrinsics with sampler. 1. define vdata type to be llvm_anyfloat_ty, address type to be llvm_anyfloat_ty, and rsrc type to be llvm_anyint_ty. As a result, we expect the intrinsics name to have three suffixes to overload each of these three types; 2. D128 as well as two other flags are implied in the three types, for example, if you use v8i32 as resource type, then r128 is 0! 3. don't expose TFE flag, and other flags are exposed in the instruction order: unrm, glc, slc, lwe and da. Differential Revision: http://reviews.llvm.org/D22838 Reviewed by: arsenm and tstellarAMD llvm-svn: 278291	2016-08-10 21:15:30 +00:00
Matt Arsenault	cb12ba3447	AMDGPU: s_setpc_b64 should be an indirect branch llvm-svn: 278278	2016-08-10 19:20:02 +00:00
Matt Arsenault	806c7ea5a9	AMDGPU: Set sizes on control flow pseudos llvm-svn: 278276	2016-08-10 19:11:51 +00:00
Matt Arsenault	8eb3b846e7	AMDGPU: Remove empty file comment llvm-svn: 278275	2016-08-10 19:11:48 +00:00
Matt Arsenault	b6ebde3d1d	AMDGPU: Remove unnecessary cast llvm-svn: 278274	2016-08-10 19:11:45 +00:00
Matt Arsenault	5267e59706	AMDGPU: Change insertion point of si_mask_branch Insert before the skip branch if one is created. This is a somewhat more natural placement relative to the skip branches, and makes it possible to implement analyzeBranch for skip blocks. The test changes are mostly due to a quirk where the block label is not emitted if there is a terminator that is not also a branch. llvm-svn: 278273	2016-08-10 19:11:42 +00:00
Matt Arsenault	d534109bca	AMDGPU: Use CreateStackObject instead of CreateSpillStackObject I'm not sure what the difference is, but no other target uses this for emergency spill slots. llvm-svn: 278272	2016-08-10 19:11:36 +00:00
Sanjay Patel	eb88a9636a	[x86, AVX] allow FP vector select folding to bitwise logic ops (PR28895) This handles the case in: https://llvm.org/bugs/show_bug.cgi?id=28895 ...but we are not getting all of the possibilities yet. Eg, we use 'X86::FANDN' for scalar FP select combines. That enhancement is filed as: https://llvm.org/bugs/show_bug.cgi?id=28925 Differential Revision: https://reviews.llvm.org/D23337 llvm-svn: 278270	2016-08-10 19:00:11 +00:00
Krzysztof Parzyszek	8197d268d7	[Hexagon] Remove unused variants of LO/HI instructions llvm-svn: 278266	2016-08-10 18:40:36 +00:00
Simon Pilgrim	2ca47e753e	[X86][SSE] Dropped blend(insertps(x,y),zero) combine - this is now handled by target shuffle chain combining llvm-svn: 278260	2016-08-10 18:10:29 +00:00
Krzysztof Parzyszek	458d8ce010	[Hexagon] Simplify the SplitConst32/64 pass llvm-svn: 278256	2016-08-10 18:05:47 +00:00
Krzysztof Parzyszek	bdc1668cd8	[Hexagon] Add extra patterns for single-precision min/max instructions llvm-svn: 278252	2016-08-10 17:56:24 +00:00
Krzysztof Parzyszek	57fa692f90	[Hexagon] Fix table-gen decode conflict warnings for CONST32/64 llvm-svn: 278247	2016-08-10 17:22:24 +00:00
Krzysztof Parzyszek	631100a1eb	[Hexagon] Use integer instructions for floating point immediates Floating point instructions use general purpose registers, so the few instructions that can put floating point immediates into registers are, in fact, integer instruction. Use them explicitly instead of having pseudo-instructions specifically for dealing with floating point values. Simplify the constant loading instructions (from sdata) to have only two: one for 32-bit values and one for 64-bit values: CONST32 and CONST64. llvm-svn: 278244	2016-08-10 16:46:36 +00:00
Roger Ferrer Ibanez	4597590001	Fix build break of VS 2013 debug builds In debug mode extra macros are enabled for several C++ algorithms. Some of them may cause unfortunate build failures. This commit adds a redundant operator() to work around one of those troublesome macros which was hit accidentally by change r278012. llvm-svn: 278241	2016-08-10 16:39:58 +00:00
Krzysztof Parzyszek	fc9436e726	[Hexagon] Delete HexagonSelectCCInfo.td This file is not used. The location assignment of call arguments and return values is implemented directly in HexagonISelLowering. llvm-svn: 278237	2016-08-10 16:23:53 +00:00
Krzysztof Parzyszek	b6174c9c27	[Hexagon] Remove unneeded/unused ISD opcodes ARGEXTEND and FCONST32 llvm-svn: 278236	2016-08-10 16:20:33 +00:00
Simon Pilgrim	534f12b752	[X86][SSE] Add support for combining target shuffles to MOVSS/MOVSD Only do this on pre-SSE41 targets where we should be lowering to BLENDPS/BLENDPD instead llvm-svn: 278228	2016-08-10 14:15:41 +00:00
Simon Pilgrim	c0a2a41fc2	[X86][SSE] Only treat SM_SentinelUndef as UNDEF in shuffle mask predicates isUndefOrEqual and isUndefOrInRange treated all -ve shuffle mask values as UNDEF, now it has to be SM_SentinelUndef (-1) We already have asserts to check that lowered SHUFFLE_VECTOR indices are in the range -1 <= index < 2*masksize (or masksize for unary shuffles) llvm-svn: 278218	2016-08-10 12:55:25 +00:00

1 2 3 4 5 ...

38827 Commits