llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	0c614eb08a	[X86][XOP] Support for VPERMIL2PD/VPERMIL2PS 2-input shuffle instructions This patch begins adding support for lowering to the XOP VPERMIL2PD/VPERMIL2PS shuffle instructions - adding the X86ISD::VPERMIL2 opcode and cleaning up the usage. The internal llvm intrinsics were assuming the shuffle mask operand was the same type as the float/double input operands (I guess to simplify the intrinsic definitions in X86InstrXOP.td to a single value type). These needed changing to integer types (matching the clang builtin and the AMD intrinsics definitions), an auto upgrade path is added to convert old calls. Mask decoding/target shuffle support will be added in future patches. Differential Revision: http://reviews.llvm.org/D20049 llvm-svn: 271633	2016-06-03 08:06:03 +00:00
Craig Topper	1981be1e4f	[X86] Fix some isel patterns to remove an operand from some multiclasses. NFC llvm-svn: 271631	2016-06-03 05:58:52 +00:00
Craig Topper	7dae6773ea	[AVX512] Ensure EVEX vpshufd, vpshuflw, and vpshufhw have isel priority over the VEX encoded ones. llvm-svn: 271629	2016-06-03 05:31:04 +00:00
Craig Topper	8371de0a99	[AVX512] Fix shuffle comment printing for EVEX encoded PSHUFD, PSHUFHW, and PSHUFLW. llvm-svn: 271628	2016-06-03 05:31:00 +00:00
Craig Topper	d4d080a520	[X86] Simplify a multiclass to remove a parameter. NFC llvm-svn: 271627	2016-06-03 05:30:56 +00:00
Craig Topper	89720ac9ae	[X86] Remove unnecessary pattern predicates from the vector bit cast patterns. The types have to be legal and there are no alternative patterns. Saves almost 200 bytes in isel table. llvm-svn: 271625	2016-06-03 04:15:27 +00:00
Craig Topper	3bff73b92e	[X86] Cleanup formatting a bit to align similar parts of adjacent lines. llvm-svn: 271624	2016-06-03 04:15:25 +00:00
Craig Topper	132b738bf7	[X86] Remove redundant bitcast patterns for 128/256-bit vectors. These only differ from the SSE/AVX versions by the register class, but register class has no bearing on isel. llvm-svn: 271623	2016-06-03 04:15:22 +00:00
Derek Schuff	b7000e8219	Revert "[WebAssembly] Emit type signatures for declared functions" This reverts r271599, it broke the integration tests. More places than I expected had nontrival return types in imports, or else the check was wrong. llvm-svn: 271606	2016-06-02 23:02:44 +00:00
Derek Schuff	f0dc82710c	[WebAssembly] Emit type signatures for declared functions Under emscripten, C code can take the address of a function implemented in Javascript (which is exposed via an import in wasm). Because imports do not have linear memory address in wasm, we need to generate a thunk to be the target of the indirect call; it call the import directly. To make this possible, LLVM needs to emit the type signatures for these functions, because they may not be called directly or referred to other than where the address is taken. This uses s new .s directive (.functype) which specifies the signature. Differential Revision: http://reviews.llvm.org/D20891 llvm-svn: 271599	2016-06-02 21:34:18 +00:00
Matt Arsenault	bc59009ab9	AMDGPU: Handle flat in getMemOpBaseRegImmOfs It can still report the base register, and the uses give up when it fails. llvm-svn: 271575	2016-06-02 20:05:20 +00:00
Sanjay Patel	02638731c1	transform obscured FP sign bit ops into a fabs/fneg using TLI hook This is effectively a revert of: http://reviews.llvm.org/rL249702 - [InstCombine] transform masking off of an FP sign bit into a fabs() intrinsic call (PR24886) and: http://reviews.llvm.org/rL249701 - [ValueTracking] teach computeKnownBits that a fabs() clears sign bits and a reimplementation as a DAG combine for targets that have IEEE754-compliant fabs/fneg instructions. This is intended to resolve the objections raised on the dev list: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098154.html and: https://llvm.org/bugs/show_bug.cgi?id=24886#c4 In the interest of patch minimalism, I've only partly enabled AArch64. PowerPC, MIPS, x86 and others can enable later. Differential Revision: http://reviews.llvm.org/D19391 llvm-svn: 271573	2016-06-02 20:01:37 +00:00
Matt Arsenault	a9ec8d2f35	AMDGPU: Cleanup load tests There are a lot of different kinds of loads to test for, and these were scattered around inconsistently with some redundancy. Try to comprehensively test all loads in a consistent way. llvm-svn: 271571	2016-06-02 19:54:26 +00:00
Matt Arsenault	2119ea8370	AMDGPU: Temporary fix for broken store combine llvm-svn: 271567	2016-06-02 19:00:55 +00:00
Matt Arsenault	28ae99d5d5	AMDGPU: Fix crashes on unknown processor name If the processor name failed to parse for amdgcn, the resulting output would have R600 ISA in it. If the processor name was missing or invalid for R600, the wavefront size would not be set and there would be crashes from missing itinerary data. Fixes crashes in future commit caused by dividing by the unset/0 wavefront size. llvm-svn: 271561	2016-06-02 18:37:16 +00:00
Ahmed Bougacha	efa3d8d88c	[X86] Define segment MI operands as regs instead of i8imm. We've been pretending that segments are i8imm since the initial support (r68645), predating the addition of the SEGMENT_REG class (r81895). That happens to works, but is wrong, and inconsistent with how we print (e.g., X86ATTInstPrinter::printMemReference) and parse them (e.g., X86Operand::addMemOperands). This change shouldn't affect any tool users, but is visible to library users or out-of-tree tablegen backends: this causes MCOperandInfo for the segment op to have an RC instead of "unknown", and TII::getRegClass to actually return something. As the registers are reserved and no vregs of the class ever created, that shouldn't change anything. No test change; no suspicious getRegClass() in X86 and CodeGen. llvm-svn: 271559	2016-06-02 18:29:15 +00:00
Matthias Braun	5a2d283ab8	AArch64: Do not test for CPUs, use SubtargetFeatures Testing for specific CPUs has a number of problems, better use subtarget features: - When some tweak is added for a specific CPU it is often desirable for the next version of that CPU as well, yet we often forget to add it. - It is hard to keep track of checks scattered around the target code; Declaring all target specifics together with the CPU in the tablegen file is a clear representation. - Subtarget features can be tweaked from the command line. To discourage people from using CPU checks in the future I removed the isCortexXX(), isCyclone(), ... functions. I added an getProcFamily() function for exceptional circumstances but made it clear in the comment that usage is discouraged. Reformat feature list in AArch64.td to have 1 feature per line in alphabetical order to simplify merging and sorting for out of tree tweaks. No functional change intended. Differential Revision: http://reviews.llvm.org/D20762 llvm-svn: 271555	2016-06-02 18:03:53 +00:00
Dimitry Andric	9beb691de0	Only attempt to detect AVG if SSE2 is available Summary: In PR29973 Sanjay Patel reported an assertion failure when a certain loop was optimized, for a target without SSE2 support. It turned out this was because of the AVG pattern detection introduced in rL253952. Prevent the assertion failure by bailing out early in `detectAVGPattern()`, if the target does not support SSE2. Also add a minimized test case. Reviewers: congh, eli.friedman, spatel Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D20905 llvm-svn: 271548	2016-06-02 17:30:49 +00:00
Geoff Berry	9151cce5f4	[PEI, AArch64] Use empty spaces in stack area for local stack slot allocation. Summary: If the target requests it, use emptry spaces in the fixed and callee-save stack area to allocate local stack objects. AArch64: Change last callee-save reg stack object alignment instead of size to leave a gap to take advantage of above change. Reviewers: t.p.northover, qcolombet, MatzeB Subscribers: rengolin, mcrosier, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D20220 llvm-svn: 271527	2016-06-02 16:22:07 +00:00
Krzysztof Parzyszek	7187f0db2b	[Hexagon] Expand COPY pseudo-instruction Handle it locally instead of having the target-independent pass deal with it. The generic pass does not preserve implicit uses, which may be necessary. llvm-svn: 271520	2016-06-02 14:33:08 +00:00
Krzysztof Parzyszek	ab57ca3845	[RDF] Ignore implicit defs when resetting <kill> flags llvm-svn: 271519	2016-06-02 14:30:09 +00:00
Simon Pilgrim	2e72cbb66e	[X86][SSE] Replace (V)CVTTPS2DQ and VCVTTPD2DQ truncating (round to zero) f32/f64 to i32 with generic IR (llvm) This patch removes the llvm intrinsics (V)CVTTPS2DQ and VCVTTPD2DQ truncation (round to zero) conversions and auto-upgrades to FP_TO_SINT calls instead. Note: I looked at updating CVTTPD2DQ as well but this still requires a lot more work to correctly lower. Differential Revision: http://reviews.llvm.org/D20860 llvm-svn: 271510	2016-06-02 10:55:21 +00:00
Sjoerd Meijer	067c8106bd	This adds support for Cortex-A73 as an available target. Differential Revision: http://reviews.llvm.org/D20865 llvm-svn: 271508	2016-06-02 10:48:52 +00:00
Craig Topper	57e5a20592	[AVX512] Add 512-bit load/stores to fast isel. llvm-svn: 271486	2016-06-02 04:51:37 +00:00
Craig Topper	94770dc6ce	[X86] No need to use 256-bit VMOVNTPS for integer types when only AVX1 is supported. VMOVNTDQ is available with AVX1. We were getting this right for v4i64 but not the other integer types. llvm-svn: 271482	2016-06-02 04:19:48 +00:00
Craig Topper	f70345a66d	[X86] Add AVX 256-bit load and stores to fast isel. I'm not sure why this was missing for so long. This also exposed that we were picking floating point 256-bit VMOVNTPS for some integer types in normal isel for AVX1 even though VMOVNTDQ is available. In practice it doesn't matter due to the execution dependency fix pass, but it required extra isel patterns. Fixing that in a follow up commit. llvm-svn: 271481	2016-06-02 04:19:45 +00:00
Craig Topper	aa1499b742	[X86] Use uint16_t for a couple arrays of instruction opcodes. NFC llvm-svn: 271480	2016-06-02 04:19:42 +00:00
Craig Topper	38cb270826	[AVX512] Remove LOADA/LOADU/STOREA/STOREU intrinsic types now that they are unused. llvm-svn: 271479	2016-06-02 04:19:40 +00:00
Craig Topper	1887664778	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271478	2016-06-02 04:19:36 +00:00
Matt Arsenault	dffa97a6fa	AMDGPU: Fix incorrectly setting kill flag when copying register tuples This fixes some verifier errors when trackLivenessAfterRegAlloc is enabled. llvm-svn: 271446	2016-06-02 00:04:30 +00:00
Matt Arsenault	d7671dd7b8	AMDGPU: SIDebuggerInsertNops preserves CFG This saves an additional run of the DominatorTree and MachineLoopInfo llvm-svn: 271444	2016-06-02 00:04:22 +00:00
Rafael Espindola	c82586e4b5	Avoid a load for local functions. llvm-svn: 271437	2016-06-01 21:57:11 +00:00
Keno Fischer	2092f44163	[PPC64] Fix SUBFC8 Defs list Fix PR27943 "Bad machine code: Using an undefined physical register". SUBFC8 implicitly defines the CR0 register, but this was omitted in the instruction definition. Patch by Jameson Nash <jameson@juliacomputing.com> Reviewers: hfinkel Differential Revision: http://reviews.llvm.org/D20802 llvm-svn: 271425	2016-06-01 20:31:07 +00:00
Michael Zuckerman	e5673d8456	Adding back-end support to two bit scanning intrinsics Adding LLVM back-end support to two intrinsics dealing with bit scan: _bit_scan_forward and _bit_scan_reverse. Their functionality is as described in Intel intrinsics guide: https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_forward&expand=371,370 https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_reverse&expand=371,370 Commit on behalf of Omer Paparo Bivas Differential Revision: http://reviews.llvm.org/D19915 llvm-svn: 271386	2016-06-01 12:02:37 +00:00
Oliver Stannard	559228e8d5	[ARM] Add additional matching for UBFX instructions This adds an additional matcher to select UBFX(..) from SRL(AND(..)) in ARMISelDAGToDAG to help with code size. Patch by David Green. Differential Revision: http://reviews.llvm.org/D20667 llvm-svn: 271384	2016-06-01 12:01:01 +00:00
Chris Dewhurst	0ad8334427	[Sparc] Allow passing of empty structs. Passing an empty struct as a function call argument is now supported. unit tests for various scenarios added. llvm-svn: 271374	2016-06-01 08:48:56 +00:00
Craig Topper	bc9e4ba942	Revert r271362 "[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead." Looks like something isn't quite right still. Also forgot to move the test cases to an autoupgrade test. llvm-svn: 271363	2016-06-01 05:57:55 +00:00
Craig Topper	734c8343a6	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271362	2016-06-01 05:35:16 +00:00
Kevin B. Smith	21876b39ee	[X86]: Add a pattern that uses GR16_ABCD rather than GR32_ABCD to avoid falsely marking whole 32 bit register as live. Differential Revision: http://reviews.llvm.org/D20649 llvm-svn: 271341	2016-05-31 22:00:12 +00:00
Matthias Braun	b873b20639	ARM: Do not attempt to modify register class of physregs. Physregs have no associated register class, do not attempt to modify it in Thumb2InstrInfo::storeRegToStackSlot()/loadFromStackSlot(). llvm-svn: 271339	2016-05-31 21:39:12 +00:00
Rafael Espindola	11d3302464	Delete AArch64II::MO_CONSTPOOL. A constant pool holding the address of a variable in equivalent to a got entry. It produces exactly the same instruction sequence as a got use and unlike a got use this is not uniqued by the linker. llvm-svn: 271311	2016-05-31 18:31:14 +00:00
Simon Dardis	fb52d15569	[mips] Enforce compact branch register restrictions Enforce compact branch register restrictions such as the use of the zero register, both operands being the same register. Emit clear error in such cases as the issue is subtle. For bovc and bnvc, silently fixup such cases when emitting objects directly, like LLVM started doing in rL269899. Reviewers: vkalintiris, dsanders Differential Review: http://reviews.llvm.org/D20475 llvm-svn: 271301	2016-05-31 17:34:42 +00:00
Matt Arsenault	07b77e4b28	AMDGPU: Remove unused address space Also return a single StringRef instead of building a string. llvm-svn: 271296	2016-05-31 16:57:45 +00:00
Rafael Espindola	95d3b4d586	Add a use of shouldAssumeDSOLocal to ARM. Now this code path knows about position independent executables. llvm-svn: 271290	2016-05-31 15:31:55 +00:00
Krzysztof Parzyszek	4090309289	[Hexagon] Disable expanding MUX instructions that define a subregister The code in HexagonExpandCondsets.cpp does not handle those cases at the moment. llvm-svn: 271281	2016-05-31 14:27:10 +00:00
Yaron Keren	d2aa4f959e	Do not modify a std::vector while looping it. Introduced in r271244, this is probably undefined behaviour and asserts when compiled with Visual C++ debug mode. On further note, the loop is quadratic with regard to the number of successors since removeSuccessor is linear and could probably be modified to linear time. llvm-svn: 271278	2016-05-31 13:45:05 +00:00
Ranjeet Singh	2e252abc9e	[ARM] Add backend support for load/store intrinsics. Added support to map intrinsics __builtin_arm_{ldc,ldcl,ldc2,ldc2l,stc,stcl,stc2,stc2l} to their ARM instructions. Differential Revision: http://reviews.llvm.org/D20564 llvm-svn: 271271	2016-05-31 12:39:30 +00:00
Simon Pilgrim	29a89e51e6	[X86][SSE] Add load-folding patterns for (V)CVTDQ2PD (PR27291) Added patterns for (V)CVTDQ2PD -> 2f64 loading from a 64-bit source. llvm-svn: 271269	2016-05-31 12:04:35 +00:00
Simon Dardis	573359788f	[mips] bnec/beqc register constraint fix beqc and bnec cannot have $rs == $rt. Inhibit compact branch creation if that would occur. Reviewers: vkalintiris, dsanders Differential Revision: http://reviews.llvm.org/D20624 llvm-svn: 271260	2016-05-31 09:54:55 +00:00
Igor Breger	dab6232733	[AVX512] Fix intrinsic vcvtps2ph lowering. Differential Revision: http://reviews.llvm.org/D20788 llvm-svn: 271255	2016-05-31 08:04:21 +00:00
Igor Breger	cc3e7d2dd1	Fix intrinsic vbroadcast{i32\|f32}x2 lowering. Differential Revision: http://reviews.llvm.org/D20780 llvm-svn: 271254	2016-05-31 07:43:39 +00:00
Craig Topper	cb79936a4b	[AVX512] Remove masked store intrinsics. Clang now emits generic masked store intrinsics instead. The intrinsics will be autoupgraded to the same generic masked stores. llvm-svn: 271245	2016-05-31 01:50:02 +00:00
Saleem Abdulrasool	f85a029a33	X86: permit using SjLj EH on x86 targets as an option This adds support to the backed to actually support SjLj EH as an exception model. This is NOT the default model, and requires explicitly opting into it from the frontend. GCC supports this model and for MinGW can still be enabled via the `--using-sjlj-exceptions` options. Addresses PR27749! llvm-svn: 271244	2016-05-31 01:48:07 +00:00
Craig Topper	4f195e8edb	[X86] Remove SSE/AVX unaligned store intrinsics as clang no longer uses them. Auto upgrade to native unaligned store instructions. llvm-svn: 271236	2016-05-30 23:15:56 +00:00
Rafael Espindola	f002876001	Fix a crash when producing COFF. llvm-svn: 271229	2016-05-30 20:18:53 +00:00
Diana Picus	178ddc2360	[BPF] Remove exit-on-error from tests (PR27768, PR27769) The exit-on-error flag is necessary to avoid some assertions/unreachables. We can get past them by creating a few dummy nodes. Fixes PR27768, PR27769. Differential Revision: http://reviews.llvm.org/D20726 llvm-svn: 271200	2016-05-30 08:28:34 +00:00
Rafael Espindola	ceb4ee788c	Move RelaxELFRel out to llvm-mc. llvm-svn: 271160	2016-05-29 01:11:00 +00:00
Simon Pilgrim	6ec0f7efbc	[X86][SSE] (Reapplied) Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) This patch removes the llvm intrinsics VPMOVSX and (V)PMOVZX sign/zero extension intrinsics and auto-upgrades to SEXT/ZEXT calls instead. We already did this for SSE41 PMOVSX sometime ago so much of that implementation can be reused. Reapplied now that the the companion patch (D20684) removes/auto-upgrade the clang intrinsics has been committed. Differential Revision: http://reviews.llvm.org/D20686 llvm-svn: 271131	2016-05-28 18:03:41 +00:00
Rafael Espindola	ece382b0af	Fix production of R_X86_64_GOTPCRELX/R_X86_64_REX_GOTPCRELX. We were producing R_X86_64_GOTPCRELX for invalid instructions and sometimes producing R_X86_64_GOTPCRELX instead of R_X86_64_REX_GOTPCRELX. llvm-svn: 271118	2016-05-28 15:51:38 +00:00
Sanjay Patel	fc1048d379	[x86] avoid printing unnecessary sign bits of hex immediates in asm comments (PR20347) It would be better to check the valid/expected size of the immediate operand, but this is generally better than what we print right now. Differential Revision: http://reviews.llvm.org/D20385 llvm-svn: 271114	2016-05-28 14:58:37 +00:00
Ahmed Bougacha	396b16d3af	[X86] Try to zero elts when lowering 256-bit shuffle with PSHUFB. Otherwise we fallback to a blend of PSHUFBs later on. Differential Revision: http://reviews.llvm.org/D19661 llvm-svn: 271113	2016-05-28 14:38:04 +00:00
Rafael Espindola	1c2acf86c1	Simplify and clang-format a table. llvm-svn: 271112	2016-05-28 11:13:34 +00:00
Rafael Espindola	e3a90de4ca	Fix default reloc model on ARM. llvm-svn: 271111	2016-05-28 10:41:15 +00:00
Renato Golin	46dd534f95	Revert "Revert "Map DynamicNoPIC to Static on non-darwin."" This reverts commit r271096, as reverting it broke even more buildbots! But that also means I'll break on ARM again... :( llvm-svn: 271099	2016-05-28 04:47:13 +00:00
Renato Golin	df3d92e4b4	Revert "Map DynamicNoPIC to Static on non-darwin." This reverts commit r271052, as it broke some ARM buildbots. llvm-svn: 271096	2016-05-28 04:24:26 +00:00
Krzysztof Parzyszek	5a1924e7ea	[Hexagon] Add option to enable subregister liveness tracking llvm-svn: 271088	2016-05-28 02:02:51 +00:00
Krzysztof Parzyszek	421d04a22a	[Hexagon] Separate C8 and USR to avoid unwanted subregister composition Composing subreg_loreg with subreg_oveflow leads to strange results with lane masks for register classes with subreg_loreg. In particular, dead lane detection generates incorrect code. llvm-svn: 271087	2016-05-28 01:51:16 +00:00
Matthias Braun	57fd48584e	AArch64: Fix indentation llvm-svn: 271084	2016-05-28 01:06:51 +00:00
Matt Arsenault	a466d34daa	AMDGPU: Fix trailing whitespace llvm-svn: 271081	2016-05-28 00:50:51 +00:00
Matt Arsenault	f5367daffc	AMDGPU: Add fract intrinsic Remove broken patterns matching it. This was matching the unsafe math pattern and expanding the fix for the buggy instruction from the pattern. The problems are also on CI. Remove the workarounds and only use fract with unsafe math or from the intrinsic. llvm-svn: 271078	2016-05-28 00:19:52 +00:00
Rafael Espindola	0474b584d9	Start using shouldAssumeDSOLocal on ARM. Given where this is used it should be a nop. llvm-svn: 271066	2016-05-27 22:41:51 +00:00
Matthias Braun	e7a50d3005	AArch64Subtarget: Use default member initializers llvm-svn: 271057	2016-05-27 22:14:09 +00:00
Rafael Espindola	dff4498487	Map DynamicNoPIC to Static on non-darwin. DynamicNoPIC was only every used on darwin. This maps it to static on ELF. It matches what is done on X86. llvm-svn: 271052	2016-05-27 21:44:18 +00:00
Krzysztof Parzyszek	1c30a694b9	[Hexagon] Use standard macros to initialize HexagonExpandCondsets pass llvm-svn: 271045	2016-05-27 21:15:34 +00:00
Krzysztof Parzyszek	cde5da3f4a	[Hexagon] Do not create passes in the constructor of HexagonPassConfig When running mir tests, a pass created in that constructor would not be freed, leading to memory leaks. llvm-svn: 271043	2016-05-27 20:48:39 +00:00
Michael Kuperstein	ac24107aac	[X86] Detect SAD patterns and emit psadbw instructions. This recommits r267649 with a fix for PR27539. Differential Revision: http://reviews.llvm.org/D20598 llvm-svn: 271033	2016-05-27 18:53:22 +00:00
Ahmed Bougacha	38d6fb72ea	[X86] Clarify PSHUFB+blend lowering function name. NFC. Also guard against v32i8 users. llvm-svn: 271024	2016-05-27 17:58:17 +00:00
Ahmed Bougacha	e067751171	[ARM] Remove tBLXr Pat made redundant by r269101. NFCI. llvm-svn: 271023	2016-05-27 17:58:03 +00:00
Benjamin Kramer	32cac0d565	Use StringRef::startswith instead of find(...) == 0. It's faster and easier to read. llvm-svn: 271018	2016-05-27 16:54:57 +00:00
Benjamin Kramer	de8eeaec07	[sparc] Simplify a slow and verbose way of checking if a string starts with "ld". PR27904. llvm-svn: 271016	2016-05-27 16:45:37 +00:00
Benjamin Kramer	a855b3205f	Apply clang-tidy's misc-move-constructor-init throughout LLVM. No functionality change intended, maybe a tiny performance improvement. llvm-svn: 270997	2016-05-27 14:27:24 +00:00
Simon Dardis	18b7d75488	[mips] Weaken asm predicate for memory offsets The isMemWithSimmOffset predicate rejects relocations which is incorrect behaviour. Linkers and other tools should handle\|warn\|error when the field overflows. Reviewers: dsanders, vkalintiris Differential Revision: http://reviews.llvm.org/D20727 llvm-svn: 270995	2016-05-27 13:56:36 +00:00
Artem Tamazov	7523016960	[AMDGPU][llvm-mc] Square-braced-syntax for registers - make ":expr2" optional. Register numbers may be specified as assembly-time expressions. This feature can be useful in macros and alike. However, expressions are supported within sqare braces only. Sqare braces were initially intended to support specifying of multiple (pairs/quads...) registers. Syntax like v[8:8] which specifies single register is also supported. That allows expressions but looks a bit unnatural. This change supports syntax REG[EXPR]. Tests added. Differential Revision: http://reviews.llvm.org/D20588 llvm-svn: 270990	2016-05-27 12:50:13 +00:00
Benjamin Kramer	63c3e24c6a	Avoid some copies by using const references. clang-tidy's performance-unnecessary-copy-initialization with some manual fixes. No functional changes intended. llvm-svn: 270988	2016-05-27 12:30:51 +00:00
Benjamin Kramer	284d6ac8f3	Apply clang-tidy's misc-static-assert where it makes sense. Also fold conditions into assert(0) where it makes sense. No functional change intended. llvm-svn: 270982	2016-05-27 11:36:04 +00:00
Benjamin Kramer	5a2861b1b6	[sparc] Remove some unused (and undefined) declarations. No functionality change. llvm-svn: 270981	2016-05-27 10:19:03 +00:00
Benjamin Kramer	e944e2f178	[hexagon] Move BlockRanges and RDF stuff into the llvm namespace. No functional change intended. llvm-svn: 270980	2016-05-27 10:06:40 +00:00
Benjamin Kramer	1879abe239	[sparc] Move LEON passes into llvm namespace. Also give them library visiblity while there. llvm-svn: 270979	2016-05-27 10:06:27 +00:00
Simon Pilgrim	99e3cf65ff	Revert: r270973 - [X86][SSE] Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) llvm-svn: 270976	2016-05-27 09:02:25 +00:00
Simon Pilgrim	c8925e270b	[X86][SSE] Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) This patch removes the llvm intrinsics VPMOVSX and (V)PMOVZX sign/zero extension intrinsics and auto-upgrades to SEXT/ZEXT calls instead. We already did this for SSE41 PMOVSX sometime ago so much of that implementation can be reused. A companion patch (D20684) removes/auto-upgrade the clang intrinsics. Differential Revision: http://reviews.llvm.org/D20686 llvm-svn: 270973	2016-05-27 08:49:15 +00:00
Krzysztof Parzyszek	871650f3b5	[Hexagon] Enable the post-RA scheduler The aggressive anti-dependency breaker can rename the restored callee- saved registers. To prevent this, mark these registers are live on all paths to the return/tail-call instructions, and add implicit use operands for them to these instructions. llvm-svn: 270898	2016-05-26 19:44:28 +00:00
Chad Rosier	6d0c828d28	[AArch64] Generate rev16/rev32 from bswap + srl when upper bits are known zero. Canonicalize (srl (bswap i32 x), 16) to (rotr (bswap i32 x), 16), if the high 16-bits of x are zero. Similarly, canonicalize (srl (bswap i64 x), 32) to (rotr (bswap i64 x), 32), if the high 32-bits of x are zero. test_rev_w_srl16: test_rev_w_srl16: and w8, w0, #0xffff and w8, w0, #0xffff rev w8, w8 ---> rev16 w0, w8 lsr w0, w8, #16 test_rev_x_srl32: test_rev_x_srl32: rev x8, x8 ---> rev32 x0, x8 lsr x0, x8, #32 llvm-svn: 270896	2016-05-26 19:41:33 +00:00
Changpeng Fang	7c77c6e723	AMDGPU/SI: Enable load-store-opt by default. Summary: Enable load-store-opt by default, and update LIT tests. Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D20694 llvm-svn: 270894	2016-05-26 19:35:29 +00:00
Artem Belevich	b0b1ecdbc5	Init member structs in constructor. Fixes build error on windows where MSVC does not support list initialization inside member initializer list. llvm-svn: 270877	2016-05-26 17:29:20 +00:00
Artem Belevich	f14e0f0a96	[NVPTX] Added NVVMIntrRange pass NVVMIntrRange adds !range metadata to calls of NVVM intrinsics that return values within known limited range. This allows LLVM to generate optimal code for indexing arrays based on tid/ctaid which is a frequently used pattern in CUDA code. Differential Revision: http://reviews.llvm.org/D20644 llvm-svn: 270872	2016-05-26 17:02:56 +00:00
Artem Tamazov	c4881d7a78	[AMDGPU][llvm-mc] s_getreg/setreg* - hwreg - factor out strings/literals etc. Hwreg(...) syntax implementation unified with sendmsg(...). Common strings moved to Utils MathExtras.h functionality utilized. Added missing build dependency in Disassembler. Differential Revision: http://reviews.llvm.org/D20381 llvm-svn: 270871	2016-05-26 17:00:33 +00:00
Artem Tamazov	d58a59d13c	Fix build warning introduced in r270552 "[AMDGPU][llvm-mc] Disassembler: support for TTMP/TBA/TMA registers." llvm-svn: 270859	2016-05-26 15:52:16 +00:00
Simon Pilgrim	3ed5f417f8	[X86][SSE] When lowering a 256-bit shuffle as PMOVZX, reduce the input vector to the lower 128-bit subvector. Most often as not this is what it started out as, the extraction is zero-cost on AVX and the PMOVZX/PMOVSX folding logic is based around 128-bit loads. llvm-svn: 270858	2016-05-26 15:40:36 +00:00
Krzysztof Parzyszek	30a7bb4136	[Hexagon] Select the aggressive anti-dependency breaker llvm-svn: 270857	2016-05-26 15:38:50 +00:00
Diana Picus	e53f2eed3e	[AMDGPU] Remove exit-on-error flag from test (PR27762) Similar to r269948, but for argument lowering. Fixes PR27762 Differential Revision: http://reviews.llvm.org/D20430 llvm-svn: 270856	2016-05-26 15:24:55 +00:00
Diana Picus	83ac5c0a7d	[BPF] Remove exit-on-error flag in test (PR27767) The exit-on-error flag is needed to avoid an assert where llvm::SelectionDAGISel::LowerArguments doesn't create enough arguments. Fill up with zeroes to reach the right number of args. Fixes PR27767. Differential Revision: http://reviews.llvm.org/D20571 llvm-svn: 270855	2016-05-26 15:23:50 +00:00
Chad Rosier	be99101f8a	[AArch64] Generate a BFI/BFXIL from 'or (and X, MaskImm), OrImm'. If and only if the value being inserted sets only known zero bits. This combine transforms things like and w8, w0, #0xfffffff0 movz w9, #5 orr w0, w8, w9 into movz w8, #5 bfxil w0, w8, #0, #4 The combine is tuned to make sure we always reduce the number of instructions. We avoid churning code for what is expected to be performance neutral changes (e.g., converted AND+OR to OR+BFI). Differential Revision: http://reviews.llvm.org/D20387 llvm-svn: 270846	2016-05-26 13:27:56 +00:00
Rafael Espindola	e388c6f6a2	Use shouldAssumeDSOLocal on AArch64. This reduces code duplication and now AArch64 also handles PIE. llvm-svn: 270844	2016-05-26 12:42:55 +00:00
Igor Breger	d6da40dfa4	[AVX512] Fix intrinsic cmp{sd\|ss} lowering. Differential Revision: http://reviews.llvm.org/D20615 llvm-svn: 270843	2016-05-26 12:42:25 +00:00
Chris Dewhurst	716ef61879	[Sparc] Extend the assembler printing support for Sparc back-end. Allows display of floating-point registers and display of assembler meta-data output. llvm-svn: 270829	2016-05-26 07:28:31 +00:00
Justin Lebar	db58249ac7	[NVPTX] Don't (incorrectly) say that the NVVMReflect pass preserves all analyses. Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D20585 llvm-svn: 270790	2016-05-25 23:12:38 +00:00
Rafael Espindola	c606f7449a	Don't repeat name in comment and git-clang-format. llvm-svn: 270785	2016-05-25 22:44:06 +00:00
Rafael Espindola	c2bb75e7bf	Sort includes. llvm-svn: 270769	2016-05-25 21:37:29 +00:00
Simon Pilgrim	2acf33db0b	Simplify std::all_of/any_of predicates by using llvm::all_of/any_of. NFCI. llvm-svn: 270753	2016-05-25 20:41:11 +00:00
Rafael Espindola	2224908028	Fix shouldAssumeDSOLocal for private linkage. llvm-svn: 270746	2016-05-25 19:55:16 +00:00
Matt Arsenault	1f6cee6a4f	AMDGPU: Fix v2i64/v2f64 bitcasts These operations tend to get promoted away to v4i32 so this doesn't happen often. llvm-svn: 270740	2016-05-25 18:07:36 +00:00
Matt Arsenault	e21e61958d	AMDGPU: Fix inconsistent lowering of select of vectors f32 vectors would use a sequence of BFI instructions instead of unrolled cmp + select. This was better in the case of a VALU select with SGPR inputs, but we don't have a way of dealing with that in the DAG. llvm-svn: 270731	2016-05-25 17:34:58 +00:00
Sanjay Patel	e582594538	[x86] avoid code explosion from LoopVectorizer for gather loop (PR27826) By making pointer extraction from a vector more expensive in the cost model, we avoid the vectorization of a loop that is very likely to be memory-bound: https://llvm.org/bugs/show_bug.cgi?id=27826 There are still bugs related to this, so we may need a more general solution to avoid vectorizing obviously memory-bound loops when we don't have HW gather support. Differential Revision: http://reviews.llvm.org/D20601 llvm-svn: 270729	2016-05-25 17:27:54 +00:00
Sanjay Patel	289425eb9f	[x86, AVX] allow explicit calls to VZERO* to modify state in VZeroUpperInserter pass (PR27823) As noted in the review, there are still problems, so this doesn't the bug completely. Differential Revision: http://reviews.llvm.org/D20529 llvm-svn: 270718	2016-05-25 16:39:47 +00:00
Simon Pilgrim	1a1ddc32da	[X86][SSE] Replace (V)CVTDQ2PD(Y) and (V)CVTPS2PD(Y) lossless conversion intrinsics with generic IR Followup to D20528 clang patch, this removes the (V)CVTDQ2PD(Y) and (V)CVTPS2PD(Y) llvm intrinsics and auto-upgrades to sitofp/fpext instead. Differential Revision: http://reviews.llvm.org/D20568 llvm-svn: 270678	2016-05-25 08:59:18 +00:00
Craig Topper	4710ab1424	[X86] Remove the llvm.x86.sse2.storel.dq intrinsic. It hasn't been used in a long time. llvm-svn: 270677	2016-05-25 06:56:32 +00:00
Nirav Dave	9f0b74dd18	Soften assertion in AMDGPU emitPrologue. [AMDGPU] emitPrologue looks for an unused unallocated SGPR that is not the scratch descriptor. Continue search if unused register found fails other requirements. Reviewers: arsenm, tstellarAMD, nhaehnle Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D20526 llvm-svn: 270646	2016-05-25 01:45:42 +00:00
Dan Gohman	e46ddfaa34	[WebAssembly] Put __stack_pointer in the offset field of loads and stores. Instead of this: i32.const $push10=, __stack_pointer i32.load $push11=, 0($pop10) Emit this: i32.const $push10=, 0 i32.load $push11=, __stack_pointer($pop10) It's not currently clear which is better, though there's a chance the second form may be better at overall compression. We can revisit this when we have more data; for now it makes sense to make PEI consistent with isel. Differential Revision: http://reviews.llvm.org/D20411 llvm-svn: 270635	2016-05-24 23:47:41 +00:00
Konstantin Zhuravlyov	480521dd43	[AMDGPU][NFC] Rename ReserveTrapVGPRs -> ReserveRegs Differential Revision: http://reviews.llvm.org/D20081 llvm-svn: 270594	2016-05-24 18:37:18 +00:00
Sam Kolton	5c1a0d6afe	[AMDGPU] Assembler: rework parsing of optional operands. Summary: Change process of parsing of optional operands. All optional operands use same parsing method - parseOptionalOperand(). No default values are added to OperandsVector. Get rid of WORKAROUND_USE_DUMMY_OPERANDS_INSTEAD_MUTIPLE_DEFAULT_OPERANDS. Reviewers: tstellarAMD, vpykhtin, artem.tamazov, nhaustov Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D20527 llvm-svn: 270556	2016-05-24 12:38:33 +00:00
Artem Tamazov	068739d10c	[AMDGPU][llvm-mc] Disassembler: support for TTMP/TBA/TMA registers. Differential Revision: http://reviews.llvm.org/D20476 llvm-svn: 270552	2016-05-24 12:05:16 +00:00
Igor Breger	29f643cd8e	[llvm][AVX512][intrinsics] Fix vperm{b\|w\|d\|q\|ps\|pd} intrinsics. Index is second argument to buildin function but it is first instruction operand. Differential Revision: http://reviews.llvm.org/D20515 llvm-svn: 270548	2016-05-24 11:06:22 +00:00
Sagar Thakur	5eb36c5e30	[MIPS][LLVM-MC] Fix Disassemble of Negative Offset Patch by Nitesh Jain. Summary: The type of Imm in MipsDisassembler.cpp was incorrect since SignExtend64 return int64_t type.As per the MIPSr6 doc ,the offset is added to the address of the instruction following the branch (not the branch itself), to form a PC-relative effective target address hence “4” is added to the offset. The offset of some test case are update to reflect the changes due to “ + 4 ” offset and new test case for negative offset are added. Reviewers: dsanders, vkalintiris Differential Revision: http://reviews.llvm.org/D17540 llvm-svn: 270542	2016-05-24 09:57:10 +00:00
Simon Pilgrim	de7240c8f6	[CostModel][X86][XOP] Added XOP costmodel for BITREVERSE Now that we have a nice fast VPPERM solution. Added framework for future intrinsic costs as well. llvm-svn: 270537	2016-05-24 08:17:50 +00:00
Dan Gohman	c3ac030898	[WebAssembly] Basic TargetTransformInfo support for SIMD128. llvm-svn: 270508	2016-05-23 22:47:07 +00:00
James Y Knight	f767eef5cb	[SPARC] Fix 8 and 16-bit atomic load and store. They were accidentally using the 32-bit load/store instruction for 8/16-bit operations, due to incorrect patterns (8/16-bit cmpxchg and atomicrmw will be fixed in subsequent changes) llvm-svn: 270486	2016-05-23 20:33:00 +00:00
Sanjay Patel	62f9635295	fix typo; NFC llvm-svn: 270469	2016-05-23 18:01:20 +00:00
Sanjay Patel	7d5178a58b	use range-loop; NFCI llvm-svn: 270467	2016-05-23 18:00:50 +00:00
Dan Gohman	b74c5addf0	[WebAssembly] Speed up LiveIntervals updating. Use the more specific LiveInterval::removeSegment instead of LiveInterval::shrinkToUses when we know the specific range that's being removed. llvm-svn: 270463	2016-05-23 17:42:57 +00:00
Krzysztof Parzyszek	3964c84399	[Hexagon] Move some debug-only variable declarations into DEBUG llvm-svn: 270459	2016-05-23 17:31:30 +00:00
Aaron Ballman	aea6907e5f	Removing a switch statement that contains only a default label; NFC. llvm-svn: 270444	2016-05-23 15:52:59 +00:00
Diana Picus	f989b6c3b6	[BPF] Remove exit-on-error flag in test (PR27766) The exit-on-error flag on the many_args1.ll test is needed to avoid an unreachable in BPFTargetLowering::LowerCall. We can also avoid it by ignoring any superfluous arguments to the call (i.e. any arguments after the first 5). Fixes PR27766. Differential Revision: http://reviews.llvm.org/D20471 v2 of r270419 llvm-svn: 270440	2016-05-23 14:57:19 +00:00
Renato Golin	317bd564ab	Reverts "[BPF] Remove exit-on-error flag in test (PR27766)" This patch reverts r270419 because it broke a lot of buildbots, mostly Windows. We'd like help in investigating the issues, but for now, it should stay out. llvm-svn: 270433	2016-05-23 13:02:11 +00:00
Diana Picus	0a143f0f02	[BPF] Remove exit-on-error flag in test (PR27766) The exit-on-error flag on the many_args1.ll test is needed to avoid an unreachable in BPFTargetLowering::LowerCall. We can also avoid it by ignoring any superfluous arguments to the call (i.e. any arguments after the first 5). Fixes PR27766 llvm-svn: 270419	2016-05-23 12:33:34 +00:00
Chris Dewhurst	14488dfcf0	[Sparc] LEON erratum fix - Delay Slot Filler modification. This code should have been with the previous check-in (r270417) and prevents the DelaySlotFiller pass being utilized in functions where the erratum fix has been applied as this will break the run-time code. llvm-svn: 270418	2016-05-23 11:52:28 +00:00
Chris Dewhurst	11fab31200	[Sparc][LEON] LEON Erratum fix. Insert NOP after LD or LDF instruction. Due to an erratum in some versions of LEON, we must insert a NOP after any LD or LDF instruction to ensure the processor has time to load the value correctly before using it. This pass will implement that erratum fix. The code will have no effect for other Sparc, but non-LEON processors. Differential Review: http://reviews.llvm.org/D20353 llvm-svn: 270417	2016-05-23 10:56:36 +00:00
Sam Kolton	59aa17c27c	[AMDGPU] Assembler: refactor parsing of modifiers and immediates. Allow modifiers for imms. Reviewers: nhaustov, tstellarAMD Subscribers: kzhuravl, arsenm Differential Revision: http://reviews.llvm.org/D20166 llvm-svn: 270415	2016-05-23 09:59:02 +00:00
Jacob Baungard Hansen	1036b51c05	Test commit llvm-svn: 270414	2016-05-23 09:41:44 +00:00
Craig Topper	62ac946928	[X86] Use instruction aliases to replace custom asm parser code for optimizing moves to use 2 byte VEX prefix. llvm-svn: 270394	2016-05-23 04:02:27 +00:00
Craig Topper	ae0615175a	[AVX512] Add patterns to implement stores of extracts of least signficant subvectors using XMM or YMM stores instead of the vector extract instructions. Similar is already done for AVX and we had lost it going to AVX512VL. llvm-svn: 270383	2016-05-22 23:44:33 +00:00
Sanjay Patel	b2763427c2	[x86, AVX] don't add a vzeroupper if that's what the code is already doing (PR27823) This isn't the complete fix, but it handles the trivial examples of duplicate vzero* ops in PR27823: https://llvm.org/bugs/show_bug.cgi?id=27823 ...and amusingly, the bogus cases already exist as regression tests, so let's take this baby step. We'll need to do more in the general case where there's legitimate AVX usage in the function + there's already a vzero in the code. Differential Revision: http://reviews.llvm.org/D20477 llvm-svn: 270378	2016-05-22 20:22:47 +00:00
Igor Breger	7450d33b3c	[AVX512] Implement missing patterns for any_extend load lowering. Differential Revision: http://reviews.llvm.org/D20513 llvm-svn: 270357	2016-05-22 10:21:04 +00:00
Craig Topper	40ab0cb8bb	[AVX512] The AVX512 file only need subtract_subvector index 0 patterns where the source is 512-bits. The 256-bit source patterns were redundant with AVX. llvm-svn: 270356	2016-05-22 07:40:58 +00:00
Craig Topper	ec1c660b77	[AVX512] Add an AddedComplexity line to the 512-bit insert_subvector undef index 0 patterns. This gives them higher priority than the memory patterns. This matches AVX1/2. llvm-svn: 270355	2016-05-22 07:40:40 +00:00
Craig Topper	b6b424b279	[AVX512] Change the AddedComplexity on some patterns to match their AVX/SSE equivalents. This helps group them close together in the isel tables and enable table compression. llvm-svn: 270354	2016-05-22 06:09:34 +00:00
Craig Topper	a6980162a3	[AVX512] Add a couple patterns to fix some cases where two vector mask inversions could appear in a row. llvm-svn: 270344	2016-05-22 00:39:30 +00:00
Craig Topper	0754f22fa5	[AVX512] Remove seemingly unnecessary AddedComplexity adjustment. llvm-svn: 270343	2016-05-22 00:39:27 +00:00
Craig Topper	c1c9bee262	[X86] Remove unnecessary alignment check on patterns that use VEXTRACTF128 for integer types when only AVX1 is supported. llvm-svn: 270335	2016-05-21 22:50:18 +00:00
Craig Topper	ff8ed4829f	[AVX512] Add patterns for extracting subvectors and storing to memory. llvm-svn: 270334	2016-05-21 22:50:14 +00:00
Craig Topper	e6ae74c913	[AVX512] Capitalize the Z in VEXTRACTPSzmr. Lowercase z has been primarily used to indicating the zero masking behavior which is not the case here. NFC llvm-svn: 270333	2016-05-21 22:50:11 +00:00

1 2 3 4 5 ...

37826 Commits