llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
James Molloy	4482987575	[ARM64] Port over missing subtarget features, and CPU definitions from AArch64. llvm-svn: 206198	2014-04-14 17:38:00 +00:00
Daniel Sanders	5698dfd860	[mips] Fix fcopysign for MIPS-IV and add the test. Summary: This was another incorrect use of hasMips64() vs isGP64bit(). Depends on D3344 Reviewers: matheusalmeida, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3347 llvm-svn: 206187	2014-04-14 16:24:12 +00:00
Daniel Sanders	b3f0be8f2e	[mips] Fix more incorrect uses of HasMips64 and isMips64() Summary: - Conditional moves acting on 64-bit GPR's should require MIPS-IV rather than MIPS64 - ISD::MUL, and ISD::MULH[US] should be lowered on all 64-bit ISA's Patch by David Chisnall His work was sponsored by: DARPA, AFRL I've added additional testcases to cover as much of the codegen changes affecting MIPS-IV as I can. Where I've been unable to find an existing MIPS64 testcase that can be re-used for MIPS-IV (mainly tests covering ISD::GlobalAddress and similar), I at least agree that MIPS-IV should behave like MIPS64. Further testcases that are fixed by this patch will follow in my next commit. The testcases from that commit that fail for MIPS-IV without this patch are: LLVM :: CodeGen/Mips/2010-07-20-Switch.ll LLVM :: CodeGen/Mips/cmov.ll LLVM :: CodeGen/Mips/eh-dwarf-cfa.ll LLVM :: CodeGen/Mips/largeimmprinting.ll LLVM :: CodeGen/Mips/longbranch.ll LLVM :: CodeGen/Mips/mips64-f128.ll LLVM :: CodeGen/Mips/mips64directive.ll LLVM :: CodeGen/Mips/mips64ext.ll LLVM :: CodeGen/Mips/mips64fpldst.ll LLVM :: CodeGen/Mips/mips64intldst.ll LLVM :: CodeGen/Mips/mips64load-store-left-right.ll LLVM :: CodeGen/Mips/sint-fp-store_pattern.ll Reviewers: dsanders Reviewed By: dsanders CC: matheusalmeida Differential Revision: http://reviews.llvm.org/D3343 llvm-svn: 206183	2014-04-14 15:44:42 +00:00
Tim Northover	1ddf6e933e	ARM64: remove buggy REV16 pattern. The 32-bit pattern is still valid: 0123 -> 3210 -> 1032. llvm-svn: 206172	2014-04-14 12:59:52 +00:00
Tim Northover	837be1f97d	AArch64/ARM64: enable directcond.ll test on ARM64. Code change is because optimizeCompareInstr didn't know how to pull the condition code out of FCSEL instructions. llvm-svn: 206171	2014-04-14 12:51:06 +00:00
Tim Northover	3a59e7c449	ARM64: add patterns for csXYZ with reversed operands. AArch64 tests for this, and it's obviously a good idea. Have to invert the condition code, of course. llvm-svn: 206170	2014-04-14 12:51:02 +00:00
Tim Northover	0f5179b30d	ARM64: add support for AArch64's addsub_ext.ll There was one definite issue in ARM64 (the off-by-1 check for whether a shift could be folded in) and one difference that is probably correct: ARM64 didn't fold nodes with multiple uses into the arithmetic operations unless optimising for code size. llvm-svn: 206168	2014-04-14 12:50:50 +00:00
Tim Northover	614708ff8e	ARM64: optimise (cmp x, (sub 0, y)) to (cmn x, y). This transformation is only valid when being used for an EQ or NE comparison since the flags change otherwise. llvm-svn: 206167	2014-04-14 12:50:47 +00:00
Richard Osborne	6d5512a94e	[XCore] Don't create invalid MKMSK instructions inside loadImmediate(). Summary: Previously loadImmediate() would produce MKMSK instructions with invalid immediate values such as mkmsk r0, 9. Fix this by checking the mask size is valid. Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3289 llvm-svn: 206163	2014-04-14 12:30:35 +00:00
Hal Finkel	c4a623f8d4	[PowerPC] [Constant Hoisting] Enable constant hoisting on PPC Implements the various TTI functions to enable constant hoisting on PPC. The only significant test-suite change is this: MultiSource/Benchmarks/VersaBench/bmm/bmm - 20% speedup (which essentially reverses the slowdown from r206120). llvm-svn: 206141	2014-04-13 23:02:40 +00:00
Hal Finkel	4da0e32e2a	[PowerPC] Fix rlwimi isel when mask is not constant We had been using the known-zero values of the operand of the or to construct the mask for an rlwimi; this is not quite correct, but fine when the mask is constant. When the mask is constant, then the known zeros of the operand must be a superset of the zeros in the mask. However, when the mask is not a constant, then there might be bits in the operand that are not known to be zero that, at runtime, might be zero in the mask. Therefore, we check that any bits not known to be zero are known to be one in the mask. Otherwise, we can't fold the mask with the or and shift. This was revealed as a miscompile of MultiSource/Benchmarks/BitBench/drop3/drop3 when I started experimenting with constant hoisting. llvm-svn: 206136	2014-04-13 17:10:58 +00:00
David Blaikie	cedce0ba4f	Fix instruction debug info location during legalization I found this from a particular GDB test suite case of inlining (something similar is provided as a test case) but came across a few other related cases (other callers of the same functions, and one other instance of the same coding mistake in a separate function). I'm not sure what the best way to test this is (let alone to cover the other cases I discovered), so hopefully this sufficies - open to ideas. llvm-svn: 206130	2014-04-13 06:39:55 +00:00
Lang Hames	f72ca1f7d5	[X86] unique_ptr'ify one of X86GenericDisassembler's members. llvm-svn: 206127	2014-04-13 04:09:16 +00:00
Hal Finkel	a1849e7ac8	[PowerPC] Implement some additional TLI callbacks Add implementations of: bool isLegalICmpImmediate(int64_t Imm) const bool isLegalAddImmediate(int64_t Imm) const bool isTruncateFree(Type Ty1, Type Ty2) const bool isTruncateFree(EVT VT1, EVT VT2) const bool shouldConvertConstantLoadToIntImm(const APInt &Imm, Type *Ty) const Unfortunately, this regresses counter-register-based loop formation because some of the loops now end up in forms were SE cannot compute loop counts. However, nevertheless, the test-suite results favor committing: SingleSource/Benchmarks/BenchmarkGame/puzzle: 26% speedup MultiSource/Benchmarks/FreeBench/analyzer/analyzer: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan: 20% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/trisolv/trisolv: 19% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gesummv/gesummv: 15% speedup MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2: 2% speedup MultiSource/Benchmarks/VersaBench/bmm/bmm: 26% slowdown llvm-svn: 206120	2014-04-12 21:52:38 +00:00
Benjamin Kramer	e5a211292c	Spell the specialization namespace correctly. Not sure why clang didn't diagnose this (GCC does). llvm-svn: 206117	2014-04-12 18:45:24 +00:00
Benjamin Kramer	224e38407d	Make helper static and place random global into the llvm namespace. llvm-svn: 206116	2014-04-12 18:39:57 +00:00
Benjamin Kramer	f6c0615b06	Retire llvm::array_endof in favor of non-member std::end. While there make array_lengthof constexpr if we have support for it. llvm-svn: 206112	2014-04-12 16:15:53 +00:00
Juergen Ributzka	23e31a5f36	[ARM64] Never hoist the shift value of a shift instruction. There is no need to check if we want to hoist the immediate value of an shift instruction. Simply return TCC_Free right away. llvm-svn: 206101	2014-04-12 02:53:51 +00:00
Juergen Ributzka	2cdca434db	[ARM64] Fix the cost model for cheap large constants. Originally the cost model would give up for large constants and just return the maximum cost. This is not what we want for constant hoisting, because some of these constants are large in bitwidth, but are still cheap to materialize. This commit fixes the cost model to either return TCC_Free if the cost cannot be determined, or accurately calculate the cost even for large constants (bitwidth > 128). This fixes <rdar://problem/16591573>. llvm-svn: 206100	2014-04-12 02:36:28 +00:00
Jim Grosbach	51592aeaa3	X86: Remove TargetMachine CPU auto-detection. This logic is properly in the realm of whatever is creating the TargetMachine. This makes plain 'llc foo.ll' consistent across heterogenous machines. llvm-svn: 206094	2014-04-12 01:34:29 +00:00
Chad Rosier	f94fdcad79	[AArch64] Implement the isLegalAddressingMode and getScalingFactorCost APIs. llvm-svn: 206089	2014-04-12 00:14:23 +00:00
Louis Gerbarg	e84c6abeec	Add ARM64 CLS patterns This patch adds patterns to generate the cls instruction ARM64. Includes tests for 64 bit and 32 bit operands. rdar://15611957 llvm-svn: 206079	2014-04-11 22:27:58 +00:00
Matt Arsenault	63e365489a	R600: Check if a sextload should be used for parameter loads. Through some oddity where truncate (sextload x) isn't folded into an anyextload for vectors, the sextload remains if the vector isn't immediately scalarized. This keeps the expected zextload instructions in the kernel-args test when small type vectors aren't scalarized. llvm-svn: 206070	2014-04-11 20:59:54 +00:00
Lang Hames	960e8b3cb0	Remove redundant symbolization support from MCDisassembler interface. MCDisassembler has an MCSymbolizer member that is meant to take care of symbolizing during disassembly, but it also has several methods that enable the disassembler to do symbolization internally (i.e. without an attached symbolizer object). There is no need for this duplication, but ARM64 had been making use of it. This patch moves the ARM64 symbolization logic out of ARM64Disassembler and into an ARM64ExternalSymbolizer class, and removes the duplicated MCSymbolizer functionality from the MCDisassembler interface. Symbolization will now be done exclusively through MCSymbolizers. There should be no impact on disassembly for any platform, but this allows us to tidy up the MCDisassembler interface and simplify the process of (and invariants related to) disassembler setup. llvm-svn: 206063	2014-04-11 20:07:58 +00:00
Matt Arsenault	e7a5c69ed5	R600/SI: Refactor SOPC classes slightly. Better match what is done for VOPC to eventually prefer selecting these. llvm-svn: 206048	2014-04-11 19:25:18 +00:00
Matt Arsenault	65fde80ac6	Move ExtractVectorElements to SelectionDAG. This seems generally useful, and makes sense to go along with SplitVector. llvm-svn: 206041	2014-04-11 17:47:30 +00:00
David Blaikie	1573e6e09f	Implement depth_first and inverse_depth_first range factory functions. Also updated as many loops as I could find using df_begin/idf_begin - strangely I found no uses of idf_begin. Is that just used out of tree? Also a few places couldn't use df_begin because either they used the member functions of the depth first iterators or had specific ordering constraints (I added a comment in the latter case). Based on a patch by Jim Grosbach. (Jim - you just had iterator_range<T> where you needed iterator_range<idf_iterator<T>>) llvm-svn: 206016	2014-04-11 01:50:01 +00:00
Jim Grosbach	1560d0c9a4	[ARM64,C++11] Range'ify use-lists iterators in address type promotion. llvm-svn: 206013	2014-04-11 01:13:10 +00:00
Jim Grosbach	351004c3a4	[ARM64,C++11]: Range'ify use-list iterators in DAGToDAG. llvm-svn: 206007	2014-04-11 00:27:22 +00:00
Jim Grosbach	cbc54c6ebd	[ARM64,C++11]: More range-based loop simplification. llvm-svn: 206006	2014-04-11 00:27:19 +00:00
Reid Kleckner	f99741400f	Move the segmented stack switch to a function attribute This removes the -segmented-stacks command line flag in favor of a per-function "split-stack" attribute. Patch by Luqman Aden and Alex Crichton! llvm-svn: 205997	2014-04-10 22:58:43 +00:00
Jim Grosbach	0cd1c34275	[ARM64,C++11]: Range'ify loops in InstrInfo. llvm-svn: 205992	2014-04-10 22:00:18 +00:00
Jim Grosbach	24a37167ab	[ARM64,C++11]: Range'ify loops in the conditional-compare pass. llvm-svn: 205988	2014-04-10 21:49:24 +00:00
Kevin Enderby	7eca8cdb16	For the ARM integrated assembler add checking of the alignments on vld/vst instructions. And report errors for alignments that are not supported. While this is a large diff and an big test case, the changes are very straight forward. But pretty much had to touch all vld/vst instructions changing the addrmode to one of the new ones that where added will do the proper checking for the specific instruction. FYI, re-committing this with a tweak so MemoryOp's default constructor is trivial and will work with MSVC 2012. Thanks to Reid Kleckner and Jim Grosbach for help with the tweak. rdar://11312406 llvm-svn: 205986	2014-04-10 20:18:58 +00:00
Daniel Sanders	1f079bb11c	[mips] NotMips64 predicate is really a test for 32-bit GPR's. Summary: Similarly, the HasMips64 on the 64-bit move InstAlias is a test for 64-bit GPR's. No functional change. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3263 llvm-svn: 205968	2014-04-10 15:00:28 +00:00
Daniel Sanders	d6fb64ca58	[mips] Switch the MIPS-III and MIPS-IV assembler tests to use -mcpu=mips4. Summary: It is now the smallest superset for these ISA's. FeatureMips4 now contains FeatureFPIdx since [ls][dw]xc1 were added in MIPS-IV. Made the FPIdx feature bit lowercase so that it can be used in the -mattr option. Depends on D3274 Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3275 llvm-svn: 205964	2014-04-10 13:16:49 +00:00
NAKAMURA Takumi	29db5b1f13	ARM64/*/LLVMBuild.txt: Prune redundant deps. llvm-svn: 205963	2014-04-10 12:46:13 +00:00
NAKAMURA Takumi	d13a996dc0	LLVMBuild.txt: Add missing dependencies. llvm-svn: 205962	2014-04-10 11:16:47 +00:00
NAKAMURA Takumi	b6baf3b8a1	LLVMBuild.txt: Reformat. llvm-svn: 205961	2014-04-10 11:16:17 +00:00
NAKAMURA Takumi	5041f185fc	Fix abuse of StringRef on ARM64SysReg::MRSMapper::toString(Val, Valid). FIXME: Could we use SmallString here? llvm-svn: 205950	2014-04-10 03:05:59 +00:00
Saleem Abdulrasool	ee8a89e416	ARM64: add an explicit cast to silence a silly warning GCC 4.8 complains with: warning: enumeral and non-enumeral type in conditional expression Although this is silly and harmless in this case, add an explicit cast to silence the warning. llvm-svn: 205949	2014-04-10 02:48:10 +00:00
Juergen Ributzka	c19a8638e5	[ARM64] Fix immediate cost calculation for types larger than i64. The immediate cost calculation code was hitting an assertion in the included test case, because APInt was still internally 128-bits. Truncating it to 64-bits fixed the issue. Fixes <rdar://problem/16572521>. llvm-svn: 205947	2014-04-10 01:36:59 +00:00
Reid Kleckner	52c8f727f3	Revert "For the ARM integrated assembler add checking of the alignments on vld/vst instructions. And report errors for alignments that are not supported." It doesn't build with MSVC 2012, because MSVC doesn't allow union members that have non-trivial default constructors. This change added 'SMLoc AlignmentLoc' to MemoryOp, which made MemoryOp's default ctor non-trivial. This reverts commit r205930. llvm-svn: 205944	2014-04-10 00:52:14 +00:00
Jim Grosbach	366b84c436	Add support for load folding of avx1 logical instructions AVX supports logical operations using an operand from memory. Unfortunately because integer operations were not added until AVX2 the AVX1 logical operation's types were preventing the isel from folding the loads. In a limited number of cases the peephole optimizer would fold the loads, but most were missed. This patch adds explicit patterns with appropriate casts in order for these loads to be folded. The included test cases run on reduced examples and disable the peephole optimizer to ensure the folds are being pattern matched. Patch by Louis Gerbarg <lgg@apple.com> rdar://16355124 llvm-svn: 205938	2014-04-09 23:39:25 +00:00
Kevin Enderby	a2db407730	For the ARM integrated assembler add checking of the alignments on vld/vst instructions. And report errors for alignments that are not supported. While this is a large diff and an big test case, the changes are very straight forward. But pretty much had to touch all vld/vst instructions changing the addrmode to one of the new ones that where added will do the proper checking for the specific instruction. rdar://11312406 llvm-svn: 205930	2014-04-09 21:32:59 +00:00
Chad Rosier	0a639ba5a6	[AArch64] Implement the isZExtFree APIs. llvm-svn: 205926	2014-04-09 20:51:21 +00:00
Chad Rosier	b6bf098390	[AArch64] Implement the isTruncateFree API. In AArch64 i64 to i32 truncate operation is a subregister access. This allows more opportunities for LSR optmization to eliminate variables of different types (i32 and i64). llvm-svn: 205925	2014-04-09 20:43:40 +00:00
Bob Wilson	9cddd45364	Simple fix for build failures resulting from r205867. llvm-svn: 205918	2014-04-09 18:34:45 +00:00
Justin Holewinski	b035f9f3e4	[NVPTX] Add preliminary intrinsics and codegen support for textures/surfaces This commit adds intrinsics and codegen support for the surface read/write and texture read instructions that take an explicit sampler parameter. Codegen operates on image handles at the PTX level, but falls back to direct replacement of handles with kernel arguments if image handles are not enabled. Note that image handles are explicitly disabled for all target architectures in this change (to be enabled later). llvm-svn: 205907	2014-04-09 15:39:15 +00:00
Justin Holewinski	80f133a62c	[NVPTX] Add support for addrspacecast in global variable initializers, including emitting generic() when casting to address space 0. llvm-svn: 205906	2014-04-09 15:39:11 +00:00
Justin Holewinski	256f672482	[NVPTX] Add query support for read-write images and managed variables This also fixes a bug in the annotation cache where the cache will not be cleared between modules if multiple modules are compiled in the same process. llvm-svn: 205905	2014-04-09 15:38:52 +00:00
Alp Toker	111bd28e59	Fix some doc and comment typos llvm-svn: 205899	2014-04-09 14:47:27 +00:00
Bradley Smith	0ce8c1944a	[ARM64] Change SYS without a register to an alias to make disassembling more consistant. llvm-svn: 205898	2014-04-09 14:44:58 +00:00
Bradley Smith	94dd081d21	[ARM64] Correctly disassemble ISB operand as ISB not DBarrier. llvm-svn: 205897	2014-04-09 14:44:54 +00:00
Bradley Smith	9253af77f9	[ARM64] Properly support both apple and standard syntax for FMOV llvm-svn: 205896	2014-04-09 14:44:49 +00:00
Bradley Smith	f3c6d7d337	[ARM64] Flag setting logical/add/sub immediate instructions don't use SP. llvm-svn: 205895	2014-04-09 14:44:44 +00:00
Bradley Smith	b0ff0afe88	[ARM64] Conditional branches must always print their condition code, even AL. llvm-svn: 205894	2014-04-09 14:44:39 +00:00
Bradley Smith	d50126665d	[ARM64] Fix disassembly logic for extended loads/stores with 32-bit registers. llvm-svn: 205893	2014-04-09 14:44:36 +00:00
Bradley Smith	a616f00fd7	[ARM64] When printing a pre-indexed address with #0 , the ', #0 ' is not optional. llvm-svn: 205892	2014-04-09 14:44:31 +00:00
Bradley Smith	7002e863ee	[ARM64] Add missing shifted register MVN alias to ORN llvm-svn: 205891	2014-04-09 14:44:26 +00:00
Bradley Smith	c5c6cabc6d	[ARM64] SXTW/UXTW are only valid aliases for 32-bit operations. llvm-svn: 205890	2014-04-09 14:44:22 +00:00
Bradley Smith	da19fd8b49	[ARM64] Fix canonicalisation of MOVs. MOV is too complex to be modelled by a dumb alias. llvm-svn: 205889	2014-04-09 14:44:18 +00:00
Bradley Smith	ce010e57ab	[ARM64] Fixup ADR/ADRP parsing such that they accept immediates and all labels types llvm-svn: 205888	2014-04-09 14:44:12 +00:00
Bradley Smith	f4dd60e27e	[ARM64] Ensure sp is decoded as SP, not XZR in LD1 instructions. llvm-svn: 205887	2014-04-09 14:44:07 +00:00
Bradley Smith	6fa2739377	[ARM64] Tighten up the special casing in emitting arithmetic extends. UXTW should only be translated when the instruction uses WSP, not SP. Vice versa for UXTX and 64-bit instructions. llvm-svn: 205886	2014-04-09 14:44:03 +00:00
Bradley Smith	c33112b3a6	[ARM64] Rename LR to the UAL-compliant 'X30'. llvm-svn: 205885	2014-04-09 14:43:59 +00:00
Bradley Smith	8923d46955	[ARM64] Rename FP to the UAL-compliant 'X29'. llvm-svn: 205884	2014-04-09 14:43:50 +00:00
Bradley Smith	12bcb7711a	[ARM64] Add a PostEncoderMethod to FCMP - the Rm field should canonically be zero but should be decoded/disassembled with any value. llvm-svn: 205883	2014-04-09 14:43:40 +00:00
Bradley Smith	7a63e7691e	[ARM64] SCVTF and FCVTZS/U are undefined if scale<5> == 0. llvm-svn: 205882	2014-04-09 14:43:35 +00:00
Bradley Smith	a41c6988db	[ARM64] EXT and EXTR instructions on v8i8 and W regs respectively must have the top bit of their immediate clear. llvm-svn: 205881	2014-04-09 14:43:31 +00:00
Bradley Smith	38e65e2910	[ARM64] Scaled fixed-point FCVTZSs should also have bit 29 set to zero. llvm-svn: 205880	2014-04-09 14:43:27 +00:00
Bradley Smith	5d3f0b60b1	[ARM64] UBFM/BFM is undefined on w registers when imms<5> or immr<5> is 1. llvm-svn: 205879	2014-04-09 14:43:24 +00:00
Bradley Smith	1828a1cae0	[ARM64] Floating point to fixed point scaled conversions are only available on fcvtzs and fcvtzu. llvm-svn: 205878	2014-04-09 14:43:20 +00:00
Bradley Smith	b7310fe4cb	[ARM64] Port over the PostEncoderMethod fix for SMULH/UMULH from AArch64. llvm-svn: 205877	2014-04-09 14:43:15 +00:00
Bradley Smith	5b9ef7909e	[ARM64] Add missing tlbi operands and error for extra/missing register on tlbi aliases. llvm-svn: 205876	2014-04-09 14:43:11 +00:00
Bradley Smith	92cc212005	[ARM64] Rework system register parsing to overcome SPSel clash in MSR variants. llvm-svn: 205875	2014-04-09 14:43:06 +00:00
Bradley Smith	ac62b533a8	[ARM64] Port over the PostEncoderMethod from AArch64 for exclusive loads and stores, so the unused register fields are set to all-ones canonically but are recognised with any value. llvm-svn: 205874	2014-04-09 14:43:01 +00:00
Bradley Smith	d4bc8aed04	[ARM64] Use PStateMapper to ensure that MSRcpsr operands are validated during disassembly. llvm-svn: 205873	2014-04-09 14:42:56 +00:00
Bradley Smith	7a8ebd7b93	[ARM64] Remove PrefetchOp and use ARM64PRFM instead. llvm-svn: 205872	2014-04-09 14:42:53 +00:00
Bradley Smith	c795ca6141	[ARM64] Add WZR to isGPR32Register, since every use needs to check for this anyway. llvm-svn: 205871	2014-04-09 14:42:49 +00:00
Bradley Smith	b4b24c5ace	[ARM64] Remove ARM64SYS. llvm-svn: 205870	2014-04-09 14:42:45 +00:00
Bradley Smith	c997adc65f	[ARM64] Move CPSRField and DBarrier operands over to AArch64-style disassembly and assembly. This removes the last users of namespace ARM64SYS. llvm-svn: 205869	2014-04-09 14:42:42 +00:00
Bradley Smith	e1a3984e57	[ARM64] Switch the decoder, disassembler, instprinter and asmparser over to using AArch64-style system registers, and fix up test failures discovered in the process. llvm-svn: 205868	2014-04-09 14:42:36 +00:00
Bradley Smith	6a073e5a8f	[ARM64] Move ARM64BaseInfo.{cpp,h} into a Utils/ subdirectory, a la AArch64. These files are required in the decoder, disassembler and parser, and a layering violation was imminent. llvm-svn: 205867	2014-04-09 14:42:27 +00:00
Bradley Smith	1f2b3f2200	[ARM64] Copy the named immediate operand mapping logic and enums from AArch64. AArch64's named immediate mapping and parsing is much more advanced than ARM64's. No functionality change - they're currently living side by side while I switch uses over. llvm-svn: 205866	2014-04-09 14:42:16 +00:00
Bradley Smith	3450b9ab8d	[ARM64] Shifted register ALU ops are reserved if sf=0 and imm6<5>=1, and also (for add/sub only) if shift=11. llvm-svn: 205865	2014-04-09 14:42:11 +00:00
Bradley Smith	1414cc8e91	[ARM64] Add support for NV condition code (exists only for valid assembly/disassembly, equivilant to AL) llvm-svn: 205864	2014-04-09 14:42:07 +00:00
Bradley Smith	736d891c7c	[ARM64] Add missing 1Q -> 1q vector kind alias llvm-svn: 205863	2014-04-09 14:42:01 +00:00
Bradley Smith	731e82da35	[ARM64] Add parsing for vector lists such as {v0.8b-v3.8b} llvm-svn: 205862	2014-04-09 14:41:58 +00:00
Bradley Smith	7dde0bc1bc	[ARM64] Correctly alias LSL to UXTW for 32bit instruction variants, rather than UXTX llvm-svn: 205861	2014-04-09 14:41:53 +00:00
Bradley Smith	b19c89eadc	[ARM64] STRHro and STRBro were not being decoded at all. llvm-svn: 205860	2014-04-09 14:41:49 +00:00
Bradley Smith	dccf46dc88	[ARM64] MOVK with sf=0 and hw<1>=1 is unallocated. Shift amount for ADD/SUB instructions is unallocated if shift > 4. llvm-svn: 205859	2014-04-09 14:41:45 +00:00
Bradley Smith	06234dd176	[ARM64] Register-offset loads and stores with the 'option' field equal to 00x or 10x are undefined. llvm-svn: 205858	2014-04-09 14:41:38 +00:00
Elena Demikhovsky	56ab81fd87	AVX-512: insert element to mask vector; store i1 data Implemented INSERT_VECTOR_ELT operation for v16i1 and v8i1 vectors; Implemented "store" for i1 type llvm-svn: 205850	2014-04-09 12:37:50 +00:00
Daniel Sanders	00ab14e29e	Re-commit: [mips] abs.[ds], and neg.[ds] should be allowed regardless of -enable-no-nans-fp-math Summary: They behave in accordance with the Has2008 and ABS2008 configuration bits of the processor which are used to select between the 1985 and 2008 versions of IEEE 754. In 1985 mode, these instructions are arithmetic (i.e. they raise invalid operation exceptions when given NaN), in 2008 mode they are non-arithmetic (i.e. they are copies). nmadd.[ds], and nmsub.[ds] are still subject to -enable-no-nans-fp-math because the ISA spec does not explicitly state that they obey Has2008 and ABS2008. Fixed the issue with the previous version of this patch (r205628). A pre-existing 'let Predicate =' statement was removing some predicates that were necessary for FP64 to behave correctly. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3274 llvm-svn: 205844	2014-04-09 09:56:43 +00:00
Matt Arsenault	ffd08a2504	R600/SI: Match not instruction. llvm-svn: 205837	2014-04-09 07:16:16 +00:00
Tim Northover	06d767ccba	ARM64: scalarize v1i64 mul operation This is the second part of fixing PR19367. llvm-svn: 205836	2014-04-09 07:07:02 +00:00
Tim Northover	2dd1fcef1d	ARM64: add pattern for <1 x i64> custom not node. This should fix PR19367. llvm-svn: 205835	2014-04-09 06:55:39 +00:00
Saleem Abdulrasool	31abd6d01f	ARM MC: 80 column llvm-svn: 205833	2014-04-09 06:18:26 +00:00
Saleem Abdulrasool	b8c4915b58	ARM MC: sort source files in CMakeLists llvm-svn: 205832	2014-04-09 06:18:23 +00:00
Craig Topper	52173239da	[C++11] Replace some comparisons with 'nullptr' with simple boolean checks to reduce verbosity. llvm-svn: 205829	2014-04-09 04:20:00 +00:00
Juergen Ributzka	ef9790c95c	[Constant Hoisting][ARM64] Enable constant hoisting for ARM64. This implements the target-hooks for ARM64 to enable constant hoisting. This fixes <rdar://problem/14774662> and <rdar://problem/16381500>. llvm-svn: 205791	2014-04-08 20:39:59 +00:00
Hal Finkel	0e97afbdb4	[PowerPC] Don't return false from PPC::isVSLDOIShuffleMask PPC::isVSLDOIShuffleMask should return -1, not false, when the shuffle predicate should be false. Noticed by inspection; no test case (yet). llvm-svn: 205787	2014-04-08 19:00:27 +00:00
Kevin Enderby	699a083c54	Fix the ARM VLD3 (single 3-element structure to all lanes) size 16 double-spaced registers instruction printing. This: vld3.16 {d0[], d2[], d4[]}, [r4]! was being printed as: vld3.16 {d0[], d1[], d2[]}, [r4]! rdar://16531387 llvm-svn: 205779	2014-04-08 18:00:52 +00:00
NAKAMURA Takumi	99b861df4d	X86MCAsmInfoGNUCOFF: Set PointerSize as 8 for targeting x64. It caused DW_LNE_set_address was misemitted on x64. FIXME: I haven't investigate whether CalleeSaveStackSlotSize should be 8. llvm-svn: 205772	2014-04-08 15:28:50 +00:00
Tim Northover	0fdbe96fff	ARM64: fix fmsub patterns which assumed accum operand was first Confusingly, the NEON fmla instructions put the accumulator first but the scalar versions put it at the end (like the fma lib function & LLVM's intrinsic). This should fix PR19345, assuming there's only one issue. llvm-svn: 205758	2014-04-08 12:23:51 +00:00
Elena Demikhovsky	73f5b6faba	AVX-512: Added fp_to_uint and uint_to_fp patterns. llvm-svn: 205754	2014-04-08 07:24:02 +00:00
David Majnemer	5a37407a86	X86: Split the relocation selection up Before, we would have conditional operators where one side of the operator would be of type RelocationTypeAMD64 and the other is of type RelocationTypeI386. GCC would noisly warn with -Wenum-compare diagnostic. Instead, refactor the code so it is more like the X86 ELF object writer. llvm-svn: 205752	2014-04-08 02:15:13 +00:00
Jim Grosbach	6d4f9711f8	Tidy up comments a bit. Punctuation, grammar, formatting, etc.. llvm-svn: 205749	2014-04-07 23:47:23 +00:00
Jim Grosbach	d60c669b41	ARM64: Range based for loop in ARM64PromoteConstant pass llvm-svn: 205748	2014-04-07 23:47:21 +00:00
Jim Grosbach	62557551a4	ARM64: Clean up file header comment a bit. llvm-svn: 205747	2014-04-07 23:14:38 +00:00
Reed Kotler	2b01770807	Reverting commit r205628 due to mips64 issues. llvm-svn: 205741	2014-04-07 22:11:40 +00:00
Tom Stellard	5e0d95bd87	R600/SI: Handle INSERT_SUBREG in SIFixSGPRCopies llvm-svn: 205732	2014-04-07 19:45:45 +00:00
Tom Stellard	557024a30d	R600: Match 24-bit arithmetic patterns in a Target DAGCombine Moving these patterns from TableGen files to PerformDAGCombine() should allow us to generate better code by eliminating unnecessary shifts and extensions earlier. This also fixes a bug where the MAD pattern was calling SimplifyDemandedBits with a 24-bit mask on the first operand even when the full pattern wasn't being matched. This occasionally resulted in some instructions being incorrectly deleted from the program. v2: - Fix bug with 64-bit mul llvm-svn: 205731	2014-04-07 19:45:41 +00:00
Tom Stellard	607fdb662d	R600: Replace dyn_cast + assert with cast llvm-svn: 205730	2014-04-07 19:31:13 +00:00
Matt Arsenault	518774e8c9	Use std::swap llvm-svn: 205723	2014-04-07 16:44:26 +00:00
Matt Arsenault	625d4b3956	Use .data() instead of &x[0] llvm-svn: 205722	2014-04-07 16:44:24 +00:00
David Blaikie	3b8c0a19e7	MachineInstr: introduce explicit_operands and implicit_operands ranges Makes iteration over implicit and explicit machine operands more explicit (har har). Insipired by code review discussion for r205565. llvm-svn: 205680	2014-04-05 22:42:04 +00:00
Saleem Abdulrasool	83256a5d2d	ARM: consolidate MachO checks for ARM asm parser This consolidates the duplicated MachO checks in the directive parsing for various directives that are unsupported for Mach-O. The error message change is unimportant as this restores the behaviour to that prior to the addition of the new directive handling. Furthermore, use a more direct check for MachO targeting rather than an indirect feature check of the assembler. Also simplify the test execution command to avoid temporary files. Further more, perform the check in both object and assembly emission. Whether all non-applicable directives are handled is another question. .fnstart is marked as being unsupported, however, the complementary .fnend is not. The additional unwinding directives are also still honoured. This change does not change that, though, it would be good to validate and mark them as being unsupported if they are unsupported for the MachO emission. llvm-svn: 205678	2014-04-05 22:09:51 +00:00
Hal Finkel	216842b276	[PowerPC] Remove unused TM member variable to unbreak build Fix "error: private field 'TM' is not used [-Werror,-Wunused-private-field]" llvm-svn: 205660	2014-04-05 00:16:28 +00:00
Hal Finkel	ade2d32df0	[PowerPC] Adjust load/store costs in PPCTTI This provides more realistic costs for the insert/extractelement instructions (which are load/store pairs), accounts for the cheap unaligned Altivec load sequence, and for unaligned VSX load/stores. Bad news: MultiSource/Applications/sgefa/sgefa - 35% slowdown (this will require more investigation) SingleSource/Benchmarks/McGill/queens - 20% slowdown (we no longer vectorize this, but it was a constant store that was scalarized) MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2 - 2% slowdown Good news: SingleSource/Benchmarks/Shootout/ary3 - 54% speedup SingleSource/Benchmarks/Shootout-C++/ary - 40% speedup MultiSource/Benchmarks/Ptrdist/ks/ks - 35% speedup MultiSource/Benchmarks/FreeBench/neural/neural - 30% speedup MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt - 20% speedup Unfortunately, estimating the costs of the stack-based scalarization sequences is hard, and adjusting these costs is like a game of whac-a-mole :( I'll revisit this again after we have better codegen for vector extloads and truncstores and unaligned load/stores. llvm-svn: 205658	2014-04-04 23:51:18 +00:00
Hal Finkel	e6580e744f	[PowerPC] PPCTTI Cleanup Remove the declaration of an unimplemented function. llvm-svn: 205657	2014-04-04 23:51:11 +00:00
Matt Arsenault	7b6a70a9cf	Add DAG parameter to ComputeNumSignBitsForTargetNode This way, you can check the number of sign bits in the operands. The depth parameter it already has is pretty useless without this. llvm-svn: 205649	2014-04-04 20:13:13 +00:00
Matt Arsenault	dacc649f1d	Fix tabs llvm-svn: 205648	2014-04-04 20:13:08 +00:00
Kai Nacke	96b8da430d	[mips] Add Octeon cnMips instructions seqi/snei and v3mulu/vmm0/vmulu. This patch adds the Octeon cnMips instructions seqi/snei and v3mulu/vmm0/vmulu. It is only for the assembler. Test case is included. Reviewed by: Daniel.Sanders@imgtec.com llvm-svn: 205631	2014-04-04 16:21:59 +00:00
Hal Finkel	e63f5074c7	[PowerPC] Add a full condition code register to make the "cc" clobber work gcc inline asm supports specifying "cc" as a clobber of all condition registers. Add just enough modeling of the full register to make this work. Fixed PR19326. llvm-svn: 205630	2014-04-04 15:15:57 +00:00
Daniel Sanders	66ab94b282	[mips] abs.[ds], and neg.[ds] should be allowed regardless of -enable-no-nans-fp-math Summary: They behave in accordance with the Has2008 and ABS2008 configuration bits of the processor which are used to select between the 1985 and 2008 versions of IEEE 754. In 1985 mode, these instructions are arithmetic (i.e. they raise invalid operation exceptions when given NaN), in 2008 mode they are non-arithmetic (i.e. they are copies). nmadd.[ds], and nmsub.[ds] are still subject to -enable-no-nans-fp-math because the ISA spec does not explicitly state that they obey Has2008 and ABS2008. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3274 llvm-svn: 205628	2014-04-04 14:52:54 +00:00
Tim Northover	421793ce9a	ARM64: handle v1i1 types arising from setcc properly. There were several overlapping problems here, and this solution is closely inspired by the one adopted in AArch64 in r201381. Firstly, scalarisation of v1i1 setcc operations simply fails if the input types are legal. This is fixed in LegalizeVectorTypes.cpp this time, and allows AArch64 code to be simplified slightly. Second, vselect with such a setcc feeding into it ends up in ScalarizeVectorOperand, where it's not handled. I experimented with an implementation, but found that whatever DAG came out was rather horrific. I think Hao's DAG combine approach is a good one for quality, though there are edge cases it won't catch (to be fixed separately). Should fix PR19335. llvm-svn: 205625	2014-04-04 14:49:21 +00:00
Stepan Dyatkovskiy	c02dc549f6	Fix for PR18921 (LDRD/STRD part):: Removed "GNU Assembler extension (compatibility)" definitions from ARMInstrInfo.td Fixed ARMAsmParser::ParseInstruction GNU compatability branch, so it also works for thumb mode from now. Added new tests. llvm-svn: 205622	2014-04-04 10:17:56 +00:00
Tim Northover	d1d0ccfca1	ARM64: use regalloc-friendly COPY_TO_REGCLASS for bitcasts The previous patterns directly inserted FMOV or INS instructions into the DAG for scalar_to_vector & bitconvert patterns. This is horribly inefficient and can generated lots more GPR <-> FPR register traffic than necessary. It's much better to emit instructions the register allocator understands so it can coalesce the copies when appropriate. It led to at least one ISelLowering hack to avoid the problems, which was incorrect for v1i64 (FPR64 has no dsub). It can now be removed entirely. This should also fix PR19331. llvm-svn: 205616	2014-04-04 09:03:09 +00:00
Tim Northover	95abd1f95a	ARM64: add 128-bit MLA operations to the custom selection code. Without this change, the llvm_unreachable kicked in. The code pattern being spotted is rather non-canonical for 128-bit MLAs, but it can happen and there's no point in generating sub-optimal code for it just because it looks odd. Should fix PR19332. llvm-svn: 205615	2014-04-04 09:03:02 +00:00
Stepan Dyatkovskiy	3d7016986d	Fixed register class in STRD instruction for Thumb2 mode. llvm-svn: 205612	2014-04-04 08:14:13 +00:00
Craig Topper	694437e2ef	Make consistent use of MCPhysReg instead of uint16_t throughout the tree. llvm-svn: 205610	2014-04-04 05:16:06 +00:00
Jim Grosbach	8f614ea1a4	ARM: Range based for-loop over block predecessors. No functional change. llvm-svn: 205604	2014-04-04 02:11:03 +00:00
Jim Grosbach	ba2a01b6d7	ARM: Use range-based for loops in frame lowering. No functional change. llvm-svn: 205602	2014-04-04 02:10:55 +00:00
Quentin Colombet	419aeb287d	Revert r205599, the commit was not intended to have so many changes llvm-svn: 205600	2014-04-04 02:02:49 +00:00
Quentin Colombet	b4d3858ea5	[RegAllocGreedy][Last Chance Recoloring] Emit diagnostics when last chance recoloring cut-offs are hit. This is related to PR18747. Patch by MAYUR PANDEY <mayur.p@samsung.com> llvm-svn: 205599	2014-04-04 01:58:57 +00:00
Saleem Abdulrasool	c448b35de7	MIPS: remove vim swap file llvm-svn: 205595	2014-04-04 01:19:54 +00:00
Jim Grosbach	7c096ae6e1	Tidy up. Space before ':' in range-based for loops. llvm-svn: 205585	2014-04-03 23:43:26 +00:00
Jim Grosbach	0813a7bbeb	Tidy up. 80 columns. llvm-svn: 205584	2014-04-03 23:43:22 +00:00
Jim Grosbach	7c9088c82d	Tidy up. Trailing whitespace. llvm-svn: 205583	2014-04-03 23:43:18 +00:00
Jim Grosbach	a1594fa87a	Fix typo. llvm-svn: 205582	2014-04-03 23:43:12 +00:00
Eli Bendersky	dacc8f8c1e	Optimize away unnecessary address casts. Removes unnecessary casts from non-generic address spaces to the generic address space for certain code patterns. Patch by Jingyue Wu. llvm-svn: 205571	2014-04-03 21:18:25 +00:00
Lang Hames	9b407b8dd3	[ARM64] Teach the ARM64DeadRegisterDefinition pass to respect implicit-defs. When rematerializing through truncates, the coalescer may produce instructions with dead defs, but live implicit-defs of subregs: E.g. %X1<def,dead> = MOVi64imm 2, %W1<imp-def>; %X1:GPR64, %W1:GPR32 These instructions are live, and their definitions should not be rewritten. Fixes <rdar://problem/16492408> llvm-svn: 205565	2014-04-03 20:51:08 +00:00
Tom Stellard	981d7d5d7e	R600: Correct opcode for BFE_INT Acording to AMD documentation, the correct opcode for BFE_INT is 0x5, not 0x4 Fixes Arithm/Absdiff.Mat/3 OpenCV test Patch by: Bruno Jiménez llvm-svn: 205562	2014-04-03 20:19:29 +00:00
Tom Stellard	76577a21a1	R600/SI: Lower 64-bit immediates using REG_SEQUENCE llvm-svn: 205561	2014-04-03 20:19:27 +00:00
Tim Northover	daeb0caa3e	ARM: tell LLVM about zext properties of ldrexb/ldrexh Implementing this via ComputeMaskedBits has two advantages: + It actually works. DAGISel doesn't deal with the chains properly in the previous pattern-based solution, so they never trigger. + The information can be used in other DAG combines, as well as the trivial "get rid of truncs". For example if the trunc is in a different basic block. rdar://problem/16227836 llvm-svn: 205540	2014-04-03 15:10:35 +00:00
Daniel Sanders	a6bc64ce8a	[mips] Implement ehb, ssnop, and pause in assembler Summary: Add negative tests for pause Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3246 llvm-svn: 205537	2014-04-03 13:21:51 +00:00
Tim Northover	d8ab3a6d2f	ARM: skip cmpxchg failure barrier if ordering is monotonic. The terminal barrier of a cmpxchg expansion will be either Acquire or SequentiallyConsistent. In either case it can be skipped if the operation has Monotonic requirements on failure. rdar://problem/15996804 llvm-svn: 205535	2014-04-03 13:06:54 +00:00
Zoran Jovanovic	712fef1943	Implementation of 16-bit microMIPS instructions MFHI and MFLO. Differential Revision: http://llvm-reviews.chandlerc.com/D3141 llvm-svn: 205532	2014-04-03 12:47:34 +00:00

1 2 3 4 5 ...

27965 Commits