llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Job Noorman	49c1205681	Make sure SP is always aligned on a 2 byte boundary llvm-svn: 193320	2013-10-24 09:32:31 +00:00
Amara Emerson	de52a239bd	[AArch64] Fix NZCV reg live-in bug in F128CSEL codegen. When generating the IfTrue basic block during the F128CSEL pseudo-instruction handling, the NZCV live-in for the newly created BB wasn't being added. This caused a fault during MI-sched/live range calculation when the predecessor for the fall-through BB didn't have a live-in for phys-reg as expected. llvm-svn: 193316	2013-10-24 08:28:24 +00:00
Elena Demikhovsky	da06b9b278	AVX-512: added VCVTPH2PS, VCVTPS2PH with intrinsics llvm-svn: 193312	2013-10-24 07:16:35 +00:00
Yaron Keren	3fb42fb8b5	(this is a corrected patch) Calling _chkstk is required on ELF as well as COFF on Windows. Without _chkstk, functions requiring large stack crash in initialization code. Previous code tested for COFF format but not Mach-O and this patch modifies the code to test for Windows OS (both Windows target and MingW target) but not Mach-O object format: Looks like macho environment was used to build some EFI code. Credits to Andrew MacPherson. llvm-svn: 193289	2013-10-23 23:37:01 +00:00
Rafael Espindola	b6d34eea66	Revert "Calling _chkstk is required on ELF as well as COFF on Windows. Without _chkstk functions requiring large stack crash in initialization code. Previous code tested for COFF format but not Mach-O and this patch modifies the code to test for Windows." This reverts commit r193263. It is causing CodeGen/X86/mingw-alloca.ll to fail. llvm-svn: 193275	2013-10-23 21:45:09 +00:00
Benjamin Kramer	701e41bb58	X86: Custom lower sext v16i8 to v16i16, and the corresponding truncate. Also update the cost model. llvm-svn: 193270	2013-10-23 21:06:07 +00:00
Yaron Keren	56f5c84f6c	Calling _chkstk is required on ELF as well as COFF on Windows. Without _chkstk functions requiring large stack crash in initialization code. Previous code tested for COFF format but not Mach-O and this patch modifies the code to test for Windows. Credits to Andrew MacPherson. llvm-svn: 193263	2013-10-23 19:40:07 +00:00
Benjamin Kramer	8ed652c269	X86: Custom lower zext v16i8 to v16i16. On sandy bridge (PR17654) we now get vpxor %xmm1, %xmm1, %xmm1 vpunpckhbw %xmm1, %xmm0, %xmm2 vpunpcklbw %xmm1, %xmm0, %xmm0 vinsertf128 $1, %xmm2, %ymm0, %ymm0 On haswell it's a simple vpmovzxbw %xmm0, %ymm0 There is a maze of duplicated and dead transforms and patterns in this area. Remove the dead custom lowering of zext v8i16 to v8i32, that's already handled by LowerAVXExtend. llvm-svn: 193262	2013-10-23 19:19:04 +00:00
Michael Liao	3b38b22386	Fix PR17631 - Skip instructions added in prolog. For specific targets, prolog may insert helper function calls (e.g. _chkstk will be called when there're more than 4K bytes allocated on stack). However, these helpers don't use/def YMM/XMM registers. llvm-svn: 193261	2013-10-23 18:32:43 +00:00
Jim Grosbach	03a64fa7b7	X86: Make concat_vectors combine a bit more conservative. Per Nadav's review comments for r192866. llvm-svn: 193252	2013-10-23 17:37:40 +00:00
Zoran Jovanovic	9e76cf1f6a	Support for microMIPS relocations 1. llvm-svn: 193247	2013-10-23 16:14:44 +00:00
Matheus Almeida	5cc3614e90	[mips][msa] Direct Object Emission support for the LSA instruction. llvm-svn: 193240	2013-10-23 13:20:07 +00:00
Daniel Sanders	9918652e43	[mips][msa] Added support for matching fexp2 from normal IR (i.e. not intrinsics) llvm-svn: 193239	2013-10-23 10:36:52 +00:00
Artyom Skrobov	2ef42e5c31	Make ARM hint ranges consistent, and add tests for these ranges llvm-svn: 193238	2013-10-23 10:14:40 +00:00
Tom Stellard	3d22dc6eef	R600/SI: Replace ffs(x) - 1 with countTrailingZeros(x) ffs(x) broke the mingw buildbot. llvm-svn: 193225	2013-10-23 03:50:25 +00:00
Tom Stellard	7df5f52e81	R600/SI: fix MIMG writemask adjustement This fixes piglit: - shaders/glsl-fs-texture2d-masked - shaders/glsl-fs-texture2d-masked-4 Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 193222	2013-10-23 02:53:47 +00:00
Tom Stellard	2b6ff7e802	R600: Fix handling of vector kernel arguments The SelectionDAGBuilder was promoting vector kernel arguments to legal types, but this won't work for R600 and SI since kernel arguments are stored in memory and can't be promoted. In order to handle vector arguments correctly we need to look at the original types from the LLVM IR function. llvm-svn: 193215	2013-10-23 00:44:32 +00:00
Tom Stellard	a1dbb396e5	R600/SI: Add support for i64 bitwise or llvm-svn: 193213	2013-10-23 00:44:19 +00:00
Tom Stellard	914bfa633e	R600/SI: Use S_LOAD_DWORD instructions for v8i32 and v16i32 llvm-svn: 193212	2013-10-23 00:44:12 +00:00
Quentin Colombet	c5a6c85c4f	[X86][FastISel] Add a comment to help understanding changes made in r192636. <rdar://problem/15192473> llvm-svn: 193199	2013-10-22 21:29:08 +00:00
Matt Arsenault	c02348a968	R600/SI: Don't assert on SCC usage llvm-svn: 193198	2013-10-22 21:11:31 +00:00
Tim Northover	e4acb9a5a2	ARM: provide diagnostics on more writeback LDM/STM instructions The set of circumstances where the writeback register is allowed to be in the list of registers is rather baroque, but I think this implements them all on the assembly parsing side. For disassembly, we still warn about an ARM-mode LDM even if the architecture revision is < v7 (the required architecture information isn't available). It's a silly instruction anyway, so hopefully no-one will mind. rdar://problem/15223374 llvm-svn: 193185	2013-10-22 19:00:39 +00:00
Tom Stellard	def55e3397	R600/SI: Use llvm_unreachable() for an always false assert llvm-svn: 193183	2013-10-22 18:42:03 +00:00
Tom Stellard	e692b77f62	R600/SI: Fix warning on non-asserts build llvm-svn: 193180	2013-10-22 18:31:45 +00:00
Tom Stellard	5908e906e2	R600: Simplify handling of private address space The AMDGPUIndirectAddressing pass was previously responsible for lowering private loads and stores to indirect addressing instructions. However, this pass was buggy and way too complicated. The only advantage it had over the new simplified code was that it saved one instruction per direct write to private memory. This optimization likely has a minimal impact on performance, and we may be able to duplicate it using some other transformation. For the private address space, we now: 1. Lower private loads/store to Register(Load\|Store) instructions 2. Reserve part of the register file as 'private memory' 3. After regalloc lower the Register(Load\|Store) instructions to MOV instructions that use indirect addressing. llvm-svn: 193179	2013-10-22 18:19:10 +00:00
Tom Stellard	4b021afc5e	R600: Remove unused InstrInfo::getMovImmInstr() function llvm-svn: 193178	2013-10-22 18:19:01 +00:00
Matheus Almeida	06be00bac8	[mips][msa] Direct Object Emission support for conditional branches. These branches have a 16-bit offset (R_MIPS_PC16). List of conditional branch instructions: bnz.{b,h,w,d} bnz.v bz.{b,h,w,d} bz.v llvm-svn: 193157	2013-10-22 09:43:32 +00:00
Elena Demikhovsky	3136868b1d	AVX-512: aligned / unaligned load and store for 512-bit integer vectors. llvm-svn: 193156	2013-10-22 09:19:28 +00:00
Craig Topper	8b2b2a7210	Replace (V)MOVZDI2PDIrr/rm instructions with patterns that select (V)MOVDI2PDIrr/rm. llvm-svn: 193146	2013-10-22 04:35:20 +00:00
Jim Grosbach	4ccaf97952	ARM: Thumb2 copy for GPRPair needs to use thumb instructions. Use tMOVr instead of plain MOVr. rdar://15193017 llvm-svn: 193139	2013-10-22 02:29:37 +00:00
Jim Grosbach	fb572e0475	ARM: Clean up copyPhysReg() a bit. No functional change, just cleaning things up for readability. llvm-svn: 193138	2013-10-22 02:29:35 +00:00
Chad Rosier	838b6065b8	[AArch64] Add the constraint to NEON scalar mla/mls instructions. llvm-svn: 193117	2013-10-21 20:11:47 +00:00
Lang Hames	df2443e32e	X86 vector element shift-by-immediate instructions take i8 immediates. Make the instruction defenitions and ISEL reflect this. Prior to this patch these instructions took an i32i8imm, and the high bits were dropped during encoding. This led to incorrect behavior for shifts by immediates higher than 255. This patch fixes that issue by detecting large immediate shifts and returning constant zero (for logical shifts) or capping the shift amount at an encodable value (for arithmetic shifts). Fixes <rdar://problem/14968098> llvm-svn: 193096	2013-10-21 17:51:24 +00:00
Elena Demikhovsky	dceb9534bf	AVX-512: MUL operation lowering for v8i64 llvm-svn: 193083	2013-10-21 13:27:34 +00:00
Matheus Almeida	8cdecf1482	[mips][msa] Direct Object Emission support for LD/ST instructions. llvm-svn: 193082	2013-10-21 13:07:13 +00:00
Matheus Almeida	49a9f349f1	[mips][msa] Direct Object Emission support for LDI instructions. llvm-svn: 193081	2013-10-21 12:56:20 +00:00
Matheus Almeida	c0cc50e68b	[mips][msa] Direct Object Emission support for MOVE.v. llvm-svn: 193080	2013-10-21 12:43:54 +00:00
Matheus Almeida	e54e855dbb	[mips][msa] Direct Object Emission support for CTCMSA and CFCMSA. These instructions are logically related as they allow read/write of MSA control registers. Currently MSA control registers are emitted by number but hopefully that will change as soon as GAS starts accepting them by name as that would make the assembly easier to read. llvm-svn: 193078	2013-10-21 12:26:50 +00:00
Matheus Almeida	1fede43958	[mips][msa] Direct Object Emission of SPLAT instruction. llvm-svn: 193077	2013-10-21 12:07:26 +00:00
Matheus Almeida	1760b2c642	[mips][msa] Fix definition of SLD instruction. The second parameter of the SLD intrinsic is the number of columns (GPR) to slide left the source array. llvm-svn: 193076	2013-10-21 11:47:56 +00:00
Nadav Rotem	fd357159bc	Mark some command line flags as hidden llvm-svn: 193013	2013-10-18 23:38:13 +00:00
Hans Wennborg	c1a311233c	MC asm parser: allow ?'s in symbol names, and handle @'s in names in MS asm This is another (final?) stab at making us able to parse our own asm output on Windows. Symbols on Windows often contain @'s and ?'s in their names. Our asm parser didn't like this. ?'s were not allowed, and @'s were intepreted as trying to reference PLT/GOT/etc. We can't just add quotes around the bad names, since e.g. for MinGW, we use gas to assemble, and it doesn't like quotes in some places (notably in .def directives). This commit makes us allow ?'s in symbol names, and @'s in symbol names for MS assembly. Differential Revision: http://llvm-reviews.chandlerc.com/D1978 llvm-svn: 193000	2013-10-18 20:46:28 +00:00
Richard Barton	4f1967c83f	Pure refactoring change. Patch by Artyom Skrobov. llvm-svn: 192977	2013-10-18 14:41:50 +00:00
Benjamin Kramer	5a7ac7cb25	R600: Remove \ at EOL from ascii art comments. Completely harmless, but GCC likes to warn about it even when the next line is a comment. llvm-svn: 192974	2013-10-18 14:12:50 +00:00
Richard Barton	cb6c32ac32	Add hint disassembly syntax for 16-bit Thumb hint instructions. Patch by Artyom Skrobov llvm-svn: 192972	2013-10-18 14:09:49 +00:00
Chad Rosier	163fdd3e73	[AArch64] Add support for NEON scalar extract narrow instructions. llvm-svn: 192970	2013-10-18 14:03:24 +00:00
Silviu Baranga	0e6bbf1a98	Add hardware division as a default feature on Cortex-A15. Also add test cases to check this, and change diagnostics for the hwdiv-arm feature to something useful. llvm-svn: 192963	2013-10-18 10:18:40 +00:00
Hans Wennborg	33576424a9	Revert "Re-commit r192758 - MC: quote tricky symbol names in asm output" This caused the clang-native-mingw32-win7 buildbot to break. The assembler was complaining about the following lines that were showing up in the asm for CrashRecoveryContext.cpp: movl $"__ZL16ExceptionHandlerP19_EXCEPTION_POINTERS@4", 4(%eax) calll "_AddVectoredExceptionHandler@8" .def "__ZL16ExceptionHandlerP19_EXCEPTION_POINTERS@4"; "__ZL16ExceptionHandlerP19_EXCEPTION_POINTERS@4": calll "_RemoveVectoredExceptionHandler@4" Reverting for now. llvm-svn: 192940	2013-10-18 02:14:40 +00:00
David Peixotto	839f0f98a5	17309 ARM backend incorrectly lowers COPY_STRUCT_BYVAL_I32 for thumb1 targets This commit implements the correct lowering of the COPY_STRUCT_BYVAL_I32 pseudo-instruction for thumb1 targets. Previously, the lowering of COPY_STRUCT_BYVAL_I32 generated the post-increment forms of ldr/ldrh/ldrb instructions. Thumb1 does not have the post-increment form of these instructions so the generated assembly contained invalid instructions. Passing the generated assembly to gcc caused it to complain with an error like this: Error: cannot honor width suffix -- `ldrb r3,[r0],#1' and the integrated assembler would generate an object file with an invalid instruction encoding. This commit contains a small test case that demonstrates the problem with thumb1 targets as well as an expanded test case that more throughly tests the lowering of byval struct passing for arm, thumb1, and thumb2 targets. llvm-svn: 192916	2013-10-17 19:52:05 +00:00
David Peixotto	22f270719b	Refactor lowering for COPY_STRUCT_BYVAL_I32 This commit refactors the lowering of the COPY_STRUCT_BYVAL_I32 pseudo-instruction in the ARM backend. We introduce a new helper class that encapsulates all of the operations needed during the lowering. The operations are implemented for each subtarget in different subclasses. Currently only arm and thumb2 subtargets are supported. This refactoring was done to easily implement support for thumb1 subtargets. This initial patch does not add support for thumb1, but is only a refactoring. A follow on patch will implement the support for thumb1 subtargets. No intended functionality change. llvm-svn: 192915	2013-10-17 19:49:22 +00:00

1 2 3 4 5 ...

25961 Commits