llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Robert Lytton	ee42d27153	XCore target: implement exception handling llvm-svn: 194564	2013-11-13 10:19:31 +00:00
Reed Kotler	3d6497041f	Allow the code which returns the length for inline assembler to know specifically about the .space directive. This allows us to force large blocks of code to appear in test cases for things like constant islands without having to make giant test cases to force things like long branches to take effect. llvm-svn: 194555	2013-11-13 04:37:52 +00:00
Andrew Trick	8a9e174bba	Add a test case to verify that misusing anyregcc crashes as expected. llvm-svn: 194553	2013-11-13 03:46:19 +00:00
Matt Arsenault	9c10e82e9e	R600: Fix selection failure on EXTLOAD llvm-svn: 194547	2013-11-13 02:39:07 +00:00
Juergen Ributzka	b47be624ea	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. This patch reapplies r193676 with an additional fix for the Hexagon backend. The SystemZ backend has already been fixed by r194148. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 194542	2013-11-13 01:57:54 +00:00
Andrew Trick	12470267da	Cleanup the stackmap operand folding code and fix a corner case. I still don't know how to refer to the fixed operands symbolically. I plan to look into it. llvm-svn: 194529	2013-11-12 22:58:39 +00:00
Akira Hatanaka	99c10a8e6d	[mips] Fix a bug in function CC_MipsO32_FP64. The second double precision argument was not being passed in $f14. llvm-svn: 194522	2013-11-12 22:16:18 +00:00
Akira Hatanaka	eb13575b41	[mips] Run test case with command line option -mattr=+fp64. llvm-svn: 194519	2013-11-12 22:06:45 +00:00
Akira Hatanaka	2df5406920	[mips] Fix and re-enable a test case that has been disabled for a long time. llvm-svn: 194510	2013-11-12 21:03:57 +00:00
Andrew Trick	56e6608cf0	Simplify operand folding when rematerializing a load. We already know how to fold a reload from a frameindex without analyzing the load instruction. Generalize this to handle any frameindex load. This streamlines the logic for rematerializing loads from stack arguments. As a side effect, it allows stackmaps to record a stack argument location without spilling it. Verified no effect on codegen for llvm test-suite. llvm-svn: 194497	2013-11-12 18:06:12 +00:00
Daniel Sanders	7fd9efa092	[mips][msa] Enable inlinse assembly for MSA. Like GCC, this re-uses the 'f' constraint and a new 'w' print-modifier: asm ("ldi.w %w0, 1", "=f"(result)); Unlike GCC, the 'w' print-modifer is not _required_ to produce the intended output. This is a consequence of differences in the internal handling of the registers in each compiler. To be source-compatible between the compilers, users must use the 'w' print-modifier. MSA registers (including control registers) are supported in clobber lists. llvm-svn: 194476	2013-11-12 12:56:01 +00:00
Daniel Sanders	8932f5bd6d	[mips][msa] Added support for matching bclr, and bclri from normal IR (i.e. not intrinsics) llvm-svn: 194471	2013-11-12 10:45:18 +00:00
Bradley Smith	d0276d3c63	[ARM] Add support for FP_HP_extension build attribute llvm-svn: 194470	2013-11-12 10:38:05 +00:00
Daniel Sanders	464a7ad75b	[mips][msa] Added support for matching bset, bseti, bneg, and bnegi from normal IR (i.e. not intrinsics) llvm-svn: 194469	2013-11-12 10:31:49 +00:00
Daniel Sanders	dc0cf6755d	[mips][msa] Change constant used in ori tests to avoid conflict with bseti (also xori to avoid bnegi) Upcoming commit(s) are going to add support for bseti and bnegi. This would cause some existing tests to (correctly) change behaviour and emit a different instruction. This patch prevents this by changing the constant used in ori and xori tests so that they will not be matchable by the bseti and bnegi patterns when these instructions are matchable from normal IR. llvm-svn: 194467	2013-11-12 10:14:18 +00:00
Robert Lytton	3962d1cdf0	XCore target: fix bug in aligning 'byval i8*' on the stack llvm-svn: 194466	2013-11-12 10:11:35 +00:00
Robert Lytton	7bf99af88b	XCore target test for hidden declaration llvm-svn: 194465	2013-11-12 10:11:30 +00:00
Robert Lytton	584459d7ea	Add XCore support for ATOMIC_FENCE. ATOMIC_FENCE is lowered to a compiler barrier which is codegen only. There is no need to emit an instructions since the XCore provides sequential consistency. Original patch by Richard Osborne llvm-svn: 194464	2013-11-12 10:11:26 +00:00
Robert Lytton	d18548882b	XCore target: return error for unsupported alignment llvm-svn: 194463	2013-11-12 10:11:05 +00:00
Matt Arsenault	70be5dff43	R600/SI: Change formatting of printed registers. Print the range of registers used with a single letter prefix. This better matches what the shader compiler produces and is overall less obnoxious than concatenating all of the subregister names together. Instead of SGPR0, it will print s0. Instead of SGPR0_SGPR1, it will print s[0:1] and so on. There doesn't appear to be a straightforward way to get the actual register info in the InstPrinter, so this parses the generated name to print with the new syntax. The required test changes are pretty nasty, and register matching regexes are now worse. Since there isn't a way to add to a variable in FileCheck, some of the tests now don't check the exact number of registers used, but I don't think that will be a real problem. llvm-svn: 194443	2013-11-12 02:35:51 +00:00
Reed Kotler	c6c2273def	Change the default branch instruction to be the 16 bit variety for mips16. This has no material effect at this time since we don't have a direct object emitter for mips16 and the assembler can't tell them apart. I place a comment "16 bit inst" for those so that I can tell them apart in the output. The constant island pass has only been minimally changed to allow this. More complete branch work is forthcoming but this is the first step. llvm-svn: 194442	2013-11-12 02:27:12 +00:00
Matt Arsenault	7970dec395	R600/SI: Add test that fails due to requiring i64 mul for pointers llvm-svn: 194433	2013-11-11 23:31:02 +00:00
Andrew Trick	9a4f1fc067	Fix the recently added anyregcc convention to handle spilled operands. Fixes <rdar://15432754> [JS] Assertion: "Folded a def to a non-store!" The primary purpose of anyregcc is to prevent a patchpoint's call arguments and return value from being spilled. They must be available in a register, although the calling convention does not pin the register. It's up to the front end to avoid using this convention for calls with more arguments than allocatable registers. llvm-svn: 194428	2013-11-11 22:40:25 +00:00
Vincent Lejeune	54d9c8726b	R600: Use function inputs to represent data stored in gpr llvm-svn: 194425	2013-11-11 22:10:24 +00:00
Akira Hatanaka	3e34a7bec2	[mips] Partially revert r193641. Stack alignment should not be determined by the floating point register mode. llvm-svn: 194423	2013-11-11 21:49:03 +00:00
Justin Holewinski	a0efbf3a7e	[NVPTX] Properly handle bitcast ConstantExpr when checking for the alignment of function parameters llvm-svn: 194410	2013-11-11 19:28:19 +00:00
Justin Holewinski	0d1f2863f9	[NVPTX] Fix logic error in loading vector parameters of more than 4 components llvm-svn: 194409	2013-11-11 19:28:16 +00:00
Chad Rosier	8d7ebe36dd	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Chad Rosier	4848250116	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Daniel Sanders	a3d78a0bb1	Vector forms of SHL, SRA, and SRL can be constant folded using SimplifyVBinOp too Reviewers: dsanders Reviewed By: dsanders CC: llvm-commits, nadav Differential Revision: http://llvm-reviews.chandlerc.com/D1958 llvm-svn: 194393	2013-11-11 17:23:41 +00:00
Matheus Almeida	568c6ffeab	[mips][msa] CHECK-DAG-ize MSA 3r-a.ll test. No functional changes. llvm-svn: 194391	2013-11-11 16:46:20 +00:00
Matheus Almeida	c62765e970	[mips][msa] CHECK-DAG-ize MSA 2rf_int_float.ll test. No functional changes. llvm-svn: 194390	2013-11-11 16:38:55 +00:00
Matheus Almeida	a747f4d24f	[mips][msa] CHECK-DAG-ize MSA 2rf_float_int.ll test. No functional changes. llvm-svn: 194389	2013-11-11 16:31:46 +00:00
Matheus Almeida	c1afcbf128	[mips][msa] CHECK-DAG-ize MSA 2rf.ll test. No functional changes. llvm-svn: 194387	2013-11-11 16:24:53 +00:00
Matheus Almeida	7ff082f91c	[mips][msa] CHECK-DAG-ize MSA 2r.ll test. No functional changes. llvm-svn: 194386	2013-11-11 16:16:53 +00:00
Hal Finkel	2d9d341e70	Add PPC option for full register names in asm On non-Darwin PPC systems, we currently strip off the register name prefix prior to instruction printing. So instead of something like this: mr r3, r4 we print this: mr 3, 4 The first form is the default on Darwin, and is understood by binutils, but not yet understood by our integrated assembler. Once our integrated-as understands full register names as well, this temporary option will be replaced by tying this functionality to the verbose-asm option. The numeric-only form is compatible with legacy assemblers and tools, and is also gcc's default on most PPC systems. On the other hand, it is harder to read, and there are some analysis tools that expect full register names. llvm-svn: 194384	2013-11-11 14:58:40 +00:00
Reed Kotler	0e6ffc6bfa	Mostly finish up constant islands port for Mips for load constants. Still need to finish the branch part. Still lots more review of the code, clean up and testing. llvm-svn: 194337	2013-11-10 00:09:26 +00:00
Akira Hatanaka	601f7aebe8	[mips] Make sure there is a chain edge dependency between loads that read formal arguments on the stack and stores created afterwards. We need this to ensure tail call optimized function calls do not write over the argument area of the stack before it is read out. llvm-svn: 194309	2013-11-09 02:38:51 +00:00
Juergen Ributzka	a748d55906	[Stackmap] Materialize the jump address within the patchpoint noop slide. This patch moves the jump address materialization inside the noop slide. This enables patching of the materialization itself or its complete removal. This patch also adds the ability to define scratch registers that can be used safely by the code called from the patchpoint intrinsic. At least one scratch register is required, because that one is used for the materialization of the jump address. This patch depends on D2009. Differential Revision: http://llvm-reviews.chandlerc.com/D2074 Reviewed by Andy llvm-svn: 194306	2013-11-09 01:51:33 +00:00
Juergen Ributzka	f27436b708	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Quentin Colombet	6833715219	[VirtRegMap] Fix for PR17825. Do not ignore noreturn definitions when setting isPhysRegUsed if the unwind information is required. Indeed, the runtime may need a correct stack to be able to unwind the call. llvm-svn: 194271	2013-11-08 18:14:17 +00:00
Tim Northover	e68673eeb6	ARM: fold prologue/epilogue sp updates into push/pop for code size ARM prologues usually look like: push {r7, lr} sub sp, sp, #4 If code size is extremely important, this can be optimised to the single instruction: push {r6, r7, lr} where we don't actually care about the contents of r6, but pushing it subtracts 4 from sp as a side effect. This should implement such a conversion, predicated on the "minsize" function attribute (-Oz) since I've yet to find any code it actually makes faster. llvm-svn: 194264	2013-11-08 17:18:07 +00:00
Vincent Lejeune	5f1f106136	R600: Fix LowerUDIVREM llvm-svn: 194153	2013-11-06 17:36:04 +00:00
Jiangning Liu	9c0eb8e7ba	Implement AArch64 Neon instruction set Perm. llvm-svn: 194123	2013-11-06 03:35:27 +00:00
Jiangning Liu	1cdd311f06	Implement AArch64 Neon instruction set Bitwise Extract. llvm-svn: 194118	2013-11-06 02:25:49 +00:00
Andrew Trick	5a2a400cf1	Slightly change the way stackmap and patchpoint intrinsics are lowered. MorphNodeTo is not safe to call during DAG building. It eagerly deletes dependent DAG nodes which invalidates the NodeMap. We could expose a safe interface for morphing nodes, but I don't think it's worth it. Just create a new MachineNode and replaceAllUsesWith. My understaning of the SD design has been that we want to support early target opcode selection. That isn't very well supported, but generally works. It seems reasonable to rely on this feature even if it isn't widely used. llvm-svn: 194102	2013-11-05 22:44:04 +00:00
Jiangning Liu	59b8117b0b	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194085	2013-11-05 17:42:05 +00:00
Reed Kotler	787735b38c	Fix r194019 as requested by Eric Christopher. Submit the basic port of the rest of ARM constant islands code to Mips. Two test cases are added which reflect the next level of functionality: constants getting moved to water areas that are out of range from the initial placement at the end of the function and basic blocks being split to create water when none exists that can be used. There is a bunch of this code that is not complete and has been marked with IN_PROGRESS. I will finish cleaning this all up during the next week or two and submit the rest of the test cases. I have elminated some code for dealing with inline assembly because to me it unecessarily complicates things and some of the newer features of llvm like function attributies and builtin assembler give me better tools to solve the alignment issues created there. Also, for Mips16 I even have the option of not doing constant islands in the present of inline assembler if I chose. When everything has been completed I will summarize the port and notify people that are knowledgable regarding the ARM Constant Islands code so they can review it in it's entirety if they wish. llvm-svn: 194053	2013-11-05 08:14:14 +00:00
Hao Liu	386d8dd5a6	Implement AArch64 post-index vector load/store multiple N-element structure class SIMD(lselem-post). Including following 14 instructions: 4 ld1 insts: post-index load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: post-index load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: post-index store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: post-index store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 194043	2013-11-05 03:39:32 +00:00
Kevin Qin	63fa5c1ef6	Implemented aarch64 neon intrinsic vcopy_lane with float type. llvm-svn: 194041	2013-11-05 02:03:59 +00:00

1 2 3 4 5 ...

8540 Commits