llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 23:42:52 +01:00

Author	SHA1	Message	Date
Tilmann Scheller	14c2ce0a1e	ARM: Add GPR register class excluding LR for use with the ADR instruction. This improves code generation for jump tables by avoiding the emission of "mov pc, lr" which could fool the processor into believing this is a return from a function causing mispredicts. The code generation logic for jump tables uses ADR to materialize the address of the jump target. Patch by Daniel Stewart! llvm-svn: 190043	2013-09-05 11:10:31 +00:00
Richard Sandiford	399318ba38	[SystemZ] Add NC, OC and XC For now these are just used to handle scalar ANDs, ORs and XORs in which all operands are memory. llvm-svn: 190041	2013-09-05 10:36:45 +00:00
Venkatraman Govindaraju	b3ea970660	[Sparc] Correctly handle call to functions with ReturnsTwice attribute. In sparc, setjmp stores only the registers %fp, %sp, %i7 and %o7. longjmp restores the stack, and the callee-saved registers (all local/in registers: %i0-%i7, %l0-%l7) using the stored %fp and register windows. However, this does not guarantee that the longjmp will restore the registers, as they were when the setjmp was called. This is because these registers may be clobbered after returning from setjmp, but before calling longjmp. This patch prevents the registers %i0-%i5, %l0-l7 to live across the setjmp call using the register mask. llvm-svn: 190033	2013-09-05 05:32:16 +00:00
Andrew Trick	330821bce0	mi-sched: Force bottom up scheduling for generic targets. Fast register pressure tracking currently only takes effect during bottom up scheduling. Forcing this is a bit faster and simpler for targets that don't have many scheduling constraints and don't need top-down scheduling. llvm-svn: 190014	2013-09-04 23:54:00 +00:00
Eric Christopher	fd11e8a82d	Expand and rewrite comment. llvm-svn: 189998	2013-09-04 21:23:23 +00:00
Arnold Schwaighofer	22610acac7	Change swift/vldm test case to be less dependent on allocation order 'Force' values in registers using the calling convention. Now, we only depend on the calling convention and that the allocator performs copy coalescing. llvm-svn: 189985	2013-09-04 20:51:06 +00:00
Vincent Lejeune	4fd20e35e6	R600: Use shared op optimization when checking cycle compatibility llvm-svn: 189981	2013-09-04 19:53:54 +00:00
Vincent Lejeune	4a8c23c168	R600: Non vector only instruction can be scheduled on trans unit llvm-svn: 189980	2013-09-04 19:53:46 +00:00
Vincent Lejeune	95def9718e	R600: Remove fmul.v4f32.ll test which is redundant with fmul.ll llvm-svn: 189978	2013-09-04 19:53:22 +00:00
Arnold Schwaighofer	ac9a5042d8	Swift: Only build vldm/vstm with q register aligned register lists Unaligned vldm/vstm need more uops and therefore are slower in general on swift. radar://14522102 llvm-svn: 189961	2013-09-04 17:41:16 +00:00
Silviu Baranga	f1ba2ead74	Fix scheduling for vldm/vstm instructions that load/store more than 32 bytes on Cortex-A9. This also makes the existing code more compact. llvm-svn: 189958	2013-09-04 17:05:18 +00:00
Venkatraman Govindaraju	30d6d6f6c9	[Sparc] Fix an assertion failure while lowering fcmp on long double. This assertion is triggered because an integer constant is created with wrong type. llvm-svn: 189948	2013-09-04 15:15:20 +00:00
Hao Liu	b344ca7aa3	Inplement aarch64 neon instructions in AdvSIMD(shift). About 24 shift instructions: sshr,ushr,ssra,usra,srshr,urshr,srsra,ursra,sri,shl,sli,sqshlu,sqshl,uqshl,shrn,sqrshrun,sqshrn,uqshr,sqrshrn,uqrshrn,sshll,ushll and 4 convert instructions: scvtf,ucvtf,fcvtzs,fcvtzu llvm-svn: 189925	2013-09-04 09:28:24 +00:00
Jim Grosbach	4f219b5c7e	Revert "Revert "ARM: Improve pattern for isel mul of vector by scalar."" This reverts commit r189648. Fixes for the previously failing clang-side arm_neon_intrinsics test cases will be checked in separately. llvm-svn: 189841	2013-09-03 20:08:17 +00:00
Richard Sandiford	2543e2b36c	[SystemZ] Add support for TMHH, TMHL, TMLH and TMLL For now this just handles simple comparisons of an ANDed value with zero. The CC value provides enough information to do any comparison for a 2-bit mask, and some nonzero comparisons with more populated masks, but that's all future work. llvm-svn: 189819	2013-09-03 15:38:35 +00:00
Venkatraman Govindaraju	ab95772300	[Sparc] Add support for soft long double (fp128). llvm-svn: 189780	2013-09-03 04:11:59 +00:00
Venkatraman Govindaraju	eaf96576da	[Sparc] Implement spill and load for long double(f128) registers. llvm-svn: 189768	2013-09-02 18:32:45 +00:00
Tilmann Scheller	e8598ae406	ARM: Default to the Swift CPU when targeting armv7s/thumbv7s. Test cases adjusted accordingly. This fixes rdar://14871821. llvm-svn: 189766	2013-09-02 17:09:01 +00:00
Tilmann Scheller	9205705478	Revert 189756 for now, it doesn't match what rdar://14871821 really wants. What we really want is to enable Swift by default for *v7s triples (and there already seems to be some logic which attempts to do that). In that case the iOS version doesn't matter. llvm-svn: 189763	2013-09-02 15:48:17 +00:00
Tilmann Scheller	9e0d3ff678	ARM: Default to Swift when compiling for iOS 6 or later. Test cases adjusted accordingly. This fixes rdar://14871821. llvm-svn: 189756	2013-09-02 12:01:58 +00:00
NAKAMURA Takumi	26f34484dd	FileCheck-ize three tests of llvm/test/CodeGen/X86/h-register(s). llvm-svn: 189755	2013-09-02 12:00:53 +00:00
NAKAMURA Takumi	6f2ef5f35a	llvm/test/CodeGen/X86: Update tests with -mattr=-bmi not to take BMI, corresponding to Craig's r189742. AMD Piledriver builder detected failures. llvm-svn: 189754	2013-09-02 12:00:46 +00:00
Craig Topper	6009a9c268	Create BEXTR instructions for (and ((sra or srl) x, imm), (2**size - 1)). Fixes PR17028. llvm-svn: 189742	2013-09-02 07:53:17 +00:00
Elena Demikhovsky	49a9b5e2c9	AVX-512: gather-scatter tests; added foldable instructions; Specify GATHER/SCATTER as heavy instructions. llvm-svn: 189736	2013-09-02 07:12:29 +00:00
Elena Demikhovsky	04a636836e	AVX-512: Added GATHER and SCATTER instructions. llvm-svn: 189729	2013-09-01 14:24:41 +00:00
Reed Kotler	6c6fac6244	Make sure we don't generate stubs for any of these functions because they don't exist in libc. This is really not the right way to solve this problem; but it's not clear to me at this time exactly what is the right way. If we create stubs here, they will cause link errors because these functions do not exist in libc. llvm-svn: 189727	2013-09-01 04:12:59 +00:00
Bill Schmidt	65bf01a470	[PowerPC] Call support for fast-isel. This patch adds fast-isel support for calls (but not intrinsic calls or varargs calls). It also removes a badly-formed assert. There are some new tests just for calls, and also for folding loads into arguments on calls to avoid extra extends. llvm-svn: 189701	2013-08-30 22:18:55 +00:00
Reed Kotler	3c79328838	Fix a problem with dual mips16/mips32 mode. When the underlying processor has hard float, when you compile the mips32 code you have to make sure that it knows to compile any mips32 routines as hard float. I need to clean up the way mips16 hard float is specified but I need to first think through all the details. Mips16 always has a form of soft float, the difference being whether the underlying hardware has floating point. So it's not really necessary to pass the -soft-float to llvm since soft-float is always true for mips16 by virtue of the fact that it will not register floating point registers. By using this fact, I can simplify the way this is all handled. llvm-svn: 189690	2013-08-30 19:40:56 +00:00
Bill Schmidt	886231ba0f	[PowerPC] Add handling for conversions to fast-isel. Yet another chunk of fast-isel code. This one handles various conversions involving floating-point. (It also includes some miscellaneous handling throughout the back end for LWA_32 and LWAX_32 that should have been part of the load-store patch.) llvm-svn: 189677	2013-08-30 15:18:11 +00:00
Craig Topper	dad5a27c09	Teach X86 backend to create BMI2 BZHI instructions from (and X, (add (shl 1, Y), -1)). Fixes PR17038. llvm-svn: 189653	2013-08-30 06:52:21 +00:00
Michael Gottesman	113e9285a1	Revert "ARM: Improve pattern for isel mul of vector by scalar." This reverts commit r189619. The commit was breaking the arm_neon_intrinsic test. llvm-svn: 189648	2013-08-30 05:36:14 +00:00
Andrew Trick	447dd8fc9e	mi-sched: improve the generic register pressure comparison. Only compare pressure within the same set. When multiple sets are affected, we prioritize the most constrained set. llvm-svn: 189641	2013-08-30 04:27:29 +00:00
Andrew Trick	3c849ec211	mi-sched: Precompute a PressureDiff for each instruction, adjust for liveness later. Created SUPressureDiffs array to hold the per node PDiff computed during DAG building. Added a getUpwardPressureDelta API that will soon replace the old one. Compute PressureDelta here from the precomputed PressureDiffs. Updating for liveness will come next. llvm-svn: 189640	2013-08-30 03:49:48 +00:00
Bill Schmidt	07bcdd6b9c	[PowerPC] Handle selection of compare instructions in fast-isel. Mostly trivial patch adding support for compares. The meat of the work was added with the branch support. llvm-svn: 189639	2013-08-30 03:16:48 +00:00
Bill Schmidt	19810417cb	[PowerPC] Miscellaneous fast-isel test cases. Here are a few more tests that now pass after the recent fast-isel commits. llvm-svn: 189637	2013-08-30 02:43:08 +00:00
Bill Schmidt	b3f46e50b4	[PowerPC] Add loads, stores, and related things to fast-isel. This is the next big chunk of fast-isel code. The primary purpose is to implement selection of loads and stores, but there is a lot of drag-along to support this. The common code to analyze addresses for both loads and stores is substantial. It's also necessary to add the materialization code for global values. Related to load-store processing is the code to fold loads into integer extends, since otherwise we generate lots of redundant instructions. We also need to add some overrides to some FastEmit routines to ensure we don't assign GPR 0 to a virtual register when this would change the meaning of an instruction. I added handling selection of a few binary arithmetic instructions, to enable committing some test cases I wrote a while back. Finally, ap couple of miscellaneous changes: * I cleaned up some poor style from a previous patch in PPCISelLowering.cpp, pointed out by David Blaikie. * I enlarged the Addr.Offset field to avoid sign problems with 32-bit offsets. llvm-svn: 189636	2013-08-30 02:29:45 +00:00
Jim Grosbach	7089633cb9	ARM: Improve pattern for isel mul of vector by scalar. In addition to recognizing when the multiply's second argument is coming from an explicit VDUPLANE, also look for a plain scalar f32 reference and reference it via the corresponding vector lane. rdar://14870054 llvm-svn: 189619	2013-08-29 22:41:46 +00:00
Elena Demikhovsky	f05835d923	AVX-512: added extend and truncate instructions. llvm-svn: 189580	2013-08-29 11:56:53 +00:00
Tim Northover	02c638e450	ARM: Use "dmb sy" for barriers on M-class CPUs The usual default of "dmb ish" (inner-shareable) isn't even a valid instruction on v6M or v7M (well, it does the same thing but software is strongly discouraged from using it) so we should emit a full-system barrier there. llvm-svn: 189483	2013-08-28 14:39:19 +00:00
Tim Northover	490c4c1bda	ARM: remove unused v(add\|sub)hn and vqdml[as]l intrinsics. Clang is now generating cleaner IR, so this removes the old variants which should be completely unused. llvm-svn: 189481	2013-08-28 14:33:33 +00:00
Tim Northover	e4e6bb8e0e	ARM: add patterns for vqdmlal with separate vqdmull and vqadds The vqdmlal and vqdmlls instructions are really just a fused pair consisting of a vqdmull.sN and a vqadd.sN. This adds patterns to LLVM so that we can switch Clang's CodeGen over to generating these instead of the special vqdmlal intrinsics. llvm-svn: 189480	2013-08-28 12:15:16 +00:00
Daniel Sanders	7d6b0c31fc	[mips][msa] Added bnz.df, bnz.v, bz.df, and bz.v These intrinsics are legalized to V(ALL\|ANY)_(NON)?ZERO nodes, are matched as SN?Z_[BHWDV]_PSEUDO pseudo's, and emitted as a branch/mov sequence to evaluate to 0 or 1. Note: The resulting code is sub-optimal since it doesnt seem to be possible to feed the result of an intrinsic directly into a brcond. At the moment it uses (SETCC (VALL_ZERO $ws), 0, SETEQ) and similar which unnecessarily evaluates the boolean twice. llvm-svn: 189478	2013-08-28 12:14:50 +00:00
Daniel Sanders	86a3b104b1	[mips][msa] Added load/store intrinsics. llvm-svn: 189476	2013-08-28 12:04:29 +00:00
Elena Demikhovsky	2f3377ea54	AVX-512: added SQRT, VRSQRT14, VCOMISS, VUCOMISS, VRCP14, VPABS llvm-svn: 189472	2013-08-28 11:21:58 +00:00
Daniel Sanders	6583601738	[mips][msa] Added move.v llvm-svn: 189471	2013-08-28 10:44:47 +00:00
Richard Sandiford	9fc2e5cdff	[SystemZ] Add support for TMHH, TMHL, TMLH and TMLL For now just handles simple comparisons of an ANDed value with zero. The CC value provides enough information to do any comparison for a 2-bit mask, and some nonzero comparisons with more populated masks, but that's all future work. llvm-svn: 189469	2013-08-28 10:31:43 +00:00
Daniel Sanders	21800e80c1	[mips][msa] Added cfcmsa, and ctcmsa The MSA control registers have been added as reserved registers, and are only used via ISD::Copy(To\|From)Reg. The intrinsics are lowered into these nodes. llvm-svn: 189468	2013-08-28 10:26:24 +00:00
Daniel Sanders	3740f20366	[mips][msa] Added f[cs]af, f[cs]or, f[cs]ueq, f[cs]ul[et], f[cs]une, fsun, ftrunc_[su], hadd_[su], hsub_[su], sr[al]r, sr[al]ri llvm-svn: 189467	2013-08-28 10:12:09 +00:00
Daniel Sanders	6d33546b4a	[mips][msa] Summarize tests Adds a comment to the start of each test summarizing the area the test covers. llvm-svn: 189465	2013-08-28 10:02:29 +00:00
Richard Sandiford	96af6a5cf1	[SystemZ] Extend memcmp support to all constant lengths This uses the infrastructure added for memcpy and memmove in r189331. llvm-svn: 189458	2013-08-28 09:01:51 +00:00

1 2 3 4 5 ...

8098 Commits