llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 23:42:52 +01:00

Author	SHA1	Message	Date
Daniel Sanders	6ffe6fc99c	[mips][msa] Added support for matching mod_[us] from normal IR (i.e. not intrinsics) llvm-svn: 191737	2013-10-01 10:22:35 +00:00
Elena Demikhovsky	84c6cd222d	AVX-512: Added X86vzmovl patterns llvm-svn: 191733	2013-10-01 08:38:02 +00:00
Manman Ren	ad317a135a	TBAA: update tbaa format from scalar format to struct-path aware format. llvm-svn: 191690	2013-09-30 18:17:55 +00:00
Manman Ren	799fd39420	TBAA: remove !tbaa from testing cases when they are not needed. llvm-svn: 191689	2013-09-30 18:17:35 +00:00
Robert Wilhelm	6b36431ffa	Fix spelling intruction -> instruction. llvm-svn: 191610	2013-09-28 11:46:15 +00:00
Tom Stellard	1cb4ba2a4d	R600: Fix handling of NAN in comparison instructions We were completely ignoring the unorder/ordered attributes of condition codes and also incorrectly lowering seto and setuo. Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 191603	2013-09-28 02:50:50 +00:00
Akira Hatanaka	e5351a10fe	[mips] Make sure loads from lazy-binding entries do not get CSE'd or hoisted out of loops. Previously, two consecutive calls to function "func" would result in the following sequence of instructions: 1. load $16, %got(func)($gp) // load address of lazy-binding stub. 2. move $25, $16 3. jalr $25 // jump to lazy-binding stub. 4. nop 5. move $25, $16 6. jalr $25 // jump to lazy-binding stub again. With this patch, the second call directly jumps to func's address, bypassing the lazy-binding resolution routine: 1. load $25, %got(func)($gp) // load address of lazy-binding stub. 2. jalr $25 // jump to lazy-binding stub. 3. nop 4. load $25, %got(func)($gp) // load resolved address of func. 5. jalr $25 // directly jump to func. llvm-svn: 191591	2013-09-28 00:12:32 +00:00
Yunzhong Gao	e51da27a74	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Manman Ren	2ef9ca7627	TBAA: handle scalar TBAA format and struct-path aware TBAA format. Remove the command line argument "struct-path-tbaa" since we should not depend on command line argument to decide which format the IR file is using. Instead, we check the first operand of the tbaa tag node, if it is a MDNode, we treat it as struct-path aware TBAA format, otherwise, we treat it as scalar TBAA format. When clang starts to use struct-path aware TBAA format no matter whether struct-path-tbaa is no, and we can auto-upgrade existing bc files, the support for scalar TBAA format can be dropped. Existing testing cases are updated to use the struct-path aware TBAA format. llvm-svn: 191538	2013-09-27 18:34:27 +00:00
Richard Sandiford	e1db330ce8	[SystemZ] Rein back the use of block operations The backend tries to use block operations like MVC, NC, OC and XC for simple scalar operations. For correctness reasons, it rejects any case in which the regions might partially overlap. However, for performance reasons, it should also reject cases where the regions might be equal, since the instruction might then not use the fast path. This fixes a performance regression seen in bzip2. We may want to limit the optimisation even more in future, or even remove it entirely, but I'll try with this for now. llvm-svn: 191525	2013-09-27 15:29:20 +00:00
Richard Sandiford	cae9d29151	[SystemZ] Improve handling of PC-relative addresses The backend previously folded offsets into PC-relative addresses whereever possible. That's the right thing to do when the address can be used directly in a PC-relative memory reference (using things like LRL). But if we have a register-based memory reference and need to load the PC-relative address separately, it's better to use an anchor point that could be shared with other accesses to the same area of the variable. Fixes a FIXME. llvm-svn: 191524	2013-09-27 15:14:04 +00:00
Daniel Sanders	0987676281	[mips][msa] Implemented insert.d intrinsic. This intrinsic is lowered into an equivalent INSERT_VECTOR_ELT which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191521	2013-09-27 13:36:54 +00:00
Daniel Sanders	3c43957555	[mips][msa] Implemented fill.d intrinsic. This intrinsic is lowered into an equivalent BUILD_VECTOR which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191519	2013-09-27 13:20:41 +00:00
Daniel Sanders	935673af60	[mips][msa] Implemented copy_[us].d intrinsic. This intrinsic is lowered into equivalent copy_s.w instructions during legalization. llvm-svn: 191518	2013-09-27 13:04:21 +00:00
Daniel Sanders	8c83ddcdd2	[mips][msa] Implemented insert_vector_elt for v4f32 and v2f64. For v4f32 and v2f64, INSERT_VECTOR_ELT is matched by a pseudo-insn which is later expanded to appropriate insve.[wd] insns. llvm-svn: 191515	2013-09-27 12:31:32 +00:00
Daniel Sanders	0bb1b5a37f	[mips][msa] Implemented extract_vector_elt for v4f32 or v2f64 For v4f32 and v2f64, EXTRACT_VECTOR_ELT is matched by a pseudo-insn which may be expanded to subregister copies and/or instructions as appropriate. llvm-svn: 191514	2013-09-27 12:17:32 +00:00
Andrea Di Biagio	a10165167b	Remove superfluous comment accidentally checked-in. llvm-svn: 191513	2013-09-27 12:13:58 +00:00
Daniel Sanders	0f009e6be5	[mips][msa] Added support for MSA registers to copyPhysReg llvm-svn: 191512	2013-09-27 12:03:51 +00:00
Daniel Sanders	8e7e5fd076	[mips][msa] Added support for matching splati from normal IR (i.e. not intrinsics) Updated some of the vshf since they (correctly) emit splati's now llvm-svn: 191511	2013-09-27 11:48:57 +00:00
Andrea Di Biagio	a96ff5eeac	Re-apply the change from r191393 with fix for pr17380. This change fixes the problem reported in pr17380 and re-add the dagcombine transformation ensuring that the value types are always legal if the transformation is triggered after Legalization took place. Added the test case from pr17380. llvm-svn: 191509	2013-09-27 11:37:05 +00:00
Daniel Sanders	d13fea547a	[mips][msa] MSA requires FR=1 mode (64-bit FPU register file). Report fatal error when using it in FR=0 mode. llvm-svn: 191498	2013-09-27 10:08:31 +00:00
Daniel Sanders	6a20248b3a	[mips][msa] Expand all truncstores and loadexts for MSA as well as DSP llvm-svn: 191496	2013-09-27 09:44:59 +00:00
Daniel Sanders	27836999cd	[mips][msa] Added missing check in performSRACombine Reviewers: jacksprat, dsanders Reviewed By: dsanders Differential Revision: http://llvm-reviews.chandlerc.com/D1755 llvm-svn: 191495	2013-09-27 09:25:29 +00:00
Weiming Zhao	c16af8ee70	Fix PR 17372: Emitting PLD for stack address for ARM Thumb2 t2PLDi12, t2PLDi8, t2PLDs was omitted in Thumb2InstrInfo. This patch fixes it. llvm-svn: 191441	2013-09-26 17:25:10 +00:00
Bill Schmidt	b5aca928c2	[PowerPC] Fix PR17354: Generate nop after local calls for PIC code. When generating code for shared libraries, even local calls may be intercepted, so we need a nop after the call for the linker to fix up the TOC. Test case adapted from the one provided in PR17354. llvm-svn: 191440	2013-09-26 17:09:28 +00:00
Andrea Di Biagio	0901efb8fb	Revert r191393 since it caused pr17380. llvm-svn: 191438	2013-09-26 16:54:01 +00:00
Venkatraman Govindaraju	af5985d1f5	[Sparc] Implements exception handling in SPARC with DwarfCFI. llvm-svn: 191432	2013-09-26 15:11:00 +00:00
Amara Emerson	80d8b3db1e	[ARM] Use the load-acquire/store-release instructions optimally in AArch32. Patch by Artyom Skrobov. llvm-svn: 191428	2013-09-26 12:22:36 +00:00
Weiming Zhao	14a079be0c	Fix PR 17368: disable vector mul distribution for square of add/sub for ARM Generally, it is desirable to distribute (a + b) * c to ac + bc for ARM with VMLx forwarding, where a, b and c are vectors. However, for (a + b)(a + b), distribution will result in one extra instruction. With distribution: x = a + b (add) y = a x (mul) z = y + b * y (mla) Without distribution: x = a + b (add) z = x * x (mul) This patch checks if a mul is a square of add/sub. If yes, skip distribution. llvm-svn: 191410	2013-09-25 23:12:06 +00:00
Josh Magee	2c804b5636	Test commit. Removed trailing whitespace. llvm-svn: 191402	2013-09-25 22:07:48 +00:00
Reed Kotler	ea8c398b50	Fix a bad typo in the inline assembly code for mips16 pic fp stubs and make one cosmetic cleanup to make it look the same as gcc in this area; adjusting test cases. llvm-svn: 191400	2013-09-25 20:58:50 +00:00
Andrea Di Biagio	1968361975	Teach DAGCombiner how to canonicalize dags according to the rule (shl (zext (shr A, X)), X) => (zext (shl (shr A, X), X)). The rule only triggers when there are no other uses of the zext to avoid materializing more instructions. This helps the DAGCombiner understand that the shl/shr sequence can then be converted into an and instruction. llvm-svn: 191393	2013-09-25 19:01:01 +00:00
Quentin Colombet	b8a9667008	[PR16882] Ignore noreturn definitions when setting isPhysRegUsed. PEI inserts a save/restore sequence for the link register, according to the information it gets from the MachineRegisterInfo. MachineRegisterInfo is populated by the VirtRegMap pass. This pass was not aware of noreturn calls and was registering the definitions of these calls the same way as regular operations. Modify VirtRegPass so that it does not set the isPhysRegUsed information for registers only defined by noreturn calls. The rational is that a noreturn call is the "last instruction" of the program (if it returns the behavior is undefined), so everything that is defined by it cannot be used and will not interfere with anything else. Therefore, it is pointless to account for then. llvm-svn: 191349	2013-09-25 00:26:17 +00:00
Andrew Trick	3b462e7046	CriticalAntiDepBreaker is no longer needed for armv7 scheduling. This is being disabled because it is no longer needed for performance. It is only used by postRAscheduler which is also planned for removal, and it is implemented with an out-dated view of register liveness. It consideres aliases instead of register units, assumes valid kill flags, and assumes implicit uses on partial register defs. Kill flags and implicit operands are error prone and impossible to verify. We should gradually eliminate dependence on them in the postRA phases. Targets that still benefit from this should move to the MI scheduler. If that doesn't solve the problem, then we should add a hook to regalloc to optimize reload placement. llvm-svn: 191348	2013-09-25 00:26:16 +00:00
Eli Friedman	bdb3e2822e	Add missing check to SETCC optimization. PR17338. llvm-svn: 191337	2013-09-24 22:50:14 +00:00
Daniel Sanders	d110591231	[mips][msa] Added support for matching pckev, and pckod from normal IR (i.e. not intrinsics) llvm-svn: 191306	2013-09-24 14:53:25 +00:00
Daniel Sanders	48059bf5ef	[mips][msa] Added support for matching ilv[lr], ilvod, and ilvev from normal IR (i.e. not intrinsics) llvm-svn: 191304	2013-09-24 14:36:12 +00:00
Daniel Sanders	db41b542e8	[mips][msa] Added support for matching shf from normal IR (i.e. not intrinsics) llvm-svn: 191302	2013-09-24 14:20:00 +00:00
Daniel Sanders	7c64721346	[mips][msa] Added support for matching vshf from normal IR (i.e. not intrinsics) llvm-svn: 191301	2013-09-24 14:02:15 +00:00
Daniel Sanders	e154d03143	[mips][msa] Remove the VSPLAT and VSPLATD nodes in favour of matching BUILD_VECTOR. Most constant BUILD_VECTOR's are matched using ComplexPatterns which cover bitcasted as well as normal vectors. However, it doesn't seem to be possible to match ldi.[bhwd] in a type-agnostic manner (e.g. to support the widest range of immediates, it should be possible to use ldi.b to load v2i64) using TableGen so ldi.[bhwd] is matched using custom code in MipsSEISelDAGToDAG.cpp This made the majority of the constant splat BUILD_VECTOR lowering redundant. The only transformation remaining for constant splats is when an (up-to) 32-bit constant splat is possible but the value does not fit into a 10-bit signed integer. In this case, the BUILD_VECTOR is transformed into a bitcasted BUILD_VECTOR so that fill.[bhw] can be used to splat the vector from a GPR32 register (which is initialized using the usual lui/addui sequence). There are no additional tests since this is a re-implementation of previous functionality. The change is intended to make it easier to implement some of the upcoming instruction selection patches since they can rely on existing support for BUILD_VECTOR's in the DAGCombiner. compare_float.ll changed slightly because a BITCAST is no longer introduced during legalization. llvm-svn: 191299	2013-09-24 13:33:07 +00:00
Daniel Sanders	1c08f8b17d	[mips][msa] Non-constant BUILD_VECTOR's should be expanded to INSERT_VECTOR_ELT instead of memory operations. The resulting code is the same length, but doesnt cause memory traffic or latency. llvm-svn: 191297	2013-09-24 13:16:15 +00:00
Daniel Sanders	d201758a30	[mips][msa] Added partial support for matching fmax_a from normal IR (i.e. not intrinsics) This covers the case where fmax_a can be used to implement ISD::FABS. llvm-svn: 191296	2013-09-24 13:02:08 +00:00
Daniel Sanders	fe71effbbd	[mips][msa] Added support for matching andi, ori, nori, and xori from normal IR (i.e. not intrinsics) llvm-svn: 191293	2013-09-24 12:32:47 +00:00
Daniel Sanders	f05ed8bd9a	[mips][msa] Added support for matching max, maxi, min, mini from normal IR (i.e. not intrinsics) llvm-svn: 191291	2013-09-24 12:18:31 +00:00
Daniel Sanders	0167ec55f4	[mips][msa] Added support for matching bsel and bseli from normal IR (i.e. not intrinsics) This required correcting the definition of the bsel and bseli intrinsics. llvm-svn: 191290	2013-09-24 12:04:44 +00:00
Daniel Sanders	9a3de1f604	[mips][msa] Added support for matching comparisons from normal IR (i.e. not intrinsics) MIPS SelectionDAG changes: * Added VCEQ, VCL[ET]_[SU] nodes to represent vector comparisons that produce a bitmask. llvm-svn: 191286	2013-09-24 10:46:19 +00:00
Daniel Sanders	362149b5a7	[mips][msa] Added support for matching slli, srai, and srli from normal IR (i.e. not intrinsics) llvm-svn: 191285	2013-09-24 10:28:18 +00:00
NAKAMURA Takumi	3b910496bf	llvm/test/CodeGen/AArch64/neon-scalar-reduce-pairwise.ll: Use -mtriple here, or aach64-pecoff might be misassumed on win32 hosts. llvm-svn: 191275	2013-09-24 04:14:29 +00:00
Jiangning Liu	5867567c41	Initial support for Neon scalar instructions. Patch by Ana Pazos. 1.Added support for v1ix and v1fx types. 2.Added Scalar Pairwise Reduce instructions. 3.Added initial implementation of Scalar Arithmetic instructions. llvm-svn: 191263	2013-09-24 02:47:27 +00:00
Michael Gottesman	a2ef7dd057	[stackprotector] Forgot to add in PR number to test case. llvm-svn: 191261	2013-09-24 02:10:55 +00:00

1 2 3 4 5 ...

8245 Commits