llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 23:42:52 +01:00

Author	SHA1	Message	Date
Richard Sandiford	7921f75ba9	Handle (shl (anyext (shr ...))) in SimpilfyDemandedBits This is really an extension of the current (shl (shr ...)) -> shl optimization. The main difference is that certain upper bits must also not be demanded. The motivating examples are the first two in the testcase, which occur in llvmpipe output. llvm-svn: 192783	2013-10-16 10:26:19 +00:00
NAKAMURA Takumi	ab81f8a305	Revert r192758 (and r192759), "MC: Better handling of tricky symbol and section names" GNU AS didn't like quotes in symbol names. Error: junk at end of line, first unrecognized character is `"' .def "@feat.00"; "@feat.00" = 1 Reproduced on Cygwin's 2.23.52.20130309 and mingw32's 2.20.1.20100303. llvm-svn: 192775	2013-10-16 08:22:49 +00:00
Rafael Espindola	3779f822e1	Add a triple to this test. llvm-svn: 192767	2013-10-16 02:27:33 +00:00
Rafael Espindola	c17b7cf2ed	Add support for metadata representing .ident directives. llvm-svn: 192764	2013-10-16 01:49:05 +00:00
Hans Wennborg	3b3efddc64	MC: Better handling of tricky symbol and section names Because of win32 mangling, we produce symbol and section names with funny characters in them, most notably @ characters. MC would choke on trying to parse its own assembly output. This patch addresses that by: - Making @ trigger quoting of symbol names - Also quote section names in the same way - Just parse section names like other identifiers (to allow for quotes) - Don't assume @ signifies a symbol variant if it is in a string. Differential Revision: http://llvm-reviews.chandlerc.com/D1945 llvm-svn: 192758	2013-10-16 01:20:40 +00:00
Andrew Trick	e3e67d4a0a	Enable MI Sched for x86. This changes the SelectionDAG scheduling preference to source order. Soon, the SelectionDAG scheduler can be bypassed saving a nice chunk of compile time. Performance differences that result from this change are often a consequence of register coalescing. The register coalescer is far from perfect. Bugs can be filed for deficiencies. On x86 SandyBridge/Haswell, the source order schedule is often preserved, particularly for small blocks. Register pressure is generally improved over the SD scheduler's ILP mode. However, we are still able to handle large blocks that require latency hiding, unlike the SD scheduler's BURR mode. MI scheduler also attempts to discover the critical path in single-block loops and adjust heuristics accordingly. The MI scheduler relies on the new machine model. This is currently unimplemented for AVX, so we may not be generating the best code yet. Unit tests are updated so they don't depend on SD scheduling heuristics. llvm-svn: 192750	2013-10-15 23:33:07 +00:00
Chad Rosier	3e791b2408	[AArch64] Add support for NEON scalar signed saturating absolute value and scalar signed saturating negate instructions. llvm-svn: 192733	2013-10-15 21:18:44 +00:00
Manman Ren	39d1a84681	Struct byval: fix a copy-paste error for thumb2. PR17309 llvm-svn: 192730	2013-10-15 19:42:32 +00:00
Michael Liao	1081bbac6c	Fix PR17546 - Type of index used in extract_vector_elt or insert_vector_elt supposes to be TLI.getVectorIdxTy() which is pointer type on most targets. It'd better to truncate (or zero-extend in case it's changed later) it to mask element type to guarantee they are matching instead of asserting that. llvm-svn: 192722	2013-10-15 17:51:58 +00:00
Michael Liao	a94d0a900a	Fix PR16807 - Lower signed division by constant powers-of-2 to target-independent DAG operators instead of target-dependent ones to support them better on targets where vector types are legal but shift operators on that types are illegal. E.g., on AVX, PSRAW is only available on <8 x i16> though <16 x i16> is a legal type. llvm-svn: 192721	2013-10-15 17:51:02 +00:00
Daniel Sanders	21c7c7cd9b	[mips][msa] Added support for build_vector for v4f32 and v2f64. llvm-svn: 192699	2013-10-15 13:14:41 +00:00
Richard Sandiford	86798c4d26	[SystemZ] Use A(G)SI when spilling the target of a constant addition llvm-svn: 192681	2013-10-15 08:42:59 +00:00
Job Noorman	54f125fb4b	Fix MSP430 calling convention to match MSPGCC llvm-svn: 192678	2013-10-15 08:19:39 +00:00
NAKAMURA Takumi	8c0f09fed1	llvm/test/CodeGen/X86/break-avx-dep.ll: Relax an expression to be matched to also r[89], not only rXX. llvm-svn: 192675	2013-10-15 06:36:36 +00:00
Andrew Trick	e196a05dc8	Improve on r192635, ExeDepsFix for avx, and add a test case. rdar:15221834 False AVX register dependencies cause 5x slowdown on flops-5/6 and significant slowdown on several others. This was blocking the switch to MI-Sched. llvm-svn: 192669	2013-10-15 03:39:43 +00:00
Akira Hatanaka	29e44ea3aa	[mips] Transfer kill flag to the newly created operand. llvm-svn: 192662	2013-10-15 01:06:30 +00:00
Quentin Colombet	cb4b84532c	[X86][FastISel] During X86 fastisel, the address of indirect call was resolved through bitcast, ptrtoint, and inttoptr instructions. This is valid only if the related instructions are in that same basic block, otherwise we may reference variables that were not live accross basic blocks resulting in undefined virtual registers. The bug was exposed when both SDISel and FastISel were used within the same function, i.e., one basic block is issued with FastISel and another with SDISel, as demonstrated with the testcase. <rdar://problem/15192473> llvm-svn: 192636	2013-10-14 22:32:09 +00:00
Nick Lewycky	0da8d88a82	Fix a typo, in a comment, in a test. llvm-svn: 192632	2013-10-14 22:02:53 +00:00
Eric Christopher	1a04817b81	Revert part of a fix from 2010, changes since then: a) x86-64 TLS has been documented b) the code path should use movq for the correct relocation to be generated. I've also added a fixme for the test case that we should improve the code generated, it should look something like is documented in the tls abi document. llvm-svn: 192631	2013-10-14 21:52:26 +00:00
Will Dietz	ad27c13a64	MachineSink: Fix and tweak critical-edge breaking heuristic. Per original comment, the intention of this loop is to go ahead and break the critical edge (in order to sink this instruction) if there's reason to believe doing so might "unblock" the sinking of additional instructions that define registers used by this one. The idea is that if we have a few instructions to sink "together" breaking the edge might be worthwhile. This commit makes a few small changes to help better realize this goal: First, modify the loop to ignore registers defined by this instruction. We don't sink definitions of physical registers, and sinking an SSA definition isn't going to unblock an upstream instruction. Second, ignore uses of physical registers. Instructions that define physical registers are rejected for sinking, and so moving this one won't enable moving any defining instructions. As an added bonus, while virtual register use-def chains are generally small due to SSA goodness, iteration over the uses and definitions (used by hasOneNonDBGUse) for physical registers like EFLAGS can be rather expensive in practice. (This is the original reason for looking at this) Finally, to keep things simple continue to only consider this trick for registers that have a single use (via hasOneNonDBGUse), but to avoid spuriously breaking critical edges only do so if the definition resides in the same MBB and therefore this one directly blocks it from being sunk as well. If sinking them together is meant to be, let the iterative nature of this pass sink the definition into this block first. Update tests to accomodate this change, add new testcase where sinking avoids pipeline stalls. llvm-svn: 192608	2013-10-14 16:57:17 +00:00
Chad Rosier	40761dc629	[AArch64] Add support for NEON scalar integer compare instructions. llvm-svn: 192596	2013-10-14 14:37:20 +00:00
Bernard Ogden	f482ee15d7	Add Cortex-A57 support llvm-svn: 192591	2013-10-14 13:17:07 +00:00
Bernard Ogden	ec0167a2ce	Add subtarget feature support for Cortex-A53 Some previous implicit defaults have changed, for example FP and NEON are now on by default. llvm-svn: 192590	2013-10-14 13:16:57 +00:00
Elena Demikhovsky	c460e7e50a	Fixed a bug in dynamic allocation memory on stack. The alignment of allocated space was wrong, see Bugzila 17345. Done by Zvi Rackover <zvi.rackover@intel.com>. llvm-svn: 192573	2013-10-14 07:26:51 +00:00
Vincent Lejeune	7594bd2071	R600: improve dump of S_WAITCNT llvm-svn: 192557	2013-10-13 17:56:28 +00:00
Vincent Lejeune	316b632e03	R600: Use masked read sel for texture instructions llvm-svn: 192554	2013-10-13 17:56:10 +00:00
Vincent Lejeune	b337ac16bc	R600: fix swizzle export llvm-svn: 192553	2013-10-13 17:56:04 +00:00
Benjamin Kramer	3000b81f9a	Force a CPU on test so it doesn't depend on microarchitectural scheduling decisions. llvm-svn: 192532	2013-10-12 11:17:12 +00:00
Reed Kotler	9efb450361	For Mips16, start to consolidate all forms of 32 bit literal loading so that they can be better handled and optimized in the Mips16 constant island code. llvm-svn: 192520	2013-10-12 02:19:08 +00:00
Matt Arsenault	289accc07f	R600: Add scalar i32 add test llvm-svn: 192501	2013-10-11 21:03:41 +00:00
Matt Arsenault	d5c3e13cc5	Use CHECK-LABEL llvm-svn: 192500	2013-10-11 21:03:39 +00:00
Matthias Braun	f96d183309	Remove kill flags after if conversion if necessary When if converting something like: true: ... = R0<kill> false: ... = R0<kill> then the instructions of the true block must not have a <kill> flag anymore, as the instruction of the false block follow and do still read the R0 value. Specifically this patch determines the set of register live-in in the false block (possibly after simulating the liveness changes of the duplicated instructions). Each of these live-in registers mustn't be killed. llvm-svn: 192482	2013-10-11 19:04:37 +00:00
Quentin Colombet	7ba3455dfc	[DAGCombiner] Load slicing test case: attempt to really fix the buildbots (used sse4.2 instead of avx!). <rdar://problem/14477220> llvm-svn: 192480	2013-10-11 18:54:49 +00:00
Quentin Colombet	c02e5604f4	[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set sse4.2 support. This should fix the buildbots. Original commit message: [DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192476	2013-10-11 18:29:42 +00:00
Quentin Colombet	fd0097531f	[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails on ubuntu. llvm-svn: 192474	2013-10-11 18:17:17 +00:00
Matthias Braun	434fbd854b	Revert "Tests: Be less dependent on a specific schedule/regalloc" This reverts r192454 Apparently FileCheck isn't as smart as I though and does not enforce a topological order between variable defs+uses. llvm-svn: 192472	2013-10-11 18:09:19 +00:00
Quentin Colombet	b60dc81c8b	[DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192471	2013-10-11 18:01:14 +00:00
Amara Emerson	bf6dcda63c	[ARM] Fix FP ABI attributes with no VFP enabled. llvm-svn: 192458	2013-10-11 16:03:43 +00:00
Matthias Braun	4beef11e35	Tests: Be less dependent on a specific schedule/regalloc llvm-svn: 192454	2013-10-11 15:40:12 +00:00
Matheus Almeida	73759d3a3b	[mips][msa] Improves robustness of the test by enhancing pattern matching. llvm-svn: 192446	2013-10-11 13:18:01 +00:00
Justin Holewinski	9769d1f0ef	[NVPTX] Switch from StrongPHIElimination to PHIElimination in NVPTXTargetMachine, and add some missing optimization passes to addOptimizedRegAlloc Fixes PR17529 llvm-svn: 192445	2013-10-11 12:39:39 +00:00
Justin Holewinski	f7d6ae0d5b	Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom comments for implicit defs For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers, while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be emitted as a true PTX register name. Other targets can use this to customize the output of implicit def comments. Fixes PR17519 llvm-svn: 192444	2013-10-11 12:39:36 +00:00
Amara Emerson	83afefcfe3	[ARM] Add a test case for disabled neon/fpu features. llvm-svn: 192440	2013-10-11 11:07:00 +00:00
Daniel Sanders	3649e05b17	[mips][msa] Added support for matching maddv.[bhwd], and msubv.[bhwd] from normal IR (i.e. not intrinsics) llvm-svn: 192438	2013-10-11 10:50:42 +00:00
Daniel Sanders	9bec7b823b	[mips][msa] Added support for matching fmsub.[wd] from normal IR (i.e. not intrinsics) llvm-svn: 192435	2013-10-11 10:27:32 +00:00
Robert Lytton	864d2bd56d	XCore target fix bug in emitArrayBound() causing segmentation fault llvm-svn: 192434	2013-10-11 10:27:13 +00:00
Robert Lytton	12def987ea	XCore target does not emit '.hidden' or '.protected' attributes llvm-svn: 192433	2013-10-11 10:27:00 +00:00
Robert Lytton	b441cef9c5	XCore target: fix bug in XCoreLowerThreadLocal.cpp When a ConstantExpr which uses a thread local is part of a PHI node instruction, the insruction that replaces the ConstantExpr must be inserted in the predecessor block, in front of the terminator instruction. If the predecessor block has multiple successors, the edge is first split. llvm-svn: 192432	2013-10-11 10:26:48 +00:00
Robert Lytton	e5a2d050ac	XCore target: add XCoreTargetLowering::isZExtFree() llvm-svn: 192431	2013-10-11 10:26:29 +00:00
Daniel Sanders	253e018134	[mips][msa] Added support for matching fmadd.[wd] from normal IR (i.e. not intrinsics) llvm-svn: 192430	2013-10-11 10:14:25 +00:00

1 2 3 4 5 ...

8377 Commits