llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Arnold Schwaighofer	1ecca5fd68	ARM NEON: Handle v16i8 and v8i16 reverse shuffles Lower reverse shuffles to a vrev64 and a vext instruction instead of the default legalization of storing and loading to the stack. This is important because we generate reverse shuffles in the loop vectorizer when we reverse store to an array. uint8_t Arr[N]; for (i = 0; i < N; ++i) Arr[N - i - 1] = ... radar://13171760 llvm-svn: 174929	2013-02-12 01:58:32 +00:00
Krzysztof Parzyszek	272abb00e0	Extend Hexagon hardware loop generation to handle various additional cases: - variety of compare instructions, - loops with no preheader, - arbitrary lower and upper bounds. llvm-svn: 174904	2013-02-11 21:37:55 +00:00
Justin Holewinski	f41b45202c	[NVPTX] Remove NoCapture from address space conversion intrinsics. NoCapture is not valid in this case, and was causing incorrect optimizations. llvm-svn: 174896	2013-02-11 18:56:35 +00:00
Reed Kotler	fbd845c0d9	Add the 16 bit version of addiu. To the assembler, the 16 and 32 bit are the same so we put in the comment field an indicator when we think we are emitting the 16 bit version. For the direct object emitter, the difference is important as well as for other passes which need an accurate count of program size. There will be other similar putbacks to this for various instructions. llvm-svn: 174747	2013-02-08 21:42:56 +00:00
Hal Finkel	624f5d5d67	DAGCombiner: Constant folding around pre-increment loads/stores Previously, even when a pre-increment load or store was generated, we often needed to keep a copy of the original base register for use with other offsets. If all of these offsets are constants (including the offset which was combined into the addressing mode), then this is clearly unnecessary. This change adjusts these other offsets to use the new incremented address. llvm-svn: 174746	2013-02-08 21:35:47 +00:00
Bob Wilson	d9dfcce74f	Revert 172027 and 174336. Remove diagnostics about over-aligned stack objects. Aside from the question of whether we report a warning or an error when we can't satisfy a requested stack object alignment, the current implementation of this is not good. We're not providing any source location in the diagnostics and the current warning is not connected to any warning group so you can't control it. We could improve the source location somewhat, but we can do a much better job if this check is implemented in the front-end, so let's do that instead. <rdar://problem/13127907> llvm-svn: 174741	2013-02-08 20:35:15 +00:00
Reed Kotler	434681ac07	When Mips16 frames grow large, the immediate field may exceed the maximum allowed size for the instruction. This code uses RegScavenger to fix this. We sometimes need 2 registers for Mips16 so we must handle things differently than how register scavenger is normally used. llvm-svn: 174696	2013-02-08 03:57:41 +00:00
Tom Stellard	32a764306e	R600: Add support for SET_DX10 instructions These instructions compare two floating point values and return an integer true (-1) or false (0) value. When compiling code generated by the Mesa GLSL frontend, the SET_DX10 instructions save us four instructions for most branch decisions that use floating-point comparisons. llvm-svn: 174609	2013-02-07 14:02:35 +00:00
Tom Stellard	a29c349245	R600: Add tests for unsupported condition codes. All of the le and lt variants are unsupported. llvm-svn: 174608	2013-02-07 14:02:33 +00:00
Tom Stellard	4ecff0777e	R600: Fix assembly name for SETGT_INT llvm-svn: 174607	2013-02-07 14:02:27 +00:00
Reed Kotler	8ac7f84606	Make sure we call externals from libraries properly when -static. For example, when we are doing mips16 hard float or soft float. llvm-svn: 174583	2013-02-07 04:34:51 +00:00
Reed Kotler	b3d71de768	Enable jumps when in -static mode. llvm-svn: 174580	2013-02-07 03:49:51 +00:00
Eli Bendersky	1854305220	This is a follow-up on r174446, now taking Atom processors into account. Atoms use LEA for updating SP in prologs/epilogs, and the exact LEA opcode depends on the data model. Also reapplying the test case which was added and then reverted (because of Atom failures), this time specifying explicitly the CPU in addition to the triple. The test case now checks all variations (data mode, cpu Atom vs. Core). llvm-svn: 174542	2013-02-06 20:43:57 +00:00
Tim Northover	a6ee94525f	Implement external weak (ELF) symbols on AArch64 Weakly defined symbols should evaluate to 0 if they're undefined at link-time. This is impossible to do with the usual address generation patterns, so we should use a literal pool entry to materlialise the address. llvm-svn: 174518	2013-02-06 16:43:33 +00:00
Eli Bendersky	6ac7b7b5cf	Remove this test in the meantime, since it won't pass on Atom. Atom uses lea to move the stack pointer in prologs/epilogs. I will fix the test and add it back later. llvm-svn: 174484	2013-02-06 03:15:00 +00:00
Manman Ren	6edff4edb0	Attempt to recover gdb bot after r174445. Failure: undefined symbol 'Lline_table_start0'. Root-cause: we use a symbol subtraction to calculate at_stmt_list, but the line table entries are not dumped in the assembly. Fix: use zero instead of a symbol subtraction for Compile Unit 0. llvm-svn: 174479	2013-02-06 00:59:41 +00:00
Eli Bendersky	6f676af6f1	Test for r174446 llvm-svn: 174464	2013-02-05 23:31:48 +00:00
Manman Ren	b9bd895a06	Dwarf: support for LTO where a single object file can have multiple line tables We generate one line table for each compilation unit in the object file. Reviewed by Eric and Kevin. rdar://problem/13067005 llvm-svn: 174445	2013-02-05 21:52:47 +00:00
Akira Hatanaka	df9480569b	[mips] Do not use function CC_MipsN_VarArg unless the function being analyzed is a vararg function. The original code was examining flag OutputArg::IsFixed to determine whether CC_MipsN_VarArg or CC_MipsN should be called. This is not correct, since this flag is often set to false when the function being analyzed is a non-variadic function. llvm-svn: 174442	2013-02-05 21:18:11 +00:00
Owen Anderson	0c8aed61df	Reapply r174343, with a fix for a scary DAG combine bug where it failed to differentiate between the alignment of the base point of a load, and the overall alignment of the load. This caused infinite loops in DAG combine with the original application of this patch. ORIGINAL COMMIT LOG: When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174431	2013-02-05 19:24:39 +00:00
Jyotsna Verma	31124b1816	Hexagon: Use TFR_cond with cmpb.[eq,gt,gtu] to handle zext( set[ne,eq,gt,ugt] (...) ) type of dag patterns. llvm-svn: 174429	2013-02-05 19:20:45 +00:00
Jyotsna Verma	a3fc230d1b	Hexagon: Add testcase for post-increment store instructions. llvm-svn: 174419	2013-02-05 18:23:51 +00:00
Chad Rosier	c645395ab5	[SjLj Prepare] When demoting an invoke instructions to the stack, if the normal edge is critical, then split it so we can insert the store. rdar://13126179 llvm-svn: 174418	2013-02-05 18:23:10 +00:00
Jyotsna Verma	774837ab41	Hexagon: Use multiclass for absolute addressing mode stores. llvm-svn: 174412	2013-02-05 18:15:34 +00:00
Jakob Stoklund Olesen	3149c42e66	Add a test case for PR14750. This was fixed by r174402. llvm-svn: 174405	2013-02-05 18:04:15 +00:00
Tom Stellard	edd3e9004d	R600: Add tests for instruction predicates llvm-svn: 174393	2013-02-05 17:09:13 +00:00
Tom Stellard	be6496bed8	R600: Emit function name in the AsmPrinter Emitting the function name allows us to check for it in the FileCheck tests so we can make sure FileCheck is checking the output of the correct function. llvm-svn: 174392	2013-02-05 17:09:11 +00:00
Jyotsna Verma	56ef42a2ac	Hexagon: Add V4 compare instructions. Enable relationship mapping for the existing instructions. llvm-svn: 174389	2013-02-05 16:42:24 +00:00
NAKAMURA Takumi	d21517b7e6	Revert r174343, "When the target-independent DAGCombiner inferred a higher alignment for a load," It caused hangups in compiling clang/lib/Parse/ParseDecl.cpp and clang/lib/Driver/Tools.cpp in stage2 on some hosts. llvm-svn: 174374	2013-02-05 14:44:16 +00:00
Logan Chien	95ad6bcb45	Link .ARM.exidx with corresponding text section. The sh_link in the ELF section header of .ARM.exidx should be filled with the section index of the corresponding text section. llvm-svn: 174372	2013-02-05 14:18:59 +00:00
Jack Carter	3dfa61ae2c	This patch that sets the EmitAlias flag in td files and enables the instruction printer to print aliased instructions. Due to usage of RegisterOperands a change in common code (utils/TableGen/AsmWriterEmitter.cpp) is required to get the correct register value if it is a RegisterOperand. Contributer: Vladimir Medic llvm-svn: 174358	2013-02-05 08:32:10 +00:00
Owen Anderson	0d5236250e	When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174343	2013-02-05 06:25:30 +00:00
Manman Ren	5380cead1a	[Stack Alignment] emit warning instead of a hard error Per discussion in rdar://13127907, we should emit a hard error only if people write code where the requested alignment is larger than achievable and assumes the low bits are zeros. A warning should be good enough when we are not sure if the source code assumes the low bits are zeros. rdar://13127907 llvm-svn: 174336	2013-02-04 23:45:08 +00:00
Jyotsna Verma	e97a68f8b5	Hexagon: Add V4 combine instructions and some more Def Pats for V2. llvm-svn: 174331	2013-02-04 15:52:56 +00:00
Benjamin Kramer	d07b68b101	Disable a couple more vector splat optimizations on PPC. I didn't see those because the test case used "not grep". FileCheck the test and XFAIL it, preserving the old optimization, so this can be fixed eventually. llvm-svn: 174330	2013-02-04 15:52:32 +00:00
Benjamin Kramer	ae05ca2d32	X86: Open up some opportunities for constant folding by postponing shift lowering. Fixes PR15141. llvm-svn: 174327	2013-02-04 15:19:33 +00:00
Benjamin Kramer	aa2475fd87	SelectionDAG: Teach FoldConstantArithmetic how to deal with vectors. This required disabling a PowerPC optimization that did the following: input: x = BUILD_VECTOR <i32 16, i32 16, i32 16, i32 16> lowered to: tmp = BUILD_VECTOR <i32 8, i32 8, i32 8, i32 8> x = ADD tmp, tmp The add now gets folded immediately and we're back at the BUILD_VECTOR we started from. I don't see a way to fix this currently so I left it disabled for now. Fix some trivially foldable X86 tests too. llvm-svn: 174325	2013-02-04 15:19:18 +00:00
David Blaikie	7c3ec60da7	Remove the (apparently) unnecessary debug info metadata indirection. The main lists of debug info metadata attached to the compile_unit had an extra layer of metadata nodes they went through for no apparent reason. This patch removes that (& still passes just as much of the GDB 7.5 test suite). If anyone can show evidence as to why these extra metadata nodes are there I'm open to reverting this patch & documenting why they're there. llvm-svn: 174266	2013-02-02 05:56:24 +00:00
Reed Kotler	fcde15ab12	Start static relocation implementation for mips16. This checkin makes hello world work. llvm-svn: 174264	2013-02-02 04:07:35 +00:00
Shuxin Yang	af2e8dd42f	rdar://13126763 Fix a bug in DAGCombine. The symptom is mistakenly optimizing expression "x + xx" into "x 3.0". llvm-svn: 174239	2013-02-02 00:22:03 +00:00
Bill Schmidt	d3beefd1a4	LLVM enablement for some older PowerPC CPUs llvm-svn: 174230	2013-02-01 22:59:51 +00:00
David Sehr	59597001bc	Two changes relevant to LEA and x32: 1) allows the use of RIP-relative addressing in 32-bit LEA instructions under x86-64 (ILP32 and LP64) 2) separates the size of address registers in 64-bit LEA instructions from control by ILP32/LP64. llvm-svn: 174208	2013-02-01 19:28:09 +00:00
Jyotsna Verma	5e4467cefa	Hexagon: Test case to confirm generation of indexed loads with zero offset. llvm-svn: 174196	2013-02-01 16:40:06 +00:00
Tim Northover	62526ce9c9	Add explicit triples to AArch64 tests Only Linux is supported at the moment, and other platforms quickly fault. As a result these tests would fail on non-Linux hosts. It may be worth making the tests more generic again as more platforms are supported. llvm-svn: 174170	2013-02-01 11:40:47 +00:00
Tom Stellard	91336f7259	R600: Fold clamp, neg, abs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174099	2013-01-31 22:11:54 +00:00
Lang Hames	ff8ee44f3a	When lowering memcpys to loads and stores, make sure we don't promote alignments past the natural stack alignment. llvm-svn: 174085	2013-01-31 20:23:43 +00:00
Tim Northover	e2b0519ed8	Add AArch64 as an experimental target. This patch adds support for AArch64 (ARM's 64-bit architecture) to LLVM in the "experimental" category. Currently, it won't be built unless requested explicitly. This initial commit should have support for: + Assembly of all scalar (i.e. non-NEON, non-Crypto) instructions (except the late addition CRC instructions). + CodeGen features required for C++03 and C99. + Compilation for the "small" memory model: code+static data < 4GB. + Absolute and position-independent code. + GNU-style (i.e. "__thread") TLS. + Debugging information. The principal omission, currently, is performance tuning. This patch excludes the NEON support also reviewed due to an outbreak of batshit insanity in our legal department. That will be committed soon bringing the changes to precisely what has been approved. Further reviews would be gratefully received. llvm-svn: 174054	2013-01-31 12:12:40 +00:00
Eric Christopher	ae708feb79	Check and allow floating point registers to select the size of the register for inline asm. This conforms to how gcc allows for effective casting of inputs into gprs (fprs is already handled). llvm-svn: 174008	2013-01-31 00:50:46 +00:00
Eli Bendersky	18a780aca3	Replace some more greps with FileChecks in tests llvm-svn: 174006	2013-01-31 00:44:12 +00:00
Eli Bendersky	dc78605596	Rewrite this test properly with a FileCheck instead of greps llvm-svn: 173997	2013-01-31 00:11:52 +00:00

1 2 3 4 5 ...

6871 Commits