llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Venkatraman Govindaraju	b02c5bdb65	[Sparc] Make floating point branch instruction formats to accept %fcc0-%fcc1 conditional registers as input. No functionality change. llvm-svn: 202614	2014-03-02 04:43:45 +00:00
Venkatraman Govindaraju	6aea38dc6d	[Sparc] Add support for parsing fcmp with %fcc registers. llvm-svn: 202610	2014-03-02 03:39:39 +00:00
Venkatraman Govindaraju	591cfa469a	[Sparc] Add register class for floating point conditional flags (%fcc0 - %fcc3). llvm-svn: 202604	2014-03-02 02:12:33 +00:00
Venkatraman Govindaraju	1c2ec775cb	[SparcV9] Add support for parsing branch instructions with prediction. llvm-svn: 202602	2014-03-01 22:03:07 +00:00
Hal Finkel	4937443651	Remove extra truncs/exts around i32 bit operations on PPC64 This generalizes the code to eliminate extra truncs/exts around i1 bit operations to also do the same on PPC64 for i32 bit operations. This eliminates a fairly prevalent code wart: int foo(int a) { return a == 5 ? 7 : 8; } On PPC64, because of the extension implied by the ABI, this would generate: cmplwi 0, 3, 5 li 12, 8 li 4, 7 isel 3, 4, 12, 2 rldicl 3, 3, 0, 32 blr where the 'rldicl 3, 3, 0, 32', the extension, is completely unnecessary. At least for the single-BB case (which is all that the DAG combine mechanism can handle), this unnecessary extension is no longer generated. llvm-svn: 202600	2014-03-01 21:36:57 +00:00
Venkatraman Govindaraju	44a4d4b894	[Sparc] Add support for parsing annulled branch instructions. llvm-svn: 202599	2014-03-01 20:08:48 +00:00
Venkatraman Govindaraju	30d1614f94	[Sparc] Add support for parsing sparcv9 instructions addc/subc/addccc/subccc. llvm-svn: 202598	2014-03-01 18:54:52 +00:00
Venkatraman Govindaraju	96169e7898	[Sparc] Add missing ALU instruction patterns. llvm-svn: 202597	2014-03-01 17:51:00 +00:00
Benjamin Kramer	803ba41365	Now that we have C++11, turn simple functors into lambdas and remove a ton of boilerplate. No intended functionality change. llvm-svn: 202588	2014-03-01 11:47:00 +00:00
Venkatraman Govindaraju	9a36f7b46b	[Sparc] Add support to decode unimp instruction. llvm-svn: 202581	2014-03-01 09:28:18 +00:00
Venkatraman Govindaraju	043ff79772	[Sparc] Add support to decode negative simm13 operands in the sparc disassembler. llvm-svn: 202578	2014-03-01 09:11:57 +00:00
Venkatraman Govindaraju	fcbe857272	[Sparc] Add support for decoding call instructions in the sparc disassembler. llvm-svn: 202577	2014-03-01 08:30:58 +00:00
Venkatraman Govindaraju	1eb4e172be	[Sparc] Add support to disassemble sparc memory instructions. llvm-svn: 202575	2014-03-01 07:46:33 +00:00
Venkatraman Govindaraju	9242ded302	[Sparc] Implement writeNopData. Emit actual NOP instruction instead of just filling with zeroes. llvm-svn: 202572	2014-03-01 05:45:09 +00:00
Venkatraman Govindaraju	dd81a65959	[Sparc] Teach SparcAsmParser to emit correct relocations for PIC code. llvm-svn: 202571	2014-03-01 05:07:21 +00:00
Venkatraman Govindaraju	a60509dc31	[Sparc] 80 column rule. No functionality change. llvm-svn: 202565	2014-03-01 02:28:34 +00:00
Venkatraman Govindaraju	789e2fd1b7	[Sparc] Add support for parsing directives in SparcAsmParser. llvm-svn: 202564	2014-03-01 02:18:04 +00:00
Venkatraman Govindaraju	439a7d90a6	[Sparc] Emit 'restore' instead of 'restore %g0, %g0, %g0'. This improves the readability of the generated code. llvm-svn: 202563	2014-03-01 01:04:26 +00:00
Tom Stellard	f4a7aeb04e	R600: Verify all instructions in the AsmPrinter on debug builds Make a call to R600's implementation of verifyInstruction() to check that instructions are only using legal operands. llvm-svn: 202544	2014-02-28 21:36:41 +00:00
Tom Stellard	6280afdecd	R600/SI: Expand all v16[if]32 operations llvm-svn: 202543	2014-02-28 21:36:37 +00:00
Zoran Jovanovic	9c1887bef4	Fixed operand of SC microMIPS instruction. llvm-svn: 202526	2014-02-28 18:22:56 +00:00
Zoran Jovanovic	ebb68d0712	Fixed encoding of SYSCALL microMIPS instruction. llvm-svn: 202523	2014-02-28 18:17:08 +00:00
Zoran Jovanovic	d914bd8ae2	Revert revision 202518 because of wrong commit message. llvm-svn: 202521	2014-02-28 18:14:16 +00:00
Zoran Jovanovic	43ca53260b	Fix operand of SC instruction. llvm-svn: 202518	2014-02-28 18:02:17 +00:00
Evgeniy Stepanov	da60e2a9aa	X86Operand is extracted into individual header. X86Operand is extracted into individual header, because it allows to create an arbitrary memory operand and append it to MCInst. It'll be reused in X86 inline assembly instrumentation. Patch by Yuri Gorshenin. llvm-svn: 202496	2014-02-28 12:28:07 +00:00
NAKAMURA Takumi	ee6de3fa2e	Reorder Mips/MCTargetDesc/CMakeLists.txt. llvm-svn: 202483	2014-02-28 10:18:21 +00:00
Sasa Stankovic	1eac2858b7	[mips] Add MipsNaClELFStreamer.cpp to CMakeLists.txt. llvm-svn: 202482	2014-02-28 10:14:12 +00:00
Sasa Stankovic	b0018b8bdb	[mips] Implement NaCl sandboxing of indirect jumps: * Align targets of indirect jumps to instruction bundle boundaries (in MI layer). * Add masking instructions before indirect jumps (in MC layer). Differential Revision: http://llvm-reviews.chandlerc.com/D2847 llvm-svn: 202479	2014-02-28 10:00:38 +00:00
Hal Finkel	1970087008	Swap PPC isel operands to allow for 0-folding The PPC isel instruction can fold 0 into the first operand (thus eliminating the need to materialize a zero-containing register when the 'true' result of the isel is 0). When the isel is fed by a bit register operation that we can invert, do so as part of the bit-register-operation peephole routine. llvm-svn: 202469	2014-02-28 06:11:16 +00:00
Hal Finkel	3bd3f4e287	Trying to unbreak the darwin11 builder The CR bit tracking code broke PPC/Darwin; trying to get it working again... (the darwin11 builder, which defaults to the darwin ABI when running PPC tests, asserted when running test/CodeGen/PowerPC/inverted-bool-compares.ll) llvm-svn: 202459	2014-02-28 01:17:25 +00:00
Hal Finkel	94f3724df6	Try to unbreak the C++11 build Cannot use negative numbers in case statements without running afoul of -Wc++11-narrowing. llvm-svn: 202455	2014-02-28 00:45:27 +00:00
Hal Finkel	883c64377d	Add CR-bit tracking to the PowerPC backend for i1 values This change enables tracking i1 values in the PowerPC backend using the condition register bits. These bits can be treated on PowerPC as separate registers; individual bit operations (and, or, xor, etc.) are supported. Tracking booleans in CR bits has several advantages: - Reduction in register pressure (because we no longer need GPRs to store boolean values). - Logical operations on booleans can be handled more efficiently; we used to have to move all results from comparisons into GPRs, perform promoted logical operations in GPRs, and then move the result back into condition register bits to be used by conditional branches. This can be very inefficient, because the throughput of these CR <-> GPR moves have high latency and low throughput (especially when other associated instructions are accounted for). - On the POWER7 and similar cores, we can increase total throughput by using the CR bits. CR bit operations have a dedicated functional unit. Most of this is more-or-less mechanical: Adjustments were needed in the calling-convention code, support was added for spilling/restoring individual condition-register bits, and conditional branch instruction definitions taking specific CR bits were added (plus patterns and code for generating bit-level operations). This is enabled by default when running at -O2 and higher. For -O0 and -O1, where the ability to debug is more important, this feature is disabled by default. Individual CR bits do not have assigned DWARF register numbers, and storing values in CR bits makes them invisible to the debugger. It is critical, however, that we don't move i1 values that have been promoted to larger values (such as those passed as function arguments) into bit registers only to quickly turn around and move the values back into GPRs (such as happens when values are returned by functions). A pair of target-specific DAG combines are added to remove the trunc/extends in: trunc(binary-ops(binary-ops(zext(x), zext(y)), ...) and: zext(binary-ops(binary-ops(trunc(x), trunc(y)), ...) In short, we only want to use CR bits where some of the i1 values come from comparisons or are used by conditional branches or selects. To put it another way, if we can do the entire i1 computation in GPRs, then we probably should (on the POWER7, the GPR-operation throughput is higher, and for all cores, the CR <-> GPR moves are expensive). POWER7 test-suite performance results (from 10 runs in each configuration): SingleSource/Benchmarks/Misc/mandel-2: 35% speedup MultiSource/Benchmarks/Prolangs-C++/city/city: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan: 23% speedup SingleSource/Benchmarks/CoyoteBench/huffbench: 13% speedup SingleSource/Benchmarks/Misc-C++/Large/sphereflake: 13% speedup SingleSource/Benchmarks/Misc-C++/mandel-text: 10% speedup SingleSource/Benchmarks/Misc-C++-EH/spirit: 10% slowdown MultiSource/Applications/lemon/lemon: 8% slowdown llvm-svn: 202451	2014-02-28 00:27:01 +00:00
Andrew Trick	a8fdecaea7	Provide a target override for the latest regalloc heuristic. This is a temporary workaround for native arm linux builds: PR18996: Changing regalloc order breaks "lencod" on native arm linux builds. llvm-svn: 202433	2014-02-27 21:37:33 +00:00
Roman Divacky	f36febf578	Lower FNEG just like FABS to fneg[ds] and fmov[ds], thus avoiding expensive libcall. Also, Qp_neg is not implemented on at least FreeBSD. This is also what gcc is doing. llvm-svn: 202422	2014-02-27 19:26:29 +00:00
Adrian Prantl	d7f77dd966	Debug info: Remove ARMAsmPrinter::EmitDwarfRegOp(). AsmPrinter can now scan the register file for sub- and super-registers. No functionality change intended. (Tests are updated because the comments in the assembler output are different.) llvm-svn: 202416	2014-02-27 17:56:08 +00:00
Richard Osborne	947c19eaa0	[XCore] Support functions returning more than 4 words. If a function returns a large struct by value return the first 4 words in registers and the rest on the stack in a location reserved by the caller. This is needed to support the xC language which supports functions returning an arbitrary number of return values. This is r202397 reapplied with a fix to avoid an uninitialized read of a member. llvm-svn: 202414	2014-02-27 17:47:54 +00:00
Richard Osborne	f1c5c83f06	[XCore] Make LowerCallResult a static function. No functionality change. This is r202396 reapplied with no changes. llvm-svn: 202413	2014-02-27 17:47:48 +00:00
Rafael Espindola	7a4b8493b1	Remove MCPureStreamer. We moved MCJIT to use native object formats a long time ago and R600 now uses ELF, so it was dead. llvm-svn: 202408	2014-02-27 16:17:34 +00:00
Richard Osborne	f8fb4e8a7f	Revert r202396, r202397. These are causing test failures, revert for now. llvm-svn: 202398	2014-02-27 14:24:13 +00:00
Richard Osborne	cb6866dfec	[XCore] Support functions returning more than 4 words. Summary: If a function returns a large struct by value return the first 4 words in registers and the rest on the stack in a location reserved by the caller. This is needed to support the xC language which supports functions returning an arbitrary number of return values. Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2889 llvm-svn: 202397	2014-02-27 14:00:40 +00:00
Richard Osborne	35b73c788e	[XCore] Make LowerCallResult a static function. No functionality change. llvm-svn: 202396	2014-02-27 14:00:34 +00:00
Richard Osborne	5ac74685fd	[XCore] Target optimized library function __memcpy_4() Summary: If the src, dst and size of a memcpy are known to be 4 byte aligned we can call __memcpy_4() instead of memcpy(). Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2871 llvm-svn: 202395	2014-02-27 13:39:07 +00:00
Richard Osborne	75c16f2bf4	[XCore] Add dag combines for instructions that ignore some input bits. These instructions ignore the high bits of one of their input operands - try and use this to simplify the code. llvm-svn: 202394	2014-02-27 13:20:11 +00:00
Richard Osborne	f815df9c6e	[XCore] Provide information about known zero bits of resource instructions. llvm-svn: 202393	2014-02-27 13:20:06 +00:00
Craig Topper	6d9ad3a694	[X86] Fix Uses/Defs lists for INS, OUTS, SCAS, CMPS, LODS llvm-svn: 202348	2014-02-27 05:08:25 +00:00
Craig Topper	e406fab3df	[X86] Add RAX/EAX/AX Uses/Defs to XCHG RAX/EAX/AX instructions. llvm-svn: 202347	2014-02-27 04:27:00 +00:00
Craig Topper	4fcab63947	[X86] Add RAX/EAX/AX/AL Uses/Defs to the absolute memory location move instructions. Patch by Florian Lukas with some additional instructions fixed by me. Fixes PR18975. llvm-svn: 202345	2014-02-27 04:07:57 +00:00
Michel Danzer	8edacce1de	R600/SI: Optimize SI_KILL for constant operands If the SI_KILL operand is constant, we can either clear the exec mask if the operand is negative, or do nothing otherwise. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202337	2014-02-27 01:47:09 +00:00
Michel Danzer	0ddce64f7c	R600/SI: Allow SI_KILL for geometry shaders Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202336	2014-02-27 01:47:02 +00:00
Matt Arsenault	4ecfd35fdd	R600: Remove unnecessary build_vector pattern. It is already fully handled in AMDGPUISelDAGToDAG. llvm-svn: 202312	2014-02-26 23:00:58 +00:00
Quentin Colombet	e639a79f72	Lower unsigned vsetcc to psubus in certain cases The current approach to lower a vsetult is to flip the sign bit of the operands, swap the operands and then use a (signed) pcmpgt. psubus (unsigned saturating subtract) can be used to emulate a vsetult more efficiently: + case ISD::SETULT: { + // If the comparison is against a constant we can turn this into a + // setule. With psubus, setule does not require a swap. This is + // beneficial because the constant in the register is no longer + // destructed as the destination so it can be hoisted out of a loop. I also enable lowering via psubus in a few other cases where it's clearly beneficial: setule and setuge if minu/maxu cannot be used. rdar://problem/14338765 Patch by Adam Nemet <anemet@apple.com>. llvm-svn: 202301	2014-02-26 21:39:12 +00:00
Aaron Ballman	9e5315239d	Silencing an MSVC signed comparison warning. llvm-svn: 202295	2014-02-26 20:22:20 +00:00
Artyom Skrobov	94122a0879	ARMv8 IfConversion must skip narrow instructions that a) define CPSR and b) wouldn't affect CPSR in an IT block llvm-svn: 202257	2014-02-26 11:27:28 +00:00
Daniel Sanders	50f3bc6330	[mips] Treat -mcpu=generic the same way as an empty CPU string. Summary: This should fix the MCJIT unit tests that were broken by r201792 on the MIPS buildbot. MIPS currently uses the default implementation of sys::getHostCPUName() which always returns "generic". For now, we will accept "generic" and coerce it to "mips32" or "mips64" depending on the target architecture like we do for empty CPU names. Reviewers: jacksprat, matheusalmeida Reviewed By: jacksprat Differential Revision: http://llvm-reviews.chandlerc.com/D2878 llvm-svn: 202253	2014-02-26 10:20:15 +00:00
Craig Topper	ab427284c0	[x86] Add same itinerary to SYSEXIT64 as SYSEXIT for consistency. llvm-svn: 202240	2014-02-26 06:50:27 +00:00
Craig Topper	f8c9edf05c	[x86] Remove some unused instruction format classes. llvm-svn: 202234	2014-02-26 06:06:38 +00:00
Craig Topper	0fa9073645	[x86] Simplify disassembler code slightly. llvm-svn: 202233	2014-02-26 06:01:21 +00:00
Rafael Espindola	1b07b35205	Use DataLayout from the module when easily available. Eventually DataLayoutPass should go away, but for now that is the only easy way to get a DataLayout in some APIs. This patch only changes the ones that have easy access to a Module. One interesting issue with sometimes using DataLayoutPass and sometimes fetching it from the Module is that we have to make sure they are equivalent. We can get most of the way there by always constructing the pass with a Module. In fact, the pass could be changed to point to an external DataLayout instead of owning one to make this stricter. Unfortunately, the C api passes a DataLayout, so it has to be up to the caller to make sure the pass and the module are in sync. llvm-svn: 202204	2014-02-25 23:25:17 +00:00
Tom Stellard	c49658a11c	R600: Don't unconditionally unroll loops with private memory accesses This causes the size of the scrypt kernel to explode and eats all the memory on some systems. llvm-svn: 202195	2014-02-25 21:36:21 +00:00
Tom Stellard	3dafad8efc	R600/SI: Custom select 64-bit ADD llvm-svn: 202194	2014-02-25 21:36:18 +00:00
Hal Finkel	08c64addef	Account for 128-bit integer operations in PPCCTRLoops We need to abort the formation of counter-register-based loops where there are 128-bit integer operations that might become function calls. llvm-svn: 202192	2014-02-25 20:51:50 +00:00
Richard Osborne	d5250f323a	[XCore] Add intrinsic for CLRPT (clear port time) instruction. llvm-svn: 202172	2014-02-25 17:31:15 +00:00
Richard Osborne	127dc9d63c	[XCore] Add intrinsic for EDU (event disable unconditional) instruction. llvm-svn: 202171	2014-02-25 17:31:06 +00:00
Rafael Espindola	32da4bdd4b	Make DataLayout a plain object, not a pass. Instead, have a DataLayoutPass that holds one. This will allow parts of LLVM don't don't handle passes to also use DataLayout. llvm-svn: 202168	2014-02-25 17:30:31 +00:00
Richard Osborne	871fa66400	[XCore] Prefer to word align functions. The behaviour of the XCore's instruction buffer means that the performance of the same code sequence can differ depending on whether it starts at a 4 byte aligned address or not. Since we don't model the instruction buffer in the backend we have no way of knowing for sure if it is beneficial to word align a specific function. However, in the absence of precise modelling, it is better on balance to word align functions because: * It makes a fetch-nop while executing the prologue slightly less likely. * If we don't word align functions then a small perturbation in one function can have a dramatic knock on effect. If the size of the function changes it might change the alignment and therefore the performance of all the functions that happen to follow it in the binary. This butterfly effect makes it harder to reason about and measure the performance of code. llvm-svn: 202163	2014-02-25 16:37:15 +00:00
Alp Toker	f3e1a22860	Fix typos llvm-svn: 202107	2014-02-25 04:21:15 +00:00
Rafael Espindola	6c834371d9	Make some DataLayout pointers const. No functionality change. Just reduces the noise of an upcoming patch. llvm-svn: 202087	2014-02-24 23:12:18 +00:00
Albrecht Kadlec	7a0ac75c6a	trivial test commit llvm-svn: 202084	2014-02-24 22:18:38 +00:00
Matt Arsenault	3af294610c	Fix unused variable llvm-svn: 202080	2014-02-24 21:16:50 +00:00
Matt Arsenault	a3de4dc001	R600/SI - Add new CI arithmetic instructions. Does not yet include larger part required to match v_mad_i64_i32 / v_mad_u64_u32. llvm-svn: 202077	2014-02-24 21:01:28 +00:00
Matt Arsenault	cee8954d45	R600: Make check clearer. The check is clearer as southern islands or later, rather than checking for later than northern islands. llvm-svn: 202076	2014-02-24 21:01:23 +00:00
Matt Arsenault	9b1fec610f	Fix DOT4 missing from getTargetOpcodeName llvm-svn: 202075	2014-02-24 21:01:21 +00:00
Quentin Colombet	282bf4e578	[X86][SchedModel] Add missing scheduling model for SSE related instructions. The patch defines new or refines existing generic scheduling classes to match the behavior of the SSE instructions. It also maps those scheduling classes on the related SSE instructions. <rdar://problem/15607571> llvm-svn: 202065	2014-02-24 19:33:51 +00:00
Roman Divacky	2ff7280e46	Add a dwarf number to the Y register. llvm-svn: 202057	2014-02-24 18:41:31 +00:00
Rafael Espindola	d89ca7eab7	Replace the F_Binary flag with a F_Text one. After this I will set the default back to F_None. The advantage is that before this patch forgetting to set F_Binary would corrupt a file on windows. Forgetting to set F_Text produces one that cannot be read in notepad, which is a better failure mode :-) llvm-svn: 202052	2014-02-24 18:20:12 +00:00
Christian Pirker	1c907c9022	Add AArch64 big endian Target (aarch64_be) llvm-svn: 202024	2014-02-24 11:34:50 +00:00
Elena Demikhovsky	ade0be1dbb	AVX-512: Fixed encoding of VPCMPEQ and VPCMPGT llvm-svn: 202015	2014-02-24 10:08:30 +00:00
Benjamin Kramer	bb5b968592	SPARC: Implement TRAP lowering. Matches what GCC emits. llvm-svn: 201994	2014-02-23 21:43:52 +00:00
Saleem Abdulrasool	686f45ad24	ARMAsmParser: whitespace llvm-svn: 201989	2014-02-23 17:45:36 +00:00
Saleem Abdulrasool	05ca7814d9	ARM IAS: support .align without parameters .align is handled specially on certain targets. .align without any parameters on ARM indicates a default alignment (4). Handle the special case in the target parser, but fall back to the generic parser for the normal version. llvm-svn: 201988	2014-02-23 17:45:32 +00:00
Elena Demikhovsky	1804845947	AVX-512: Fixed encoding of VPTESTMQ llvm-svn: 201980	2014-02-23 14:28:35 +00:00
Saleem Abdulrasool	39ff879a52	ARM IAS: support .short and .hword This adds support for the .short and its alias .hword for adding literal values into the object file. This is similar to the .word directive, however, rather than inserting a value of 4 bytes, adds a 2-byte value. llvm-svn: 201968	2014-02-23 06:22:09 +00:00
Logan Chien	a067775c77	Move get[S\|U]LEB128Size() to LEB128.h. This commit moves getSLEB128Size() and getULEB128Size() from MCAsmInfo to LEB128.h and removes some copy-and-paste code. Besides, this commit also adds some unit tests for the LEB128 functions. llvm-svn: 201937	2014-02-22 14:00:39 +00:00
Juergen Ributzka	92359248ea	[Stackmaps] Move the target-independent frame index elimination for stackmaps and patchpoints into target-specific code. The lowering of the frame index for stackmaps and patchpoints requires some target-specific magic and should therefore be handled in the target-specific eliminateFrameIndex method. This is related to <rdar://problem/16106219> llvm-svn: 201904	2014-02-21 23:29:32 +00:00
Kevin Qin	e05e6b31e1	[AArch64] Add register constraints to avoid generating STLXR and STXR with unpredictable behavior. llvm-svn: 201841	2014-02-21 07:45:48 +00:00
Rafael Espindola	1f7e9d4bed	Rename a few more DataLayout variables. llvm-svn: 201833	2014-02-21 01:53:35 +00:00
Benjamin Kramer	fdf9b7dac1	Remove unnecessary copy of array_lengthof. llvm-svn: 201798	2014-02-20 17:36:31 +00:00
Oliver Stannard	ce7688d8cc	AArch64: __va_list.__stack must be 8-byte aligned The va_start macro for AArch64 must set va_list.__stack to the address following the last named argument on the stack, rounded up to an alignment of 8 bytes. llvm-svn: 201797	2014-02-20 17:19:26 +00:00
Chad Rosier	ebcee99c02	[AArch64] Add support for TargetTransformInfo Analysis. llvm-svn: 201793	2014-02-20 16:00:08 +00:00
Daniel Sanders	1f73ab934b	[mips] Make it impossible to have UnknownABI in CodeGen and Integrated Assembler. Summary: This removes the need to coerce UnknownABI to the default ABI (O32 for MIPS32, N64 for MIPS64 []) in both MipsSubtarget and MipsAsmParser. Clang has been updated to disable both possible default ABI's before enabling the ABI it intends to use. [] N64 being the default for MIPS64 is not actually correct. However N32 is not fully implemented/tested yet. Depends on: D2830 Reviewers: jacksprat, matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D2832 Differential Revision: http://llvm-reviews.chandlerc.com/D2846 llvm-svn: 201792	2014-02-20 14:58:19 +00:00
NAKAMURA Takumi	122c55ae1d	[CMake] Move intrinsics_gen to lib/Target out of add_public_tablegen_target. add_public_tablegen_target is used somewhere. llvm-svn: 201787	2014-02-20 13:42:30 +00:00
Daniel Sanders	742e6aefa1	[mips] Make mips64 the default CPU for the mips64 architecture Summary: This is consistent with the integrated assembler. All mips64 codegen tests previously passed -mcpu. Removed -mcpu from blez_bgez.ll and const-mult.ll to cover the default case. Ideally, the two implementations of selectMipsCPU() will be merged but it's proven difficult to find a home for the function that doesn't cause link errors. For now, we'll hoist the common functionality into a function and mark it with FIXME's. Reviewers: jacksprat, matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D2830 llvm-svn: 201782	2014-02-20 13:13:33 +00:00
Craig Topper	80c9d78b97	[x86] Switch PAUSE instruction to use XS prefix instead of HasREPPrefix. Remove HasREPPrefix support from disassembler table generator since its now only used by CodeGenOnly instructions. llvm-svn: 201767	2014-02-20 07:59:43 +00:00
Elena Demikhovsky	f4fb18943a	AVX-512: Fixed compilation issue llvm-svn: 201761	2014-02-20 07:00:10 +00:00
Elena Demikhovsky	af8e1ef280	AVX-512: Assembly parsing of broadcast semantic in AVX-512; imlemented by Nis Zinovy (zinovy.y.nis@intel.com) Fixed truncate i32 to i1; a test will be provided in the next commit. llvm-svn: 201757	2014-02-20 06:34:39 +00:00
Reed Kotler	325c6f3182	Make one statement easier to understand from post commmit feedback from a review of the previous patch that introduced this week. llvm-svn: 201723	2014-02-19 22:11:45 +00:00
Roman Divacky	1a91fd1bdc	Expand 64bit {SHL,SHR,SRA}_PARTS on sparcv9. llvm-svn: 201718	2014-02-19 21:35:39 +00:00
Rafael Espindola	60133f3afe	move getNameWithPrefix and getSymbol to TargetMachine. TargetLoweringBase is implemented in CodeGen, so before this patch we had a dependency fom Target to CodeGen. This would show up as a link failure of llvm-stress when building with -DBUILD_SHARED_LIBS=ON. This fixes pr18900. llvm-svn: 201711	2014-02-19 20:30:41 +00:00
Rafael Espindola	aea6192f20	Add back r201608, r201622, r201624 and r201625 r201608 made llvm corretly handle private globals with MachO. r201622 fixed a bug in it and r201624 and r201625 were changes for using private linkage, assuming that llvm would do the right thing. They all got reverted because r201608 introduced a crash in LTO. This patch includes a fix for that. The issue was that TargetLoweringObjectFile now has to be initialized before we can mangle names of private globals. This is trivially true during the normal codegen pipeline (the asm printer does it), but LTO has to do it manually. llvm-svn: 201700	2014-02-19 17:23:20 +00:00
Christian Pirker	1d938fb7c4	Test commit - remove the new line to lib/Target/AArch64/AArch64TargetMachine.cpp. llvm-svn: 201698	2014-02-19 16:58:28 +00:00
Daniel Sanders	b1311110b0	[mips] In the integrated assembler, select the default feature bits by changing the CPU value. This is consistent with the way CodeGen acheives this. However, CodeGen always selects mips32 (even when the architecture is mips64). llvm-svn: 201694	2014-02-19 16:13:26 +00:00
Christian Pirker	be6d8a86fb	Test commit - added a new line to lib/Target/AArch64/AArch64TargetMachine.cpp. llvm-svn: 201692	2014-02-19 16:07:32 +00:00
Daniel Sanders	a4b8f677a1	[mips] Use llvm::Triple in ParseMipsTriple() instead of manually parsing it No functional change. llvm-svn: 201689	2014-02-19 15:55:21 +00:00
Daniel Sanders	a5d02b49e6	[mips] Remove unused NotN64 predicate llvm-svn: 201682	2014-02-19 15:16:47 +00:00
Cameron McInally	7173a45caf	Fix AVX512 vector sqrt assembly strings. llvm-svn: 201681	2014-02-19 15:16:09 +00:00
Daniel Jasper	bf4e7d8ac3	Revert r201622 and r201608. This causes the LLVMgold plugin to segfault. More information on the replies to r201608. llvm-svn: 201669	2014-02-19 12:26:01 +00:00
Tim Northover	1b102abe53	X86 CodeGenPrep: sink shufflevectors before shifts On x86, shifting a vector by a scalar is significantly cheaper than shifting a vector by another fully general vector. Unfortunately, because SelectionDAG operates on just one basic block at a time, the shufflevector instruction that reveals whether the right-hand side of a shift is really a scalar is often not visible to CodeGen when it's needed. This adds another handler to CodeGenPrepare, to sink any useful shufflevector instructions down to the basic block where they're used, predicated on a target hook (since on other architectures, doing so will often just introduce extra real work). rdar://problem/16063505 llvm-svn: 201655	2014-02-19 10:02:43 +00:00
Craig Topper	de3c74571e	Remove special FP opcode maps and instead add enough MRM_XX formats to handle all the FP operations. This increases format by 1 bit, but decreases opcode map by 1 bit so the TSFlags size doesn't change. llvm-svn: 201649	2014-02-19 08:25:02 +00:00
Craig Topper	b56c73e1c5	Reduce size of map field in X86 TSFlags since it now requires less bits. llvm-svn: 201646	2014-02-19 07:29:07 +00:00
Craig Topper	7d159c5e98	Put some of the X86 formats in a more logical order. llvm-svn: 201645	2014-02-19 06:59:13 +00:00
Craig Topper	5b20c52fcc	Remove A6/A7 opcode maps. They can all be handled with a TB map, opcode of 0xa6/0xa7, and adding MRM_C0/MRM_E0 forms. Removes 376K from the disassembler tables. llvm-svn: 201641	2014-02-19 05:34:21 +00:00
Rafael Espindola	d39a573c72	Fix PR18743. The IR @foo = private constant i32 42 is valid, but before this patch we would produce an invalid MachO from it. It was invalid because it would use an L label in a section where the liker needs the labels in order to atomize it. One way of fixing it would be to just reject this IR in the backend, but that would not be very front end friendly. What this patch does is use an 'l' prefix in sections that we know the linker requires symbols for atomizing them. This allows frontends to just use private and not worry about which sections they go to or how the linker handles them. One small issue with this strategy is that now a symbol name depends on the section, which is not available before codegen. This is not a problem in practice. The reason is that it only happens with private linkage, which will be ignored by the non codegen users (llvm-nm and llvm-ar). llvm-svn: 201608	2014-02-18 22:24:57 +00:00
Rafael Espindola	c898de3245	Rename a DebugLoc variable to DbgLoc and a DataLayout to DL. This is quiet a bit less confusing now that TargetData was renamed DataLayout. llvm-svn: 201606	2014-02-18 22:05:46 +00:00
Ana Pazos	9cdade7a3e	[AArch64] Expanded sin, cos, pow with FP vector types inputs llvm-svn: 201601	2014-02-18 20:31:05 +00:00
Robert Lytton	3f025fc96b	XCore target: Handle common linkage llvm-svn: 201563	2014-02-18 11:21:59 +00:00
Robert Lytton	73848eb640	XCore target: addMemOperand as necessary BuildMI instructions were not including MachineMemOperand information. This was discovered by 'SingleSource/Benchmarks/Stanford/Oscar' failing due to a FrameIndex load incorrectly being hoisted by postra-machine-licm. No other tests have been found to fail. llvm-svn: 201562	2014-02-18 11:21:53 +00:00
Robert Lytton	296ff43f53	XCore target: Fix llvm.eh.return and EH info register handling llvm-svn: 201561	2014-02-18 11:21:48 +00:00
Tim Northover	83bbdcb246	GlobalMerge: move "-global-merge" option to the pass itself. It's rather odd to have the flag enabling and disabling this pass only affect a single target. llvm-svn: 201559	2014-02-18 11:17:29 +00:00
Tim Northover	448249fd73	X86: use vpsllvd (& friends) for 16-bit shifts on Haswell llvm-svn: 201558	2014-02-18 11:15:32 +00:00
Craig Topper	947a05e5c4	Add PS prefix to some classes I missed in r201538. llvm-svn: 201551	2014-02-18 08:24:22 +00:00
Craig Topper	b5b81fb98b	Add a bunch of OpSize32 tags to 64-bit mode only instructions to match their 32-bit mode counterparts for cases where there is also a OpSize16 instruction. llvm-svn: 201550	2014-02-18 08:18:29 +00:00
Elena Demikhovsky	8091d0ad88	AVX-512: Fixed size of mask registers llvm-svn: 201546	2014-02-18 07:52:26 +00:00
Jiangning Liu	9508c695c8	Fix a typo about lowering AArch64 va_copy. llvm-svn: 201541	2014-02-18 02:37:42 +00:00
Craig Topper	de78f4304d	Add an x86 prefix encoding for instructions that would decode to a different instruction with 0xf2/f3/66 were in front of them, but don't themselves have a prefix. For now this doesn't change any bbehavior, but plan to use it to fix some bugs in the disassembler. llvm-svn: 201538	2014-02-18 00:21:49 +00:00
Kevin Enderby	0a635e7acf	Fix the arm assembler so that this malformed instruction: ldrd r6, r7 [r2, #15] simply gives an error and does not triggers an assertion. As Jim points out, the diagnostic is really strange here, but fixing that would be more complicated. The missing comma results in the parser expecting a construct like r2[2], which is the vector index thing the error message is talking about. That's not what the user intended, though, and there's nothing else in the instruction that looks at all like a vector. Yet more fallout from not having a real parser here and trying to do context-free generic matching for addressing modes. rdar://15097243 llvm-svn: 201531	2014-02-17 21:45:27 +00:00
Craig Topper	3e74ac0d93	Fix diassembler handling of rex.b when mod=00/01/10 and bbb=101. Mod=00 should ignore the base register entirely. Mod=01/10 should treat this as R13 plus displacment. Fixes PR18860. llvm-svn: 201507	2014-02-17 10:03:43 +00:00
Elena Demikhovsky	0e85630ee2	AVX-512: implemented zext fron i1 to i16 llvm-svn: 201502	2014-02-17 07:29:33 +00:00
Mark Seaborn	a1a8c0677a	Use 16 byte stack alignment for NaCl on ARM NaCl's ARM ABI uses 16 byte stack alignment, so set that in ARMSubtarget.cpp. Using 16 byte alignment exposes an issue in code generation in which a varargs function leaves a 4 byte gap between the values of r1-r3 saved to the stack and the following arguments that were passed on the stack. (Previously, this code only needed to support 4 byte and 8 byte alignment.) With this issue, llc generated: varargs_func: sub sp, sp, #16 push {lr} sub sp, sp, #12 add r0, sp, #16 // Should be 20 stm r0, {r1, r2, r3} ldr r0, .LCPI0_0 // Address of va_list add r1, sp, #16 str r1, [r0] bl external_func Fix the bug by checking for "Align > 4". Also simplify the code by using OffsetToAlignment(), and update comments. Differential Revision: http://llvm-reviews.chandlerc.com/D2677 llvm-svn: 201497	2014-02-16 18:59:48 +00:00
Rafael Espindola	ed0a04d469	Remove dead code, we already require cmake 2.8.8. llvm-svn: 201495	2014-02-16 14:36:26 +00:00
Elena Demikhovsky	2cdad2b3d4	AVX-512: simpyfied BUILD_VECTOR for masks; fixed cmp/test sequence llvm-svn: 201487	2014-02-16 11:34:23 +00:00
Saleem Abdulrasool	f0e7fa2121	ARM IAS: (partially) support .arch_extension directive This adds a partial implementation of the .arch_extension directive to the integrated ARM assembler. There are a number of limitations to this implementation arising from the target backend support rather than the implementation itself. Namely, iWMMXT (v1 and v2), Maverick, and XScale support is not present in the ARM backend. Currently, there is no check for A-class only (needed for virt), and no ARMv6k detection (needed for os and sec). The remainder of the extensions are fully supported. llvm-svn: 201471	2014-02-16 00:16:41 +00:00
Craig Topper	17b586d7c5	Add opcode extension forms of MOV8ri/MOV16ri/MOV32ri. llvm-svn: 201463	2014-02-15 07:29:18 +00:00
Reed Kotler	22855ad786	This patch has two main functions: 1) Fix a specific bug when certain conversion functions are called in a program compiled as mips16 with hard float and the program is linked as c++. There are two libraries that are reversed in the link order with gcc/g++ and clang/clang++ for mips16 in this case and the proper stubs will then not be called. These stubs are normally handled in the Mips16HardFloat pass but in this case we don't know at that time that we need to generate the stubs. This must all be handled later in code generation and we have moved this functionality to MipsAsmPrinter. When linked as C (gcc or clang) the proper stubs are linked in from libc. 2) Set up the infrastructure to handle 90% of what is in the Mips16HardFloat pass in this new area of MipsAsmPrinter. This is a more logical place to handle this and we have known for some time that we needed to move the code later and not implement it using inline asm as we do now but it was not clear exactly where to do this and what mechanism should be used. Now it's clear to us how to do this and this patch contains the infrastructure to move most of this to MipsAsmPrinter but the actual moving will be done in a follow on patch. The same infrastructure is used to fix this current bug as described in #1. This change was requested by the list during the original putback of the Mips16HardFloat pass but was not practical for us do at that time. llvm-svn: 201426	2014-02-14 19:16:39 +00:00
Artyom Skrobov	10498a713c	Generate the DWARF stack frame decode operations in the function prologue for ARM/Thumb functions. Patch by Keith Walker! llvm-svn: 201423	2014-02-14 17:19:07 +00:00
Kevin Qin	fa58a631ae	[AArch64 NEON] Fix a bug to avoid using floating type as condition type in lowering SELECT_CC. llvm-svn: 201395	2014-02-14 09:41:15 +00:00
Jiangning Liu	5da69caef9	Enable AArch64 NEON by default. llvm-svn: 201385	2014-02-14 04:38:09 +00:00
Hao Liu	022a50cb21	[AArch64]Fix the assertion failure caused by "v1i1 SETCC" DAG node. As v1i1 is illegal, the type legalizer tries to scalarize such node. But if the type operands of SETCC is legal, the scalarization algorithm will cause an assertion failure. llvm-svn: 201381	2014-02-14 02:21:56 +00:00
Juergen Ributzka	40016f4730	[X86] Don't mark movabsq as cheap-as-move - it isn't that cheap. A simple register copy on X86 is just 3 bytes, whereas movabsq is a 10 byte instruction. Marking movabsq as not beeing cheap will allow LICM to move it out of the loop and it also prevents unnecessary rematerializations if the value is needed in more than one register. llvm-svn: 201377	2014-02-14 00:51:13 +00:00
Tom Stellard	988925aeae	R600/SI: Expand all v8[if]32 operations llvm-svn: 201371	2014-02-13 23:34:15 +00:00
Tom Stellard	309a624102	R600/SI: Add a pattern for i32 anyext Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 201370	2014-02-13 23:34:13 +00:00
Tom Stellard	4b0c3551df	R600/SI: Completely Disable TypeRewriter on compute llvm-svn: 201369	2014-02-13 23:34:12 +00:00
Tom Stellard	4447febe55	R600/SI: Split global vector loads with more than 4 elements llvm-svn: 201368	2014-02-13 23:34:10 +00:00
Daniel Sanders	7a3a160940	Re-commit: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call Summary: AsmPrinter::EmitInlineAsm() will no longer use the EmitRawText() call for targets with mature MC support. Such targets will always parse the inline assembly (even when emitting assembly). Targets without mature MC support continue to use EmitRawText() for assembly output. The hasRawTextSupport() check in AsmPrinter::EmitInlineAsm() has been replaced with MCAsmInfo::UseIntegratedAs which when true, causes the integrated assembler to parse inline assembly (even when emitting assembly output). UseIntegratedAs is set to true for targets that consider any failure to parse valid assembly to be a bug. Target specific subclasses generally enable the integrated assembler in their constructor. The default value can be overridden with -no-integrated-as. All tests that rely on inline assembly supporting invalid assembly (for example, those that use mnemonics such as 'foo' or 'hello world') have been updated to disable the integrated assembler. Changes since review (and last commit attempt): - Fixed test failures that were missed due to configuration of local build. (fixes crash.ll and a couple others). - Fixed tests that happened to pass because the local build was on X86 (should fix 2007-12-17-InvokeAsm.ll) - mature-mc-support.ll's should no longer require all targets to be compiled. (should fix ARM and PPC buildbots) - Object output (-filetype=obj and similar) now forces the integrated assembler to be enabled regardless of default setting or -no-integrated-as. (should fix SystemZ buildbots) Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2686 llvm-svn: 201333	2014-02-13 14:44:26 +00:00
Tim Northover	5e3ca32797	ARM: remove floating-point patterns for @llvm.arm.neon.vabs The front-end is now generating the generic @llvm.fabs for this operation now, so the extra patterns are no longer needed. llvm-svn: 201314	2014-02-13 10:44:30 +00:00
Oliver Stannard	f7cc40a705	Add Cortex-A53 and Cortex-A57 cores to the AArch64 backend llvm-svn: 201305	2014-02-13 09:46:11 +00:00
Hao Liu	386fc0d8ae	[AArch64]Fix the problems that can't select mul/add/sub of v1i8/v1i16/v1i32 types. As this problems are similar to shl/sra/srl, also add patterns for shift nodes. llvm-svn: 201298	2014-02-13 05:42:33 +00:00
Hao Liu	ee04163cfe	[AArch64]Add support for spilling FPR8/FPR16. llvm-svn: 201287	2014-02-13 02:36:58 +00:00
Andrea Di Biagio	594ea331ef	[Vectorizer] Add a new 'OperandValueKind' in TargetTransformInfo called 'OK_NonUniformConstValue' to identify operands which are constants but not constant splats. The cost model now allows returning 'OK_NonUniformConstValue' for non splat operands that are instances of ConstantVector or ConstantDataVector. With this change, targets are now able to compute different costs for instructions with non-uniform constant operands. For example, On X86 the cost of a vector shift may vary depending on whether the second operand is a uniform or non-uniform constant. This patch applies the following changes: - The cost model computation now takes into account non-uniform constants; - The cost of vector shift instructions has been improved in X86TargetTransformInfo analysis pass; - BBVectorize, SLPVectorizer and LoopVectorize now know how to distinguish between non-uniform and uniform constant operands. Added a new test to verify that the output of opt '-cost-model -analyze' is valid in the following configurations: SSE2, SSE4.1, AVX, AVX2. llvm-svn: 201272	2014-02-12 23:43:47 +00:00
Andrea Di Biagio	b682c0a265	[X86] Teach the backend how to lower vector shift left into multiply rather than scalarizing it. Instead of expanding a packed shift into a sequence of scalar shifts, the backend now tries (when possible) to convert the vector shift into a vector multiply. Before this change, a shift of a MVT::v8i16 vector by a build_vector of constants was always scalarized into a long sequence of "vector extracts + scalar shifts + vector insert". With this change, if there is SSE2 support, we emit a single vector multiply. This change also affects SSE4.1, AVX, AVX2 shifts: - A shift of a MVT::v4i32 vector by a build_vector of non uniform constants is now lowered when possible into a single SSE4.1 vector multiply. - Packed v16i16 shift left by constant build_vector are now expanded when possible into a single AVX2 vpmullw. This change also improves the lowering of AVX512f vector shifts. Added test CodeGen/X86/vec_shift6.ll with some code examples that are affected by this change. llvm-svn: 201271	2014-02-12 23:42:28 +00:00
Daniel Sanders	656c4d360b	Revert r201237+r201238: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call It introduced multiple test failures in the buildbots. llvm-svn: 201241	2014-02-12 15:39:20 +00:00

1 2 3 4 5 ...

27379 Commits