llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 12:02:58 +02:00

Author	SHA1	Message	Date
Justin Holewinski	d5636664a4	[NVPTX] Add GenericToNVVM IR converter to better handle idiomatic LLVM IR inputs This converter currently only handles global variables in address space 0. For these variables, they are promoted to address space 1 (global memory), and all uses are updated to point to the result of a cvta.global instruction on the new variable. The motivation for this is address space 0 global variables are illegal since we cannot declare variables in the generic address space. Instead, we place the variables in address space 1 and explicitly convert the pointer to address space 0. This is primarily intended to help new users who expect to be able to place global variables in the default address space. llvm-svn: 182254	2013-05-20 12:13:32 +00:00
Justin Holewinski	fda22b94b1	[NVPTX] Fix i1 kernel parameters and global variables. ABI rules say we need to use .u8 for i1 parameters for kernels. llvm-svn: 182253	2013-05-20 12:13:28 +00:00
Stepan Dyatkovskiy	adb91e7ed9	PR15868 fix. Introduction: In case when stack alignment is 8 and GPRs parameter part size is not N8: we add padding to GPRs part, so part's last byte must be recovered at address K8-1. We need to do it, since remained (stack) part of parameter starts from address K8, and we need to "attach" "GPRs head" without gaps to it: Stack: \|---- 8 bytes block ----\| \|---- 8 bytes block ----\| \|---- 8 bytes... [ [padding] [GPRs head] ] [ ------ Tail passed via stack ------ ... FIX: Note, once we added padding we need to correct all* Arg offsets that are going after padded one. That's why we need this fix: Arg offsets were never corrected before this patch. See new test-cases included in patch. We also don't need to insert padding for byval parameters that are stored in GPRs only. We need pad only last byval parameter and only in case it outsides GPRs and stack alignment = 8. Though, stack area, allocated for recovered byval params, must satisfy "Size mod 8 = 0" restriction. This patch reduces stack usage for some cases: We can reduce ArgRegsSaveArea since inner N*4 bytes sized byval params my be "packed" with alignment 4 in some cases. llvm-svn: 182237	2013-05-20 08:01:34 +00:00
Jakob Stoklund Olesen	dad0022424	Also expand 64-bit bitcasts. llvm-svn: 182229	2013-05-20 01:01:43 +00:00
Jakob Stoklund Olesen	e033165fd4	Implement spill and fill of I64Regs. llvm-svn: 182228	2013-05-20 00:53:25 +00:00
Jakob Stoklund Olesen	f777e6d131	Mark i64 SETCC as expand so it is turned into a SELECT_CC. llvm-svn: 182227	2013-05-20 00:28:36 +00:00
Benjamin Kramer	1e23dfa473	Replace some bit operations with simpler ones. No functionality change. llvm-svn: 182226	2013-05-19 22:01:57 +00:00
Jakob Stoklund Olesen	50d419ac93	Don't use %g0 to materialize 0 directly. The wired physreg doesn't work on tied operands like on MOVXCC. Add a README note to fix this later. llvm-svn: 182225	2013-05-19 21:47:13 +00:00
Jakob Stoklund Olesen	f4ec84c2d4	Select i64 values with %icc conditions. llvm-svn: 182224	2013-05-19 20:38:21 +00:00
Jakob Stoklund Olesen	1948cf9ca7	Add floating point selects on %xcc predicates. llvm-svn: 182222	2013-05-19 20:33:11 +00:00
Jakob Stoklund Olesen	c53b3587a3	Implement SPselectfcc for i64 operands. Also clean up the arguments to all the MOVCC instructions so the operands always are (true-val, false-val, cond-code). llvm-svn: 182221	2013-05-19 20:20:54 +00:00
Venkatraman Govindaraju	d432ed013f	[Sparc] Rearrange integer registers' allocation order so that register allocator will use I and G registers before using L and O registers. Also, enable registers %g2-%g4 to be used in application and %g5 in 64 bit mode. llvm-svn: 182219	2013-05-19 20:07:20 +00:00
Jakob Stoklund Olesen	702a7c68c9	Handle i64 FrameIndex nodes in SPARC v9 mode. llvm-svn: 182216	2013-05-19 19:14:24 +00:00
Hal Finkel	d4eb9291fa	Check InlineAsm clobbers in PPCCTRLoops We don't need to reject all inline asm as using the counter register (most does not). Only those that explicitly clobber the counter register need to prevent the transformation. llvm-svn: 182191	2013-05-18 09:20:39 +00:00
Tim Northover	5256d08973	AArch64: add CMake dependency to fix very parallel builds llvm-svn: 182190	2013-05-18 08:17:47 +00:00
David Majnemer	a21386b571	X86: Bad peephole interaction between adc, MOV32r0 The peephole tries to reorder MOV32r0 instructions such that they are before the instruction that modifies EFLAGS. The problem is that the peephole does not consider the case where the instruction that modifies EFLAGS also depends on the previous state of EFLAGS. Instead, walk backwards until we find an instruction that has a def for EFLAGS but does not have a use. If we find such an instruction, insert the MOV32r0 before it. If it cannot find such an instruction, skip the optimization. llvm-svn: 182184	2013-05-18 01:02:03 +00:00
Matt Arsenault	118196f0ca	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
JF Bastien	cbcaf8db77	Support unaligned load/store on more ARM targets This patch matches GCC behavior: the code used to only allow unaligned load/store on ARM for v6+ Darwin, it will now allow unaligned load/store for v6+ Darwin as well as for v7+ on Linux and NaCl. The distinction is made because v6 doesn't guarantee support (but LLVM assumes that Apple controls hardware+kernel and therefore have conformant v6 CPUs), whereas v7 does provide this guarantee (and Linux/NaCl behave sanely). The patch keeps the -arm-strict-align command line option, and adds -arm-no-strict-align. They behave similarly to GCC's -mstrict-align and -mnostrict-align. I originally encountered this discrepancy in FastIsel tests which expect unaligned load/store generation. Overall this should slightly improve performance in most cases because of reduced I$ pressure. llvm-svn: 182175	2013-05-17 23:49:01 +00:00
Rafael Espindola	aabd77b198	Fix the build in c++11 mode. The errors were: non-constant-expression cannot be narrowed from type 'int64_t' (aka 'long') to 'uint32_t' (aka 'unsigned int') in initializer list and non-constant-expression cannot be narrowed from type 'long' to 'uint32_t' (aka 'unsigned int') in initializer list llvm-svn: 182168	2013-05-17 22:45:52 +00:00
Vincent Lejeune	c8aad4509a	R600: Lower int_load_input to copyFromReg instead of Register node It solves a bug uncovered by dot4 patch where the register class of int_load_input use was ignored. llvm-svn: 182130	2013-05-17 16:51:06 +00:00
Vincent Lejeune	5a2e018ab6	R600: Use bottom up scheduling algorithm llvm-svn: 182129	2013-05-17 16:50:56 +00:00
Vincent Lejeune	2bf65b1826	R600: Use depth first scheduling algorithm It should increase PV substitution opportunities and lower gpr usage (pending computations path are "flushed" sooner) llvm-svn: 182128	2013-05-17 16:50:44 +00:00
Vincent Lejeune	a140216a0a	R600: Replace big texture opcode switch in scheduler by usesTC/usesVC llvm-svn: 182127	2013-05-17 16:50:37 +00:00
Vincent Lejeune	152473c61c	R600: Relax some vector constraints on Dot4. Dot4 now uses 8 scalar operands instead of 2 vectors one which allows register coalescer to remove some unneeded COPY. This patch also defines some structures/functions that can be used to handle every vector instructions (CUBE, Cayman special instructions...) in a similar fashion. llvm-svn: 182126	2013-05-17 16:50:32 +00:00
Vincent Lejeune	0c663b698a	R600: Improve texture handling llvm-svn: 182125	2013-05-17 16:50:20 +00:00
Vincent Lejeune	b57cb76b6d	R600: Rename 128 bit registers. Almost all instructions that takes a 128 bits reg as input (fetch, export...) have the abilities to swizzle their argument and output. Instead of printing default swizzle for each 128 bits reg, rename T.XYZW to T and let instructions print potentially optimized swizzles themselves. llvm-svn: 182124	2013-05-17 16:50:09 +00:00
Vincent Lejeune	d391d51989	R600: Some factorization llvm-svn: 182123	2013-05-17 16:50:02 +00:00
Vincent Lejeune	bf991c018d	R600: Factorize Fetch size limit inside AMDGPUSubTarget llvm-svn: 182122	2013-05-17 16:49:55 +00:00
Vincent Lejeune	d39a89783b	R600: prettier dump of clamp llvm-svn: 182121	2013-05-17 16:49:49 +00:00
Tom Stellard	a4cc081e08	R600: Fix encoding for R600 family GPUs Reviewed-by: Vincent Lejeune <vljn@ovi.com> https://bugs.freedesktop.org/show_bug.cgi?id=64193 https://bugs.freedesktop.org/show_bug.cgi?id=64257 https://bugs.freedesktop.org/show_bug.cgi?id=64320 NOTE: This is a candidate for the 3.3 branch. llvm-svn: 182113	2013-05-17 15:23:21 +00:00
Tom Stellard	b91da0601d	R600: Pass MCSubtargetInfo reference to R600CodeEmitter llvm-svn: 182112	2013-05-17 15:23:12 +00:00
Venkatraman Govindaraju	989fb74a1c	[Sparc] Implements hasReservedCallFrame and hasFP. This is to generate correct framesetup code when the function has variable sized allocas. llvm-svn: 182108	2013-05-17 15:14:34 +00:00
Benjamin Kramer	e9efb3252f	X86: Make shuffle -> shift conversion more aggressive about undefs. Shuffles that only move an element into position 0 of the vector are common in the output of the loop vectorizer and often generate suboptimal code when SSSE3 is not available. Lower them to vector shifts if possible. We still prefer palignr over psrldq because it has higher throughput on sandybridge. llvm-svn: 182102	2013-05-17 14:48:34 +00:00
Ulrich Weigand	e299bf2813	[PowerPC] Fix hi/lo encoding in old-style code emitter This patch implements the equivalent change to r182091/r182092 in the old-style code emitter. Instead of having two separate 16-bit immediate encoding routines depending on the instruction, this patch introduces a single encoder that checks the machine operand flags to decide whether the low or high half of a symbol address is required. Since now both encoders make no further distinction between "symbolLo" and "symbolHi", the .td operand can now use a single getS16ImmEncoding method. Tested by running the old-style JIT tests on 32-bit Linux. llvm-svn: 182097	2013-05-17 14:14:12 +00:00
Ulrich Weigand	24cfcf8f49	[PowerPC] Merge/rename PPC fixup types Now that fixup_ppc_ha16 and fixup_ppc_lo16 are being treated exactly the same everywhere, it no longer makes sense to have two fixup types. This patch merges them both into a single type fixup_ppc_half16, and renames fixup_ppc_lo16_ds to fixup_ppc_half16ds for consistency. (The half16 and half16ds names are taken from the description of relocation types in the PowerPC ABI.) No change in code generation expected. llvm-svn: 182092	2013-05-17 12:37:21 +00:00
Ulrich Weigand	89ebba5af6	[PowerPC] Fix processing of ha16/lo16 fixups The current PowerPC MC back end distinguishes between fixup_ppc_ha16 and fixup_ppc_lo16, which are determined by the instruction the fixup applies to, and uses this distinction to decide whether a fixup ought to resolve to the high or the low part of a symbol address. This isn't quite correct, however. It is valid -if unusual- assembler to use, e.g. li 1, symbol@ha or lis 1, symbol@l Whether the high or the low part of the address is used depends solely on the @ suffix, not on the instruction. In addition, both li 1, symbol and lis 1, symbol are valid, assuming the symbol address fits into 16 bits; again, both will then refer to the actual symbol value (so li will load the value itself, while lis will load the value shifted by 16). To fix this, two places need to be adapted. If the fixup cannot be resolved at assembler time, a relocation needs to be emitted via PPCELFObjectWriter::getRelocType. This routine already looks at the VK_ type to determine the relocation. The only problem is that will reject any _LO modifier in a ha16 fixup and vice versa. This is simply incorrect; any of those modifiers ought to be accepted for either fixup type. If the fixup can be resolved at assembler time, adjustFixupValue currently selects the high bits of the symbol value if the fixup type is ha16. Again, this is incorrect; see the above example lis 1, symbol Now, in theory we'd have to respect a VK_ modifier here. However, in fact common code never even attempts to resolve symbol references using any nontrivial VK_ modifier at assembler time; it will always fall back to emitting a reloc and letting the linker handle it. If this ever changes, presumably there'd have to be a target callback to resolve VK_ modifiers. We'd then have to handle @ha etc. there. llvm-svn: 182091	2013-05-17 12:36:29 +00:00
Benjamin Kramer	5781bb221a	Don't cast away constness. llvm-svn: 182086	2013-05-17 11:39:41 +00:00
Christian Konig	34f0d6eaf2	R600/SI: return undef instead of null for skipped arguments This is a candidate for the stable branch. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=64694 Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182084	2013-05-17 09:46:48 +00:00
Venkatraman Govindaraju	62af2fad30	[Sparc] Prevent instructions that defines or uses %o7 to be in call's delay slot. llvm-svn: 182063	2013-05-16 23:53:29 +00:00
Akira Hatanaka	3848727973	[mips] Improve instruction selection for pattern (store (fp_to_sint $src), $ptr). Previously, three instructions were needed: trunc.w.s $f0, $f2 mfc1 $4, $f0 sw $4, 0($2) Now we need only two: trunc.w.s $f0, $f2 swc1 $f0, 0($2) llvm-svn: 182053	2013-05-16 21:17:15 +00:00
Rafael Espindola	21effc7220	Remove addFrameMove. Now that we have good testing, remove addFrameMove and create cfi instructions directly. llvm-svn: 182052	2013-05-16 21:02:15 +00:00
Akira Hatanaka	af33bc35ec	[mips] Factor out unaligned store lowering code. llvm-svn: 182050	2013-05-16 20:45:17 +00:00
Jack Carter	0c75cd3c10	Mips assembler: Add TwoOperandConstraint definitions This patch removes alias definition for addiu $rs,$imm and instead uses the TwoOperandAliasConstraint field in the ArithLogicI instruction class. This way all instructions that inherit ArithLogicI class have the same macro defined. The usage examples are added to test files. Patch by Vladimir Medic llvm-svn: 182048	2013-05-16 20:24:27 +00:00
Jack Carter	27313685a3	Mips td file formatting: white space and long lines llvm-svn: 182047	2013-05-16 20:08:49 +00:00
Hal Finkel	80fddc9af7	Create an new preheader in PPCCTRLoops to avoid counter register clobbers Some IR-level instructions (such as FP <-> i64 conversions) are not chained w.r.t. the mtctr intrinsic and yet may become function calls that clobber the counter register. At the selection-DAG level, these might be reordered with the mtctr intrinsic causing miscompiles. To avoid this situation, if an existing preheader has instructions that might use the counter register, create a new preheader for the mtctr intrinsic. This extra block will be remerged with the old preheader at the MI level, but will prevent unwanted reordering at the selection-DAG level. llvm-svn: 182045	2013-05-16 19:58:38 +00:00
Akira Hatanaka	ba455f200e	[mips] Test case for r182042. Add comment. llvm-svn: 182044	2013-05-16 19:57:23 +00:00
Akira Hatanaka	fe7cc5cbd7	[mips] Fix instruction selection pattern for sint_to_fp node to avoid emitting an invalid instruction sequence. Rather than emitting an int-to-FP move instruction and an int-to-FP conversion instruction during instruction selection, we emit a pseudo instruction which gets expanded post-RA. Without this change, register allocation can possibly insert a floating point register move instruction between the two instructions, which is not valid according to the ISA manual. mtc1 $f4, $4 # int-to-fp move instruction. mov.s $f2, $f4 # move contents of $f4 to $f2. cvt.s.w $f0, $f2 # int-to-fp conversion. llvm-svn: 182042	2013-05-16 19:48:37 +00:00
Jack Carter	8986125dda	Mips assembler: Add branch macro definitions This patch adds bnez and beqz instructions which represent alias definitions for bne and beq instructions as follows: bnez $rs,$imm => bne $rs,$zero,$imm beqz $rs,$imm => beq $rs,$zero,$imm The corresponding test cases are added. Patch by Vladimir Medic llvm-svn: 182040	2013-05-16 19:40:19 +00:00
Akira Hatanaka	8513e9139f	[mips] Fix indentation. llvm-svn: 182036	2013-05-16 18:42:42 +00:00
Akira Hatanaka	8857b3a45c	[mips] Delete unused enum value. llvm-svn: 182035	2013-05-16 18:40:12 +00:00
Ulrich Weigand	7b22c7a38a	[PowerPC] Use true offset value in "memrix" machine operands This is the second part of the change to always return "true" offset values from getPreIndexedAddressParts, tackling the case of "memrix" type operands. This is about instructions like LD/STD that only have a 14-bit field to encode immediate offsets, which are implicitly extended by two zero bits by the machine, so that in effect we can access 16-bit offsets as long as they are a multiple of 4. The PowerPC back end currently handles such instructions by carrying the 14-bit value (as it will get encoded into the actual machine instructions) in the machine operand fields for such instructions. This means that those values are in fact not the true offset, but rather the offset divided by 4 (and then truncated to an unsigned 14-bit value). Like in the case fixed in r182012, this makes common code operations on such offset values not work as expected. Furthermore, there doesn't really appear to be any strong reason why we should encode machine operands this way. This patch therefore changes the encoding of "memrix" type machine operands to simply contain the "true" offset value as a signed immediate value, while enforcing the rules that it must fit in a 16-bit signed value and must also be a multiple of 4. This change must be made simultaneously in all places that access machine operands of this type. However, just about all those changes make the code simpler; in many cases we can now just share the same code for memri and memrix operands. llvm-svn: 182032	2013-05-16 17:58:02 +00:00
Hal Finkel	7daa616e35	PPC32 cannot form counter loops around i64 FP conversions On PPC32, i64 FP conversions are implemented using runtime calls (which clobber the counter register). These must be excluded. llvm-svn: 182023	2013-05-16 16:52:41 +00:00
Aaron Ballman	42af887d8c	Fixing a 64-bit conversion warning in MSVC. llvm-svn: 182018	2013-05-16 16:03:36 +00:00
Rafael Espindola	4c7120e048	Remove dead calls to addFrameMove. Without a PROLOG_LABEL present, the cfi instructions are never printed. llvm-svn: 182016	2013-05-16 15:08:37 +00:00
Ulrich Weigand	08228b8354	[PowerPC] Report true displacement value from getPreIndexedAddressParts DAGCombiner::CombineToPreIndexedLoadStore calls a target routine to decompose a memory address into a base/offset pair. It expects the offset (if constant) to be the true displacement value in order to perform optional additional optimizations; in particular, to convert other uses of the original pointer into uses of the new base pointer after pre-increment. The PowerPC implementation of getPreIndexedAddressParts, however, simply calls SelectAddressRegImm, which returns a TargetConstant. This value is appropriate for encoding into the instruction, but it is not always usable as true displacement value: - Its type is always MVT::i32, even on 64-bit, where addresses ought to be i64 ... this causes the optimization to simply always fail on 64-bit due to this line in DAGCombiner: // FIXME: In some cases, we can be smarter about this. if (Op1.getValueType() != Offset.getValueType()) { - Its value is truncated to an unsigned 16-bit value if negative. This causes the above opimization to generate wrong code. This patch fixes both problems by simply returning the true displacement value (in its original type). This doesn't affect any other user of the displacement. llvm-svn: 182012	2013-05-16 14:53:05 +00:00
Richard Sandiford	cb335bb295	[SystemZ] Tweak register array comment llvm-svn: 182007	2013-05-16 13:39:02 +00:00
Patrik Hagglund	a0ea76e714	Removed unused variable, detected by gcc -Wunused-but-set-variable. Leftover from r181979. llvm-svn: 181993	2013-05-16 08:37:22 +00:00
Rafael Espindola	537821c785	Delete dead code. llvm-svn: 181982	2013-05-16 04:59:17 +00:00
Rafael Espindola	24bf7876c2	Don't call addFrameMove on XCore. getExceptionHandlingType is not ExceptionHandling::DwarfCFI on xcore, so etFrameInstructions is never called. There is no point creating cfi instructions if they are never used. llvm-svn: 181979	2013-05-16 04:16:25 +00:00
Rafael Espindola	92a3518a62	Removed dead code. llvm-svn: 181975	2013-05-16 03:34:58 +00:00
Reed Kotler	fb71c30979	Patch number 2 for mips16/32 floating point interoperability stubs. This creates stubs that help Mips32 functions call Mips16 functions which have floating point parameters that are normally passed in floating point registers. llvm-svn: 181972	2013-05-16 02:17:42 +00:00
Derek Schuff	8af06f2ba6	Revert "Support unaligned load/store on more ARM targets" This reverts r181898. llvm-svn: 181944	2013-05-15 23:07:43 +00:00
Rafael Espindola	e1f815ebd9	Delete dead code. llvm-svn: 181941	2013-05-15 22:27:35 +00:00
Hal Finkel	ae8f6158eb	undef setjmp in PPCCTRLoops Trying to unbreak the VS build by copying some undef code from Utils/LowerInvoke.cpp. llvm-svn: 181938	2013-05-15 22:20:24 +00:00
David Majnemer	8ce4c34d1d	X86: Remove redundant test instructions Increase the number of instructions LLVM recognizes as setting the ZF flag. This allows us to remove test instructions that redundantly recalculate the flag. llvm-svn: 181937	2013-05-15 22:03:08 +00:00
Hal Finkel	91bd48d046	Implement PPC counter loops as a late IR-level pass The old PPCCTRLoops pass, like the Hexagon pass version from which it was derived, could only handle some simple loops in canonical form. We cannot directly adapt the new Hexagon hardware loops pass, however, because the Hexagon pass contains a fundamental assumption that non-constant-trip-count loops will contain a guard, and this is not always true (the result being that incorrect negative counts can be generated). With this commit, we replace the pass with a late IR-level pass which makes use of SE to calculate the backedge-taken counts and safely generate the loop-count expressions (including any necessary max() parts). This IR level pass inserts custom intrinsics that are lowered into the desired decrement-and-branch instructions. The most fragile part of this new implementation is that interfering uses of the counter register must be detected on the IR level (and, on PPC, this also includes any indirect branches in addition to function calls). Also, to make all of this work, we need a variant of the mtctr instruction that is marked as having side effects. Without this, machine-code level CSE, DCE, etc. illegally transform the resulting code. Hopefully, this can be improved in the future. This new pass is smaller than the original (and much smaller than the new Hexagon hardware loops pass), and can handle many additional cases correctly. In addition, the preheader-creation code has been copied from LoopSimplify, and after we decide on where it belongs, this code will be refactored so that it can be explicitly shared (making this implementation even smaller). The new test-case files ctrloop-{le,lt,ne}.ll have been adapted from tests for the new Hexagon pass. There are a few classes of loops that this pass does not transform (noted by FIXMEs in the files), but these deficiencies can be addressed within the SE infrastructure (thus helping many other passes as well). llvm-svn: 181927	2013-05-15 21:37:41 +00:00
Rafael Espindola	5ef2f39b92	Cleanup relocation sorting for ELF. We want the order to be deterministic on all platforms. NAKAMURA Takumi fixed that in r181864. This patch is just two small cleanups: * Move the function to the cpp file. It is only passed to array_pod_sort. * Remove the ppc implementation which is now redundant llvm-svn: 181910	2013-05-15 18:22:01 +00:00
NAKAMURA Takumi	04ff46b750	PPCISelLowering.h: Escape \@ in comments. [-Wdocumentation] llvm-svn: 181907	2013-05-15 18:01:35 +00:00
NAKAMURA Takumi	9f1d12c5aa	Whitespace. llvm-svn: 181906	2013-05-15 18:01:28 +00:00
Derek Schuff	d07b32ae79	Support unaligned load/store on more ARM targets This patch matches GCC behavior: the code used to only allow unaligned load/store on ARM for v6+ Darwin, it will now allow unaligned load/store for v6+ Darwin as well as for v7+ on other targets. The distinction is made because v6 doesn't guarantee support (but LLVM assumes that Apple controls hardware+kernel and therefore have conformant v6 CPUs), whereas v7 does provide this guarantee (and Linux behaves sanely). Overall this should slightly improve performance in most cases because of reduced I$ pressure. Patch by JF Bastien llvm-svn: 181897	2013-05-15 16:08:30 +00:00
Ulrich Weigand	0325b4909d	[PowerPC] Remove need for adjustFixupOffst hack Now that applyFixup understands differently-sized fixups, we can define fixup_ppc_lo16/fixup_ppc_lo16_ds/fixup_ppc_ha16 to properly be 2-byte fixups, applied at an offset of 2 relative to the start of the instruction text. This has the benefit that if we actually need to generate a real relocation record, its address will come out correctly automatically, without having to fiddle with the offset in adjustFixupOffset. Tested on both 64-bit and 32-bit PowerPC, using external and integrated assembler. llvm-svn: 181894	2013-05-15 15:07:06 +00:00
Richard Sandiford	1ccc224047	[SystemZ] Make use of SUBTRACT HALFWORD Thanks to Ulrich Weigand for noticing that this instruction was missing. llvm-svn: 181893	2013-05-15 15:05:29 +00:00
Ulrich Weigand	08f27cb2c6	[PowerPC] Correctly handle fixups of other than 4 byte size The PPCAsmBackend::applyFixup routine handles the case where a fixup can be resolved within the same object file. However, this routine is currently hard-coded to assume the size of any fixup is always exactly 4 bytes. This is sort-of correct for fixups on instruction text; even though it only works because several of what really would be 2-byte fixups are presented as 4-byte fixups instead (requiring another hack in PPCELFObjectWriter::adjustFixupOffset to clean it up). However, this assumption breaks down completely for fixups on data, which legitimately can be of any size (1, 2, 4, or 8). This patch makes applyFixup aware of fixups of varying sizes, introducing a new helper routine getFixupKindNumBytes (along the lines of what the ARM back end does). Note that in order to handle fixups of size 8, we also need to fix the return type of adjustFixupValue to uint64_t to avoid truncation. Tested on both 64-bit and 32-bit PowerPC, using external and integrated assembler. llvm-svn: 181891	2013-05-15 15:01:46 +00:00
Richard Sandiford	85e457ca32	[SystemZ] Add more future work items to the README Based on an analysis by Ulrich Weigand. llvm-svn: 181882	2013-05-15 12:53:31 +00:00
Arnold Schwaighofer	6df028f783	ARM ISel: Don't create illegal types during LowerMUL The transformation happening here is that we want to turn a "mul(ext(X), ext(X))" into a "vmull(X, X)", stripping off the extension. We have to make sure that X still has a valid vector type - possibly recreate an extension to a smaller type. In case of a extload of a memory type smaller than 64 bit we used create a ext(load()). The problem with doing this - instead of recreating an extload - is that an illegal type is exposed. This patch fixes this by creating extloads instead of ext(load()) sequences. Fixes PR15970. radar://13871383 llvm-svn: 181842	2013-05-14 22:33:24 +00:00
Bill Schmidt	b5b390b792	Implement the PowerPC system call (sc) instruction. Instruction added at request of Roman Divacky. Tested via asm-parser. llvm-svn: 181821	2013-05-14 19:35:45 +00:00
Jyotsna Verma	a1b968ec45	Hexagon: Pass to replace tranfer/copy instructions into combine instruction where possible. llvm-svn: 181817	2013-05-14 18:54:06 +00:00
Eric Christopher	ad3c788829	Reapply "Subtract isn't commutative, fix this for MMX psub." with a somewhat randomly chosen cpu that will minimize cpu specific differences on bots. llvm-svn: 181814	2013-05-14 18:33:40 +00:00
Eric Christopher	9db4fb70b3	Temporarily revert "Subtract isn't commutative, fix this for MMX psub." It's causing failures on the atom bot. llvm-svn: 181812	2013-05-14 18:20:42 +00:00
Eric Christopher	e3d05e6f9d	Subtract isn't commutative, fix this for MMX psub. Patch by Andrea DiBiagio. llvm-svn: 181809	2013-05-14 17:52:05 +00:00
Jyotsna Verma	980fae33f3	Hexagon: Add patterns to generate 'combine' instructions. llvm-svn: 181805	2013-05-14 17:16:38 +00:00
Jyotsna Verma	86beda7e47	Hexagon: ArePredicatesComplement should not restrict itself to TFRs. llvm-svn: 181803	2013-05-14 16:36:34 +00:00
Bill Schmidt	8c1e12a2ff	PPC32: Fix stack collision between FP and CR save areas. The changes to CR spill handling missed a case for 32-bit PowerPC. The code in PPCFrameLowering::processFunctionBeforeFrameFinalized() checks whether CR spill has occurred using a flag in the function info. This flag is only set by storeRegToStackSlot and loadRegFromStackSlot. spillCalleeSavedRegisters does not call storeRegToStackSlot, but instead produces MI directly. Thus we don't see the CR is spilled when assigning frame offsets, and the CR spill ends up colliding with some other location (generally the FP slot). This patch sets the flag in spillCalleeSavedRegisters for PPC32 so that the CR spill is properly detected and gets its own slot in the stack frame. llvm-svn: 181800	2013-05-14 16:08:32 +00:00
Jyotsna Verma	8bbeb8be87	Hexagon: Remove dead-code after unconditional return from addPreSched2. llvm-svn: 181797	2013-05-14 15:33:27 +00:00
Tom Stellard	5f6b8d7e47	R600/SI: Add processor type for Hainan asic Patch by: Alex Deucher Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181792	2013-05-14 14:42:56 +00:00
Richard Sandiford	77796bbc7d	[SystemZ] Add disassembler support llvm-svn: 181777	2013-05-14 10:17:52 +00:00
Richard Sandiford	4850b2341b	[SystemZ] Rework handling of constant PC-relative operands The GNU assembler treats things like: brasl %r14, 100 in the same way as: brasl %r14, .+100 rather than as a branch to absolute address 100. We implemented this in LLVM by creating an immediate operand rather than the usual expr operand, and by handling immediate operands specially in the code emitter. This was undesirable for (at least) three reasons: - the specialness of immediate operands was exposed to the backend MC code, rather than being limited to the assembler parser. - in disassembly, an immediate operand really is an absolute address. (Note that this means reassembling printed disassembly can't recreate the original code.) - it would interfere with any assembly manipulation that we might try in future. E.g. operations like branch shortening can change the relative position of instructions, but any code that updates sym+offset addresses wouldn't update an immediate "100" operand in the same way as an explicit ".+100" operand. This patch changes the implementation so that the assembler creates a "." label for immediate PC-relative operands, so that the operand to the MCInst is always the absolute address. The patch also adds some error checking of the offset. llvm-svn: 181773	2013-05-14 09:47:26 +00:00
Richard Sandiford	67070ba7fc	[SystemZ] Remove bogus isAsmParserOnly Marking instructions as isAsmParserOnly stops them from being disassembled. However, in cases where separate asm and codegen versions exist, we actually want to disassemble to the asm ones. No functional change intended. llvm-svn: 181772	2013-05-14 09:38:07 +00:00
Richard Sandiford	e46cf32917	[SystemZ] Match operands to fields by name rather than by order The SystemZ port currently relies on the order of the instruction operands matching the order of the instruction field lists. This isn't desirable for disassembly, where the two are matched only by name. E.g. the R1 and R2 fields of an RR instruction should have corresponding R1 and R2 operands. The main complication is that addresses are compound operands, and as far as I know there is no mechanism to allow individual suboperands to be selected by name in "let Inst{...} = ..." assignments. Luckily it doesn't really matter though. The SystemZ instruction encoding groups all address fields together in a predictable order, so it's just as valid to see the entire compound address operand as a single field. That's the approach taken in this patch. Matching by name in turn means that the operands to COPY SIGN and CONVERT TO FIXED instructions can be given in natural order. (It was easier to do this at the same time as the rename, since otherwise the intermediate step was too confusing.) No functional change intended. llvm-svn: 181771	2013-05-14 09:36:44 +00:00
Richard Sandiford	a64d960eda	[SystemZ] Match operands to fields by name rather than by order The SystemZ port currently relies on the order of the instruction operands matching the order of the instruction field lists. This isn't desirable for disassembly, where the two are matched only by name. E.g. the R1 and R2 fields of an RR instruction should have corresponding R1 and R2 operands. The main complication is that addresses are compound operands, and as far as I know there is no mechanism to allow individual suboperands to be selected by name in "let Inst{...} = ..." assignments. Luckily it doesn't really matter though. The SystemZ instruction encoding groups all address fields together in a predictable order, so it's just as valid to see the entire compound address operand as a single field. That's the approach taken in this patch. Matching by name in turn means that the operands to COPY SIGN and CONVERT TO FIXED instructions can be given in natural order. (It was easier to do this at the same time as the rename, since otherwise the intermediate step was too confusing.) No functional change intended. llvm-svn: 181769	2013-05-14 09:28:21 +00:00
Reed Kotler	8e03705e31	Fix typo. llvm-svn: 181759	2013-05-14 06:00:01 +00:00
Reed Kotler	a1fa48c538	Removed an unnamed namespace and forgot to make two of the functions inside "static". llvm-svn: 181754	2013-05-14 02:13:45 +00:00
Reed Kotler	cade566d36	This is the first of three patches which creates stubs used for Mips16/32 floating point interoperability. When Mips16 code calls external functions that would normally have some of its parameters or return values passed in floating point registers, it needs (Mips32) helper functions to do this because while in Mips16 mode there is no ability to access the floating point registers. In Pic mode, this is done with a set of predefined functions in libc. This case is already handled in llvm for Mips16. In static relocation mode, for efficiency reasons, the compiler generates stubs that the linker will use if it turns out that the external function is a Mips32 function. (If it's Mips16, then it does not need the helper stubs). These stubs are identically named and the linker knows about these tricks and will not create multiple copies and will delete them if they are not needed. llvm-svn: 181753	2013-05-14 02:00:24 +00:00
Jack Carter	c0737806ad	Mips assembler: Assembler macro ADDIU $rs,imm This patch adds alias for addiu instruction which enables following syntax: addiu $rs,imm The macro is translated as: addiu $rs,$rs,imm Contributer: Vladimir Medic llvm-svn: 181729	2013-05-13 20:26:46 +00:00
Bill Schmidt	d57de4ce17	Fix goofy commentary in PPCTargetObjectFile.cpp. llvm-svn: 181725	2013-05-13 19:40:36 +00:00
Bill Schmidt	c7fd4630b3	PPC64: Constant initializers with dynamic relocations go in .data.rel.ro. This fixes warning messages observed in the oggenc application test in projects/test-suite. Special handling is needed for the 64-bit PowerPC SVR4 ABI when a constant is initialized with a pointer to a function in a shared library. Because a function address is implemented as the address of a function descriptor, the use of copy relocations can lead to problems with initialization. GNU ld therefore replaces copy relocations with dynamic relocations to be resolved by the dynamic linker. This means the constant cannot reside in the read-only data section, but instead belongs in .data.rel.ro, which is designed for constants containing dynamic relocations. The implementation creates a class PPC64LinuxTargetObjectFile inheriting from TargetLoweringObjectFileELF, which behaves like its parent except to place constants of this sort into .data.rel.ro. The test case is reduced from the oggenc application. llvm-svn: 181723	2013-05-13 19:34:37 +00:00
Akira Hatanaka	aaa3035d45	[mips] Add option -mno-ldc1-sdc1. This option is used when the user wants to avoid emitting double precision FP loads and stores. Double precision FP loads and stores are expanded to single precision instructions after register allocation. llvm-svn: 181718	2013-05-13 18:23:35 +00:00
Akira Hatanaka	4ac9f3dc5b	[mips] Define a helper function which creates an instruction with the same operands as the prototype instruction but with a different opcode. llvm-svn: 181714	2013-05-13 17:57:42 +00:00
Akira Hatanaka	083fece809	[mips] Rename functions. No functionality changes. llvm-svn: 181713	2013-05-13 17:43:19 +00:00
Rafael Espindola	44f36ace35	Remove unused fields and arguments. llvm-svn: 181706	2013-05-13 14:34:48 +00:00
Mihai Popa	7a2df35483	The purpose of the patch is to fix the syntax of ARM mrc and mrc2 instructions when they are used to write to the APSR. In this case, the destination operand should be APSR_nzcv, and the encoding of the target should be 0b1111 (same as for PC). In pre-UAL syntax, this form used the PC register as a textual target. This is still allowed for backward compatibility. llvm-svn: 181705	2013-05-13 14:10:04 +00:00
Lang Hames	9019f5804f	Correctly preserve the input chain for potential tailcall nodes whose return values are bitcasts. The chain had previously been being clobbered with the entry node to the dag, which sometimes caused other code in the function to be erroneously deleted when tailcall optimization kicked in. <rdar://problem/13827621> llvm-svn: 181696	2013-05-13 10:21:19 +00:00
Duncan Sands	526a10c53f	Suppress GCC compiler warnings in release builds about variables that are only read in asserts. llvm-svn: 181689	2013-05-13 07:50:47 +00:00
Rafael Espindola	237980d752	Remove the MachineMove class. It was just a less powerful and more confusing version of MCCFIInstruction. A side effect is that, since MCCFIInstruction uses dwarf register numbers, calls to getDwarfRegNum are pushed out, which should allow further simplifications. I left the MachineModuleInfo::addFrameMove interface unchanged since this patch was already fairly big. llvm-svn: 181680	2013-05-13 01:16:13 +00:00
Rafael Espindola	245de3a31f	Change getFrameMoves to return a const reference. To add a frame now there is a dedicated addFrameMove which also takes care of constructing the move itself. llvm-svn: 181657	2013-05-11 02:38:11 +00:00
Reed Kotler	88fbecdc6f	Checkin in of first of several patches to finish implementation of mips16/mips32 floating point interoperability. This patch fixes returns from mips16 functions so that if the function was in fact called by a mips32 hard float routine, then values that would have been returned in floating point registers are so returned. Mips16 mode has no floating point instructions so there is no way to load values into floating point registers. This is needed when returning float, double, single complex, double complex in the Mips ABI. Helper functions in libc for mips16 are available to do this. For efficiency purposes, these helper functions have a different calling convention from normal Mips calls. Registers v0,v1,a0,a1 are used to pass parameters instead of a0,a1,a2,a3. This is because v0,v1,a0,a1 are the natural registers used to return floating point values in soft float. These values can then be moved to the appropriate floating point registers with no extra cost. The only register that is modified is ra in this call. The helper functions make sure that the return values are in the floating point registers that they would be in if soft float was not in effect (which it is for mips16, though the soft float is implemented using a mips32 library that uses hard float). llvm-svn: 181641	2013-05-10 22:25:39 +00:00
Jyotsna Verma	9c76c6e3f4	Fix unused variable error. Earlier, this variable was used in an assert and was causing failure on darwin. llvm-svn: 181630	2013-05-10 21:44:02 +00:00
Jyotsna Verma	f20d85d4ad	Hexagon: Fix switch statements in GetDotOldOp and IsNewifyStore. No functionality change. llvm-svn: 181628	2013-05-10 20:58:11 +00:00
Jyotsna Verma	2dfc0b2d13	Hexagon: Fix switch cases in HexagonVLIWPacketizer.cpp. llvm-svn: 181624	2013-05-10 20:27:34 +00:00
Rafael Espindola	4e52b3900b	Fix the R600 build. llvm-svn: 181621	2013-05-10 18:31:42 +00:00
Chad Rosier	7da7292b4e	[ms-inline asm] Fix a crasher when we fail on a direct match. The issue was that the MatchingInlineAsm and VariantID args to the MatchInstructionImpl function weren't being set properly. Specifically, when parsing intel syntax, the parser thought it was parsing inline assembly in the at&t dialect; that will never be the case. The crash was caused when the emitter tried to emit the instruction, but the operands weren't set. When parsing inline assembly we only set the opcode, not the operands, which is used to lookup the instruction descriptor. rdar://13854391 and PR15945 Also, this commit reverts r176036. Now that we're correctly parsing the intel syntax the pushad/popad don't match properly. I've reimplemented that fix using a MnemonicAlias. llvm-svn: 181620	2013-05-10 18:24:17 +00:00
Rafael Espindola	d05c5e1727	Remove unused argument. llvm-svn: 181618	2013-05-10 18:16:59 +00:00
Rafael Espindola	4f14a6f0b8	Remove unused function. llvm-svn: 181606	2013-05-10 16:53:12 +00:00
Logan Chien	d5b8ea6c58	Implement AsmParser for ARM unwind directives. This commit implements the AsmParser for fnstart, fnend, cantunwind, personality, handlerdata, pad, setfp, save, and vsave directives. This commit fixes some minor issue in the ARMELFStreamer: * The switch back to corresponding section after the .fnend directive. * Emit the unwind opcode while processing .fnend directive if there is no .handlerdata directive. * Emit the unwind opcode to .ARM.extab while processing .handlerdata even if .personality directive does not exist. llvm-svn: 181603	2013-05-10 16:17:24 +00:00
Tom Stellard	7edf38bf1f	R600: Remove AMDILPeeopholeOptimizer and replace optimizations with tablegen patterns The BFE optimization was the only one we were actually using, and it was emitting an intrinsic that we don't support. https://bugs.freedesktop.org/show_bug.cgi?id=64201 Reviewed-by: Christian König <christian.koenig@amd.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181580	2013-05-10 02:09:45 +00:00
Tom Stellard	ed363c57b2	R600: Expand SUB for v2i32/v4i32 Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181579	2013-05-10 02:09:39 +00:00
Tom Stellard	3ca3d250c6	R600: Expand MUL for v4i32/v2i32 Fixes piglit test for OpenCL builtin mul24, and allows mad24 to run. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181578	2013-05-10 02:09:34 +00:00
Tom Stellard	56fef8261c	R600: Expand SRA for v4i32/v2i32 v2: Add v4i32 test Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181577	2013-05-10 02:09:29 +00:00
Tom Stellard	5d4a5a0d37	R600: Expand vselect for v4i32 and v2i32 v2: Add vselect v4i32 test Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181576	2013-05-10 02:09:24 +00:00
Chad Rosier	1c5336508d	[x86AsmParser] It's valid to stop parsing an operand at an immediate. rdar://13854369 and PR15944 llvm-svn: 181564	2013-05-09 23:48:53 +00:00
Bill Wendling	59bb428cc9	Generate a compact unwind encoding in the face of a stack alignment push. We generate a `push' of a random register (%rax) if the stack needs to be aligned by the size of that register. However, this could mess up compact unwind generation. In particular, we want to still generate compact unwind in the presence of this monstrosity. Check if the push of of the %rax/%eax register. If it is and it's marked with the `FrameSetup' flag, then we can generate a compact unwind encoding for the function only if the push is the last FrameSetup instruction. llvm-svn: 181540	2013-05-09 20:10:38 +00:00
Jyotsna Verma	c849e91c17	Hexagon: Remove switch cases from GetDotNewPredOp and isPostIncrement functions. No functionality change. llvm-svn: 181535	2013-05-09 19:16:07 +00:00
Jyotsna Verma	15d448ba0c	Hexagon: Use relation map for getMatchingCondBranchOpcode() and getInvertedPredicatedOpcode() functions instead of switch cases. llvm-svn: 181530	2013-05-09 18:25:44 +00:00
Bill Wendling	76d1441b1e	Simplify the code a bit. The compact unwind registers were defined in two different places. It's better just to place them in the function that uses them and specify that this is a 64-bit or 32-bit machine. No functionality change. llvm-svn: 181529	2013-05-09 18:21:45 +00:00
Richard Osborne	7504cb9f47	[XCore] Fix handling of functions where only the LR is spilled. Previously we only checked if the LR required saving if the frame size was non zero. However because the caller reserves 1 word for the callee to use that doesn't count towards our frame size it is possible for the LR to need saving and for the frame size to be 0. We didn't hit when the LR needed saving because of a function calls because the 1 word of stack we must allocate for our callee means the frame size is always non zero in this case. However we can hit this case if the LR is clobbered in inline asm. llvm-svn: 181520	2013-05-09 16:43:42 +00:00
Akira Hatanaka	a1d814e7b8	[mips] Add instruction selection pattern for (seteq $LHS, 0). llvm-svn: 181459	2013-05-08 19:38:04 +00:00
Roman Divacky	d1f5c02d87	Remove unused isLegalAddressImmediate() method. llvm-svn: 181452	2013-05-08 17:51:39 +00:00
Ulrich Weigand	1c76f9fa7c	[PowerPC] Fix regression in generating @ha/@l relocs The patch I committed as revision 167864 introduced a regression that causes LLVM to no longer generate appropriate relocs for @ha/@l symbol references (but fail an assertion instead). This is fixed here by re-enabling support for the VK_PPC_GAS_HA16/ VK_PPC_GAS_LO16 variant kinds (and their Darwin variants) in PPCELFObjectWriter.cpp. Tested by running projects/test-suite in -m32 mode with the integrated assembler forced on. A standalone test case will be committed shortly as well. llvm-svn: 181450	2013-05-08 17:50:07 +00:00
Bill Schmidt	7f1a2b5212	Fix handling of anonymous aggregate parameters for powerpc*-apple-darwin8. This fixes bug 15821 similarly to the powerpc64-linux fix for bug 14779. Patch by David Fang. llvm-svn: 181449	2013-05-08 17:22:33 +00:00
Stepan Dyatkovskiy	13432c8af4	For r181148: fixed warning 'enumeral and non-enumeral type in conditional expression'. llvm-svn: 181437	2013-05-08 14:51:27 +00:00
Hal Finkel	1636de8210	PPCInstrInfo::optimizeCompareInstr should not optimize FP compares The floating-point record forms on PPC don't set the condition register bits based on a comparison with zero (like the integer record forms do), but rather based on the exception status bits. llvm-svn: 181423	2013-05-08 12:16:14 +00:00
Preston Gurd	ed3b81e028	Corrected Atom latencies for SSE SQRT instructions. llvm-svn: 181346	2013-05-07 19:57:34 +00:00
Jyotsna Verma	37863260ff	Hexagon: Fix Small Data support to handle -G 0 correctly. llvm-svn: 181344	2013-05-07 19:53:00 +00:00
Hal Finkel	1725b5191f	Cleanup PPCInstrInfo::optimizeCompareInstr Implement suggestions by Bill Schmidt in post-commit review. No functionality change intended. llvm-svn: 181338	2013-05-07 17:49:55 +00:00
Jyotsna Verma	5307666fe8	Reverting r181331. Missing file, HexagonSplitConst32AndConst64.cpp, from lib/Target/Hexagon/CMakeLists.txt. llvm-svn: 181334	2013-05-07 17:12:35 +00:00
Jyotsna Verma	af0c734e1b	Hexagon: Fix Small Data support to handle -G 0 correctly. llvm-svn: 181331	2013-05-07 16:42:15 +00:00
Jyotsna Verma	71c6bf55f2	Hexagon: Set accessSize and addrMode on all load/store instructions. llvm-svn: 181324	2013-05-07 15:06:29 +00:00
Michael Kuperstein	25056babb2	Re-enable AVX detection on x64 platforms. llvm-svn: 181313	2013-05-07 14:05:33 +00:00
Richard Sandiford	a9580c775a	[SystemZ] Fix InitMCCodeGenInfo call createSystemZMCCodeGenInfo was not passing the optimization level to InitMCCodeGenInfo(), so -O0 would be ignored. Fixes DebugInfo/namespace.ll after the changes in r181271. llvm-svn: 181312	2013-05-07 12:56:31 +00:00
Tom Stellard	3a335f24af	R600/SI: Add intrinsic for MIMG IMAGE_GET_RESINFO opcode Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181269	2013-05-06 23:02:19 +00:00
Tom Stellard	57e8e4e921	R600/SI: Handle arbitrary destination type in SITargetLowering::adjustWritemask Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181268	2013-05-06 23:02:15 +00:00
Tom Stellard	d2ec929c52	R600/SI: Add intrinsic for texture image loading Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181267	2013-05-06 23:02:12 +00:00
Tom Stellard	4ed2501894	R600/SI: Add pattern for uint_to_fp Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181266	2013-05-06 23:02:07 +00:00
Tom Stellard	7d53018f9b	R600/SI: Add patterns for integer maxima / minima Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181265	2013-05-06 23:02:04 +00:00
Tom Stellard	2c5ed6e6ce	R600/SI: Add pattern for AMDGPU.trunc intrinsic Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181263	2013-05-06 23:02:00 +00:00
Krzysztof Parzyszek	b05e065c7b	Print IR from Hexagon MI passes with -print-before/after-all. llvm-svn: 181255	2013-05-06 21:58:00 +00:00
Krzysztof Parzyszek	1f0992737e	Cleanup of the HexagonTargetMachine setup. llvm-svn: 181250	2013-05-06 21:25:45 +00:00
Jyotsna Verma	0ec07a2dbc	Hexagon: Add multiclass/encoding bits for the New-Value Jump instructions. llvm-svn: 181235	2013-05-06 18:49:23 +00:00
Krzysztof Parzyszek	afd38d8a5a	Make references to HexagonTargetMachine "const". llvm-svn: 181233	2013-05-06 18:38:37 +00:00
Tom Stellard	740d847e2c	R600: Remove dead code from the CodeEmitter v2 v2: - Replace switch statement with TSFlags query Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181229	2013-05-06 17:50:57 +00:00
Tom Stellard	fb8e73f3af	R600: Emit config values in register / value pairs Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181228	2013-05-06 17:50:51 +00:00
Tom Stellard	6c3f6e1b02	R600: Stop emitting the instruction type byte before each instruction Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181225	2013-05-06 17:50:44 +00:00
Tom Stellard	ebe049fd75	R600: Emit ISA for CALL_FS_* instructions Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181223	2013-05-06 17:50:26 +00:00
Ulrich Weigand	558cec9d9b	[SystemZ] Add back end This adds the actual lib/Target/SystemZ target files necessary to implement the SystemZ target. Note that at this point, the target cannot yet be built since the configure bits are missing. Those will be provided shortly by a follow-on patch. This version of the patch incorporates feedback from reviews by Chris Lattner and Anton Korobeynikov. Thanks to all reviewers! Patch by Richard Sandiford. llvm-svn: 181203	2013-05-06 16:15:19 +00:00
Ulrich Weigand	d9194d871d	[PowerPC] Fix memory corruption in AsmParser As pointed out by Evgeniy Stepanov, assigning a std::string temporary to a StringRef is not a good idea. Rework MatchRegisterName to avoid using the .lower routine. llvm-svn: 181192	2013-05-06 11:16:57 +00:00
Evan Cheng	4d42f5b1d5	ARM AnalyzeBranch should conservatively return true when it sees a predicated indirect branch at the end of the BB. Otherwise if-converter, branch folding pass may incorrectly update its successor info if it consider BB as fallthrough to the next BB. rdar://13782395 llvm-svn: 181161	2013-05-05 18:06:32 +00:00
Richard Osborne	816f899c45	[XCore] Add LDAPB instructions. With the change the disassembler now supports the XCore ISA in its entirety. llvm-svn: 181155	2013-05-05 13:36:53 +00:00
Richard Osborne	f5ff5a7e6f	[XCore] Update LDAP to use pcrel_imm. llvm-svn: 181154	2013-05-05 13:33:10 +00:00
Richard Osborne	0777489aca	[XCore] Rename calltarget -> pcrel_imm. No functionality change. llvm-svn: 181153	2013-05-05 13:29:02 +00:00
Richard Osborne	6600501755	[XCore] Add BLRB instructions. llvm-svn: 181152	2013-05-05 13:24:16 +00:00
Richard Osborne	4a26a37a60	[XCore] Remove '-' from back branch asm syntax. Instead operands are treated as negative immediates where the sign bit is implicit in the instruction encoding. llvm-svn: 181151	2013-05-05 13:20:22 +00:00
Stepan Dyatkovskiy	c06cd03f6e	For ARM backend, fixed "byval" attribute support. Now even the small structures could be passed within byval (small enough to be stored in GPRs). In regression tests next function prototypes are checked: PR15293: %artz = type { i32 } define void @foo(%artz* byval %s) define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2) foo: "s" stored in R0 foo2: "s" stored in R0, "s2" stored in R2. Next AAPCS rules are checked: 5.5 Parameters Passing, C.4 and C.5, "ParamSize" is parameter size in 32bit words: -- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4. Parameter should be sent to the stack; NCRN := R4. -- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4. Parameter stored in GPRs; NCRN += ParamSize. llvm-svn: 181148	2013-05-05 07:48:36 +00:00
David Majnemer	d5ba4da281	Remove a recently redundant transform from X86ISelLowering. X86ISelLowering has support to treat: (icmp ne (and (xor %flags, -1), (shl 1, flag)), 0) as if it were actually: (icmp eq (and %flags, (shl 1, flag)), 0) However, r179386 has code at the InstCombine level to handle this. llvm-svn: 181145	2013-05-05 02:00:10 +00:00
Dmitri Gribenko	82c92dc3dd	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Nadav Rotem	8c293c00fc	whitespace llvm-svn: 181137	2013-05-04 23:27:32 +00:00
Nadav Rotem	1330c4432b	Fix an odd comment. llvm-svn: 181136	2013-05-04 23:24:56 +00:00
Tim Northover	390335941f	AArch64: enable MCJIT and tests now that everything passes. This removes dire warnings about AArch64 being unsupported and enables the tests when appropriate on this platform. llvm-svn: 181135	2013-05-04 20:14:22 +00:00
Tim Northover	7f0690a028	AArch64: implement first relocation required for MCJIT R_AARCH64_PCREL32 is present in even trivial .eh_frame sections and so is required to compile any function without the "nounwind" attribute. This change implements very basic infrastructure in the RuntimeDyldELF file and allows (for example) the test-shift.ll MCJIT test to pass on AArch64. llvm-svn: 181131	2013-05-04 20:13:59 +00:00
Tim Northover	33a7a57446	Build system changes to enable MCJIT on AArch64 These changes just allow AArch64 to take part in the MCJIT world when built correctly. llvm-svn: 181130	2013-05-04 20:13:52 +00:00
Richard Osborne	49987866b1	Fix buildbot failure on 64 bit linux due to std::max() having different operand types. llvm-svn: 181128	2013-05-04 17:41:01 +00:00
Richard Osborne	0eea664ede	[XCore] Remove unused operand type. llvm-svn: 181127	2013-05-04 17:30:05 +00:00
Richard Osborne	9a767b4821	[XCore] Make use of the target independent global address offset folding. This let us to remove some custom code that matched constant offsets from globals at instruction selection time as a special addressing mode. No intended functionality change. llvm-svn: 181126	2013-05-04 17:24:33 +00:00
Richard Osborne	2935de3971	[XCore] Simplify code that checks for an aligned base plus a constant. The code now makes use of ComputeMaskedBits, SelectionDAG::isBaseWithConstantOffset and TargetLowering::isGAPlusOffset where appropriate reducing the amount of logic needed in XCoreISelLowering. No intended functionality change. llvm-svn: 181125	2013-05-04 17:17:10 +00:00
Richard Osborne	22dc54e15d	[XCore] Move lowering of thread local storage to a separate pass. Thread local storage is not supported by the XMOS linker so we handle thread local variables by lowering the variable to an array of n elements (where n is the number of hardware threads per core, currently 8 for all XMOS devices) indexed by the the current thread ID. Previously this lowering was spread across the XCoreISelLowering and the XCoreAsmPrinter classes. Moving this to a separate pass should be much cleaner. llvm-svn: 181124	2013-05-04 17:01:55 +00:00
Tim Northover	9422845262	AArch64: assert code model is small for TLS accesses Supporting TLS in the large memory model is rather difficult at the moment, so make sure no-one gets into difficulties by mistake. llvm-svn: 181121	2013-05-04 16:54:11 +00:00
Tim Northover	d4f2cac7b6	AArch64: support literal pool access in large memory model. llvm-svn: 181120	2013-05-04 16:54:07 +00:00
Tim Northover	4ef2500d01	AArch64: support large code model for jump-tables llvm-svn: 181119	2013-05-04 16:54:00 +00:00
Tim Northover	ece66eacb2	AArch64: implement support for blockaddress in large code model llvm-svn: 181118	2013-05-04 16:53:53 +00:00
Tim Northover	87645e02c0	AArch64: implement large code model access to global variables. The MOVZ/MOVK instruction sequence may not be the most efficient (a literal-pool load could be better) but adding that would require reinstating the ConstantIslands pass. For now the sequence is correct, and that's enough. Beware, as of commit GNU ld does not appear to support the relocations needed for this. Its primary purpose (for now) will be to support JITed code, since in that case there is no guarantee of where your code will end up in memory relative to external symbols it references. llvm-svn: 181117	2013-05-04 16:53:46 +00:00
Richard Osborne	cb8c7c4310	[XCore] Use static relocation model by default. This allows us to get get rid of a hack in XCoreTargetObjectFile where the the DataRel* sections were overridden. llvm-svn: 181116	2013-05-04 16:40:58 +00:00
Krzysztof Parzyszek	ab0f078360	Use consistent function names. llvm-svn: 181090	2013-05-04 01:30:49 +00:00
Amara Emerson	036eb4649d	Revert r181009. llvm-svn: 181079	2013-05-03 23:57:17 +00:00
Reed Kotler	b89d9a0181	Remove some uneeded pseudos in the presence of the naked function attribute. llvm-svn: 181072	2013-05-03 23:17:24 +00:00
Ulrich Weigand	6d4495723d	[PowerPC] Avoid using '$' in generated assembler code PowerPC assemblers are supposed to support a stand-alone '$' symbol as an alternative of '.' to refer to the current PC. This does not work in the LLVM assembler parser yet. To avoid bootstrap failures when using the LLVM assembler as system assembler, this patch modifies the assembler source code generated by LLVM to avoid using '$' (and simply use '.' instead). llvm-svn: 181054	2013-05-03 19:53:04 +00:00
Ulrich Weigand	c7ad3c20c4	[PowerPC] Add some Book II instructions to AsmParser This patch adds a couple of Book II instructions (isync, icbi) to the PowerPC assembler parser. These are needed when bootstrapping clang with the integrated assembler forced on, because they are used in inline asm statements in the code base. The test case adds the full list of Book II storage control instructions, including associated extended mnemonics. Again, those that are not yet supported as marked as FIXME. llvm-svn: 181052	2013-05-03 19:51:09 +00:00
Ulrich Weigand	4b44c2d06f	[PowerPC] Support extended mnemonics in AsmParser This patch adds infrastructure to support extended mnemonics in the PowerPC assembler parser. It adds support specifically for those extended mnemonics that LLVM will itself generate. The test case lists all extended mnemonics according to the PowerPC ISA v2.06 Book I, but marks those not yet supported as FIXME. llvm-svn: 181051	2013-05-03 19:50:27 +00:00
Ulrich Weigand	d9b4cff835	[PowerPC] Add assembler parser This adds assembler parser support to the PowerPC back end. The parser will run for any powerpc-- and powerpc64-- triples, but was tested only on 64-bit Linux. The supported syntax is intended to be compatible with the GNU assembler. The parser does not yet support all PowerPC instructions, but it does support anything that is generated by LLVM itself. There is no support for testing restricted instruction sets yet, i.e. the parser will always accept any instructions it knows, no matter what feature flags are given. Instruction operands will be checked for validity and errors generated. (Error handling in general could still be improved.) The patch adds a number of test cases to verify instruction and operand encodings. The tests currently cover all instructions from the following PowerPC ISA v2.06 Book I facilities: Branch, Fixed-point, Floating-Point, and Vector. Note that a number of these instructions are not yet supported by the back end; they are marked with FIXME. A number of follow-on check-ins will add extra features. When they are all included, LLVM passes all tests (including bootstrap) when using clang -cc1as as the system assembler. llvm-svn: 181050	2013-05-03 19:49:39 +00:00
Akira Hatanaka	5f295bccfc	[mips] Split the DSP control register and define one register for each field of its fields. This removes false dependencies between DSP instructions which access different fields of the the control register. Implicit register operands are added to instructions RDDSP and WRDSP after instruction selection, depending on the value of the mask operand. llvm-svn: 181041	2013-05-03 18:37:49 +00:00
Tom Stellard	2165728987	R600: Expand vector or, shl, srl, and xor nodes llvm-svn: 181035	2013-05-03 17:21:31 +00:00
Tom Stellard	9015d64c2e	R600: BFI_INT is a vector-only instruction llvm-svn: 181034	2013-05-03 17:21:24 +00:00
Tom Stellard	f2fd0109a0	R600: Add pattern for SHA-256 Ma function This can be optimized using the BFI_INT instruction. llvm-svn: 181033	2013-05-03 17:21:20 +00:00
Tom Stellard	00f307d8e4	R600: Clean up comments in Processors.td llvm-svn: 181032	2013-05-03 17:21:14 +00:00
Amara Emerson	863672f436	Add support for reading ARM ELF build attributes. Build attribute sections can now be read if they exist via ELFObjectFile, and the llvm-readobj tool has been extended with an option to dump this information if requested. Regression tests are also included which exercise these features. Also update the docs with a fixed ARM ABI link and a new link to the Addenda which provides the build attributes specification. llvm-svn: 181009	2013-05-03 11:36:35 +00:00
Benjamin Kramer	96ad18d591	X86: Add target description for btver2; make autodetection logic aware of AVX. llvm-svn: 181005	2013-05-03 10:20:08 +00:00
Aaron Ballman	e1ad194ed5	Unbreaking the non-x86 build bots by protecting the AVX test code properly. llvm-svn: 180992	2013-05-03 02:52:21 +00:00
Aaron Ballman	38fdf1efd6	Correctly testing for AVX support in x86 based off code from Hosts.cpp. llvm-svn: 180991	2013-05-03 02:39:21 +00:00
Reid Kleckner	488fd277c1	Fix missing include in Hexagon code for Release+Asserts llvm-svn: 180983	2013-05-03 00:54:56 +00:00
John McCall	a69fa9df25	In MC asm parsing, account for the possibility of whitespace within the "identifier" parsed by the frontend callback by skipping forward until we've consumed a token that ends at the point dictated by the callback. In addition, inform the callback when it's parsing an unevaluated operand (e.g. mov eax, LENGTH A::x) as opposed to an evaluated one (e.g. mov eax, [A::x]). This commit depends on a clang commit. llvm-svn: 180978	2013-05-03 00:15:41 +00:00
Akira Hatanaka	ab6ee99fe0	[mips] Handle reading, writing or copying of ccond field of DSP control register. - Define pseudo instructions which store or load ccond field of the DSP control register. - Emit the pseudos in MipsSEInstrInfo::storeRegToStack and loadRegFromStack. - Expand the pseudos before callee-scan save. - Emit instructions RDDSP or WRDSP to copy between ccond field and GPRs. llvm-svn: 180969	2013-05-02 23:07:05 +00:00
Jyotsna Verma	08d387d6f8	reverting r180953 llvm-svn: 180964	2013-05-02 22:10:59 +00:00

... 2 3 4 5 6 ...

24450 Commits