llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 22:42:46 +02:00

Author	SHA1	Message	Date
Bob Wilson	42f80596ca	pr9367: Add missing predicated BLX instructions. Patch by Jyun-Yan You, with some minor adjustments and a testcase from me. llvm-svn: 126915	2011-03-03 01:41:01 +00:00
Kevin Enderby	58cc960338	Fixes an assertion failure while disassembling ARM rsbs reg/reg form. Patch by Ted Kremenek! llvm-svn: 126895	2011-03-02 23:08:33 +00:00
Renato Golin	967b93c6e3	Fixing a bug when printing fpu text to object file. Patch by Mans Rullgard. llvm-svn: 126882	2011-03-02 21:20:09 +00:00
Tilmann Scheller	c557d1eeb4	Add Win64 thiscall calling convention. llvm-svn: 126862	2011-03-02 19:29:22 +00:00
David Greene	2fd6d03bc9	[AVX] Fix mask predicates for 256-bit UNPCKLPS/D and implement missing patterns for them. Add a SIMD test subdirectory to hold tests for SIMD instruction selection correctness and quality. ' llvm-svn: 126845	2011-03-02 17:23:43 +00:00
Che-Liang Chiou	8ab0f86f1b	ptx: fix lint and compiler warnings llvm-svn: 126838	2011-03-02 07:58:46 +00:00
Che-Liang Chiou	3529b49230	Add 64-bit addressing to PTX backend - Add '64bit' sub-target option. - Select 32-bit/64-bit loads/stores based on '64bit' option. - Fix function parameter order. Patch by Justin Holewinski llvm-svn: 126837	2011-03-02 07:36:48 +00:00
Che-Liang Chiou	2e7bb6da4c	Extend initial support for primitive types in PTX backend - Allow i16, i32, i64, float, and double types, using the native .u16, .u32, .u64, .f32, and .f64 PTX types. - Allow loading/storing of all primitive types. - Allow primitive types to be passed as parameters. - Allow selection of PTX Version and Shader Model as sub-target attributes. - Merge integer/floating-point test cases for load/store. - Use .u32 instead of .s32 to conform to output from NVidia nvcc compiler. Patch by Justin Holewinski llvm-svn: 126824	2011-03-02 03:20:28 +00:00
Duncan Sands	859a335e92	Add datalayout information for the IEEE quad precision fp128 type. llvm-svn: 126780	2011-03-01 20:56:50 +00:00
Bill Wendling	304dda7810	Narrow right shifts need to encode their immediates differently from a normal shift. 16-bit: imm6<5:3> = '001', 8 - <imm> is encded in imm6<2:0> 32-bit: imm6<5:4> = '01',16 - <imm> is encded in imm6<3:0> 64-bit: imm6<5> = '1', 32 - <imm> is encded in imm6<4:0> llvm-svn: 126723	2011-03-01 01:00:59 +00:00
Chris Lattner	871d62dc5b	add a note llvm-svn: 126719	2011-03-01 00:24:51 +00:00
Renato Golin	986151bc09	Fix .fpu printing in ARM assembly, regarding bug http://llvm.org/bugs/show_bug.cgi?id=8931 llvm-svn: 126689	2011-02-28 22:04:27 +00:00
Kevin Enderby	da76779962	Add missing whitespace in the formatting. llvm-svn: 126687	2011-02-28 21:45:12 +00:00
Chris Lattner	355d573721	fix a signed comparison warning. llvm-svn: 126682	2011-02-28 20:50:35 +00:00
David Greene	3bc73b0ae9	[AVX] Add decode support for VUNPCKLPS/D instructions, both 128-bit and 256-bit forms. Because the number of elements in a vector does not determine the vector type (4 elements could be v4f32 or v4f64), pass the full type of the vector to decode routines. llvm-svn: 126664	2011-02-28 19:06:56 +00:00
Kevin Enderby	a1c2ea4ba0	Fix the arm's disassembler for blx that was building an MCInst without the needed two predicate operands before the imm operand. llvm-svn: 126662	2011-02-28 18:46:31 +00:00
Evan Cheng	4e6d375744	Fix a typo which cause dag combine crash. rdar://9059537. llvm-svn: 126661	2011-02-28 18:45:27 +00:00
Stuart Hastings	539d4e1460	Support for byval parameters on ARM. Will be enabled by a forthcoming patch to the front-end. Radar 7662569. llvm-svn: 126655	2011-02-28 17:17:53 +00:00
Kalle Raiskila	cc5b703c81	Add branch hinting for SPU. The implemented algorithm is overly simplistic (just speculate all branches are taken)- this is work in progress. llvm-svn: 126651	2011-02-28 14:08:24 +00:00
Che-Liang Chiou	4026d01040	Add preliminary support for .f32 in the PTX backend. - Add appropriate TableGen patterns for fadd, fsub, fmul. - Add .f32 as the PTX type for the LLVM float type. - Allow parameters, return values, and global variable declarations to accept the float type. - Add appropriate test cases. Patch by Justin Holewinski llvm-svn: 126636	2011-02-28 06:34:09 +00:00
Benjamin Kramer	0bdf517525	Silence enum conversion warnings. llvm-svn: 126578	2011-02-27 18:13:53 +00:00
NAKAMURA Takumi	b35d45a714	Target/X86: Always emit "push/pop GPRs" in prologue/epilogue and emit "spill/reload frames" for XMMs. It improves Win64's prologue/epilogue but it would not affect ia32 and amd64 (lack of nonvolatile XMMs). llvm-svn: 126568	2011-02-27 08:47:19 +00:00
Benjamin Kramer	412ffed4f0	Add some DAGCombines for (adde 0, 0, glue), which are useful to optimize legalized code for large integer arithmetic. 1. Inform users of ADDEs with two 0 operands that it never sets carry 2. Fold other ADDs or ADDCs into the ADDE if possible It would be neat if we could do the same thing for SETCC+ADD eventually, but we can't do that in target independent code. llvm-svn: 126557	2011-02-26 22:48:07 +00:00
Owen Anderson	bd26993873	Allow targets to specify a the type of the RHS of a shift parameterized on the type of the LHS. llvm-svn: 126518	2011-02-25 21:41:48 +00:00
Cameron Zwarich	974208a607	Roll out r126425 and r126450 to see if it fixes the failures on the buildbots. llvm-svn: 126488	2011-02-25 16:30:32 +00:00
Bob Wilson	6bbffe19e9	Add patterns to use post-increment addressing for Neon VST1-lane instructions. llvm-svn: 126477	2011-02-25 06:42:42 +00:00
Evan Cheng	56354c17d9	Fix typo. llvm-svn: 126467	2011-02-25 01:29:29 +00:00
Evan Cheng	fbdcea4b2e	Each prologue may have multiple vpush instructions to store callee-saved D registers since the vpush list may not have gaps. Make sure the stack adjustment instruction isn't moved between them. Ditto for vpop in epilogues. Sorry, can't reduce a small test case. rdar://9043312 llvm-svn: 126457	2011-02-25 00:24:46 +00:00
Chris Lattner	55119c81aa	remove command line option debugging hook. llvm-svn: 126441	2011-02-24 21:53:03 +00:00
Devang Patel	f2b2417c2c	Enable DebugInfo support for COFF object files. Patch by Nathan Jeffords! llvm-svn: 126425	2011-02-24 21:04:00 +00:00
Richard Osborne	a8df984a31	Add XCore intrinsic for eeu instruction. llvm-svn: 126384	2011-02-24 13:39:18 +00:00
Evan Cheng	9db7b1367d	Fix bug in X86 folding / unfolding table. Int_CMPSDrm and Int_CMPSSrm memory operands starts at index 2, not 1. rdar://9045024 PR9305 llvm-svn: 126359	2011-02-24 02:36:52 +00:00
Richard Osborne	d9564589f6	Add XCore intrinsic for clre instruction. llvm-svn: 126322	2011-02-23 18:52:05 +00:00
Richard Osborne	4a55817288	Add llvm.xcore.waitevent intrinsic. The effect of this intrinsic is to enable events on the thread and wait until a resource is ready to event. The vector of the resource that is ready is returned. llvm-svn: 126320	2011-02-23 18:35:59 +00:00
Richard Osborne	aaac1b01fd	Add XCore intrinsic for the setv instruction. llvm-svn: 126315	2011-02-23 16:46:37 +00:00
Richard Osborne	2374e9683e	Fix format for setc instruction. llvm-svn: 126314	2011-02-23 15:20:16 +00:00
Richard Osborne	aa39bf94b4	Add XCore intrinsic for settw instruction. llvm-svn: 126313	2011-02-23 14:45:03 +00:00
Evan Cheng	98e040ea71	Change VFPNeonA8 definition to make the code easier to read. llvm-svn: 126298	2011-02-23 02:35:33 +00:00
Evan Cheng	da40bcab44	More fcopysign correctness and performance fix. The previous codegen for the slow path (when values are in VFP / NEON registers) was incorrect if the source is NaN. The new codegen uses NEON vbsl instruction to copy the sign bit. e.g. vmov.i32 d1, #0x80000000 vbsl d1, d2, d0 If NEON is not available, it uses integer instructions to copy the sign bit. rdar://9034702 llvm-svn: 126295	2011-02-23 02:24:55 +00:00
David Greene	7b0539174a	[AVX] General VUNPCKL codegen support. llvm-svn: 126264	2011-02-22 23:31:46 +00:00
Joerg Sonnenberger	67e0eb235d	Use the same (%dx) hack for in[bwl] as for out[bwl]. llvm-svn: 126244	2011-02-22 20:40:09 +00:00
Evan Cheng	f540b0e0f6	VFP single precision arith instructions can go down to NEON pipeline, but on Cortex-A8 only. llvm-svn: 126238	2011-02-22 19:53:14 +00:00
Roman Divacky	f028b1614b	Stack alignment is 16 bytes on FreeBSD/i386 too. llvm-svn: 126226	2011-02-22 17:30:05 +00:00
Evan Cheng	f7c6f8580b	Guard against de-referencing MBB.end(). llvm-svn: 126192	2011-02-22 07:07:59 +00:00
Evan Cheng	6e3d087477	available_externally (hidden or not) GVs are always accessed via stubs. rdar://9027648. llvm-svn: 126191	2011-02-22 06:58:34 +00:00
Eric Christopher	58b95654bc	Only use blx for external function calls on thumb, these could be fixed up by the dynamic linker, but it's better to use the correct instruction to begin with. Fixes rdar://9011034 llvm-svn: 126176	2011-02-22 01:37:10 +00:00
Joerg Sonnenberger	9dceff5417	Recognize loopz and loopnz as aliases for loope and loopne. From Dimitry Andric. llvm-svn: 126168	2011-02-22 00:43:07 +00:00
Rafael Espindola	e4a04cce2b	Implement xgetbv and xsetbv. Patch by Jai Menon. llvm-svn: 126165	2011-02-22 00:35:18 +00:00
Evan Cheng	aaa5bd52f4	Skipping over debugvalue instructions to determine whether the split spot is in a IT block. rdar://9030770 llvm-svn: 126159	2011-02-21 23:40:47 +00:00
Devang Patel	d5c4589795	Revert r124611 - "Keep track of incoming argument's location while emitting LiveIns." In other words, do not keep track of argument's location. The debugger (gdb) is not prepared to see line table entries for arguments. For the debugger, "second" line table entry marks beginning of function body. This requires some coordination with debugger to get this working. - The debugger needs to be aware of prolog_end attribute attached with line table entries. - The compiler needs to accurately mark prolog_end in line table entries (at -O0 and at -O1+) llvm-svn: 126155	2011-02-21 23:21:26 +00:00
Sean Callanan	8aaf83f2b8	Fixed a bug in the X86 disassembler where a member of the X86 instruction decode structure was being interpreted as being in units of bits, although it is actually stored in units of bytes. llvm-svn: 126147	2011-02-21 21:55:05 +00:00
Richard Osborne	bd0e21b5ca	Add XCore intrinsics for various instructions on ports. llvm-svn: 126132	2011-02-21 18:23:30 +00:00
Duncan Sands	77c058dc70	The stack should be 16 byte aligned on 32 bit solaris. Patch by Yuri. llvm-svn: 126130	2011-02-21 17:37:17 +00:00
Chris Lattner	e7dc7e1e5b	a serious "compare CSE" issue that is nontrivial to get right, but which is responsible for us doing really bad things to 256.bzip2. llvm-svn: 126126	2011-02-21 17:03:47 +00:00
NAKAMURA Takumi	a03e9f0267	Target/X86/X86FastISel: [PR6275] Fix Win32's dllimport function with fastisel. "dllimport" function must not be GlobalVariable, but Function. It is enough to check with GlobalValue. test/CodeGen/X86/dll-linkage.ll is updated to check llc -O0. llvm-svn: 126110	2011-02-21 04:50:06 +00:00
Venkatraman Govindaraju	1a5bbc7f1e	Generate correct Sparc32 ABI compliant code for functions that return a struct. llvm-svn: 126108	2011-02-21 03:42:44 +00:00
Chris Lattner	c373140c8b	add a missed loop deletion case. llvm-svn: 126103	2011-02-21 02:13:39 +00:00
Chris Lattner	8760c28fe1	add an idiom that loop idiom could theoretically catch. llvm-svn: 126101	2011-02-21 01:33:38 +00:00
Cameron Zwarich	3384d8f317	A lo/hi mul has higher latency than an imul r,ri, e.g. 5 cycles compared to 3 on Core 2 and Nehalem, so the code we generate is better than GCC's here. llvm-svn: 126100	2011-02-21 01:29:32 +00:00
Cameron Zwarich	b7e676db6c	The signed version of our "magic number" computation for the integer approximation of a constant had a minor typo introduced when copying it from the book, which caused it to favor negative approximations over positive approximations in many cases. Positive approximations require fewer operations beyond the multiplication. In the case of division by 3, we still generate code that is a single instruction larger than GCC's code. llvm-svn: 126097	2011-02-21 00:22:02 +00:00
Eric Christopher	568548ce13	If both operands are loads from stores in memory we can't use movlpd/movlps since one needs to be a register operand. Just use movss instead of forcing an operand into a register. Fixes PR9239 llvm-svn: 126072	2011-02-20 05:04:42 +00:00
Oscar Fuentes	59c8ae34f7	Use explicit add_subdirectory's for LLVM target sublibraries instead of testing for its presence at cmake time. This way the build automatically regenerates the makefiles when a svn update brings in a new sublibrary. llvm-svn: 126068	2011-02-20 02:55:27 +00:00
Eli Friedman	0ad25251cb	Minor x86 README updates. llvm-svn: 126054	2011-02-19 21:54:28 +00:00
Chris Lattner	7cd801727d	implement PR9264: disambiguating 'bt mem, imm' as a btl. This is reasonable to do since all bt-mem forms do the same thing. llvm-svn: 126047	2011-02-19 21:06:36 +00:00
Eric Christopher	67a5a75e28	Fix typos. llvm-svn: 126018	2011-02-19 03:19:09 +00:00
Joerg Sonnenberger	4652f152e4	Avoid dangling else warnings. llvm-svn: 126004	2011-02-19 00:43:45 +00:00
Chris Lattner	a0dede2c21	add a way to disable all builtins, wire it up to opt's -disable-simplifylibcalls flag. llvm-svn: 125978	2011-02-18 22:34:03 +00:00
Oscar Fuentes	6e5d344a2e	Move library stuff out of the toplevel CMakeLists.txt file. llvm-svn: 125968	2011-02-18 22:06:14 +00:00
Chris Lattner	63dfb2c797	introduce a new TargetLibraryInfo pass, which transformations can use to query about available library functions. For now this just has memset_pattern16, which exists on darwin, but it can be extended for a bunch of other things in the future. llvm-svn: 125965	2011-02-18 21:50:34 +00:00
Bruno Cardoso Lopes	d97e3e6dad	Fix style and a typo llvm-svn: 125949	2011-02-18 19:49:06 +00:00
Bruno Cardoso Lopes	ad05904e0b	Add assembly parsing support for "msr" and also fix its encoding. Also add testcases for the disassembler to make sure it still works for "msr". llvm-svn: 125948	2011-02-18 19:45:59 +00:00
Chris Lattner	2aebf9f4ad	add a poor division by constant case. llvm-svn: 125832	2011-02-18 05:35:49 +00:00
Joerg Sonnenberger	efa8090e2a	Recognize monitor/mwait with explicit register arguments llvm-svn: 125805	2011-02-18 00:48:11 +00:00
Joerg Sonnenberger	9f8f3a2c59	Recognize leavel and leaveq aliases for leave. Validate encoding of leave in 64bit mode. llvm-svn: 125795	2011-02-17 23:36:39 +00:00
David Greene	244920d662	[AVX] Recorganize X86ShuffleDecode into its own library (LLVMX86Utils.a) to break cyclic library dependencies between LLVMX86CodeGen.a and LLVMX86AsmParser.a. Previously this code was in a header file and marked static but AVX requires some additional functionality here that won't be used by all clients. Since including unused static functions causes a gcc compiler warning, keeping it as a header would break builds that use -Werror. Putting this in its own library solves both problems at once. llvm-svn: 125765	2011-02-17 19:18:59 +00:00
Dan Gohman	71117af2db	The labyrinthine X86 backend no longer appears to require these patterns. llvm-svn: 125759	2011-02-17 18:50:19 +00:00
NAKAMURA Takumi	00228d0c2c	Triple::MinGW64 is deprecated and removed. We can use Triple::MinGW32 generally. No one uses *-mingw64. mingw-w64 is represented as {i686\|x86_64}-w64-mingw32. In llvm side, i686 and x64 can be treated as similar way. llvm-svn: 125747	2011-02-17 12:24:17 +00:00
NAKAMURA Takumi	8d39c3a632	Fix whitespace. llvm-svn: 125746	2011-02-17 12:23:50 +00:00
Duncan Sands	e0ece264ba	This has been implemented. llvm-svn: 125738	2011-02-17 08:16:56 +00:00
Chris Lattner	035876162f	add some notes on compares + binops. Remove redundant entries. llvm-svn: 125702	2011-02-17 01:43:46 +00:00
Chris Lattner	9f4e529571	Add a few missed xforms from GCC PR14753 llvm-svn: 125681	2011-02-16 19:16:34 +00:00
Stuart Hastings	47e45a32a8	Swap VT and DebugLoc operands of getExtLoad() for consistency with other getNode() methods. Radar 9002173. llvm-svn: 125665	2011-02-16 16:23:55 +00:00
Eli Friedman	b409f8da64	Remove outdated README entry. llvm-svn: 125660	2011-02-16 07:41:19 +00:00
Eli Friedman	5f848d70fa	Remove outdated README entry. llvm-svn: 125659	2011-02-16 07:18:18 +00:00
Eli Friedman	30a64ae1b9	Update README entry. llvm-svn: 125658	2011-02-16 07:17:44 +00:00
Rafael Espindola	b59fdeb3de	Add support for pushsection and popsection. Patch by Joerg Sonnenberger. llvm-svn: 125629	2011-02-16 01:08:29 +00:00
Evan Cheng	d3928a2c3a	Some single precision VFP instructions may be executed on NEON pipeline, but not double precision ones. llvm-svn: 125624	2011-02-16 00:35:02 +00:00
Jakob Stoklund Olesen	d8c18daea5	Teach ARMLoadStoreOptimizer to remove kill flags from merged instructions as well. This is necessary to avoid a crash in certain tangled situations where a kill flag is first correctly moved to a merged instruction, and then needs to be moved again: STR %R0, a... STR %R0<kill>, b... First becomes: STR %R0, b... STM a, %R0<kill>, ... and then: STM a, %R0, ... STM b, %R0<kill>, ... We can now remove the kill flag from the merged STM when needed. 8960050. llvm-svn: 125591	2011-02-15 19:51:58 +00:00
Duncan Sands	061150ac1b	Spelling fix: consequtive -> consecutive. llvm-svn: 125563	2011-02-15 09:23:02 +00:00
Bob Wilson	43bf86b10d	Remove unused bitvectors that record ARM callee-saved registers. llvm-svn: 125534	2011-02-14 23:40:38 +00:00
Bruno Cardoso Lopes	5eb7668012	A fail to match coprocessor number and register number must fail instead of assert. llvm-svn: 125521	2011-02-14 21:10:33 +00:00
Bruno Cardoso Lopes	e65a98b127	Fix encoding and add parsing support for the arm/thumb CPS instruction: - Add custom operand matching for imod and iflags. - Rename SplitMnemonicAndCC to SplitMnemonic since it splits more than CC from mnemonic. - While adding ".w" as an operand, don't change "Head" to avoid passing the wrong mnemonic to ParseOperand. - Add asm parser tests. - Add disassembler tests just to make sure it can catch all cps versions. llvm-svn: 125489	2011-02-14 13:09:44 +00:00
Chris Lattner	bcf2d46d8a	Enhance ComputeMaskedBits to know that aligned frameindexes have their low bits set to zero. This allows us to optimize out explicit stack alignment code like in stack-align.ll:test4 when it is redundant. Doing this causes the code generator to start turning FI+cst into FI\|cst all over the place, which is general goodness (that is the canonical form) except that various pieces of the code generator don't handle OR aggressively. Fix this by introducing a new SelectionDAG::isBaseWithConstantOffset predicate, and using it in places that are looking for ADD(X,CST). The ARM backend in particular was missing a lot of addressing mode folding opportunities around OR. llvm-svn: 125470	2011-02-13 22:25:43 +00:00
Reid Kleckner	0e68b2ed88	Add encodings and mnemonics for FXSAVE64 and FXRSTOR64. These are just FXSAVE and FXRSTOR with REX.W prefixes. These versions use 64-bit pointer values instead of 32-bit pointer values in the memory map they dump and restore. llvm-svn: 125446	2011-02-12 23:24:13 +00:00
Venkatraman Govindaraju	3cc16c2b89	Prevent IMPLICIT_DEF/KILL to become a delay filler instruction in SPARC backend. llvm-svn: 125444	2011-02-12 19:02:33 +00:00
Benjamin Kramer	19bcaa5d51	Add a note about SSE4.1 roundss/roundsd. llvm-svn: 125438	2011-02-12 17:58:16 +00:00
Jim Grosbach	c359122d78	AsmMatcher custom operand parser failure enhancements. Teach the AsmMatcher handling to distinguish between an error custom-parsing an operand and a failure to match. The former should propogate the error upwards, while the latter should continue attempting to parse with alternative matchers. Update the ARM asm parser accordingly. llvm-svn: 125426	2011-02-12 01:34:40 +00:00
Nate Begeman	0a8f9ff53b	Implement sdiv & udiv for <4 x i16> and <8 x i8> NEON vector types. This avoids moving each element to the integer register file and calling __divsi3 etc. on it. llvm-svn: 125402	2011-02-11 20:53:29 +00:00
Rafael Espindola	bb94ca00f7	Remove std::string version of getNameWithPrefix. llvm-svn: 125363	2011-02-11 05:23:09 +00:00
Evan Cheng	7cfe7b71e6	Fix buggy fcopysign lowering. This define float @foo(float %x, float %y) nounwind readnone { entry: %0 = tail call float @copysignf(float %x, float %y) nounwind readnone ret float %0 } Was compiled to: vmov s0, r1 bic r0, r0, #-2147483648 vmov s1, r0 vcmpe.f32 s0, #0 vmrs apsr_nzcv, fpscr it lt vneglt.f32 s1, s1 vmov r0, s1 bx lr This fails to copy the sign of -0.0f because it's lost during the float to int conversion. Also, it's sub-optimal when the inputs are in GPR registers. Now it uses integer and + or operations when it's profitable. And it's correct! lsrs r1, r1, #31 bfi r0, r1, #31, #1 bx lr rdar://8984306 llvm-svn: 125357	2011-02-11 02:28:55 +00:00

1 2 3 4 5 ...

17273 Commits