llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Benjamin Kramer	583d3f2591	Make CHECK lines a bit less strict so they also match code generated for win64. Hopefully brings the windows buildbots back to life. llvm-svn: 180630	2013-04-26 21:04:21 +00:00
Benjamin Kramer	11723aa321	X86: Now that we have a canonical form for vector integer abs, match it into pabs. llvm-svn: 180600	2013-04-26 12:05:21 +00:00
Benjamin Kramer	7ce75fb032	DAGCombiner: Canonicalize vector integer abs in the same way we do it for scalars. This already helps SSE2 x86 a lot because it lacks an efficient way to represent a vector select. The long term goal is to enable the backend to match a canonicalized pattern into a single instruction (e.g. vabs or pabs). llvm-svn: 180597	2013-04-26 09:19:19 +00:00
Preston Gurd	0547d81fdb	This patch adds the X86FixupLEAs pass, which will reduce instruction latency for certain models of the Intel Atom family, by converting instructions into their equivalent LEA instructions, when it is both useful and possible to do so. llvm-svn: 180573	2013-04-25 20:29:37 +00:00
Chad Rosier	3030f76a0d	[inline asm] Add a test case for r180226. The specific issue is that the inline assembly is requesting a 64-bit register, which is invalid for i386. rdar://13731657 llvm-svn: 180445	2013-04-25 17:10:21 +00:00
Rafael Espindola	f7c86d97a1	Move test from grep to FileCheck. llvm-svn: 180092	2013-04-23 12:03:27 +00:00
Arnaud A. de Grandmaison	087fe129d8	Cleanup: test source files do not need to be executable llvm-svn: 180003	2013-04-22 08:02:43 +00:00
David Blaikie	9bfe15c313	Revert "Revert "PR14606: debug info imported_module support"" This reverts commit r179840 with a fix to test/DebugInfo/two-cus-from-same-file.ll I'm not sure why that test only failed on ARM & MIPS and not X86 Linux, even though the debug info was clearly invalid on all of them, but this ought to fix it. llvm-svn: 179996	2013-04-22 06:12:31 +00:00
Bill Wendling	61eb6957c5	Remove tbaa metadata. llvm-svn: 179970	2013-04-21 01:38:25 +00:00
Stephen Lin	98df7358cd	Minor renaming of tests (for consistency with an in-development patch) llvm-svn: 179954	2013-04-20 16:21:26 +00:00
Benjamin Kramer	a7e8f887fe	Don't litter .s files in test directory. llvm-svn: 179937	2013-04-20 10:43:40 +00:00
Stephen Lin	9d99ba2071	Add CodeGen support for functions that always return arguments via a new parameter attribute 'returned', which is taken advantage of in target-independent tail call opportunity detection and in ARM call lowering (when placed on an integral first parameter). llvm-svn: 179925	2013-04-20 05:14:40 +00:00
Stephen Lin	65c1101eba	Allow tail call opportunity detection through nested and/or multiple iterations of extractelement/insertelement indirection llvm-svn: 179924	2013-04-20 04:27:51 +00:00
Anton Korobeynikov	f95220dd8b	Do not mangle in MS-way the globals with magic \001 in the name. Based on the patch by David Nadlinger! llvm-svn: 179889	2013-04-19 21:20:56 +00:00
Bill Wendling	e8c6d1cb09	Make test slightly more readable. llvm-svn: 179888	2013-04-19 21:14:59 +00:00
Bill Wendling	7256108f6f	Add a testcase to make sure we generate the proper compact unwind section for a function that cannot produce a compact unwind encoding. llvm-svn: 179887	2013-04-19 21:07:11 +00:00
Eric Christopher	88bdd26cc9	Revert "PR14606: debug info imported_module support" This reverts commit r179836 as it seems to have caused test failures. llvm-svn: 179840	2013-04-19 07:47:16 +00:00
David Blaikie	46f35f8e56	PR14606: debug info imported_module support Adding another CU-wide list, in this case of imported_modules (since they should be relatively rare, it seemed better to add a list where each element had a "context" value, rather than add a (usually empty) list to every scope). This takes care of DW_TAG_imported_module, but to fully address PR14606 we'll need to expand this to cover DW_TAG_imported_declaration too. llvm-svn: 179836	2013-04-19 06:57:04 +00:00
Benjamin Kramer	aeff9e581b	X86: Add an SSE2 lowering for 64 bit compares when pcmpgtq (SSE4.2) isn't available. This pattern started popping up in vectorized min/max reductions. llvm-svn: 179797	2013-04-18 21:37:45 +00:00
Derek Schuff	c55a3d43a9	Allow misaligned stores in x86 fast-isel. In X86FastISel::X86SelectStore(), improperly aligned stores are rejected and handled by the DAG-based ISel. However, X86FastISel::X86SelectLoad() makes no such requirement. There doesn't appear to be an x86 architectural correctness issue with allowing potentially unaligned store instructions. This patch removes this restriction. Patch by Jim Stichnot. llvm-svn: 179774	2013-04-18 17:41:08 +00:00
Eli Bendersky	802610971f	This patch teaches x86 fast-isel to generate the native div/idiv instructions for the sdiv/srem/udiv/urem bitcode instructions. This is done for the i8, i16, and i32 types, as well as i64 for the x86_64 target. Patch by Jim Stichnoth llvm-svn: 179715	2013-04-17 20:10:13 +00:00
Tim Northover	b5dc8bb136	Avoid outputting temporary test file into source tree. llvm-svn: 179532	2013-04-15 15:49:13 +00:00
Andrew Trick	2bd87ad8d4	Further generalize this scheduler test. The order of copies depends on queue order, which is not very stable. llvm-svn: 179456	2013-04-13 07:37:27 +00:00
Andrew Trick	fb2a8d10f8	Fix a dislexic regex. llvm-svn: 179455	2013-04-13 07:29:21 +00:00
Andrew Trick	1ef71359cd	Add a missing REQUIRES: asserts llvm-svn: 179453	2013-04-13 06:12:46 +00:00
Andrew Trick	861493bc4f	MI-Sched: schedule physreg copies. The register allocator expects minimal physreg live ranges. Schedule physreg copies accordingly. This is slightly tricky when they occur in the middle of the scheduling region. For now, this is handled by rescheduling the copy when its associated instruction is scheduled. Eventually we may instead bundle them, but only if we can preserve the bundles as parallel copies during regalloc. llvm-svn: 179449	2013-04-13 06:07:40 +00:00
Nadav Rotem	f96cc4976d	Fix the test on linux by setting the triple and the align format llvm-svn: 179354	2013-04-12 01:07:16 +00:00
Nadav Rotem	662256bafa	Add a flag to align all basic blocks in the function. When debugging performance regressions we often ask ourselves if the regression that we see is due to poor isel/sched/ra or due to some micro-architetural problem. When comparing two code sequences one good way to rule out front-end bottlenecks (and other the issues) is to force code alignment. This pass adds a flag that forces the alignment of all of the basic blocks in the program. llvm-svn: 179353	2013-04-12 00:48:32 +00:00
Preston Gurd	8e4196dfe6	Use FileCheck instead of grep. llvm-svn: 179322	2013-04-11 21:39:01 +00:00
Eli Bendersky	0ce49fd520	Add a CHECK-NOT for a more faithful translation of the original grep \| count 2. Thanks to Reid Kleckner for catching this. llvm-svn: 179289	2013-04-11 14:43:19 +00:00
Michael Liao	877d1576e6	Optimize vector select from all 0s or all 1s As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane, vector select could be simplified to AND/OR or removed if one or both values being selected is all 0s or all 1s. llvm-svn: 179267	2013-04-11 05:15:54 +00:00
Michael Liao	87125582e9	Enhance bool simplifcation in X86 to handle more cases This patch is revised based on patch from Victor Umansky <victor.umansky@intel.com>. More cases are handled in X86's bool simplification, i.e. - SETCC_CARRY - value is truncated to i1 with AND As a by-product, PR5443 is also fixed. llvm-svn: 179265	2013-04-11 04:43:09 +00:00
Eli Bendersky	90daaa543a	Rewrite some of the test/CodeGen/X86 tests to use FileCheck instead of grep llvm-svn: 179241	2013-04-10 23:30:20 +00:00
Evan Cheng	9f82233851	__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in xmm0 / xmm1. rdar://13599493 llvm-svn: 179141	2013-04-10 01:26:07 +00:00
Timur Iskhodzhanov	c004e9db2d	Make the test/CodeGen/X86/win32_sret.ll reliable on any CPU by explicitly specifying the -mcpu llvm-svn: 178885	2013-04-05 17:05:56 +00:00
Andrew Trick	6da34cd35e	RegisterPressure heuristics currently require signed comparisons. llvm-svn: 178823	2013-04-05 00:31:34 +00:00
Timur Iskhodzhanov	0976f711d6	Temporarily relax the WIN32 checks in the SRet test to fix the Atom D2700 bot llvm-svn: 178635	2013-04-03 12:17:15 +00:00
Timur Iskhodzhanov	ecd533f0ec	Fix SRet for thiscall in i686-pc-win32 llvm-svn: 178634	2013-04-03 11:27:54 +00:00
NAKAMURA Takumi	d8a9117bcb	llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32\|64}.ll and handle-move.ll, corresponding to r178549. This reverts r176808, r176798, and r177914. llvm-svn: 178583	2013-04-02 22:35:08 +00:00
Jakob Stoklund Olesen	b0a5a72daf	Don't attempt MTM heuristics without a scheduling model present. This should fix the PPC buildbots. llvm-svn: 178558	2013-04-02 18:26:45 +00:00
Chad Rosier	908153170e	[fast-isel] Use the correct API to disable FastLowerArguments for Win64. llvm-svn: 178549	2013-04-02 16:31:41 +00:00
Preston Gurd	fca710bf70	Simplify test cases for Atom preferring call register indirect over call memory indirect (32 and 64 bit). llvm-svn: 178541	2013-04-02 14:25:06 +00:00
Arnold Schwaighofer	a2a475a83d	Merge load/store sequences with adresses: base + index + offset We would also like to merge sequences that involve a variable index like in the example below. int index = *idx++ int i0 = c[index+0]; int i1 = c[index+1]; b[0] = i0; b[1] = i1; By extending the parsing of the base pointer to handle dags that contain a base, index, and offset we can handle examples like the one above. The dag for the code above will look something like: (load (i64 add (i64 copyfromreg %c) (i64 signextend (i8 load %index)))) (load (i64 add (i64 copyfromreg %c) (i64 signextend (i32 add (i32 signextend (i8 load %index)) (i32 1))))) The code that parses the tree ignores the intermediate sign extensions. However, if there is a sign extension it needs to be on all indexes. (load (i64 add (i64 copyfromreg %c) (i64 signextend (add (i8 load %index) (i8 1)))) vs (load (i64 add (i64 copyfromreg %c) (i64 signextend (i32 add (i32 signextend (i8 load %index)) (i32 1))))) radar://13536387 llvm-svn: 178483	2013-04-01 18:12:58 +00:00
Benjamin Kramer	790bd5fb50	X86: Promote sitofp <8 x i16> to <8 x i32> when AVX is available. A vector sext + sitofp is a lot cheaper than 8 scalar conversions. llvm-svn: 178448	2013-03-31 12:49:15 +00:00
Benjamin Kramer	86e90ea8b4	DAGCombine: visitXOR can replace a node without returning it, bail out in that case. Fixes the crash reported in PR15608. llvm-svn: 178429	2013-03-30 21:28:18 +00:00
Benjamin Kramer	50725426cb	Change '@SECREL' suffix to GAS-compatible '@SECREL32'. '@SECREL' is what is used by the Microsoft assembler, but GNU as expects '@SECREL32'. With the patch, the MC-generated code works fine in combination with a recent GNU as (2.23.51.20120920 here). Patch by David Nadlinger! Differential Revision: http://llvm-reviews.chandlerc.com/D429 llvm-svn: 178427	2013-03-30 16:21:50 +00:00
Timur Iskhodzhanov	d7d35221f7	Exclude the X86/complex-fca.ll test at it probably wasn't supposed to work on Windows llvm-svn: 178375	2013-03-29 21:54:00 +00:00
Benjamin Kramer	279e5cfa9a	Remove the old CodePlacementOpt pass. It was superseded by MachineBlockPlacement and disabled by default since LLVM 3.1. llvm-svn: 178349	2013-03-29 17:14:24 +00:00
Michael Liao	427149cbcf	Add support of RDSEED defined in AVX2 extension llvm-svn: 178314	2013-03-28 23:41:26 +00:00
Michael Liao	aec693ab31	Enhance boolean simplification to handle 16-/64-bit RDRAND - RDRAND always clears the destination value when a random value is not available (i.e. CF == 0). This value is truncated or zero-extended as the false boolean value to be returned. Boolean simplification needs to skip this 'zext' or 'trunc' node. llvm-svn: 178312	2013-03-28 23:38:52 +00:00

1 2 3 4 5 ...

3951 Commits