llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Bill Wendling	55b4dfcd9d	Micro-optimization: This code: float floatingPointComparison(float x, float y) { double product = (double)x * y; if (product == 0.0) return product; return product - 1.0; } produces this: _floatingPointComparison: 0000000000000000 cvtss2sd %xmm1,%xmm1 0000000000000004 cvtss2sd %xmm0,%xmm0 0000000000000008 mulsd %xmm1,%xmm0 000000000000000c pxor %xmm1,%xmm1 0000000000000010 ucomisd %xmm1,%xmm0 0000000000000014 jne 0x00000004 0000000000000016 jp 0x00000002 0000000000000018 jmp 0x00000008 000000000000001a addsd 0x00000006(%rip),%xmm0 0000000000000022 cvtsd2ss %xmm0,%xmm0 0000000000000026 ret The "jne/jp/jmp" sequence can be reduced to this instead: _floatingPointComparison: 0000000000000000 cvtss2sd %xmm1,%xmm1 0000000000000004 cvtss2sd %xmm0,%xmm0 0000000000000008 mulsd %xmm1,%xmm0 000000000000000c pxor %xmm1,%xmm1 0000000000000010 ucomisd %xmm1,%xmm0 0000000000000014 jp 0x00000002 0000000000000016 je 0x00000008 0000000000000018 addsd 0x00000006(%rip),%xmm0 0000000000000020 cvtsd2ss %xmm0,%xmm0 0000000000000024 ret for a savings of 2 bytes. This xform can happen when we recognize that jne and jp jump to the same "true" MBB, the unconditional jump would jump to the "false" MBB, and the "true" branch is the fall-through MBB. llvm-svn: 97766	2010-03-05 00:24:26 +00:00
Johnny Chen	b6d35fd803	Drop the ".w" qualifier for t2UXTB16* instructions as there is no 16-bit version of either sxtb16 or uxtb16, and the unified syntax does not specify ".w". llvm-svn: 97760	2010-03-04 22:24:41 +00:00
Bob Wilson	188e15d7a5	pr6478: The frame pointer spill frame index is only defined when there is a frame pointer. llvm-svn: 97755	2010-03-04 21:42:36 +00:00
Bob Wilson	73b96c00d2	pr6480: Don't try producing ld/st-multiple instructions when the address is an undef value. This is only going to come up for bugpoint-reduced tests -- correct programs will not access memory at undefined addresses -- so it's not worth the effort of doing anything more aggressive. llvm-svn: 97745	2010-03-04 21:04:38 +00:00
Jakob Stoklund Olesen	3408cd6de1	Fix the remaining MUL8 and DIV8 to define AX instead of AL,AH. These instructions technically define AL,AH, but a trick in X86ISelDAGToDAG reads AX in order to avoid reading AH with a REX instruction. Fix PR6489. llvm-svn: 97742	2010-03-04 20:42:07 +00:00
Dan Gohman	265f85f6d8	Fix recognition of 16-bit bswap for C front-ends which emit the clobber registers in a different order. llvm-svn: 97741	2010-03-04 19:58:08 +00:00
John McCall	f762d454a4	Teach lit to honor conditional directives. The syntax is: IF(condition(value)): If the value satisfies the condition, the line is processed by lit; otherwise it is skipped. A test with no unignored directives is resolved as Unsupported. The test suite is responsible for defining conditions; conditions are unary functions over strings. I've defined two conditions in the LLVM test suite, TARGET (with values like those in TARGETS_TO_BUILD) and BINDING (with values like those in llvm_bindings). So for example you can write: IF(BINDING(ocaml)): RUN: %blah %s -o - and the RUN line will only execute if LLVM was configured with the ocaml bindings. llvm-svn: 97726	2010-03-04 09:36:50 +00:00
Nick Lewycky	58ab63e179	Make the 'icmp pred trunc(ext(X)), CST --> icmp pred X, ext(trunc(CST))' transformation much more careful. Truncating binary '01' to '1' sounds like it's safe until you realize that it switched from positive to negative under a signed interpretation, and that depends on the icmp predicate. Also a few miscellaneous cleanups. llvm-svn: 97721	2010-03-04 06:54:10 +00:00
Erick Tryzelaar	0b21835716	Expose the rest of the llvm-c scalar opts to ocaml. llvm-svn: 97685	2010-03-03 23:51:34 +00:00
Chris Lattner	9e230fb6b2	fix incorrect folding of icmp with undef, PR6481. llvm-svn: 97659	2010-03-03 19:46:03 +00:00
Dan Gohman	da13ee1220	Revert r97580; that's not the right way to fix this. llvm-svn: 97639	2010-03-03 04:36:42 +00:00
Bill Wendling	d1f658563d	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Erick Tryzelaar	f04f234444	Remove module providers from ocaml. llvm-svn: 97609	2010-03-02 23:59:00 +00:00
Chris Lattner	9c9c1158cb	Fix some issues in WalkChainUsers dealing with CopyToReg/CopyFromReg/INLINEASM. These are annoying because they have the same opcode before an after isel. Fix this by setting their NodeID to -1 to indicate that they are selected, just like what automatically happens when selecting things that end up being machine nodes. With that done, give IsLegalToFold a new flag that causes it to ignore chains. This lets the HandleMergeInputChains routine be the one place that validates chains after a match is successful, enabling the new hotness in chain processing. This smarter chain processing eliminates the need for "PreprocessRMW" in the X86 and MSP430 backends and enables MSP to start matching it's multiple mem operand instructions more aggressively. I currently #if out the dead code in the X86 backend and MSP backend, I'll remove it for real in a follow-on patch. The testcase changes are: test/CodeGen/X86/sse3.ll: we generate better code test/CodeGen/X86/store_op_load_fold2.ll: PreprocessRMW was miscompiling this before, we now generate correct code Convert it to filecheck while I'm at it. test/CodeGen/MSP430/Inst16mm.ll: Add a testcase for mem/mem folding to make anton happy. :) llvm-svn: 97596	2010-03-02 22:20:06 +00:00
Chris Lattner	d25f212f9f	this testcase is failing because pic16 doesn't define a reg/reg xor pattern. I have no plans to fix this XFAIL. llvm-svn: 97587	2010-03-02 20:48:24 +00:00
Erick Tryzelaar	0b0e6ace2c	Add support for use to ocaml. llvm-svn: 97586	2010-03-02 20:32:32 +00:00
Chris Lattner	f84b94d738	xfail this for now. llvm-svn: 97584	2010-03-02 19:53:25 +00:00
Dan Gohman	f06941597a	When expanding an expression such as (A + B + C + D), sort the operands by loop depth and emit loop-invariant subexpressions outside of loops. This speeds up MultiSource/Applications/viterbi and others. llvm-svn: 97580	2010-03-02 19:32:21 +00:00
Chris Lattner	845db3b26d	clean up some testcases. llvm-svn: 97576	2010-03-02 18:56:03 +00:00
Chris Lattner	2019e2922f	Fix the xfail I added a couple of patches back. The issue was that we weren't properly handling the case when interior nodes of a matched pattern become dead after updating chain and flag uses. Now we handle this explicitly in UpdateChainsAndFlags. llvm-svn: 97561	2010-03-02 07:50:03 +00:00
Chris Lattner	0b41a42411	Rewrite chain handling validation and input TokenFactor handling stuff now that we don't care about emulating the old broken behavior of the old isel. This eliminates the 'CheckChainCompatible' check (along with IsChainCompatible) which did an incorrect and inefficient scan up the chain nodes which happened as the pattern was being formed and does the validation at the end in HandleMergeInputChains when it forms a structural pattern. This scans "down" the graph, which means that it is quickly bounded by nodes already selected. This also handles token factors that get "trapped" in the dag. Removing the CheckChainCompatible nodes also shrinks the generated tables by about 6K for X86 (down to 83K). There are two pieces remaining before I can nuke PreprocessRMW: 1. I xfailed a test because we're now producing worse code in a case that has nothing to do with the change: it turns out that our use of MorphNodeTo will leave dead nodes in the graph which (depending on how the graph is walked) end up causing bogus uses of chains and blocking matches. This is really bad for other reasons, so I'll fix this in a follow-up patch. 2. CheckFoldableChainNode needs to be improved to handle the TF. llvm-svn: 97539	2010-03-02 02:22:10 +00:00
Dan Gohman	56a20fc5eb	Fix several places to handle vector operands properly. Based on a patch by Micah Villmow for PR6438. llvm-svn: 97538	2010-03-02 02:14:38 +00:00
Dan Gohman	1625456786	Non-affine post-inc SCEV expansions have more code which must be emitted after the increment. Make sure the insert position reflects this. This fixes PR6453. llvm-svn: 97537	2010-03-02 01:59:21 +00:00
Dan Gohman	37bf232609	Floating-point add, sub, and mul are now spelled fadd, fsub, and fmul, respectively. llvm-svn: 97531	2010-03-02 01:11:08 +00:00
Chris Lattner	c0839055a9	Fix PR2590 by making PatternSortingPredicate actually be ordered correctly. Previously it would get in trouble when two patterns were too similar and give them nondet ordering. We force this by using the record ID order as a fallback. The testsuite diff is due to alpha patterns being ordered slightly differently, the change is a semantic noop afaict: < lda $0,-100($16) --- > subq $16,100,$0 llvm-svn: 97509	2010-03-01 22:09:11 +00:00
Devang Patel	9f858ad942	Remove tests that checks @llvm.dbg.stoppoint handling. llvm-svn: 97493	2010-03-01 20:33:48 +00:00
Chris Lattner	04209058b9	stop using anders-aa llvm-svn: 97492	2010-03-01 20:24:50 +00:00
Chris Lattner	ac2f5c24a0	stop using anders-aa llvm-svn: 97491	2010-03-01 20:24:05 +00:00
Chris Lattner	5649a97c00	remove andersen's tests. llvm-svn: 97490	2010-03-01 20:23:15 +00:00
Devang Patel	6dd4084f57	@llvm.dbg.stoppoint intrinsic is not used anymore. Delete dead testcase. llvm-svn: 97489	2010-03-01 19:46:08 +00:00
Devang Patel	ef282ea4c2	Update to use new debug info encoding scheme. As a bonus, now the test passes! llvm-svn: 97487	2010-03-01 19:41:26 +00:00
Devang Patel	66fd0f6b4b	Remove this test because it checks wheter optimizer handled @llvm.dbg.global_variable appropriately or not. LLVM does not use this scheme to encode debug info for global variables any more. llvm-svn: 97480	2010-03-01 19:14:25 +00:00
Devang Patel	4853c9c8d1	Remove test to check bugfix in handing debug info for global variables using intrinsics. Now, debug info for global variable is encoded using metadata. The old code path is now history and there is no need to have a test to check a bug fix in old code path. llvm-svn: 97477	2010-03-01 19:09:55 +00:00
Devang Patel	f05e20ef3c	Remove dead test. llvm-svn: 97474	2010-03-01 19:04:23 +00:00
Devang Patel	cdb2c39383	Replace test case that uses @llvm.dbg.* intrinsic with a test that uses metadata. llvm-svn: 97473	2010-03-01 19:02:51 +00:00
Devang Patel	be1150d535	These two tests check whether oprimizer safely ignores @llvm.dbg.stoppoint intrinsic or not. This intrinsic is not used anymore. llvm-svn: 97468	2010-03-01 18:45:28 +00:00
Devang Patel	271f21327d	This test checks whether LICM ignores @llvm.dbg.stoppoint intrinsics appropriately or not. Now, llvm does not use this intrinsic. Remove this test. llvm-svn: 97466	2010-03-01 18:32:27 +00:00
Devang Patel	6853e2432e	Rewrite test to test VLA using new debug info encoding scheme. llvm-svn: 97465	2010-03-01 18:30:58 +00:00
Devang Patel	c56aee014c	Remove this generic debug info intrinsic test. LLVM does not use this llvm.dbg.stoppoint intrinsic anymore. There are tests to check new implementation, which attaches location information directly with an instruction using metadata. llvm-svn: 97464	2010-03-01 18:30:08 +00:00
Dan Gohman	5e58ab0b56	LLVM instruction syntax doesn't have trailing semicolons. llvm-svn: 97456	2010-03-01 17:53:15 +00:00
Erick Tryzelaar	264323d31e	Add support getting the operands of a User to ocaml. llvm-svn: 97414	2010-02-28 20:45:03 +00:00
Erick Tryzelaar	ff1a75de6d	Add support for global aliases to ocaml. llvm-svn: 97413	2010-02-28 20:44:58 +00:00
Erick Tryzelaar	c0bff2bbc9	Add support for inserting inline asm to ocaml. llvm-svn: 97412	2010-02-28 20:44:53 +00:00
Chris Lattner	74db1864da	add some random nounwinds. llvm-svn: 97411	2010-02-28 20:36:49 +00:00
Erick Tryzelaar	7d687cbb84	Add support for getting a null pointer. llvm-svn: 97380	2010-02-28 09:46:27 +00:00
Erick Tryzelaar	17cb74d29a	Add a way to look up a type by it's name in a module. llvm-svn: 97379	2010-02-28 09:46:21 +00:00
Erick Tryzelaar	c6e78e3503	Add support for global variables in an address space for llvm-c and ocaml. llvm-svn: 97377	2010-02-28 09:46:13 +00:00
Erick Tryzelaar	08d9f8e8fe	Add indirect br support to llvm-c and ocaml. llvm-svn: 97376	2010-02-28 09:46:06 +00:00
Erick Tryzelaar	9401ea01cc	Add metadata functions to llvm-c and ocaml. llvm-svn: 97375	2010-02-28 09:45:59 +00:00
Erick Tryzelaar	f7bbb83b17	Add the new builder arthmetic instructions to llvm-c and ocaml. llvm-svn: 97372	2010-02-28 05:51:43 +00:00

1 2 3 4 5 ...

9378 Commits