llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Evan Cheng	b1ddb193c7	Fix PR4975. Avoid referencing empty vector. llvm-svn: 99840	2010-03-29 21:27:30 +00:00
Evan Cheng	aafcb722f9	Pool allocate SDDbgValue nodes. llvm-svn: 99836	2010-03-29 20:48:30 +00:00
Chris Lattner	24b8a8bebd	add a statistic for the # times isel has to backtrack. llvm-svn: 99774	2010-03-28 19:46:56 +00:00
Chris Lattner	5ad69fe4b6	finally remove the immAllOnesV_bc/immAllZerosV_bc patterns and those derived from them. These are obnoxious because they were written as: PatLeaf<(bitconvert). Not having an argument was foiling adding better type checking for operand count matching up with what was required (in this case, bitconvert always requires an operand!) llvm-svn: 99759	2010-03-28 08:43:23 +00:00
Chris Lattner	78dc3c322f	comply with the wishes of a fixme. llvm-svn: 99742	2010-03-28 05:55:17 +00:00
Chris Lattner	918dd19018	now that (parallel) is gone and a variety of bugs in targets are cleaned up, we can remove an old fixme. llvm-svn: 99741	2010-03-28 05:54:03 +00:00
Chris Lattner	3f060e3216	add an optimized form of OPC_EmitMergeInputChains for the 1, 0 and 1, 1 cases which are by-far the most frequent. This shrinks the X86 isel table from 77014 -> 74657 bytes. llvm-svn: 99740	2010-03-28 05:50:16 +00:00
Chris Lattner	bacf4edba3	don't add nodes to the now-dead nodes list multiple times, this can cause a crash on crazy situations in msp430 when morph-node-to is disabled. llvm-svn: 99739	2010-03-28 05:28:31 +00:00
Chris Lattner	bb65543f01	don't add flag nodes with chain results to the NowDeadNodes list multiple times when MorphNodeTo can't be applied. llvm-svn: 99735	2010-03-28 04:54:33 +00:00
Chris Lattner	85756c913d	improve -debug-only=isel comments for cases when we don't enter a scope due to obviously false predicate. llvm-svn: 99723	2010-03-27 18:54:50 +00:00
Bill Wendling	3f00f8044f	Forgot the part where we handle the ".llvm.eh.catch.all.value". llvm-svn: 99697	2010-03-27 01:24:30 +00:00
Anton Korobeynikov	add7caa0f3	Add few missed libcalls and correct names for others. llvm-svn: 99656	2010-03-26 21:32:14 +00:00
Evan Cheng	1f419ac4da	LiveVariables should clear kill / dead markers first. This allows us to remove a hack in the scheduler. llvm-svn: 99597	2010-03-26 02:12:24 +00:00
Chris Lattner	bea905d2e9	fix a valgrind error on copy-constructor-synthesis.cpp, which is caused when the custom insertion hook deletes the instruction, then we try to set dead flags on it. Neither the code that I added nor the code that was there before was safe. llvm-svn: 99538	2010-03-25 18:49:10 +00:00
Evan Cheng	8c19404e5c	Scheduler assumes SDDbgValue nodes are in source order. That's true currently. But add an assertion to verify it. llvm-svn: 99501	2010-03-25 07:16:57 +00:00
Chris Lattner	13f444cdf1	Change tblgen to emit FOOISD opcode names as two bytes instead of one byte. This is important because we're running up to too many opcodes to fit in a byte and it is aggrevated by FIRST_TARGET_MEMORY_OPCODE making the numbering sparse. This just bites the bullet and bloats out the table. In practice, this increases the size of the x86 isel table from 74.5K to 76K. I think we'll cope :) This fixes rdar://7791648 llvm-svn: 99494	2010-03-25 06:33:05 +00:00
Evan Cheng	0ad874a05c	Remove a fixme that doesn't make sense any more. llvm-svn: 99489	2010-03-25 06:02:53 +00:00
Evan Cheng	0917f4a575	Make sure SDDbgValue.Invalid is initialized to false by all the constructors. llvm-svn: 99487	2010-03-25 05:50:26 +00:00
Chris Lattner	391902aa30	Make the NDEBUG assertion stronger and more clear what is happening. Enhance scheduling to set the DEAD flag on implicit defs more aggressively. Before, we'd set an implicit def operand to dead if it were present in the SDNode corresponding to the machineinstr but had no use. Now we do it in this case AND if the implicit def does not exist in the SDNode at all. This exposes a couple of problems: one is the FIXME, which causes a live intervals crash on CodeGen/X86/sibcall.ll. The second is that it makes machinecse and licm more aggressive (which is a good thing) but also exposes a case where licm hoists a set0 and then it doesn't get resunk. Talking to codegen folks about both these issues, but I need this patch in in the meantime. llvm-svn: 99485	2010-03-25 05:40:48 +00:00
Chris Lattner	7382099e8f	reapply 99444/99445, which I speculatively reverted in r99453. llvm-svn: 99482	2010-03-25 04:41:16 +00:00
Evan Cheng	abefdb6d68	Change how dbg_value sdnodes are converted into machine instructions. Their placement should be determined by the relative order of incoming llvm instructions. The scheduler will now use the SDNode ordering information to determine where to insert them. A dbg_value instruction is inserted after the instruction with the last highest source order and before the instruction with the next highest source order. It will optimize the placement by inserting right after the instruction that produces the value if they have consecutive order numbers. Here is a theoretical example that illustrates why the placement is important. tmp1 = store tmp1 -> x ... tmp2 = add ... ... call ... store tmp2 -> x Now mem2reg comes along: tmp1 = dbg_value (tmp1 -> x) ... tmp2 = add ... ... call ... dbg_value (tmp2 -> x) When the debugger examine the value of x after the add instruction but before the call, it should have the value of tmp1. Furthermore, for dbg_value's that reference constants, they should not be emitted at the beginning of the block (since they do not have "producers"). This patch also cleans up how SDISel manages DbgValue nodes. It allow a SDNode to be referenced by multiple SDDbgValue nodes. When a SDNode is deleted, it uses the information to find the SDDbgValues and invalidate them. They are not deleted until the corresponding SelectionDAG is destroyed. llvm-svn: 99469	2010-03-25 01:38:16 +00:00
Chris Lattner	0bbe76ccc4	revert 99444/99445. This doesn't cause the failure of 2006-07-19-stwbrx-crash.ll for me, but it's the only likely patch in the blame list of several bots. Lets see if this fixes it. llvm-svn: 99453	2010-03-24 23:41:19 +00:00
Chris Lattner	c7a6ab89fb	remove dead argument. llvm-svn: 99445	2010-03-24 22:47:12 +00:00
Chris Lattner	d581e0398e	split EmitNode in half to reduce indentation. llvm-svn: 99444	2010-03-24 22:45:47 +00:00
Dan Gohman	56a90ff2a1	Remove the ConvertActions table and associated code, which is unused. llvm-svn: 99372	2010-03-24 00:53:38 +00:00
Dan Gohman	ce6d4394a8	Revert 99335. getTypeToExpandTo's iterative behavior is actually needed here. llvm-svn: 99339	2010-03-23 22:44:42 +00:00
Dan Gohman	7814ddee81	Remove getTypeToExpandTo, since it isn't adding much value beyond just calling getTypeToTransformTo. llvm-svn: 99335	2010-03-23 22:15:31 +00:00
Mon P Wang	005a544af8	Fixed a widening bug where we were not using the correct size for the load llvm-svn: 98920	2010-03-19 01:19:52 +00:00
Anton Korobeynikov	23c07f492e	Get rid of target-specific nodes for fp16 <-> fp32 conversion. llvm-svn: 98888	2010-03-18 22:35:37 +00:00
Dan Gohman	ff37afdc4b	Define placement new wrappers for BumpPtrAllocator and RecyclingAllocator to allow client code to be simpler, and simplify several clients. llvm-svn: 98847	2010-03-18 18:49:47 +00:00
Bob Wilson	a8b566d854	Fix pr6543: svn r88806 changed MachineJumpTableInfo::getJumpTableIndex() to always create a new jump table. The intention was to avoid merging jump tables in SelectionDAGBuilder, and to wait for the branch folding pass to merge tables. Unfortunately, the same getJumpTableIndex() method is also used to merge tables in branch folding, so as a result of this change branch tables are never merged. Worse, the branch folding code is expecting getJumpTableIndex to always return the index of an existing table, but with this change, it never does so. In at least some cases, e.g., pr6543, this creates references to non-existent tables. I've fixed the problem by adding a new createJumpTableIndex function, which will always create a new table, and I've changed getJumpTableIndex to only look at existing tables. llvm-svn: 98845	2010-03-18 18:42:41 +00:00
Devang Patel	dd5c1fcba9	Fix comment. llvm-svn: 98830	2010-03-18 16:41:16 +00:00
Devang Patel	9ad039e65f	Debug info intrinsic does not intefer during tail call optimization. llvm-svn: 98778	2010-03-17 23:52:37 +00:00
Chris Lattner	cf7f134913	reapply r98656 unmodified, which exposed the asmprinter not handling constant unions. llvm-svn: 98680	2010-03-16 21:25:55 +00:00
Daniel Dunbar	faade5305c	Revert r98656, its breaking all over the place. llvm-svn: 98662	2010-03-16 19:35:34 +00:00
Chris Lattner	7a96045d0a	improve support for uniontype and ConstantUnion, patch by Tim Northover! llvm-svn: 98656	2010-03-16 19:15:03 +00:00
Devang Patel	db8c479e0d	Create SDDbgValue for dbg_value intrinsics and remember its connections with DAG nodes. This is a work in progress. Patch by Dale Johannesen! llvm-svn: 98568	2010-03-15 19:15:44 +00:00
Devang Patel	3244526a3b	Emit dwarf variable info communicated by code generator through DBG_VALUE machine instructions. This is a work in progress. llvm-svn: 98556	2010-03-15 18:33:46 +00:00
Chris Lattner	a8e4282df3	SIGN_EXTEND from the same type as the dest is valid. llvm-svn: 98548	2010-03-15 16:15:56 +00:00
Chris Lattner	89c2d22d3d	sink the call to VT.getSizeInBits() down into its uses, not all unary nodes necessarily have a simple result type. llvm-svn: 98547	2010-03-15 16:05:15 +00:00
Duncan Sands	217cec1786	Turn calls to copysignl into an FCOPYSIGN node. Handle FCOPYSIGN nodes with ppc_f128 type by having the type legalizer turn these back into a call to copysignl. llvm-svn: 98514	2010-03-14 21:08:40 +00:00
Evan Cheng	f91618ae9f	Rename SDDbgValue.h to SDNodeDbgValue.h for consistency. llvm-svn: 98513	2010-03-14 19:56:39 +00:00
Chris Lattner	70eca8f78e	fix ShrinkDemandedOps to not leave dead nodes around, fixing PR6607 llvm-svn: 98512	2010-03-14 19:46:02 +00:00
Chris Lattner	5393443ed6	rewrite ShrinkDemandedOps to be faster and indent less, no functionality change. llvm-svn: 98511	2010-03-14 19:43:04 +00:00
Chris Lattner	f656ba2910	make -view-isel-dags print after the 'ShrinkDemandedOps' pass. llvm-svn: 98509	2010-03-14 19:27:55 +00:00
Anton Korobeynikov	2cb4451ae6	Make default expansion for FP16 <-> FP32 nodes into libcalls llvm-svn: 98501	2010-03-14 18:42:24 +00:00
Anton Korobeynikov	95f830f289	Add DAG nodes to represent FP16 <-> FP32 intrinsics llvm-svn: 98500	2010-03-14 18:42:15 +00:00
Chris Lattner	2bdb0765f8	fix AsmPrinter::GetBlockAddressSymbol to always return a unique label instead of trying to form one based on the BB name (which causes collisions if the name is empty). This fixes PR6608 llvm-svn: 98495	2010-03-14 17:53:23 +00:00
Chris Lattner	9331acc6d7	get MMI out of the label uniquing business, just go to MCContext to get unique assembler temporary labels. llvm-svn: 98489	2010-03-14 08:36:50 +00:00
Chris Lattner	5fef80c5aa	change the LabelSDNode to be EHLabelSDNode and make it hold an MCSymbol. Make the EH_LABEL MachineInstr hold its label with an MCSymbol instead of ID. Fix a bug in MMI.cpp which would return labels named "Label4" instead of "label4". llvm-svn: 98463	2010-03-14 02:33:54 +00:00
Chris Lattner	149cf816bb	change EH related stuff (other than EH_LABEL) to use MCSymbol instead of label ID's. This cleans up and regularizes a bunch of code and makes way for future progress. Unfortunately, this pointed out to me that JITDwarfEmitter.cpp is largely copy and paste from DwarfException/MachineModuleInfo and other places. This is very sad and disturbing. :( One major change here is that TidyLandingPads moved from being called in DwarfException::BeginFunction to being called in DwarfException::EndFunction. There should not be any functionality change from doing this, but I'm not an EH expert. llvm-svn: 98459	2010-03-14 01:41:15 +00:00
Duncan Sands	086a80eee3	Revert turning copysignl into a COPYSIGN node for the moment: ppc calls copysignl with a 128 bit ppc long double, resulting in a node that the type legalizer doesn't know how to expand. llvm-svn: 98357	2010-03-12 17:41:34 +00:00
Duncan Sands	aeed41b97a	Now that it's supported, turn copysignl into a COPYSIGN node. llvm-svn: 98348	2010-03-12 12:13:59 +00:00
Duncan Sands	5dcc3b328d	Fix PR6522: implement copysign expansion for x86 long double (it seems that FreeBSD doesn't have copysignl). Done by removing a bunch of assumptions from the code. This may also help with sparc 128 bit floats. llvm-svn: 98346	2010-03-12 11:45:06 +00:00
Chris Lattner	80ab250a1c	fix PR6577, a bug in sdbuilder lowering select instructions whose true value was not Val#0. llvm-svn: 98336	2010-03-12 07:15:36 +00:00
Dan Gohman	6b1b9e37d7	Remove getWidenVectorType, which is no longer used. llvm-svn: 98289	2010-03-11 21:39:57 +00:00
Evan Cheng	e1c0438e7b	In case of tail call size of Ins and InVals may not match. llvm-svn: 98277	2010-03-11 19:38:18 +00:00
Daniel Dunbar	b24134670c	Remove dead include. llvm-svn: 98225	2010-03-11 02:28:48 +00:00
Chris Lattner	7cd70b8066	fix PR6533 by updating the br(xor) code to remember the case when it looked past a trunc. llvm-svn: 98203	2010-03-10 23:46:44 +00:00
Dale Johannesen	29afbd39e4	Cosmetic: lengthen names and improve comments. llvm-svn: 98202	2010-03-10 23:37:24 +00:00
Dale Johannesen	987770c05d	Progress towards shepherding debug info through SelectionDAG. No functional effect yet. This is still evolving and should not be viewed as final. llvm-svn: 98195	2010-03-10 22:13:47 +00:00
Dan Gohman	4c22c7a665	Fix another bitwidth calculation to handle vector types; based on a patch by Micah Villmow for PR6572. llvm-svn: 98188	2010-03-10 21:04:53 +00:00
Dan Gohman	0d69c61fec	Attempt to make this debug output meaningful, both in the case of multibyte opcodes and in the case of multiple scopes. llvm-svn: 98036	2010-03-09 02:15:05 +00:00
Dan Gohman	cc7ed51fa3	Print the correct index in the "match failed at index" message. llvm-svn: 98013	2010-03-09 00:07:36 +00:00
Dale Johannesen	b87c6c82e6	Add Order to SDDbgValue llvm-svn: 97939	2010-03-08 05:39:50 +00:00
Chris Lattner	f3ee582f23	Use Other as a sentinel instead of iAny. llvm-svn: 97914	2010-03-07 07:45:08 +00:00
Dale Johannesen	f0c8e76a85	Add some new bits of debug info handling. No functional change yet. llvm-svn: 97855	2010-03-06 00:03:23 +00:00
Dan Gohman	00a652eea0	Reapply r97778 and r97779, enabled only for unsigned i64 to f64 conversions. llvm-svn: 97854	2010-03-06 00:00:55 +00:00
Jakob Stoklund Olesen	67476519d7	Avoid creating bad PHI instructions when BR is being const-folded. llvm-svn: 97836	2010-03-05 21:49:10 +00:00
Chris Lattner	80aaccb987	Fix PR6497, a bug where we'd fold a load into an addc node which has a flag. That flag in turn was used by an already-selected adde which turned into an ADC32ri8 which used a selected load which was chained to the load we folded. This flag use caused us to form a cycle. Fix this by not ignoring chains in IsLegalToFold even in cases where the isel thinks it can. llvm-svn: 97791	2010-03-05 06:19:13 +00:00
Chris Lattner	d7353d219f	inline a small function with one call site. llvm-svn: 97789	2010-03-05 05:49:45 +00:00
Dan Gohman	ebdb1743d3	Revert r97778 and r97779. They're somehow breaking llvm-gcc builds. llvm-svn: 97781	2010-03-05 02:40:23 +00:00
Dan Gohman	0a01ba144d	Fix these constants to be more portable. llvm-svn: 97779	2010-03-05 02:13:10 +00:00
Dan Gohman	e5b9ea020f	Rewrite i64-to-f64 conversion using an algorithm which handles rounding correctly. This implementation is a generalization of the x86_64 code in compiler-rt. This fixes rdar://7683708. llvm-svn: 97778	2010-03-05 02:00:46 +00:00
Chris Lattner	860cbbb031	add a statistic for # times fastisel fails. llvm-svn: 97738	2010-03-04 19:46:56 +00:00
Dan Gohman	632e2a2b8c	Fix a typo Duncan noticed. llvm-svn: 97735	2010-03-04 19:11:28 +00:00
Chris Lattner	2bbca2de9e	change the new isel matcher to emit ComplexPattern matches as the very last thing before node emission. This should dramatically reduce the number of times we do 'MatchAddress' on X86, speeding up compile time. This also improves comments in the tables and shrinks the table a bit, now down to 80506 bytes for x86. llvm-svn: 97703	2010-03-04 01:23:08 +00:00
Dan Gohman	9f6d374ab7	Fix more code to work properly with vector operands. Based on a patch my Micah Villmow for PR6465. llvm-svn: 97692	2010-03-04 00:23:16 +00:00
Chris Lattner	19007009c8	inline CannotYetSelectIntrinsic into CannotYetSelect and simplify. llvm-svn: 97690	2010-03-04 00:21:16 +00:00
Dan Gohman	cdc603ecae	Fix a bug in SelectionDAG's ReplaceAllUsesWith in the case where CSE and recursive RAUW calls delete a node from the use list, invalidating the use list iterator. There's currently no known way to reproduce this in an unmodified LLVM, however there's no fundamental reason why a SelectionDAG couldn't be formed which would trigger this case. llvm-svn: 97665	2010-03-03 21:33:37 +00:00
Chris Lattner	9e7f00c3aa	add some of the more obscure predicate types to the Scope accelerator. llvm-svn: 97652	2010-03-03 07:46:25 +00:00
Chris Lattner	9889ed8c45	speed up scope node processing: if the first element of a scope entry we're about to process is obviously going to fail, don't bother pushing a scope only to have it immediately be popped. This avoids a lot of scope stack traffic in common cases. Unfortunately, this requires duplicating some of the predicate dispatch. To avoid duplicating the actual logic I pulled each predicate out to its own static function which gets used in both places. llvm-svn: 97651	2010-03-03 07:31:15 +00:00
Chris Lattner	92a814205f	introduce a new SwitchTypeMatcher node (which is analogous to SwitchOpcodeMatcher) and have DAGISelMatcherOpt form it. This speeds up selection, particularly for X86 which has lots of variants of instructions with only type differences. llvm-svn: 97645	2010-03-03 06:28:15 +00:00
Bill Wendling	65baaf9499	Use APInt instead of zext value. llvm-svn: 97631	2010-03-03 01:58:01 +00:00
Bill Wendling	d1f658563d	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Chris Lattner	9c9c1158cb	Fix some issues in WalkChainUsers dealing with CopyToReg/CopyFromReg/INLINEASM. These are annoying because they have the same opcode before an after isel. Fix this by setting their NodeID to -1 to indicate that they are selected, just like what automatically happens when selecting things that end up being machine nodes. With that done, give IsLegalToFold a new flag that causes it to ignore chains. This lets the HandleMergeInputChains routine be the one place that validates chains after a match is successful, enabling the new hotness in chain processing. This smarter chain processing eliminates the need for "PreprocessRMW" in the X86 and MSP430 backends and enables MSP to start matching it's multiple mem operand instructions more aggressively. I currently #if out the dead code in the X86 backend and MSP backend, I'll remove it for real in a follow-on patch. The testcase changes are: test/CodeGen/X86/sse3.ll: we generate better code test/CodeGen/X86/store_op_load_fold2.ll: PreprocessRMW was miscompiling this before, we now generate correct code Convert it to filecheck while I'm at it. test/CodeGen/MSP430/Inst16mm.ll: Add a testcase for mem/mem folding to make anton happy. :) llvm-svn: 97596	2010-03-02 22:20:06 +00:00
Chris Lattner	0c14477270	run HandleMergeInputChains even if we only have one input chain. llvm-svn: 97581	2010-03-02 19:34:59 +00:00
Chris Lattner	2019e2922f	Fix the xfail I added a couple of patches back. The issue was that we weren't properly handling the case when interior nodes of a matched pattern become dead after updating chain and flag uses. Now we handle this explicitly in UpdateChainsAndFlags. llvm-svn: 97561	2010-03-02 07:50:03 +00:00
Chris Lattner	d23cbd049d	I was confused about this, it turns out that MorphNodeTo does delete ex-operands that become dead. llvm-svn: 97559	2010-03-02 07:14:49 +00:00
Chris Lattner	bd1d913a9d	factor node morphing out to its own helper method. llvm-svn: 97558	2010-03-02 06:55:04 +00:00
Chris Lattner	1707a88a2c	Sink InstructionSelect() out of each target into SDISel, and rename it DoInstructionSelection. Inline "SelectRoot" into it from DAGISelHeader. Sink some other stuff out of DAGISelHeader into SDISel. Eliminate the various 'Indent' stuff from various targets, which dates to when isel was recursive. 17 files changed, 114 insertions(+), 430 deletions(-) llvm-svn: 97555	2010-03-02 06:34:30 +00:00
Chris Lattner	9a28d163c2	Use the right induction variable. llvm-svn: 97541	2010-03-02 02:37:23 +00:00
Chris Lattner	0b41a42411	Rewrite chain handling validation and input TokenFactor handling stuff now that we don't care about emulating the old broken behavior of the old isel. This eliminates the 'CheckChainCompatible' check (along with IsChainCompatible) which did an incorrect and inefficient scan up the chain nodes which happened as the pattern was being formed and does the validation at the end in HandleMergeInputChains when it forms a structural pattern. This scans "down" the graph, which means that it is quickly bounded by nodes already selected. This also handles token factors that get "trapped" in the dag. Removing the CheckChainCompatible nodes also shrinks the generated tables by about 6K for X86 (down to 83K). There are two pieces remaining before I can nuke PreprocessRMW: 1. I xfailed a test because we're now producing worse code in a case that has nothing to do with the change: it turns out that our use of MorphNodeTo will leave dead nodes in the graph which (depending on how the graph is walked) end up causing bogus uses of chains and blocking matches. This is really bad for other reasons, so I'll fix this in a follow-up patch. 2. CheckFoldableChainNode needs to be improved to handle the TF. llvm-svn: 97539	2010-03-02 02:22:10 +00:00
Dan Gohman	56a20fc5eb	Fix several places to handle vector operands properly. Based on a patch by Micah Villmow for PR6438. llvm-svn: 97538	2010-03-02 02:14:38 +00:00
Bill Wendling	5990930d72	Remove dead parameter passing. llvm-svn: 97536	2010-03-02 01:55:18 +00:00
Chris Lattner	53bd8b1717	remove dead code. llvm-svn: 97529	2010-03-02 00:40:26 +00:00
Chris Lattner	e6f86e288c	refactor some code out of OPC_EmitMergeInputChains into a new helper function. llvm-svn: 97525	2010-03-02 00:00:03 +00:00
Chris Lattner	4ecd0eb275	remove all but one version of SelectionDAG::MorphNodeTo (the most general) the others are dead. llvm-svn: 97511	2010-03-01 22:20:05 +00:00
Chris Lattner	b65ac4a796	Accelerate isel dispatch for tables that start with a top-level OPC_SwitchOpcode to use a table lookup instead of having to go through the interpreter for this. llvm-svn: 97469	2010-03-01 18:47:11 +00:00
Dan Gohman	99c98139c7	Fix optimization of ISD::TRUNCATE on vector operands. Based on a patch by Micah Villmow for PR6335. llvm-svn: 97461	2010-03-01 17:59:21 +00:00
Chris Lattner	f62ad24616	some trivial microoptimizations. llvm-svn: 97441	2010-03-01 07:43:08 +00:00
Chris Lattner	cdfa80eaaf	eliminate the CheckMultiOpcodeMatcher code and have each ComplexPattern at the root be generated multiple times, once for each opcode they are part of. This encourages factoring because the opcode checks get treated just like everything else in the matcher. llvm-svn: 97439	2010-03-01 07:17:40 +00:00
Chris Lattner	8529ea0237	add a new OPC_SwitchOpcode which is semantically equivalent to a scope where every child starts with a CheckOpcode, but executes more efficiently. Enhance DAGISelMatcherOpt to form it. This also fixes a bug in CheckOpcode: apparently the SDNodeInfo objects are not pointer comparable, we have to compare the enum name. llvm-svn: 97438	2010-03-01 06:59:22 +00:00
Chris Lattner	4408939f12	eliminate GetInt1/2 llvm-svn: 97426	2010-02-28 22:38:43 +00:00
Chris Lattner	afa7d2eacc	hoist the new isel interpreter out of DAGISelHeader.h (which gets #included into the middle of each target's DAGISel class) into a .cpp file where it is only compiled once. llvm-svn: 97425	2010-02-28 22:37:22 +00:00
Chris Lattner	61a0c6674e	enhance the new isel to handle the 'node already exists' case of MorphNodeTo directly. llvm-svn: 97417	2010-02-28 21:36:14 +00:00
Chris Lattner	f6124c5583	simplify this code, return only ever has zero or one operands. llvm-svn: 97408	2010-02-28 18:53:13 +00:00
Evan Cheng	94051bc37e	Re-apply 97040 with fix. This survives a ppc self-host llvm-gcc bootstrap. llvm-svn: 97310	2010-02-27 07:36:59 +00:00
Dale Johannesen	516867fd79	Move dbg_value generation to target-independent FastISel, as X86 is currently the only FastISel target. Per review. llvm-svn: 97255	2010-02-26 20:01:55 +00:00
Dan Gohman	17447493ea	Fix ExpandVectorBuildThroughStack for the case where the operands are themselves vectors. Based on a patch by Micah Villmow for PR6338. llvm-svn: 97165	2010-02-25 20:30:49 +00:00
Dan Gohman	084112437d	Revert r97064. Duncan pointed out that bitcasts are defined in terms of store and load, which means bitcasting between scalar integer and vector has endian-specific results, which undermines this whole approach. llvm-svn: 97137	2010-02-25 15:20:39 +00:00
Chris Lattner	5ca790deef	clean up various VT manipulations, patch by Micah Villmow! PR6337 llvm-svn: 97072	2010-02-24 22:44:06 +00:00
Dan Gohman	424e8f22d0	Make getTypeSizeInBits work correctly for array types; it should return the number of value bits, not the number of bits of allocation for in-memory storage. Make getTypeStoreSize and getTypeAllocSize work consistently for arrays and vectors. Fix several places in CodeGen which compute offsets into in-memory vectors to use TargetData information. This fixes PR1784. llvm-svn: 97064	2010-02-24 22:05:23 +00:00
Chris Lattner	75062cedf7	convert cycle checker to smallptrset, add comments and make it more elegant. llvm-svn: 97059	2010-02-24 21:34:04 +00:00
Chris Lattner	fbafa903b5	revert david's patch which does not even build. llvm-svn: 97057	2010-02-24 21:25:08 +00:00
David Greene	68c9ece3da	Use a SmallPtrSet as suggested by Chris. llvm-svn: 97056	2010-02-24 20:59:49 +00:00
Daniel Dunbar	24c99e027e	Speculatively revert r97011, "Re-apply 96540 and 96556 with fixes.", again in the hopes of fixing PPC bootstrap. llvm-svn: 97040	2010-02-24 17:05:47 +00:00
Dan Gohman	c0c6077fed	When forming SSE min and max nodes for UGE and ULE comparisons, it's necessary to swap the operands to handle NaN and negative zero properly. Also, reintroduce logic for checking for NaN conditions when forming SSE min and max instructions, fixed to take into consideration NaNs and negative zeros. This allows forming min and max instructions in more cases. llvm-svn: 97025	2010-02-24 06:52:40 +00:00
Chris Lattner	52a02205d8	Change the scheduler from adding nodes in allnodes order to adding them in a determinstic order (bottom up from the root) based on the structure of the graph itself. This updates tests for some random changes, interesting bits: CodeGen/Blackfin/promote-logic.ll no longer crashes. I have no idea why, but that's good right? CodeGen/X86/2009-07-16-LoadFoldingBug.ll also fails, but now compiles to have one fewer constant pool entry, making the expected load that was being folded disappear. Since it is an unreduced mass of gnast, I just removed it. This fixes PR6370 llvm-svn: 97023	2010-02-24 06:11:37 +00:00
Chris Lattner	b1b5df8a16	add node #'s to debug dumps. llvm-svn: 97019	2010-02-24 04:24:44 +00:00
Evan Cheng	5787cd9349	Re-apply 96540 and 96556 with fixes. llvm-svn: 97011	2010-02-24 01:42:31 +00:00
Chris Lattner	80c14ff96b	make selectnodeto set the nodeid to -1. This makes it more akin to creating a new node then replacing uses. llvm-svn: 97000	2010-02-23 23:01:35 +00:00
Chris Lattner	f6e9f39042	fix a bug in findNonImmUse (used by IsLegalToFold) where nodes with no id's would cause early exit allowing IsLegalToFold to return true instead of false, producing a cyclic dag. This was striking the new isel because it isn't using SelectNodeTo yet, which theoretically is just an optimization. llvm-svn: 96972	2010-02-23 19:32:27 +00:00
Chris Lattner	4d568129c4	Print node ID's in dumps and views if set. llvm-svn: 96971	2010-02-23 19:31:18 +00:00
David Greene	dad0944642	Speed up cycle checking significantly by caching results. llvm-svn: 96956	2010-02-23 17:37:50 +00:00
Duncan Sands	5d5cce2e19	Revert commits 96556 and 96640, because commit 96556 breaks the dragonegg self-host build. I reverted 96640 in order to revert 96556 (96640 goes on top of 96556), but it also looks like with both of them applied the breakage happens even earlier. The symptom of the 96556 miscompile is the following crash: llvm[3]: Compiling AlphaISelLowering.cpp for Release build cc1plus: /home/duncan/tmp/tmp/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:4982: void llvm::SelectionDAG::ReplaceAllUsesWith(llvm::SDNode, llvm::SDNode, llvm::SelectionDAG::DAGUpdateListener*): Assertion `(!From->hasAnyUseOfValue(i) \|\| From->getValueType(i) == To->getValueType(i)) && "Cannot use this version of ReplaceAllUsesWith!"' failed. Stack dump: 0. Running pass 'X86 DAG->DAG Instruction Selection' on function '@_ZN4llvm19AlphaTargetLowering14LowerOperationENS_7SDValueERNS_12SelectionDAGE' g++: Internal error: Aborted (program cc1plus) This occurs when building LLVM using LLVM built by LLVM (via dragonegg). Probably LLVM has miscompiled itself, though it may have miscompiled GCC and/or dragonegg itself: at this point of the self-host build, all of GCC, LLVM and dragonegg were built using LLVM. Unfortunately this kind of thing is extremely hard to debug, and while I did rummage around a bit I didn't find any smoking guns, aka obviously miscompiled code. Found by bisection. r96556 \| evancheng \| 2010-02-18 03:13:50 +0100 (Thu, 18 Feb 2010) \| 5 lines Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" r96640 \| evancheng \| 2010-02-19 01:34:39 +0100 (Fri, 19 Feb 2010) \| 16 lines Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96672	2010-02-19 11:30:41 +00:00
Evan Cheng	32031f7404	Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96640	2010-02-19 00:34:39 +00:00
Evan Cheng	9af06dfc83	Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" llvm-svn: 96556	2010-02-18 02:13:50 +00:00
David Greene	d93fb1f15d	Make the non-temporal bit "significant" in MemSDNodes so they aren't CSE'd or otherwise combined with temporal MemSDNodes. llvm-svn: 96505	2010-02-17 20:21:42 +00:00
Chris Lattner	955ce23e27	sink special case "cannotyetselect" for intrinsics out of the tblgen splatted code into the implementation. llvm-svn: 96460	2010-02-17 06:28:22 +00:00
Duncan Sands	1b33dd3c83	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Evan Cheng	df9a8c9d49	Fix a memory leak. Patch by Nicolas Geoffray. llvm-svn: 96295	2010-02-15 23:16:53 +00:00
Evan Cheng	b5fe25544c	Split SelectionDAGISel::IsLegalAndProfitableToFold to IsLegalToFold and IsProfitableToFold. The generic version of the later simply checks whether the folding candidate has a single use. This allows the target isel routines more flexibility in deciding whether folding makes sense. The specific case we are interested in is folding constant pool loads with multiple uses. llvm-svn: 96255	2010-02-15 19:41:07 +00:00
David Greene	4f983d569c	Add non-temporal flags and remove an assumption of default arguments. llvm-svn: 96240	2010-02-15 17:00:31 +00:00
Duncan Sands	2acaf3609c	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Jakob Stoklund Olesen	3aca1b0249	Use array_pod_sort instead of std::sort for improved code size. Use SmallVector instead of std::vector for better speed when indirectbr has few successors. llvm-svn: 95879	2010-02-11 18:06:56 +00:00
Jakob Stoklund Olesen	215d9f3898	Remove duplicate successors from indirectbr instructions before building the machine CFG. This makes early tail duplication run 60 times faster when compiling the Firefox JavaScript interpreter, see PR6186. llvm-svn: 95831	2010-02-11 00:34:18 +00:00
Mon P Wang	c17e781f35	The previous fix of widening divides that trap was too fragile as it depends on custom lowering and requires that certain types exist in ValueTypes.h. Modified widening to check if an op can trap and if so, the widening algorithm will apply only the op on the defined elements. It is safer to do this in widening because the optimizer can't guarantee removing unused ops in some cases. llvm-svn: 95823	2010-02-10 23:37:45 +00:00
Dan Gohman	92b6122204	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Evan Cheng	8bee7fb61d	Now that ShrinkDemandedOps() is separated out from DAG combine. It sometimes leave some obvious nops which dag combine used to clean up afterwards e.g. (trunk (ext n)) -> n. Look for them and squash them. llvm-svn: 95757	2010-02-10 02:17:34 +00:00
Evan Cheng	8c2662f96a	Emit an error for illegal inline asm constraint (which uses illegal type) rather than asserting. llvm-svn: 95746	2010-02-10 01:21:02 +00:00
Dale Johannesen	c9f253214e	Fix comments to reflect renaming elsewhere. llvm-svn: 95730	2010-02-10 00:11:11 +00:00
David Greene	d2c34ce826	Only dump output in debug mode. llvm-svn: 95711	2010-02-09 23:03:05 +00:00
Chris Lattner	7acf9be6c4	move target-independent opcodes out of TargetInstrInfo into TargetOpcodes.h. #include the new TargetOpcodes.h into MachineInstr. Add new inline accessors (like isPHI()) to MachineInstr, and start using them throughout the codebase. llvm-svn: 95687	2010-02-09 19:54:29 +00:00
Dale Johannesen	5ddd318b03	Apply the 95471 fix to SelectionDAGBuilder as well; we can get in here if FastISel gives up in a block. (Actually the two copies of this need to be unified. Later.) llvm-svn: 95579	2010-02-08 21:53:27 +00:00
Dan Gohman	f113e5466c	In guaranteed tailcall mode, don't decline the tailcall optimization for blocks ending in "unreachable". llvm-svn: 95565	2010-02-08 20:34:14 +00:00
Dale Johannesen	83f31d3511	After Victor's latest commits I am seeing null addresses in dbg.declare; ignore this for the moment to prevent things from breaking. llvm-svn: 95471	2010-02-06 02:26:02 +00:00
Evan Cheng	94fe5501b7	When the scheduler unfold a load folding instruction it move some of the predecessors to the unfolded load. It decides what gets moved to the load by checking whether the new load is using the predecessor as an operand. The check neglects the cases whether the predecessor is a flagged scheduling unit. rdar://7604000 llvm-svn: 95339	2010-02-05 01:27:11 +00:00
Evan Cheng	ce5962aaf9	Fix typo Duncan noticed. llvm-svn: 95322	2010-02-04 19:07:06 +00:00
Evan Cheng	3c93245a64	It's too risky to eliminate sext / zext of call results for tail call optimization even if the caller / callee attributes completely match. The callee may have been bitcast'ed (or otherwise lied about what it's doing). llvm-svn: 95282	2010-02-04 02:45:02 +00:00
Evan Cheng	e273e42195	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Evan Cheng	d9cf09b0d6	Allow all types of callee's to be tail called. But avoid automatic tailcall if the callee is a result of bitcast to avoid losing necessary zext / sext etc. llvm-svn: 95195	2010-02-03 03:28:02 +00:00
Evan Cheng	9057fea7ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	48375fbf4f	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Mon P Wang	65b01b6ce7	Improve EXTRACT_VECTOR_ELT patch based on comments from Duncan llvm-svn: 95012	2010-02-01 22:15:09 +00:00
Chris Lattner	af2e10ddef	eliminate a bunch of pointless LLVMContext arguments. llvm-svn: 95001	2010-02-01 20:48:08 +00:00
Dale Johannesen	1483e87700	fix PR 6157. Testcase pending. llvm-svn: 94996	2010-02-01 19:54:53 +00:00
Mon P Wang	f9fa3aa2c0	Fixed a couple of optimization with EXTRACT_VECTOR_ELT that assumes the result type is the same as the element type of the vector. EXTRACT_VECTOR_ELT can be used to extended the width of an integer type. This fixes a bug for Generic/vector-casts.ll on a ppc750. llvm-svn: 94990	2010-02-01 19:03:18 +00:00
Duncan Sands	6b277c2823	Change the SREM case to match the logic in the IR version ComputeMaskedBits. llvm-svn: 94805	2010-01-29 09:45:26 +00:00
Bill Wendling	d408e7be4e	Assign the ordering of SDNodes in a much less intrusive fashion. After the "visit*" method is called, take the newly created nodes, walk them in a DFS fashion, and if they don't have an ordering set, then give it one. llvm-svn: 94757	2010-01-28 21:51:40 +00:00
Jim Grosbach	7474937004	Update of 94055 to track the IR level call site information via an intrinsic. This allows code gen and the exception table writer to cooperate to make sure landing pads are associated with the correct invoke locations. llvm-svn: 94726	2010-01-28 01:45:32 +00:00
Evan Cheng	237629e476	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Evan Cheng	c674548602	Allow some automatic tailcall optimization without changing ABI. llvm-svn: 94611	2010-01-26 23:13:04 +00:00
Chris Lattner	967a37fb26	eliminate the TargetLowering::UsesGlobalOffsetTable bool, which is subsumed by TargetLowering::getJumpTableEncoding(). Change uses of it to be more specific. llvm-svn: 94529	2010-01-26 06:53:37 +00:00
Chris Lattner	d87c50833a	Move getJTISymbol from MachineJumpTableInfo to MachineFunction, which is more convenient, and change getPICJumpTableRelocBaseExpr to take a MachineFunction to match. Next, move the X86 code that create a PICBase symbol to X86TargetLowering::getPICBaseSymbol from X86MCInstLower::GetPICBaseSymbol, which was an asmprinter specific library. This eliminates a 'gross hack', and allows us to implement X86ISelLowering::getPICJumpTableRelocBaseExpr which now calls it. This in turn allows us to eliminate the X86AsmPrinter::printPICJumpTableSetLabel method, which was the only overload of printPICJumpTableSetLabel. llvm-svn: 94526	2010-01-26 06:28:43 +00:00
Chris Lattner	a9b1ea03e9	add a new MachineJumpTableInfo::getJTISymbol method, use it to implement the default TargetLowering::getPICJumpTableRelocBaseExpr llvm-svn: 94523	2010-01-26 05:58:28 +00:00
Chris Lattner	3b2bf7ab66	stub out a new target hook, need some refactoring before I can implement it. llvm-svn: 94521	2010-01-26 05:30:30 +00:00
Evan Cheng	548d00d77c	Implement cond ? -1 : 0 with sbb. llvm-svn: 94490	2010-01-26 02:00:44 +00:00
Dale Johannesen	d75f9dc3ff	Generate DEBUG_VALUE comments on x86. The (limited) dbg.declare's we currently generate go through both register allocators without perturbing the results. llvm-svn: 94480	2010-01-26 00:09:58 +00:00
Chris Lattner	efdc572e44	Rearrange handling of jump tables. Highlights: 1. MachineJumpTableInfo is now created lazily for a function the first time it actually makes a jump table instead of for every function. 2. The encoding of jump table entries is now described by the MachineJumpTableInfo::JTEntryKind enum. This enum is determined by the TLI::getJumpTableEncoding() hook, instead of by lots of code scattered throughout the compiler that "knows" that jump table entries are always 32-bits in pic mode (for example). 3. The size and alignment of jump table entries is now calculated based on their kind, instead of at machinefunction creation time. Future work includes using the EntryKind in more places in the compiler, eliminating other logic that "knows" the layout of jump tables in various situations. llvm-svn: 94470	2010-01-25 23:26:13 +00:00
Chris Lattner	5a57121631	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Mon P Wang	d4d1cbb72b	It seems better to scalarize vectors of size 1 instead of widening them. Add support to widen SETCC. llvm-svn: 94342	2010-01-24 00:24:43 +00:00
Mon P Wang	871ea08e40	Improved widening loads by adding support for wider loads if the alignment allows. Fixed a bug where we didn't use a vector load/store for PR5626. llvm-svn: 94338	2010-01-24 00:05:03 +00:00
Bill Wendling	7449bb010b	Remove the '-disable-scheduling' flag and replace it with the 'source' option of the '-pre-RA-sched' flag. It actually makes more sense to do it this way. Also, keep track of the SDNode ordering by default. Eventually, we would like to make this ordering a way to break a "tie" in the scheduler. However, doing that now breaks the "CodeGen/X86/abi-isel.ll" test for 32-bit Linux. llvm-svn: 94308	2010-01-23 10:26:57 +00:00
Evan Cheng	a238930f0b	Enable pre-regalloc scheduling load clustering by default. llvm-svn: 94255	2010-01-22 23:49:45 +00:00
Chris Lattner	276811b58a	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Evan Cheng	72dbbce547	Teach pre-regalloc scheduler to schedule loads from nearby addresses. It may improve cache locality. This is controlled by -cluster-loads for now. llvm-svn: 94148	2010-01-22 03:36:51 +00:00
Evan Cheng	c3ad04c825	Trim unneeded includes. llvm-svn: 94105	2010-01-21 21:44:43 +00:00
Jim Grosbach	57ca094a52	back this out for now. Growing Function is not good. llvm-svn: 94097	2010-01-21 20:10:22 +00:00
Jim Grosbach	cf6e6e1c79	Make sure that landing pad entries in the EH call site table are in the proper order for SjLj style exception handling. llvm-svn: 94055	2010-01-21 00:43:30 +00:00
David Greene	80fdc554d0	When XDEBUG is enabled, check for SelectionDAG cycles at some key points. This will help us find future problems like the one described in PR6019. llvm-svn: 94019	2010-01-20 20:13:31 +00:00
David Greene	1908ff46da	Add some asserts to check SelectionDAG problems earlier. llvm-svn: 93960	2010-01-20 00:59:23 +00:00
Dan Gohman	34b548b94a	Fold (add x, shl(0 - y, n)) -> sub(x, shl(y, n)), to simplify some code that SCEVExpander can produce when running on behalf of LSR. llvm-svn: 93949	2010-01-19 23:30:49 +00:00
David Greene	fde2825063	Add some new debugging APIs to print out "raw" SelectionDAGs to make understanding CannotYTetSelect and other errors easier. llvm-svn: 93901	2010-01-19 20:37:34 +00:00
Dale Johannesen	e1ba7ecf45	Revert 93811 per request. llvm-svn: 93818	2010-01-19 00:10:52 +00:00
Dale Johannesen	0b8b2713d3	Enable code to emit dbg.declare as DEBUG_VALUE comments (fast isel, X86). This doesn't seem to break any functionality, but will introduce cases where -g affects the generated code. I'll be fixing that. llvm-svn: 93811	2010-01-18 23:34:55 +00:00
Evan Cheng	5cf9d23e4e	Canonicalize -1 - x to ~x. Instcombine does this but apparently there are situations where this pattern will escape the optimizer and / or created by isel. Here is a case that's seen in JavaScriptCore: %t1 = sub i32 0, %a %t2 = add i32 %t1, -1 The dag combiner pattern: ((c1-A)+c2) -> (c1+c2)-A will fold it to -1 - %a. llvm-svn: 93773	2010-01-18 21:38:44 +00:00
Kenneth Uildriks	d6b30baf78	When checking for sret-demotion, it needs to use legal types. When using the return value of an sret-demoted call, it needs to use possibly illegal types that match the declared Type of the callee. llvm-svn: 93667	2010-01-16 23:37:33 +00:00
David Greene	d8faccbeab	Add some debug routines to SelectionDAG to dump full DAGs. print/dumpWithDepth allows one to dump a DAG up to N levels deep. dump/printWithFullDepth prints the whole DAG, subject to a depth limit on 100 in the default case (to prevent infinite recursion). Have CannotYetSelect to a dumpWithFullDepth so it is clearer exactly what the non-matching DAG looks like. llvm-svn: 93538	2010-01-15 19:43:23 +00:00
Victor Hernandez	c1b5223e76	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. It also strips old llvm.dbg.declare intrinsics that did not pass metadata as the first argument. llvm-svn: 93531	2010-01-15 19:04:09 +00:00
Victor Hernandez	97d7107d5e	Revert r93504 because older uses of llvm.dbg.declare intrinsics need to be auto-upgraded llvm-svn: 93515	2010-01-15 17:36:47 +00:00
Victor Hernandez	aee71b4e81	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. llvm-svn: 93504	2010-01-15 03:37:48 +00:00
Jim Grosbach	7239c4c92e	fix 80-column violations llvm-svn: 93487	2010-01-15 00:36:15 +00:00
Dan Gohman	7c596d2b00	Fix a codegen abort seen in 483.xalancbmk. llvm-svn: 93417	2010-01-14 03:08:49 +00:00
Dan Gohman	2cd1b789c7	Update a partially obsolete comment. llvm-svn: 93228	2010-01-12 04:32:35 +00:00
Dan Gohman	da0bcb49b5	Fix a typo in a comment. llvm-svn: 93227	2010-01-12 04:30:26 +00:00
Jakob Stoklund Olesen	f1c71ef6ba	Avoid adding PHI arguments for a predecessor that has gone away when a BRCOND was constant folded. This fixes PR5980. llvm-svn: 93184	2010-01-11 21:02:33 +00:00
Mon P Wang	e8470bbcc4	Disable transformation of select of two loads to a select of address and then a load if the loads are not in the default address space because the transformation discards src value info. llvm-svn: 93180	2010-01-11 20:12:49 +00:00
Dan Gohman	3708af1c59	Revert an earlier change to SIGN_EXTEND_INREG for vectors. The VTSDNode really does need to be a vector type, because TargetLowering::getOperationAction for SIGN_EXTEND_INREG uses that type, and it needs to be able to distinguish between vectors and scalars. Also, fix some more issues with legalization of vector casts. llvm-svn: 93043	2010-01-09 02:13:55 +00:00
Evan Cheng	afa00d14db	Dan pointed out checking whether a node is dead by comparing its opcode to ISD::DELETED_NODE is not safe. Use a DAGUpdateListener to remove dead nodes from work list instead. llvm-svn: 93031	2010-01-09 00:21:08 +00:00
Evan Cheng	f96a9ec02b	ReplaceAllUsesOfValueWith may delete other nodes that the one being replaced. Do not delete dead nodes again. llvm-svn: 92988	2010-01-08 02:36:12 +00:00
Chris Lattner	e0199dff81	Fix rdar://7517201, a regression introduced by r92849. When folding a and(any_ext(load)) both the any_ext and the load have to have only a single use. This removes the anyext-uses.ll testcase which started failing because it is unreduced and unclear what it is testing. llvm-svn: 92950	2010-01-07 21:59:23 +00:00
Chris Lattner	f68e328a99	factor this code better and reduce nesting at the same time, no functionality change. llvm-svn: 92948	2010-01-07 21:53:27 +00:00
Evan Cheng	4523041394	APInt'fy TargetLowering::SimplifySetCC to fix PR5963. llvm-svn: 92943	2010-01-07 20:58:44 +00:00
Benjamin Kramer	22ec2524fd	Use pop_back_val instead of back()+pop_back. llvm-svn: 92918	2010-01-07 17:27:56 +00:00
Evan Cheng	f9dade0634	Comment. llvm-svn: 92850	2010-01-06 19:43:21 +00:00
Evan Cheng	25dcf9b830	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Bill Wendling	b4f0d6e3b6	The previous code could potentially cause a cycle. Allow ordering w.r.t. a 0 order. llvm-svn: 92810	2010-01-06 00:23:35 +00:00
Bill Wendling	41e18c3512	Only check the ordering if there is an ordering for each nodes. llvm-svn: 92807	2010-01-06 00:09:23 +00:00
Bill Wendling	b7d6746476	Add a semi-primitive form of scheduling via the "SDNode ordering" to the bottom-up scheduler. We prefer the lower order number. llvm-svn: 92806	2010-01-05 23:48:12 +00:00
Bill Wendling	7e9607ab56	Don't assign the shift the same type as the variable being shifted. This could result in illegal types for the SHL operator. llvm-svn: 92797	2010-01-05 22:39:10 +00:00
Dan Gohman	9ef9e2c758	Don't use the ISD::NodeType enum for SDNode opcodes, as CodeGen uses several kinds of opcode values which are not declared within that enum. This fixes PR5946. llvm-svn: 92794	2010-01-05 22:26:32 +00:00
Benjamin Kramer	e90a3c66c4	Avoid going through the LLVMContext for type equality where it's safe to dereference the type pointer. llvm-svn: 92726	2010-01-05 13:12:22 +00:00
Devang Patel	9c02d20409	Delete renaming use of dead dbg intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start llvm-svn: 92672	2010-01-05 01:47:06 +00:00
David Greene	e5bee4794b	Change errs() to dbgs(). llvm-svn: 92597	2010-01-05 01:26:11 +00:00
David Greene	b1764ac8cb	Change errs() to dbgs(). llvm-svn: 92581	2010-01-05 01:25:11 +00:00
David Greene	54706ebebb	Change errs() to dbgs(). llvm-svn: 92580	2010-01-05 01:25:09 +00:00
David Greene	4f5be2f621	Change errs() to dbgs(). llvm-svn: 92579	2010-01-05 01:25:04 +00:00
David Greene	bf6025f893	Change errs() to dbgs(). llvm-svn: 92578	2010-01-05 01:25:00 +00:00
David Greene	2c9dbf7b18	Change errs() to dbgs(). llvm-svn: 92577	2010-01-05 01:24:57 +00:00
David Greene	3a4b23b913	Change errs() to dbgs(). llvm-svn: 92576	2010-01-05 01:24:54 +00:00
David Greene	a9fbae472c	Change errs() to dbgs(). llvm-svn: 92575	2010-01-05 01:24:53 +00:00
David Greene	5201625e9c	Change errs() to dbgs(). llvm-svn: 92574	2010-01-05 01:24:50 +00:00
David Greene	aaf3acc471	Change errs() to dbgs(). llvm-svn: 92573	2010-01-05 01:24:48 +00:00
David Greene	4c9eb54e83	Change errs() to dbgs(). llvm-svn: 92572	2010-01-05 01:24:45 +00:00
David Greene	216139c074	Change errs() to dbgs(). llvm-svn: 92571	2010-01-05 01:24:43 +00:00
David Greene	82adcc9c8f	Change errs() to dbgs(). llvm-svn: 92570	2010-01-05 01:24:40 +00:00
David Greene	616c94ab91	Change errs() to dbgs(). llvm-svn: 92569	2010-01-05 01:24:36 +00:00
David Greene	b22b413dff	Change errs() to dbgs(). llvm-svn: 92568	2010-01-05 01:24:34 +00:00
Dan Gohman	9bcfdf98f1	Change SelectCode's argument from SDValue to SDNode , to make it more clear what information these functions are actually using. This is also a micro-optimization, as passing a SDNode around is simpler than passing a { SDNode *, int } by value or reference. llvm-svn: 92564	2010-01-05 01:24:18 +00:00
Dan Gohman	2754bdcfdb	Use a pointer type rather than MVT::Other for the ExternalSymbol node used in an inline asm. llvm-svn: 92512	2010-01-04 21:00:54 +00:00
Chris Lattner	fe8af82cd4	Teach codegen to handle: (X != null) \| (Y != null) --> (X\|Y) != 0 (X == null) & (Y == null) --> (X\|Y) == 0 so that instcombine can stop doing this for pointers. This is part of PR3351, which is a case where instcombine doing this for pointers (inserting ptrtoint) is pessimizing code. llvm-svn: 92406	2010-01-02 00:00:03 +00:00
Chris Lattner	330323f780	whitespace cleanup llvm-svn: 92404	2010-01-01 23:37:34 +00:00
Mikhail Glushenkov	e862cc48f3	Fix a warning on gcc 4.4. SelectionDAGBuilder.cpp:4294: warning: suggest explicit braces to avoid ambiguous ‘else’ llvm-svn: 92395	2010-01-01 04:41:36 +00:00
Mikhail Glushenkov	0fba686958	Trailing whitespace, 80-col violations. llvm-svn: 92394	2010-01-01 04:41:22 +00:00
Chris Lattner	44298d184a	Teach codegen to lower llvm.powi to an efficient (but not optimal) multiply sequence when the power is a constant integer. Before, our codegen for std::pow(.., int) always turned into a libcall, which was really inefficient. This should also make many gfortran programs happier I'd imagine. llvm-svn: 92388	2010-01-01 03:32:16 +00:00
Chris Lattner	d981b3fc91	remove a bunch of unneeded functions. llvm-svn: 92263	2009-12-29 09:32:19 +00:00
Chris Lattner	84e9de4a58	Final step in the metadata API restructuring: move the getMDKindID/getMDKindNames methods to LLVMContext (and add convenience methods to Module), eliminating MetadataContext. Move the state that it maintains out to LLVMContext. llvm-svn: 92259	2009-12-29 09:01:33 +00:00
Chris Lattner	9ec640a902	This is a major cleanup of the instruction metadata interfaces that I asked Devang to do back on Sep 27. Instead of going through the MetadataContext class with methods like getMD() and getMDs(), just ask the instruction directly for its metadata with getMetadata() and getAllMetadata(). This includes a variety of other fixes and improvements: previously all Value*'s were bloated because the HasMetadata bit was thrown into value, adding a 9th bit to a byte. Now this is properly sunk down to the Instruction class (the only place where it makes sense) and it will be folded away somewhere soon. This also fixes some confusion in getMDs and its clients about whether the returned list is indexed by the MDID or densely packed. This is now returned sorted and densely packed and the comments make this clear. This introduces a number of fixme's which I'll follow up on. llvm-svn: 92235	2009-12-28 23:41:32 +00:00
Chris Lattner	cd3aa9d1ff	rename getMDKind -> getMDKindID, make it autoinsert if an MD Kind doesn't exist already, eliminate registerMDKind. Tidy up a bunch of random stuff. llvm-svn: 92225	2009-12-28 20:45:51 +00:00
Sanjiv Gupta	d17915f6f0	Allow targets to specify the return type of libcalls that are generated for floating point comparisons, rather than hard-coding them as i32. llvm-svn: 92199	2009-12-28 02:40:33 +00:00
Bill Wendling	3fbb708d4f	Remove dead store. llvm-svn: 92190	2009-12-28 01:51:30 +00:00
Bill Wendling	08d36672e1	Remove dead variable. llvm-svn: 92189	2009-12-28 01:48:56 +00:00
Bill Wendling	5badb477fa	Remove dead variable. llvm-svn: 92188	2009-12-28 01:47:48 +00:00
Bill Wendling	008e2e1e7f	Remove dead variable. llvm-svn: 92180	2009-12-28 01:02:21 +00:00
Bill Wendling	091861587d	Remove dead variable. llvm-svn: 92178	2009-12-28 01:00:12 +00:00
Chris Lattner	4e96d36f72	handle equality memcmp of 8 bytes on x86-64 with two unaligned loads and a compare. On other targets we end up with a call to memcmp because we don't want 16 individual byte loads. We should be able to use movups as well, but we're failing to select the generated icmp. llvm-svn: 92107	2009-12-24 01:07:17 +00:00
Chris Lattner	5d3919d5f9	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Nuno Lopes	6abe311a0f	move a few more symbols to .rodata llvm-svn: 92011	2009-12-23 17:48:10 +00:00
Dale Johannesen	b4485fd8a9	Use more sensible type for flags in asms. PR 5570. Patch by Sylve`re Teissier (sorry, ASCII only). llvm-svn: 91988	2009-12-23 07:32:51 +00:00

... 3 4 5 6 7 ...

4330 Commits