llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 08:23:21 +01:00

Author	SHA1	Message	Date
Evan Cheng	6397a77e16	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Daniel Dunbar	3a0c98ca87	MC/X86: Add movq alias for movabsq, to allow matching 64-bit immediates with movq. llvm-svn: 104275	2010-05-20 20:36:29 +00:00
Dan Gohman	89f64e13fe	When canonicalizing icmp operand order to put the loop invariant operand on the left, the interesting operand is on the right. This fixes a bug where LSR was failing to recognize ICmpZero uses, which led it to be unable to reverse the induction variable in the attached testcase. Delete test/CodeGen/X86/stack-color-with-reg-2.ll, because its test is extremely fragile and hard to meaningfully update. llvm-svn: 104262	2010-05-20 19:26:52 +00:00
Bob Wilson	11aebf39f1	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Dan Gohman	c8b4555a94	Fix assembly parsing and encoding of the pushf and popf family of instructions. llvm-svn: 104231	2010-05-20 16:16:00 +00:00
Dan Gohman	52dcd5fb9a	Define the x86 pause instruction. llvm-svn: 104204	2010-05-20 01:35:50 +00:00
Dan Gohman	00b8752500	Fix the sfence instruction to use MRM_F8 instead of MRM7r, since it doesn't have a register operand. Also, use I instead of PSI, for consistency with mfence and lfence. llvm-svn: 104203	2010-05-20 01:23:41 +00:00
Bill Wendling	e7a42798bc	Match "4" or "8" depending upon if it's 32- or 64-bit. llvm-svn: 104196	2010-05-20 00:27:10 +00:00
Eric Christopher	1643e2f4c6	Once more, with feeling. llvm-svn: 104190	2010-05-20 00:07:13 +00:00
Dan Gohman	772b731ca5	Teach LSR how to cope better with unrolled loops on targets where the addressing modes don't make this trivially easy. This allows it to avoid falling into the less precise heuristics in more cases. llvm-svn: 104186	2010-05-19 23:43:12 +00:00
Chris Lattner	aedd148163	fix rdar://7986634 - match instruction opcodes case insensitively. llvm-svn: 104183	2010-05-19 23:34:33 +00:00
Bill Wendling	483ce4b9b7	Testcase for r104181. llvm-svn: 104182	2010-05-19 23:33:26 +00:00
Eric Christopher	8c8d643a87	A more combo tls testcase. llvm-svn: 104163	2010-05-19 21:19:42 +00:00
Eric Christopher	6b51010080	Few more simple tls testcases. llvm-svn: 104148	2010-05-19 20:35:15 +00:00
Jakob Stoklund Olesen	6a2bfde3c8	TwoAddressInstructionPass doesn't really know how to merge live intervals when lowering REG_SEQUENCE instructions. Insert copies for REG_SEQUENCE sources not killed to avoid breaking later passes. llvm-svn: 104146	2010-05-19 20:08:00 +00:00
Eric Christopher	fd72aa1040	Attempt to run this test on x86 only. llvm-svn: 104143	2010-05-19 18:59:37 +00:00
Bob Wilson	e5f623ac22	Testcase to go with 104141. llvm-svn: 104142	2010-05-19 18:58:37 +00:00
Evan Cheng	6f52107b12	t2LEApcrel and tLEApcrel are re-materializable. This makes it possible to hoist more loads during machine LICM. llvm-svn: 104115	2010-05-19 07:28:01 +00:00
Evan Cheng	632cb17357	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Eric Christopher	aec869777c	Add a test to make sure that we're lowering the shift amount correctly. llvm-svn: 104090	2010-05-19 00:22:04 +00:00
Jakob Stoklund Olesen	f3114dbb3a	Remember to update VirtRegLastUse when spilling without killing before a call. llvm-svn: 104074	2010-05-18 22:20:09 +00:00
Dan Gohman	eaa3ee65bd	When converting a test to a cmp to fold a load, use the cmp that has an 8-bit immediate field rather than one with a wider immediate field. llvm-svn: 104064	2010-05-18 21:42:03 +00:00
Eric Christopher	79cbb29471	Quick test to make sure we're emitting the tbss section correctly. llvm-svn: 104063	2010-05-18 21:40:20 +00:00
Evan Cheng	e2980af336	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Dale Johannesen	6b88a14922	Test passed on ppc, to my surprise; if it worked there it may work everywhere... llvm-svn: 104053	2010-05-18 20:47:04 +00:00
Evan Cheng	9fc34e676d	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Dale Johannesen	00e3e62df1	Testcase for llvm-gcc checkin 104042. llvm-svn: 104043	2010-05-18 19:03:51 +00:00
Kevin Enderby	438a36f66a	Fixed the problem with a branch to "0b" that was not parsed by llvm-mc correctly. The Lexer was incorrectly eating the newline casusing it to branch to address 0. Updated the test case to use a "0:" label and a branch to "0b". llvm-svn: 104038	2010-05-18 17:51:35 +00:00
Daniel Dunbar	739e720a21	MC/Mach-O: Implement support for setting indirect symbol table offset in section header. Also, create symbol data for LHS of assignment, to match 'as' symbol ordering better. llvm-svn: 104033	2010-05-18 17:28:24 +00:00
Daniel Dunbar	8c20c162fe	MC/X86: Implement custom lowering to make sure we match things like X86::ADC32ri $0, %eax to X86::ADC32i32 $0 llvm-svn: 104030	2010-05-18 17:22:24 +00:00
Evan Cheng	39b5115e93	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Evan Cheng	8aa900cf16	Fix PR7175. Insert copies of a REG_SEQUENCE source if it is used by other REG_SEQUENCE instructions. llvm-svn: 103994	2010-05-17 23:24:12 +00:00
Kevin Enderby	14c986967b	Added support in MC for Directional Local Labels. llvm-svn: 103989	2010-05-17 23:08:19 +00:00
Eric Christopher	6ea9cf8425	More data/parsing support for tls directives. Add a few more testcases and cleanup comments as well. llvm-svn: 103985	2010-05-17 22:53:55 +00:00
Evan Cheng	378d6c5d76	Fix PR7156. If the sources of a REG_SEQUENCE are all IMPLICIT_DEF's. Replace it with an IMPLICIT_DEF rather than deleting it or else it would be left without a def. llvm-svn: 103984	2010-05-17 22:09:49 +00:00
Daniel Dunbar	b18dfe6cb4	MC/Mach-O/x86: Optimal nop sequences should only be used for the .text sections, not all sections in the text segment. llvm-svn: 103981	2010-05-17 21:54:30 +00:00
Daniel Dunbar	ee5ac7a69b	MC/Mach-O: Reverse order of SymbolData scanning when emitting instructions. - This fixes a string table mismatch with 'as' when two new symbols are defined in a single instruction. llvm-svn: 103979	2010-05-17 21:19:59 +00:00
Evan Cheng	bb0a4fbe13	Careful with reg_sequence coalescing to not to overwrite sub-register indices. llvm-svn: 103971	2010-05-17 20:57:12 +00:00
Daniel Dunbar	80719b2d36	MC/Mach-O: Fix some differences in symbol flag handling. - Don't clear weak reference flag, 'as' was only "trying" to do this, it wasn't actually succeeding. - Clear the "lazy bound" bit when we mark something external. This corresponds roughly to the lazy clearing of the bit that 'as' implements in symbol_table_lookup. - The exact meaning of these flags appears pretty loose, since 'as' isn't very consistent. For now we just try to match 'as', we will clean this up one day hopefully. llvm-svn: 103964	2010-05-17 20:12:31 +00:00
Evan Cheng	3bce87c79f	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Daniel Dunbar	8f5da3624f	llvm-mc: Support reassignment of variables in one special case, when the variable has not yet been used in an expression. This allows us to support a few cases that show up in real code (mostly because gcc generates it for Objective-C on Darwin), without giving up a reasonable semantic model for assignment. llvm-svn: 103950	2010-05-17 17:46:23 +00:00
Jakob Stoklund Olesen	c07fd51d56	Avoid allocating the same physreg to multiple virtregs in one instruction. While that approach works wonders for register pressure, it tends to break everything. This should unbreak the arm-linux builder and fix a number of miscompilations. llvm-svn: 103946	2010-05-17 17:18:59 +00:00
Jakob Stoklund Olesen	40545bf117	Only use clairvoyance when defining a register, and then only if it has one use. This makes allocation independent on the ordering of use-def chains. llvm-svn: 103935	2010-05-17 04:50:57 +00:00
Eric Christopher	950f0d7892	Assume that we'll handle mangling the symbols earlier and just put the symbol to the file as we have it. Simplifies out tbss handling. llvm-svn: 103928	2010-05-17 02:13:02 +00:00
Dale Johannesen	b71d6a4150	Removing as part of previous reversion. llvm-svn: 103915	2010-05-16 20:19:40 +00:00
Dale Johannesen	cf2d4b9f91	Revert 103911; it broke a test that expects bitconvert <1xi64> -> i64 to work in MMX registers on hosts where -no-sse is the default (not mine). The right thing is to accept this and make i64->f64 conversions go through memory, but I don't have time right now. llvm-svn: 103914	2010-05-16 20:19:04 +00:00
Dale Johannesen	15dce10b5a	Make x86-64 64-bit bitconvert work when SSE is not available. (This worked as of about 6 months ago and I didn't track down exactly what broke it; I think this fix is appropriate.) llvm-svn: 103911	2010-05-16 18:22:38 +00:00
Anton Korobeynikov	925a32ae37	Add support for thiscall calling convention. Patch by Charles Davis and Steven Watanabe! llvm-svn: 103902	2010-05-16 09:08:45 +00:00
Anton Korobeynikov	314ccc5501	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Evan Cheng	85497bd415	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Bill Wendling	5fde821884	SystemZ really does mean "has calls" and not just "adjusts stack." Go ahead and replace the check with the appropriate predicate. Modify the testcase to reflect the correct code. (It should be saving callee-saved registers on the stack allocated by the calling fuction.) llvm-svn: 103829	2010-05-14 22:17:42 +00:00
Devang Patel	6bbba26bf1	Test case for r103800. llvm-svn: 103801	2010-05-14 21:04:45 +00:00
Kevin Enderby	dc13d89540	Fix so "int3" is correctly accepted, added "into" and fixed "int" with an argument, like "int $4", to not get an Assertion error. llvm-svn: 103791	2010-05-14 19:16:02 +00:00
Daniel Dunbar	89a1d9036a	MC/Mach-O/x86_64: Darwin's special "signed_N" relocation types should only be used to replace a normal relocation, not a reference to a GOT entry. llvm-svn: 103789	2010-05-14 18:53:40 +00:00
Jakob Stoklund Olesen	3eac02b22f	Simplify the handling of physreg defs and uses in RegAllocFast. This adds extra security against using clobbered physregs, and it adds kill markers to physreg uses. llvm-svn: 103784	2010-05-14 18:03:25 +00:00
Daniel Dunbar	3c261b3ff8	XFAIL the test I added with vg_leak, apparently it is the first and only llc -filetype=obj test, and -filetype=obj leaks a few objects. Added a FIXME, we need to sort out the ownership model for the various MC objects. llvm-svn: 103769	2010-05-14 07:47:51 +00:00
Daniel Dunbar	26fa01eb86	Inline Asm: Ensure buffer is newline terminated to match how the text is printed. - This is a hack, but I can't decide the best place to handle this. Chris? llvm-svn: 103765	2010-05-14 04:31:50 +00:00
Eric Christopher	ebea91f168	Add AsmParser support for darwin tbss directive. Nothing uses this yet. llvm-svn: 103757	2010-05-14 01:50:28 +00:00
Nick Lewycky	fc4c30e9e3	Actually run the test. Thanks Daniel Dunbar! llvm-svn: 103720	2010-05-13 17:41:06 +00:00
Nick Lewycky	38e49fbf52	Add testcase for r103653. llvm-svn: 103699	2010-05-13 06:00:14 +00:00
Daniel Dunbar	f254edca8d	MC/Mach-O: Add another zerofill test to improve coverage. llvm-svn: 103691	2010-05-13 01:10:28 +00:00
Jakob Stoklund Olesen	d99818256c	Take allocation hints from copy instructions to/from physregs. This causes way more identity copies to be generated, ripe for coalescing. llvm-svn: 103686	2010-05-13 00:19:43 +00:00
Chris Lattner	ca57d80f83	fix rdar://7965971 and a fixme: use ParseIdentifier in ParseDirectiveDarwinZerofill instead of hard coding the check for identifier. This allows quoted symbol names to be used. llvm-svn: 103682	2010-05-13 00:10:34 +00:00
Chris Lattner	887e8f9f53	reapply r103668 with a fix. Never make "minor syntax changes" after testing before committing. llvm-svn: 103681	2010-05-13 00:02:47 +00:00
Chris Lattner	361c115f23	revert r103668 for now, it is apparently breaking things. llvm-svn: 103677	2010-05-12 23:40:59 +00:00
Chris Lattner	91a836a9c7	moffset forms of moves are x86-32 only, make the parser lower them to the correct x86-64 instructions since we don't have a clean way to handle this in td files yet. rdar://7947184 llvm-svn: 103668	2010-05-12 23:13:36 +00:00
Chris Lattner	1960255123	fix the encoding of the obscure "moffset" forms of moves, i386 part first. rdar://7947184 llvm-svn: 103660	2010-05-12 22:48:24 +00:00
Jakob Stoklund Olesen	7d1323d9a5	Make sure to add kill flags to the last use of a virtreg when it is redefined. The X86 floating point stack pass and others depend on good kill flags. llvm-svn: 103635	2010-05-12 18:46:03 +00:00
Devang Patel	2c350fe8a7	Test case for r103633. llvm-svn: 103634	2010-05-12 18:31:04 +00:00
Dale Johannesen	9d4b3277ba	Testcase for llvm 103572 (7898991). llvm-svn: 103574	2010-05-12 05:04:20 +00:00
Daniel Dunbar	469dddef0e	MC/X86: Extend suffix matching hack to match 'q' suffix. llvm-svn: 103535	2010-05-12 00:54:20 +00:00
Daniel Dunbar	e5a79692cf	MC/Mach-O/x86_64: Add a new hook for checking whether a particular section can be diced into atoms, and adjust getAtom() to take this into account. - This fixes relocations to symbols in fixed size literal sections, for example. llvm-svn: 103532	2010-05-12 00:38:17 +00:00
Jakob Stoklund Olesen	6976c543cd	Enable a bunch more -regalloc=fast tests llvm-svn: 103531	2010-05-12 00:11:24 +00:00
Daniel Dunbar	a033860d55	MC/Mach-O/x86_64: Fix PCrel adjustment for x86_64, which was using the fixup offset instead of the fixup address as intended. llvm-svn: 103527	2010-05-11 23:53:11 +00:00
Jakob Stoklund Olesen	063844f706	Keep track of the last place a live virtreg was used. This allows us to add accurate kill markers, something the scavenger likes. Add some more tests from ARM that needed this. llvm-svn: 103521	2010-05-11 23:24:45 +00:00
Jakob Stoklund Olesen	99d5d74fb0	One more -regalloc=fast test llvm-svn: 103509	2010-05-11 20:51:07 +00:00
Jakob Stoklund Olesen	e27902ac68	Simplify the tracking of used physregs to a bulk bitor followed by a transitive closure after allocating all blocks. Add a few more test cases for -regalloc=fast. llvm-svn: 103500	2010-05-11 20:30:28 +00:00
Jakob Stoklund Olesen	442e38c4de	Mostly rewrite RegAllocFast. Sorry for the big change. The path leading up to this patch had some TableGen changes that I didn't want to commit before I knew they were useful. They weren't, and this version does not need them. The fast register allocator now does no liveness calculations. Instead it relies on kill flags provided by isel. (Currently those kill flags are also ignored due to isel bugs). The allocation algorithm is supposed to work with any subset of valid kill flags. More kill flags simply means fewer spills inserted. Registers are allocated from a working set that contains no aliases. That means most allocations can be done directly without expensive alias checks. When the working set runs out of registers we do the full alias check to find new free registers. llvm-svn: 103488	2010-05-11 18:54:45 +00:00
Daniel Dunbar	7670f69e42	MC/Mach-O x86_64: Switch to using fragment atom symbol. - This eliminates getAtomForAddress() (which was a linear search) and simplifies getAtom(). - This also fixes some correctness problems where local labels at the same address as non-local labels could be assigned to the wrong atom. llvm-svn: 103480	2010-05-11 17:22:50 +00:00
Kalle Raiskila	e302300b51	Make SPU backend not assert on jump tables. llvm-svn: 103466	2010-05-11 11:00:02 +00:00
Evan Cheng	11130a0a22	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Daniel Dunbar	d2c14ce38d	MC/Mach-O: Fix another mismatch with .weak_definition, we shouldn't use a scattered relocation entry with a .weak_definition. llvm-svn: 103443	2010-05-10 23:15:20 +00:00
Devang Patel	23f7323e7f	Enable multiple Compile Units in one module. This means now 'llvm-ld a.bc b.bc' will preserve debug info appropriately. llvm-svn: 103439	2010-05-10 22:49:55 +00:00
Chris Lattner	a0f98775e9	this really is needed. :( llvm-svn: 103434	2010-05-10 21:23:48 +00:00
Chris Lattner	bf56c070f3	just remove this, it isn't needed. llvm-svn: 103432	2010-05-10 21:01:47 +00:00
Chris Lattner	e78cc107c4	fix PR7105 by enumerating MDNodes on all @llvm.foo function calls, not just recognized intrinsics. llvm-svn: 103428	2010-05-10 20:53:17 +00:00
Chris Lattner	b3d401532c	fix a pretty obvious typo. We test things before committing them, right? llvm-svn: 103427	2010-05-10 20:51:06 +00:00
David Greene	dec8f8b802	Fix PR6875: This includes a patch by Roman Divacky to fix the initial crash. Move the actual addition of passes from PassManager::add to PassManager::addImpl. That way, when adding printer passes we won't recurse infinitely. Finally, check to make sure that we are actually adding a FunctionPass to a FunctionPassManager before doing a print before or after it. Immutable passes are strange in this way because they aren't FunctionPasses yet they can be and are added to the FunctionPassManager. llvm-svn: 103425	2010-05-10 20:24:27 +00:00
Evan Cheng	df350445c6	Be careful with operand promotion. For a binary operation, the source operands may be the same. PR7018. rdar://7939869. llvm-svn: 103419	2010-05-10 19:03:57 +00:00
Devang Patel	553c42f20b	Test case for 103414. llvm-svn: 103415	2010-05-10 17:49:40 +00:00
Kalle Raiskila	61289abcda	Fix encoding of 'sf' and 'sfh' instructions. llvm-svn: 103399	2010-05-10 08:13:49 +00:00
Chris Lattner	e74d980a02	make simplifycfg insert an llvm.trap before the 'unreachable' it introduces when it detects undefined behavior. llvm.trap generally codegens into some thing really small (e.g. a 2 byte ud2 instruction on x86) and debugging this sort of thing is "nontrivial". For example, we now compile: void foo() { (int)0 = 42; } into: _foo: pushl %ebp movl %esp, %ebp ud2 Some may even claim that this is a security hole, though that seems dubious to me. This addresses rdar://7958343 - Optimizing away null dereference potentially allows arbitrary code execution llvm-svn: 103356	2010-05-08 22:15:59 +00:00
Chris Lattner	0b442d35da	Teach instcombine to transform a bitcast/(zext\|trunc)/bitcast sequence with a vector input and output into a shuffle vector. This sort of sequence happens when the input code stores with one type and reloads with another type and then SROA promotes to i96 integers, which make everyone sad. This fixes rdar://7896024 llvm-svn: 103354	2010-05-08 21:50:26 +00:00
Chris Lattner	1037630863	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Bill Wendling	787de8fe38	Readd testcase. llvm-svn: 103335	2010-05-08 04:47:54 +00:00
Dan Gohman	4ca74b9c6c	When pruning candidate formulae out of an LSRUse, update the LSRUse's Regs set after all pruning is done, rather than trying to do it on the fly, which can produce an incomplete result. This fixes a case where heuristic pruning was stripping all formulae from a use, which led the solver to enter an infinite loop. Also, add a few asserts to diagnose this kind of situation. llvm-svn: 103328	2010-05-07 23:36:59 +00:00
Bill Wendling	4241941b52	Remove. Don't XFAIL. llvm-svn: 103321	2010-05-07 23:09:17 +00:00
Bill Wendling	3846ec7889	Temorarily revert r101984. llvm-svn: 103314	2010-05-07 22:45:36 +00:00
Dan Gohman	95040c18f4	SDDbgValues are apparently not being legalized. Fix a symptom of the problem, and not the real problem itself, by dropping debug info for i128 values. rdar://7958162. llvm-svn: 103310	2010-05-07 22:19:08 +00:00
Kevin Enderby	3807a4746a	Fix i386 relocations to Weak Definitions. The relocation entries should be external and the item to be relocated should not have the address of the symbol added in. llvm-svn: 103302	2010-05-07 21:44:23 +00:00
Dale Johannesen	1ee37ac5d4	Fix PR 7087, and probably other things, by extending getConstantFP to accept the two supported long double target types. This was not the original intent, but there are other places that assume this works and it's easy enough to do. llvm-svn: 103299	2010-05-07 21:35:53 +00:00
Devang Patel	9290f59fb8	Update test to use valid debug info. llvm-svn: 103287	2010-05-07 20:34:00 +00:00
Jim Grosbach	2db1618b44	Clean up the conditional for handling of sign_extend_inreg based on whether the extract instructions are available. rdar://7956878 llvm-svn: 103277	2010-05-07 18:34:55 +00:00
Duncan Sands	ed2ef5a987	Correct some bogus target triples. llvm-svn: 103265	2010-05-07 17:03:48 +00:00
Dan Gohman	1512bd9998	Add an LLVM IR version of code sinking. This uses the same simple algorithm as MachineSink, but it isn't constrained by MachineInstr-level details. llvm-svn: 103257	2010-05-07 15:40:13 +00:00
Nick Lewycky	3e5720a898	Revert r103133 and add testcase from PR7066. llvm-svn: 103233	2010-05-07 01:45:38 +00:00
Dale Johannesen	e5ca1fe49e	Adjust tests affected by llvm-gcc 103229. All results here match gcc-4.2. llvm-svn: 103230	2010-05-07 01:11:31 +00:00
Dan Gohman	f863ad3dc5	Disable the new unknown-location code for now. It causes a major increase in the debug line info section, and it's causing regressions in a gdb testsuite. llvm-svn: 103226	2010-05-07 01:08:53 +00:00
Daniel Dunbar	45589cd853	MC/X86: X86AbsMemAsmOperand is subclass of X86NoSegMemAsmOperand. - This fixes "leal 0, %eax", for example. llvm-svn: 103205	2010-05-06 22:39:14 +00:00
Chris Lattner	c788971da7	fix rdar://7947167 - llvm-mc doesn't match movsq llvm-svn: 103199	2010-05-06 21:48:14 +00:00
Sean Callanan	4331428e24	Eliminated the classification of control registers into %ecr_ and %rcr_, leaving just %cr_ which is what people expect. Updated the disassembler to support this unified register set. Added a testcase to verify that the registers continue to be decoded correctly. llvm-svn: 103196	2010-05-06 20:59:00 +00:00
Dan Gohman	497e752655	Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it doesn't have to guess. llvm-svn: 103194	2010-05-06 20:33:48 +00:00
Dan Gohman	3190e5c292	Add a testcase for r103135, explicitly representing unknown locations in debug line info. llvm-svn: 103189	2010-05-06 17:49:17 +00:00
Daniel Dunbar	a3731b17c0	Revert r103137, fix for $ in labels. It looks like we can't actually handle this at the token level. Consider the following horrible test case: a = 1 .globl $a movl ($a), %eax movl $a, %eax movl $$a, %eax llvm-svn: 103178	2010-05-06 14:46:38 +00:00
Chris Lattner	014a954e3d	Fix PR7054 - Assertion `Symbol->isUndefined() && "Cannot define a symbol twice!"' failed. Users can write broken code that emits the same label twice with asm renaming, detect this and emit a fatal backend error instead of aborting. llvm-svn: 103140	2010-05-06 00:05:37 +00:00
Chris Lattner	ca80f41a4d	fix rdar://7946934 - in some limited cases, the assembler should allow $ at the start of a symbol name. llvm-svn: 103137	2010-05-05 23:51:28 +00:00
Jim Grosbach	e04cc6cb43	Cleanup of ARMv7M support. Move hardware divide and Thumb2 extract/pack instructions to subtarget features and update tests to reflect. PR5717. llvm-svn: 103136	2010-05-05 23:44:43 +00:00
Jakob Stoklund Olesen	2e5d12acfa	Fix PR6520. An earlyclobber physreg must not be allocated to anything else. llvm-svn: 103133	2010-05-05 23:07:41 +00:00
Stuart Hastings	edf26ed051	Test case for pr2394 and r102979. llvm-svn: 103129	2010-05-05 22:49:33 +00:00
Jim Grosbach	7eb0b4d646	fix copy/paste oops. llvm-svn: 103122	2010-05-05 21:07:46 +00:00
Jim Grosbach	25fc725b2a	Add tests for ARMV7M divide instruction use llvm-svn: 103120	2010-05-05 20:47:15 +00:00
Jim Grosbach	9b7ae2027f	remove unneeded underscores. llvm-svn: 103114	2010-05-05 19:55:58 +00:00
Jim Grosbach	7ea67d346f	Convert to filecheck llvm-svn: 103113	2010-05-05 19:41:11 +00:00
Daniel Dunbar	9a3d46162f	MC/Mach-O: Mark absolute variable's appropriately, and add Mach-O support for writing them. - <rdar://problem/7885351> integrated assembler broken for i386 objc code llvm-svn: 103112	2010-05-05 19:01:05 +00:00
Daniel Dunbar	f5dc70a7d1	MC/Mach-O/x86_64: Relocations in debug sections should use local relocations when possible. - <rdar://problem/7934873> llvm-svn: 103092	2010-05-05 17:22:39 +00:00
Duncan Sands	7db9873b74	Use llvm.foo as the intrinsic, rather than llvm.dbg.value. Since the values passed to llvm.dbg.value were not valid for the intrinsic, it might have caused trouble one day if the verifier ever started checking for valid debug info. llvm-svn: 103038	2010-05-04 20:09:25 +00:00
Chris Lattner	b4696853af	"on the rare occasion the SPU BE produces illegal assembly - it tries to emit an add instruction of the form 'a reg, reg, imm'." Patch by Kalle Raiskila! llvm-svn: 103021	2010-05-04 17:58:46 +00:00
Daniel Dunbar	e335a71a56	MC/X86: Chris pointed that 'as' isn't consistent in accepting the long form of instructions which have no direct register usage. Darwin 'as' accepts: add $0, (%rax) but rejects mov $0, (%rax) for example. Given that, only accept suffix matches which match exactly one form. We still need to emit nice diagnostics for failures... llvm-svn: 103015	2010-05-04 17:31:02 +00:00
Daniel Dunbar	648d1ae783	MC/X86: Add "support" for matching ATT style mnemonic prefixes. - The idea is that when a match fails, we just try to match each of +'b', +'w', +'l'. If exactly one matches, we assume this is a mnemonic prefix and accept it. If all match, we assume it is width generic, and take the 'l' form. - This would be a horrible hack, if it weren't so simple. Therefore it is an elegant solution! Chris gets the credit for this particular elegant solution. :) - Next step to making this more robust is to have the X86 matcher generate the mnemonic prefix information. Ideally we would also compute up-front exactly which mnemonic to attempt to match, but this may require more custom code in the matcher than is really worth it. llvm-svn: 103012	2010-05-04 16:12:42 +00:00
Duncan Sands	a3857d3d9a	Fix a variant of PR6112 found by thinking about it: when doing RAUW of a global variable with a local variable in function F, if function local metadata M in function G was using the global then M would become function-local to both F and G, which is not allowed. See the testcase for an example. Fixed by detecting this situation and zapping the metadata operand when it occurs. llvm-svn: 103007	2010-05-04 12:43:36 +00:00
Devang Patel	ee7c7143dc	Set DW_AT_APPLE_omit_frame_ptr in endFunction() where MachineFunction is available all the time. llvm-svn: 103001	2010-05-04 06:15:30 +00:00
Devang Patel	edbef722d6	Do not ignore debug loc attached with llvm.dbg.declare while collecting debug info used by a module. llvm-svn: 102995	2010-05-04 01:05:02 +00:00
Dale Johannesen	b10ca6bf4c	Implement builtin_return_address(x) and builtin_frame_address(x) on PPC for x!=0. 7624113. llvm-svn: 102972	2010-05-03 22:59:34 +00:00
Jakob Stoklund Olesen	51ab2653d5	Check that subregisters don't have independent values in RemoveCopyByCommutingDef(). This fixes PR6941. llvm-svn: 102970	2010-05-03 22:40:32 +00:00
Dan Gohman	c024fd43c9	Fix tests to use fadd, fsub, and fmul, instead of add, sub, and mul, when the type is floating-point. llvm-svn: 102969	2010-05-03 22:36:46 +00:00
Bill Wendling	80159b87ea	Revert r102948. llvm-svn: 102964	2010-05-03 21:51:21 +00:00
Kevin Enderby	ce7a24a57f	Changed llvm-mc to use the same suffixes with floating point compare instructions as the Mac OS X darwin assembler. Some of which like 'fcoml' assembled to different opcodes. While some of the suffixes were just different. llvm-svn: 102958	2010-05-03 21:31:40 +00:00
Kevin Enderby	be2804f752	Fixed the encoding of two of the X86 movq instuctions. The Move quadword from mm to mm/m64 and the Move quadword from xmm2/mem64 to xmm1 had the incorrect encodings. llvm-svn: 102952	2010-05-03 21:03:31 +00:00
Kevin Enderby	c1eeb061e7	Fixed the encoding of the x86 push instructions. Using a 32-bit immediate value caused the a pushl instruction to be incorrectly encoding using only two bytes of immediate, causing the following 2 instruction bytes to be part of the 32-bit immediate value. Also fixed the one byte form of push to be used when the immediate would fit in a signed extended byte. Lastly changed the names to not include the 32 of PUSH32 since they actually push the size of the stack pointer. llvm-svn: 102951	2010-05-03 20:45:05 +00:00
Bill Wendling	61fab275c0	Testcase for r102947. llvm-svn: 102948	2010-05-03 20:39:35 +00:00
Devang Patel	fa560fdfc1	Check for side effects before splitting loop. Patch by Jakub Staszak! llvm-svn: 102928	2010-05-03 18:06:58 +00:00
Dan Gohman	15cb983f55	Fix a bug which prevented tail merging of return instructions in beneficial cases. See the changes in test/CodeGen/X86/tail-opts.ll and test/CodeGen/ARM/ifcvt2.ll for details. The fix is to change HashEndOfMBB to hash at most one instruction, instead of trying to apply heuristics about when it will be profitable to consider more than one instruction. The regular tail-merging heuristics are already prepared to handle the same cases, and they're more precise. Also, make test/CodeGen/ARM/ifcvt5.ll and test/CodeGen/Thumb2/thumb2-branch.ll slightly more complex so that they continue to test what they're intended to test. And, this eliminates the problem in test/CodeGen/Thumb2/2009-10-15-ITBlockBranch.ll, the testcase from PR5204. Update it accordingly. llvm-svn: 102907	2010-05-03 14:35:47 +00:00
Duncan Sands	153ad3b903	Remove the -enable-sjlj-eh option, which doesn't do anything. Remove the -enable-eh option which is only used by the JIT, and replace it with -jit-enable-eh. llvm-svn: 102865	2010-05-02 15:36:26 +00:00
Chris Lattner	afaee8e110	revert r102831. We already delete dead readonly calls in other places, killing a valid transformation is not the right answer. llvm-svn: 102850	2010-05-01 17:19:38 +00:00
Anton Korobeynikov	a3726088fa	Insert ANY_EXTEND node instead of invalid truncate during DAG Combining (X & 1), when needed. This fixes PR7001 llvm-svn: 102838	2010-05-01 12:52:34 +00:00
Anton Korobeynikov	f31181a0cc	Do folding for indirect branches, where possible llvm-svn: 102836	2010-05-01 12:28:21 +00:00
Anton Korobeynikov	9b724bd446	Implement indirect branches on MSP430 llvm-svn: 102835	2010-05-01 12:04:32 +00:00
Owen Anderson	443d813b45	Disable the call-deletion transformation introduced in r86975. Without halting analysis, it is illegal to delete a call to a read-only function. The correct solution is almost certainly to add a "must halt" attribute and only allow deletions in its presence. XFAIL the relevant testcase for now. llvm-svn: 102831	2010-05-01 08:34:28 +00:00
Chris Lattner	61a8beaae0	fix PR5009 by making CGSCCPM realize that a call was devirtualized if an indirect call site was removed and a direct one was added, not just if an indirect call site was modified to be direct. llvm-svn: 102830	2010-05-01 06:38:43 +00:00
Chris Lattner	660cc3ac57	rename test llvm-svn: 102829	2010-05-01 06:34:13 +00:00
Chris Lattner	9ee72a47c2	Implement rdar://6295824 and PR6724 with two tiny changes that can have a big effect :). The first is to enable the iterative SCC passmanager juice that kicks in when the scc passmgr detects that a function pass has devirtualized a call. In this case, it will rerun all the passes it manages on the SCC, up to the iteration count limit (4). This is useful because a function pass may devirualize a call, and we want the inliner to inline it, or pruneeh to infer stuff about it, etc. The second patch is to add all call sites to the DevirtualizedCalls list the inliner uses. This list is about to get renamed, but the jist of this is that the inliner now reconsiders all inlined call sites as candidates for further inlining. The intuition is this that in cases like this: f() { g(1); } g(int x) { h(x); } We analyze this bottom up, and may decide that it isn't profitable to inline H into G. Next step, we decide that it is profitable to inline G into F, and do so, which means that F now calls H. Even though the call from G -> H may not have been profitable to inline, the call from F -> H may be (in this case because a constant allows folding etc). In my spot checks, this doesn't have a big impact on code. For example, the LLC output for 252.eon grew from 0.02% (from 317252 to 317308) and 176.gcc actually shrunk by .3% (from 1525612 to 1520964 bytes). 252.eon never iterated in the SCC Passmgr, 176.gcc iterated at most 1 time. llvm-svn: 102823	2010-05-01 01:15:56 +00:00
Chris Lattner	b893cedad2	The inliner has traditionally not considered call sites that appear due to inlining a callee as candidates for futher inlining, but a recent patch made it do this if those call sites were indirect and became direct. Unfortunately, in bizarre cases (see testcase) doing this can cause us to infinitely inline mutually recursive functions into callers not in the cycle. Fix this by keeping track of the inline history from which callsite inline candidates got inlined from. This shouldn't affect any "real world" code, but is required for a follow on patch that is coming up next. llvm-svn: 102822	2010-05-01 01:05:10 +00:00
Bill Wendling	7b6527b94b	Test failing too much on too many platforms. llvm-svn: 102812	2010-05-01 00:12:33 +00:00
Bill Wendling	a348ff69b8	Maybe it needs sse2? llvm-svn: 102802	2010-04-30 23:19:29 +00:00
Bill Wendling	d1c5f11f72	Force 64-bit. llvm-svn: 102800	2010-04-30 22:45:20 +00:00
Chris Lattner	3eb6a9f076	Dan recently disabled recursive inlining within a function, but we were still inlining self-recursive functions into other functions. Inlining a recursive function into itself has the potential to reduce recursion depth by a factor of 2, inlining a recursive function into something else reduces recursion depth by exactly 1. Since inlining a recursive function into something else is a weird form of loop peeling, turn this off. The deleted testcase was added by Dale in r62107, since then we're leaning towards not inlining recursive stuff ever. In any case, if we like inlining recursive stuff, it should be done within the recursive function itself to get the algorithm recursion depth win. llvm-svn: 102798	2010-04-30 22:37:22 +00:00
Bill Wendling	95a4929ac7	EXTRACT_VECTOR_ELT of an INSERT_VECTOR_ELT may have the same index, but the indexes could be of a different value type. Or not even using the same SDNode for the constant (weird, I know). Compare the actual values instead of the pointers. llvm-svn: 102791	2010-04-30 22:19:17 +00:00
Jakob Stoklund Olesen	fac584ed9e	The local register allocator has to spill dirty callee saved registers before a call that might throw. The landing pad assumes that all registers are in stack slots. We used to spill those dirty CSRs after the call, and the stack slots would be wrong when arriving at the landing pad. llvm-svn: 102770	2010-04-30 21:19:29 +00:00
Devang Patel	8146cb492f	Preserve debug info attached with call instruction while eliminating dead argument. Radar 7927803 llvm-svn: 102760	2010-04-30 20:23:54 +00:00
Devang Patel	9832115581	New test. llvm-svn: 102746	2010-04-30 19:39:29 +00:00
Dan Gohman	6944fae2f6	Add lint checks for invalid uses of memory. llvm-svn: 102733	2010-04-30 19:05:00 +00:00
Dan Gohman	f6e1477a55	Add -o /dev/null to some tests which don't care about their output. llvm-svn: 102722	2010-04-30 17:42:30 +00:00
Evan Cheng	fc86d7fbdc	Fix test. llvm-svn: 102694	2010-04-30 06:00:56 +00:00
Evan Cheng	9303b11e47	Another sibcall bug. If caller and callee calling conventions differ, then it's only safe to do a tail call if the results are returned in the same way. llvm-svn: 102683	2010-04-30 01:12:32 +00:00
Jakob Stoklund Olesen	01254ea96d	Reject really weird coalescer case when trying to merge identical subregisters of different register classes. e.g. %reg1048:3<def> = EXTRACT_SUBREG %RAX<kill>, 3 Where %reg1048 is a GR32 register. This is not impossible to handle, but it is pretty hard and very rare. This should unbreak the dragonegg builder. llvm-svn: 102672	2010-04-29 23:47:46 +00:00
Evan Cheng	da89f22f3d	Load folding tail call should not use ebp / rbp after it's popped. PEI should use esp / rsp to reference frame instead. llvm-svn: 102596	2010-04-29 05:08:22 +00:00
Kevin Enderby	58bed5a913	Fixed the word sized Bit Scan Forward/Reverse instructions, they needed the Operand size override prefix to be part of their records. llvm-svn: 102556	2010-04-28 23:20:40 +00:00
Chris Lattner	45c337c939	fix this to work with objdir != srcdir llvm-svn: 102547	2010-04-28 22:34:35 +00:00
Dale Johannesen	5aa4801ddb	Fix comment. llvm-svn: 102545	2010-04-28 22:23:46 +00:00
Dale Johannesen	02cb4ed701	Test for llvm-gcc checkin 102543. llvm-svn: 102544	2010-04-28 22:17:33 +00:00
Devang Patel	8e2e813b48	Update tests. Now DBG_VALUE instruction is created only if alloca corresponding to llvm.dbg.declare is missing. llvm-svn: 102524	2010-04-28 20:27:48 +00:00
Chris Lattner	4629370fa2	fix PR6112 - When globalopt (or any other pass) does RAUW(@G, %G), metadata references in non-function-local MDNodes should drop to null. llvm-svn: 102519	2010-04-28 20:16:12 +00:00
Chris Lattner	9867c1a075	Rework global alignment computation again. Now we do round up alignment of globals to the preferred alignment, but only when there is no section specified on the global (by far the common case). llvm-svn: 102515	2010-04-28 19:58:07 +00:00
Evan Cheng	d4fe387eb8	Enable i16 to i32 promotion by default. llvm-svn: 102493	2010-04-28 08:30:49 +00:00
Evan Cheng	08e5f737d2	Update tests. llvm-svn: 102487	2010-04-28 01:53:13 +00:00
Devang Patel	570e9d53a7	Emit debug info for byval parameters. llvm-svn: 102486	2010-04-28 01:39:28 +00:00
Evan Cheng	2aaefc6167	Do not count kill, implicit_def instructions as printed instructions. llvm-svn: 102453	2010-04-27 19:38:45 +00:00
Chris Lattner	a9c1328501	round zero-byte .zerofill directives up to 1 byte. This should fix some "g++.dg-struct-layout-1" failures, rdar://7886017 llvm-svn: 102421	2010-04-27 07:41:44 +00:00
Dale Johannesen	af026229b2	Un-XFAIL this on ppc. My enabling of dbg_declare handling in ISel fixed it. llvm-svn: 102404	2010-04-27 00:01:42 +00:00
Chris Lattner	9292bad5f5	on darwin empty functions need to codegen into something of non-zero length, otherwise labels get incorrectly merged. We handled this by emitting a ".byte 0", but this isn't correct on thumb/arm targets where the text segment needs to be a multiple of 2/4 bytes. Handle this by emitting a noop. This is more gross than it should be because arm/ppc are not fully mc'ized yet. This fixes rdar://7908505 llvm-svn: 102400	2010-04-26 23:37:21 +00:00
Bob Wilson	ece63716aa	Handle register-to-register copies within the tGPR class. Radar 7896289 llvm-svn: 102396	2010-04-26 23:20:08 +00:00
Devang Patel	ac2c76f813	Use DW_AT_entry_pc instead of DW_AT_low_pc/DW_AT_high_pc pair. This simplifies debug range entries. llvm-svn: 102394	2010-04-26 22:54:28 +00:00
Dan Gohman	40561dd0ba	When checking whether the special handling for an addrec increment which doesn't dominate the header is needed, don't check whether the increment expression has computable loop evolution. While the operands of an addrec are required to be loop-invariant, they're not required to dominate any part of the loop. This fixes PR6914. llvm-svn: 102389	2010-04-26 21:46:36 +00:00
Dan Gohman	9c1b7fdc46	Add a comment to this test. llvm-svn: 102387	2010-04-26 21:37:43 +00:00
Chris Lattner	4854eab087	fix PR6921 a different way. Intead of increasing the alignment of globals with a specified alignment, we fix common variables to obey their alignment. Add a comment explaining why this behavior is important. llvm-svn: 102365	2010-04-26 18:46:46 +00:00
Chris Lattner	a8cd2ac893	Revert r102300/102301, which serious broke objc apps. llvm-svn: 102359	2010-04-26 18:30:45 +00:00
Chris Lattner	9065710fcf	fix PR6940: sitofp(undef) folds to 0.0, not undef. llvm-svn: 102358	2010-04-26 18:21:23 +00:00
Chris Lattner	241b92e4ee	testcase for PR6913 llvm-svn: 102303	2010-04-25 05:51:14 +00:00
Chris Lattner	454613a18a	this passes now. llvm-svn: 102301	2010-04-25 05:49:31 +00:00
Chris Lattner	e4a25eb35a	Fix PR6921: globals were not getting correctly rounded up to their preferred alignment unless they were common or some other special case. llvm-svn: 102300	2010-04-25 05:30:43 +00:00
Dan Gohman	42337e0ee9	Generalize LSR's OptimizeMax to handle the new kinds of max expressions that indvars may use, now that indvars is recognizing le and ge loops. llvm-svn: 102235	2010-04-24 03:13:44 +00:00
Dan Gohman	231fe284cd	ScalarEvolution support for <= and >= loops. Also, generalize ScalarEvolutions's min and max recognition to handle some new forms of min and max that this change makes more common. llvm-svn: 102234	2010-04-24 03:09:42 +00:00
Chris Lattner	ace5b97b5c	no longer xfail llvm-svn: 102220	2010-04-23 22:39:33 +00:00
Stuart Hastings	85b5c330f2	Per Chris, fuse four trivial tests using grep (r102199) into one that uses FileCheck. llvm-svn: 102216	2010-04-23 22:12:57 +00:00
Dan Gohman	6a48222bd8	Change TargetData's algorithm for computing defualt vector type alignment to match what's used in clang and GCC for __alignof, rather than trying to guess what Legalize is going to be doing. llvm-svn: 102206	2010-04-23 19:41:15 +00:00
Stuart Hastings	ad81819149	Add some missing x86 patterns for movdq2q. Fixes two (LLVM-)GCC DejaGNU testcases. Radar 6881029. llvm-svn: 102199	2010-04-23 19:03:32 +00:00
Chris Lattner	790231f95e	fix some failures my callgraph dump format change broke. llvm-svn: 102197	2010-04-23 18:38:40 +00:00
Chris Lattner	775c94002d	testcase for the bug that required a patch to be reverted. llvm-svn: 102195	2010-04-23 18:31:01 +00:00
Dan Gohman	38949c2f1f	Fix LSR to tolerate cases where ScalarEvolution initially misses an opportunity to fold add operands, but folds them after LSR has separated them out. This fixes rdar://7886751. llvm-svn: 102157	2010-04-23 01:55:05 +00:00
Chris Lattner	85dd1e42b6	disable my previous inliner patch, it appears to be busting self-host. llvm-svn: 102153	2010-04-23 00:41:03 +00:00
Chris Lattner	5d87e1be44	The inliner was choosing to not consider call sites that appear in the SCC as a result of inlining as candidates for inlining. Change this so that it does consider call sites that change from being indirect to being direct as a result of inlining. This allows it to completely "devirtualize" the testcase. llvm-svn: 102146	2010-04-22 23:37:35 +00:00
Jim Grosbach	b9dccb6103	Update ARM DAGtoDAG for matching UBFX instruction for unsigned bitfield extraction. This fixes PR5998. llvm-svn: 102144	2010-04-22 23:24:18 +00:00
Devang Patel	01dcb9fa4c	Remove the test for now. llvm-svn: 102135	2010-04-22 22:06:28 +00:00
Devang Patel	cf9bece3dd	Adjust debug range offsets for isWeakForLinker() functions. llvm-svn: 102127	2010-04-22 20:52:00 +00:00
Chris Lattner	66e308198d	add a DEBUG call so that -debug lists when CGSCCPM iterates. Fix RefreshCallGraph to use CGN->replaceCallEdge instead of hand rolling its own loop. replaceCallEdge properly maintains the reference counts of the nodes, fixing a crash exposed by the iterative callgraph stuff. llvm-svn: 102120	2010-04-22 20:42:33 +00:00
Dan Gohman	31d6b29bae	Don't attempt to analyze values which are obviously undef. This fixes some assertion failures in extreme cases. llvm-svn: 102042	2010-04-22 01:35:11 +00:00
Evan Cheng	a324da99ae	Do not try to optimize a copy that has already been marked for deletion. llvm-svn: 102027	2010-04-21 20:57:54 +00:00
Evan Cheng	dbfb7dc438	Implement -disable-non-leaf-fp-elim which disable frame pointer elimination optimization for non-leaf functions. This will be hooked up to gcc's -momit-leaf-frame-pointer option. rdar://7886181 llvm-svn: 101984	2010-04-21 03:18:23 +00:00
Johnny Chen	6e4b1607ee	Thumb instructions which have reglist operands at the end and predicate operands before reglist were not properly handled with respect to IT Block. Fix that by creating a new method ARMBasicMCBuilder::DoPredicateOperands() used by those instructions for disassembly. Add a test case. llvm-svn: 101974	2010-04-21 01:01:19 +00:00
Chris Lattner	c840cfe5c9	Implement (but don't enable) PR6724 and rdar://6295824. In short, we have RefreshCallGraph detect when a function pass devirtualizes a call, and have CGSCCPassMgr iterate (up to a count) when this happens. This allows (in the example) GVN to devirtualize the call in foo, then the inliner to inline it away. This is not currently enabled because I haven't done any analysis on the (potentially substantial) code size or performance impact of doing this, and guess what, it exposes callgraph updating bugs in various passes. This is progress though, and you can play with it by passing -max-cg-scc-iterations=5 to opt. llvm-svn: 101973	2010-04-21 00:47:40 +00:00
Evan Cheng	a0c4b2952f	- Clean up some crappy code which deals with coalescing of copies which look at extract_subreg / insert_subreg, etc. - Add support for more aggressive insert_subreg coalescing. llvm-svn: 101971	2010-04-21 00:44:22 +00:00
Dan Gohman	4d1724c3e8	Revert r101471. For tight recursive functions which have multiple recursive callsites, inlining can reduce the number of calls by exponential factors, as it does in MultiSource/Benchmarks/Olden/treeadd. More involved heuristics will be needed. llvm-svn: 101969	2010-04-21 00:43:30 +00:00
Dan Gohman	570b621976	Add another variant of this test which found a place where CodeGen's ComputeMaskedBits was being over-conservative when computing bits for an ADD. llvm-svn: 101963	2010-04-21 00:19:28 +00:00
Chris Lattner	6db0f451a7	teach the x86 address matching stuff to handle (shl (or x,c), 3) the same as (shl (add x, c), 3) when x doesn't have any bits from c set. This finishes off PR1135. Before we compiled the block to: to: LBB0_3: ## %bb cmpb $4, %dl sete %dl addb %dl, %cl movb %cl, %dl shlb $2, %dl addb %r8b, %dl shlb $2, %dl movzbl %dl, %edx movl %esi, (%rdi,%rdx,4) leaq 2(%rdx), %r9 movl %esi, (%rdi,%r9,4) leaq 1(%rdx), %r9 movl %esi, (%rdi,%r9,4) addq $3, %rdx movl %esi, (%rdi,%rdx,4) incb %r8b decb %al movb %r8b, %dl jne LBB0_1 Now we produce: LBB0_3: ## %bb cmpb $4, %dl sete %dl addb %dl, %cl movb %cl, %dl shlb $2, %dl addb %r8b, %dl shlb $2, %dl movzbl %dl, %edx movl %esi, (%rdi,%rdx,4) movl %esi, 8(%rdi,%rdx,4) movl %esi, 4(%rdi,%rdx,4) movl %esi, 12(%rdi,%rdx,4) incb %r8b decb %al movb %r8b, %dl jne LBB0_1 llvm-svn: 101958	2010-04-20 23:18:40 +00:00
Johnny Chen	9998480f92	When doing Thumb disassembly, there's no need to consider t2ADDrSPi12/t2SUBrSPi12, as their generic counterparts t2ADDri12/t2SUBri12 should suffice. llvm-svn: 101929	2010-04-20 18:45:24 +00:00
Bill Wendling	a87efb5d0f	Move CodeGen/X86/2010-04-19-DAGCombineCrash.ll into CodeGen/X86/crash.ll. Also reduce. llvm-svn: 101925	2010-04-20 18:14:47 +00:00
Johnny Chen	bd5bf58029	For t2LDRT, t2LDRBT, t2LDRHT, t2LDRSBT, and t2LDRSHT, if Rn(Inst{19-16})=='1111', transform the Opcode to the corresponding t2LDR*pci counterpart. Ref: A8.6.86 LDRT, A8.6.65 LDRBT, A8.6.77 LDRHT, A8.6.81 LDRSBT, A8.6.85 LDRSHT llvm-svn: 101915	2010-04-20 17:28:50 +00:00
Devang Patel	def402649b	Add RUN: llvm-svn: 101913	2010-04-20 17:20:10 +00:00
Chris Lattner	b66b0c36cd	Bill's change in r95336 broke empty aggregates embedded in other types. fix this by only bumping zero-byte globals up to a single byte if the entire global is zero size, fixing PR6340. This also fixes empty arrays etc to be handled correctly, and only does this on subsection-via-symbols targets (aka darwin) which is the only place where this matters. llvm-svn: 101879	2010-04-20 06:20:21 +00:00
Chris Lattner	04fb51984f	teach cellspu how to return i8 and i16 from calls, patch by Kalle Raiskila! llvm-svn: 101875	2010-04-20 05:36:09 +00:00
Chris Lattner	e5a995a834	RewriteLoopBodyWithConditionConstant can end up rewriting the condition we're unswitching on. In this case, don't try to simplify the second copy of the loop which may be dead or not, but is probably a constant now. This fixes PR6879 llvm-svn: 101870	2010-04-20 05:09:16 +00:00
Chris Lattner	7a2ce4afd8	reapply 'reject forward references to functions whose type don't match' now that the testsuite has been updated. llvm-svn: 101866	2010-04-20 04:49:11 +00:00
Bill Wendling	887dac2aa6	The visitXOR method can return the same SDNode. If so, we don't want to delete it as it's not dead. llvm-svn: 101855	2010-04-20 01:25:01 +00:00
Eric Christopher	53e7e0fcfb	Remove the palignr intrinsics now that we lower them to vector shuffles, shifts and null vectors. Autoupgrade these to what we'd lower them to. Add a testcase to exercise this. llvm-svn: 101851	2010-04-20 00:59:54 +00:00
Chris Lattner	994155dc91	Fix rdar://7879828 - crash in CallGraph, a self host issue. Arg promotion was deleting call graph nodes that still had references from the 'indirect' CGN. Like the inliner, it should only delete the function if all references are gone. llvm-svn: 101845	2010-04-20 00:46:50 +00:00
Bob Wilson	2e6cd50a50	Fix tests for Neon load/store intrinsics to match the i8* types expected by the intrinsics. The reason for those i8* types is that the intrinsics are overloaded on the vector type and we don't have a way to declare an intrinsic where one argument is an overloaded vector type and another argument is a pointer to the vector element type. The bitcasts added here will match what the frontend will typically generate when these intrinsics are used. llvm-svn: 101840	2010-04-20 00:17:16 +00:00
Dan Gohman	e52396cb52	Remove the Expr member from IVUsers. Instead of remembering the expression, just ask ScalarEvolution for it on demand. This helps IVUsers be more robust in the case of expressions changing underneath it. This fixes PR6862. llvm-svn: 101819	2010-04-19 21:48:58 +00:00
Johnny Chen	fe24cdbba8	According to A8.6.16 B (Encoding T3) and A8.3 Conditional execution -- A8.3.1 Pseudocode details of conditional, Condition bits '111x' indicate the instruction is always executed. That is, '1111' is a leagl condition field value, which is now mapped to ARMCC::AL. Also add a test case for condition field '1111'. llvm-svn: 101817	2010-04-19 21:19:52 +00:00
Devang Patel	2a21d92744	Fix typo. add a test case. llvm-svn: 101812	2010-04-19 20:31:39 +00:00
Johnny Chen	091c01cb2f	ARM disassembler did not react to recent changes to the NEON instruction table. VLD1q_UPD and VST1q_UPD have the ${dst:dregpair} operand now. llvm-svn: 101784	2010-04-19 16:20:34 +00:00
Nick Lewycky	c639c07492	Fix declarations in a few more tests. llvm-svn: 101676	2010-04-17 21:29:25 +00:00
Daniel Dunbar	0cfafd5b8f	Revert "reject forward references to functions whose type don't match", because DJG told me to! llvm-svn: 101675	2010-04-17 21:24:55 +00:00
Nick Lewycky	7abefa1195	Fix intrinsic signature in this test. llvm-svn: 101674	2010-04-17 21:12:55 +00:00
Chris Lattner	8028c98fe2	reject forward references to functions whose type don't match up with the definition (and fix a broken testcase). PR6491. llvm-svn: 101670	2010-04-17 20:45:56 +00:00
Chris Lattner	3117e48427	doh, didn't mean to check in my hackaround lit sucking. :) llvm-svn: 101663	2010-04-17 19:04:03 +00:00
Chris Lattner	99d17acb35	fix PR6332, allowing an index of zero into a zero sized array even if the element of the array has no size. llvm-svn: 101662	2010-04-17 19:02:33 +00:00
Chris Lattner	65d1e40895	teach the x86 asm parser how to handle segment prefixes in memory operands. rdar://7874844 llvm-svn: 101661	2010-04-17 18:56:34 +00:00
Chris Lattner	80a22bfcdb	testcase for r101538, patch by Nico Schmidt! llvm-svn: 101642	2010-04-17 17:22:06 +00:00
Dan Gohman	5736cd1e47	Start function numbering at 0. llvm-svn: 101638	2010-04-17 16:29:15 +00:00
Chris Lattner	42337931b2	a bunch of ssse3 instructions are misencoded to think they have an i8 field when they really do not. This fixes rdar://7840289 llvm-svn: 101629	2010-04-17 07:38:24 +00:00
Evan Cheng	d3d5e6793a	Add nounwind. llvm-svn: 101613	2010-04-17 03:43:36 +00:00
Bob Wilson	ad00f21093	Re-commit my previous SSAUpdater changes. The previous version naively tried to determine where to place PHIs by iteratively comparing reaching definitions at each block. That was just plain wrong. This version now computes the dominator tree within the subset of the CFG where PHIs may need to be placed, and then places the PHIs in the iterated dominance frontier of each definition. The rest of the patch is mostly the same, with a few more performance improvements added in. llvm-svn: 101612	2010-04-17 03:08:24 +00:00
Johnny Chen	1b78f89844	Minor change to make the test case comply with Vd<0> == '0' when Q == '1'. llvm-svn: 101559	2010-04-16 22:48:31 +00:00
Johnny Chen	44087236da	Fixed a bug in DisassembleN1RegModImmFrm() where a break stmt was missing for a case. Also, the 0xFF hex literal involved in the shift for ESize64 should be suffixed "ul" to preserve the shift result. Implemented printHex*ImmOperand() by copying from ARMAsmPrinter.cpp and added a test case for DisassembleN1RegModImmFrm()/printHex64ImmOperand(). llvm-svn: 101557	2010-04-16 22:40:20 +00:00
Johnny Chen	7ac96eac03	In the same spirit of r101524, which removed the assert() from printAddrMode2OffsetOperand(), this patch removes the assert() from printAddrMode3OffsetOperand() and adds a test case. llvm-svn: 101529	2010-04-16 19:57:21 +00:00
Johnny Chen	3cc0200f74	Multiclass LdStCop was using pre-UAL syntax LDC<c>L for the L fragment. Changed to the UAL syntax of LDCL<c>, instead. Add a test case for this change which also tests the removal of assert() from printAddrMode2OffsetOperand(). llvm-svn: 101527	2010-04-16 19:33:23 +00:00
Dan Gohman	0d55b3ecfc	Revert r101455, which fails on the llvm-arm-linux buildbot. llvm-svn: 101515	2010-04-16 18:37:31 +00:00
Dan Gohman	58add81e7d	Disable inlining of recursive calls. It can complicate tailcallelim and dependent analyses, and increase code size, so doing it profitably would require more complex heuristics. llvm-svn: 101471	2010-04-16 16:01:18 +00:00
Dan Gohman	f0457a82fa	Refine the detection of seemingly infinitely recursive calls where the callee is expected to be expanded to something else by codegen, so that normal infinitely recursive calls are still transformed. llvm-svn: 101468	2010-04-16 15:57:50 +00:00
Bill Wendling	60db5321b6	Add JIT exception handling test. llvm-svn: 101455	2010-04-16 09:04:28 +00:00

... 3 4 5 6 7 ...

10113 Commits