llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Jim Grosbach	d4eea7c10d	Let target asm backends see assembler flags as they go by. Use that to handle thumb vs. arm mode differences in WriteNopData(). llvm-svn: 121219	2010-12-08 01:16:55 +00:00
Jakob Stoklund Olesen	77e7ad803a	Move RABasic::addMBBLiveIns to the base class, it is generally useful. Minor optimization to the use of IntervalMap iterators. They are fairly heavyweight, so prefer SI.valid() over SI != end(). llvm-svn: 121217	2010-12-08 01:06:06 +00:00
Owen Anderson	d00dc39a11	Simplify the byte reordering logic slightly. llvm-svn: 121216	2010-12-08 00:21:33 +00:00
Owen Anderson	ba5edcfe05	VLDR fixups need special handling under Thumb. While the encoding is the same, the order of the bytes in the data stream is flipped around. llvm-svn: 121215	2010-12-08 00:18:36 +00:00
Devang Patel	0c0accf6bc	Global variable does not need linkage name. llvm-svn: 121212	2010-12-08 00:06:22 +00:00
Devang Patel	bdbff5f106	Add support to create local variable's debug info. llvm-svn: 121211	2010-12-07 23:58:00 +00:00
Rafael Espindola	790fe1d064	Layout each section independently. With the testcase in PR8711: before: 4 assembler - Number of assembler layout and relaxation steps 78563 assembler - Number of emitted assembler fragments 8693904 assembler - Number of emitted object file bytes 271223 assembler - Number of evaluated fixups 330771677 assembler - Number of fragment layouts 5958 assembler - Number of relaxed instructions 2508361 mcexpr - Number of MCExpr evaluations real 0m26.123s user 0m25.694s sys 0m0.388s after: 4 assembler - Number of assembler layout and relaxation steps 78563 assembler - Number of emitted assembler fragments 8693904 assembler - Number of emitted object file bytes 271223 assembler - Number of evaluated fixups 231507 assembler - Number of fragment layouts 5958 assembler - Number of relaxed instructions 2508361 mcexpr - Number of MCExpr evaluations real 0m2.500s user 0m2.113s sys 0m0.273s And yes, the outputs are identical :-) llvm-svn: 121207	2010-12-07 23:32:26 +00:00
Matt Beaumont-Gay	5e680ad101	Fix a warning about a variable which is only used in an assertion. llvm-svn: 121206	2010-12-07 23:26:21 +00:00
Devang Patel	cef2982b39	Add support to create variables, structs etc.. using DIBuilder. This is still work in progress. llvm-svn: 121205	2010-12-07 23:25:47 +00:00
Jakob Stoklund Olesen	9d6472e894	Switch LiveIntervalUnion from std::set to IntervalMap. This speeds up RegAllocBasic by 20%, not counting releaseMemory which becomes way faster. llvm-svn: 121201	2010-12-07 23:18:47 +00:00
Bill Wendling	45bdb13970	Cleanup in the Darwin end. No functionality change. llvm-svn: 121198	2010-12-07 23:11:00 +00:00
Evan Cheng	3bd9b95b4d	Fix a bad prologue / epilogue codegen bug where the compiler would emit illegal vpush instructions to save / restore VFP / NEON registers like this: vpush {d8,d10,d11} vpop {d8,d10,d11} vpush and vpop do not allow gaps in the register list. rdar://8728956 llvm-svn: 121197	2010-12-07 23:08:38 +00:00
Bill Wendling	4399d09458	A bit of cleanup: early exit ApplyFixup and cache the Fixup offset. No functionality change. llvm-svn: 121195	2010-12-07 23:05:20 +00:00
Jim Grosbach	77b631549c	Binary encoding for ARM tLDRspi and tSTRspi. llvm-svn: 121186	2010-12-07 21:50:47 +00:00
Owen Anderson	a23e10f29d	Fix Thumb2 encoding of the S bit. llvm-svn: 121182	2010-12-07 20:50:15 +00:00
Jim Grosbach	1aa6a676cf	Refactor the ARM CMPz* patterns to just use the normal CMP instructions when possible. They were duplicates for everything exception the source pattern before. llvm-svn: 121179	2010-12-07 20:41:06 +00:00
Evan Cheng	9af09ebf8b	Code clean up; no functionality change. llvm-svn: 121176	2010-12-07 20:11:46 +00:00
Evan Cheng	0295c17fbc	Code clean up; no functionality change. llvm-svn: 121172	2010-12-07 19:59:34 +00:00
Dan Gohman	54a78a9787	Remove the code from Function::dropAllReferences which replaced uses of the function's blocks with undef. This code isn't needed, because BasicBlock's destructor handles such uses. Also, undef isn't correct, since blockaddresses may still be used for comparisons with null. llvm-svn: 121170	2010-12-07 19:56:51 +00:00
Bruno Cardoso Lopes	e11d870459	Remove target specific node MipsISD::CMov, which is not used because all conditional moves are directly matched using tablegen patterns. If there's a need in the future, we can introduce it again llvm-svn: 121164	2010-12-07 19:04:14 +00:00
Bruno Cardoso Lopes	0e14644599	Match a pattern generated by a dag combiner opt where: (select (load (load tga0)) (load tga1)) => (load (select (load tga0) tga1)) Thanks to Akira for pointing that. llvm-svn: 121163	2010-12-07 19:00:20 +00:00
Jakob Stoklund Olesen	39e22e19bf	Simplify assertion. llvm-svn: 121162	2010-12-07 18:51:27 +00:00
Michael J. Spencer	3885add959	Support: Remove Alarm. It is unused (via local grep and google code search). llvm-svn: 121160	2010-12-07 18:41:59 +00:00
Michael J. Spencer	3dc94b3cc1	Support/PathV2: Remove const from bool return types. llvm-svn: 121157	2010-12-07 18:12:07 +00:00
Jim Grosbach	c99517ecc6	Encode the literal field for tCMPzi instruction. llvm-svn: 121153	2010-12-07 17:48:24 +00:00
Rafael Espindola	866531d633	Fix absolute recording of differences of symbols in two sections. Reduced from ctor_dtor_count-2.cpp. llvm-svn: 121152	2010-12-07 17:12:32 +00:00
Michael J. Spencer	7979bb402f	Support/PathV2: Change most functions in the path namespace to return their work via their return value instead of an out parameter. llvm-svn: 121149	2010-12-07 17:04:04 +00:00
Daniel Dunbar	6d79685e20	build: Go back to dropping __eprintf reference when building with Clang, see comment. llvm-svn: 121146	2010-12-07 16:29:44 +00:00
Benjamin Kramer	fb17a54866	Add parens to pacify gcc. llvm-svn: 121142	2010-12-07 15:50:35 +00:00
Frits van Bommel	e7f51111ce	Remove some dead code from the jump threading pass. The last uses of these functions were removed in r113852 when LazyValueInfo was permanently enabled and removed the need for them. llvm-svn: 121133	2010-12-07 13:08:07 +00:00
Jay Foad	79e18ed269	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Owen Anderson	4ad5307d6a	Don't leak the mutex when loading dynamic libraries. llvm-svn: 121119	2010-12-07 07:56:20 +00:00
Rafael Espindola	da64b6aa50	Fix relocations with weak definitions. llvm-svn: 121114	2010-12-07 05:57:28 +00:00
Chris Lattner	12c2c17ac7	reapply r121100 with a tweak to constant fold ConstExprs with TargetData (if available) as we go so that we get simple constantexprs not insane ones. This fixes the failure of clang/test/CodeGenCXX/virtual-base-ctor.cpp that the previous iteration of this patch had. llvm-svn: 121111	2010-12-07 04:33:29 +00:00
Michael J. Spencer	0ce07529b3	Support/PathV2: Cleanup separator handling. llvm-svn: 121110	2010-12-07 03:57:48 +00:00
Michael J. Spencer	a96fe51fa6	Support/PathV2: Remove the error_code return type from all functions in the path namespace. None of them return anything except for success anyway. These will be converted to returning their result soon. llvm-svn: 121109	2010-12-07 03:57:37 +00:00
Michael J. Spencer	7c3efd63d4	Support/PathV2: Move make_absolute from path to fs. llvm-svn: 121108	2010-12-07 03:57:17 +00:00
Rafael Espindola	9ede5ef045	Fix pcrel relocations that cross sections. llvm-svn: 121107	2010-12-07 03:50:14 +00:00
NAKAMURA Takumi	7947c9770a	lib/Target/X86/X86MCAsmInfo.cpp: [PR8741] On Win64, specify explicit PrivateGlobalPrefix as ".L". Or, global symbols @Lxxxx might be treated as temporal symbol by MCSymbol. llvm-svn: 121103	2010-12-07 02:43:45 +00:00
Eric Christopher	cab6997dc8	Temporarily revert r121100 as it's causing clang to fail CodeGenCXX/virtual-base-ctor.cpp. llvm-svn: 121102	2010-12-07 02:41:11 +00:00
Chris Lattner	5996a47663	fix PR8710 - teach global opt that some constantexprs are too complex to put in a global variable's initializer. llvm-svn: 121100	2010-12-07 01:59:32 +00:00
Jakob Stoklund Olesen	48ba44334f	Remove unused member. llvm-svn: 121098	2010-12-07 01:32:45 +00:00
Michael J. Spencer	2953ba9e66	Support/Unix/PathV2: Return the real error from realpath instead of any error that close or unlink set. llvm-svn: 121094	2010-12-07 01:23:39 +00:00
Michael J. Spencer	6874ebd344	Support/Unix/PathV2: Use 0770 instead of 0700 when creating a directory. Also use the standard macros instead of octal notation. llvm-svn: 121093	2010-12-07 01:23:29 +00:00
Michael J. Spencer	cfd185355b	Support/PathV2: Use SmallVector::clear instead of set_size. llvm-svn: 121092	2010-12-07 01:23:19 +00:00
Michael J. Spencer	a59a7b3965	Support/PathV2: Clarify and correct documentation. llvm-svn: 121091	2010-12-07 01:23:08 +00:00
Michael J. Spencer	898af0f235	Support/PathV2: Move current_path from path to fs and fix the Unix implementation. Unix bug spotted by Dan Gohman. llvm-svn: 121090	2010-12-07 01:22:31 +00:00
Rafael Espindola	c98cc0b286	Fix a crash reduced from gcc produced assembly. llvm-svn: 121085	2010-12-07 01:09:54 +00:00
Owen Anderson	81f8b084e6	Second attempt at converting Thumb2's LDRpci, including updating the gazillion places that need to know about it. llvm-svn: 121082	2010-12-07 00:45:21 +00:00
Rafael Espindola	8dad37785c	Sorry for such a large commit. The summary is that only MachO cares about the actuall addresses in a .o file, so it is better to let the MachO writer compute it. This is good for two reasons. First, areas that shouldn't care about addresses now don't have access to it. Second, the layout of each section is independent. I should use this in a subsequent commit to speed it up. Most of the patch is just removing the section address computation. The two interesting parts are the change on how we handle padding in the end of sections and how MachO can get the address of a-b when a and b are in different sections. Since now the expression evaluation normally doesn't know the section address, it will think that a-b needs relocation and let the MachO writer know. Once it has computed the section addresses, it calls back the expression evaluation with the section addresses to resolve these expressions. The remaining problem is the handling of padding. Currently it will create a special alignment fragment at the end. Since that fragment doesn't update the alignment of the section, it needs the real address to be computed. Since now the layout will not compute a-b with a and b in different sections, the only effect that the special alignment fragment has is update the address size of the section. This can also be done by the MachO writer. llvm-svn: 121076	2010-12-07 00:27:36 +00:00
Jim Grosbach	2d361b9318	Add fixup for Thumb1 BL/BLX instructions. llvm-svn: 121072	2010-12-06 23:57:07 +00:00
Frits van Bommel	1494a2f6fe	Implement jump threading of 'indirectbr' by keeping track of whether we're looking for ConstantInts or BlockAddresss. llvm-svn: 121066	2010-12-06 23:36:56 +00:00
Devang Patel	12459bc442	Undefined value in reg 0 may need a marker to identify end of source range. This will be used to truncate live range of DBG_VALUE instruction by register allocator and friends. llvm-svn: 121061	2010-12-06 22:48:22 +00:00
Devang Patel	6fe7fe8dd4	If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0. llvm-svn: 121059	2010-12-06 22:39:26 +00:00
Rafael Espindola	82d3d8dc2c	Use references to simplify the code a bit. llvm-svn: 121050	2010-12-06 22:30:54 +00:00
Wesley Peck	ffdbf99b57	Adding bug fix that was suppose to be part of 121044. patch contributed by Jack Whitham! llvm-svn: 121049	2010-12-06 22:19:28 +00:00
Wesley Peck	996c76c27e	Fixed reversed operands for IDIV and CMP instructions in MBlaze backend. Use BRAD instead of BRD for indirect branches in MBlaze backend. patch contributed by Jack Whitham! llvm-svn: 121044	2010-12-06 22:06:49 +00:00
Jason W Kim	672ef014da	Refactor ELFObjectWriter. + ARM/X86/MBlaze now share a common RecordRelocation + ARM/X86/MBlaze arch specific routines are limited to GetRelocType() llvm-svn: 121043	2010-12-06 21:57:34 +00:00
Chris Lattner	599e271a46	replace a linear scan with a symtab lookup, reduce indentation. No functionality change. llvm-svn: 121042	2010-12-06 21:53:07 +00:00
Rafael Espindola	c726be7d0a	use getSymbolOffset. llvm-svn: 121041	2010-12-06 21:51:55 +00:00
Chris Lattner	2f134ca2cc	Use a stronger predicate here, pointed out by Duncan llvm-svn: 121040	2010-12-06 21:48:10 +00:00
Chris Lattner	2907722386	add some DEBUG statements. llvm-svn: 121038	2010-12-06 21:13:51 +00:00
Wesley Peck	b168ddedaa	Fix a 16-bit immediate value detection bug in the MBlaze delay slot filler. Address more hazards in the MBlaze delay slot filler. patch contributed by Jack Whitham! llvm-svn: 121037	2010-12-06 21:11:01 +00:00
Rafael Espindola	fd0cc5d13f	Another use of getSymbolOffset. llvm-svn: 121034	2010-12-06 19:55:05 +00:00
Rafael Espindola	65c25aef87	Remove the instruction fragment to data fragment lowering since it was causing freed data to be read. I will open a bug to track it being reenabled. llvm-svn: 121028	2010-12-06 19:08:48 +00:00
Owen Anderson	8e9cb84ea2	Revert r121021, which broke the buildbots. llvm-svn: 121026	2010-12-06 18:57:40 +00:00
Jim Grosbach	f2e0e808ba	Trailing whitespace. llvm-svn: 121024	2010-12-06 18:47:44 +00:00
Owen Anderson	0c51a02230	Improve handling of Thumb2 PC-relative loads by converting LDRpci (and friends) to Pseudos. llvm-svn: 121021	2010-12-06 18:35:51 +00:00
Jim Grosbach	6c27b4f3cf	Encode the register operand of ARM CondCode operands correctly. ARM::CPSR if the instruction is predicated, reg0 otherwise. llvm-svn: 121020	2010-12-06 18:30:57 +00:00
Jim Grosbach	c79c6290ee	The ARM AsmMatcher needs to know that the CCOut operand is a register value, not an immediate. It stores either ARM::CPSR or reg0. llvm-svn: 121018	2010-12-06 18:21:12 +00:00
Rafael Espindola	3e954d16f4	Second try at making direct object emission produce the same results as llc + llvm-mc. This time ELF is not changed and I tested that llvm-gcc bootstrap on darwin10 using darwin9's assembler and linker. llvm-svn: 121006	2010-12-06 17:27:56 +00:00
Rafael Espindola	4ec917db9b	Revert previous two patches while I try to find out how to make both linux and darwin assemblers happy :-( llvm-svn: 121004	2010-12-06 15:35:15 +00:00
Rafael Espindola	3dc2b4cba7	Add an EmitAbsValue helper method and use it in cases where we want to be sure that no relocations are used (on MochO). Fixes llc producing different output from llc + llvm-mc. llvm-svn: 121000	2010-12-06 14:53:14 +00:00
Chris Lattner	48a7310e08	Fix PR8735, a really terrible problem in the inliner's "alloca merging" optimization. Consider: static void foo() { A = alloca ... } static void bar() { B = alloca ... call foo(); } void main() { bar() } The inliner proceeds bottom up, but lets pretend it decides not to inline foo into bar. When it gets to main, it inlines bar into main(), and says "hey, I just inlined an alloca "B" into main, lets remember that. Then it keeps going and finds that it now contains a call to foo. It decides to inline foo into main, and says "hey, foo has an alloca A, and I have an alloca B from another inlined call site, lets reuse it". The problem with this of course, is that the lifetime of A and B are nested, not disjoint. Unfortunately I can't create a reasonable testcase for this: the one in the PR is both huge and extremely sensitive, because you minor tweaks end up causing foo to get inlined into bar too early. We already have tests for the basic alloca merging optimization and this does not break them. llvm-svn: 120995	2010-12-06 07:52:42 +00:00
Chris Lattner	21587c9f65	improve comment llvm-svn: 120994	2010-12-06 07:43:04 +00:00
Chris Lattner	71a4c43942	improve -debug output and comments a little. llvm-svn: 120993	2010-12-06 07:38:40 +00:00
Michael J. Spencer	b31b7d5b4e	Support/Windows: Make MinGW happy. llvm-svn: 120991	2010-12-06 06:02:07 +00:00
Michael J. Spencer	244b426701	Support/FileSystem: Add directory_iterator implementation. llvm-svn: 120989	2010-12-06 04:28:42 +00:00
Michael J. Spencer	36a2df800d	Support/PathV2: Fix append to not add a slash to empty or root paths. llvm-svn: 120988	2010-12-06 04:28:23 +00:00
Michael J. Spencer	61043e9f3a	Support/Windows: Add ScopedHandle and move some clients over to it. llvm-svn: 120987	2010-12-06 04:28:13 +00:00
Che-Liang Chiou	cd2878d421	ptx: add shift instructions llvm-svn: 120982	2010-12-06 04:00:03 +00:00
Rafael Espindola	0ba01a5b5c	Remove the getAddress getter, initialize Ordinal in the constructor and use that on the ELF writer to detect a section we created. llvm-svn: 120981	2010-12-06 03:48:09 +00:00
Rafael Espindola	bf001eed4c	Simplify a bit. llvm-svn: 120980	2010-12-06 03:36:43 +00:00
Rafael Espindola	d361a448af	Use getSymbolOffset on the COFF writer. llvm-svn: 120979	2010-12-06 03:24:04 +00:00
Rafael Espindola	f56c11276e	Don't use PadSectionToAlignment on windows. llvm-svn: 120978	2010-12-06 03:03:44 +00:00
Rafael Espindola	1b2090ef24	Add a getSymbolOffset method and use it in the ELF writer. llvm-svn: 120977	2010-12-06 02:57:26 +00:00
Chris Lattner	db6c348f31	Fix PR8728, a miscompilation I recently introduced. When optimizing memcpy's like: memcpy(A, B) memcpy(A, C) we cannot delete the first memcpy as dead if A and C might be aliases. If so, we actually get: memcpy(A, B) memcpy(A, A) which is not correct to transform into: memcpy(A, A) This patch was heavily influenced by Jakub Staszak's patch in PR8728, thanks Jakub! llvm-svn: 120974	2010-12-06 01:48:06 +00:00
Evan Cheng	4d9d54e44e	Eliminate unneeded #include's. llvm-svn: 120971	2010-12-05 23:41:43 +00:00
NAKAMURA Takumi	594d4094ca	ARM/CMakeLists.txt: Add missing MLxExpansionPass.cpp since r120960. llvm-svn: 120966	2010-12-05 23:08:57 +00:00
Evan Cheng	12561e250d	Code clean up. llvm-svn: 120965	2010-12-05 23:03:45 +00:00
Evan Cheng	854ec53564	Remove an unused variable. llvm-svn: 120964	2010-12-05 23:03:35 +00:00
Cameron Zwarich	f56ba80bb2	Some cleanup before I start committing some incremental progress on StrongPHIElimination. llvm-svn: 120961	2010-12-05 22:34:08 +00:00
Evan Cheng	fc78767730	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Cameron Zwarich	f64c26bb9e	Remove the PHIElimination.h header, as it is no longer needed. llvm-svn: 120959	2010-12-05 21:39:42 +00:00
Frits van Bommel	e390b379ae	Fix PR 4170 by having ExtractValueInst::getIndexedType() reject out-of-bounds indexing. Also add asserts that the indices are valid in InsertValueInst::init(). ExtractValueInst already asserts when constructed with invalid indices. llvm-svn: 120956	2010-12-05 20:50:26 +00:00
Cameron Zwarich	fbe9e91d97	I forgot to actually remove the FindCopyInsertPoint() declaration from PHIElimination.h. llvm-svn: 120953	2010-12-05 19:58:57 +00:00
Cameron Zwarich	cb613dcf69	Remove the SplitCriticalEdge() method declaration from PHIElimination.h. At one time, this method existed, but now PHIElimination uses the method of the same name on MachineBasicBlock. llvm-svn: 120952	2010-12-05 19:54:23 +00:00
Cameron Zwarich	c680f44c1b	Move the FindCopyInsertPoint method of PHIElimination to a new standalone function so that it can be shared with StrongPHIElimination. llvm-svn: 120951	2010-12-05 19:51:05 +00:00
Frits van Bommel	b95594885e	Refactor jump threading. Should have no functional change other than the order of two transformations that are mutually-exclusive and the exact formatting of debug output. Internally, it now stores the ConstantInts as Constants, and actual undef values instead of nulls. llvm-svn: 120946	2010-12-05 19:06:41 +00:00
Frits van Bommel	4f39797ac2	Remove trailing whitespace. llvm-svn: 120945	2010-12-05 19:02:47 +00:00
Frits van Bommel	31cf7b99f9	Teach SimplifyCFG to turn (indirectbr (select cond, blockaddress(@fn, BlockA), blockaddress(@fn, BlockB))) into (br cond, BlockA, BlockB). llvm-svn: 120943	2010-12-05 18:29:03 +00:00
Chris Lattner	e30adfb732	Teach X86ISelLowering that the second result of X86ISD::UMUL is a flags result. This allows us to compile: void *test12(long count) { return new int[count]; } into: test12: movl $4, %ecx movq %rdi, %rax mulq %rcx movq $-1, %rdi cmovnoq %rax, %rdi jmp __Znam ## TAILCALL instead of: test12: movl $4, %ecx movq %rdi, %rax mulq %rcx seto %cl testb %cl, %cl movq $-1, %rdi cmoveq %rax, %rdi jmp __Znam Of course it would be even better if the regalloc inverted the cmov to 'cmovoq', which would eliminate the need for the 'movq %rdi, %rax'. llvm-svn: 120936	2010-12-05 07:49:54 +00:00
Chris Lattner	76601e7a99	it turns out that when ".with.overflow" intrinsics were added to the X86 backend that they were all implemented except umul. This one fell back to the default implementation that did a hi/lo multiply and compared the top. Fix this to check the overflow flag that the 'mul' instruction sets, so we can avoid an explicit test. Now we compile: void *func(long count) { return new int[count]; } into: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] seto %cl ## encoding: [0x0f,0x90,0xc1] testb %cl, %cl ## encoding: [0x84,0xc9] movq $-1, %rdi ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff] cmoveq %rax, %rdi ## encoding: [0x48,0x0f,0x44,0xf8] jmp __Znam ## TAILCALL instead of: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] testq %rdx, %rdx ## encoding: [0x48,0x85,0xd2] movq $-1, %rdi ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff] cmoveq %rax, %rdi ## encoding: [0x48,0x0f,0x44,0xf8] jmp __Znam ## TAILCALL Other than the silly seto+test, this is using the o bit directly, so it's going in the right direction. llvm-svn: 120935	2010-12-05 07:30:36 +00:00
Chris Lattner	16bafb2414	generalize the previous check to handle -1 on either side of the select, inserting a not to compensate. Add a missing isZero check that I lost somehow. This improves codegen of: void *func(long count) { return new int[count]; } from: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] testq %rdx, %rdx ## encoding: [0x48,0x85,0xd2] movq $-1, %rdi ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff] cmoveq %rax, %rdi ## encoding: [0x48,0x0f,0x44,0xf8] jmp __Znam ## TAILCALL ## encoding: [0xeb,A] to: __Z4funcl: ## @_Z4funcl movl $4, %ecx ## encoding: [0xb9,0x04,0x00,0x00,0x00] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] mulq %rcx ## encoding: [0x48,0xf7,0xe1] cmpq $1, %rdx ## encoding: [0x48,0x83,0xfa,0x01] sbbq %rdi, %rdi ## encoding: [0x48,0x19,0xff] notq %rdi ## encoding: [0x48,0xf7,0xd7] orq %rax, %rdi ## encoding: [0x48,0x09,0xc7] jmp __Znam ## TAILCALL ## encoding: [0xeb,A] llvm-svn: 120932	2010-12-05 02:00:51 +00:00
Chris Lattner	474ed0aa9b	Improve an integer select optimization in two ways: 1. generalize (select (x == 0), -1, 0) -> (sign_bit (x - 1)) to: (select (x == 0), -1, y) -> (sign_bit (x - 1)) \| y 2. Handle the identical pattern that happens with !=: (select (x != 0), y, -1) -> (sign_bit (x - 1)) \| y cmov is often high latency and can't fold immediates or memory operands. For example for (x == 0) ? -1 : 1, before we got: < testb %sil, %sil < movl $-1, %ecx < movl $1, %eax < cmovel %ecx, %eax now we get: > cmpb $1, %sil > sbbl %eax, %eax > orl $1, %eax llvm-svn: 120929	2010-12-05 01:23:24 +00:00
Bill Wendling	2b53c0830d	Initialize HasPOPCNT. llvm-svn: 120923	2010-12-04 23:57:24 +00:00
Rafael Espindola	310b851621	Once the layout is done we don't need to keep updating which fragments are valid. Addresses will not change. llvm-svn: 120921	2010-12-04 22:47:22 +00:00
Rafael Espindola	1bf8d261f1	Remember the contents of leb and dwarfline fragments when relaxing. This avoids having to evaluate the expression again when writing. llvm-svn: 120920	2010-12-04 21:58:52 +00:00
Cameron Zwarich	5e3c712e67	Remove PHIElimination's private copy of SkipPHIsAndLabels. llvm-svn: 120918	2010-12-04 20:40:15 +00:00
Benjamin Kramer	851691ddb2	Add patterns for the x86 popcnt instruction. - Also adds a new POPCNT subtarget feature that is currently enabled if the target supports SSE4.2 (nehalem) or SSE4A (barcelona). llvm-svn: 120917	2010-12-04 20:32:23 +00:00
Bill Wendling	18e834f217	Silence 'may be used uninitialized in this function' warnings. Static analysis may determine that they cannot be used uninitialized. But that might be a bit too much for the compiler to determine. llvm-svn: 120916	2010-12-04 20:20:34 +00:00
Michael J. Spencer	2cd429339f	Support/PathV2: Remove redundant calls to make_error_code. llvm-svn: 120913	2010-12-04 18:45:32 +00:00
Benjamin Kramer	e2e8053264	APInt: microoptimize a few methods. llvm-svn: 120912	2010-12-04 18:05:36 +00:00
Benjamin Kramer	009451fddc	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Benjamin Kramer	612c7225ee	Apparently APFloat::getZero doesn't like PPCDoubleDoubles. llvm-svn: 120909	2010-12-04 14:43:08 +00:00
Benjamin Kramer	77faee6ba1	Simplify code. No functionality change. llvm-svn: 120907	2010-12-04 14:22:24 +00:00
Bob Wilson	20c65a9d33	The Thumb tADDrSPi instruction is not valid when the destination is SP. Check for that and try narrowing it to tADDspi instead. Radar 8724703. llvm-svn: 120892	2010-12-04 04:40:19 +00:00
Rafael Espindola	9215947c83	There are two reasons why we might want to use foo = a - b .long foo instead of just .long a - b First, on darwin9 64 bits the assembler produces the wrong result. Second, if "a" is the end of the section all darwin assemblers (9, 10 and mc) will not consider a - b to be a constant but will if the dummy foo is created. Split how we handle these cases. The first one is something MC should take care of. The second one has to be handled by the caller. llvm-svn: 120889	2010-12-04 03:21:47 +00:00
Michael J. Spencer	5a2272dcef	Support/FileSystem: Add status implementation. llvm-svn: 120870	2010-12-04 00:32:40 +00:00
Michael J. Spencer	090a4bd632	Support/Windows/FileSystem: Fix MinGW warnings. llvm-svn: 120868	2010-12-04 00:32:14 +00:00
Michael J. Spencer	e1360680ce	Support/FileSystem: Add file_size implementation. llvm-svn: 120867	2010-12-04 00:31:48 +00:00
Rafael Espindola	50b6170457	Next step: Only pad debug_line when the target is darwin. Add a FIXME to avoid doing that if the target is darwin10 or newer. This fixes ) Direct object emission was producing objects without the workaround on darwin9. ) Assembly printing was producing objects with the workaround on linux. llvm-svn: 120866	2010-12-04 00:31:13 +00:00
Jim Grosbach	4c10ffa7c3	Encode condition code for Thumb1 conditional branch instruction. llvm-svn: 120865	2010-12-04 00:20:40 +00:00
Jim Grosbach	1dac8796d5	Correctly size-reduce the t2CMPzrr instruction to tCMPzr when possible. tCMPzhir has undefined behavior when both source registers are low registers. rdar://8728577 llvm-svn: 120858	2010-12-03 23:54:18 +00:00
Bill Wendling	7e27f17312	Use correct variable names to match the patterns. llvm-svn: 120857	2010-12-03 23:44:24 +00:00
Jakob Stoklund Olesen	6f535251de	Also inore '()' while creating mdnode name from ObjC symbol name. llvm-svn: 120856	2010-12-03 23:40:45 +00:00
Rafael Espindola	4145acdc3b	First step in fixing MC. Make it clear that we are avoiding a bug in the darwin9 linker, what is needed to avoid it and where to get more information. Also make the workaround simpler. Just the regular end_sequence we normally create is more than 4 bytes. Tested by building cctools and ld64 from darwin9 on a darwin10 system and using those. I checked that I was able to reproduce the bootstrap failure when the the workaround was disabled. llvm-svn: 120854	2010-12-03 23:36:59 +00:00
Devang Patel	fdc570cad4	Ignore '+' while creating mdnode name from ObjC symbol name. llvm-svn: 120853	2010-12-03 23:29:30 +00:00
Jim Grosbach	3b5c857f01	Match pattern operand names to expected encoding field names. This corrects the operand encoding ordering of the instruction. llvm-svn: 120852	2010-12-03 23:21:25 +00:00
Jim Grosbach	dbefb3e7e5	Remove incorrect BL target encoding (it's similar to, but not the same as the ARM instruction). Add encoding of bits 13 and 11. llvm-svn: 120849	2010-12-03 22:33:42 +00:00
Jim Grosbach	8cef570ed9	Encode the 32-bit wide Thumb (and Thumb2) instructions with the high order halfword being emitted to the stream first. rdar://8728174 llvm-svn: 120848	2010-12-03 22:31:40 +00:00
Nate Begeman	d4310b6d7c	Revert this change since it breaks a couple of the AVX tests. I'm unclear if the tests are actually correct or not, but reverting for now. llvm-svn: 120847	2010-12-03 22:29:15 +00:00
Jakob Stoklund Olesen	8ed86d7fd3	Rename virtRegMap to avoid confusion with the VirtRegMap that it isn't. llvm-svn: 120846	2010-12-03 22:25:09 +00:00
Jakob Stoklund Olesen	8b893e3575	Coalesce debug locations when possible, causing less DBG_VALUE instructions to be emitted. llvm-svn: 120845	2010-12-03 22:25:07 +00:00
Nate Begeman	deb26223bd	Scalar f32/f64 are also subregs of ymm regs llvm-svn: 120844	2010-12-03 21:54:39 +00:00
Nate Begeman	3911dcfd71	Remove SSE1-4 disable when AVX is enabled. While this may be useful for development, it completely breaks scalar fp in xmm regs when AVX is enabled. llvm-svn: 120843	2010-12-03 21:54:14 +00:00
Jakob Stoklund Olesen	c18ef29bc6	Emit DBG_VALUE instructions from LiveDebugVariables. llvm-svn: 120842	2010-12-03 21:47:10 +00:00
Jakob Stoklund Olesen	1d753a5f7f	Also update virtRegMap when renaming virtual registers. llvm-svn: 120841	2010-12-03 21:47:08 +00:00
Jim Grosbach	c69ad2176a	When using the 'push' mnemonic for Thumb2 stmdb, be explicit when it's the 32-bit wide version by adding the .w suffix. llvm-svn: 120838	2010-12-03 20:33:01 +00:00
Benjamin Kramer	e27eff8888	Remove unused variable. llvm-svn: 120836	2010-12-03 19:55:37 +00:00
Jim Grosbach	c8ce9a3453	Reduce t2 ldr/str instructions to the correct t1 versions when there's an immediate offset. llvm-svn: 120833	2010-12-03 19:47:11 +00:00
Jason W Kim	27bbab7e31	fix ARM::fixup_arm_branch, cleanup, and share more code between ELF and Darwin llvm-svn: 120832	2010-12-03 19:40:23 +00:00
Jim Grosbach	25da270139	No need to declare EncoderMethod property anymore; just assign to it. llvm-svn: 120831	2010-12-03 19:31:00 +00:00
Jakob Stoklund Olesen	1ec6a68038	Delete the StrongPHIElimination pass, leaving only a shell. The StrongPHIElimination pass did not work, and nobody has worked on it for two years. A rewrite is underway, so I am leaving this shell pass instead of deleting it completely. llvm-svn: 120830	2010-12-03 19:21:53 +00:00
Jakob Stoklund Olesen	4cd667151d	Add IntervalMap::iterator::set{Start,Stop,Value} methods that allow limited editing of the current interval. These methods may cause coalescing, there are corresponding set*Unchecked methods for editing without coalescing. The non-coalescing methods are useful for applying monotonic transforms to all keys or values in a map without accidentally coalescing transformed and untransformed intervals. llvm-svn: 120829	2010-12-03 19:02:00 +00:00
Michael J. Spencer	6cf89ea23a	Support/FileSystem: Add equivalent implementation. llvm-svn: 120827	2010-12-03 18:49:13 +00:00
Michael J. Spencer	ce950c50c4	Support/FileSystem: Fix MinGW build. It doesn't have _chsize_s. llvm-svn: 120826	2010-12-03 18:48:56 +00:00
Jim Grosbach	d0db6c9f0e	Add FIXMEs. llvm-svn: 120824	2010-12-03 18:37:17 +00:00
Jim Grosbach	dca34b5da7	Size reduction for tPUSH come from t2STMDB_UPD, not t2STMIA_UPD. llvm-svn: 120822	2010-12-03 18:31:03 +00:00
Michael J. Spencer	60c2f39a49	And I really hate line endings. llvm-svn: 120821	2010-12-03 18:04:11 +00:00
Michael J. Spencer	94cf759d48	Support/Windows/FileSystem: Fix MinGW build. llvm-svn: 120820	2010-12-03 18:03:28 +00:00
Michael J. Spencer	a0346605ae	Support/FileSystem: Add resize_file implementation. llvm-svn: 120819	2010-12-03 17:54:07 +00:00
Michael J. Spencer	ba3176e790	Support/FileSystem: Add rename implementation. llvm-svn: 120818	2010-12-03 17:53:55 +00:00
Michael J. Spencer	1d9a2b7541	Support/FileSystem: Add remove implementation. llvm-svn: 120817	2010-12-03 17:53:43 +00:00
Michael J. Spencer	8c7c12a74e	Fix line endings. llvm-svn: 120816	2010-12-03 17:53:23 +00:00
Eric Christopher	25f7b89a9e	Apparently OS X 10.4 doesn't have __crashreporter_info__. Try to fix building on the wayback machine. llvm-svn: 120801	2010-12-03 07:45:22 +00:00
Michael J. Spencer	98d9d1509e	Support/FileSystem: Add create_symlink implementation. llvm-svn: 120800	2010-12-03 07:41:25 +00:00
Michael J. Spencer	efc69d250d	Support/FileSystem: Add create_hard_link implementation. llvm-svn: 120792	2010-12-03 05:58:41 +00:00
Michael J. Spencer	5328e94513	Support/ADT/Twine: Make toNullTerminatedStringRef not rely on UB :(. llvm-svn: 120791	2010-12-03 05:42:25 +00:00
Michael J. Spencer	96236c93ff	Support/FileSystem: Add create_director{y,ies} implementations. llvm-svn: 120790	2010-12-03 05:42:11 +00:00
Rafael Espindola	ec560cdae3	Make EmitIntValue more efficient and more like what we do for leb128. The difference is much smaller (about 0.3s) but significant. llvm-svn: 120787	2010-12-03 02:54:21 +00:00
Bill Wendling	b7df584ef7	Don't overwrite the opcode passed into the T1Special pattern. llvm-svn: 120782	2010-12-03 02:02:58 +00:00
Bill Wendling	c4858cb4c3	Add Thumb encoding for some more instructions. llvm-svn: 120780	2010-12-03 01:55:47 +00:00
Michael J. Spencer	d9f355beb0	Support/Windows/FileSystem: Remove unneeded toNullTerminatedStringRef. llvm-svn: 120777	2010-12-03 01:21:38 +00:00
Michael J. Spencer	4e1623c715	Support/FileSystem: Add unique_file and exists implementations. llvm-svn: 120776	2010-12-03 01:21:28 +00:00
Rafael Espindola	dc103b1755	Do with uleb the same trick we now do with dwarf line/address advances. This avoids creating leb128 fragments and speeds up the test in PR8711 to 33s. llvm-svn: 120774	2010-12-03 01:19:49 +00:00
Rafael Espindola	3e119b0bb4	Try to resolve symbol differences early, and if successful create a plain data fragment. This reduces the time to assemble the test in 8711 from 60s to 54s. llvm-svn: 120767	2010-12-03 00:55:40 +00:00
Bill Wendling	2f6a820abe	The tLDR instruction wasn't encoded properly: <MCInst 2251 <MCOperand Reg:70> <MCOperand Reg:66> <MCOperand Imm:0> <MCOperand Reg:0> <MCOperand Imm:14> <MCOperand Reg:0>> Notice that the "reg" here is 0, which is an invalid register. Put a check in the code for this to prevent crashing. llvm-svn: 120766	2010-12-03 00:53:22 +00:00
Devang Patel	77a1457f1a	It may not be an option to skip .debug_line if there are file reference in already emitted debug info. So, for now, emit dummy line table entry to make older linker and assemblers happy. This is not a new behavior, original AsmPrinter emitted similar line table entries. llvm-svn: 120760	2010-12-03 00:10:48 +00:00
Jim Grosbach	0bd3b0fd6c	Trailing whitespace. llvm-svn: 120748	2010-12-02 23:05:38 +00:00
Devang Patel	822facd787	Use set directive for StartMinusEndExpr. This is a fix for llvm-gcc-i386-darwin9 buildbot failure. llvm-svn: 120742	2010-12-02 21:32:30 +00:00
Jakob Stoklund Olesen	08f52108b1	Update LiveDebugVariables during coalescing. llvm-svn: 120720	2010-12-02 18:15:44 +00:00
Jim Grosbach	b6d0c8d5b1	When expanding the MOVCCi32imm, make sure to use the ARM movt/movw opcodes, not thumb2. llvm-svn: 120711	2010-12-02 16:42:25 +00:00
Jim Grosbach	78ef3199c8	Fix copy/pasto in vmin.f32 encoding. llvm-svn: 120709	2010-12-02 16:30:58 +00:00
Wesley Peck	09a4bffe09	Teaching MBlaze backend how to reverse branch conditions. llvm-svn: 120707	2010-12-02 16:17:11 +00:00
Rafael Espindola	922894345e	Add a fast path to EvaluateSymbolicAdd. This avoids computing symbol addresses which then avoids running EnsureValid. This cuts the assembly time of the testcase in PR8711 from 2:50 minutes to 1 minute. llvm-svn: 120697	2010-12-02 07:53:12 +00:00
Rafael Espindola	2dabc56340	Move EmitValueToOffset to the ObjectStreamer. llvm-svn: 120691	2010-12-02 05:59:38 +00:00
Rafael Espindola	3bc5d20c38	Add EmitInstToFragment to the generic object streamer. llvm-svn: 120690	2010-12-02 05:44:06 +00:00
Rafael Espindola	41bdd97d48	The sections that the ELF object writer has to create are very simple and contain only data. Handle them specially instead of using AddSectionToTheEnd. This moves a hack from the generic assembler to the elf writer. It is also a bit faster and should make other improvements easier. llvm-svn: 120683	2010-12-02 03:09:06 +00:00
Devang Patel	525de51e78	If tehre are not any line entry then do not try to emit .debug_line section. llvm-svn: 120637	2010-12-02 01:17:51 +00:00
Jakob Stoklund Olesen	54b6cd6d38	Implement the first half of LiveDebugVariables. Scan the MachineFunction for DBG_VALUE instructions, and replace them with a data structure similar to LiveIntervals. The live range of a DBG_VALUE is determined by propagating it down the dominator tree until a new DBG_VALUE is found. When a DBG_VALUE lives in a register, its live range is confined to the live range of the register's value. LiveDebugVariables runs before coalescing, so DBG_VALUEs are not artificially extended when registers are joined. The missing half will recreate DBG_VALUE instructions from the intervals when register allocation is complete. The pass is disabled by default. It can be enabled with the temporary command line option -live-debug-variables. llvm-svn: 120636	2010-12-02 00:37:37 +00:00
Jim Grosbach	0e71db6919	Add support for binary encoding of ARM 'adr' instructions referencing constant pool entries (LEApcrel pseudo). Ongoing saga of rdar://8542291. llvm-svn: 120635	2010-12-02 00:28:45 +00:00
Devang Patel	409a5ff824	Revert r120580. llvm-svn: 120630	2010-12-02 00:22:29 +00:00
Evan Cheng	4118b24aca	Fix and re-enable tail call optimization of expanded libcalls. llvm-svn: 120622	2010-12-01 22:59:46 +00:00
Rafael Espindola	c9af6aea7a	Remove unused argument. llvm-svn: 120621	2010-12-01 22:48:11 +00:00
Jason W Kim	7d4b30652e	fixing style nit: move class static to global static llvm-svn: 120619	2010-12-01 22:46:50 +00:00
Bill Wendling	d85ff071c0	Add a post encoder method to the VFP instructions to convert them to the Thumb2 encoding if we're in that mode. llvm-svn: 120608	2010-12-01 21:54:50 +00:00
Jim Grosbach	b1b1ff4271	Use the correct fixup type for ARM VLDR* llvm-svn: 120604	2010-12-01 21:09:40 +00:00
Rafael Espindola	16b64c646a	Rename temporary symbols if they conflict with artificial symbols created by the assembler. This was blocking parsing any large .s produced by clang for example. Fixes PR8596. llvm-svn: 120603	2010-12-01 20:46:11 +00:00
Michael J. Spencer	8908ec62e4	Support/FileSystem: Fix copy_file implementation to use toNullTerminatedStringRef instead of toStringRef. The file system APIs need c strings. llvm-svn: 120601	2010-12-01 20:37:42 +00:00
Michael J. Spencer	03bbd0f21d	Support/ADT/Twine: Add toNullTerminatedStringRef. llvm-svn: 120600	2010-12-01 20:37:30 +00:00
Jim Grosbach	b2a12afa5f	Refactor LEApcrelJT as a pseudo-instructionlowered to a cannonical ADR instruction at MC lowering. Add binary encoding information for the ADR, including fixup data for the label operand. llvm-svn: 120594	2010-12-01 19:47:31 +00:00
Michael J. Spencer	42a5674c7a	Support/FileSystem: Add copy_file implementation. Not tests yet because the file creation APIs aren't implemented. llvm-svn: 120593	2010-12-01 19:32:01 +00:00
Owen Anderson	8802c68592	Add correct encodings for STRD and LDRD, including fixup support. Additionally, update these to unified syntax. llvm-svn: 120589	2010-12-01 19:18:46 +00:00
Jason W Kim	d468d24fc9	kill trailing space llvm-svn: 120586	2010-12-01 19:07:22 +00:00
Jim Grosbach	538c57aeed	Fix a mised reloc rename spot. llvm-svn: 120585	2010-12-01 19:02:26 +00:00
Jim Grosbach	25b2b536f3	10 bits, not 12. llvm-svn: 120584	2010-12-01 18:51:32 +00:00
Devang Patel	e68cb5a5cf	Disable debug info for x86-darwin9 and earlier until PR 8715 and radar 8709290 are fixed. llvm-svn: 120580	2010-12-01 16:59:34 +00:00
Duncan Sands	cd4f56b8e2	I don't think it makes any sense to assert that the target supports SSE3 here. The user (i.e. whoever generated a call to the intrinsic in the first place) is essentially asking for a particular instruction to be placed in the assembler. If that instruction won't execute on the target machine, that's their problem not ours. Two buildbots with processors that don't support SSE3 were barfing on the apm.ll test in CodeGen/X86 because of this assertion. llvm-svn: 120574	2010-12-01 12:58:13 +00:00
Che-Liang Chiou	c61d8fa0e3	ptx: bug fix: use after free llvm-svn: 120571	2010-12-01 11:45:53 +00:00

... 2 3 4 5 6 ...

44035 Commits