llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Eric Christopher	1fb8e7458e	For types with a parent of the compile unit make sure and emit the DECL information. rdar://10855921 llvm-svn: 152876	2012-03-15 23:55:40 +00:00
Chad Rosier	e007850778	[fast-isel] Address Eli's comments for r152847. Specifically, add a test case and still allow immediate encoding, just not with cmn. rdar://11038907 llvm-svn: 152869	2012-03-15 22:54:20 +00:00
Jim Grosbach	3812c82b92	ARM case-insensitive checking for APSR_nzcv. rdar://11056591 llvm-svn: 152846	2012-03-15 21:34:14 +00:00
Matt Beaumont-Gay	7f3db984b3	line endings llvm-svn: 152832	2012-03-15 20:24:29 +00:00
Lang Hames	7918b0b225	Use vmov.f32 to materialize f32 consts on ARM. This relaxes constraints on register allocation by allowing all 32 D-registers to be used. Patch by Cameron Zwarich. llvm-svn: 152824	2012-03-15 18:49:02 +00:00
Kristof Beyls	5f7d669c67	Fix VCVT decoding (between floating-point and fixed-point, Floating-point). Patch by Richard Barton. llvm-svn: 152814	2012-03-15 17:50:29 +00:00
Rafael Espindola	ac42573389	Short term fix for pr12270 before we change dominates to handle unreachable code. While here, reduce indentation. llvm-svn: 152803	2012-03-15 15:52:59 +00:00
Nadav Rotem	8cf9105f96	When optimizing certain BUILD_VECTOR nodes into other BUILD_VECTOR nodes, add the new node into the work list because there is a potential for further optimizations. llvm-svn: 152784	2012-03-15 08:49:06 +00:00
Eric Christopher	0711b41ec6	Revert the removal of DW_AT_MIPS_linkage_name when we aren't putting out the DW_AT_name. Older gdbs unfortunately still use it to disambiguate member functions in templated classes (gdb.cp/templates.exp). rdar://11043421 (which is now deferred for a bit) llvm-svn: 152782	2012-03-15 08:19:33 +00:00
Chad Rosier	bd3e55d39c	[avx] Add patterns for VINSERTF128rm. This results in things such as vmovaps -96(%rbx), %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 to be combined to vinsertf128 $1, -96(%rbx), %ymm0, %ymm0 rdar://10643481 llvm-svn: 152762	2012-03-15 00:45:30 +00:00
Aaron Ballman	bf6eebde21	Fixed a transform crash when setting a negative size value for memset. Fixes PR12202. llvm-svn: 152756	2012-03-15 00:05:31 +00:00
Chandler Carruth	889ecbc0f8	Extend the inline cost calculation to account for bonuses due to correlated pairs of pointer arguments at the callsite. This is designed to recognize the common C++ idiom of begin/end pointer pairs when the end pointer is a constant offset from the begin pointer. With the C-based idiom of a pointer and size, the inline cost saw the constant size calculation, and this provides the same level of information for begin/end pairs. In order to propagate this information we have to search for candidate operations on a pair of pointer function arguments (or derived from them) which would be simplified if the pointers had a known constant offset. Then the callsite analysis looks for such pointer pairs in the argument list, and applies the appropriate bonus. This helps LLVM detect that half of bounds-checked STL algorithms (such as hash_combine_range, and some hybrid sort implementations) disappear when inlined with a constant size input. However, it's not a complete fix due the inaccuracy of our cost metric for constants in general. I'm looking into that next. Benchmarks showed no significant code size change, and very minor performance changes. However, specific code such as hashing is showing significantly cleaner inlining decisions. llvm-svn: 152752	2012-03-14 23:19:53 +00:00
Dan Gohman	a30e1f4576	When an invoke is marked with metadata indicating its unwind edge should be ignored by ARC optimization, don't insert new ARC runtime calls in the unwind destination. llvm-svn: 152748	2012-03-14 23:05:06 +00:00
Eric Christopher	ffe82d6846	Remove the DW_AT_MIPS_linkage name attribute when we don't need it output (we're emitting a specification already and the information isn't changing). Saves 1% on the debug information for a build of llvm. Fixes rdar://11043421 llvm-svn: 152697	2012-03-14 02:59:17 +00:00
Evan Cheng	155a7230b7	DAG combine incorrectly optimize (i32 vextract (v4i16 load $addr), c) to (i16 load $addr+csizeof(i16)) and replace uses of (i32 vextract) with the i16 load. It should issue an extload instead: (i32 extload $addr+csizeof(i16)). rdar://11035895 llvm-svn: 152675	2012-03-13 22:00:52 +00:00
Kevin Enderby	b5413ed6cc	Change the X86 assembler to not require a segment register on string instruction's destination operand like it does for the source operand. Also fix a typo in the comment for X86AsmParser::isSrcOp(). llvm-svn: 152654	2012-03-13 19:47:55 +00:00
Chris Lattner	84f83c2727	enhance jump threading to preserve TBAA information when PRE'ing loads, fixing rdar://11039258, an issue that came up when inspecting clang's bootstrapped codegen. llvm-svn: 152635	2012-03-13 18:07:41 +00:00
Dan Gohman	fa43b599ac	Teach globalopt how to evaluate an invoke with a non-void return type. llvm-svn: 152634	2012-03-13 18:01:37 +00:00
Duncan Sands	60c339c405	Generalize the "trunc(ptrtoint(x)) - trunc(ptrtoint(y)) -> trunc(ptrtoint(x-y))" optimization introduced by Chandler. llvm-svn: 152626	2012-03-13 14:07:05 +00:00
Eli Friedman	77682009bc	Fix regression from r151466: an we can't replace uses of an instruction reachable from the entry block with uses of an instruction not reachable from the entry block. PR12231. llvm-svn: 152595	2012-03-13 01:06:07 +00:00
Kevin Enderby	8afd951f49	Change the second line of the test added for r152414 to use CHECK-NEXT. Suggestion by Bill Wendling! llvm-svn: 152582	2012-03-12 21:38:09 +00:00
Kevin Enderby	9f26c75ab5	Added a missing error check for X86 assembly with mismatched base and index registers not both being 64-bit or both being 32-bit registers. llvm-svn: 152580	2012-03-12 21:32:09 +00:00
Kostya Serebryany	f5088bb8a5	[asan] move x86-specific test to a separate X86 directory with a custom lit.local.cfg file llvm-svn: 152567	2012-03-12 18:49:11 +00:00
Chandler Carruth	015ff468c2	When inlining a function and adding its inner call sites to the candidate set for subsequent inlining, try to simplify the arguments to the inner call site now that inlining has been performed. The goal here is to propagate and fold constants through deeply nested call chains. Without doing this, we loose the inliner bonus that should be applied because the arguments don't match the exact pattern the cost estimator uses. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152556	2012-03-12 11:19:33 +00:00
Chandler Carruth	d1c1c98162	Teach instsimplify how to constant fold pointer differences. Typically instcombine has handled this, but pointer differences show up in several contexts where we would like to get constant folding, and cannot afford to run instcombine. Specifically, I'm working on improving the constant folding of arguments used in inline cost analysis with instsimplify. Doing this in instsimplify implies some algorithm changes. We have to handle multiple layers of all-constant GEPs because instsimplify cannot fold them into a single GEP the way instcombine can. Also, we're only interested in all-constant GEPs. The result is that this doesn't really replace the instcombine logic, it's just complimentary and focused on constant folding. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152555	2012-03-12 11:19:31 +00:00
Chandler Carruth	98464723a5	FileCheck-ize this test. llvm-svn: 152554	2012-03-12 11:19:28 +00:00
Andrew Trick	db66ee17be	Move llc + target triple tests into X86 llvm-svn: 152502	2012-03-10 19:03:51 +00:00
Benjamin Kramer	dbfa526afc	Don't try to filecheck bitcode. llvm-svn: 152498	2012-03-10 18:07:46 +00:00
Bill Wendling	5f16e35eed	Make this transformation slightly less agressive and more correct. The 'CmpInst::isFalseWhenEqual' function returns 'false' for values other than simply equality. For instance, it returns 'false' for <= or >=. This isn't the correct behavior for this transformation, which is checking for strict equality and non-equality. It was causing the gcc.c-torture/execute/frame-address.c test to fail because it would completely (and incorrectly) optimize a whole function into a 'ret i32 0'. llvm-svn: 152497	2012-03-10 17:56:03 +00:00
Bill Wendling	1a3f2619a7	Fix disasm of iret, sysexit, and sysret when displayed with Intel syntax. Patch by Kay Tiong Khoo! llvm-svn: 152487	2012-03-10 07:37:27 +00:00
Kevin Enderby	15f974a5a4	Add the missing call to Error when a bad X86 scale expression is parsed. llvm-svn: 152443	2012-03-09 22:24:10 +00:00
David Meyer	d29d7cfe60	Support reading GNU symbol versions in ELFObjectFile * Add enums and structures for GNU version information. * Implement extraction of that information on a per-symbol basis (ELFObjectFile::getSymbolVersion). * Implement a generic interface, GetELFSymbolVersion(), for getting the symbol version from the ObjectFile (hides the templating). * Have llvm-readobj print out the version, when available. * Add a test for the new feature: readobj-elf-versioning.test llvm-svn: 152436	2012-03-09 20:59:52 +00:00
Dan Gohman	784659a39f	When identifying exit nodes for the reverse-CFG reverse-post-order traversal, consider nodes for which the only successors are backedges which the traversal is ignoring to be exit nodes. This fixes a problem where the bottom-up traversal was failing to visit split blocks along split loop backedges. This fixes rdar://10989035. llvm-svn: 152421	2012-03-09 18:50:52 +00:00
Kevin Enderby	1a3b6570f8	Fix the x86 disassembler to at least print the lock prefix if it is the first prefix. Added a FIXME to remind us this still does not work when it is not the first prefix. llvm-svn: 152414	2012-03-09 17:52:49 +00:00
NAKAMURA Takumi	c97ffd132b	test/MC/X86/lit.local.cfg: Fix up to detect 'X86' in targets. llvm-svn: 152406	2012-03-09 14:52:38 +00:00
Duncan Sands	8139573edf	Eliminate switch cases that can never match, for example removes all negative switch cases if the branch condition is known to be positive. Inspired by a recent improvement to GCC's VRP. llvm-svn: 152405	2012-03-09 13:45:18 +00:00
Chandler Carruth	63f95ab839	Undo a previous restriction on the inline cost calculation which Nick introduced. Specifically, there are cost reductions for all constant-operand icmp instructions against an alloca, regardless of whether the alloca will in fact be elligible for SROA. That means we don't want to abort the icmp reduction computation when we abort the SROA reduction computation. That in turn frees us from the need to keep a separate worklist and defer the ICmp calculations. Use this new-found freedom and some judicious function boundaries to factor the innards of computing the cost factor of any given instruction out of the loop over the instructions and into static helper functions. This greatly simplifies the code, and hopefully makes it more clear what is happening here. Reviewed by Eric Christopher. There is some concern that we'd like to ensure this doesn't get out of hand, and I plan to benchmark the effects of this change over the next few days along with some further fixes to the inline cost. llvm-svn: 152368	2012-03-09 02:49:36 +00:00
Chad Rosier	a10cf5e1b9	Fix a regression from r147481. Original commit message from r147481: DAGCombine for transforming 128->256 casts into a vmovaps, rather then a vxorps + vinsertf128 pair if the original vector came from a load. Fix: Unaligned loads need to generate a vmovups. rdar://10974078 llvm-svn: 152366	2012-03-09 02:00:48 +00:00
Benjamin Kramer	d42906ae81	Remove the no longer existent psp triple from a test. The test fell back to the C backend, making it useless and it started to fail on configurations that don't build the C backend. llvm-svn: 152342	2012-03-08 21:22:27 +00:00
Akira Hatanaka	f4288c9e0e	Test case for r152280, r152285 and r152290. llvm-svn: 152292	2012-03-08 03:32:42 +00:00
Rafael Espindola	4cd149ab38	Use llvm-mc instead of llc. Patch by Jack Carter. llvm-svn: 152242	2012-03-07 20:58:59 +00:00
Jakob Stoklund Olesen	47b877f5bd	Fix infinite loop in nested multiclasses. Patch by Michael Liao! llvm-svn: 152232	2012-03-07 16:39:35 +00:00
Eric Christopher	6d5c7a5141	Add the DW_AT_APPLE_runtime_class attribute to forward declarations as well as completely defined classes. This fixes rdar://10956070 llvm-svn: 152171	2012-03-07 00:15:19 +00:00
Evan Cheng	f04f2e7a52	Extend r148086 to check for [r +/- reg] address mode. This fixes queens performance regression (due to increased register pressure from overly aggressive pre-inc formation). llvm-svn: 152162	2012-03-06 23:33:32 +00:00
Eli Friedman	c397259ea6	Fix the operand ordering on aliases for shld and shrd. PR12173, part 2. llvm-svn: 152136	2012-03-06 19:58:46 +00:00
Kevin Enderby	64d11852dd	Fix a bug in the ARM disassembly of the neon VLD2 all lanes instruction. llvm-svn: 152127	2012-03-06 18:33:12 +00:00
Jakob Stoklund Olesen	d4e1cb591a	Add <imp-def> operands when reloading into physregs. When an instruction only writes sub-registers, it is still necessary to add an <imp-def> operand for the super-register. When reloading into a virtual register, rewriting will add the operand, but when loading directly into a virtual register, the <imp-def> operand is still necessary. llvm-svn: 152095	2012-03-06 02:48:17 +00:00
Lang Hames	a49054ac9c	Split fpscr into two registers: FPSCR and FPSCR_NZCV. The fpscr register contains both flags (set by FP operations/comparisons) and control bits. The control bits (FPSCR) should be reserved, since they're always available and needn't be defined before use. The flag bits (FPSCR_NZCV) should like to be unreserved so they can be hoisted by MachineCSE. This fixes PR12165. llvm-svn: 152076	2012-03-06 00:19:55 +00:00
Jim Grosbach	91314c2db6	ARM vpush/vpop assembler mnemonics accept an optional size suffix. rdar://10988114 llvm-svn: 152068	2012-03-05 23:16:31 +00:00
Eli Friedman	59cebb7902	Make sure we don't return bits outside the mask in ComputeMaskedBits. PR12189. llvm-svn: 152066	2012-03-05 23:09:40 +00:00
Jakob Stoklund Olesen	9fed2852ff	Remove a test case that no longer makes sense. This was testing the handling of sub-register coalescing followed by remat. The original problem was caused by the extra <imp-def> operands added by sub-register coalescing. Those <imp-def> operands are not added any longer, and the test case passes even when the original patch is reverted. llvm-svn: 152040	2012-03-05 19:10:13 +00:00
Sebastian Pop	e6eeed8151	updated patch for the ARM fused multiply add/sub In this update: - I assumed neon2 does not imply vfpv4, but neon and vfpv4 imply neon2. - I kept setting .fpu=neon-vfpv4 code attribute because that is what the assembler understands. Patch by Ana Pazos <apazos@codeaurora.org> llvm-svn: 152036	2012-03-05 17:39:52 +00:00
Eli Friedman	4a049305a9	Make aliases for shld and shrd match gas. PR12173. llvm-svn: 152014	2012-03-05 04:31:54 +00:00
Jakob Stoklund Olesen	fd29132e44	Use <def,undef> operands when spilling NEON bundles. MachineOperands that define part of a virtual register must have an <undef> flag if they are not intended as read-modify-write operands. The old trick of adding an <imp-def> operand doesn't work any longer. Fixes PR12177. llvm-svn: 152008	2012-03-04 18:40:30 +00:00
Duncan Sands	ccc56e1071	Nick pointed out on IRC that GVN's propagateEquality wasn't propagating equalities into phi node operands for which the equality is known to hold in the incoming basic block. That's because replaceAllDominatedUsesWith wasn't handling phi nodes correctly in general (that this didn't give wrong results was just luck: the specific way GVN uses replaceAllDominatedUsesWith precluded wrong changes to phi nodes). llvm-svn: 152006	2012-03-04 13:25:19 +00:00
Bill Wendling	88f55b45b2	Do trivial CSE of dead BBs during codegen preparation. Some BBs can become dead after codegen preparation. If we delete them here, it could help enable tail-call optimizations later on. <rdar://problem/10256573> llvm-svn: 152002	2012-03-04 10:46:01 +00:00
Jakob Stoklund Olesen	cfef07fd05	Fix RA-dependent test. llvm-svn: 151958	2012-03-03 00:26:30 +00:00
Benjamin Kramer	2a3719125f	LVI: Recognize the form instcombine canonicalizes range checks into when forming constant ranges. This could probably be made a lot smarter, but this is a common case and doesn't require LVI to scan a lot of code. With this change CVP can optimize away the "shift == 0" case in Hashing.h that only gets hit when "shift" is in a range not containing 0. llvm-svn: 151919	2012-03-02 15:34:43 +00:00
Chad Rosier	c6fad847e9	Prevent obscure and incorrect tail-call optimization. In this instance we are generating the tail-call during legalizeDAG. The 2nd floor call can't be a tail call because it clobbers %xmm1, which is defined by the first floor call. The first floor call can't be a tail-call because it's not in the tail position. The only reasonable way I could think to fix this in a target-independent manner was to check for glue logic on the copy reg. rdar://10930395 llvm-svn: 151877	2012-03-02 02:50:46 +00:00
Eric Christopher	39493f0f97	Revert "Reorder the sections being output to reduce the number of assembler" The inline table needs to be constructed ahead of time so that it doesn't try to create new strings while we're emitting everything. This reverts commit a8ff9bccb399183cdd5f1c3cec2bda763664b4b0. llvm-svn: 151864	2012-03-02 00:30:24 +00:00
Evan Cheng	31b407de17	Neuter the optimization I implemented with r107852 and r108258 which turn some floating point equality comparisons into integer ones with -ffast-math. The issue is the optimization causes +0.0 != -0.0. Now the optimization is only done when one side is known to be 0.0. The other side's sign bit is masked off for the comparison. rdar://10964603 llvm-svn: 151861	2012-03-01 23:27:13 +00:00
Eric Christopher	3d271eb540	Reorder the sections being output to reduce the number of assembler fixups that are being used to determine section offsets. Reduces the total number of fixups by 50% for a non-trivial testcase. Part of rdar://10413936 llvm-svn: 151852	2012-03-01 22:50:31 +00:00
David Meyer	7f21ecb667	[Object] Add ObjectFile::getLoadName() for retrieving the soname/installname of a shared object. llvm-svn: 151845	2012-03-01 22:19:54 +00:00
Kevin Enderby	26dad6994b	Change ARMInstPrinter::printPredicateOperand() so it will not abort if it runs into the undefined 15 condition code value. llvm-svn: 151844	2012-03-01 22:13:02 +00:00
Akira Hatanaka	75b06f4a49	Fix bugs which were introduced when support for base+index floating point loads and stores was added. - SelectAddr should return false if Parent is an unaligned f32 load or store. - Only aligned load and store nodes should be matched to select reg+imm floating point instructions. - MIPS does not have support for f64 unaligned load or store instructions. llvm-svn: 151843	2012-03-01 22:12:30 +00:00
Preston Gurd	29cb4871db	Trivial change to make the test use Use –mcpu=generic, so that the test will not fail when run on an Intel Atom processor, due to the Atom scheduler producing an instruction sequence that is different from that which is normally expected. llvm-svn: 151832	2012-03-01 19:57:20 +00:00
Chad Rosier	3bdd700004	Revert r151816 as Jim has the appropriate fix. llvm-svn: 151818	2012-03-01 17:41:19 +00:00
Chad Rosier	89b9ecae75	Fix testcases from r151807. llvm-svn: 151816	2012-03-01 17:31:30 +00:00
Jim Grosbach	e41072bbb4	Add missing triple for tests. Make darwin bots happier. llvm-svn: 151813	2012-03-01 17:30:32 +00:00
James Molloy	1038b57cac	Fix a codegen fault in which log2 or exp2 could be dead-code eliminated even though they could have sideeffects. Only allow log2/exp2 to be converted to an intrinsic if they are declared "readnone". llvm-svn: 151807	2012-03-01 14:32:18 +00:00
NAKAMURA Takumi	9d35e6f60f	llvm/test/CMakeLists.txt: Update dependencies to add llvm-readobj to "check". llvm-svn: 151795	2012-03-01 03:14:13 +00:00
David Meyer	44201a2d17	[Object] * Add begin_dynamic_table() / end_dynamic_table() private interface to ELFObjectFile. * Add begin_libraries_needed() / end_libraries_needed() interface to ObjectFile, for grabbing the list of needed libraries for a shared object or dynamic executable. * Implement this new interface completely for ELF, leave stubs for COFF and MachO. * Add 'llvm-readobj' tool for dumping ObjectFile information. llvm-svn: 151785	2012-03-01 01:36:50 +00:00
Lang Hames	6cd018d0bc	Don't redundantly copy implicit operands when rematerializing. While we're at it - don't copy vreg implicit operands while rematerializing. This fixes PR12138. llvm-svn: 151779	2012-03-01 00:41:17 +00:00
Richard Trieu	4eaabe29a7	Fix flags for test in MC/MachO/ARM/empty-function-nop.ll llvm-svn: 151778	2012-03-01 00:29:09 +00:00
Benjamin Kramer	1b5aa9f5cd	LegalizeIntegerTypes: Reorder operations in the "big shift by small amount" optimization, making the lives of later passes easier. llvm-svn: 151722	2012-02-29 13:27:00 +00:00
Duncan Sands	207ee17589	Have GVN also do condition propagation when the right-hand side is not a constant. This fixes PR1768. llvm-svn: 151713	2012-02-29 11:12:03 +00:00
Bill Wendling	690b72d2b3	Testcase for r151691. llvm-svn: 151694	2012-02-29 01:53:13 +00:00
Jim Grosbach	cb853fbdc9	ARM implement TargetInstrInfo::getNoopForMachoTarget() Without this hook, functions w/ a completely empty body (including no epilogue) will cause an MCEmitter assertion failure. For example, define internal fastcc void @empty_function() { unreachable } rdar://10947471 llvm-svn: 151673	2012-02-28 23:53:30 +00:00
David Meyer	31e23de700	In the ObjectFile interface, replace isInternal(), isAbsolute(), isGlobal(), and isWeak(), with a bitset of flags. llvm-svn: 151670	2012-02-28 23:47:53 +00:00
Rafael Espindola	646dff508a	On ELF, create relocations to the abbreviation and line sections when producing debug info for assembly files. We were already doing the right thing when producing debug info for C/C++. ELF linkers don't know dwarf, so they depend on these relocations to produce valid dwarf output. llvm-svn: 151655	2012-02-28 21:13:05 +00:00
Benjamin Kramer	daa291f4fd	LegalizeIntegerTypes: Reenable the large shift with small amount optimization. To avoid problems with zero shifts when getting the bits that move between words we use a trick: first shift the by amount-1, then do another shift by one. When amount is 0 (and size 32) we first shift by 31, then by one, instead of by 32. Also fix a latent bug that emitted the low and high words in the wrong order when shifting right. Fixes PR12113. llvm-svn: 151637	2012-02-28 17:58:00 +00:00
Daniel Dunbar	b448d31a6b	Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part. llvm-svn: 151630	2012-02-28 15:36:07 +00:00
Nadav Rotem	75b36e6716	Fix a bug in the code that builds SDNodes from vector GEPs. When the GEP index is a vector of pointers, the code that calculated the size of the element started from the vector type, and not the contained pointer type. As a result, instead of looking at the data element pointed by the vector, this code used the size of the vector. This works for 32bit members (on 32bit systems), but not for other types. Added code to peel the vector type and added a test. llvm-svn: 151626	2012-02-28 11:54:05 +00:00
Evan Cheng	d29a22e4b0	Some ARM implementaions, e.g. A-series, does return stack prediction. That is, the processor keeps a return addresses stack (RAS) which stores the address and the instruction execution state of the instruction after a function-call type branch instruction. Calling a "noreturn" function with normal call instructions (e.g. bl) can corrupt RAS and causes 100% return misprediction so LLVM should use a unconditional branch instead. i.e. mov lr, pc b _foo The "mov lr, pc" is issued in order to get proper backtrace. rdar://8979299 llvm-svn: 151623	2012-02-28 06:42:03 +00:00
Pete Cooper	ab5f2302dc	Reverted r152620 - DSE: Shorten memset when a later store overwrites the start of it. There were all sorts of buildbot issues llvm-svn: 151621	2012-02-28 05:06:24 +00:00
Pete Cooper	93352dcd53	DSE: Shorten memset when a later store overwrites the start of it llvm-svn: 151620	2012-02-28 04:27:10 +00:00
Akira Hatanaka	0934449dd8	Add support for floating point base register + offset register addressing mode load and store instructions. llvm-svn: 151611	2012-02-28 02:55:02 +00:00
Jakob Stoklund Olesen	c74b7b271e	Handle regmasks in MachineCSE. Don't attempt to extend physreg live ranges across calls. <rdar://problem/10942095> llvm-svn: 151610	2012-02-28 02:08:50 +00:00
Jakob Stoklund Olesen	c6377253f7	Enable ARM base pointer when calling functions with large arguments. When an outgoing call takes more than 2k of arguments on the stack, we don't allocate that call frame in the prolog, but adjust the stack pointer immediately before the call instead. This causes problems with the emergency spill slot because PEI can't track stack pointer adjustments on the second pass, and if the outgoing arguments are too big, SP can't be used to reach the emergency spill slot at all. Work around these problems by ensuring there is a base or frame pointer that can be used to access the emergency spill slot. <rdar://problem/10917166> llvm-svn: 151604	2012-02-28 01:15:01 +00:00
Michael J. Spencer	0aef1b9f18	[Object] Add {begin,end}_dynamic_symbols stubs and implementation for ELF. Add -D option to llvm-nm to dump dynamic symbols. Patch by David Meyer. llvm-svn: 151600	2012-02-28 00:40:37 +00:00
Bill Wendling	aa73b7af8d	Add back removed code. It still causes LLVM to miscompile. But not having it breaks other things. llvm-svn: 151594	2012-02-27 23:48:30 +00:00
Preston Gurd	81931b8e3b	test commit. llvm-svn: 151588	2012-02-27 23:31:51 +00:00
Eli Friedman	1ff1d1f1bc	Duncan pointed out that if the alignment isn't explicitly specified, it defaults to the ABI alignment. Given that, make this code a bit more aggressive in such cases. llvm-svn: 151584	2012-02-27 23:16:46 +00:00
Bill Wendling	42454c257c	XFAIL test until <rdar://problem/10913281> is fixed. llvm-svn: 151578	2012-02-27 22:53:42 +00:00
Jim Grosbach	02bf78f5ca	ARM BL/BLX instruction fixups should use relocations. We on the linker to resolve calls to the appropriate BL/BLX instruction to make interworking function correctly. It uses the symbol in the relocation to do that, so we need to be careful about being too clever. To enable this for ARM mode, split the BL/BLX fixup kind off from the unconditional-branch fixups. rdar://10927209 llvm-svn: 151571	2012-02-27 21:36:23 +00:00
Eli Friedman	15f56db6c0	Teach BasicAA about the LLVM IR rules that allow reading past the end of an object given sufficient alignment. Fixes PR12098. llvm-svn: 151553	2012-02-27 20:46:07 +00:00
Roman Divacky	588712f080	Test the section specification. llvm-svn: 151552	2012-02-27 20:42:19 +00:00
Roman Divacky	200acf8e6e	Reapply r151278 with fixes. MCize function entry label emission on PowerPC64 properly. llvm-svn: 151547	2012-02-27 20:20:47 +00:00
Duncan Sands	9e95178a81	When performing a conditional branch depending on the value of a comparison %cmp (eg: A==B) we already replace %cmp with "true" under the true edge, and with "false" under the false edge. This change enhances this to replace the negated compare (A!=B) with "false" under the true edge and "true" under the false edge. Reported to improve perlbench results by 1%. llvm-svn: 151517	2012-02-27 08:14:30 +00:00
Rafael Espindola	2d9b864afe	Fix this assert. IP can point to an instruction with strange dominance properties (invoke). Just assert that the instruction we return dominates the insertion point. llvm-svn: 151511	2012-02-27 02:13:03 +00:00
Craig Topper	ab46706aa9	X86 disassembler support for jcxz, jecxz, and jrcxz. Fixes PR11643. Patch by Kay Tiong Khoo. llvm-svn: 151510	2012-02-27 01:54:29 +00:00
Rafael Espindola	868ea25522	Add testcase for the previous commit. llvm-svn: 151475	2012-02-26 05:49:57 +00:00
Rafael Espindola	34b7c064cb	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Nick Lewycky	a93c874757	Reinstate the optimization from r151449 with a fix to not turn 'gep %x' into 'gep null' when the icmp predicate is unsigned (or is signed without inbounds). llvm-svn: 151467	2012-02-26 02:09:49 +00:00
Nick Lewycky	849715d31f	Roll these back to r151448 until I figure out how they're breaking MultiSource/Applications/lua. llvm-svn: 151463	2012-02-25 23:01:19 +00:00
Nick Lewycky	1636c6eaef	An argument and a local identified object (eg. a noalias call) could turn out equal if both are null. In the test, scope type %t and global @y by adding a 'gep' prefix to them. llvm-svn: 151452	2012-02-25 20:19:07 +00:00
Nick Lewycky	94be1c7d95	Teach instsimplify to be more aggressive when analyzing comparisons of pointers by using llvm::isIdentifiedObject. Also teach it to handle GEPs that have the same base pointer and constant operands. Fixes PR11238! llvm-svn: 151449	2012-02-25 19:07:42 +00:00
Hal Finkel	3aea686faa	Revert r151278, breaks static linking. Reverting this because it breaks static linking on ppc64. Specifically, it may be linkonce_odr functions that are the problem. With this patch, if you link statically, calls to some functions end up calling their descriptor addresses instead of calling to their entry points. This causes the execution to fail with SIGILL (b/c the descriptor address just has some pointers, not code). llvm-svn: 151433	2012-02-25 03:40:11 +00:00
NAKAMURA Takumi	17b6271b41	Target/X86: Fix assertion failures and warnings caused by r151382 _ftol2 lowering for i386-*-win32 targets. Patch by Joe Groff. [Joe Groff] Hi everyone. My previous patch applied as r151382 had a few problems: Clang raised a warning, and X86 LowerOperation would assert out for fptoui f64 to i32 because it improperly lowered to an illegal BUILD_PAIR. Here's a patch that addresses these issues. Let me know if any other changes are necessary. Thanks. llvm-svn: 151432	2012-02-25 03:37:25 +00:00
Akira Hatanaka	8fc9a35d3f	Add definitions of floating point multiply add/sub and negative multiply add/sub instructions. llvm-svn: 151415	2012-02-25 00:21:52 +00:00
Akira Hatanaka	3b3ee53886	Add an option to use a virtual register as the global base register instead of reserving a physical register ($gp or $28) for that purpose. This will completely eliminate loads that restore the value of $gp after every function call, if the register allocator assigns a callee-saved register, or eliminate unnecessary loads if it assigns a temporary register. example: .cpload $25 // set $gp. ... .cprestore 16 // store $gp to stack slot 16($sp). ... jalr $25 // function call. clobbers $gp. lw $gp, 16($sp) // not emitted if callee-saved reg is chosen. ... lw $2, 4($gp) ... jalr $25 // function call. lw $gp, 16($sp) // not emitted if $gp is not live after this instruction. ... llvm-svn: 151402	2012-02-24 22:34:47 +00:00
Chris Lattner	b01936f21a	fix PR12075, a regression in a recent transform I added. In unreachable code, gep chains can be infinite. Just like "stripPointerCasts", use a set to keep track of visited instructions so we don't recurse infinitely. llvm-svn: 151383	2012-02-24 19:01:58 +00:00
Michael J. Spencer	d2f0ce2674	Add WIN_FTOL_* psudo-instructions to model the unique calling convention used by the Win32 _ftol2 runtime function. Patch by Joe Groff! llvm-svn: 151382	2012-02-24 19:01:22 +00:00
Hal Finkel	784c4bf068	X11/X2 loads around indirect calls on ppc64 should not be deleted. llvm-svn: 151374	2012-02-24 17:54:01 +00:00
Hal Finkel	8c2c90c035	Don't crash when a glue node contains an internal CopyToReg This is necessary to support the existing ppc lowering code for indirect calls. Fixes PR12071. llvm-svn: 151373	2012-02-24 17:53:59 +00:00
Duncan Sands	30c1ce0834	Teach GVN that x+y is the same as y+x and that x<y is the same as y>x. llvm-svn: 151365	2012-02-24 15:16:31 +00:00
Kristof Beyls	3f16b0ead0	test commit. removing unnecessary whitespace. llvm-svn: 151363	2012-02-24 13:52:45 +00:00
NAKAMURA Takumi	d8b4183963	test/CodeGen/X86/2012-02-23-mmx-inlineasm.ll: Fixup to add -march=x86. -mcpu does not choose arch automatically, on non-x86 hosts. llvm-svn: 151362	2012-02-24 13:29:50 +00:00
Pete Cooper	135769381b	Turn avx insert intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove duplicate patterns for selecting the intrinsics llvm-svn: 151342	2012-02-24 03:51:49 +00:00
Eric Christopher	ea7403bfe2	If the Address of a variable is an argument then treat the entire variable declaration as an argument because we want that address anyhow for our debug information. This seems to fix rdar://9965111, at least we have more debug information than before and from reading the assembly it appears to be the correct location. llvm-svn: 151335	2012-02-24 01:59:08 +00:00
Jim Grosbach	4ff2fb2fbc	Thumb2 size reduction fix for tied operands of tMUL. The tied source operand of tMUL is the second source operand, not the first like every other two-address thumb instruction. Special case it in the size reduction pass to make sure we create the tMUL instruction properly. llvm-svn: 151315	2012-02-24 00:33:36 +00:00
Dan Gohman	8da4093a80	When emitting a cmp with 0 for a lowered select, mask out the high bits of the value carying the boolean condition, as their contents are undefined. This fixes rdar://10887484. llvm-svn: 151310	2012-02-24 00:09:36 +00:00
Bill Wendling	1a35321235	Allow an integer to be converted into an MMX type when it's used in an inline asm. <rdar://problem/10106006> llvm-svn: 151303	2012-02-23 23:25:25 +00:00
Michael J. Spencer	ba986d585c	Emit global ctors into .CRT$XCU instead of .ctors on Win32. Patch by Joe Groff! llvm-svn: 151289	2012-02-23 21:56:08 +00:00
Roman Divacky	35c45da372	MCize function entry label emission on PowerPC64 properly. llvm-svn: 151278	2012-02-23 20:28:39 +00:00
Kevin Enderby	4e089c2b5b	Updated the llvm-mc disassembler C API to support for the X86 target. rdar://10873652 As part of this I updated the llvm-mc disassembler C API to always call the SymbolLookUp call back even if there is no getOpInfo call back. If there is a getOpInfo call back that is tried first and then if that gets no information then the SymbolLookUp is called. I also made the code more robust by memset(3)'ing to zero the LLVMOpInfo1 struct before then setting SymbolicOp.Value before for the call to getOpInfo. And also don't use any values from the LLVMOpInfo1 struct if getOpInfo returns 0. And also don't use any of the ReferenceType or ReferenceName values from SymbolLookUp if it returns NULL. rdar://10873563 and rdar://10873683 For the X86 target also fixed bugs so the annotations get printed. Also fixed a few places in the ARM target that was not producing symbolic operands for some instructions. rdar://10878166 llvm-svn: 151267	2012-02-23 18:18:17 +00:00
Jakob Stoklund Olesen	3809cf9ffe	Make tests less sensitive to scheduling changes. llvm-svn: 151260	2012-02-23 17:19:34 +00:00
Anton Korobeynikov	fb863cd279	Fix to make sure that a comdat group gets generated correctly for a static member of instantiated C++ templates. Patch by Kristof Beyls! llvm-svn: 151250	2012-02-23 10:36:04 +00:00
Evan Cheng	9d9b58cc0d	Canonicalize (srl (bswap x), 16) to (rotr (bswap x), 16) if the high 16 bits of x are zero. This optimizes rev + lsr 16 to rev16. rdar://10750814 llvm-svn: 151230	2012-02-23 02:58:19 +00:00
Evan Cheng	d18a688213	Optimize a couple of common patterns involving conditional moves where the false value is zero. Instead of a cmov + op, issue an conditional op instead. e.g. cmp r9, r4 mov r4, #0 moveq r4, #1 orr lr, lr, r4 should be: cmp r9, r4 orreq lr, lr, #1 That is, optimize (or x, (cmov 0, y, cond)) to (or.cond x, y). Similarly extend this to xor as well as (and x, (cmov -1, y, cond)) => (and.cond x, y). It's possible to extend this to ADD and SUB but I don't think they are common. rdar://8659097 llvm-svn: 151224	2012-02-23 01:19:06 +00:00
Daniel Dunbar	cac06bf0c6	MC: Fix the MCNullStreamer which was broken in r147763. llvm-svn: 151213	2012-02-22 23:49:50 +00:00
Hal Finkel	cfc8c850f6	Allow the use of an alternate symbol for calculating a function's size. The standard function epilog includes a .size directive, but ppc64 uses an alternate local symbol to tag the actual start of each function. Until recently, binutils accepted the .size directive as: .size test1, .Ltmp0-test1 however, using this directive with recent binutils will result in the error: .size expression for XXX does not evaluate to a constant so we must use the label which actually tags the start of the function. llvm-svn: 151200	2012-02-22 21:11:47 +00:00
Michael J. Spencer	24f6d49962	Properly emit _fltused with FastISel. Refactor to share code with SDAG. Patch by Joe Groff! llvm-svn: 151183	2012-02-22 19:06:13 +00:00
David Greene	7cabd2e787	Add Foreach Loop Add some data structures to represent for loops. These will be referenced during object processing to do any needed iteration and instantiation. Add foreach keyword support to the lexer. Add a mode to indicate that we're parsing a foreach loop. This allows the value parser to early-out when processing the foreach value list. Add a routine to parse foreach iteration declarations. This is separate from ParseDeclaration because the type of the named value (the iterator) doesn't match the type of the initializer value (the value list). It also needs to add two values to the foreach record: the iterator and the value list. Add parsing support for foreach. Add the code to process foreach loops and create defs based on iterator values. Allow foreach loops to be matched at the top level. When parsing an IDValue check if it is a foreach loop iterator for one of the active loops. If so, return a VarInit for it. Add Emacs keyword support for foreach. Add VIM keyword support for foreach. Add tests to check foreach operation. Add TableGen documentation for foreach. Support foreach with multiple objects. Support non-braced foreach body with one object. Do not require types for the foreach declaration. Assume the iterator type from the iteration list element type. llvm-svn: 151164	2012-02-22 16:09:41 +00:00
Eric Christopher	9f47c92b48	Only add DW_AT_prototyped if we're working with a C-like language. Worth another 45k (1%) off of a large C++ testcase. rdar://10909458 llvm-svn: 151144	2012-02-22 08:46:21 +00:00
Rafael Espindola	23cd372dbf	Semantically revert 151015. Add a comment on why we should be able to assert the dominance once the dominates method is fixed and why we can use the builder's insertion point. Fixes pr12048. llvm-svn: 151125	2012-02-22 03:21:39 +00:00
Aaron Ballman	a76a5b7265	Adding support for Microsoft's thiscall calling convention. LLVM side of the patch. llvm-svn: 151123	2012-02-22 03:04:40 +00:00
Jakob Stoklund Olesen	4404c980b2	Remove a bad PowerPC test. This test case was way too strict, matching the entire assembly output. Every non-trivial change to the ppc backend or -O0 pipeline required the test to be updated. It should be replaced with a test of the specific vaarg feature. llvm-svn: 151105	2012-02-21 23:49:18 +00:00
Eric Christopher	f725ac3dff	Testcase for previous commit. rdar://10493979 llvm-svn: 151098	2012-02-21 22:25:56 +00:00
Eric Christopher	7b19cf8b2a	There's no need for a DW_AT_byte_size on a pointer type. Part of rdar://10493979 where it reduces by about .5% (10k) llvm-svn: 151097	2012-02-21 22:25:53 +00:00
Nick Lewycky	664d5b131f	Use the target-aware constant folder on expressions to improve the chance they'll be simple enough to simulate, and to reduce the chance we'll encounter equal but different simple pointer constants. This removes the symptoms from PR11352 but is not a full fix. A proper fix would either require a guarantee that two constant objects we simulate are folded when equal, or a different way of handling equal pointers (ie., trying a constantexpr icmp on them to see whether we know they're equal or non-equal or unsure). llvm-svn: 151093	2012-02-21 22:08:06 +00:00
Evan Cheng	9759637dc1	Proper support for a bastardized darwin-eabi hybird ABI. llvm-svn: 151083	2012-02-21 20:46:00 +00:00
Benjamin Kramer	dacc2e8edb	InstCombine: Don't transform a signed icmp of two GEPs into a signed compare of the indices. This transformation is not safe in some pathological cases (signed icmp of pointers should be an extremely rare thing, but it's valid IR!). Add an explanatory comment. Kudos to Duncan for pointing out this edge case (and not giving up explaining it until I finally got it). llvm-svn: 151055	2012-02-21 13:31:09 +00:00
NAKAMURA Takumi	0fac05f8e2	test/CodeGen/X86/2012-02-20-MachineCPBug.ll: Fix on generic(non-x86) hosts to add -mattr=+sse. llvm-svn: 151053	2012-02-21 11:56:42 +00:00
Nick Lewycky	b9cf2477b9	Check for the correct size in the invariant marker. llvm-svn: 151003	2012-02-20 23:32:26 +00:00
Evan Cheng	3bffc22fc2	Fix machine-cp by having it to check sub-register indicies. e.g. ecx = mov eax al = mov ch The second copy is not a nop because the sub-indices of ecx,ch is not the same of that of eax/al. Re-enabled machine-cp. PR11940 llvm-svn: 151002	2012-02-20 23:28:17 +00:00
Benjamin Kramer	64719820cf	Test case for r150978. llvm-svn: 150979	2012-02-20 19:00:28 +00:00
Benjamin Kramer	9ade8e4d79	InstCombine: When comparing two GEPs that were derived from the same base pointer but use different types, expand the offset calculation and to the compare on the offset if profitable. This came up in SmallVector code. llvm-svn: 150962	2012-02-20 15:07:47 +00:00
Benjamin Kramer	3d87f26b44	InstCombine: Make OptimizePointerDifference more aggressive. - Ignore pointer casts. - Also expand GEPs that aren't constantexprs when they have one use or only constant indices. - We now compile "&foo[i] - &foo[j]" into "i - j". llvm-svn: 150961	2012-02-20 14:34:57 +00:00
Chris Lattner	50ad7c3f54	fold comparisons of gep'd alloca points with null to false, implementing PR12013. We now compile the testcase to: __Z4testv: ## @_Z4testv ## BB#0: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit pushq %rbx subq $64, %rsp leaq 32(%rsp), %rbx movq %rbx, (%rsp) leaq 64(%rsp), %rax movq %rax, 16(%rsp) movl $1, 32(%rsp) leaq 36(%rsp), %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_2 ## BB#1: callq _free LBB0_2: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret instead of: __Z4testv: ## @_Z4testv ## BB#0: pushq %rbx subq $64, %rsp xorl %eax, %eax leaq (%rsp), %rbx addq $32, %rbx movq %rbx, (%rsp) movq %rbx, 8(%rsp) leaq 64(%rsp), %rcx movq %rcx, 16(%rsp) je LBB0_2 ## BB#1: movl $1, 32(%rsp) movq %rbx, %rax LBB0_2: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit addq $4, %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_4 ## BB#3: callq _free LBB0_4: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret This doesn't shrink clang noticably though. llvm-svn: 150944	2012-02-20 00:42:49 +00:00
Craig Topper	cfbfa3dcd1	Add vmfunc instruction to X86 assembler and disassembler. llvm-svn: 150899	2012-02-19 01:39:49 +00:00
Rafael Espindola	5154b9bedb	Don't skip debug instructions when looking for the insertion point of the cast. If we do, we can end up with inst1 --------------- < Insertion point dbg inst new inst instead of the desired inst1 new inst --------------- < Insertion point dbg inst Another option would be for InsertNoopCastOfTo (or its callers) to move the insertion point and we would end up with inst1 dbg inst new inst --------------- < Insertion point but that complicates the callers. This fixes PR12018 (and firefox's build). llvm-svn: 150884	2012-02-18 17:22:58 +00:00
Craig Topper	ecf21d8132	Add X86 assembler and disassembler support for AMD SVM instructions. Original patch by Kay Tiong Khoo. Few tweaks by me for code density and to reduce replication. llvm-svn: 150873	2012-02-18 08:19:49 +00:00
Eli Friedman	be89455c98	Fix a rather nasty regression from r150690: LHS != RHS does not imply LHS->stripPointerCasts() != RHS->stripPointerCasts(). llvm-svn: 150863	2012-02-18 03:29:25 +00:00
Eric Christopher	c2e76f573d	Testcase for the previous commit. llvm-svn: 150852	2012-02-18 00:05:45 +00:00
Dan Gohman	71b80f9e8c	Calls and invokes with the new clang.arc.no_objc_arc_exceptions metadata may still unwind, but only in ways that the ARC optimizer doesn't need to consider. This permits more aggressive optimization. llvm-svn: 150829	2012-02-17 18:59:53 +00:00
David Chisnall	86b0f069d6	It turns out that putting an 8-byte symbol in a 4-byte section makes Solaris ld sulk. GNU ld is perfectly happy with it, which is worrying for a whole other set of reasons... Thanks to Anton, Duncan and Rafael for helping me track this down. Pointy hat to Rafael for introducing the bug in the first place. llvm-svn: 150811	2012-02-17 16:05:50 +00:00
Nick Lewycky	a37a7e5a0f	Remove question. llvm-svn: 150809	2012-02-17 09:55:20 +00:00
Nick Lewycky	a5a53772d9	Add support for invariant.start inside the static constructor evaluator. This is useful to represent a variable that is const in the source but can't be constant in the IR because of a non-trivial constructor. If globalopt evaluates the constructor, and there was an invariant.start with no matching invariant.end possible, it will mark the global constant afterwards. llvm-svn: 150794	2012-02-17 06:59:21 +00:00
Chad Rosier	7867a0bd92	[fast-isel] Add support for returning non-legal types with no sign- or zero- entend flag. llvm-svn: 150774	2012-02-17 01:21:28 +00:00
Bill Wendling	c137296347	Use –mcpu=generic, so that the test will not fail when run on an Intel Atom processor, due to the Atom scheduler producing an instruction sequence that is different from that which is expected. Patch by Michael Spencer! llvm-svn: 150736	2012-02-16 22:42:48 +00:00
Benjamin Kramer	814de25917	Disable machine copy propagation for now. It's known to be buggy (PR11940) and introduces subtle miscompiles in many places. llvm-svn: 150703	2012-02-16 17:29:50 +00:00
Benjamin Kramer	8c809e592f	InstSimplify: Ignore pointer casts when constant folding compares between pointers. llvm-svn: 150690	2012-02-16 13:49:39 +00:00
Eli Bendersky	4afdeeb682	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Eli Friedman	18f18c7618	loop-rotate shouldn't hoist alloca instructions out of a loop. Patch by Patrik Hägglund, with slightly modified test. Issue reported by Patrik Hägglund on llvmdev. llvm-svn: 150642	2012-02-16 00:41:10 +00:00
Bill Wendling	29bf5e7b09	Remove extraneous tests. llvm-svn: 150636	2012-02-15 23:44:05 +00:00
Bill Wendling	0ea63367ec	Add a test for generating Objective-C metadata from module flags. llvm-svn: 150635	2012-02-15 23:43:37 +00:00
Bill Wendling	25933cd903	Add a test for the Objective-C garbage collection metadata stuff. llvm-svn: 150626	2012-02-15 22:44:10 +00:00
David Meyer	ce969dfbf0	For ELF, also call fixSymbolsInTLSFixups() on expressions passed to EmitValue (literal values). Previously only called on expressions in instructions. New test cases added to tls.s, tls-i386.s. Resolves PR11981. llvm-svn: 150582	2012-02-15 15:09:06 +00:00
Pete Cooper	21409dd760	Stop custom lowering forr x86 DEC64m from happening if the load in the lowered sequence has more than 1 user llvm-svn: 150537	2012-02-15 00:33:37 +00:00
Lang Hames	11ccc79191	Tighten physical register invariants: Allocatable physical registers can only be live in to a block if it is the function entry point or a landing pad. llvm-svn: 150494	2012-02-14 18:51:53 +00:00
Nadav Rotem	5da800572a	Fix PR12000. Some vector operations may use scalar operands with types that are greater than the vector element type. For example BUILD_VECTOR of type <1 x i1> with a constant i8 operand. This patch fixes the assertion. llvm-svn: 150477	2012-02-14 13:06:32 +00:00
Bill Wendling	0f9a487360	Change error tests to coincide with message changes. llvm-svn: 150467	2012-02-14 09:29:21 +00:00
Kostya Serebryany	457b375949	[asan] fix asan-vs-gvn.ll test (it did not actually check much before this change) llvm-svn: 150441	2012-02-14 00:02:35 +00:00
Andrew Trick	c1482c669a	Add simplifyLoopLatch to LoopRotate pass. This folds a simple loop tail into a loop latch. It covers the common (in fortran) case of postincrement loops. It's a "free" way to expose this type of loop to downstream loop optimizations that bail out on non-canonical loops (getLoopLatch is a heavily used check). llvm-svn: 150439	2012-02-14 00:00:23 +00:00
Devang Patel	7f07d60411	Check against umin while converting fcmp into an icmp. llvm-svn: 150425	2012-02-13 23:05:18 +00:00
Dan Gohman	20fd978e4b	Just like in regular escape analysis, loads and stores through (but not of) a block pointer do not cause the block pointer to escape. This fixes rdar://10803830. llvm-svn: 150424	2012-02-13 22:57:02 +00:00
Kostya Serebryany	5cd1e1380f	ThreadSanitizer, a race detector. First LLVM commit. Clang patch (flags) will follow shortly. The run-time library will also follow, but not immediately. llvm-svn: 150423	2012-02-13 22:50:51 +00:00
Nadav Rotem	2141a8413e	Fix a bug in DAGCombine for the optimization of BUILD_VECTOR. We cant generate a shuffle node from two vectors of different types. llvm-svn: 150383	2012-02-13 12:42:26 +00:00
Craig Topper	1487726cdf	Revert accidental commit of a pruned testcase from r150360. llvm-svn: 150361	2012-02-13 04:33:33 +00:00
Craig Topper	250c8fb194	Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only have one use. Matches DAGCombiner and prevents vector_shuffles from reaching isel. llvm-svn: 150360	2012-02-13 04:30:38 +00:00
Pete Cooper	b1229a8866	Fixed bug when custom lowering DEC64m on x86. If the DEC node had more than one user, it was doing this lowering but leaving the original DEC node around and so decrementing twice. Fixes PR11964. llvm-svn: 150356	2012-02-13 00:10:03 +00:00
Nadav Rotem	ea4aecb3e5	This patch addresses the problem of poor code generation for the zext v8i8 -> v8i32 on AVX machines. The codegen often scalarizes ANY_EXTEND nodes. The DAGCombiner has two optimizations that can mitigate the problem. First, if all of the operands of a BUILD_VECTOR node are extracted from an ZEXT/ANYEXT nodes, then it is possible to create a new simplified BUILD_VECTOR which uses UNDEFS/ZERO values to eliminate the scalar ZEXT/ANYEXT nodes. Second, another dag combine optimization lowers BUILD_VECTOR into a shuffle vector instruction. In the case of zext v8i8->v8i32 on AVX, a value in an XMM register is to be shuffled into a wide YMM register. This patch modifes the second optimization and allows the creation of shuffle vectors even when the newly generated vector and the original vector from which we extract the values are of different types. llvm-svn: 150340	2012-02-12 15:05:31 +00:00
Anton Korobeynikov	5996573d4b	Add support for implicit TLS model used with MS VC runtime. Patch by Kai Nacke! llvm-svn: 150307	2012-02-11 17:26:53 +00:00
Bill Wendling	0dfc3d1e3e	[WIP] Initial code for module flags. Module flags are key-value pairs associated with the module. They include a 'behavior' value, indicating how module flags react when mergine two files. Normally, it's just the union of the two module flags. But if two module flags have the same key, then the resulting flags are dictated by the behaviors. Allowable behaviors are: Error Emits an error if two values disagree. Warning Emits a warning if two values disagree. Require Emits an error when the specified value is not present or doesn't have the specified value. It is an error for two (or more) llvm.module.flags with the same ID to have the Require behavior but different values. There may be multiple Require flags per ID. Override Uses the specified value if the two values disagree. It is an error for two (or more) llvm.module.flags with the same ID to have the Override behavior but different values. llvm-svn: 150300	2012-02-11 11:38:06 +00:00
Hal Finkel	56c6162a55	Update BBVectorize to use aliasesUnknownInst. This allows BBVectorize to check the "unknown instruction" list in the alias sets. This is important to prevent instruction fusing from reordering function calls. Resolves PR11920. llvm-svn: 150250	2012-02-10 15:52:40 +00:00
Duncan Sands	931ce8ee15	Fix PR11948: the result type of an icmp may be a vector of boolean - don't assume it is a boolean. llvm-svn: 150247	2012-02-10 14:31:24 +00:00
Duncan Sands	205d9394e8	Revert commit 149912 (lattner) and add a testcase that shows the problem (which is that patterns no longer match for vectors of booleans, because you only get ConstantDataVector when the vector element type is i8, i16, etc, not when it is i1). Original commit message: Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 150246	2012-02-10 14:26:42 +00:00
Andrew Trick	c3cc8fa604	RegAlloc superpass: includes phi elimination, coalescing, and scheduling. Creates a configurable regalloc pipeline. Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa. When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>. CodeGen transformation passes are never "required" as an analysis ProcessImplicitDefs does not require LiveVariables. We have a plan to massively simplify some of the early passes within the regalloc superpass. llvm-svn: 150226	2012-02-10 04:10:36 +00:00
Benjamin Kramer	1a2b069bb9	GlobalOpt: Be more aggressive about elminating side-effect free static dtors. GlobalOpt runs early in the pipeline (before inlining) and complex class hierarchies often introduce bitcasts or GEPs which weren't optimized away. Teach it to ignore side-effect free instructions instead of depending on other passes to remove them. llvm-svn: 150174	2012-02-09 14:26:06 +00:00
James Molloy	85be8f7f88	Teach the MC and disassembler about SoftFail, and hook it up to UNPREDICTABLE on ARM. Wire this to tBLX in order to provide test coverage. llvm-svn: 150169	2012-02-09 10:56:31 +00:00
NAKAMURA Takumi	81f7ad5b9b	test/CodeGen/X86/atom-lea-sp.ll: Add explicit -mtriple=i686-linux. llvm-svn: 150151	2012-02-09 05:12:58 +00:00
Evan Cheng	1be96ff50e	Commit Andy Zhang's test for the lea patch. llvm-svn: 150107	2012-02-08 22:33:17 +00:00
Kostya Serebryany	2de61e1628	[asan] unpoison the stack before every noreturn call. Fixes asan issue 37. llvm part llvm-svn: 150102	2012-02-08 21:36:17 +00:00
Elena Demikhovsky	87a6e08d3a	Fixed a bug in printing "cmp" pseudo ops. > This IR code > %res = call <8 x float> @llvm.x86.avx.cmp.ps.256(<8 x float> %a0, <8 x float> %a1, i8 14) > fails with assertion: > > llc: X86ATTInstPrinter.cpp:62: void llvm::X86ATTInstPrinter::printSSECC(const llvm::MCInst, unsigned int, llvm::raw_ostream&): Assertion `0 && "Invalid ssecc argument!"' failed. > 0 llc 0x0000000001355803 > 1 llc 0x0000000001355dc9 > 2 libpthread.so.0 0x00007f79a30575d0 > 3 libc.so.6 0x00007f79a23a1945 gsignal + 53 > 4 libc.so.6 0x00007f79a23a2f21 abort + 385 > 5 libc.so.6 0x00007f79a239a810 __assert_fail + 240 > 6 llc 0x00000000011858d5 llvm::X86ATTInstPrinter::printSSECC(llvm::MCInst const, unsigned int, llvm::raw_ostream&) + 119 I added the full testing for all possible pseudo-ops of cmp. I extended X86AsmPrinter.cpp and X86IntelInstPrinter.cpp. You'l also see lines alignments (unrelated to this fix) in X86IselLowering.cpp from my previous check-in. llvm-svn: 150068	2012-02-08 08:37:26 +00:00
Chad Rosier	b70d1dfae6	[fast-isel] Add support for SUBs with non-legal types. llvm-svn: 150047	2012-02-08 02:45:44 +00:00
Chad Rosier	66b35d7220	Add comment to test case. llvm-svn: 150046	2012-02-08 02:30:12 +00:00
Chad Rosier	1ef78d6989	[fast-isel] Add support for ORs with non-legal types. llvm-svn: 150045	2012-02-08 02:29:21 +00:00
Chad Rosier	26610906f0	[fast-isel] Add support for indirect branches. llvm-svn: 150014	2012-02-07 23:56:08 +00:00
Craig Topper	a8a69356e1	Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD. llvm-svn: 149968	2012-02-07 06:28:42 +00:00
Chad Rosier	945ab43c4f	[fast-isel] Add support for ADDs with non-legal types. llvm-svn: 149934	2012-02-06 23:50:07 +00:00
Kostya Serebryany	f4be131943	The patch resolves the conflict between AddressSanitizer and load widening (GVN). The problem initially reported by Mozilla folks (http://code.google.com/p/address-sanitizer/issues/detail?id=20), but it also prevents us from enabling LLVM bootstrap with AddressSanitizer. llvm-svn: 149925	2012-02-06 22:48:56 +00:00
Bill Wendling	2fbed70727	The 'unwind' instruction is deprecated and will be removed, making this test obsolete. llvm-svn: 149880	2012-02-06 18:18:47 +00:00
Nick Lewycky	bad48a142a	Teach GlobalOpt to handle atomic accesses to globals. * Most of the transforms come through intact by having each transformed load or store copy the ordering and synchronization scope of the original. * The transform that turns a global only accessed in main() into an alloca (since main is non-recursive) with a store of the initial value uses an unordered store, since it's guaranteed to be the first thing to happen in main. (Threads may have started before main (!) but they can't have the address of a function local before the point in the entry block we insert our code.) * The heap-SRoA transforms are disabled in the face of atomic operations. This can probably be improved; it seems odd to have atomic accesses to an alloca that doesn't have its address taken. AnalyzeGlobal keeps track of the strongest ordering found in any use of the global. This is more information than we need right now, but it's cheap to compute and likely to be useful. llvm-svn: 149847	2012-02-05 19:56:38 +00:00
Duncan Sands	fb60d2db35	Testcase for commit 149833 (use of an uninitialized variable noticed by GCC). llvm-svn: 149840	2012-02-05 19:27:57 +00:00
Duncan Sands	eb56d51cfb	Reduce the number of dom queries made by GVN's conditional propagation logic by half: isOnlyReachableViaThisEdge was trying to be clever and handle the case of a branch to a basic block which is contained in a loop. This costs a domtree lookup and is completely useless due to GVN's position in the pass pipeline: all loops have preheaders at this point, which means it is enough for isOnlyReachableViaThisEdge to check that Dst has only one predecessor. (I checked this theoretical argument by running over the entire nightly testsuite, and indeed it is so!). llvm-svn: 149838	2012-02-05 18:25:50 +00:00
Benjamin Kramer	8e54f21216	Testing vector code without sse doesn't make much sense. Should bring arm and ppc testers back to life (they default to -mcpu=generic) llvm-svn: 149821	2012-02-05 11:19:39 +00:00
Chris Lattner	4881c9ecb0	Add a test for the miscompilation my recent ConstantDataArray patches introduced, to make sure we don't regress on it in the future. llvm-svn: 149803	2012-02-05 02:37:36 +00:00
Craig Topper	c289726019	Remove most of the intrinsics for XOP VPCMOV instruction. They all aliased to the same instruction with different types. This would be better accomplished with casts in the not yet created xopintrin.h header file. llvm-svn: 149795	2012-02-05 00:55:56 +00:00
Hal Finkel	34ae699943	Boost the effective chain depth of loads and stores. By default, boost the chain depth contribution of loads and stores. This will allow a load/store pair to vectorize even when it would not otherwise be long enough to satisfy the chain depth requirement. llvm-svn: 149761	2012-02-04 04:14:04 +00:00
Chad Rosier	ec3053c33c	[fast-isel] HandlePHINodesInSuccessorBlocks() can promite i8 and i16 types too. llvm-svn: 149730	2012-02-04 00:39:19 +00:00
Chad Rosier	cff3c98417	[fast-isel] Add support for FPToUI. Also add test cases for FPToSI. llvm-svn: 149706	2012-02-03 20:27:51 +00:00
Chad Rosier	40b3e74387	[fast-isel] Add support for selecting UIToFP. llvm-svn: 149704	2012-02-03 19:42:52 +00:00
Nadav Rotem	5c5681cf27	The type-legalizer often scalarizes code. One of the common patterns is extract-and-truncate. In this patch we optimize this pattern and convert the sequence into extract op of a narrow type. This allows the BUILD_VECTOR dag optimizations to construct efficient shuffle operations in many cases. llvm-svn: 149692	2012-02-03 13:18:25 +00:00
Akira Hatanaka	874523adc5	Add a new MachineJumpTableInfo entry type, EK_GPRel64BlockAddress, which is needed to emit a 64-bit gp-relative relocation entry. Make changes necessary for emitting jump tables which have entries with directive .gpdword. This patch does not implement the parts needed for direct object emission or JIT. llvm-svn: 149668	2012-02-03 04:33:00 +00:00
Dan Gohman	d18622bd02	Fix SSAUpdaterImpl's RecordMatchingPHI to record exactly the PHI nodes which were matched, rather than climbing up the original PHI node's operands to rediscover PHI nodes for recording, since the PHI nodes found that are not necessarily part of the matched set. This fixes rdar://10589171. llvm-svn: 149654	2012-02-03 01:07:01 +00:00
Jim Grosbach	bc7e9b3c96	Revert "Disable InstCombine unsafe folding bitcasts of calls w/ varargs." This reverts commit d0e277d272d517ca1cda368267d199f0da7cad95. llvm-svn: 149647	2012-02-03 00:00:50 +00:00
Matt Beaumont-Gay	8b5dfe05f5	Unix line endings llvm-svn: 149615	2012-02-02 19:00:49 +00:00
NAKAMURA Takumi	a7f8fe6300	Move test/CodeGen/Generic/2012-02-01-CoalescerBug.ll to CodeGen/ARM, for now. It requires TARGETS=arm. I cannot reproduce a fixed issue with other targets. llvm-svn: 149604	2012-02-02 11:44:58 +00:00
Elena Demikhovsky	7ca11b6e3f	Optimization for SIGN_EXTEND operation on AVX. Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32 extensions. llvm-svn: 149600	2012-02-02 09:10:43 +00:00
Lang Hames	004f627ed6	Set EFLAGS correctly in EmitLoweredSelect on X86. llvm-svn: 149597	2012-02-02 07:48:37 +00:00
Lang Hames	2efb52b518	PR11868. The previous loop in LiveIntervals::join would sometimes fall over if more than two adjacent ranges needed to be merged. The new version should be able to handle an arbitrary sequence of adjancent ranges. llvm-svn: 149588	2012-02-02 05:37:34 +00:00
Andrew Trick	d09b64fc25	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! llvm-svn: 149558	2012-02-01 23:20:51 +00:00
Mon P Wang	7313ffe333	Avoid creating an extract element to an illegal type after LegalizeTypes has run. llvm-svn: 149548	2012-02-01 22:15:20 +00:00
Andrew Trick	b4963dd8da	VLIW specific scheduler framework that utilizes deterministic finite automaton (DFA). This new scheduler plugs into the existing selection DAG scheduling framework. It is a top-down critical path scheduler that tracks register pressure and uses a DFA for pipeline modeling. Patch by Sergei Larin! llvm-svn: 149547	2012-02-01 22:13:57 +00:00
NAKAMURA Takumi	0bb21fdfce	test/CodeGen/X86/avx-minmax.ll: Relax expressions for Win32 targets. YMM arguments are passed as indirect on Win32 x64. llvm-svn: 149505	2012-02-01 14:35:29 +00:00
Elena Demikhovsky	455db87d41	Passing AVX 256-bit structures in Win64 was wrong. Fixed Win64 calling conventions. llvm-svn: 149494	2012-02-01 10:46:14 +00:00
Elena Demikhovsky	da37eb48d8	Optimization for "truncate" operation on AVX. Truncating v4i64 -> v4i32 and v8i32 -> v8i16 may be done with set of shuffles. llvm-svn: 149485	2012-02-01 07:56:44 +00:00
Hal Finkel	8cf5de5774	Add a basic-block autovectorization pass. This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure. Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser). llvm-svn: 149468	2012-02-01 03:51:43 +00:00
Jim Grosbach	6186319c3f	Disable InstCombine unsafe folding bitcasts of calls w/ varargs. Changing arguments from being passed as fixed to varargs is unsafe, as the ABI may require they be handled differently (stack vs. register, for example). Remove two tests which rely on the bitcast being folded into the direct call, which is exactly the transformation that's unsafe. llvm-svn: 149457	2012-02-01 00:08:17 +00:00
Kevin Enderby	cb876a7560	Fixed a crash in llvm-mc for Mach-O when a symbol difference expression uses a symbol from an assignment. In this case the symbol did not have a fragment so MCObjectWriter::IsSymbolRefDifferenceFullyResolved() should not have been calling IsSymbolRefDifferenceFullyResolvedImpl() with a NULL fragment and should just have returned false in that case. llvm-svn: 149442	2012-01-31 23:02:57 +00:00
Craig Topper	2b764de6ab	Remove pcmpgt/pcmpeq intrinsics as clang is not using them. llvm-svn: 149367	2012-01-31 06:52:44 +00:00
Bill Wendling	7761976036	Remove all references to the old EH. There was always the current EH. -- Ministry of Truth llvm-svn: 149335	2012-01-31 02:09:07 +00:00
Bill Wendling	76beba7841	Update test to new EH model. llvm-svn: 149333	2012-01-31 02:05:13 +00:00
Bill Wendling	8402993dd4	Update test to new EH model. llvm-svn: 149332	2012-01-31 02:04:20 +00:00
Chandler Carruth	865317627e	Chris's constant data sequence refactoring actually enabled printing vectors of all one bits to be printed more cleverly in the AsmPrinter. Unfortunately, the byte value for all one bits is the same with -fsigned-char as the error return of '-1'. Force this to be the unsigned byte value when returning it to avoid this problem, and update the test case for the shiny new behavior. Yay for building LLVM and Clang with -funsigned-char. Chris, please review, and let me know if there is any reason to not desire this change. It seems good on the surface, and certainly intended based on the code written. llvm-svn: 149299	2012-01-30 23:47:44 +00:00
Devang Patel	be1817e3e0	Intel syntax. Adjust special code, used to recognize cmp<comparison code>{ss,sd,ps,pd}, for intel syntax. llvm-svn: 149291	2012-01-30 22:47:12 +00:00
Devang Patel	a5bfdedb9f	Intel syntax. Support .intel_syntax directive. llvm-svn: 149270	2012-01-30 20:02:42 +00:00
Craig Topper	9a8c6c1633	Fix pattern for memory form of PSHUFD for use with FP vectors to remove bitcast to an integer vector that normal code wouldn't have. Also remove bitcasts from code that turns splat vector loads into a shuffle as it was making the broken pattern necessary. llvm-svn: 149232	2012-01-30 07:50:31 +00:00
NAKAMURA Takumi	4776d14929	CMake: Promote the testing targets out of folders on IDE. llvm-svn: 149220	2012-01-30 03:15:47 +00:00
James Molloy	b586b7c9c7	Ensure .AliasedSymbol() is called on all uses of getSymbol(). Affects ARM and MIPS ELF backends. Fixes PR11877 llvm-svn: 149180	2012-01-28 15:58:32 +00:00
Rafael Espindola	c74f450f77	Small improvement to the recursion detection logic from the previous commit. llvm-svn: 149175	2012-01-28 06:22:14 +00:00
Rafael Espindola	82e15e4544	Handle recursive variable definitions directly. This gives us better error messages and allows us to fix PR11865. llvm-svn: 149174	2012-01-28 05:57:00 +00:00
Rafael Espindola	7bddde2b49	Add r149110 back with a fix for when the vector and the int have the same width. llvm-svn: 149151	2012-01-27 23:33:07 +00:00
Rafael Espindola	7800e62486	Revert r149110 and add a testcase that was crashing since that revision. Unfortunately I also had to disable constant-pool-sharing.ll the code it tests has been updated to use the IL logic. llvm-svn: 149148	2012-01-27 22:42:48 +00:00
Devang Patel	e4725ba181	Intel Syntax: Parse mem operand with seg reg. QWORD PTR FS:[320] llvm-svn: 149142	2012-01-27 19:48:28 +00:00
Matt Beaumont-Gay	f1e0eb546a	Unix line endings llvm-svn: 149115	2012-01-27 02:31:29 +00:00
Chris Lattner	929f66cdfa	enhance constant folding to be able to constant fold bitcast of ConstantVector's to integer type. llvm-svn: 149110	2012-01-27 01:44:03 +00:00
Lang Hames	5b641086b2	Rewrite instruction operands in AdjustCopiesBackFrom. Fixes PR11861. llvm-svn: 149097	2012-01-27 00:05:42 +00:00
Jakob Stoklund Olesen	c63a45ebe6	Handle call-clobbered ymm registers on Win64. The Win64 calling convention has xmm6-15 as callee-saved while still clobbering all ymm registers. Add a YMM_HI_6_15 pseudo-register that aliases the clobbered part of the ymm registers, and mark that as call-clobbered. This allows live xmm registers across calls. This hack wouldn't be necessary with RegisterMask operands representing the call clobbers, but they are not quite operational yet. llvm-svn: 149088	2012-01-26 22:59:28 +00:00

... 3 4 5 6 7 ...

15977 Commits