llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Matt Arsenault	e3debcae62	Use type form of getIntPtrType. This should be inconsequential and is work towards removing the default address space arguments. llvm-svn: 194347	2013-11-10 04:46:57 +00:00
Nadav Rotem	537b5180e9	SimplifyCFG has a heuristics for out-of-order processors that decides when it is worthwhile to merge branches. It tries to estimate if the operands of the instruction that we want to hoist are ready. This commit marks function arguments as 'ready' because they require no calculation. This boosts libquantum and a few other workloads from the testsuite. llvm-svn: 194346	2013-11-10 04:13:31 +00:00
Matt Arsenault	4fcb58f7fb	Teach MergeFunctions about address spaces llvm-svn: 194342	2013-11-10 01:44:37 +00:00
Hal Finkel	b485e0c406	Remove dead code from LoopUnswitch LoopUnswitch's code simplification routine has logic to convert conditional branches into unconditional branches, after unswitching makes the condition constant, and then remove any blocks that renders dead. Unfortunately, this code is dead, currently broken, and furthermore, has never been alive (at least as far back at 2006). No functionality change intended. llvm-svn: 194277	2013-11-08 19:58:21 +00:00
Michael Gottesman	04ff2468f0	[objc-arc] Convert the one directional retain/release relation assert to a conditional check + fail. Due to the previously added overflow checks, we can have a retain/release relation that is one directional. This occurs specifically when we run into an additive overflow causing us to drop state in only one direction. If that occurs, we should bail and not optimize that retain/release instead of asserting. Apologies for the size of the testcase. It is necessary to cause the additive cfg overflow to trigger. rdar://15377890 llvm-svn: 194083	2013-11-05 16:02:40 +00:00
Hal Finkel	26ff1b7453	Add a runtime unrolling parameter to the LoopUnroll pass constructor As with the other loop unrolling parameters (the unrolling threshold, partial unrolling, etc.) runtime unrolling can now also be controlled via the constructor. This will be necessary for moving non-trivial unrolling late in the pass manager (after loop vectorization). No functionality change intended. llvm-svn: 194027	2013-11-05 00:08:03 +00:00
Shuxin Yang	08d2dc8405	Remove dead code llvm-svn: 194017	2013-11-04 21:44:01 +00:00
Benjamin Kramer	9eaaead296	SLPVectorizer: Use properlyDominates to satisfy the irreflexivity of a strict weak ordering. STL debug mode checks this. llvm-svn: 194015	2013-11-04 21:34:55 +00:00
Matt Arsenault	1f521e921d	Scalarize select vector arguments when extracted. When the elements are extracted from a select on vectors or a vector select, do the select on the extracted scalars from the input if there is only one use. llvm-svn: 194013	2013-11-04 20:36:06 +00:00
Benjamin Kramer	15ebc47438	SLPVectorizer: Add a missing pair of parens. No functionality change. llvm-svn: 193958	2013-11-03 12:54:32 +00:00
Benjamin Kramer	f45bcf5480	SLPVectorizer: When CSEing generated gathers only scan blocks containing them. Instead of doing a RPO traversal of the whole function remember the blocks containing gathers (typically <= 2) and scan them in dominator-first order. The actual CSE is still quadratic, but I'm not confident that adding a scoped hash table here is worth it as we're only looking at the generated instructions and not arbitrary code. llvm-svn: 193956	2013-11-03 12:27:52 +00:00
David Majnemer	2bbbbaaeee	Revert "Inliner: Handle readonly attribute per argument when adding memcpy" This reverts commit r193356, it caused PR17781. A reduced test case covering this regression has been added to the test suite. llvm-svn: 193955	2013-11-03 12:22:13 +00:00
David Majnemer	fbb2338856	Spell "Actual" correctly llvm-svn: 193954	2013-11-03 11:09:39 +00:00
Bob Wilson	eb7943100b	Convert calls to __sinpi and __cospi into __sincospi_stret This adds an SimplifyLibCalls case which converts the special __sinpi and __cospi (float & double variants) into a __sincospi_stret where appropriate to remove duplicated work. Patch by Tim Northover llvm-svn: 193943	2013-11-03 06:48:38 +00:00
Benjamin Kramer	ae919396b6	SLPVectorizer: Remove duplicated function. llvm-svn: 193927	2013-11-02 14:46:27 +00:00
Benjamin Kramer	abc7baa1dc	LoopVectorize: Remove quadratic behavior the local CSE. Doing this with a hash map doesn't change behavior and avoids calling isIdenticalTo O(n^2) times. This should probably eventually move into a utility class shared with EarlyCSE and the limited CSE in the SLPVectorizer. llvm-svn: 193926	2013-11-02 13:39:00 +00:00
Arnold Schwaighofer	fed0c4f8e8	LoopVectorizer: Move cse code into its own function llvm-svn: 193895	2013-11-01 23:28:54 +00:00
Arnold Schwaighofer	fba1c74b67	LoopVectorizer: Perform redundancy elimination on induction variables When the loop vectorizer was part of the SCC inliner pass manager gvn would run after the loop vectorizer followed by instcombine. This way redundancy (multiple uses) were removed and instcombine could perform scalarization on the induction variables. Having moved the loop vectorizer to later we no longer run any form of redundancy elimination before we perform instcombine. This caused vectorized induction variables to survive that did not before. On a recent iMac this helps linpack back from 6000Mflops to 7000Mflops. This should also help lpbench and paq8p. I ran a Release (without Asserts) build over the test-suite and did not see any negative impact on compile time. radar://15339680 llvm-svn: 193891	2013-11-01 22:18:19 +00:00
Benjamin Kramer	3045156cee	LoopVectorize: Look for consecutive acces in GEPs with trailing zero indices If we have a pointer to a single-element struct we can still build wide loads and stores to it (if there is no padding). llvm-svn: 193860	2013-11-01 14:09:50 +00:00
Arnold Schwaighofer	5d7be45165	LoopVectorizer: If dependency checks fail try runtime checks When a dependence check fails we can still try to vectorize loops with runtime array bounds checks. This helps linpack to vectorize a loop in dgefa. And we are back to 2x of the scalar performance on a corei7-avx. radar://15339680 llvm-svn: 193853	2013-11-01 03:05:07 +00:00
Arnold Schwaighofer	fe8e481ef6	LoopVectorizer: Clear all member data structures in RuntimeCheck.reset() Clear all data structures when resetting the RuntimeCheck data structure. No test case. This was exposed by an upcomming change. llvm-svn: 193852	2013-11-01 03:05:04 +00:00
Manman Ren	bba5655c39	Do not convert "call asm" to "invoke asm" in Inliner. Given that backend does not handle "invoke asm" correctly ("invoke asm" will be handled by SelectionDAGBuilder::visitInlineAsm, which does not have the right setup for LPadToCallSiteMap) and we already made the assumption that inline asm does not throw in InstCombiner::visitCallSite, we are going to make the same assumption in Inliner to make sure we don't convert "call asm" to "invoke asm". If it becomes necessary to add support for "invoke asm" later on, we will need to modify the backend as well as remove the assumptions that inline asm does not throw. Fix rdar://15317907 llvm-svn: 193808	2013-10-31 21:56:03 +00:00
Rafael Espindola	24353f2de2	Use LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN instead of the "dso list". There are two ways one could implement hiding of linkonce_odr symbols in LTO: * LLVM tells the linker which symbols can be hidden if not used from native files. * The linker tells LLVM which symbols are not used from other object files, but will be put in the dso symbol table if present. GOLD's API is the second option. It was implemented almost 1:1 in llvm by passing the list down to internalize. LLVM already had partial support for the first option. It is also very similar to how ld64 handles hiding these symbols when not doing LTO. This patch then * removes the APIs for the DSO list. * marks LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN all linkonce_odr unnamed_addr global values and other linkonce_odr whose address is not used. * makes the gold plugin responsible for handling the API mismatch. llvm-svn: 193800	2013-10-31 20:51:58 +00:00
Rafael Espindola	afc61d382c	Merge CallGraph and BasicCallGraph. llvm-svn: 193734	2013-10-31 03:03:55 +00:00
Matt Arsenault	7af7ffc005	Teach scalarrepl about address spaces llvm-svn: 193720	2013-10-30 22:54:58 +00:00
Matt Arsenault	f608b07c6f	Fix GVN creating bitcast between address spaces llvm-svn: 193710	2013-10-30 19:05:41 +00:00
Arnold Schwaighofer	fe80e563da	ARM cost model: Account for zero cost scalar SROA instructions By vectorizing a series of srl, or, ... instructions we have obfuscated the intention so much that the backend does not know how to fold this code away. radar://15336950 llvm-svn: 193573	2013-10-29 01:33:53 +00:00
Arnold Schwaighofer	6f22639253	SLPVectorizer: Use vector type for vectorized memory operations No test case, because with the current cost model we don't see a difference. An upcoming ARM memory cost model change will expose and test this bug. radar://15332579 llvm-svn: 193572	2013-10-29 01:33:50 +00:00
Shuxin Yang	eb29e658a0	Revert r193251 : Use address-taken to disambiguate global variable and indirect memops. llvm-svn: 193489	2013-10-27 03:08:44 +00:00
Wan Xiaofei	f3100f24fa	Quick look-up for block in loop. This patch implements quick look-up for block in loop by maintaining a hash set for blocks. It improves the efficiency of loop analysis a lot, the biggest improvement could be 5-6%(458.sjeng). Below are the compilation time for our benchmark in llc before & after the patch. Benchmark llc - trunk llc - patched 401.bzip2 0.339081 100.00% 0.329657 102.86% 403.gcc 19.853966 100.00% 19.605466 101.27% 429.mcf 0.049823 100.00% 0.048451 102.83% 433.milc 0.514898 100.00% 0.510217 100.92% 444.namd 1.109328 100.00% 1.103481 100.53% 445.gobmk 4.988028 100.00% 4.929114 101.20% 456.hmmer 0.843871 100.00% 0.825865 102.18% 458.sjeng 0.754238 100.00% 0.714095 105.62% 464.h264ref 2.9668 100.00% 2.90612 102.09% 471.omnetpp 4.556533 100.00% 4.511886 100.99% bitmnp01 0.038168 100.00% 0.0357 106.91% idctrn01 0.037745 100.00% 0.037332 101.11% libquake2 3.78689 100.00% 3.76209 100.66% libquake_ 2.251525 100.00% 2.234104 100.78% linpack 0.033159 100.00% 0.032788 101.13% matrix01 0.045319 100.00% 0.043497 104.19% nbench 0.333161 100.00% 0.329799 101.02% tblook01 0.017863 100.00% 0.017666 101.12% ttsprk01 0.054337 100.00% 0.053057 102.41% Reviewer : Andrew Trick <atrick@apple.com>, Hal Finkel <hfinkel@anl.gov> Approver : Andrew Trick <atrick@apple.com> Test : Pass make check-all & llvm test-suite llvm-svn: 193460	2013-10-26 03:08:02 +00:00
Andrew Trick	741b7a0cfc	Fix SCEVExpander: don't try to expand quadratic recurrences outside a loop. Partial fix for PR17459: wrong code at -O3 on x86_64-linux-gnu (affecting trunk and 3.3) When SCEV expands a recurrence outside of a loop it attempts to scale by the stride of the recurrence. Chained recurrences don't work that way. We could compute binomial coefficients, but would hve to guarantee that the chained AddRec's are in a perfectly reduced form. llvm-svn: 193438	2013-10-25 21:35:56 +00:00
Rafael Espindola	37e88054ad	Handle calls and invokes in GlobalStatus. This patch teaches GlobalStatus to analyze a call that uses the global value as a callee, not as an argument. With this change internalize call handle the common use of linkonce_odr functions. This reduces the number of linkonce_odr functions in a LTO build of clang (checked with the emit-llvm gold plugin option) from 1730 to 60. llvm-svn: 193436	2013-10-25 21:29:52 +00:00
Hal Finkel	d554c99b37	LoopVectorizer: Don't attempt to vectorize extractelement instructions The loop vectorizer does not currently understand how to vectorize extractelement instructions. The existing check, which excluded all vector-valued instructions, did not catch extractelement instructions because it checked only the return value. As a result, vectorization would proceed, producing illegal instructions like this: %58 = extractelement <2 x i32> %15, i32 0 %59 = extractelement i32 %58, i32 0 where the second extractelement is illegal because its first operand is not a vector. llvm-svn: 193434	2013-10-25 20:40:15 +00:00
Tom Stellard	3863b94253	Inliner: Handle readonly attribute per argument when adding memcpy Patch by: Vincent Lejeune llvm-svn: 193356	2013-10-24 16:38:33 +00:00
Renato Golin	ae79e04f36	Mark vector loops as already vectorized Make sure we mark all loops (scalar and vector) when vectorizing, so that we don't try to vectorize them anymore. Also, set unroll to 1, since this is what we check for on early exit. llvm-svn: 193349	2013-10-24 14:50:51 +00:00
Nuno Lopes	09c3fc8dac	fix PR17635: false positive with packed structures LLVM optimizers may widen accesses to packed structures that overflow the structure itself, but should be in bounds up to the alignment of the object llvm-svn: 193317	2013-10-24 09:17:24 +00:00
Juergen Ributzka	db5b9c734f	Fix a bug in LinearFunctionTestReplace that created invalid loop exit checks. Reviewed by Andy llvm-svn: 193303	2013-10-24 05:29:56 +00:00
Andrew Trick	780bd06488	Clarify comments in genLoopLimit. llvm-svn: 193292	2013-10-24 00:43:38 +00:00
Yuchen Wu	88a414eb08	Fixed comment typo in GCOVProfiling.cpp llvm-svn: 193268	2013-10-23 20:35:00 +00:00
Shuxin Yang	45a453cafe	Use address-taken to disambiguate global variable and indirect memops. Major steps include: 1). introduces a not-addr-taken bit-field in GlobalVariable 2). GlobalOpt pass sets "not-address-taken" if it proves a global varirable dosen't have its address taken. 3). AA use this info for disambiguation. llvm-svn: 193251	2013-10-23 17:28:19 +00:00
Eric Christopher	4762bba338	Fix spelling, grammar, and match naming convention for test files. llvm-svn: 193130	2013-10-21 23:14:06 +00:00
Tom Stellard	6f3b22d6ce	SimplifyCFG: Don't duplicate calls to functions marked noduplicate v2 v2: - Use CI->cannotDuplicate() llvm-svn: 193115	2013-10-21 20:07:30 +00:00
Matt Arsenault	fcca6dd732	Use more type helper functions llvm-svn: 193109	2013-10-21 19:43:56 +00:00
Matt Arsenault	8faf1f4444	Teach SimplifyCFG about address spaces llvm-svn: 193104	2013-10-21 18:55:08 +00:00
Rafael Espindola	34304975c5	Optimize more linkonce_odr values during LTO. When a linkonce_odr value that is on the dso list is not unnamed_addr we can still look to see if anything is actually using its address. If not, it is safe to hide it. This patch implements that by moving GlobalStatus to Transforms/Utils and using it in Internalize. llvm-svn: 193090	2013-10-21 17:14:55 +00:00
Michael Gottesman	80054d1ccd	Fix the predecessor removal logic in r193045. Additionally some small comment/stylistic fixes are included as well. llvm-svn: 193068	2013-10-21 05:20:11 +00:00
Bill Wendling	ab46af5515	Don't eliminate a partially redundant load if it's in a landing pad. A landing pad can be jumped to only by the unwind edge of an invoke instruction. If we eliminate a partially redundant load in a landing pad, it will create a basic block that violates this constraint. It then leads to other problems down the line if it tries to merge that basic block with the landing pad. Avoid this by not eliminating the load in a landing pad. PR17621 llvm-svn: 193064	2013-10-21 04:09:17 +00:00
Michael Gottesman	f056facc24	Teach simplify-cfg how to correctly create covered lookup tables for switches on iN with N >= 3. One optimization simplify-cfg performs is the converting of switches to lookup tables if the switch has > 4 cases. This is done by: 1. Finding the max/min case value and calculating the switch case range. 2. Create a lookup table basic block. 3. Perform a check in the switch's BB to see if the input value is in the switch's case range. If the input value satisfies said predicate branch to the lookup table BB, otherwise branch to the switch's default destination BB using the default value as the result. The conditional check consists of subtracting the min case value of the table from any input iN value and then ensuring that said value is unsigned less than the size of the lookup table represented as an iN value. If the lookup table is a covered lookup table, the size of the table will be N which is 0 as an iN value. Thus the comparison will be an `icmp ult` of an iN value against 0 which is always false yielding the incorrect result. This patch fixes this problem by recognizing if we have a covered lookup table and if we do, unconditionally jumps to the lookup table BB since the covering property of the lookup table implies no input values could not be handled by said BB. rdar://15268442 llvm-svn: 193045	2013-10-20 07:04:37 +00:00
Bill Wendling	fda67cdf91	Perform an intelligent splice of the predecessor with the single successor. If the predecessor's being spliced into a landing pad, then we need the PHIs to come first and the rest of the predecessor's code to come after the landing pad instruction. llvm-svn: 193035	2013-10-19 11:27:12 +00:00
Nadav Rotem	fd357159bc	Mark some command line flags as hidden llvm-svn: 193013	2013-10-18 23:38:13 +00:00
Rafael Espindola	e040b98ded	Rename fields of GlobalStatus to match the coding style. llvm-svn: 192910	2013-10-17 18:18:52 +00:00
Rafael Espindola	4e9662d96e	rename SafeToDestroyConstant to isSafeToDestroyConstant and clang-format. llvm-svn: 192907	2013-10-17 18:06:32 +00:00
Rafael Espindola	04822e2dd6	Simplify the interface of AnalyzeGlobal a bit and rename to analyzeGlobal. No functionality change. llvm-svn: 192906	2013-10-17 18:00:25 +00:00
Evgeniy Stepanov	bf4a5e9d07	[msan] Use zero-extension in shadow cast by default. Switch to sign-extension in r192575 caused 7% perf loss on 482.sphinx3. llvm-svn: 192882	2013-10-17 10:53:50 +00:00
Dmitry Vyukov	012cdf1364	tsan: implement no_sanitize_thread attribute If a function has no_sanitize_thread attribute, do not instrument memory accesses in it. llvm-svn: 192871	2013-10-17 07:20:06 +00:00
Arnold Schwaighofer	789187ee86	SLPVectorizer: Don't vectorize volatile memory operations radar://15231682 Reapply r192799, http://lab.llvm.org:8011/builders/lldb-x86_64-debian-clang/builds/8226 showed that the bot is still broken even with this out. llvm-svn: 192820	2013-10-16 17:52:40 +00:00
Arnold Schwaighofer	7097263371	Revert "SLPVectorizer: Don't vectorize volatile memory operations" This speculatively reverts commit 192799. It might have broken a linux buildbot. llvm-svn: 192816	2013-10-16 17:19:40 +00:00
Arnold Schwaighofer	eebda9d6cf	SLPVectorizer: Don't vectorize volatile memory operations radar://15231682 llvm-svn: 192799	2013-10-16 16:09:00 +00:00
Kostya Serebryany	2ae1ed1c8f	[asan] Optimize accesses to global arrays with constant index Summary: Given a global array G[N], which is declared in this CU and has static initializer avoid instrumenting accesses like G[i], where 'i' is a constant and 0<=i<N. Also add a bit of stats. This eliminates ~1% of instrumentations on SPEC2006 and also partially helps when asan is being run together with coverage. Reviewers: samsonov Reviewed By: samsonov CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1947 llvm-svn: 192794	2013-10-16 14:06:14 +00:00
Benjamin Kramer	a00487e169	LoopVectorize: Properly reflect PODness in comments. llvm-svn: 192717	2013-10-15 16:19:54 +00:00
Craig Topper	037594e792	Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from x86_sse42_crc32_32_8 and was not mapped to a clang builtin. I'm not even sure why this form of the instruction is even called out explicitly in the docs. Also add AutoUpgrade support to convert it into the other intrinsic with appropriate trunc and zext. llvm-svn: 192672	2013-10-15 05:20:47 +00:00
Rafael Espindola	d9b08f1296	Remove lib/Transforms/Instrumentation/ProfilingUtils.* They were leftover from the old profiling support. Patch by Alastair Murray. llvm-svn: 192605	2013-10-14 16:46:46 +00:00
Chris Lattner	18bf2f7f32	Basic blocks typically have few predecessors. Use a SmallDenseMap to avoid a heap allocation when this is the case. llvm-svn: 192602	2013-10-14 16:05:55 +00:00
Evgeniy Stepanov	bd5e5ef8a1	[msan] Instrument x86._cvt intrinsics. Currently MSan checks that arguments of cvt intrinsics are fully initialized. That's too much to ask: some of them only operate on lower half, or even quarter, of the input register. llvm-svn: 192599	2013-10-14 15:16:25 +00:00
Evgeniy Stepanov	57dafd2630	[msan] Fix handling of scalar select of vectors. llvm-svn: 192575	2013-10-14 09:52:09 +00:00
Arnold Schwaighofer	38ec37faba	SLPVectorizer: Sort PHINodes based on their opcode Before this patch we relied on the order of phi nodes when we looked for phi nodes of the same type. This could prevent vectorization of cases where there was a phi node of a second type in between phi nodes of some type. This is important for vectorization of an internal graphics kernel. On the test suite + external on x86_64 (and on a run on armv7s) it showed no impact on either performance or compile time. radar://15024459 llvm-svn: 192537	2013-10-12 18:56:27 +00:00
Tobias Grosser	bc154d94d0	LoopVectorize: Add missing INITIALIZE_PASS_DEPENDENCY macros Contributed-by: Peter Zotov <whitequark@whitequark.org> llvm-svn: 192536	2013-10-12 18:29:15 +00:00
Renato Golin	ec7fe56cfa	Better info when debugging vectorizer llvm-svn: 192460	2013-10-11 16:14:39 +00:00
Shuxin Yang	708f0cc2e4	Fix a bug in Dead Argument Elimination. If a function seen at compile time is not necessarily the one linked to the binary being built, it is illegal to change the actual arguments passing to it. e.g. -------------------------- void foo(int lol) { // foo() has linkage satisifying isWeakForLinker() // "lol" is not used at all. } void bar(int lo2) { // xform to foo(undef) is illegal, as compiler dose not know which // instance of foo() will be linked to the the binary being built. foo(lol2); } ----------------------------- Such functions can be captured by isWeakForLinker(). NOTE that mayBeOverridden() is insufficient for this purpose as it dosen't include linkage types like AvailableExternallyLinkage and LinkOnceODRLinkage. Take link_odr* as an example, it indicates a set of EQUIVALENT globals that can be merged at link-time. However, the semantic of EQUIVALENT-functions includes parameters. Changing parameters breaks the assumption. Thank John McCall for help, especially for the explanation of subtle difference between linkage types. rdar://11546243 llvm-svn: 192302	2013-10-09 17:21:44 +00:00
Arnold Schwaighofer	bfe48b104a	LoopVectorize: External uses must use the last value in a reduction cycle Otherwise, we don't perform operations that would have been performed on the scalar version. Fixes PR17498. llvm-svn: 192133	2013-10-07 21:05:43 +00:00
Alexey Samsonov	a04533615d	Revert r191834 until we measure the effect of this benchmarks and maybe find a better way to fix it llvm-svn: 192121	2013-10-07 19:03:24 +00:00
Hal Finkel	83df3058ed	UpdatePHINodes in BasicBlockUtils should not crash on duplicate predecessors UpdatePHINodes has an optimization to reuse an existing PHI node, where it first deletes all of its entries and then replaces them. Unfortunately, in the case where we had duplicate predecessors (which are allowed so long as the associated PHI entries have the same value), the loop removing the existing PHI entries from the to-be-reused PHI would assert (if that PHI was not the one which had the duplicates). llvm-svn: 192001	2013-10-04 23:41:05 +00:00
Arnold Schwaighofer	6fab174840	SLPVectorizer: Sort inputs to commutative binary operations Sort the operands of the other entries in the current vectorization root according to the first entry's operands opcodes. %conv0 = uitofp ... %load0 = load float ... = fmul %conv0, %load0 = fmul %load0, %conv1 = fmul %load0, %conv2 Make sure that we recursively vectorize <%conv0, %conv1, %conv2> and <%load0, %load0, %load0>. This makes it more likely to obtain vectorizable trees. We have to be careful when we sort that we don't destroy 'good' existing ordering implied by source order. radar://15080067 llvm-svn: 191977	2013-10-04 20:39:16 +00:00
Owen Anderson	5e2eba8c18	Pull fptrunc's upwards through selects when one of the select's selectands was a constant. This has a number of benefits, including producing small immediates (easier to materialize, smaller constant pools) as well as being more likely to allow the fptrunc to fuse with a preceding instruction (truncating selects are unusual). llvm-svn: 191929	2013-10-03 21:08:05 +00:00
Rafael Espindola	a88c415234	Optimize linkonce_odr unnamed_addr functions during LTO. Generalize the API so we can distinguish symbols that are needed just for a DSO symbol table from those that are used from some native .o. The symbols that are only wanted for the dso symbol table can be dropped if llvm can prove every other dso has a copy (linkonce_odr) and the address is not important (unnamed_addr). llvm-svn: 191922	2013-10-03 18:29:09 +00:00
Matt Arsenault	c8e9a77ae3	Make gep i8* X, -(ptrtoint Y) transform work with address spaces llvm-svn: 191920	2013-10-03 18:15:57 +00:00
Matt Arsenault	2c180f66a9	Don't use runtime bounds check between address spaces. Don't vectorize with a runtime check if it requires a comparison between pointers with different address spaces. The values can't be assumed to be directly comparable. Previously it would create an illegal bitcast. llvm-svn: 191862	2013-10-02 22:38:17 +00:00
Yi Jiang	09195de8f3	Apply slp vectorization on fully-vectorizable tree of height 2 llvm-svn: 191852	2013-10-02 20:20:39 +00:00
Matt Arsenault	15633246b6	Fix debug printing spacing. Fix missing newlines, missing and extra spaces in printed messages. llvm-svn: 191851	2013-10-02 20:04:29 +00:00
Matt Arsenault	26cc78f548	Fix comment grammar and capitalization. llvm-svn: 191850	2013-10-02 20:04:26 +00:00
Benjamin Kramer	4458ae839d	SLPVectorizer: Make store chain finding more aggressive with GetUnderlyingObject. This recursively strips all GEPs like the existing code. It also handles bitcasts and other operations that do not change the pointer value. llvm-svn: 191847	2013-10-02 19:06:06 +00:00
Tom Stellard	b9cbfd3959	StructurizeCFG: Add dependency on LowerSwitch pass Switch instructions were crashing the StructurizeCFG pass, and it's probably easier anyway if we don't need to handle them in this pass. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 191841	2013-10-02 17:04:59 +00:00
Chandler Carruth	ee12d58370	Remove the very substantial, largely unmaintained legacy PGO infrastructure. This was essentially work toward PGO based on a design that had several flaws, partially dating from a time when LLVM had a different architecture, and with an effort to modernize it abandoned without being completed. Since then, it has bitrotted for several years further. The result is nearly unusable, and isn't helping any of the modern PGO efforts. Instead, it is getting in the way, adding confusion about PGO in LLVM and distracting everyone with maintenance on essentially dead code. Removing it paves the way for modern efforts around PGO. Among other effects, this removes the last of the runtime libraries from LLVM. Those are being developed in the separate 'compiler-rt' project now, with somewhat different licensing specifically more approriate for runtimes. llvm-svn: 191835	2013-10-02 15:42:23 +00:00
Alexey Samsonov	3291ccfed7	Remove "localize global" optimization Summary: As discussed in http://llvm-reviews.chandlerc.com/D1754, this optimization isn't really valid for C, and fires too rarely anyway. Reviewers: rafael, nicholas Reviewed By: nicholas CC: rnk, llvm-commits, nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D1769 llvm-svn: 191834	2013-10-02 15:31:34 +00:00
Matt Arsenault	e06d79aa83	Don't merge tiny functions. It's silly to merge functions like these: define void @foo(i32 %x) { ret void } define void @bar(i32 %x) { ret void } to get define void @bar(i32) { tail call void @foo(i32 %0) ret void } llvm-svn: 191786	2013-10-01 18:05:30 +00:00
Rafael Espindola	a279462828	Remove several unused variables. Patch by Alp Toker. llvm-svn: 191757	2013-10-01 13:32:03 +00:00
Matt Arsenault	a3e171a6c8	Fix code duplication llvm-svn: 191716	2013-10-01 00:01:14 +00:00
Matt Arsenault	bc0b70cd8d	Use right address space size in InstCombineCompares The test's output doesn't change, but this ensures this is actually hit with a different address space. llvm-svn: 191701	2013-09-30 21:11:01 +00:00
Matt Arsenault	27f5203bba	Constant fold ptrtoint + compare with address spaces llvm-svn: 191699	2013-09-30 21:06:18 +00:00
Benjamin Kramer	0b73a10aeb	BoundsChecking: Fix refacto. llvm-svn: 191676	2013-09-30 15:52:50 +00:00
Benjamin Kramer	7b5eaaacfd	Convert manual insert point restores to the new RAII object. llvm-svn: 191675	2013-09-30 15:40:17 +00:00
Benjamin Kramer	ae9249f56f	InstCombine: Replace manual fast math flag copying with the new IRBuilder RAII helper. Defines away the issue where cast<Instruction> would fail because constant folding happened. Also slightly cleaner. llvm-svn: 191674	2013-09-30 15:39:59 +00:00
Benjamin Kramer	33abdcddb3	IRBuilder: Add RAII objects to reset insertion points or fast math flags. Inspired by the object from the SLPVectorizer. This found a minor bug in the debug loc restoration in the vectorizer where the location of a following instruction was attached instead of the location from the original instruction. llvm-svn: 191673	2013-09-30 15:39:48 +00:00
Joey Gouly	b2a84bfcc1	Fix a bug in InstCombine where it attempted to cast a Value* to an Instruction* when it was actually a Constant*. There are quite a few other casts to Instruction that might have the same problem, but this is the only one I have a test case for. llvm-svn: 191668	2013-09-30 14:18:35 +00:00
Robert Wilhelm	198f21deb3	Even more spelling fixes for "instruction". llvm-svn: 191611	2013-09-28 13:42:22 +00:00
Robert Wilhelm	6b36431ffa	Fix spelling intruction -> instruction. llvm-svn: 191610	2013-09-28 11:46:15 +00:00
Matt Arsenault	f672e722f2	Use right pointer type in DebugIR llvm-svn: 191576	2013-09-27 22:26:25 +00:00
Matt Arsenault	c1629ee7a8	Use type helper functions llvm-svn: 191574	2013-09-27 22:18:51 +00:00
Matt Arsenault	0dc0668061	Fix SLPVectorizer using wrong address space for load/store llvm-svn: 191564	2013-09-27 21:24:57 +00:00
Justin Bogner	d2e08f6deb	InstCombine: Only foldSelectICmpAndOr for integer types Currently foldSelectICmpAndOr asserts if the "or" involves a vector containing several of the same power of two. We can easily avoid this by only performing the fold on integer types, like foldSelectICmpAnd does. Fixes <rdar://problem/15012516> llvm-svn: 191552	2013-09-27 20:35:39 +00:00
Justin Bogner	db7f32982e	Transforms: Use getFirstNonPHI to set the insertion point for PHIs We were previously using getFirstInsertionPt to insert PHI instructions when vectorizing, but getFirstInsertionPt also skips past landingpads, causing this to generate invalid IR. We can avoid this issue by using getFirstNonPHI instead. llvm-svn: 191526	2013-09-27 15:30:25 +00:00
Puyan Lotfi	c9af951375	First check in. Modified a comment. llvm-svn: 191491	2013-09-27 07:36:10 +00:00
Arnold Schwaighofer	9830391a8f	SLPVectorize: Put horizontal reductions feeding a store under separate flag Put them under a separate flag for experimentation. They are more likely to interfere with loop vectorization which happens later in the pass pipeline. llvm-svn: 191371	2013-09-25 14:02:32 +00:00
Evgeniy Stepanov	8387908aff	[msan] Fix -Wreturn-type warnings in non-self-hosted build. llvm-svn: 191361	2013-09-25 08:56:00 +00:00
Yi Jiang	6ba7a7b02c	set the cost of tiny trees to INT_MAX in SLP vectorizer to disable vectorization on them llvm-svn: 191314	2013-09-24 17:26:43 +00:00
Benjamin Kramer	109a525643	Push analysis passes to InstSimplify when they're around anyways. llvm-svn: 191309	2013-09-24 16:37:40 +00:00
Evgeniy Stepanov	e1fcc1bf1d	[msan] Handling of atomic load/store, atomic rmw, cmpxchg. llvm-svn: 191287	2013-09-24 11:20:27 +00:00
Arnold Schwaighofer	b1cea2cfcc	Revert "LoopVectorizer: Only allow vectorization of intrinsics." Revert 191122 - with extra checks we are allowed to vectorize math library function calls. Standard library indentifiers are reserved names so functions with external linkage must not overrided them. However, functions with internal linkage can. Therefore, we can vectorize calls to math library functions with a check for external linkage and matching signature. This matches what we do during SelectionDAG building. llvm-svn: 191206	2013-09-23 14:54:39 +00:00
Benjamin Kramer	1308256cf8	Provide basic type safety for array_pod_sort comparators. This makes using array_pod_sort significantly safer. The implementation relies on function pointer casting but that should be safe as we're dealing with void* here. llvm-svn: 191175	2013-09-22 14:09:50 +00:00
Benjamin Kramer	7dae47d205	Drop spurious handle in comment. llvm-svn: 191172	2013-09-22 11:24:58 +00:00
Benjamin Kramer	190baf6fef	SROA: Handle casts involving vectors of pointers and integer scalars. SROA wants to convert any types of equivalent widths but it's not possible to convert vectors of pointers to an integer scalar with a single cast. As a workaround we add a bitcast to the corresponding int ptr type first. This type of cast used to be an edge case but has become common with SLP vectorization. Fixes PR17271. llvm-svn: 191143	2013-09-21 20:36:04 +00:00
Arnold Schwaighofer	4f8b0cf48b	SLPVectorizer: Fix multiline comment warning llvm-svn: 191135	2013-09-21 05:37:30 +00:00
Arnold Schwaighofer	c1f8473eb6	Reapply "SLPVectorizer: Handle more horizontal reductions (disabled)"" Reapply r191108 with a fix for a memory corruption error I introduced. Of course, we can't reference the scalars that we replace by vectorizing and then call their eraseFromParent method. I only 'needed' the scalars to get the DebugLoc. Just store the DebugLoc before actually vectorizing instead. As a nice side effect, this also simplifies the interface between BoUpSLP and the HorizontalReduction class to returning a value pointer (the vectorized tree root). radar://14607682 llvm-svn: 191123	2013-09-21 01:06:00 +00:00
Nadav Rotem	a10aa3ffa1	LoopVectorizer: Only allow vectorization of intrinsics. We can't know for sure that the functions 'abs' or 'round' are the functions from libm. rdar://15012650 llvm-svn: 191122	2013-09-21 00:27:05 +00:00
Arnold Schwaighofer	c81e140238	Revert "SLPVectorizer: Handle more horizontal reductions (disabled)" This reverts commit r191108. The horizontal.ll test case fails under libgmalloc. Thanks Shuxin for pointing this out to me. llvm-svn: 191121	2013-09-21 00:06:20 +00:00
Shuxin Yang	640fbb44a5	Resurrect r191017 " GVN proceeds in the presence of dead code" plus a fix to PR17307 & 17308. The problem of r191017 is that when GVN fabricate a val-number for a dead instruction (in order to make following expr-PRE happy), it forget to fabricate a leader-table entry for it as well. llvm-svn: 191118	2013-09-20 23:12:57 +00:00
Benjamin Kramer	89ff6bf9f0	InstCombine: Remove unused argument. No functionality change. llvm-svn: 191112	2013-09-20 22:12:42 +00:00
Arnold Schwaighofer	db78859615	SLPVectorizer: Handle more horizontal reductions (disabled) Match reductions starting at binary operation feeding into a phi. The code handles trees like r += v1 + v2 + v3 ... and r += v1 r += v2 ... and r *= v1 + v2 + ... We currently only handle associative operations (add, fadd fast). The code can now also handle reductions feeding into stores. a[i] = v1 + v2 + v3 + ... The code is currently disabled behind the flag "-slp-vectorize-hor". The cost model for most architectures is not there yet. I found one opportunity of a horizontal reduction feeding a phi in TSVC (LoopRerolling-flt) and there are several opportunities where reductions feed into stores. radar://14607682 llvm-svn: 191108	2013-09-20 21:18:20 +00:00
Joerg Sonnenberger	d4339cb110	Revert r191017, it results in segmentation faults in Qt. llvm-svn: 191104	2013-09-20 20:33:57 +00:00
Benjamin Kramer	7ea950a209	InstCombine: Canonicalize (gep i8* X, -(ptrtoint Y)) to (sub (ptrtoint X), (ptrtoint Y)) The GEP pattern is what SCEV expander emits for "ugly geps". The latter is what you get for pointer subtraction in C code. The rest of instcombine already knows how to deal with that so just canonicalize on that. llvm-svn: 191090	2013-09-20 14:38:44 +00:00
Shuxin Yang	44b8e62121	[Fast-math] Disable "(C1/X)C2 => (C1C2)/X" if C1/X has multiple uses. If "C1/X" were having multiple uses, the only benefit of this transformation is to potentially shorten critical path. But it is at the cost of instroducing additional div. The additional div may or may not incur cost depending on how div is implemented. If it is implemented using Newton–Raphson iteration, it dosen't seem to incur any cost (FIXME). However, if the div blocks the entire pipeline, that sounds to be pretty expensive. Let CodeGen to take care this transformation. This patch sees 6% on a benchmark. rdar://15032743 llvm-svn: 191037	2013-09-19 21:13:46 +00:00
Benjamin Kramer	b681a45715	InstCombine: Don't allow turning vector-of-pointer loads into vector-of-integer. The code below can't handle any pointers. PR17293. llvm-svn: 191036	2013-09-19 20:59:04 +00:00
Shuxin Yang	76ccefc5f8	GVN proceeds in the presence of dead code. This is how it ignores the dead code: 1) When a dead branch target, say block B, is identified, all the blocks dominated by B is dead as well. 2) The PHIs of those blocks in dominance-frontier(B) is updated such that the operands corresponding to dead predecessors are replaced by "UndefVal". Using lattice's jargon, the "UndefVal" is the "Top" in essence. Phi node like this "phi(v1 bb1, undef xx)" will be optimized into "v1" if v1 is constant, or v1 is an instruction which dominate this PHI node. 3) When analyzing the availability of a load L, all dead mem-ops which L depends on disguise as a load which evaluate exactly same value as L. 4) The dead mem-ops will be materialized as "UndefVal" during code motion. llvm-svn: 191017	2013-09-19 17:22:51 +00:00
Evgeniy Stepanov	d26ac53a42	[msan] Wrap indirect functions. Adds a flag to the MemorySanitizer pass that enables runtime rewriting of indirect calls. This is part of MSanDR implementation and is needed to return control to the DynamiRio-based helper tool on transition between instrumented and non-instrumented modules. Disabled by default. llvm-svn: 191006	2013-09-19 15:22:35 +00:00
Kostya Serebryany	9e042a92a5	[asan] call __asan_stack_malloc_N only if use-after-return detection is enabled with the run-time option llvm-svn: 190939	2013-09-18 14:07:14 +00:00
Robert Lytton	b41e4ff222	Prevent LoopVectorizer and SLPVectorizer running if the target has no vector registers. XCore target: Add XCoreTargetTransformInfo This is where getNumberOfRegisters() resides, which in turn returns the number of vector registers (=0). llvm-svn: 190936	2013-09-18 12:43:35 +00:00
Craig Topper	194d1e2a5a	Revert accidental commit I had to make to get the test case in PR17268 to still work correctly. llvm-svn: 190917	2013-09-18 04:10:17 +00:00
Craig Topper	5d022196de	Lift alignment restrictions for load/store folding on VINSERTF128/VEXTRACTF128. Fixes PR17268. llvm-svn: 190916	2013-09-18 03:55:53 +00:00
David Blaikie	8914db31b9	ifndef NDEBUG-out an asserts-only constant committed in r190863 llvm-svn: 190905	2013-09-18 00:11:27 +00:00
Quentin Colombet	1950396a9e	Revert the load slicing done in r190870. To avoid regressions with bitfield optimizations, this slicing should take place later, like ISel time. llvm-svn: 190891	2013-09-17 22:01:26 +00:00
Matt Arsenault	6fd5ad85d0	Cleanup handling of constant function casts. Some of this code is no longer necessary since int<->ptr casts are no longer occur as of r187444. This also fixes handling vectors of pointers, and adds a bunch of new testcases for vectors and address spaces. llvm-svn: 190885	2013-09-17 21:10:14 +00:00
Arnold Schwaighofer	43c2040076	SLPVectorizer: Don't vectorize phi nodes that use invoke values We can't insert an insertelement after an invoke. We would have to split a critical edge. So when we see a phi node that uses an invoke we just give up. radar://14990770 llvm-svn: 190871	2013-09-17 17:03:29 +00:00
Quentin Colombet	3d996b9289	[InstCombiner] Slice a big load in two loads when the elements are next to each other in memory. The motivation was to get rid of truncate and shift right instructions that get in the way of paired load or floating point load. E.g., Consider the following example: struct Complex { float real; float imm; }; When accessing a complex, llvm was generating a 64-bits load and the imm field was obtained by a trunc(lshr) sequence, resulting in poor code generation, at least for x86. The idea is to declare that two load instructions is the canonical form for loading two arithmetic type, which are next to each other in memory. Two scalar loads at a constant offset from each other are pretty easy to detect for the sorts of passes that like to mess with loads. <rdar://problem/14477220> llvm-svn: 190870	2013-09-17 16:57:34 +00:00
Kostya Serebryany	f9c84976da	[asan] inline the calls to __asan_stack_free_* with small sizes. Yet another 10%-20% speedup for use-after-return llvm-svn: 190863	2013-09-17 12:14:50 +00:00
Stepan Dyatkovskiy	124d49fc91	Bugfix for PR17099: Wrong cast operation. MergeFunctions emits Bitcast instead of pointer-to-integer operation. Patch fixes MergeFunctions::writeThunk function. It replaces unconditional Bitcast creation with "Value* createCast(...)" method, that checks operand types and selects proper instruction. See unit-test as example. llvm-svn: 190859	2013-09-17 09:36:11 +00:00
Matt Arsenault	513e7539be	MemCpyOptimizer: Use max legal int size instead of pointer size If there are no legal integers, assume 1 byte. This makes more sense than using the pointer size as a guess for the maximum GPR width. It is conceivable to want to use some 64-bit pointers on a target where 64-bit integers aren't legal. llvm-svn: 190817	2013-09-16 22:43:16 +00:00
Arnold Schwaighofer	11f318e34c	Don't vectorize if there are outside loop users of the induction variable. We would have to compute the pre increment value, either by computing it on every loop iteration or by splitting the edge out of the loop and inserting a computation for it there. For now, just give up vectorizing such loops. Fixes PR17179. llvm-svn: 190790	2013-09-16 16:17:24 +00:00
Evgeniy Stepanov	f45c97a722	[msan] Check return value of main(). llvm-svn: 190782	2013-09-16 13:24:32 +00:00
Peter Collingbourne	cf3b1a2910	Implement function prefix data as an IR feature. Previous discussion: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html Differential Revision: http://llvm-reviews.chandlerc.com/D1191 llvm-svn: 190773	2013-09-16 01:08:15 +00:00
Benjamin Kramer	b6950f09cc	Replace some unnecessary vector copies with references. llvm-svn: 190770	2013-09-15 22:04:42 +00:00
Robert Wilhelm	0ba533b69c	Fix spelling. llvm-svn: 190750	2013-09-14 09:34:59 +00:00
Chandler Carruth	d47d52e219	Remove the long, long defunct IR block placement pass. This pass was based on the previous (essentially unused) profiling infrastructure and the assumption that by ordering the basic blocks at the IR level in a particular way, the correct layout would happen in the end. This sometimes worked, and mostly didn't. It also was a really naive implementation of the classical paper that dates from when branch predictors were primarily directional and when loop structure wasn't commonly available. It also didn't factor into the equation non-fallthrough branches and other machine level details. Anyways, for all of these reasons and more, I wrote MachineBlockPlacement, which completely supercedes this pass. It both uses modern profile information infrastructure, and actually works. =] llvm-svn: 190748	2013-09-14 09:28:14 +00:00
Evgeniy Stepanov	47ad94a685	[msan] Add source file:line to stack origin reports. Compiler part. llvm-svn: 190689	2013-09-13 12:54:49 +00:00
Duncan Sands	5dbc902c8f	Avoid a compiler warning about Found not being used when assertions are disabled. llvm-svn: 190668	2013-09-13 08:16:06 +00:00
Hal Finkel	fe9daed60a	Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 190542	2013-09-11 19:25:43 +00:00
Benjamin Kramer	23f676e21d	Revert "Give internal classes hidden visibility." It works with clang, but GCC has different rules so we can't make all of those hidden. This reverts commit r190534. llvm-svn: 190536	2013-09-11 18:05:11 +00:00
Benjamin Kramer	386bd314a1	Give internal classes hidden visibility. Worth 100k on a linux/x86_64 Release+Asserts clang. llvm-svn: 190534	2013-09-11 17:42:27 +00:00
Matt Arsenault	8d27006dd4	Use type form of getIntPtrType This doesn't change anything since malloc always returns address space 0. llvm-svn: 190498	2013-09-11 07:29:40 +00:00
Matt Arsenault	d35880d234	Teach loop-idiom about address space pointer sizes llvm-svn: 190491	2013-09-11 05:09:42 +00:00
Matt Arsenault	8c0c236d92	Add braces llvm-svn: 190490	2013-09-11 05:09:35 +00:00
Eli Friedman	3718173153	Get rid of unused isPodLike definitions. llvm-svn: 190461	2013-09-11 00:36:54 +00:00
Eli Friedman	e87f8feb58	Don't assert on invalid loop vectorization hint. llvm-svn: 190450	2013-09-10 23:45:25 +00:00
Eli Friedman	713110699b	Fix mistake in r190442. llvm-svn: 190446	2013-09-10 23:09:24 +00:00
Eli Friedman	89c35cdea8	Remove unused functions. llvm-svn: 190442	2013-09-10 22:42:31 +00:00
Matt Arsenault	a15663a1b4	Teach ScalarEvolution about pointer address spaces llvm-svn: 190425	2013-09-10 19:55:24 +00:00
Benjamin Kramer	06ad7ff792	LoopVectorize: PHI nodes are always at the beginning of a block, no need to scan the whole block. llvm-svn: 190422	2013-09-10 18:46:15 +00:00
Kostya Serebryany	0218bab3d0	[asan] refactor the use-after-return API so that the size class is computed at compile time instead of at run-time. llvm part llvm-svn: 190407	2013-09-10 13:16:56 +00:00
Matt Arsenault	eeee915ad0	Use StringRef::npos for StringRef instead of std::string one llvm-svn: 190375	2013-09-10 00:41:53 +00:00
Eli Friedman	f2904ff59c	Don't shrink atomic ops to bool in GlobalOpt. LLVM IR doesn't currently allow atomic bool load/store operations, and the transformation is dubious anyway because it isn't profitable on all platforms. PR17163. llvm-svn: 190357	2013-09-09 22:00:13 +00:00
Quentin Colombet	48ec8d3046	[InstCombiner] Expose opportunities to merge subtract and comparison. Several architectures use the same instruction to perform both a comparison and a subtract. The instruction selection framework does not allow to consider different basic blocks to expose such fusion opportunities. Therefore, these instructions are “merged” by CSE at MI IR level. To increase the likelihood of CSE to apply in such situation, we reorder the operands of the comparison, when they have the same complexity, so that they matches the order of the most frequent subtract. E.g., icmp A, B ... sub B, A <rdar://problem/14514580> llvm-svn: 190352	2013-09-09 20:56:48 +00:00
Bob Wilson	6c7d3717b3	Revert patches to add case-range support for PR1255. The work on this project was left in an unfinished and inconsistent state. Hopefully someone will eventually get a chance to implement this feature, but in the meantime, it is better to put things back the way the were. I have left support in the bitcode reader to handle the case-range bitcode format, so that we do not lose bitcode compatibility with the llvm 3.3 release. This reverts the following commits: 155464, 156374, 156377, 156613, 156704, 156757, 156804 156808, 156985, 157046, 157112, 157183, 157315, 157384, 157575, 157576, 157586, 157612, 157810, 157814, 157815, 157880, 157881, 157882, 157884, 157887, 157901, 158979, 157987, 157989, 158986, 158997, 159076, 159101, 159100, 159200, 159201, 159207, 159527, 159532, 159540, 159583, 159618, 159658, 159659, 159660, 159661, 159703, 159704, 160076, 167356, 172025, 186736 llvm-svn: 190328	2013-09-09 19:14:35 +00:00
Manman Ren	de03bcdbec	TBAA: add isTBAAVtableAccess to MDNode so clients can call the function instead of having its own implementation. The implementation of isTBAAVtableAccess is in TypeBasedAliasAnalysis.cpp since it is related to the format of TBAA metadata. The path for struct-path tbaa will be exercised by test/Instrumentation/ThreadSanitizer/read_from_global.ll, vptr_read.ll, and vptr_update.ll when struct-path tbaa is on by default. llvm-svn: 190216	2013-09-06 22:47:05 +00:00
Matt Arsenault	4c2083b14a	Use type helper functions. llvm-svn: 190113	2013-09-06 00:37:24 +00:00
Matt Arsenault	f658af2617	Teach CodeGenPrepare about address spaces llvm-svn: 190112	2013-09-06 00:18:43 +00:00
Matt Arsenault	f022cb3c1a	Consistently use dbgs() in debug printing llvm-svn: 190093	2013-09-05 19:48:28 +00:00
Rafael Espindola	5a6f92c702	Remove unused argument. llvm-svn: 190090	2013-09-05 19:15:21 +00:00
Nick Lewycky	194eee5b98	Declare missing dependency on AliasAnalysis. Patch by Liu Xin! llvm-svn: 190035	2013-09-05 08:19:58 +00:00
Rafael Espindola	2a5c049416	Rename some variables to match the style guide. I am about to patch this code, and this makes the diff far more readable. llvm-svn: 189982	2013-09-04 20:08:46 +00:00
Rafael Espindola	b52aea99b5	Small simplification given that insert of an empty range is a nop. llvm-svn: 189971	2013-09-04 18:53:21 +00:00
Rafael Espindola	77e21912e2	Refactor duplicated logic to a helper function. No functionality change. llvm-svn: 189969	2013-09-04 18:37:36 +00:00
Rafael Espindola	6e3748cf68	Remove dead code. llvm-svn: 189967	2013-09-04 18:16:02 +00:00
Rafael Espindola	357980289a	Revert "Add r159136 back now that pr13124 has been fixed." This reverts commit r189886. I found a corner case where this optimization is not valid: Say we have a "linkonce_odr unnamed_addr" in two translation units: * In TU 1 this optimization kicks in and makes it hidden. * In TU 2 it gets const merged with a constant that is not unnamed_addr, resulting in a non unnamed_addr constant with default visibility. * The static linker rules for combining visibility them produce a hidden symbol, which is incorrect from the point of view of the non unnamed_addr constant. The one place we can do this is when we know that the symbol is not used from another TU in the same shared object, i.e., during LTO. I will move it there. llvm-svn: 189954	2013-09-04 16:09:01 +00:00
Tim Northover	baf9697e72	InstCombine: allow unmasked icmps to be combined with logical ops "(icmp op i8 A, B)" is equivalent to "(icmp op i8 (A & 0xff), B)" as a degenerate case. Allowing this as a "masked" comparison when analysing "(icmp) &/\| (icmp)" allows us to combine them in more cases. rdar://problem/7625728 llvm-svn: 189931	2013-09-04 11:57:17 +00:00
Tim Northover	9045305fce	InstCombine: look for masked compares with subset relation Even in cases which aren't universally optimisable like "(A & B) != 0 && (A & C) != 0", the masks can make one of the comparisons completely redundant. In this case, since we've gone to the effort of spotting masked comparisons we should combine them. rdar://problem/7625728 llvm-svn: 189930	2013-09-04 11:57:13 +00:00
Rafael Espindola	8b9c0a576e	Add r159136 back now that pr13124 has been fixed. Original message: If a constant or a function has linkonce_odr linkage and unnamed_addr, mark hidden. Being linkonce_odr guarantees that it is available in every dso that needs it. Being a constant/function with unnamed_addr guarantees that the copies don't have to be merged. llvm-svn: 189886	2013-09-03 23:34:36 +00:00
Michael Gottesman	3912825096	[objc-arc] Remove dead code from previous commit. llvm-svn: 189870	2013-09-03 22:40:56 +00:00
Michael Gottesman	81c7cd8b47	[objc-arc] Turn off the objc_retainBlock -> objc_retain optimization. The reason that I am turning off this optimization is that there is an additional case where a block can escape that has come up. Specifically, this occurs when a block is used in a scope outside of its current scope. This can cause a captured retainable object pointer whose life is preserved by the objc_retainBlock to be deallocated before the block is invoked. An example of the code needed to trigger the bug is: ---- \#import <Foundation/Foundation.h> int main(int argc, const char * argv[]) { void (^somethingToDoLater)(); { NSObject *obj = [NSObject new]; somethingToDoLater = ^{ [obj self]; // Crashes here }; } NSLog(@"test."); somethingToDoLater(); return 0; } ---- In the next commit, I remove all the dead code that results from this. Once I put in the fixing commit I will bring back the tests that I deleted in this commit. rdar://14802782. rdar://14868830. llvm-svn: 189869	2013-09-03 22:40:54 +00:00
Nadav Rotem	ff7b438ada	Enable late-vectorization by default. This patch changes the default setting for the LateVectorization flag that controls where the loop-vectorizer is ran. Perf gains: SingleSource/Benchmarks/Shootout/matrix -37.33% MultiSource/Benchmarks/PAQ8p/paq8p -22.83% SingleSource/Benchmarks/Linpack/linpack-pc -16.22% SingleSource/Benchmarks/Shootout-C++/ary3 -15.16% MultiSource/Benchmarks/TSVC/NodeSplitting-flt/NodeSplitting-flt -10.34% MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl -7.12% Regressions: SingleSource/Benchmarks/Misc/lowercase 15.10% MultiSource/Benchmarks/TSVC/Equivalencing-flt/Equivalencing-flt 13.18% SingleSource/Benchmarks/Shootout-C++/matrix 8.27% SingleSource/Benchmarks/CoyoteBench/lpbench 7.30% llvm-svn: 189858	2013-09-03 21:33:17 +00:00
Matt Arsenault	469d672381	Teach InstCombineLoadCast about address spaces. This is another one that doesn't matter much, but uses the right GEP index types in the first place. llvm-svn: 189854	2013-09-03 21:05:48 +00:00
Matt Arsenault	306cb38abf	Use type form of getIntPtrType in alloca visitor. This doesn't actually matter, since alloca is always 0 address space, but this is more consistent. llvm-svn: 189853	2013-09-03 21:05:15 +00:00
Yi Jiang	e1b34bf1fe	In this patch we are trying to do two things: 1) If the width of vectorization list candidate is bigger than vector reg width, we will break it down to fit the vector reg. 2) We do not vectorize the width which is not power of two. The performance result shows it will help some spec benchmarks. mesa improved 6.97% and ammp improved 1.54%. llvm-svn: 189830	2013-09-03 17:26:04 +00:00
Evgeniy Stepanov	475b9bd212	[msan] Fix handling of select with struct arguments. llvm-svn: 189796	2013-09-03 13:05:29 +00:00
Evgeniy Stepanov	af50498a8a	[msan] Fix select instrumentation. Select condition shadow was being ignored resulting in false negatives. This change OR-s sign-extended condition shadow into the result shadow. llvm-svn: 189785	2013-09-03 10:04:11 +00:00
Benjamin Kramer	4bd23de2ce	SimplifyLibCalls: When emitting an overloaded fp function check that it's available. The existing code missed some edge cases when e.g. we're going to emit sqrtf but only the availability of sqrt was checked. This happens on odd platforms like windows. llvm-svn: 189724	2013-08-31 18:19:35 +00:00
Bill Wendling	1e3b2981d1	Compulsive reformatting. llvm-svn: 189697	2013-08-30 21:07:33 +00:00
Benjamin Kramer	3a81691558	InstCombine: Check for zero shift amounts before subtracting one causing integer overflow. PR17026. Also avoid undefined shifts and shift amounts larger than 64 bits (those are always undef because we can't represent integer types that large). llvm-svn: 189672	2013-08-30 14:35:35 +00:00
Bill Wendling	9107fdb197	Random cleanup: No need to use a std::vector here, since createInternalizePass uses an ArrayRef. llvm-svn: 189632	2013-08-30 00:48:37 +00:00
Hal Finkel	198ffea54f	Revert: r189565 - Add getUnrollingPreferences to TTI Revert unintentional commit (of an unreviewed change). Original commit message: Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 189566	2013-08-29 03:33:15 +00:00
Hal Finkel	04a990355c	Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 189565	2013-08-29 03:29:57 +00:00
Nadav Rotem	2fd78a4651	Vectorizer/PassManager: I am working on moving the vectorizer out of the SCC passes. This patch moves the SLP-vectorizer and BB-vectorizer back into SCC passes for two reasons: 1. They are a kind of cannonicalization. 2. The performance measurements show that it is better to keep them in. There should be no functional change if you are not enabling the LateVectorization mode. llvm-svn: 189539	2013-08-28 23:40:29 +00:00
Matt Arsenault	60f6ad8d53	Fix typo. llvm-svn: 189524	2013-08-28 22:17:26 +00:00
Hal Finkel	a22a21165f	Disable unrolling in the loop vectorizer when disabled in the pass manager When unrolling is disabled in the pass manager, the loop vectorizer should also not unroll loops. This will allow the -fno-unroll-loops option in Clang to behave as expected (even for vectorizable loops). The loop vectorizer's -force-vector-unroll option will (continue to) override the pass-manager setting (including -force-vector-unroll=0 to force use of the internal auto-selection logic). In order to test this, I added a flag to opt (-disable-loop-unrolling) to force disable unrolling through opt (the analog of -fno-unroll-loops in Clang). Also, this fixes a small bug in opt where the loop vectorizer was enabled only after the pass manager populated the queue of passes (the global_alias.ll test needed a slight update to the RUN line as a result of this fix). llvm-svn: 189499	2013-08-28 18:33:10 +00:00
Alexey Samsonov	007bc9e8d7	80 cols llvm-svn: 189473	2013-08-28 11:25:12 +00:00
Peter Collingbourne	e16a469093	DataFlowSanitizer: Implement trampolines for function pointers passed to custom functions. Differential Revision: http://llvm-reviews.chandlerc.com/D1503 llvm-svn: 189408	2013-08-27 22:09:06 +00:00
Nadav Rotem	c8417c3f79	Refactor 'vectorizeLoop' no functionality change. This patch merges LoopVectorize of InnerLoopVectorizer and InnerLoopUnroller by adding checks for VF=1. This helps in erasing the Unroller code that is almost identical to the InnerLoopVectorizer code. llvm-svn: 189391	2013-08-27 18:52:47 +00:00
Michael Gottesman	5cd63cc173	Fixed typo. Noticed by Stephen Checkoway <s@pahtak.org>. llvm-svn: 189312	2013-08-27 04:43:03 +00:00
Matt Arsenault	c32a5a3d46	Fix inserting instructions before last in bundle. The builder inserts from before the insert point, not after, so this would insert before the last instruction in the bundle instead of after it. I'm not sure if this can actually be a problem with any of the current insertions. llvm-svn: 189285	2013-08-26 23:08:37 +00:00
Nadav Rotem	7d4e24e1f4	LoopVectorize: Implement partial loop unrolling when vectorization is not profitable. This patch enables unrolling of loops when vectorization is legal but not profitable. We add a new class InnerLoopUnroller, that extends InnerLoopVectorizer and replaces some of the vector-specific logic with scalars. This patch does not introduce any runtime regressions and improves the following workloads: SingleSource/Benchmarks/Shootout/matrix -22.64% SingleSource/Benchmarks/Shootout-C++/matrix -13.06% External/SPEC/CINT2006/464_h264ref/464_h264ref -3.99% SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding -1.95% llvm-svn: 189281	2013-08-26 22:33:26 +00:00
Yi Jiang	b951785e03	test commit. Remove blank line llvm-svn: 189265	2013-08-26 18:57:55 +00:00
Matt Arsenault	49ebc53ae4	Fix unused variable in release build llvm-svn: 189264	2013-08-26 18:38:29 +00:00
Matt Arsenault	0d6c559675	Constify functions llvm-svn: 189234	2013-08-26 17:56:38 +00:00
Matt Arsenault	fe57252c78	Vectorize starting from insertelements building a vector llvm-svn: 189233	2013-08-26 17:56:35 +00:00
Matt Arsenault	c1b8722791	Check if in set on insertion instead of separately llvm-svn: 189179	2013-08-24 19:55:38 +00:00
Benjamin Kramer	02d328a0f9	Add a function object to compare the first or second component of a std::pair. Replace instances of this scattered around the code base. llvm-svn: 189169	2013-08-24 12:54:27 +00:00
Peter Collingbourne	a2ec50d21b	DataFlowSanitizer: correctly combine labels in the case where they are equal. llvm-svn: 189133	2013-08-23 18:45:06 +00:00
Evgeniy Stepanov	47f9a57504	[msan] Fix handling of va_arg overflow area on x86_64. The code was erroneously reading overflow area shadow from the TLS slot, bypassing the local copy. Reading shadow directly from TLS is wrong, because it can be overwritten by a nested vararg call, if that happens before va_start. llvm-svn: 189104	2013-08-23 12:11:00 +00:00
Richard Sandiford	b195d89bde	Turn MipsOptimizeMathLibCalls into a target-independent scalar transform ...so that it can be used for z too. Most of the code is the same. The only real change is to use TargetTransformInfo to test when a sqrt instruction is available. The pass is opt-in because at the moment it only handles sqrt. llvm-svn: 189097	2013-08-23 10:27:02 +00:00
Alexey Samsonov	e81fe60561	80 cols llvm-svn: 189091	2013-08-23 07:42:51 +00:00
Michael Gottesman	0f9b142f60	Update StripDeadDebugInfo to use DebugInfoFinder so that it is no longer stale to the point of not working and more resilient to debug info changes. The current version of StripDeadDebugInfo became stale and no longer actually worked since it was expecting an older version of debug info. This patch updates it to use DebugInfoFinder and the modern DebugInfo classes as much as possible to make it more redundent to such changes. Additionally, the only place where that was avoided (the code where we replace the old sets with the new), I call verify on the DIContextUnit implying that if the format changes and my live set changes no longer make sense an assert will be hit. In order to ensure that that occurs I have included a test case. The actual stripping of the dead debug info follows the same strategy as was used before in this class: find the live set and replace the old set in the given compile unit (which may contain dead global variables/functions) with the new live one. llvm-svn: 189078	2013-08-23 00:23:24 +00:00
Peter Collingbourne	1e7de1b7af	DataFlowSanitizer: Replace non-instrumented aliases of instrumented functions, and vice versa, with wrappers. Differential Revision: http://llvm-reviews.chandlerc.com/D1442 llvm-svn: 189054	2013-08-22 20:08:15 +00:00
Peter Collingbourne	411f259ed9	DataFlowSanitizer: Factor the wrapper builder out to buildWrapperFunction. Differential Revision: http://llvm-reviews.chandlerc.com/D1441 llvm-svn: 189053	2013-08-22 20:08:11 +00:00
Peter Collingbourne	ac1c1c4377	DataFlowSanitizer: Prefix the name of each instrumented function with "dfs$". DFSan changes the ABI of each function in the module. This makes it possible for a function with the native ABI to be called with the instrumented ABI, or vice versa, thus possibly invoking undefined behavior. A simple way of statically detecting instances of this problem is to prepend the prefix "dfs$" to the name of each instrumented-ABI function. This will not catch every such problem; in particular function pointers passed across the instrumented-native barrier cannot be used on the other side. These problems could potentially be caught dynamically. Differential Revision: http://llvm-reviews.chandlerc.com/D1373 llvm-svn: 189052	2013-08-22 20:08:08 +00:00
Chandler Carruth	e6b6740e73	Teach the SLP vectorizer the correct way to check for consecutive access using GEPs. Previously, it used a number of different heuristics for analyzing the GEPs. Several of these were conservatively correct, but failed to fall back to SCEV even when SCEV might have given a reasonable answer. One was simply incorrect in how it was formulated. There was good code already to recursively evaluate the constant offsets in GEPs, look through pointer casts, etc. I gathered this into a form code like the SLP code can use in a previous commit, which allows all of this code to become quite simple. There is some performance (compile time) concern here at first glance as we're directly attempting to walk both pointers constant GEP chains. However, a couple of thoughts: 1) The very common cases where there is a dynamic pointer, and a second pointer at a constant offset (usually a stride) from it, this code will actually not do any unnecessary work. 2) InstCombine and other passes work very hard to collapse constant GEPs, so it will be rare that we iterate here for a long time. That said, if there remain performance problems here, there are some obvious things that can improve the situation immensely. Doing a vectorizer-pass-wide memoizer for each individual layer of pointer values, their base values, and the constant offset is likely to be able to completely remove redundant work and strictly limit the scaling of the work to scrape these GEPs. Since this optimization was not done on the prior version (which would still benefit from it), I've not done it here. But if folks have benchmarks that slow down it should be straight forward for them to add. I've added a test case, but I'm not really confident of the amount of testing done for different access patterns, strides, and pointer manipulation. llvm-svn: 189007	2013-08-22 12:45:17 +00:00
Matt Arsenault	1b410a4dec	Teach LoopVectorize about address space sizes llvm-svn: 188980	2013-08-22 02:42:55 +00:00
Michael Gottesman	50d821cd72	Fixed typo. llvm-svn: 188957	2013-08-21 22:53:54 +00:00
Michael Gottesman	704d037910	Removed trailing whitespace. llvm-svn: 188956	2013-08-21 22:53:29 +00:00
Yunzhong Gao	9a8c1892ea	No functionality change. Replace "(255 & value)" with "(0xFF & value)" to improve clarity. llvm-svn: 188941	2013-08-21 22:11:15 +00:00
Matt Arsenault	95d00423a7	Teach InstCombine about address spaces llvm-svn: 188926	2013-08-21 19:53:10 +00:00
Matt Arsenault	3e8997425a	Use attribute helper function llvm-svn: 188916	2013-08-21 18:54:50 +00:00
Matt Arsenault	68d9c05b51	Fix typo llvm-svn: 188915	2013-08-21 18:54:47 +00:00
Bill Wendling	68c17b24b6	Move registering the execution of a basic block to the beginning rather than the end. There are situations which can affect the correctness (or at least expectation) of the gcov output. For instance, if a call to __gcov_flush() occurs within a block before the execution count is registered and then the program aborts in some way, then that block will not be marked as executed. This is not normally what the user expects. If we move the code that's registering when a block is executed to the beginning, we can catch these types of situations. PR16893 llvm-svn: 188849	2013-08-20 23:52:00 +00:00
Arnold Schwaighofer	276cfe784a	SLPVectorizer: Fix invalid iterator errors Update iterator when the SLP vectorizer changes the instructions in the basic block by restarting the traversal of the basic block. Patch by Yi Jiang! Fixes PR 16899. llvm-svn: 188832	2013-08-20 21:21:45 +00:00
Hal Finkel	8f395a803a	Add a llvm.copysign intrinsic This adds a llvm.copysign intrinsic; We already have Libfunc recognition for copysign (which is turned into the FCOPYSIGN SDAG node). In order to autovectorize calls to copysign in the loop vectorizer, we need a corresponding intrinsic as well. In addition to the expected changes to the language reference, the loop vectorizer, BasicTTI, and the SDAG builder (the intrinsic is transformed into an FCOPYSIGN node, just like the function call), this also adds FCOPYSIGN to a few lists in LegalizeVector{Ops,Types} so that vector copysigns can be expanded. In TargetLoweringBase::initActions, I've made the default action for FCOPYSIGN be Expand for vector types. This seems correct for all in-tree targets, and I think is the right thing to do because, previously, there was no way to generate vector-values FCOPYSIGN nodes (and most targets don't specify an action for vector-typed FCOPYSIGN). llvm-svn: 188728	2013-08-19 23:35:46 +00:00
Jakub Staszak	b4a62fef02	Use pop_back_val() instead of both back() and pop_back(). llvm-svn: 188723	2013-08-19 22:47:55 +00:00
Matt Arsenault	77bbadbfcb	Teach InstCombine visitGetElementPtr about address spaces llvm-svn: 188721	2013-08-19 22:17:40 +00:00
Matt Arsenault	2c55fe773b	Cleanup visitGetElementPtr to make address space change easier llvm-svn: 188720	2013-08-19 22:17:34 +00:00
Matt Arsenault	14a3f7be8d	commonPointerCast cleanups to make address space change easier llvm-svn: 188719	2013-08-19 22:17:18 +00:00
Matt Arsenault	22098dc46b	Revert non-test parts of r188507 Re-add the inboundsless tests I didn't add originally llvm-svn: 188710	2013-08-19 21:40:31 +00:00
Peter Collingbourne	aa48e92de9	Introduce SpecialCaseList::isIn overload for GlobalAliases. Differential Revision: http://llvm-reviews.chandlerc.com/D1437 llvm-svn: 188688	2013-08-19 19:00:35 +00:00
Michael Kuperstein	9bedf7fc8c	Adds missing TLI check for library simplification of * pow(x, 0.5) -> fabs(sqrt(x)) * pow(2.0, x) -> exp2(x) llvm-svn: 188656	2013-08-19 06:55:47 +00:00
Peter Collingbourne	ea402298e2	Remove SpecialCaseList::findCategory. It turned out that I didn't need this for DFSan. llvm-svn: 188646	2013-08-19 00:24:20 +00:00
Joerg Sonnenberger	72a32889f4	PR 16899: Do not modify the basic block using the iterator, but keep the next value. This avoids crashes due to invalidation. Patch by Joey Gouly. llvm-svn: 188605	2013-08-17 11:04:47 +00:00
Jim Grosbach	933ecf8022	InstCombine: Use isAllOnesValue() instead of explicit -1. llvm-svn: 188563	2013-08-16 17:03:36 +00:00
Jim Grosbach	72387340f5	InstCombine: Simplify if(x!=0 && x!=-1). When both constants are positive or both constants are negative, InstCombine already simplifies comparisons like this, but when it's exactly zero and -1, the operand sorting ends up reversed and the pattern fails to match. Handle that special case. Follow up for rdar://14689217 llvm-svn: 188512	2013-08-16 00:15:20 +00:00
Matt Arsenault	9594ef019c	Don't do FoldCmpLoadFromIndexedGlobal for non inbounds GEPs This path wasn't tested before without a datalayout, so add some more tests and re-run with and without one. llvm-svn: 188507	2013-08-15 23:11:07 +00:00
Matt Arsenault	66eeeddb1d	Fix spelling llvm-svn: 188506	2013-08-15 23:11:03 +00:00
Yunzhong Gao	37d3ce60e8	Fixing a corner-case bug in strchr and strrchr lib call optimizations where the input character is not converted to char before comparing with zero. The patch was discussed in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130812/184069.html llvm-svn: 188489	2013-08-15 20:58:59 +00:00
Peter Collingbourne	25f0a1d209	DataFlowSanitizer: Add a debugging feature to help us track nonzero labels. Summary: When the -dfsan-debug-nonzero-labels parameter is supplied, the code is instrumented such that when a call parameter, return value or load produces a nonzero label, the function __dfsan_nonzero_label is called. The idea is that a debugger breakpoint can be set on this function in a nominally label-free program to help identify any bugs in the instrumentation pass causing labels to be introduced. Reviewers: eugenis CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1405 llvm-svn: 188472	2013-08-15 18:51:12 +00:00
Mark Lacey	c3ab3bb4bc	Fix small typo: s/succ/Succ/ llvm-svn: 188415	2013-08-14 22:11:42 +00:00
Peter Collingbourne	8968732c46	DataFlowSanitizer: Instrumentation for memset. Differential Revision: http://llvm-reviews.chandlerc.com/D1395 llvm-svn: 188412	2013-08-14 20:51:38 +00:00
Peter Collingbourne	905e1efbe5	DataFlowSanitizer: greylist is now ABI list. This replaces the old incomplete greylist functionality with an ABI list, which can provide more detailed information about the ABI and semantics of specific functions. The pass treats every function in the "uninstrumented" category in the ABI list file as conforming to the "native" (i.e. unsanitized) ABI. Unless the ABI list contains additional categories for those functions, a call to one of those functions will produce a warning message, as the labelling behaviour of the function is unknown. The other supported categories are "functional", "discard" and "custom". - "discard" -- This function does not write to (user-accessible) memory, and its return value is unlabelled. - "functional" -- This function does not write to (user-accessible) memory, and the label of its return value is the union of the label of its arguments. - "custom" -- Instead of calling the function, a custom wrapper __dfsw_F is called, where F is the name of the function. This function may wrap the original function or provide its own implementation. Differential Revision: http://llvm-reviews.chandlerc.com/D1345 llvm-svn: 188402	2013-08-14 18:54:12 +00:00
Chandler Carruth	219c3d81d0	Fix a really terrifying but improbable bug in mem2reg. If you have seen extremely subtle miscompilations (such as a load getting replaced with the value stored below the load within a basic block) related to promoting an alloca to an SSA value, there is the dim possibility that you hit this. Please let me know if you won this unfortunate lottery. The first half of mem2reg's core logic (as it is used both in the standalone mem2reg pass and in SROA) builds up a mapping from 'Instruction ' to the index of that instruction within its basic block. This allows quickly establishing which store dominate a particular load even for large basic blocks. We cache this information throughout the run of mem2reg over a function in order to amortize the cost of computing it. This is not in and of itself a strange pattern in LLVM. However, it introduces a very important constraint: absolutely no instruction can be deleted from the program without updating the mapping. Otherwise a newly allocated instruction might get the same pointer address, and then end up with a wrong index. Yes, LLVM routinely suffers from a single threaded* variant of the ABA problem. Most places in LLVM don't find avoiding this an imposition because they don't both delete and create new instructions iteratively, but mem2reg loves to do this... All the time. Fortunately, the mem2reg code was really careful about updating this cache to handle this eventuallity... except when it comes to the debug declare intrinsic. Oops. The fix is to invalidate that pointer in the cache when we delete it, the same as we do when deleting alloca instructions and other instructions. I've also caused the same bug in new code while working on a fix to PR16867, so this seems to be a really unfortunate pattern. Hopefully in subsequent patches the deletion of dead instructions can be consolidated sufficiently to make it less likely that we'll see future occurences of this bug. Sorry for not having a test case, but I have literally no idea how to reliably trigger this kind of thing. It may be single-threaded, but it remains an ABA problem. It would require a really amazing number of stars to align. llvm-svn: 188367	2013-08-14 08:56:41 +00:00
Matt Arsenault	cb3b478d91	Fix always creating GEP with i32 indices Use the pointer size if datalayout is available. Use i64 if it's not, which is consistent with what other places do when the pointer size is unknown. The test doesn't really test this in a useful way since it will be transformed to that later anyway, but this now tests it for non-zero arrays and when datalayout isn't available. The cases in visitGetElementPtrInst should save an extra re-visit to the newly created GEP since it won't need to cleanup after itself. llvm-svn: 188339	2013-08-14 00:24:38 +00:00
Matt Arsenault	6d1daebfff	Use type helper functions instead of cast llvm-svn: 188338	2013-08-14 00:24:34 +00:00
Matt Arsenault	734103e561	Use array initializer, space around operator llvm-svn: 188337	2013-08-14 00:24:05 +00:00
Hal Finkel	fd36621506	BBVectorize: Add initial stores to the write set when tracking uses When computing the use set of a store, we need to add the store to the write set prior to iterating over later instructions. Otherwise, if there is a later aliasing load of that store, that load will not be tagged as a use, and bad things will happen. trackUsesOfI still adds later dependent stores of an instruction to that instruction's write set, but it never sees the original instruction, and so when tracking uses of a store, the store must be added to the write set by the caller. Fixes PR16834. llvm-svn: 188329	2013-08-13 23:34:32 +00:00
Nick Lewycky	eab287a60a	Revert r187191, which broke opt -mem2reg on the testcases included in PR16867. However, opt -O2 doesn't run mem2reg directly so nobody noticed until r188146 when SROA started sending more things directly down the PromoteMemToReg path. In order to revert r187191, I also revert dependent revisions r187296, r187322 and r188146. Fixes PR16867. Does not add the testcases from that PR, but both of them should get added for both mem2reg and sroa when this revert gets unreverted. llvm-svn: 188327	2013-08-13 22:51:58 +00:00
Dmitry Vyukov	fe91b1efa2	dfsan: fix lint warnings llvm-svn: 188293	2013-08-13 16:52:41 +00:00
Arnold Schwaighofer	dfaac373ee	Also remove logic in LateVectorize llvm-svn: 188285	2013-08-13 16:12:04 +00:00
Arnold Schwaighofer	406976609b	Remove logic that decides whether to vectorize or not depending on O-levels I have moved this logic into clang and opt. llvm-svn: 188281	2013-08-13 15:51:25 +00:00

... 3 4 5 6 7 ...

11089 Commits