llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Adam Nemet	c3652c7273	[LAA-memchecks] Comment improvement I forgot to roll this into r231816. It was requested by Hal in D8122. llvm-svn: 231821	2015-03-10 19:12:41 +00:00
Michael Zolotukhin	5abcdaa7c0	Enable loop-rotate before loop-vectorize by default llvm-svn: 231820	2015-03-10 19:07:41 +00:00
Adam Nemet	9315b99bd9	[LAA-memchecks 3/3] Introduce pointer partitions for memchecks This is the final patch that actually introduces the new parameter of partition mapping to RuntimePointerCheck::needsChecking. Another API (LAI::getInstructionsForAccess) is also exposed that helps to map pointers to instructions because ultimately we partition instructions. The WIP version of the Loop Distribution pass in D6930 has been adapted to use all this. See for example, how InstrPartitionContainer::computePartitionSetForPointers sets up the partitions using the above API and then calls to LAI::addRuntimeCheck with the pointer partitions. llvm-svn: 231818	2015-03-10 18:54:26 +00:00
Adam Nemet	5ee2447b48	[LAA-memchecks 2/3] Move number of memcheck threshold checking to LV Now the analysis won't "fail" if the memchecks exceed the threshold. It is the transform pass' responsibility to perform the check. This allows the transform pass to further analyze/eliminate the memchecks. E.g. in Loop distribution we only need to check pointers that end up in different partitions. Note that there is a slight change of functionality here. The logic in analyzeLoop is that if dependence checking fails due to non-constant distance between the pointers, another attempt is made to prove safety of the dependences purely using run-time checks. Before this patch we could fail the loop due to exceeding the memcheck threshold after the first step, now we only check the threshold in the client after the full analysis. There is no measurable compile-time effect but I wanted to record this here. llvm-svn: 231817	2015-03-10 18:54:23 +00:00
Adam Nemet	37dc13d5c0	[LAA-memchecks 1/3] Split out NumComparisons checks. NFC The check for the number of memchecks will be moved to the client of this analysis. Besides allowing for transform-specific thresholds, this also lets Loop Distribution post-process the memchecks; Loop Distribution only needs memchecks between pointers of different partitions. The motivation for this first patch is to untangle the CanDoRT check from the NumComparison check before moving the NumComparison part. CanDoRT means that we couldn't determine the bounds for the pointer. Note that NumComparison is set independent of this flag. llvm-svn: 231816	2015-03-10 18:54:19 +00:00
Sanjay Patel	1a55477781	remove names from comments; NFC llvm-svn: 231813	2015-03-10 18:41:22 +00:00
Sanjay Patel	25d06d29cd	fix typos; NFC llvm-svn: 231812	2015-03-10 18:37:05 +00:00
Benjamin Kramer	8a6a7bc837	NVPTX: Remove copy of LLVMInitializeNVPTXAsmPrinter. If anyone is using this for some strange reason, LLVMInitializeNVPTXAsmPrinter does exactly the same thing and is what other LLVM tools are calling. llvm-svn: 231810	2015-03-10 18:19:24 +00:00
Benjamin Kramer	b12c209269	Hexagon: Remove unused InstrMapping. llvm-svn: 231809	2015-03-10 18:19:16 +00:00
Adam Nemet	4024c4c865	[LoopAccesses 3/3] Print the dependences with -analyze The dependences are now expose through the new getInterestingDependences API so we can use that with -analyze too and fix the FIXME. This lets us remove the test that relied on -debug to check the dependences. llvm-svn: 231807	2015-03-10 17:40:43 +00:00
Adam Nemet	78939ba308	[LoopAccesses 2/3] Allow querying of interesting dependences Gather an array of interesting dependences rather than just failing after the first unsafe one and regarding the loop unsafe. Loop Distribution needs to be able to collect all dependences in order to isolate the dependence cycles into their own partition. Since the dependence checking algorithm is quadratic in terms of accesses sharing the same underlying pointer, I am applying a cut-off threshold (MaxInterestingDependence). Exceeding that, the logic reverts back to the original approach deeming the loop unsafe upon encountering the first unsafe dependence. The main idea of the patch is to split isDepedent from directly answering the question whether the dep is safe for vectorization to return a dependence type which then gets mapped to old boolean result using Dependence::isSafeForVectorization. Tested that this was compile-time neutral on SpecINT2006 LTO bitcode inputs. No assembly change on the testsuite including external. llvm-svn: 231806	2015-03-10 17:40:37 +00:00
Adam Nemet	2084b90437	[LoopAccesses 1/3] Expose MemoryDepChecker to LAA users LoopDistribution needs to query various results of the dependence analysis. This series will expose some more APIs and state of the dependence checker. This patch is a simple one to just expose the DepChecker instance. The set is compile-time neutral measured with LTO bitcode files of SpecINT2006. Also there is no assembly change on the testsuite. llvm-svn: 231805	2015-03-10 17:40:34 +00:00
Rafael Espindola	074f6e2b72	Store an optional section start label in MCSection. This makes code that uses section relative expressions (debug info) simpler and less brittle. This is still a bit awkward as the symbol is created late and has to be stored in a mutable field. I will move the symbol creation earlier in the next patch. llvm-svn: 231802	2015-03-10 16:58:10 +00:00
Sanjay Patel	83c4b90c27	remove function names from comments; NFC llvm-svn: 231801	2015-03-10 16:42:24 +00:00
Igor Laevsky	ec23b1a840	Teach lowering to correctly handle invoke statepoint and gc results tied to them. Note that we still can not lower gc.relocates for invoke statepoints. Also it extracts getCopyFromRegs helper function in SelectionDAGBuilder as we need to be able to customize type of the register exported from basic block during lowering of the gc.result. (Resubmitting this change after not being able to reproduce buildbot failure) Differential Revision: http://reviews.llvm.org/D7760 llvm-svn: 231800	2015-03-10 16:26:48 +00:00
Chad Rosier	edcfbac252	[BranchFolding] Remove MMOs during tail merge to preserve dependencies. When tail merging it may be necessary to remove MMOs from memory operations to ensures later passes (e.g., MI sched) conservatively compute dependencies. Currently, we only remove the MMO from the common tail if the MMO doesn't match with the relative instruction in the non-common tail(s). A more robust solution would be to add multiple MMOs from the duplicate MIs to the new MI. Currently ScheduleDAGInstrs.cpp ignores all MMOs on instructions with multiple MMOs, so this solution is equivalent for the time being. No test case included as this is incredibly difficult to reproduce. Patch was a collaborative effort between Ana Pazos and myself. Phabricator: http://reviews.llvm.org/D7769 llvm-svn: 231799	2015-03-10 16:22:52 +00:00
Tom Stellard	a3238a003f	R600/SI: Add _IDXEN and _BOTHEN variants for buffer_store llvm-svn: 231798	2015-03-10 16:16:51 +00:00
Tom Stellard	78a9d058b0	R600/SI: Re-order MUBUF operands to match asm strings. llvm-svn: 231797	2015-03-10 16:16:49 +00:00
Tom Stellard	f486efe507	R600/SI: Move kill flag to second instruction when splitting SMRD This fixes a machine verifier error in the salu-to-valu.ll, which would have been exposed by a future commit. llvm-svn: 231796	2015-03-10 16:16:48 +00:00
Tom Stellard	af46d08311	R600/SI: Add 32-bit encoding of v_cndmask_b32 This was done by refactoring the v_cndmask_b32 tablegen definition to use inherit from VOP2Inst. llvm-svn: 231795	2015-03-10 16:16:44 +00:00
Sanjay Patel	5c62e16cdb	[X86, AVX] replace vinsertf128 intrinsics with generic shuffles We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the Clang half of this change: http://reviews.llvm.org/D8088 Differential Revision: http://reviews.llvm.org/D8086 llvm-svn: 231794	2015-03-10 16:08:36 +00:00
Benjamin Kramer	f65c49c935	Hexagon: Remove pass that does nothing at all llvm-svn: 231791	2015-03-10 15:06:38 +00:00
Rafael Espindola	8bcb85a7ac	Remove effectively dead code. Switching back and forth between sections does nothing (other than producing larger .s files). llvm-svn: 231790	2015-03-10 14:48:01 +00:00
Karthik Bhat	cc36bd3062	Fix a memory corruption in Dependency Analysis. This crash occurs due to memory corruption when trying to update dependency direction based on Constraints. This crash was observed during lnt regression of Polybench benchmark test case dynprog. Review: http://reviews.llvm.org/D8059 llvm-svn: 231788	2015-03-10 14:32:02 +00:00
Rafael Espindola	fcc1484a5d	Don't repeat names and clang-format this file. llvm-svn: 231786	2015-03-10 13:56:44 +00:00
Aaron Ballman	12f86c66e5	Removing dead code to silence warning C4060: switch statement contains no 'case' or 'default' labels; NFC. llvm-svn: 231785	2015-03-10 13:56:28 +00:00
Karthik Bhat	7b21c02d1e	Fix a crash in Dependency Analysis. This crash in Dependency analysis is because we assume here that in case of UsefulGEP both source and destination have the same number of operands which may not be true. This incorrect assumption results in crash while populating Pairs. Fix the same. This crash was observed during lnt regression for code such as- struct s{ int A[10][10]; int C[10][10][10]; } S; void dep_constraint_crash_test(int k,int N) { for( int i=0;i<N;i++) for( int j=0;j<N;j++) S.A[0][0] = S.C[0][0][k]; } Review: http://reviews.llvm.org/D8162 llvm-svn: 231784	2015-03-10 13:31:03 +00:00
Daniel Sanders	b0ddb75c07	The operand flag word used in ISD::INLINEASM is an i32 not a pointer. NFC. Summary: This is part of the work to support memory constraints that behave differently to 'm'. The subsequent patches will expand on the existing encoding (which is a 32-bit int) and as a result in some flag words will no longer fit into an i16. This problem only affected the MSP430 target which appears to have 16-bit pointers. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D8168 llvm-svn: 231783	2015-03-10 10:42:59 +00:00
Yaron Keren	39596543fc	Teach raw_ostream to accept SmallString. Saves adding .str() call to any raw_ostream << SmallString usage and a small step towards making .str() consistent in the ADTs by removing one of the SmallString::str() use cases, discussion at http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20141013/240026.html I'll update the Phabricator patch http://reviews.llvm.org/D6372 for review of the Twine SmallString support, it's more complex than this one. llvm-svn: 231763	2015-03-10 07:33:23 +00:00
Owen Anderson	fa465e39c1	Fix a crash in InstCombine where we could try to truncate a switch comparison to zero width. llvm-svn: 231761	2015-03-10 06:51:39 +00:00
Owen Anderson	a93b443224	Fix a stack overflow in the assembler when checking that GEPs must be over sized types. We failed to use a marking set to properly handle recursive types, which caused use to recurse infinitely and eventually overflow the stack. llvm-svn: 231760	2015-03-10 06:34:57 +00:00
Owen Anderson	b10fb77aca	Fix an issue in the verifier where we could try to read information out of a malformed statepoint intrinsic. In this situation we would always have already flagged an error on the statepoint intrinsic, but then we carry on to parse other, related GC intrinsics, and could end up crashing during that verification when they try to access data from the malformed statepoint. llvm-svn: 231759	2015-03-10 05:58:21 +00:00
Owen Anderson	c22a907f78	Fix an infinite loop in InstCombine when an instruction with no users and side effects can be constant folded. ReplaceInstUsesWith needs to return nullptr when the input has no users, because in that case it does not mutate the program. Otherwise, we can get stuck in an infinite loop of repeatedly attempting to constant fold and instruction with no users. llvm-svn: 231755	2015-03-10 05:13:47 +00:00
Rafael Espindola	cd36ff5123	Move variable into assert to fix -Asserts builds. llvm-svn: 231753	2015-03-10 04:28:09 +00:00
Rafael Espindola	b021663a47	Remove incredibly confusing isBaseAddressKnownZero. When referring to a symbol in a dwarf section on ELF we should use .long foo instead of .long foo - .debug_something because ELF is unaware of the content of the sections and therefore needs relocations. This has nothing to do with optimizing a -0. llvm-svn: 231751	2015-03-10 04:11:52 +00:00
Rafael Espindola	9f3b858c2b	Use a better name for compile unit labels. They mark the start of a compile unit, so name them .Lcu_*. Using Section->getLabelBeginName() makes it looks like they mark the start of the section. While at it, switch to createTempSymbol to avoid collisions with labels created in inline assembly. Not sure if a "don't crash" test is worth it. With this getLabelBeginName is dead, delete it. llvm-svn: 231750	2015-03-10 03:58:36 +00:00
Sanjay Patel	f6590d6612	removed function names from comments; NFC llvm-svn: 231749	2015-03-10 03:48:14 +00:00
Frederic Riss	1f956a6735	DwarfAccelTable: remove unneeded bucket terminators. Last commit fixed the handling of hash collisions, but it introdcuced unneeded bucket terminators in some places. The generated table was correct, it can just be a tiny bit smaller. As the previous table was correct, the test doesn't need updating. If we really wanted to test this, I could add the section size to the dwarf dump and test for a precise value there. IMO the correctness test is sufficient. llvm-svn: 231748	2015-03-10 03:47:55 +00:00
Sanjay Patel	b09f1ac09a	use range-based for loops; NFC llvm-svn: 231747	2015-03-10 03:26:39 +00:00
Craig Topper	605895c7e7	Improve and simplify EnforceSmallerThan for vector types. Explicitly compare the size of the scalar types and the whole vector size rather than just comparing enum encodings. llvm-svn: 231746	2015-03-10 03:25:07 +00:00
Craig Topper	38f7090cef	Remove extra indentation of entire function body. NFC. llvm-svn: 231745	2015-03-10 03:25:04 +00:00
Rafael Espindola	240f1f5b6b	Move label creation close to emission. NFC. llvm-svn: 231744	2015-03-10 03:11:11 +00:00
George Burgess IV	939966a7bc	Added ConstantExpr support to CFLAA. CFLAA didn't know how to properly handle ConstantExprs; it would silently ignore them. This was a problem if the ConstantExpr is, say, a GEP of a global, because CFLAA wouldn't realize that there's a global there. :) llvm-svn: 231743	2015-03-10 02:58:15 +00:00
George Burgess IV	27ecf15e8f	Added special handling for inttoptr in CFLAA. We now treat pointers given to ptrtoint and pointers retrieved from inttoptr as similar to arguments or globals (can alias anything, etc.) This solves some of the problems we were having with giving incorrect results. llvm-svn: 231741	2015-03-10 02:40:06 +00:00
Mehdi Amini	f88efe5f8a	DataLayout is mandatory, update the API to reflect it with references. Summary: Now that the DataLayout is a mandatory part of the module, let's start cleaning the codebase. This patch is a first attempt at doing that. This patch is not exactly NFC as for instance some places were passing a nullptr instead of the DataLayout, possibly just because there was a default value on the DataLayout argument to many functions in the API. Even though it is not purely NFC, there is no change in the validation. I turned as many pointer to DataLayout to references, this helped figuring out all the places where a nullptr could come up. I had initially a local version of this patch broken into over 30 independant, commits but some later commit were cleaning the API and touching part of the code modified in the previous commits, so it seemed cleaner without the intermediate state. Test Plan: Reviewers: echristo Subscribers: llvm-commits From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231740	2015-03-10 02:37:25 +00:00
Kostya Serebryany	1c5044f9e7	[sanitizer] fix instrumentation with -mllvm -sanitizer-coverage-block-threshold=0 to actually do something useful. llvm-svn: 231736	2015-03-10 01:58:27 +00:00
Kostya Serebryany	a4f5e50c70	[sanitizer] decrease sanitizer-coverage-block-threshold from 1000 to 500 as another horrible workaround for PR17409 llvm-svn: 231733	2015-03-10 01:11:53 +00:00
Frederic Riss	9c6539e46b	DwarfAccelTable: Fix handling of hash collisions. It turns out accelerator tables where totally broken if they contained entries with colliding hashes. The failure mode is pretty bad, as it not only impacted the colliding entries, but would basically make all the entries after the first hash collision pointing in the wrong place. The testcase uses the symbol names that where found to collide during a clang build. From a performance point of view, the patch adds a sort and a linear walk over each bucket contents. While it has a measurable impact on the accelerator table emission, it's not showing up significantly in clang profiles (and I'd argue that correctness is priceless :-)). llvm-svn: 231732	2015-03-10 00:46:31 +00:00
Eric Christopher	0c42dfb65e	Temporarily revert r231726 and r231724 as they're breaking the build.: Author: Lang Hames <lhames@gmail.com> Date: Mon Mar 9 23:51:09 2015 +0000 [Orc][MCJIT][RuntimeDyld] Add header that was accidentally left out of r231724. Author: Lang Hames <lhames@gmail.com> Date: Mon Mar 9 23:44:13 2015 +0000 [Orc][MCJIT][RuntimeDyld] Add symbol flags to symbols in RuntimeDyld. Thread the new types through MCJIT and Orc. In particular, add a 'weak' flag. When plumbed through RTDyldMemoryManager, this will allow us to distinguish between weak and strong definitions and find the right ones during symbol resolution. llvm-svn: 231731	2015-03-10 00:33:27 +00:00
Eric Christopher	bf1458c86c	Remove an unused variable. llvm-svn: 231730	2015-03-10 00:33:22 +00:00

1 2 3 4 5 ...

114642 Commits