llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Quentin Colombet	b7df98faee	[CodeGenPrepare] Refine the cost model provided by the promotion helper. - Use TargetLowering to check for the actual cost of each extension. - Provide a factorized method to check for the cost of an extension: TargetLowering::isExtFree. - Provide a virtual method TargetLowering::isExtFreeImpl for targets to be able to tune the cost of non-free extensions. This refactoring offers a better granularity to model what really happens on different targets. No performance changes and very few code differences. Part of <rdar://problem/19267165> llvm-svn: 231855	2015-03-10 21:48:15 +00:00
Adam Nemet	008aa6afa3	[LoopAccesses] Add debug message to indicate the result of the analysis The debug message was pretty confusing here. It only reported the situation with memchecks without the result of the dependence analysis. Now it prints whether the loop is safe from the POV of the dependence analysis and if yes, whether we need memchecks. llvm-svn: 231854	2015-03-10 21:47:39 +00:00
Rafael Espindola	ca2a87b2b9	Move a non-trivial virtual function out of line. llvm-svn: 231853	2015-03-10 21:35:16 +00:00
Colin LeMahieu	e90f8c6eac	[Hexagon] Adding frame index + add load/store patterns. llvm-svn: 231850	2015-03-10 21:24:13 +00:00
Rafael Espindola	525d4a858f	clang-format code that is about to change. llvm-svn: 231848	2015-03-10 21:16:18 +00:00
Colin LeMahieu	be732782a1	[Hexagon] Simplifying deallocret definitions. llvm-svn: 231847	2015-03-10 21:12:32 +00:00
Rafael Espindola	c62c766d41	clang-format these declarations. NFC. llvm-svn: 231846	2015-03-10 21:05:09 +00:00
Rafael Espindola	be51bac53f	Don't repeat names in comments. NFC. llvm-svn: 231845	2015-03-10 21:01:50 +00:00
Colin LeMahieu	6e54f9236b	[Hexagon] Separating InstHexagon from OpcodeHexagon. llvm-svn: 231844	2015-03-10 20:56:22 +00:00
Nemanja Ivanovic	54958e2a25	Add support for part-word atomics for PPC http://reviews.llvm.org/D8090#inline-67337 llvm-svn: 231843	2015-03-10 20:51:07 +00:00
Chris Bieneman	d24ea211dc	Add new LLVM_OPTIMIZED_TABLEGEN build setting which configures, builds and uses a release tablegen build when LLVM is configured with assertions enabled. Summary: This change leverages the cross-compiling functionality in the build system to build a release tablegen executable for use during the build. Reviewers: resistor, rnk Reviewed By: rnk Subscribers: rnk, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D7349 llvm-svn: 231842	2015-03-10 20:48:02 +00:00
Ahmed Bougacha	faad462651	[AArch64] Avoid going through GPRs for across-vector instructions. This adds new node types for each intrinsic. For instance, for addv, we have AArch64ISD::UADDV, such that: (v4i32 (uaddv ...)) is the same as (v4i32 (scalar_to_vector (i32 (int_aarch64_neon_uaddv ...)))) that is, (v4i32 (INSERT_SUBREG (v4i32 (IMPLICIT_DEF)), (i32 (int_aarch64_neon_uaddv ...)), ssub) In a combine, we transform all such across-vector-lanes intrinsics to: (i32 (extract_vector_elt (uaddv ...), 0)) This has one big advantage: by making the extract_element explicit, we enable the existing patterns for lane-aware instructions to fire. This lets us avoid needlessly going through the GPRs. Consider: uint32x4_t test_mul(uint32x4_t a, uint32x4_t b) { return vmulq_n_u32(a, vaddvq_u32(b)); } We now generate: addv.4s s1, v1 mul.4s v0, v0, v1[0] instead of the previous: addv.4s s1, v1 fmov w8, s1 dup.4s v1, w8 mul.4s v0, v1, v0 rdar://20044838 llvm-svn: 231840	2015-03-10 20:45:38 +00:00
Ahmed Bougacha	b20ce2fe64	[AArch64] Remove integer INSvi*lane patterns. NFCI. Most are redundant, and they never seem to fire. The V128 integer patterns already exist in the INS multiclass. The duplicates only fire when the vector index type isn't i64, because they accept "imm" instead of an explicit "i64", as the instruction definition patterns do. TLI::getVectorIdxTy is i64 on AArch64, so this should never happen. Also, one of them had a typo: for i64, INSvi32lane was used. I noticed because I mistakenly used an explicit i32 as the idx type, and got ins.s for an i64 vector_insert. The V64 patterns also don't seem to ever fire, as V64 vector extract/insert are legalized to V128. The equivalent float patterns are unique and useful, so keep them. No functional change intended; none exhibited on the LIT and LNT tests. llvm-svn: 231838	2015-03-10 20:37:19 +00:00
Chad Rosier	dad3b0ec71	Don't evaluate rend() on every iteration of the loop. llvm-svn: 231837	2015-03-10 20:29:59 +00:00
David Majnemer	63f263f727	LoopAccessAnalysis: Silence -Wreturn-type diagnostic from GCC llvm-svn: 231836	2015-03-10 20:23:29 +00:00
Benjamin Kramer	fd94ea49f9	Don't use LLVM_LIBRARY_VISIBILITY in cpp files. llvm-svn: 231831	2015-03-10 20:07:44 +00:00
Bruno Cardoso Lopes	e8b8714ddf	[AsmPrinter][TLOF] Reintroduce AArch64 test Follow up from r231505. Fix the non-determinism by using a MapVector and reintroduce the AArch64 testcase. Defer deleting the got candidates up to the end and remove them in a bulk, avoiding linear time removal of each element. Thanks to Renato Golin for trying it out on other platforms. llvm-svn: 231830	2015-03-10 20:05:23 +00:00
Colin LeMahieu	0b62318f64	[Hexagon] Adding nodes for PIC support. llvm-svn: 231829	2015-03-10 20:04:44 +00:00
Colin LeMahieu	4fadccc929	[Hexagon] Adding DuplexInst instruction format and duplex class defs. llvm-svn: 231828	2015-03-10 19:53:14 +00:00
Kit Barton	f514e1c5fc	Change the generation of the vmuluwm instruction to be based on the MUL opcode. Phabricator review: http://reviews.llvm.org/D8185 llvm-svn: 231827	2015-03-10 19:49:38 +00:00
Sanjay Patel	e4ff5d4d5c	remove function names from comments; NFC llvm-svn: 231826	2015-03-10 19:42:57 +00:00
Colin LeMahieu	e0c4dcc7b6	[Hexagon] Adding nodes for vector insert/extract lowering. llvm-svn: 231825	2015-03-10 19:40:03 +00:00
Colin LeMahieu	aa7e3b58f5	[Hexagon] Renaming HexagonJT to JT and adding CP for constantpool. llvm-svn: 231824	2015-03-10 19:29:53 +00:00
Adrian Prantl	aa7fb527c3	Change the datatype of DwarfExpression::Emit(Un)Signed to (u)int64_t so it matches the one used by ByteStreamer::Emit(U\|S)LEB128. llvm-svn: 231823	2015-03-10 19:23:37 +00:00
Benjamin Kramer	5b70e6e9c6	NVPTX: move NVPTXAllocaHoisting into the cpp file Also initialize without using static initialization. llvm-svn: 231822	2015-03-10 19:20:52 +00:00
Adam Nemet	c3652c7273	[LAA-memchecks] Comment improvement I forgot to roll this into r231816. It was requested by Hal in D8122. llvm-svn: 231821	2015-03-10 19:12:41 +00:00
Michael Zolotukhin	5abcdaa7c0	Enable loop-rotate before loop-vectorize by default llvm-svn: 231820	2015-03-10 19:07:41 +00:00
Adam Nemet	9315b99bd9	[LAA-memchecks 3/3] Introduce pointer partitions for memchecks This is the final patch that actually introduces the new parameter of partition mapping to RuntimePointerCheck::needsChecking. Another API (LAI::getInstructionsForAccess) is also exposed that helps to map pointers to instructions because ultimately we partition instructions. The WIP version of the Loop Distribution pass in D6930 has been adapted to use all this. See for example, how InstrPartitionContainer::computePartitionSetForPointers sets up the partitions using the above API and then calls to LAI::addRuntimeCheck with the pointer partitions. llvm-svn: 231818	2015-03-10 18:54:26 +00:00
Adam Nemet	5ee2447b48	[LAA-memchecks 2/3] Move number of memcheck threshold checking to LV Now the analysis won't "fail" if the memchecks exceed the threshold. It is the transform pass' responsibility to perform the check. This allows the transform pass to further analyze/eliminate the memchecks. E.g. in Loop distribution we only need to check pointers that end up in different partitions. Note that there is a slight change of functionality here. The logic in analyzeLoop is that if dependence checking fails due to non-constant distance between the pointers, another attempt is made to prove safety of the dependences purely using run-time checks. Before this patch we could fail the loop due to exceeding the memcheck threshold after the first step, now we only check the threshold in the client after the full analysis. There is no measurable compile-time effect but I wanted to record this here. llvm-svn: 231817	2015-03-10 18:54:23 +00:00
Adam Nemet	37dc13d5c0	[LAA-memchecks 1/3] Split out NumComparisons checks. NFC The check for the number of memchecks will be moved to the client of this analysis. Besides allowing for transform-specific thresholds, this also lets Loop Distribution post-process the memchecks; Loop Distribution only needs memchecks between pointers of different partitions. The motivation for this first patch is to untangle the CanDoRT check from the NumComparison check before moving the NumComparison part. CanDoRT means that we couldn't determine the bounds for the pointer. Note that NumComparison is set independent of this flag. llvm-svn: 231816	2015-03-10 18:54:19 +00:00
Sanjay Patel	1a55477781	remove names from comments; NFC llvm-svn: 231813	2015-03-10 18:41:22 +00:00
Sanjay Patel	25d06d29cd	fix typos; NFC llvm-svn: 231812	2015-03-10 18:37:05 +00:00
Benjamin Kramer	8a6a7bc837	NVPTX: Remove copy of LLVMInitializeNVPTXAsmPrinter. If anyone is using this for some strange reason, LLVMInitializeNVPTXAsmPrinter does exactly the same thing and is what other LLVM tools are calling. llvm-svn: 231810	2015-03-10 18:19:24 +00:00
Benjamin Kramer	b12c209269	Hexagon: Remove unused InstrMapping. llvm-svn: 231809	2015-03-10 18:19:16 +00:00
Adam Nemet	4024c4c865	[LoopAccesses 3/3] Print the dependences with -analyze The dependences are now expose through the new getInterestingDependences API so we can use that with -analyze too and fix the FIXME. This lets us remove the test that relied on -debug to check the dependences. llvm-svn: 231807	2015-03-10 17:40:43 +00:00
Adam Nemet	78939ba308	[LoopAccesses 2/3] Allow querying of interesting dependences Gather an array of interesting dependences rather than just failing after the first unsafe one and regarding the loop unsafe. Loop Distribution needs to be able to collect all dependences in order to isolate the dependence cycles into their own partition. Since the dependence checking algorithm is quadratic in terms of accesses sharing the same underlying pointer, I am applying a cut-off threshold (MaxInterestingDependence). Exceeding that, the logic reverts back to the original approach deeming the loop unsafe upon encountering the first unsafe dependence. The main idea of the patch is to split isDepedent from directly answering the question whether the dep is safe for vectorization to return a dependence type which then gets mapped to old boolean result using Dependence::isSafeForVectorization. Tested that this was compile-time neutral on SpecINT2006 LTO bitcode inputs. No assembly change on the testsuite including external. llvm-svn: 231806	2015-03-10 17:40:37 +00:00
Adam Nemet	2084b90437	[LoopAccesses 1/3] Expose MemoryDepChecker to LAA users LoopDistribution needs to query various results of the dependence analysis. This series will expose some more APIs and state of the dependence checker. This patch is a simple one to just expose the DepChecker instance. The set is compile-time neutral measured with LTO bitcode files of SpecINT2006. Also there is no assembly change on the testsuite. llvm-svn: 231805	2015-03-10 17:40:34 +00:00
Rafael Espindola	074f6e2b72	Store an optional section start label in MCSection. This makes code that uses section relative expressions (debug info) simpler and less brittle. This is still a bit awkward as the symbol is created late and has to be stored in a mutable field. I will move the symbol creation earlier in the next patch. llvm-svn: 231802	2015-03-10 16:58:10 +00:00
Sanjay Patel	83c4b90c27	remove function names from comments; NFC llvm-svn: 231801	2015-03-10 16:42:24 +00:00
Igor Laevsky	ec23b1a840	Teach lowering to correctly handle invoke statepoint and gc results tied to them. Note that we still can not lower gc.relocates for invoke statepoints. Also it extracts getCopyFromRegs helper function in SelectionDAGBuilder as we need to be able to customize type of the register exported from basic block during lowering of the gc.result. (Resubmitting this change after not being able to reproduce buildbot failure) Differential Revision: http://reviews.llvm.org/D7760 llvm-svn: 231800	2015-03-10 16:26:48 +00:00
Chad Rosier	edcfbac252	[BranchFolding] Remove MMOs during tail merge to preserve dependencies. When tail merging it may be necessary to remove MMOs from memory operations to ensures later passes (e.g., MI sched) conservatively compute dependencies. Currently, we only remove the MMO from the common tail if the MMO doesn't match with the relative instruction in the non-common tail(s). A more robust solution would be to add multiple MMOs from the duplicate MIs to the new MI. Currently ScheduleDAGInstrs.cpp ignores all MMOs on instructions with multiple MMOs, so this solution is equivalent for the time being. No test case included as this is incredibly difficult to reproduce. Patch was a collaborative effort between Ana Pazos and myself. Phabricator: http://reviews.llvm.org/D7769 llvm-svn: 231799	2015-03-10 16:22:52 +00:00
Tom Stellard	a3238a003f	R600/SI: Add _IDXEN and _BOTHEN variants for buffer_store llvm-svn: 231798	2015-03-10 16:16:51 +00:00
Tom Stellard	78a9d058b0	R600/SI: Re-order MUBUF operands to match asm strings. llvm-svn: 231797	2015-03-10 16:16:49 +00:00
Tom Stellard	f486efe507	R600/SI: Move kill flag to second instruction when splitting SMRD This fixes a machine verifier error in the salu-to-valu.ll, which would have been exposed by a future commit. llvm-svn: 231796	2015-03-10 16:16:48 +00:00
Tom Stellard	af46d08311	R600/SI: Add 32-bit encoding of v_cndmask_b32 This was done by refactoring the v_cndmask_b32 tablegen definition to use inherit from VOP2Inst. llvm-svn: 231795	2015-03-10 16:16:44 +00:00
Sanjay Patel	5c62e16cdb	[X86, AVX] replace vinsertf128 intrinsics with generic shuffles We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the Clang half of this change: http://reviews.llvm.org/D8088 Differential Revision: http://reviews.llvm.org/D8086 llvm-svn: 231794	2015-03-10 16:08:36 +00:00
Benjamin Kramer	f65c49c935	Hexagon: Remove pass that does nothing at all llvm-svn: 231791	2015-03-10 15:06:38 +00:00
Rafael Espindola	8bcb85a7ac	Remove effectively dead code. Switching back and forth between sections does nothing (other than producing larger .s files). llvm-svn: 231790	2015-03-10 14:48:01 +00:00
Karthik Bhat	cc36bd3062	Fix a memory corruption in Dependency Analysis. This crash occurs due to memory corruption when trying to update dependency direction based on Constraints. This crash was observed during lnt regression of Polybench benchmark test case dynprog. Review: http://reviews.llvm.org/D8059 llvm-svn: 231788	2015-03-10 14:32:02 +00:00
Rafael Espindola	fcc1484a5d	Don't repeat names and clang-format this file. llvm-svn: 231786	2015-03-10 13:56:44 +00:00

1 2 3 4 5 ...

114617 Commits