llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 13:33:37 +02:00

Author	SHA1	Message	Date
Chandler Carruth	93fcae17a9	[x86] Merge still more combine tests into the common file. These at least seem slightly more interesting test wise, although given how spotily we actually combine anything, I remain somewhat suspicious. llvm-svn: 218861	2014-10-02 08:02:34 +00:00
Chandler Carruth	b565b0f62c	[x86] Merge the third combining test into the generic one and add proper checks for all the ISA variants. If the SSE2 checks here terrify you, good. This is (in large part) the kind of amazingly bad code that is holding LLVM back when vectorizing on older ISAs. At the same time, these tests seem increasingly dubious to me. There are a very large number of tests and it isn't clear that they are systematically covering a specific set of functionality. Anyways, I don't want to reduce testing during the transition, I just want to consolidate it to where it is easier to manage. llvm-svn: 218860	2014-10-02 07:56:47 +00:00
Chandler Carruth	6d374cbfb0	[x86] Merge the second set of vector combining tests into a common test file. Some of these really don't make sense to test -- we're testing for the lack of combining two shuffles into one, presumably because the two would generate better shuffles in the end. But if you look at the generated code shown here, in many cases the generated code is, frankly, terrible. Or we combine any two generated shuffles back into a single instruction! I've left a FIXME to revisit these decisions. llvm-svn: 218859	2014-10-02 07:42:58 +00:00
Chandler Carruth	b89415b01c	[x86] Merge the bitwise operation shuffle combining into the common test file, adding assertions across the ISA variants for it. llvm-svn: 218858	2014-10-02 07:30:24 +00:00
Chandler Carruth	16ffa749c8	[x86] Update this test to run a full complement of the ISA extensions, and use the new grouped FileCheck patterns to match them. No interesting changes yet, but this test is now in proper form to have the other shuffle combining tests merged into it. llvm-svn: 218857	2014-10-02 07:22:26 +00:00
Chandler Carruth	d38853bc15	[x86] Minimize the parameters to this test for clarity. The test has to do with DAG combines, and so it doesn't need the new vector shuffle lowering to be effective. Also, it has a nice in-IR triple string which we should really be using rather than command line flags (unless it varies form RUN-line to RUN-line). Finally, I much prefer letting LLVM synthesize the correct datalayout string from the triple rather than baking one in here that will just become stale. llvm-svn: 218856	2014-10-02 07:17:15 +00:00
Chandler Carruth	8fff345b1f	[x86] Add a comment clarifying that this test should span all manners of generic DAG combining of shuffles relevant to x86. My plan is to fold a bunch of the other DAG combining test cases into this one, while converting them to use the nice new FileCheck assertion syntax. llvm-svn: 218855	2014-10-02 07:13:25 +00:00
Chandler Carruth	a53845cb08	[x86] Switch some of the new consolidated vector tests to use a bare-metal triple and have nice BB labels, etc. No significant change here, just tidying up to have a consistent set of OS-agnostic vector functionality here. llvm-svn: 218854	2014-10-02 06:52:19 +00:00
Lang Hames	ea94736de4	[PBQP] Update doxygen comment style to match the rest of the file. NFC. llvm-svn: 218849	2014-10-02 04:21:27 +00:00
Lang Hames	1bd1ce6089	[PBQP] Add support for graph-level metadata to the PBQP graph. This will be used in the future to attach useful information about the PBQP graph (e.g. the associated MachineFunction, pointers to regalloc passes) to the graph itself, making that information accessible to the solver. This should also allow the PBQPBuilder interface to be simplified. llvm-svn: 218848	2014-10-02 04:17:36 +00:00
Eric Christopher	53beacc954	Remove test directories with no tests. llvm-svn: 218843	2014-10-02 00:42:30 +00:00
Justin Bogner	852141fdbe	InstrProf: Simplify counting a file's regions when writing coverage (NFC) When writing a coverage mapping we iterate through the mapping regions in order of FileID, but we were then repeatedly searching from the beginning of the list to count the number of regions with a given FileID. It is simpler and more efficient to search forward from the current iterator to find the number of regions. llvm-svn: 218842	2014-10-02 00:31:00 +00:00
Chandler Carruth	3f933250d5	[x86] Improve and correct how the new vector shuffle lowering was matching and lowering 64-bit insertions. The first problem was that we weren't looking through bitcasts to discover that we could lower as insertions. Once fixed, we in turn weren't looking through bitcasts to discover that we could fold a load into the lowering. Once fixed, we weren't forming a SCALAR_TO_VECTOR node around the inserted element and instead were passing a scalar to a DAG node that expected a vector. It turns out there are some patterns that will "lower" this into the correct asm, but the rest of the X86 backend is very unhappy with such antics. This should fix a few more edge case regressions I've spotted going through the regression test suite to enable the new vector shuffle lowering. llvm-svn: 218839	2014-10-01 23:14:28 +00:00
Bob Wilson	84f3fb3ea5	PR21101: tablegen's FastISel emitter should filter out unused functions. FastISel has a fixed set of virtual functions that are overridden by the tablegen-generated code for each target. These functions are distinguished by the kinds of operands, e.g., register + immediate = "ri". The FastISel emitter has been blindly emitting functions with different combinations of operand kinds, even for combinations that are completely unused by FastISel, e.g., "fastEmit_rrr". Change to filter out functions that will be irrelevant for FastISel and do not bother generating the code for them. Also add explicit "override" keywords for the virtual functions that are overridden. llvm-svn: 218838	2014-10-01 22:44:01 +00:00
Lang Hames	3442624858	[MCJIT] Don't crash in debugging output for sections that aren't emitted. llvm-svn: 218836	2014-10-01 21:57:47 +00:00
Eric Christopher	3eb7c19a39	constify the TargetMachine argument used in the subtarget and lowering constructors. llvm-svn: 218832	2014-10-01 21:36:28 +00:00
Duncan P. N. Exon Smith	5f537dacd8	DIBuilder: Remove duplicated comments, NFC These comments already appear in the header, and some of them are out-of-date anyway. llvm-svn: 218829	2014-10-01 21:32:15 +00:00
Duncan P. N. Exon Smith	436b4ca3b4	Revert "DIBuilder: Remove dead code" This reverts commit r218820. It turns out that Adrian has an outstanding SROA patch that uses this. I've updated it to forward to `createExpression()`. llvm-svn: 218828	2014-10-01 21:32:12 +00:00
Sanjay Patel	ae0d32c510	Lower FNEG ( FABS (x) ) -> FNABS (x) [X86 codegen] PR20578 Negative FABS of either a scalar or vector should be handled the same way on x86 with SSE/AVX: a single OR instruction of the FP operand with a constant to light up the sign bit(s). http://llvm.org/bugs/show_bug.cgi?id=20578 Differential Revision: http://reviews.llvm.org/D5201 llvm-svn: 218822	2014-10-01 21:20:06 +00:00
David Blaikie	abe7833c2f	Update test name to match changes made in r218783 Addressing post commit review feedback from Justin Bogner. llvm-svn: 218821	2014-10-01 21:19:39 +00:00
Duncan P. N. Exon Smith	e70bbe654f	DIBuilder: Remove dead code I neglected to update `DIBuilder::createPieceExpression()` in r218797, which I noticed while rebasing a patch for PR17891. On closer inspection, it looks like dead code. If there are any downstream users of this, you should transition to the more general `createExpression()`. Or, we can add this back, but then it should just forward to `createExpression()`. llvm-svn: 218820	2014-10-01 21:14:20 +00:00
Chandler Carruth	efba997ca5	[x86] Merge the remaining test cases into vector-blend.ll and remove all the ISA-specific test files. llvm-svn: 218818	2014-10-01 21:07:07 +00:00
Eric Christopher	bce38d60f8	Now that the optimization level is adjusting the feature string before we hit the subtarget, remove the constructor parameter. llvm-svn: 218817	2014-10-01 21:05:35 +00:00
Chandler Carruth	17a5f1f40d	[x86] Expand the ISA coverage of our blend test in preparation for merging ISA-specific testing into this file. llvm-svn: 218816	2014-10-01 21:03:21 +00:00
Argyrios Kyrtzidis	8575c06fb9	Adds 'override' to overriding methods. NFC. llvm-svn: 218815	2014-10-01 21:00:44 +00:00
Chandler Carruth	54cba461be	[x86] Merge the interesting test cases from blend-msb.ll into vector-blend.ll and remove the former. llvm-svn: 218814	2014-10-01 20:56:57 +00:00
Chandler Carruth	eb0ef09cba	[x86] Move the AVX blend test to a generic name. I'm going to fold other blend tests into this one. llvm-svn: 218813	2014-10-01 20:52:55 +00:00
Chandler Carruth	fe43286675	[x86] Remove a test that wasn't doing anything really. We have plenty of better tests for zext of vectors at this point. llvm-svn: 218811	2014-10-01 20:50:58 +00:00
Chandler Carruth	1757cc6b66	[x86] Add a 32-bit run to the sext test, and remove a sad vec_sext.ll test file. This old test had a bunch of functions that were never even checked. =/ The only thing it really did was to make sure that we did something reasonable in 32-bit mode with SSE4.1. Adding another run line to the main vector-sext.ll test seems a better way to do that. llvm-svn: 218810	2014-10-01 20:49:54 +00:00
Chandler Carruth	d9dbba2329	[x86] Teach both sext and zext vector tests to cover a nice wide range of architectures: SSE2, SSSE3, SSE4.1, AVX, and AVX2. Unfortunately, this exposses the absolute horror of the code we generate for many of these patterns. Anyone wanting to familiarize themselves with the x86 backend and improve performance could do a lot of good sitting down and making these test cases not look so terrible. While the new vector shuffle code I'm working on well help some, it won't fix all of the crimes here. llvm-svn: 218807	2014-10-01 20:41:36 +00:00
Eric Christopher	4ce55b7a5c	Rework the PPC TargetMachine so that the non-function specific overrides happen at TargetMachine creation and not on every subtarget creation. llvm-svn: 218805	2014-10-01 20:38:26 +00:00
Eric Christopher	5245670b4d	constify TargetMachine parameter for X86TargetLowering. llvm-svn: 218804	2014-10-01 20:38:22 +00:00
Sanjay Patel	790c02f096	Make the sqrt intrinsic return undef for a negative input. As discussed here: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140609/220598.html And again here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-September/077168.html The sqrt of a negative number when using the llvm intrinsic is undefined. We should return undef rather than 0.0 to match the definition in the LLVM IR lang ref. This change should not affect any code that isn't using "no-nans-fp-math"; ie, no-nans is a requirement for generating the llvm intrinsic in place of a sqrt function call. Unfortunately, the behavior introduced by this patch will not match current gcc, xlc, icc, and possibly other compilers. The current clang/llvm behavior of returning 0.0 doesn't either. We knowingly approve of this difference with the other compilers in an attempt to flag code that is invoking undefined behavior. A front-end warning should also try to convince the user that the program will fail: http://llvm.org/bugs/show_bug.cgi?id=21093 Differential Revision: http://reviews.llvm.org/D5527 llvm-svn: 218803	2014-10-01 20:36:33 +00:00
Chandler Carruth	0c80c6833d	[x86] Sort the ISA-specific RUN lines for vector-sext.ll to go from oldest to newest. This makes more sense to me and is more consistent with other tests. llvm-svn: 218802	2014-10-01 20:32:44 +00:00
Tim Northover	f58482e671	ARM: yes it can (as of r218789) llvm-svn: 218801	2014-10-01 20:31:58 +00:00
Chandler Carruth	c6c3e58c92	[x86] Rename avx-{s,z}ext.ll to vector-{s,z}ext.ll. These tests are far and away the best sext and zext tests we have for vectors. I'm going to merge the other similar tests into them and expand the ISA coverage. llvm-svn: 218800	2014-10-01 20:30:30 +00:00
Chandler Carruth	fbc2708243	[x86] Cleanup and re-generate the checks for avx-zext.ll using the new script. llvm-svn: 218799	2014-10-01 20:27:16 +00:00
Duncan P. N. Exon Smith	51b31a68a3	DIBuilder: Encapsulate DIExpression's element type `DIExpression`'s elements are 64-bit integers that are stored as `ConstantInt`. The accessors already encapsulate the storage. This commit updates the `DIBuilder` API to also encapsulate that. llvm-svn: 218797	2014-10-01 20:26:08 +00:00
Chandler Carruth	a1f7067666	[x86] Generate the FileCheck assertions for avx-blend.ll with my new script to make them nice and predictable. This will ease updating them for the new vector shuffle lowering and seeing the delta if any. llvm-svn: 218795	2014-10-01 20:19:45 +00:00
Chandler Carruth	f4596dd7b6	[x86] Clean up and generate detailed FileCheck assertions for avx-sext.ll using my new script. Also add an AVX2 mode to this test. Part of cleaning up the test suite before enabling the new vector shuffle lowering. This also highlights some of the abysmal failures of the old shuffle lowering. Check out those 'pinsrw' and 'pextrw' sequences! llvm-svn: 218794	2014-10-01 20:19:32 +00:00
Bruno Cardoso Lopes	c2183ae743	[MemoryDepAnalysis] Fix compile time slowdown - Problem One program takes ~3min to compile under -O2. This happens after a certain function A is inlined ~700 times in a function B, inserting thousands of new BBs. This leads to 80% of the compilation time spent in GVN::processNonLocalLoad and MemoryDependenceAnalysis::getNonLocalPointerDependency, while searching for nonlocal information for basic blocks. Usually, to avoid spending a long time to process nonlocal loads, GVN bails out if it gets more than 100 deps as a result from MD->getNonLocalPointerDependency. However this only happens after all nonlocal information for BBs have been computed, which is the bottleneck in this scenario. For instance, there are 8280 times where getNonLocalPointerDependency returns deps with more than 100 bbs and from those, 600 times it returns more than 1000 blocks. - Solution Bail out early during the nonlocal info computation whenever we reach a specified threshold. This patch proposes a 100 BBs threshold, it also reduces the compile time from 3min to 23s. - Testing The test-suite presented no compile nor execution time regressions. Some numbers from my machine (x86_64 darwin): - 17s under -Oz (which avoids inlining). - 1.3s under -O1. - 2m51s under -O2 ToT *** 23s under -O2 w/ Result.size() > 100 - 1m54s under -O2 w/ Result.size() > 500 With NumResultsLimit = 100, GVN yields the same outcome as in the unlimited 3min version. http://reviews.llvm.org/D5532 rdar://problem/18188041 llvm-svn: 218792	2014-10-01 20:07:13 +00:00
Sanjay Patel	923727403c	Don't repeat function/variable name in comment. NFC. llvm-svn: 218791	2014-10-01 19:39:32 +00:00
Adam Nemet	e0d1a483d8	[X86 disasm tblegen backend] Clean up numPhysicalOperands asserts No functionality change intended. This implements Elena's idea to put the new additionalOperand outside the switch to cover all cases (http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140929/237763.html). Note only nontrivial change is in MRMSrcMemFrm. This requires an inclusive interval of [2, 4] because we have prefix-dependent optional immediate operand. llvm-svn: 218790	2014-10-01 19:28:11 +00:00
Tim Northover	64a066f76a	ARM: allow copying of CPSR when all else fails. As with x86 and AArch64, certain situations can arise where we need to spill CPSR in the middle of a calculation. These should be avoided where possible (MRS/MSR is rather expensive), which ARM is actually better at than the other two since it tries to Glue defs to uses, but as a last ditch effort, copying is better than crashing. rdar://problem/18011155 llvm-svn: 218789	2014-10-01 19:21:03 +00:00
Adrian Prantl	2b1df58ebe	Move the complex address expression out of DIVariable and into an extra argument of the llvm.dbg.declare/llvm.dbg.value intrinsics. Previously, DIVariable was a variable-length field that has an optional reference to a Metadata array consisting of a variable number of complex address expressions. In the case of OpPiece expressions this is wasting a lot of storage in IR, because when an aggregate type is, e.g., SROA'd into all of its n individual members, the IR will contain n copies of the DIVariable, all alike, only differing in the complex address reference at the end. By making the complex address into an extra argument of the dbg.value/dbg.declare intrinsics, all of the pieces can reference the same variable and the complex address expressions can be uniqued across the CU, too. Down the road, this will allow us to move other flags, such as "indirection" out of the DIVariable, too. The new intrinsics look like this: declare void @llvm.dbg.declare(metadata %storage, metadata %var, metadata %expr) declare void @llvm.dbg.value(metadata %storage, i64 %offset, metadata %var, metadata %expr) This patch adds a new LLVM-local tag to DIExpressions, so we can detect and pretty-print DIExpression metadata nodes. What this patch doesn't do: This patch does not touch the "Indirect" field in DIVariable; but moving that into the expression would be a natural next step. http://reviews.llvm.org/D4919 rdar://problem/17994491 Thanks to dblaikie and dexonsmith for reviewing this patch! Note: I accidentally committed a bogus older version of this patch previously. llvm-svn: 218787	2014-10-01 18:55:02 +00:00
Duncan P. N. Exon Smith	ee9cf8af02	LTO: Add missing target triple from r218784 llvm-svn: 218786	2014-10-01 18:49:58 +00:00
Reed Kotler	f0d9d7da37	Add fptrunc to mips fast-sel Summary: Implement conversion of 64 to 32 bit floating point numbers (fptrunc) in mips fast-isel Test Plan: fptrunc.ll checked also with 4 internal mips build bot flavors mip32r1/miprs32r2 and at -O0 and -O2 Reviewers: dsanders Reviewed By: dsanders Subscribers: rfuhler Differential Revision: http://reviews.llvm.org/D5553 llvm-svn: 218785	2014-10-01 18:47:02 +00:00
Duncan P. N. Exon Smith	236c4250e7	LTO: Ignore disabled diagnostic remarks r206400 and r209442 added remarks that are disabled by default. However, if a diagnostic handler is registered, the remarks are sent unfiltered to the handler. This is the right behaviour for clang, since it has its own filters. However, the diagnostic handler exposed in the LTO API receives only the severity and message. It doesn't have the information to filter by pass name. For LTO, disabled remarks should be filtered by the producer. I've changed `LLVMContext::setDiagnosticHandler()` to take a `bool` argument indicating whether to respect the built-in filters. This defaults to `false`, so other consumers don't have a behaviour change, but `LTOCodeGenerator::setDiagnosticHandler()` sets it to `true`. To make this behaviour testable, I added a `-use-diagnostic-handler` command-line option to `llvm-lto`. This fixes PR21108. llvm-svn: 218784	2014-10-01 18:36:03 +00:00
David Blaikie	7b8bbd0e46	Add an immovable type to test Optional<T>::emplace more rigorously after r218732. llvm-svn: 218783	2014-10-01 18:29:44 +00:00
Adrian Prantl	0959156fa3	Revert r218778 while investigating buldbot breakage. "Move the complex address expression out of DIVariable and into an extra" llvm-svn: 218782	2014-10-01 18:10:54 +00:00

1 2 3 4 5 ...

108262 Commits