llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Eric Christopher	d592833f58	Add support for ARM and AArch64 big endian objects to RelocVisitor. Patch by Charlie Turner. llvm-svn: 219109	2014-10-06 07:02:58 +00:00
Eric Christopher	f6e0f054d2	Refactor RelocVisitor to take an object. This removes some string comparisons and makes it a bit easier to check individual targets. Patch by Charlie Turner. llvm-svn: 219108	2014-10-06 06:55:55 +00:00
Eric Christopher	bd23d9f1d0	Add some tests for RelocVisitor. Patch by Charlie Turner. llvm-svn: 219107	2014-10-06 06:52:52 +00:00
Eric Christopher	faca264c55	Add subtarget caches to aarch64, arm, ppc, and x86. These will make it easier to test further changes to the code generation and optimization pipelines as those are moved to subtargets initialized with target feature and target cpu. llvm-svn: 219106	2014-10-06 06:45:36 +00:00
Yaron Keren	45037aef50	Resolve ambiguity between llvm::make_unique and std::make_unique. Intorduced in r219098. llvm-svn: 219105	2014-10-06 06:39:57 +00:00
David Blaikie	16833bbcd2	DebugInfo: Sink constructImportedEntityDIE down into DwarfUnit from DwarfDebug. It was just calling a bunch of DwarfUnit functions anyway, as can be seen by the simplification of removing "TheCU" from all the function calls in the implementation. llvm-svn: 219103	2014-10-06 05:37:24 +00:00
Frederic Riss	01fe0d04e1	[dwarfdump] Print the name for referenced specification of abstract_origin DIEs. Reviewers: dblaikie, samsonov, echristo, aprantl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5466 llvm-svn: 219099	2014-10-06 03:36:31 +00:00
Frederic Riss	805a471e52	Factor the Unit section parsing into the DWARFUnitSection class. Summary: No functional change. Reviewers: dblaikie, samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5522 llvm-svn: 219098	2014-10-06 03:36:18 +00:00
Chandler Carruth	c2e5aca058	[PM] Remove an unused and rather expensive mapping from an analysis group's interface to all of the implementations of that analysis group. The groups themselves can and do manage this anyways, the pass registry needn't involve itself. llvm-svn: 219097	2014-10-06 00:30:59 +00:00
Chandler Carruth	c9860adb9f	[PM] Remove the (deeply misguided) 'unregister' functionality from the pass registry. This style of registry is somewhat questionable, but it being non-monotonic is crazy. No one is (or should be) unloading DSOs with passes and unregistering them here. I've checked with a few folks and I don't know of anyone using this functionality or any important use case where it is necessary. llvm-svn: 219096	2014-10-06 00:13:25 +00:00
Chandler Carruth	4f6ca97fd2	[cleanup] Switch to using range-based for loops in two very obvious places. llvm-svn: 219095	2014-10-06 00:06:48 +00:00
Chandler Carruth	6fb719686c	[cleanup] Fix up trailing whitespace and formatting in the pass regitsry code prior to hacking on it more significantly. llvm-svn: 219094	2014-10-05 23:59:03 +00:00
Owen Anderson	2aa1295ade	Give the Reassociate pass a bit more flexibility and autonomy when optimizing expressions. Particularly, it addresses cases where Reassociate breaks Subtracts but then fails to optimize combinations like I1 + -I2 where I1 and I2 have the same rank and are identical. Patch by Dmitri Shtilman. llvm-svn: 219092	2014-10-05 23:41:26 +00:00
Chandler Carruth	88628181c0	[x86] Remove the 2-addr-to-3-addr "optimization" from shufps to pshufd. This trades a (register-renamer-friendly) movaps for a floating point / integer domain cross. That is a very bad trade, even on architectures where domain crossing is relatively fast. On any chip where there is even a cycle stall, this is a Very Bad Idea. It doesn't even seem likely to cause a spill to be introduced because the reason for the copy is to destructively shuffle in place. Thanks to Ben Kramer for fixing a bug in this code that my new shuffle lowering exposed and highlighting that perhaps it should just go away. =] llvm-svn: 219090	2014-10-05 22:57:31 +00:00
Chandler Carruth	a54972693a	[x86, dag] Teach the DAG combiner to prune inputs toa vector_shuffle that are unused. This allows the combiner to delete math feeding shuffles where the math isn't actually necessary. This improves some of the vperm2x128 tests that regressed when the vector shuffle lowering started actually generating vperm instructions rather than forcibly decomposing them. Sadly, this isn't enough to get this really right because we still form a completely unnecessary permutation. To fix that, we also need to fold shuffles which just rearrange concatenated or inserted subvectors. llvm-svn: 219086	2014-10-05 19:14:34 +00:00
David Blaikie	d13e7a4d3b	Remove unused map This became unnecessary/unused in r208636 llvm-svn: 219085	2014-10-05 16:31:13 +00:00
Benjamin Kramer	fac673d6bc	X86: Don't drop half of the mask when converting 2-address shufps into 3-address pshufd. It's debatable whether this transform is useful at all, but for now make sure we don't generate invalid asm. llvm-svn: 219084	2014-10-05 16:14:29 +00:00
Elena Demikhovsky	0be5f4deeb	AVX-512-SKX: Added instruction VPMOVM2B/W/D/Q. This instruction allows to broadacst mask vector to data vector. llvm-svn: 219083	2014-10-05 14:11:08 +00:00
Benjamin Kramer	2b33247371	Simplify code. No functionality change. llvm-svn: 219082	2014-10-05 12:21:57 +00:00
Chandler Carruth	aa7f8c811b	[x86] Fix PR21139, one of the last remaining regressions found in the new vector shuffle lowering. This is loosely based on a patch by Marius Wachtler to the PR (thanks!). I refactored it a bi to use std::count_if and a mutable array ref but the core idea was exactly right. I also added some direct testing of this case. I believe PR21137 is now the only remaining regression. llvm-svn: 219081	2014-10-05 12:07:34 +00:00
Chandler Carruth	b8978f2ab2	[x86] Teach the new vector shuffle lowering how to lower 128-bit shuffles using AVX and AVX2 instructions. This fixes PR21138, one of the few remaining regressions impacting benchmarks from the new vector shuffle lowering. You may note that it "regresses" many of the vperm2x128 test cases -- these were actually "improved" by the naive lowering that the new shuffle lowering previously did. This regression gave me fits. I had this patch ready-to-go about an hour after flipping the switch but wasn't sure how to have the best of both worlds here and thought the correct solution might be a completely different approach to lowering these vector shuffles. I'm now convinced this is the correct lowering and the missed optimizations shown in vperm2x128 are actually due to missing target-independent DAG combines. I've even written most of the needed DAG combine and will submit it shortly, but this part is ready and should help some real-world benchmarks out. llvm-svn: 219079	2014-10-05 11:41:36 +00:00
NAKAMURA Takumi	242f8cc95b	HexagonMCCodeEmitter.cpp: Prune 2nd redundant \brief. [-Wdocumentation] llvm-svn: 219073	2014-10-05 04:54:54 +00:00
NAKAMURA Takumi	1b6b088c7b	[CMake] HexagonTests: Update LINK_COMPONENTS. llvm-svn: 219072	2014-10-05 04:54:41 +00:00
NAKAMURA Takumi	ad3b39eff9	HexagonDesc: Update LLVMBuild.txt. llvm-svn: 219071	2014-10-05 04:54:29 +00:00
Hal Finkel	4acb4516d6	[InstCombine] Simplify the logic from r219067 using ValueTracking Joerg suggested on IRC that I look at generalizing the logic from r219067 to handle more general redundancies (like removing an assume(x > 3) dominated by an assume(x > 5)). The way to do this would be to ask ValueTracking to determine the value of the i1 argument. It turns out that ValueTracking is not very good at this right now (although it does get the trivial redundancy case) because it does not understand ICmps. Nevertheless, the resulting code in InstCombine is simpler than r219067, so we might as well do it now. llvm-svn: 219070	2014-10-05 00:53:02 +00:00
Benjamin Kramer	a961a36aad	[SystemZ] Make operator bool explicit. NFC. llvm-svn: 219069	2014-10-04 22:44:35 +00:00
Benjamin Kramer	860521c88b	Make AAMDNodes ctor and operator bool (!!!) explicit, mop up bugs and weirdness exposed by it. llvm-svn: 219068	2014-10-04 22:44:29 +00:00
Hal Finkel	59318e2605	[InstCombine] Remove redundant @llvm.assume intrinsics For any @llvm.assume intrinsic, if there is another which dominates it and uses the same condition, then it is redundant and can be removed. While this does not alter the semantics of the @llvm.assume intrinsics, it makes subsequent handling more efficient (and the resulting IR easier to read). llvm-svn: 219067	2014-10-04 21:27:06 +00:00
Yaron Keren	d58cb7b41a	Solve Visual C++ warning C4805 on getAsInteger<bool>. Fix http://llvm.org/PR21158 by adding a cast to unsigned long long, so the comparison would be between two unsigned long longs instead of bool and unsigned long long. if (getAsUnsignedInteger(*this, Radix, ULLVal) \|\| static_cast<unsigned long long>(static_cast<T>(ULLVal)) != ULLVal) llvm-svn: 219065	2014-10-04 19:58:30 +00:00
Benjamin Kramer	7db3ef45b9	Remove unnecessary copying or replace it with moves in a bunch of places. NFC. llvm-svn: 219061	2014-10-04 16:55:56 +00:00
David Blaikie	0eef1c005d	Sink DwarfDebug::updateSubprogramScopeDIE into DwarfCompileUnit This requires exposing some of the current function state from DwarfDebug. I hope there's not too much of that to expose as I go through all the functions, but it still seems nicer to expose singular data down to multiple consumers, than have consumers expose raw mapping data structures up to DwarfDebug for building subprograms. Part of a series of refactoring to allow subprograms in both the skeleton and dwo CUs under Fission. llvm-svn: 219060	2014-10-04 16:24:00 +00:00
David Blaikie	1f6267c40e	Reformatting accidentally left out of r219057 llvm-svn: 219059	2014-10-04 16:00:26 +00:00
David Blaikie	c691131ef4	Sink DwarfDebug::attachLowHighPC into DwarfCompileUnit One of many things to sink down into DwarfCompileUnit to allow handling of subprograms in both the skeleton and dwo CU under Fission. llvm-svn: 219058	2014-10-04 15:58:47 +00:00
David Blaikie	7bf1cb9a67	Move DwarfCompileUnit from DwarfUnit.h to its own header (DwarfCompileUnit.h) In preparation for sinking all the subprogram emission code down from DwarfDebug into DwarfCompileUnit, this will avoid bloating DwarfUnit.h/cpp greatly and make concerns a bit more clear/isolated. (sinking this handling down is part of the work to handle emitting minimal subprograms for -gmlt-like data into the skeleton CU under fission) llvm-svn: 219057	2014-10-04 15:49:50 +00:00
Duncan P. N. Exon Smith	ccfae8edc5	DI: Fixup global syntax in example llvm-svn: 219056	2014-10-04 15:44:01 +00:00
Duncan P. N. Exon Smith	23a107ef30	DI: Line up comments in examples llvm-svn: 219055	2014-10-04 15:35:25 +00:00
Duncan P. N. Exon Smith	a8dcc79ed3	DI: Fixup example IR from r219051 llvm-svn: 219054	2014-10-04 15:31:08 +00:00
Duncan P. N. Exon Smith	e52ab99166	DI: Prune another example llvm-svn: 219053	2014-10-04 15:30:52 +00:00
Duncan P. N. Exon Smith	9e10a6fe57	DI: Update and prune metadata examples Update a couple of the examples of debug info metadata, and prune the rest. Point to the true reference implementation in the source. llvm-svn: 219051	2014-10-04 14:56:56 +00:00
Chandler Carruth	42266f377f	[x86] Slap a triple on this test since it is poking around at the stack and calling conventions. Otherwise its too hard to craft a usefully generic set of assertions. llvm-svn: 219047	2014-10-04 04:22:55 +00:00
Chandler Carruth	5063f25595	[x86] Enable the new vector shuffle lowering by default. Update the entire regression test suite for the new shuffles. Remove most of the old testing which was devoted to the old shuffle lowering path and is no longer relevant really. Also remove a few other random tests that only really exercised shuffles and only incidently or without any interesting aspects to them. Benchmarking that I have done shows a few small regressions with this on LNT, zero measurable regressions on real, large applications, and for several benchmarks where the loop vectorizer fires in the hot path it shows 5% to 40% improvements for SSE2 and SSE3 code running on Sandy Bridge machines. Running on AMD machines shows even more dramatic improvements. When using newer ISA vector extensions the gains are much more modest, but the code is still better on the whole. There are a few regressions being tracked (PR21137, PR21138, PR21139) but by and large this is expected to be a win for x86 generated code performance. It is also more correct than the code it replaces. I have fuzz tested this extensively with ISA extensions up through AVX2 and found no crashes or miscompiles (yet...). The old lowering had a few miscompiles and crashers after a somewhat smaller amount of fuzz testing. There is one significant area where the new code path lags behind and that is in AVX-512 support. However, there was extremely little support for that already and so this isn't a significant step backwards and the new framework will probably make it easier to implement lowering that uses the full power of AVX-512's table-based shuffle+blend (IMO). Many thanks to Quentin, Andrea, Robert, and others for benchmarking assistance. Thanks to Adam and others for help with AVX-512. Thanks to Hal, Eric, and many others for answering my incessant questions about how the backend actually works. =] I will leave the old code path in the tree until the 3 PRs above are at least resolved to folks' satisfaction. Then I will rip it (and 1000s of lines of code) out. =] I don't expect this flag to stay around for very long. It may not survive next week. llvm-svn: 219046	2014-10-04 03:52:55 +00:00
Jingyue Wu	4a186967a9	Add fake use to suppress defined-but-unused warnings llvm-svn: 219045	2014-10-04 03:50:10 +00:00
Chandler Carruth	7001fb9ace	[x86] Fix a bug in the VZEXT DAG combine that I just made more powerful. It turns out this combine was always somewhat flawed -- there are cases where nested VZEXT nodes can't be combined: if their types have a mismatch that can be observed in the result. While none of these show up in currently, once I switch to the new vector shuffle lowering a few test cases actually form such nested VZEXT nodes. I've not come up with any IR pattern that I can sensible write to exercise this, but it will be covered by tests once I flip the switch. llvm-svn: 219044	2014-10-04 02:51:03 +00:00
Chandler Carruth	b73b4f12a1	[x86] Sink a generic combine of VZEXT nodes from the lowering to VZEXT nodes to the DAG combining of them. This will allow the combine to fire on both old vector shuffle lowering and the new vector shuffle lowering and generally seems like a cleaner design. I've trimmed down the code a bit and tried to make it and the surrounding combine fairly clean while moving it around. llvm-svn: 219042	2014-10-04 01:05:48 +00:00
Matt Arsenault	c421684bad	R600/SI: Custom lower f64 -> i64 conversions llvm-svn: 219038	2014-10-03 23:54:56 +00:00
Matt Arsenault	7b24655980	R600: Custom lower [s\|u]int_to_fp for i64 -> f64 llvm-svn: 219037	2014-10-03 23:54:41 +00:00
Matt Arsenault	2456242394	R600/SI: Fix ftrunc f64 conformance failures. Re-add the tests since they were deleted at some point llvm-svn: 219036	2014-10-03 23:54:27 +00:00
Peter Collingbourne	6e844ba0a6	Remove unused ALL_BINDINGS configuration variable. llvm-svn: 219035	2014-10-03 23:03:01 +00:00
Chandler Carruth	74c4b81b56	[x86] Add a really preposterous number of patterns for matching all of the various ways in which blends can be used to do vector element insertion for lowering with the scalar math instruction forms that effectively re-blend with the high elements after performing the operation. This then allows me to bail on the element insertion lowering path when we have SSE4.1 and are going to be doing a normal blend, which in turn restores the last of the blends lost from the new vector shuffle lowering when I got it to prioritize insertion in other cases (for example when we don't have a blend instruction). Without the patterns, using blends here would have regressed sse-scalar-fp-arith.ll completely with the new vector shuffle lowering. For completeness, I've added RUN-lines with the new lowering here. This is somewhat superfluous as I'm about to flip the default, but hey, it shows that this actually significantly changed behavior. The patterns I've added are just ridiculously repetative. Suggestions on making them better very much welcome. In particular, handling the commuted form of the v2f64 patterns is somewhat obnoxious. llvm-svn: 219033	2014-10-03 22:43:17 +00:00
Chris Bieneman	7798d68464	Converting the ErrorHandlerMutex to a ManagedStatic to avoid the static constructor and destructor. llvm-svn: 219028	2014-10-03 22:03:12 +00:00

1 2 3 4 5 ...

108496 Commits