llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Oliver Stannard	8901713bd2	Teach the AArch64 backend about v4f16 and v8f16 This teaches the AArch64 backend to deal with the operations required to deal with the operations on v4f16 and v8f16 which are exposed by NEON intrinsics, plus the add, sub, mul and div operations. llvm-svn: 216555	2014-08-27 16:16:04 +00:00
Michael Zolotukhin	45d4921dc0	[SLP] Re-enable vectorization of GEP expressions (re-apply r210342 with a fix). llvm-svn: 216549	2014-08-27 15:01:18 +00:00
Evgeniy Stepanov	bde2b50f23	Clang-format over X86AsmInstrumentation.* with LLVM style. r216536 mistakenly used -style=Google instead of LLVM. llvm-svn: 216543	2014-08-27 13:11:55 +00:00
Benjamin Kramer	7c4796a444	Add an explicit cast to pacify implicit boolean conversion warnings. llvm-svn: 216539	2014-08-27 11:47:52 +00:00
Chandler Carruth	d55485e199	[x86] Fix a regression introduced with r213897 for 32-bit targets where we stopped efficiently lowering sextload using the SSE41 instructions for that operation. This is a consequence of a bad predicate I used thinking of the memory access needs. The code actually handles the cases where the predicate doesn't apply, and handles them much better. =] Simple fix and a test case added. Fixes PR20767. llvm-svn: 216538	2014-08-27 11:39:47 +00:00
Chandler Carruth	925d065888	[SDAG] Re-instate r215611 with a fix to a pesky X86 DAG combine. This combine is essentially combining target-specific nodes back into target independent nodes that it "knows" will be combined yet again by a target independent DAG combine into a different set of target-independent nodes that are legal (not custom though!) and thus "ok". This seems... deeply flawed. The crux of the problem is that we don't combine un-legalized shuffles that are introduced by legalizing other operations, and thus we don't see a very profitable combine opportunity. So the backend just forces the input to that combine to re-appear. However, for this to work, the conditions detected to re-form the unlegalized nodes must be exactly right. Previously, failing this would have caused poor code (if you're lucky) or a crasher when we failed to select instructions. After r215611 we would fall back into the legalizer. In some cases, this just "fixed" the crasher by produces bad code. But in the test case added it caused the legalizer and the dag combiner to iterate forever. The fix is to make the alignment checking in the x86 side of things match the alignment checking in the generic DAG combine exactly. This isn't really a satisfying or principled fix, but it at least make the code work as intended. It also highlights that it would be nice to detect the availability of under aligned loads for a given type rather than bailing on this optimization. I've left a FIXME to document this. Original commit message for r215611 which covers the rest of the chang: [SDAG] Fix a case where we would iteratively legalize a node during combining by replacing it with something else but not re-process the node afterward to remove it. In a truly remarkable stroke of bad luck, this would (in the test case attached) end up getting some other node combined into it without ever getting re-processed. By adding it back on to the worklist, in addition to deleting the dead nodes more quickly we also ensure that if it stops being dead for any reason it makes it back through the legalizer. Without this, the test case will end up failing during instruction selection due to an and node with a type we don't have an instruction pattern for. It took many million runs of the shuffle fuzz tester to find this. llvm-svn: 216537	2014-08-27 11:22:16 +00:00
Evgeniy Stepanov	b7f5d6f168	Clang-format over X86AsmInstrumentation.*. llvm-svn: 216536	2014-08-27 11:10:54 +00:00
Robert Khasanov	a051690c52	[SKX] Added new versions of cmp instructions in avx512_icmp_cc multiclass, added VL multiclass. Added encoding tests llvm-svn: 216532	2014-08-27 09:34:37 +00:00
Elena Demikhovsky	fec7633c50	AVX-512: Added intrinsic for VMOVSS store form with mask. llvm-svn: 216530	2014-08-27 07:38:43 +00:00
Craig Topper	43cee2f5fc	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216525	2014-08-27 05:25:25 +00:00
Craig Topper	d133367f2d	Fix some cases were ArrayRefs were being passed by reference. Also remove 'const' from some other ArrayRef uses since its implicitly const already. llvm-svn: 216524	2014-08-27 05:25:00 +00:00
David Majnemer	826f5eb297	InstCombine: Optimize GEP's involving ptrtoint better We supported transforming: (gep i8* X, -(ptrtoint Y)) to: (inttoptr (sub (ptrtoint X), (ptrtoint Y))) However, this only fired if 'X' had type i8*. Generalize this to support various types of different sizes. This results in much better CodeGen, especially for pointers to packed structs. llvm-svn: 216523	2014-08-27 05:16:04 +00:00
David Blaikie	4442146a4e	Remove type unit skeletons. GDB no longer needs them & this saves a heap of space. llvm-svn: 216521	2014-08-27 05:04:14 +00:00
Juergen Ributzka	069b8c9f42	[FastISel][AArch64] Fix address simplification. When a shift with extension or an add with shift and extension cannot be folded into the memory operation, then the address calculation has to be materialized separately. While doing so the code forgot to consider a possible sign-/zero- extension. This fix folds now also the sign-/zero-extension into the add or shift instruction which is used to materialize the address. This fixes rdar://problem/18141718. llvm-svn: 216511	2014-08-27 00:58:30 +00:00
Juergen Ributzka	d16dcac638	[FastISel][AArch64] Fold Sign-/Zero-Extend into the shift immediate instruction. llvm-svn: 216510	2014-08-27 00:58:26 +00:00
David Blaikie	93f40d579d	Fix a couple of debug info test cases to match the metadata schema change in r216239 Found these while testing something else. llvm-svn: 216505	2014-08-27 00:04:16 +00:00
Rafael Espindola	a2d7cc97be	Pass a std::unique_ptr<MemoryBuffer>& to getLazyBitcodeModule. By taking a reference we can do the ownership transfer in one place instead of expecting every caller to do it. llvm-svn: 216492	2014-08-26 22:00:09 +00:00
Rafael Espindola	225cf75bef	Pass a MemoryBufferRef when we can avoid taking ownership. The attached patch simplifies a few interfaces that don't need to take ownership of a buffer. For example, both parseAssembly and parseBitcodeFile will parse the entire buffer before returning. There is no need to take ownership. Using a MemoryBufferRef makes it obvious in the type signature that there is no ownership transfer. llvm-svn: 216488	2014-08-26 21:49:01 +00:00
Rafael Espindola	57a5798809	Give ExecutionEngine of top level buffers. Long term the idea if for the engine to not own the buffers, but for now this is consistent with the rest of the API. llvm-svn: 216484	2014-08-26 21:04:04 +00:00
Reid Kleckner	363b599ec0	MC: Split the x86 asm matcher implementations by dialect The existing matcher has lots of AT&T assembly dialect assumptions baked into it. In particular, the hack for resolving the size of a memory operand by appending the four most common suffixes doesn't work at all. The Intel assembly dialect mnemonic table has ambiguous entries, so we need to try matching multiple times with different operand sizes, since that's the only way to choose different instruction variants. This makes us more compatible with gas's implementation of Intel assembly syntax. MSVC assumes you want byte-sized operations for the instructions that we reject as ambiguous. Reviewed By: grosbach Differential Revision: http://reviews.llvm.org/D4747 llvm-svn: 216481	2014-08-26 20:32:34 +00:00
Joerg Sonnenberger	a73c9a0239	Revert r210342 and r210343, add test case for the crasher. PR 20642. llvm-svn: 216475	2014-08-26 19:06:41 +00:00
Joerg Sonnenberger	2e3334af4e	Convert MC command line option for fatal assembler warnings into a proper flag. llvm-svn: 216471	2014-08-26 18:39:50 +00:00
Rafael Espindola	e9da2b22ff	Invert the condition to have a single return. Thanks to David Blaikie for the suggestion. llvm-svn: 216468	2014-08-26 18:03:35 +00:00
Rafael Espindola	5d3991396a	Return a std::unique_ptr from the IRReader.h functions. NFC. llvm-svn: 216466	2014-08-26 17:29:46 +00:00
Rafael Espindola	7c0442fa71	Return a std::unique_ptr from parseInputFile and propagate. NFC. The memory management in BugPoint is fairly convoluted, so this just unwraps one layer by changing the return type of functions that always return owned Modules. llvm-svn: 216464	2014-08-26 17:19:03 +00:00
Rafael Espindola	1722847ca2	Simplify LTOModule::makeLTOModule a bit. NFC. Just call parseBitcodeFile instead of getLazyBitcodeModule followed by materializeAllPermanently. llvm-svn: 216461	2014-08-26 15:09:32 +00:00
Rafael Espindola	8cbf70d62d	Merge TempDir and system_temp_directory. We had two functions for finding the temp or cache directory. Each had a different set of smarts about OS specific APIs. With this patch system_temp_directory becomes the only way to do it. llvm-svn: 216460	2014-08-26 14:47:52 +00:00
Benjamin Kramer	7901ad422f	Silence unused function warning in Release builds. llvm-svn: 216458	2014-08-26 14:22:05 +00:00
James Molloy	b5144baff8	Change the return value of "getEnd()" from a MachineInstr* to a MachineBasicBlock::iterator. It seems on Darwin the illegal round-trip ::iterator -> MachineInstr* -> ::iterator breaks execution horribly when the iterator is not a real MachineInstr, like ::end(). llvm-svn: 216455	2014-08-26 13:41:31 +00:00
Yi Kong	7df6bd5f10	ARM: Add patterns for dbg llvm-svn: 216451	2014-08-26 12:47:26 +00:00
Dinesh Dwivedi	1b0080d8e1	This patch enables SimplifyUsingDistributiveLaws() to handle following pattens. (X >> Z) & (Y >> Z) -> (X&Y) >> Z for all shifts. (X >> Z) \| (Y >> Z) -> (X\|Y) >> Z for all shifts. (X >> Z) ^ (Y >> Z) -> (X^Y) >> Z for all shifts. These patterns were previously handled separately in visitAnd()/visitOr()/visitXor(). Differential Revision: http://reviews.llvm.org/D4951 llvm-svn: 216443	2014-08-26 08:53:32 +00:00
Bill Wendling	066ad62a26	Use 'xz' compression instead of 'gz'. llvm-svn: 216442	2014-08-26 08:11:22 +00:00
David Majnemer	635dd8cd82	InstSimplify: Fold gep X, (sub 0, ptrtoint(X)) to null Save InstCombine some work if we can perform this fold during InstSimplify. llvm-svn: 216441	2014-08-26 07:08:03 +00:00
David Majnemer	c439a36c15	InstSimplify: Simplify trivial pointer expressions like b + (e - b) consider: long long f(long long b, long long e) { return b + (e - b); } we would lower this to something like: define i64 @f(i64* %b, i64* %e) { %1 = ptrtoint i64* %e to i64 %2 = ptrtoint i64* %b to i64 %3 = sub i64 %1, %2 %4 = ashr exact i64 %3, 3 %5 = getelementptr inbounds i64* %b, i64 %4 ret i64* %5 } This should fold away to just 'e'. N.B. This adds m_SpecificInt as a convenient way to match against a particular 64-bit integer when using LLVM's match interface. llvm-svn: 216439	2014-08-26 05:55:16 +00:00
Dylan Noblesmith	70b930984e	AArch64: use std::fill instead of memset Followup based on review. llvm-svn: 216436	2014-08-26 03:33:26 +00:00
Dylan Noblesmith	8c4eed9349	Revert "AArch64: use std::vector for temp array" This reverts commit r216365. llvm-svn: 216433	2014-08-26 02:03:43 +00:00
Dylan Noblesmith	e0403e6eab	Analysis: cleanup Address review comments. llvm-svn: 216432	2014-08-26 02:03:40 +00:00
Dylan Noblesmith	24bc8991ce	Revert "Analysis: unique_ptr-ify DependenceAnalysis::collectCoeffInfo" This reverts commit r216358. llvm-svn: 216431	2014-08-26 02:03:38 +00:00
Dylan Noblesmith	dcc2327ca8	Revert "NVPTX: remove another raw delete call" This reverts commit r216364. llvm-svn: 216430	2014-08-26 02:03:35 +00:00
Dylan Noblesmith	1af83d4c81	Revert "Support/APFloat: unique_ptr-ify temp arrays" This reverts commit rr216359. llvm-svn: 216429	2014-08-26 02:03:33 +00:00
Dylan Noblesmith	36c670b2b8	Revert "Support/Path: remove raw delete" This reverts commit r216360. llvm-svn: 216428	2014-08-26 02:03:30 +00:00
Dylan Noblesmith	1e830f5d02	ExecutionEngine: address review comments llvm-svn: 216427	2014-08-26 02:03:28 +00:00
Dylan Noblesmith	3caa557cf3	CodeGen/LiveVariables: use vector::assign() Address review comments. llvm-svn: 216426	2014-08-26 02:03:25 +00:00
Reid Kleckner	4a75bd090f	musttail: Don't eliminate varargs packs if there is a forwarding call Also clean up and beef up this grep test for the feature. llvm-svn: 216425	2014-08-26 00:59:51 +00:00
Sanjay Patel	b8e4ed2941	fix typos in comments llvm-svn: 216424	2014-08-26 00:59:15 +00:00
Reid Kleckner	f92e18b173	Declare that musttail calls in variadic functions forward the ellipsis Summary: There is no functionality change here except in the way we assemble and dump musttail calls in variadic functions. There's really no need to separate out the bits for musttail and "is forwarding varargs" on call instructions. A musttail call by definition has to forward the ellipsis or it would fail verification. Reviewers: chandlerc, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4892 llvm-svn: 216423	2014-08-26 00:33:28 +00:00
Reid Kleckner	0f3d2670c2	Fix Path unittests on Windows after raw_fd_ostream changes llvm-svn: 216422	2014-08-26 00:24:23 +00:00
Reid Kleckner	04b36830cc	ArgPromotion: Don't touch variadic functions Adding, removing, or changing non-pack parameters can change the ABI classification of pack parameters. Clang and other frontends encode the classification in the IR of the call site, but the callee side determines it dynamically based on the number of registers consumed so far. Changing the prototype affects the number of registers consumed would break such code. Dead argument elimination performs a similar task and already has a similar check to avoid this problem. Patch by Thomas Jablin! llvm-svn: 216421	2014-08-25 23:58:48 +00:00
Lang Hames	d1b482e078	[MCJIT][SystemZ] Use a simpler expression for indirect relocation offsets. The expressions 'Reloc.Addend - Addend' and 'Reloc.Offset' should always be equal in this context. The latter is prefered - we want to remove the RelocationValueRef::Addend field in the future. llvm-svn: 216418	2014-08-25 23:33:48 +00:00
Rafael Espindola	557df13e6d	Fix bug in llvm::sys::argumentsFitWithinSystemLimits(). This patch fixes a subtle bug in the UNIX implementation of llvm::sys::argumentsFitWithinSystemLimits() regarding the misuse of a static variable. This bug causes our cached number that stores the system command line maximum length to be halved after each call to the function. With a sufficient number of calls to this function, it will eventually report any given command line string to be over system limits. Patch by Rafael Auler. llvm-svn: 216415	2014-08-25 22:53:21 +00:00

1 2 3 4 5 ...

107094 Commits