llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Ahmed Bougacha	6c3e1c0f56	[MC] Reset the MCInst in the matcher function before adding opcode/operands. On X86, the Intel asm parser tries to match all memory operand sizes when none is explicitly specified. For LEA, which doesn't really have a memory operand (just a pointer one), this results in multiple successful matches, one for each memory size. There's no error because it's same opcode, so really, it's just one match. However, the tablegen'd matcher function adds opcode/operands to the passed MCInst, and this results in multiple duplicated operands. This commit clears the MCInst in the tablegen'd matcher function. We sometimes clear it when the match failed, so there's no expectation of keeping the previous content anyway. Differential Revision: http://reviews.llvm.org/D6670 llvm-svn: 224347	2014-12-16 18:05:28 +00:00
Colin LeMahieu	4932546e48	[Hexagon] Adding absolute value, and negate with saturation llvm-svn: 224346	2014-12-16 17:44:49 +00:00
Sanjay Patel	8363dd3b42	combine consecutive subvector 16-byte loads into one 32-byte load This is a fix for PR21709 ( http://llvm.org/bugs/show_bug.cgi?id=21709 ). When we have 2 consecutive 16-byte loads that are merged into one 32-byte vector, we can use a single 32-byte load instead. But we don't do this for SandyBridge / IvyBridge because they have slower 32-byte memops. We also don't bother using 32-byte integer loads on a machine that only has AVX1 (btver2) because those operands would have to be split in half anyway since there is no support for 32-byte integer math ops. Differential Revision: http://reviews.llvm.org/D6492 llvm-svn: 224344	2014-12-16 16:30:01 +00:00
Colin LeMahieu	585f29d985	[Hexagon] Adding saturate and swizzle instructions. llvm-svn: 224343	2014-12-16 16:27:17 +00:00
Zoran Jovanovic	d72dae73a8	[mips][microMIPS] Implement SWP and LWP instructions Differential Revision: http://reviews.llvm.org/D5667 llvm-svn: 224338	2014-12-16 14:59:10 +00:00
Vladimir Medic	3860fde4a8	Add disassembler tests for mips4 platform. There are no functional changes. llvm-svn: 224335	2014-12-16 13:02:25 +00:00
Elena Demikhovsky	fe73fcc29b	Masked Load and Store Intrinsics in loop vectorizer. The loop vectorizer optimizes loops containing conditional memory accesses by generating masked load and store intrinsics. This decision is target dependent. http://reviews.llvm.org/D6527 llvm-svn: 224334	2014-12-16 11:50:42 +00:00
Daniel Sanders	018d1acab3	[mips] Fix arguments-struct.ll for Windows and OSX hosts. llvm-svn: 224333	2014-12-16 11:21:58 +00:00
Bradley Smith	5d5a40a0f8	[ARM] Prevent PerformVCVTCombine from combining a vmul/vcvt with 8 lanes This would result in a crash since the vcvt used does not support v8i32 types. llvm-svn: 224332	2014-12-16 10:59:27 +00:00
Duncan P. N. Exon Smith	58ed764767	IR: Stop printing 'metadata' in Metadata::print() Stop printing `metadata` in `Metadata::print()` and `Metadata::printAsOperand()`. llvm-svn: 224327	2014-12-16 07:40:31 +00:00
Duncan P. N. Exon Smith	c03e29935e	DebugInfo: Update testcase to actually check something This test was missing a `Debug Info Version` so it's `not grep` was passing vacuously. Update it to CHECK for something useful at the same time so it doesn't bitrot quite so easily in the future. llvm-svn: 224324	2014-12-16 07:08:19 +00:00
Saleem Abdulrasool	c163948b80	ARM: diagnose deprecated syntax The use of SP and PC in the register list for stores is deprecated on ARM (ARM ARM A.8.8.199): ARM deprecates the use of ARM instructions that include the SP or the PC in the list. Provide a deprecation warning from the assembler in the case that the syntax is ever seen. llvm-svn: 224319	2014-12-16 05:53:25 +00:00
Hal Finkel	04ae4c36c5	[PowerPC] Improve instruction selection bit-permuting operations (32-bit) The PowerPC backend, somewhat embarrassingly, did not generate an optimal-length sequence of instructions for a 32-bit bswap. While adding a pattern for the bswap intrinsic to fix this would not have been terribly difficult, doing so would not have addressed the real problem: we had been generating poor code for many bit-permuting operations (by which I mean things like byte-swap that permute the bits of one or more inputs around in various ways). Here are some initial steps toward solving this deficiency. Bit-permuting operations are represented, at the SDAG level, using ISD::ROTL, SHL, SRL, AND and OR (mostly with constant second operands). Looking back through these operations, we can build up a description of the bits in the resulting value in terms of bits of one or more input values (and constant zeros). For each bit, we compute the rotation amount from the original value, and then group consecutive (value, rotation factor) bits into groups. Groups sharing these attributes are then collected and sorted, and we can then instruction select the entire permutation using a combination of masked rotations (rlwinm), imm ands (andi/andis), and masked rotation inserts (rlwimi). The result is that instead of lowering an i32 bswap as: rlwinm 5, 3, 24, 16, 23 rlwinm 4, 3, 24, 0, 7 rlwimi 4, 3, 8, 8, 15 rlwimi 5, 3, 8, 24, 31 rlwimi 4, 5, 0, 16, 31 we now produce: rlwinm 4, 3, 8, 0, 31 rlwimi 4, 3, 24, 16, 23 rlwimi 4, 3, 24, 0, 7 and for the 'test6' example in the PowerPC/README.txt file: unsigned test6(unsigned x) { return ((x & 0x00FF0000) >> 16) \| ((x & 0x000000FF) << 16); } we used to produce: lis 4, 255 rlwinm 3, 3, 16, 0, 31 ori 4, 4, 255 and 3, 3, 4 and now we produce: rlwinm 4, 3, 16, 24, 31 rlwimi 4, 3, 16, 8, 15 and, as a nice bonus, this fixes the FIXME in test/CodeGen/PowerPC/rlwimi-and.ll. This commit does not include instruction-selection for i64 operations, those will come later. llvm-svn: 224318	2014-12-16 05:51:41 +00:00
Rafael Espindola	f367e92d33	Start adding thin archive support. This is just sufficient for 'ar t' to work. llvm-svn: 224307	2014-12-16 01:43:41 +00:00
Kevin Enderby	db5408dea6	Fix a bug in llvm-objdump’s -private-headers for 32-bit Mach-O files printing the section header. And add some tests for this for 32-bit files. llvm-svn: 224302	2014-12-16 01:14:45 +00:00
Adrian Prantl	33921ffabc	ARM/AArch64: Attach the FrameSetup MIFlag to CFI instructions. Debug info marks the first instruction without the FrameSetup flag as being the end of the function prologue. Any CFI instructions in the middle of the function prologue would cause debug info to end the prologue too early and worse, attach the line number of the CFI instruction, which incidentally is often 0. llvm-svn: 224294	2014-12-16 00:20:49 +00:00
Colin LeMahieu	4c0e2a35a6	[Hexagon] Adding doubleword multiplies with and without accumulation. llvm-svn: 224293	2014-12-16 00:07:24 +00:00
Colin LeMahieu	0a4e0a7b23	[Hexagon] Adding halfword to doubleword multiplies. llvm-svn: 224289	2014-12-15 23:29:37 +00:00
Colin LeMahieu	b56764d577	[Hexagon] Adding logical-logical accumulation instructions and tests. llvm-svn: 224288	2014-12-15 23:19:07 +00:00
Sanjoy Das	0cdde3ea1f	Teach ScalarEvolution to exploit min and max expressions when proving isKnownPredicate. The motivation for this change is to optimize away checks in loops like this: limit = min(t, len) for (i = 0 to limit) if (i >= len \|\| i < 0) throw_array_of_of_bounds(); a[i] = ... Differential Revision: http://reviews.llvm.org/D6635 llvm-svn: 224285	2014-12-15 22:50:15 +00:00
Simon Pilgrim	1fd72b137f	Added missing tests for X86vzmovl folding. NFC. llvm-svn: 224284	2014-12-15 22:45:48 +00:00
JF Bastien	27a63b4d77	x86: Emit LOCK prefix after DATA16 Summary: x86 allows either ordering for the LOCK and DATA16 prefixes, but using GCC+GAS leads to different code generation than using LLVM. This change matches the order that GAS emits the x86 prefixes when a semicolon isn't used in inline assembly (see tc-i386.c comment before define LOCK_PREFIX), and helps simplify tooling that operates on the instruction's byte sequence (such as NaCl's validator). This change shouldn't have any performance impact. Test Plan: ninja check Reviewers: craig.topper, jvoung Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D6630 llvm-svn: 224283	2014-12-15 22:34:58 +00:00
Colin LeMahieu	a6e921963f	[Hexagon] Adding a number of additional multiply forms with tests. llvm-svn: 224282	2014-12-15 22:10:37 +00:00
Colin LeMahieu	cb4ac18de9	[Hexagon] Adding misc multiply encodings and tests. llvm-svn: 224273	2014-12-15 21:17:03 +00:00
Colin LeMahieu	cfd931a5a2	[Hexagon] Adding doubleworld accumulating multiplies of halfwords. llvm-svn: 224267	2014-12-15 20:17:46 +00:00
Colin LeMahieu	410a9d158e	[Hexagon] Adding accumulating half word multiplies. llvm-svn: 224266	2014-12-15 20:10:28 +00:00
Colin LeMahieu	5b550fb31a	[Hexagon] Adding multiply with rnd/sat/rndsat llvm-svn: 224265	2014-12-15 20:01:59 +00:00
Ahmed Bougacha	9d970e1dc6	[X86] And also test INSERTPS shuffle mask pretty-printing. For r224260. llvm-svn: 224264	2014-12-15 19:47:35 +00:00
Colin LeMahieu	4225ddfd4f	[Hexagon] Adding encoding bits for halfword multiplies. llvm-svn: 224261	2014-12-15 19:22:07 +00:00
Duncan P. N. Exon Smith	9c5542c040	IR: Make metadata typeless in assembly Now that `Metadata` is typeless, reflect that in the assembly. These are the matching assembly changes for the metadata/value split in r223802. - Only use the `metadata` type when referencing metadata from a call intrinsic -- i.e., only when it's used as a `Value`. - Stop pretending that `ValueAsMetadata` is wrapped in an `MDNode` when referencing it from call intrinsics. So, assembly like this: define @foo(i32 %v) { call void @llvm.foo(metadata !{i32 %v}, metadata !0) call void @llvm.foo(metadata !{i32 7}, metadata !0) call void @llvm.foo(metadata !1, metadata !0) call void @llvm.foo(metadata !3, metadata !0) call void @llvm.foo(metadata !{metadata !3}, metadata !0) ret void, !bar !2 } !0 = metadata !{metadata !2} !1 = metadata !{i32* @global} !2 = metadata !{metadata !3} !3 = metadata !{} turns into this: define @foo(i32 %v) { call void @llvm.foo(metadata i32 %v, metadata !0) call void @llvm.foo(metadata i32 7, metadata !0) call void @llvm.foo(metadata i32* @global, metadata !0) call void @llvm.foo(metadata !3, metadata !0) call void @llvm.foo(metadata !{!3}, metadata !0) ret void, !bar !2 } !0 = !{!2} !1 = !{i32* @global} !2 = !{!3} !3 = !{} I wrote an upgrade script that handled almost all of the tests in llvm and many of the tests in cfe (even handling many `CHECK` lines). I've attached it (or will attach it in a moment if you're speedy) to PR21532 to help everyone update their out-of-tree testcases. This is part of PR21532. llvm-svn: 224257	2014-12-15 19:07:53 +00:00
Reid Kleckner	af63007e28	Move mips1 tests to test/MC/Disassembler/Mips/mips1 This matches the pattern of the mips2 and 3 tests, as well as our normal conventions. llvm-svn: 224254	2014-12-15 17:56:02 +00:00
Vladimir Medic	ab769bd4d1	Add disassembler tests for mips3 platform. There are no functional changes. llvm-svn: 224253	2014-12-15 16:19:34 +00:00
Vladimir Medic	e80ff03b83	Add disassembler tests for mips2 platform. There are no functional changes. llvm-svn: 224252	2014-12-15 15:58:20 +00:00
Vladimir Medic	f681904c24	This is the first in a series of patches that add missing disassembler tests for mips platform. The patches are divided per version of mips CPU to keep the patches smaller and ease the review. There are no functional changes, code is changed only if new tests reveal a bug.This patch adds disassembler tests for mips1 CPU. llvm-svn: 224251	2014-12-15 15:22:33 +00:00
Elena Demikhovsky	003ce257e6	Added a test related to 224247 revision llvm-svn: 224248	2014-12-15 14:14:10 +00:00
Michael Kuperstein	cc87d705cb	[X86] Break false dependencies before partial register updates when the source operand is in memory Adds the various "rm" instruction variants into the list of instructions that have a partial register update. Also adds all variants of SQRTSD that were missing in the original list. Differential Revision: http://reviews.llvm.org/D6620 llvm-svn: 224246	2014-12-15 13:18:21 +00:00
Suyog Sarda	c0c2a062e1	Typo Correction in Test Case. NFC. llvm-svn: 224244	2014-12-15 12:19:46 +00:00
Elena Demikhovsky	51c511a201	AVX-512: Added EXPAND instructions and intrinsics. llvm-svn: 224241	2014-12-15 10:03:52 +00:00
Hal Finkel	acf8e7a584	[PowerPC] Handle cmp op promotion for SELECT[_CC] nodes in PPCTL::DAGCombineExtBoolTrunc PPCTargetLowering::DAGCombineExtBoolTrunc contains logic to remove unwanted truncations and extensions when dealing with nodes of the form: zext(binary-ops(binary-ops(trunc(x), trunc(y)), ...) There was a FIXME in the implementation (now removed) regarding the fact that the function would abort the transformations if any of the non-output operands of a SELECT or SELECT_CC node would need to be promoted (because they were also output operands, for example). As a result, we continued to generate unnecessary zero-extends for code such as this: unsigned foo(unsigned a, unsigned b) { return (a <= b) ? a : b; } which would produce: cmplw 0, 3, 4 isel 3, 4, 3, 1 rldicl 3, 3, 0, 32 blr and now we produce: cmplw 0, 3, 4 isel 3, 4, 3, 1 blr which is better in the obvious way. llvm-svn: 224213	2014-12-14 05:53:19 +00:00
Ahmed Bougacha	88111b0889	Reapply "[ARM] Combine base-updating/post-incrementing vector load/stores." r223862 tried to also combine base-updating load/stores. r224198 reverted it, as "it created a regression on the test-suite on test MultiSource/Benchmarks/Ptrdist/anagram by scrambling the order in which the words are shown." Reapply, with a fix to ignore non-normal load/stores. Truncstores are handled elsewhere (you can actually write a pattern for those, whereas for postinc loads you can't, since they return two values), but it should be possible to also combine extloads base updates, by checking that the memory (rather than result) type is of the same size as the addend. Original commit message: We used to only combine intrinsics, and turn them into VLD1_UPD/VST1_UPD when the base pointer is incremented after the load/store. We can do the same thing for generic load/stores. Note that we can only combine the first load/store+adds pair in a sequence (as might be generated for a v16f32 load for instance), because other combines turn the base pointer addition chain (each computing the address of the next load, from the address of the last load) into independent additions (common base pointer + this load's offset). Differential Revision: http://reviews.llvm.org/D6585 llvm-svn: 224203	2014-12-13 23:22:12 +00:00
Renato Golin	3418b50014	Revert "[ARM] Combine base-updating/post-incrementing vector load/stores." This reverts commit r223862, as it created a regression on the test-suite on test MultiSource/Benchmarks/Ptrdist/anagram by scrambling the order in which the words are shown. We'll investigate the issue and re-apply when safe. llvm-svn: 224198	2014-12-13 20:23:18 +00:00
Akira Hatanaka	e6d6f49584	Rename argument strings of codegen passes to avoid collisions with command line options. This commit changes the command line arguments (PassInfo::PassArgument) of two passes, MachineFunctionPrinter and MachineScheduler, to avoid collisions with command line options that have the same argument strings. This bug manifests when the PassList construct (defined in opt.cpp) is used in a tool that links with codegen passes. To reproduce the bug, paste the following lines into llc.cpp and run llc. #include "llvm/IR/LegacyPassNameParser.h" static llvm:🆑:list<const llvm::PassInfo*, bool, llvm::PassNameParser> PassList(llvm:🆑:desc("Optimizations available:")); rdar://problem/19212448 llvm-svn: 224186	2014-12-13 04:52:04 +00:00
Hal Finkel	30da0a42c8	[PowerPC] Add a DAGToDAG peephole to remove unnecessary zero-exts On PPC64, we end up with lots of i32 -> i64 zero extensions, not only from all of the usual places, but also from the ABI, which specifies that values passed are zero extended. Almost all 32-bit PPC instructions in PPC64 mode are defined to do something to the higher-order bits, and for some instructions, that action clears those bits (thus providing a zero-extended result). This is especially common after rotate-and-mask instructions. Adding an additional instruction to zero-extend the results of these instructions is unnecessary. This PPCISelDAGToDAG peephole optimization examines these zero-extensions, and looks back through their operands to see if all instructions will implicitly zero extend their results. If so, we convert these instructions to their 64-bit variants (which is an internal change only, the actual encoding of these instructions is the same as the original 32-bit ones) and remove the unnecessary zero-extension (changing where the INSERT_SUBREG instructions are to make everything internally consistent). llvm-svn: 224169	2014-12-12 23:59:36 +00:00
David Majnemer	70ded5026c	ValueTracking: Don't recurse too deeply in computeKnownBitsFromAssume Respect the MaxDepth recursion limit, doing otherwise will trigger an assert in computeKnownBits. This fixes PR21891. llvm-svn: 224168	2014-12-12 23:59:29 +00:00
Chad Rosier	b5c6a89ee6	[ARMConstantIsland] Insert tbb/tbh optimization where previous jump table resided. llvm-svn: 224165	2014-12-12 23:27:40 +00:00
Colin LeMahieu	e750275948	[Hexagon] Adding double word add/min/minu/max/maxu instructions and tests. llvm-svn: 224153	2014-12-12 21:29:25 +00:00
Nico Weber	b1b6a861e5	Revert r224149, llvm-dsymutil was already here. I saw a failure on an internal bot, opened this file, saw it was missing, thought "aha!", tried to land, got an "file is out of date", synced, didn't see the file listed right above the line I added (cause I didn't add it in the right place) and landed. Apologies! llvm-svn: 224152	2014-12-12 21:25:07 +00:00
Colin LeMahieu	980129091a	[Hexagon] Adding J class call instructions. llvm-svn: 224150	2014-12-12 21:12:27 +00:00
Nico Weber	5b22824a0f	Add llvm-dsymutil to test/CMakeLists.txt r224134 added this and runs it from a test, but doesn't build it with test binaries. llvm-svn: 224149	2014-12-12 20:56:49 +00:00
Reid Kleckner	516fcdc230	Relax debug-map-parsing.test error message check for Windows On Windows we get the string "no such file or directory". llvm-svn: 224141	2014-12-12 18:52:07 +00:00

1 2 3 4 5 ...

27549 Commits