llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 13:11:39 +01:00

Author	SHA1	Message	Date
Daniil Fukalov	1683768871	[AMDGPU] add LDS f32 intrinsics added llvm.amdgcn.atomic.{add\|min\|max}.f32 intrinsics to allow generate ds_{add\|min\|max}[_rtn]_f32 instructions needed for OpenCL float atomics in LDS Reviewed by: arsenm Differential Revision: https://reviews.llvm.org/D37985 llvm-svn: 322656	2018-01-17 14:05:05 +00:00
Dmitry Preobrazhensky	fda42c21ab	[AMDGPU][MC][GFX9] Enable inline constants for SDWA operands See bug 35771: https://bugs.llvm.org/show_bug.cgi?id=35771 Differential Revision: https://reviews.llvm.org/D42058 Reviewers: vpykhtin, artem.tamazov, arsenm llvm-svn: 322655	2018-01-17 14:00:48 +00:00
Diana Picus	7841f95817	[ARM GlobalISel] Legalize G_FPEXT and G_FPTRUNC Mark G_FPEXT and G_FPTRUNC as legal or libcall, depending on hardware support, but only for conversions between float and double. Also add the necessary boilerplate so that the LegalizerHelper can introduce the required libcalls. This also works only for float and double, but isn't too difficult to extend when the need arises. llvm-svn: 322651	2018-01-17 13:34:10 +00:00
Ivan A. Kosarev	0fdfdf2dc8	[Transforms] Support making mutable versions of new-format TBAA access tags Differential Revision: https://reviews.llvm.org/D41565 llvm-svn: 322650	2018-01-17 13:29:54 +00:00
Benjamin Kramer	5dc3847399	[X86] Don't mutate shuffle arguments after early-out for AVX512 The match* functions have the annoying behavior of modifying its inputs. Save and restore the inputs, just in case the early out for AVX512 is hit. This is still not great and its only a matter of time this kind of bug happens again, but I couldn't come up with a better pattern without rewriting significant chunks of this code. Fixes PR35977. llvm-svn: 322644	2018-01-17 13:01:06 +00:00
Pavel Labath	b166b499f9	Don't emit apple accelerator tables on non-darwin targets Summary: Currently -glldb turns on emission of apple tables on all targets, but lldb is only really capable of consuming them on darwin. Furthermore, making lldb consume these tables is not straight-forward because of the differences in how the debug info is distributed on darwin vs. elf targets. The darwin debug model assumes that the debug info (along with accelerator tables) will either remain in the .o files or it will be linked into a dsym bundle by a linker that knows how to merge these tables. In the elf world, all present linkers will simply concatenate these accelerator tables into the shared object. Since the tables are not self-terminating, this renders the tables unusable, as the debugger cannot pry the individual tables apart anymore. It might theoretically be possible to make the tables work with split dwarf, as that is somewhat similar to the apple .o model, but unfortunately right now the combination of -glldb and -gsplit-dwarf produces broken object files. Until these issues are resolved there is no point in emitting the apple tables for these targets. At best, it wastes space; at worst, it breaks compilation and prevents the user from getting other benefits of -glldb. Reviewers: probinson, aprantl, dblaikie Subscribers: emaste, dim, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D41986 llvm-svn: 322633	2018-01-17 11:52:13 +00:00
Pavel Labath	ed43a0e5cd	Rewrite debugger tuning test case to not depend on apple sections Summary: In a follow-up commit I'll change the rules for emission of accelerator tables, which means we won't be able to use them as a litmus test for the debugger tuning options. Instead of sections, I base the test on the presence/absence of some debug info attributes and opcodes: LLDB - prefers DW_OP_form_tls_address and uses DW_AT_APPLE_optimized GDB - prefers DW_OP_GNU_push_tls_address and does not use the optimized attribute SCE - prefers DW_OP_form_tls_address and does not use the optimized attribute Reviewers: probinson, aprantl, dblaikie Subscribers: JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D41985 llvm-svn: 322630	2018-01-17 11:11:53 +00:00
Simon Pilgrim	9c2f4584d2	[X86][AVX] Add extra 'interleaved+lanepermute' shuffle test Possible missed opportunity to use 64-bit lane permute on AVX1 in lowerShuffleAsRepeatedMaskAndLanePermute llvm-svn: 322628	2018-01-17 10:56:54 +00:00
Andrew V. Tischenko	1fb380cedb	Allow usage of X86-prefixes as separate instrs. Differential Revision: https://reviews.llvm.org/D42102 llvm-svn: 322623	2018-01-17 10:12:06 +00:00
Sean Eveson	b01a058fa6	[MC] Fix -stack-size-section on ARM Change symbol values in the stack_size section from being 8 bytes, to being a target dependent size. Differential Revision: https://reviews.llvm.org/D42108 llvm-svn: 322619	2018-01-17 09:01:29 +00:00
Aaron Smith	26f42ee3dd	[pdbutil] Replace 0 byte PDB input with correct version to fix failing unit test llvm-svn: 322614	2018-01-17 03:48:07 +00:00
Aaron Smith	a4bf47e131	Fix pretty printing the unspecified param of a variadic function Summary: - Fix a bug in PrettyBuiltinDumper that returns "void" as the name for an unspecified builtin type. Since the unspecified param of a variadic function is considered a builtin of unspecified type in PDBs, we set "..." for its name. - Provide a method to determine if a PDBSymbolFunc is variadic in PrettyFunctionDumper since PDBSymbolFunc::getArgument() doesn't return the last unspecified-type param. - Add a pretty-func-dumper.test to test pretty dumping of variadic functions. Reviewers: zturner, llvm-commits Reviewed By: zturner Differential Revision: https://reviews.llvm.org/D41801 llvm-svn: 322608	2018-01-17 01:22:03 +00:00
Evgeniy Stepanov	5ebd4c69d4	[hwasan] Rename sized load/store callbacks to be consistent with ASan. Summary: __hwasan_load is now __hwasan_loadN. Reviewers: kcc Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D42138 llvm-svn: 322601	2018-01-16 23:15:08 +00:00
Simon Pilgrim	15dfa1bfc1	[X86][BTVER2] Fix scheduling of VCMPSD/VCMPSS instructions For some reason they don't have a trailing i like the packed equivalents. llvm-svn: 322600	2018-01-16 22:15:41 +00:00
Florian Hahn	19dae95402	[CallSiteSplitting] Pass list of (BB, Conditions) pairs to splitCallSite. This removes some duplication from splitCallSite and makes it easier to add additional code dealing with each predecessor. It also allows us to split for more than 2 predecessors, although that is not enabled for now. Reviewers: junbuml, mcrosier, davidxl, davide Reviewed By: junbuml Differential Revision: https://reviews.llvm.org/D41858 llvm-svn: 322599	2018-01-16 22:13:15 +00:00
Volkan Keles	eea46f246c	[GlobalISel][TableGen] Add support for SDNodeXForm Summary: This patch adds CustomRenderer which renders the matched operands to the specified instruction. Targets can enable the matching of SDNodeXForm by adding a definition that inherits from GICustomOperandRenderer and GISDNodeXFormEquiv as follows. def gi_imm8 : GICustomOperandRenderer<"renderImm8”>, GISDNodeXFormEquiv<imm8_xform>; Custom renderer functions should be of the form: void render(MachineInstrBuilder &MIB, const MachineInstr &I); Reviewers: dsanders, ab, rovka Reviewed By: dsanders Subscribers: kristof.beyls, javed.absar, llvm-commits, mgrang, qcolombet Differential Revision: https://reviews.llvm.org/D42012 llvm-svn: 322582	2018-01-16 18:44:05 +00:00
Alexey Bataev	2364802067	[SLP] Fix for PR32164: Improve vectorization of reverse order of extract operations. Summary: Sometimes vectorization of insertelement instructions with extractelement operands may produce an extra shuffle operation, if these operands are in the reverse order. Patch tries to improve this situation by the reordering of the operands to remove this extra shuffle operation. Reviewers: mkuper, hfinkel, RKSimon, spatel Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D33954 llvm-svn: 322579	2018-01-16 18:17:01 +00:00
Simon Pilgrim	3c09cf2a76	[X86][MMX] Accept UNDEF upper bits for MOVD GR32->MMX llvm-svn: 322574	2018-01-16 17:01:31 +00:00
Petar Jovanovic	e54d2c7806	[LiveDebugValues] update kill-after-spill test with target triple Set target triple to "x86_64-unknown-linux-gnu". llvm-svn: 322568	2018-01-16 15:57:03 +00:00
Petar Jovanovic	b90909be63	[LiveDebugValues] recognize spilled reg killed in instruction after spill Current condition for spill instruction recognition in LiveDebugValues does not recognize case when register is spilled and killed in next instruction. Patch by Nikola Prica. Differential Revision: https://reviews.llvm.org/D41226 llvm-svn: 322554	2018-01-16 14:46:05 +00:00
Simon Pilgrim	d94666a84c	[X86][MMX] Improve MMX constant generation Extend the MMX zero code to take any constant with zero'd upper 32-bits llvm-svn: 322553	2018-01-16 14:21:28 +00:00
Gadi Haber	d141ef10e6	[X86][I86,I186,I286,I386,I486,PPRO, MMX]: Adding full coverage of MC encoding for the I86, I186, I286, I386, I486, PPRO and MMX isa sets.<NFC> NFC. Adding MC regressions tests to cover the I86, I186, I286, I386, I486, PPRO and MMX isa sets. This patch is part of a larger task to cover MC encoding of all X86 ISA Sets. Started in revision: https://reviews.llvm.org/D39952 Reviewers: zvi, RKSimon, AndreiGrischenko, craig.topper Differential Revision: https://reviews.llvm.org/D40879 Change-Id: I231a35861611bfd3d23c74cc59507373f021a629 llvm-svn: 322544	2018-01-16 11:33:45 +00:00
Jonas Devlieghere	54c6b81933	[DebugInfo] Unify dumping of address ranges Summary: This patch unifies the printing of address ranges as [0x0, 0x1). rdar://34822059 Reviewers: aprantl, dblaikie Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D42056 llvm-svn: 322543	2018-01-16 11:17:57 +00:00
Gadi Haber	6e5511c95c	[X86][XSAVE]: Adding full coverage of MC encoding for the XSAVE isa sets.<NFC> NFC. Adding MC regressions tests to cover the XSAVE ISA sets. This patch is part of a larger task to cover MC encoding of all X86 ISA Sets started in revision: https://reviews.llvm.org/D39952 Reviewers: zvi, RKSimon, AndreiGrischenko, craig.topper Differential Revision: https://reviews.llvm.org/D41282 Change-Id: I325bf8f421f78c80179a04fc39033366759cbe45 llvm-svn: 322537	2018-01-16 08:50:29 +00:00
George Rimar	aa78157c95	[FileCheck] - Fix possible buffer out of bounds access when parsing --check-prefix. FileCheck tool crashes when trying to parse --check-prefix argument if there is no any data after it. For example test like following would crash if there are no symbols and no EOL mark after `boom`: # REQUIRES: x86 # RUN: <skipped few lines> # RUN: llvm-readobj -t %t \| FileCheck %s --check-prefix=boom Patch fixes the issue. Differential revision: https://reviews.llvm.org/D42057 llvm-svn: 322536	2018-01-16 08:09:24 +00:00
Yonghong Song	4f63cbd37f	[BPF] Teach DAG2DAG AND elimination about load intrinsics As commented on the existing code: // The Reg operand should be a virtual register, which is defined // outside the current basic block. DAG combiner has done a pretty // good job in removing truncating inside a single basic block. However, when the Reg operand comes from bpf_load_[byte \| half \| word] intrinsics, the generic optimizer doesn't understand their results are zero extended, so these single basic block elimination opportunities were missed. Acked-by: Jakub Kicinski <jakub.kicinski@netronome.com> Acked-by: Yonghong Song <yhs@fb.com> Signed-off-by: Jiong Wang <jiong.wang@netronome.com> llvm-svn: 322534	2018-01-16 07:27:19 +00:00
Hiroshi Inoue	98e064bb0e	[SROA] fix assetion failure This patch fixes the assertion failure in SROA reported in PR35657. PR35657 reports the assertion failure due to r319522 (splitting for non-whole-alloca slices), but this problem can happen even without r319522. The problem exists in a check for reusing an existing alloca when rewriting partitions. As the original comment said, we can reuse the existing alloca if the new alloca has the same type and offset with the existing one. But the code checks only type of the alloca and then check the offset using an assert. In a corner case with out-of-bounds access (e.g. @PR35657 function added in unit test), it is possible that the two allocas have the same type but different offsets. This patch makes the check of the offset in the if condition, and re-enables the splitting for non-whole-alloca slices. Differential Revision: https://reviews.llvm.org/D41981 llvm-svn: 322533	2018-01-16 06:23:05 +00:00
Craig Topper	c065faaccf	[X86] Make 'xchgq %rax, %rax' an alias for the 0x90 nop encoding to match gas. Previously we encoded it as 0x48 0x90. llvm-svn: 322531	2018-01-16 06:07:14 +00:00
Simon Pilgrim	a8d108687b	[X86][MMX] Add support for MMX zero vector creation As mentioned on PR35869, (and came up recently on D41517) we don't create a MMX zero register via the PXOR but instead perform a spill to stack from a XMM zero register. This patch adds support for direct MMX zero vector creation and should make it easier to add better constant vector creation in the future as well. Differential Revision: https://reviews.llvm.org/D41908 llvm-svn: 322525	2018-01-15 22:32:40 +00:00
Simon Pilgrim	9a1d5eeb3e	[X86][SSE] Add custom execution domain fixing for BLENDPD/BLENDPS/PBLENDD/PBLENDW (PR34873) Add support for custom execution domain fixing and implement support for BLENDPD/BLENDPS/PBLENDD/PBLENDW. Differential Revision: https://reviews.llvm.org/D42042 llvm-svn: 322524	2018-01-15 22:18:45 +00:00
Sanjay Patel	57be7f765d	[x86] add tests to show missed constant shrinking (PR35907); NFC llvm-svn: 322523	2018-01-15 21:57:41 +00:00
Sanjay Patel	d5efb59dec	[x86] regenerate test checks; NFC llvm-svn: 322522	2018-01-15 21:32:39 +00:00
Sanjay Patel	48d1ab9f39	[x86] regenerate test checks; NFC llvm-svn: 322521	2018-01-15 21:28:52 +00:00
Sanjay Patel	9572845598	[x86] regenerate test checks; NFC llvm-svn: 322519	2018-01-15 21:22:46 +00:00
Stanislav Mekhanoshin	b2de8bbb4a	[AMDGPU] Add HW_REG_SH_MEM_BASES symbolic name for s_getreg_b32 Differential Revision: https://reviews.llvm.org/D41617 llvm-svn: 322500	2018-01-15 18:49:15 +00:00
Krzysztof Parzyszek	d71dcfa234	[Hexagon] Rewrite LowerVECTOR_SHUFFLE for 32-/64-bit vectors The old implementation was not always correct. The new one recognizes more shuffles that match specific instructions. llvm-svn: 322498	2018-01-15 18:33:33 +00:00
Jonas Paulsson	e8a35d1b36	[SystemZ] Check for legality before doing LOAD AND TEST transformations. Since a load and test instruction treat its operands as signed, it can only replace a logical compare for EQ/NE uses. Review: Ulrich Weigand https://bugs.llvm.org/show_bug.cgi?id=35662 llvm-svn: 322488	2018-01-15 15:41:26 +00:00
Andrew V. Tischenko	99ef3709ba	Update BTVER2 sched numbers for some AVX instructions (xmm version). Differential Revision: https://reviews.llvm.org/D40067 llvm-svn: 322485	2018-01-15 14:21:11 +00:00
Benjamin Kramer	c98b7c0b21	Revert "[DAG] Elide overlapping stores" This reverts commit r322085. Internal PPC testing is still showing the same symptoms as when this patch landed the last time. llvm-svn: 322474	2018-01-15 10:57:24 +00:00
Andrei Elovikov	eebe9ed57e	[LV] Don't call recordVectorLoopValueForInductionCast for newly-created IV from a trunc. Summary: This method is supposed to be called for IVs that have casts in their use-def chains that are completely ignored after vectorization under PSE. However, for truncates of such IVs the same InductionDescriptor is used during creation/widening of both original IV based on PHINode and new IV based on TruncInst. This leads to unintended second call to recordVectorLoopValueForInductionCast with a VectorLoopVal set to the newly created IV for a trunc and causes an assert due to attempt to store new information for already existing entry in the map. This is wrong and should not be done. Fixes PR35773. Reviewers: dorit, Ayal, mssimpso Reviewed By: dorit Subscribers: RKSimon, dim, dcaballe, hsaito, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D41913 llvm-svn: 322473	2018-01-15 10:56:07 +00:00
Gadi Haber	0860369631	[X86][AVX512F_512]: Adding full coverage of MC encoding for the AVX512F 512 bits isa sets.<NFC> NFC. Adding MC regressions tests to cover the AVX512F_512 isa sets both 32 and 64 bit. This patch is part of a larger task to cover MC encoding of all X86 ISA Sets. started in revision: https://reviews.llvm.org/D39952 Reviewers: zvi, craig.topper, RKSimon, AndreiGrischenko Differential Revision: https://reviews.llvm.org/D41172 Change-Id: I46aa33dd967d63d33f67d1988ad42d8df2081e39 llvm-svn: 322471	2018-01-15 09:39:08 +00:00
Mikael Holmen	455a18e971	[GlobalsAA] Don't let dbg intrinsics affect analysis result Summary: This fixes PR35899. Debug info intrinsics shouldn't affect code generation so ignore them in GlobalsAA. Reviewers: hfinkel, aprantl Reviewed By: aprantl Subscribers: aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D41984 llvm-svn: 322470	2018-01-15 07:05:51 +00:00
Davide Italiano	1e4933df39	[BasicAA] Stop crashing when dealing with pointers > 64 bits. An alternative (and probably better) fix would be that of making `Scale` an APInt, and there's a patch floating around to do this. As we're still discussing it, at least stop crashing in the meanwhile (added bonus, we now have a regression test for this situation). Fixes PR35843. Thanks to Eli for suggesting the fix and Simon for reporting and reducing the bug. llvm-svn: 322467	2018-01-15 01:40:18 +00:00
Simon Pilgrim	9aea8c1468	[X86][SSE] Tag PR21137 test case The test was added ages ago, but we didn't comment where it came from. llvm-svn: 322465	2018-01-14 21:59:43 +00:00
Craig Topper	bcb01a0298	[X86] Add test cases for D41794. llvm-svn: 322464	2018-01-14 20:53:49 +00:00
Simon Pilgrim	fda81ea77c	[X86][SSE] Add PR22391 test case llvm-svn: 322463	2018-01-14 19:57:50 +00:00
Craig Topper	90b0c61a22	[X86] Autoupgrade kunpck intrinsics using vector operations instead of scalar operations Summary: This patch changes the kunpck intrinsic autoupgrade to use vXi1 shufflevector operations to perform vector extracts and concats. This more closely matches the definition of the kunpck instructions. Currently we rely on a DAG combine to turn the scalar shift/and/or code into a concat vectors operation. By doing it in the IR we get this for free. Reviewers: spatel, RKSimon, zvi, jina.nahias Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42018 llvm-svn: 322462	2018-01-14 19:24:10 +00:00
Simon Pilgrim	35b185fca9	[X86] Regenerate fp128 test llvm-svn: 322460	2018-01-14 19:07:41 +00:00
Simon Pilgrim	6556cecd6b	[X86][SSE] Support combining MOVLHPS undef inputs llvm-svn: 322459	2018-01-14 18:50:34 +00:00
Simon Pilgrim	73ca4978e5	[X86][SSE] Add v2f64 3u shuffle test Shows a missed opportunity to remove a unnecessary move compared to 31 shuffle mask. llvm-svn: 322458	2018-01-14 18:38:21 +00:00

1 2 3 4 5 ...

50223 Commits