llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Craig Topper	1904f2268b	[X86][AVX512] Strengthen the assertions from r269001. We need VLX to use the 128/256-bit move opcodes for extended registers. llvm-svn: 269019	2016-05-10 05:28:04 +00:00
Craig Topper	601759c5c1	[X86] Add ZMM registers to the X86_INTR calling convention preserved mask when AVX512 is enabled. llvm-svn: 269018	2016-05-10 05:28:02 +00:00
Craig Topper	aa2acf8da0	[X86] Update X86_INTR calling convention to save ZMM registers instead of YMM registers when AVX512 is enabled. llvm-svn: 269017	2016-05-10 05:27:56 +00:00
Dan Gohman	13d15dbc40	[WebAssembly] Move register stackification and coloring to a late phase. Move the register stackification and coloring passes to run very late, after PEI, tail duplication, and most other passes. This means that all code emitted and expanded by those passes is now exposed to these passes. This also eliminates the need for prologue/epilogue code to be manually stackified, which significantly simplifies the code. This does require running LiveIntervals a second time. It's useful to think of these late passes not as late optimization passes, but as a domain-specific compression algorithm based on knowledge of liveness information. It's used to compress the code after all conventional optimizations are complete, which is why it uses LiveIntervals at a phase when actual optimization passes don't typically need it. Differential Revision: http://reviews.llvm.org/D20075 llvm-svn: 269012	2016-05-10 04:24:02 +00:00
Matthias Braun	556abb392a	CodeGen: Move TargetPassConfig from Passes.h to an own header; NFC Many files include Passes.h but only a fraction needs to know about the TargetPassConfig class. Move it into an own header. Also rename Passes.cpp to TargetPassConfig.cpp while we are at it. llvm-svn: 269011	2016-05-10 03:21:59 +00:00
Quentin Colombet	4c5a2694f9	[X86][AVX512] Use the proper load/store for AVX512 registers. When loading or storing AVX512 registers we were not using the AVX512 variant of the load and store for VR128 and VR256 like registers. Thus, we ended up with the wrong encoding and actually were dropping the high bits of the instruction. The result was that we load or store the wrong register. The effect is visible only when we emit the object file directly and disassemble it. Then, the output of the disassembler does not match the assembly input. This is related to llvm.org/PR27481. llvm-svn: 269001	2016-05-10 01:09:14 +00:00
Justin Lebar	e9c3e87ae5	[NVPTX] Change begin/end inline asm comments to "begin/end inline asm". Previously it was just "// inline asm", which made it tricky to read code with lots of inline assembly. llvm-svn: 268994	2016-05-10 00:31:22 +00:00
Derek Schuff	15d6ef9c78	[WebAssembly] Disable 128-bit shift libcalls Currently the signature of the functions i128(i128, i32) aka void(i32, i64, i64, i32) doesn't match the signature of the call emitted by the default lowering, void(i32, i64, i64). llvm-svn: 268991	2016-05-10 00:14:07 +00:00
Justin Bogner	2067192e3a	SDAG: Stop relying on Select's return value in SystemZ's splitLargeImmediate. NFC The call to Select on Upper here happens in an unusual order in order to defeat the constant folding that getNode() does. Add a comment explaining why we can't just move the Select to later to avoid a Handle, and wrap the call to SelectCode in a handle so we don't need its return value. This is part of the work to have Select return void instead of an SDNode *, which is in turn part of llvm.org/pr26808. llvm-svn: 268990	2016-05-09 23:54:23 +00:00
Quentin Colombet	1cf0e63b3f	[X86] Fix the AllRegs AVX calling convention. We used to list registers that were not in the AVX space. In other words, we were pushing registers that the ISA cannot encode (YMM16-YMM31). This is part of llvm.org/PR27481. llvm-svn: 268983	2016-05-09 22:37:05 +00:00
Quentin Colombet	38fc77229e	[X86] Strengthen the setting of inline asm constraints for fp regclasses. This is similar to r268953, but for floating point and vector register classes. Explanations: The setting of the inline asm constraints was implicitly relying on the order of the register classes in the file generated by tablegen. Since, we do not have any control on that order, make sure we do not depend on it anymore. llvm-svn: 268973	2016-05-09 21:24:31 +00:00
Simon Pilgrim	01729bd834	[X86][SSE] Improve cost model for i64 vector comparisons on pre-SSE42 targets As discussed on PR24888, until SSE42 we don't have access to PCMPGTQ for v2i64 comparisons, but the cost models don't reflect this, resulting in over-optimistic vectorizaton. This patch adds SSE2 'base level' costs that match what a typical target is capable of and only reduces the v2i64 costs at SSE42. Technically SSE41 provides a PCMPEQQ v2i64 equality test, but as getCmpSelInstrCost doesn't give us a way to discriminate between comparison test types we can't easily make use of this, otherwise we could split the cost of integer equality and greater-than tests to give better costings of each. Differential Revision: http://reviews.llvm.org/D20057 llvm-svn: 268972	2016-05-09 21:14:38 +00:00
Quentin Colombet	6707ab391d	[X86] Drop the 64-bit alignment for LOW32_ADDR_ACCESS register class. The only 64-bit register in that register class is RIP and it will not get spilled in the current ABIs. llvm-svn: 268963	2016-05-09 19:50:30 +00:00
Quentin Colombet	3f37fd8049	Reapply [X86] Add a new LOW32_ADDR_ACCESS_RBP register class. This reapplies commit r268796, with a fix for the setting of the inline asm constraints. I.e., "mark" LOW32_ADDR_ACCESS_RBP as a GR variant, so that the regular processing of the GR operands (setting of the subregisters) happens. Original commit log: [X86] Add a new LOW32_ADDR_ACCESS_RBP register class. ABIs like NaCl uses 32-bit addresses but have 64-bit frame. The new register class reflects those constraints when choosing a register class for a address access. llvm-svn: 268955	2016-05-09 19:01:46 +00:00
Quentin Colombet	09ecef5209	[X86] Strengthen the setting of inline asm constraints. The setting of the inline asm constraints was implicitly relying on the order of the register classes in the file generated by tablegen. Since, we do not have any control on that order, make sure we do not depend on it anymore. llvm-svn: 268953	2016-05-09 19:01:35 +00:00
Nemanja Ivanovic	286a9532e8	[Power9] Add support for -mcpu=pwr9 in the back end This patch corresponds to review: http://reviews.llvm.org/D19683 Simply adds the bits for being able to specify -mcpu=pwr9 to the back end. llvm-svn: 268950	2016-05-09 18:54:58 +00:00
Krzysztof Parzyszek	72e4e48963	[Hexagon] Treat all conditional branches as predicted (not-taken by default) llvm-svn: 268946	2016-05-09 18:22:07 +00:00
Daniel Sanders	36ce7ed6b5	[mips] Fix a partially initialized member variable that was introduced in r268896. llvm-svn: 268938	2016-05-09 17:42:04 +00:00
Simon Pilgrim	0f2ef1de0a	Fixed unused but set variable warning llvm-svn: 268931	2016-05-09 16:42:23 +00:00
Matt Arsenault	f726e32980	AMDGPU: Fold shift into cvt_f32_ubyteN llvm-svn: 268930	2016-05-09 16:29:50 +00:00
Daniel Sanders	77f004d65f	[mips] Try to fix 'truncation from FindBestPredicateResult to bool' reported by MSVC llvm-svn: 268928	2016-05-09 15:50:15 +00:00
Daniel Sanders	dd90de11d3	[mips][ias] Attempt to fix 'not all control paths return a value' reported by MSVC. llvm-svn: 268927	2016-05-09 15:37:52 +00:00
Daniel Sanders	76aeb9f378	[mips][micromips] Make getPointerRegClass() result depend on the instruction. Summary: Previously, it returned the GPR16MMRegClass for all instructions which was incorrect for instructions like lwsp/lwgp and unnecesarily restricted the permitted registers for instructions like lw32. This fixes quite a few of the -verify-machineinstrs errors reported in PR27458. I've only added -verify-machineinstrs to one test in this change since I understand there is a plan to enable the verifier by default. Reviewers: hvarga, zbuljan, zoran.jovanovic, sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: http://reviews.llvm.org/D19873 llvm-svn: 268918	2016-05-09 13:38:25 +00:00
Simon Pilgrim	3ddc1a30df	[X86][SSE] Added TODO comment to add support for AVX512 mask registers to shuffle comments This came up in discussion on D19198 llvm-svn: 268915	2016-05-09 13:30:16 +00:00
Daniel Sanders	22a6cb21be	[mips] Fix use after free and an unnecessary copy introduced in r268896. llvm-svn: 268913	2016-05-09 13:10:57 +00:00
Strahinja Petrovic	e297a43f7c	[PowerPC] fix register alignment for long double type This patch fixes register alignment for long double type in soft float mode. Before this patch alignment was 8 and this patch changes it to 4. Differential Revision: http://reviews.llvm.org/D18034 llvm-svn: 268909	2016-05-09 12:27:39 +00:00
Chris Dewhurst	9cefba091e	[Sparc][LEON] Add UMAC and SMAC instruction support for Sparc LEON subtargets This change adds SMAC (signed multiply-accumulate) and UMAC (unsigned multiply-accumulate) for LEON subtargets of the Sparc processor. The new files LeonFeatures.td and leon-instructions.ll will both be expanded in future, so I want to leave them separate as small files for this review, to be expanded in future check-ins. Note: The functions are provided only for inline-assembly provision. No DAG selection is provided. Differential Revision: http://reviews.llvm.org/D19911 llvm-svn: 268908	2016-05-09 11:55:15 +00:00
Silviu Baranga	edcf928591	[AArch64] Implement lowering of the X constraint on AArch64 Summary: This implements the lowering of the X constraint on AArch64. The default behaviour of the X constraint lowering is to restrict it to "f". This is a problem because the "f" constraint is not implemented on AArch64 and would be too restrictive anyway. Therefore, the AArch64 hook will lower this to "w" (if the operand is a floating point or vector) or "r" otherwise. The implementation is similar with the one added for ARM (r267411). This is the AArch64 side of the fix for http://llvm.org/PR26493 Reviewers: rengolin Subscribers: aemerson, rengolin, llvm-commits, t.p.northover Differential Revision: http://reviews.llvm.org/D19967 llvm-svn: 268907	2016-05-09 11:10:44 +00:00
Benjamin Kramer	8779c9c4d8	Revert "[Mips] Fix use after free." Fixes use after free but breaks tests. This reverts commit r268901. llvm-svn: 268902	2016-05-09 10:31:17 +00:00
Benjamin Kramer	5385b15220	[Mips] Fix use after free. llvm-svn: 268901	2016-05-09 10:21:56 +00:00
Daniel Sanders	828a69bd74	[mips][ias] R_MIPS_(GOT\|HI\|LO\|PC)16 and R_MIPS_GPREL32 do not need symbols. Summary: In theory, care must be taken to ensure that pairs of R_MIPS_(GOT\|HI\|LO)16 make the same decision on both relocs in the reloc pair but in practice this isn't as hard as it sounds and only limits the complexity of the predicate used. We handle all three with the same code to ensure their decisions always agree with each other. Reviewers: sdardis Subscribers: rafael, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D19016 llvm-svn: 268900	2016-05-09 10:21:14 +00:00
Zlatko Buljan	4c55849feb	[mips][microMIPS] Implement LWP and SWP instructions Differential Revision: http://reviews.llvm.org/D10640 llvm-svn: 268896	2016-05-09 08:07:28 +00:00
Craig Topper	f0db9d6006	[X86] Strengthen some type contraints for floating point round and extend. llvm-svn: 268892	2016-05-09 05:34:14 +00:00
Craig Topper	1a7c59b5bb	[AVX512] Fix up types for arguments of int_x86_avx512_mask_cvtsd2ss_round and int_x86_avx512_mask_cvtss2sd_round. Only the argument being converted should be a different type. The other 2 argument should have the same type as the result. llvm-svn: 268891	2016-05-09 05:34:12 +00:00
Craig Topper	79a42b734a	[AVX512] Add non-temporal store patterns for v16i32/v32i16/v64i8. llvm-svn: 268889	2016-05-08 23:43:17 +00:00
Craig Topper	003ac12540	[AVX512] Add missing patterns for non-temporal stores of 128/256-bit vXi8/vXi16/vXi32 when VLX is enabled. The equivalent AVX1/2 patterns are disabled by VLX. This caused regular stores to be emitted instead. llvm-svn: 268886	2016-05-08 23:08:45 +00:00
Craig Topper	b6baa777fe	[AVX512] Change predicates on some vXi16/vXi8 AVX store patterns so they stay enabled unless VLX and BWI instructions are supported." Without this we could fail instruction selection if VLX was enabled, but BWI wasn't. llvm-svn: 268885	2016-05-08 23:08:40 +00:00
Craig Topper	dd43e37f21	[AVX512] Add VLX 128/256-bit SET0 operations that encode to 128/256-bit EVEX encoded VPXORD so all 32 registers can be used. llvm-svn: 268884	2016-05-08 21:33:53 +00:00
Craig Topper	872733a7d7	[X86] Remove extra patterns that check for BUILD_VECTOR of all 0s. These are always canonicalized to v4i32/v8i32/v16i32 except for in SSE1 only when only v4f32 is supported. llvm-svn: 268880	2016-05-08 20:10:20 +00:00
David Majnemer	ed9abdfae3	[X86] Promote several single precision FP libcalls on Windows A number of libcalls don't exist in any particular lib but are, instead, defined in math.h as inline functions (even in C mode!). Don't rely on their existence when lowering @llvm.{cos,sin,floor,..}.f32, promote them instead. N.B. We had logic to handle FREM but were missing out on a number of others. This change generalizes the FREM handling. llvm-svn: 268875	2016-05-08 08:15:50 +00:00
Craig Topper	ce30d72448	[X86] Lower 256-bit vector all-zero constants to v8i32 even with AVX1 only. Either way a 256-bit VXORPS will be used. llvm-svn: 268873	2016-05-08 07:10:54 +00:00
Craig Topper	17e31eeac8	[X86] Add patterns for 256-bit non-temporal stores when only AVX1 is supported. While there, add a predicate to the SSE2 patterns to avoid an ordering dependency. llvm-svn: 268872	2016-05-08 07:10:50 +00:00
Craig Topper	85b1036796	[X86] No need to avoid selecting AVX_SET0 for 256-bit integer types when only AVX1 is supported. AVX_SET0 just expands to 256-bit VXORPS which is legal in AVX1. llvm-svn: 268871	2016-05-08 07:10:47 +00:00
Weiming Zhao	b94762d920	[ARM] Fix Scavenger assert due to underestimated stack size (re-apply r268810 as it exposed an uninitialized variable in ARM MFI. Patch 268868 should fix that.) Summary: Currently, when checking if a stack is "BigStack" or not, it doesn't count into spills and arguments. Therefore, LLVM won't reserve spill slot for this actually "BigStack". This may cause scavenger failure. Reviewers: rengolin Subscribers: vitalybuka, aemerson, rengolin, tberghammer, danalbert, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D19896 llvm-svn: 268869	2016-05-08 05:11:54 +00:00
Weiming Zhao	85513d3f4e	Fix use-of-uninitialized-value of ARMMachineFunctionInfo Summary: Explicitly initialize ArgumentStackSize to prevent the msan failure. Reviewers: rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D20051 llvm-svn: 268868	2016-05-08 05:04:47 +00:00
Craig Topper	7b7547a4dc	[X86] Fix InstAliases to not allow FARCALL32i/FARCALL16i/FARJMP32i/FARJMP16i in 64-bit mode. llvm-svn: 268863	2016-05-07 19:25:56 +00:00
Simon Pilgrim	ecb5f17d8b	[X86] Pulled out duplicate mask width calculation. NFCI. llvm-svn: 268861	2016-05-07 18:04:24 +00:00
Sanjay Patel	64251c0df0	[x86, BMI] add TLI hook for 'andn' and use it to simplify comparisons For the sake of minimalism, this patch is x86 only, but I think that at least PPC, ARM, AArch64, and Sparc probably want to do this too. We might want to generalize the hook and pattern recognition for a target like PPC that has a full assortment of negated logic ops (orc, nand). Note that http://reviews.llvm.org/D18842 will cause this transform to trigger more often. For reference, this relates to: https://llvm.org/bugs/show_bug.cgi?id=27105 https://llvm.org/bugs/show_bug.cgi?id=27202 https://llvm.org/bugs/show_bug.cgi?id=27203 https://llvm.org/bugs/show_bug.cgi?id=27328 Differential Revision: http://reviews.llvm.org/D19087 llvm-svn: 268858	2016-05-07 15:03:40 +00:00
NAKAMURA Takumi	46124f6758	MipsELFObjectWriter.cpp: Activate debug printer just for +Asserts. [-Wunused-function] llvm-svn: 268848	2016-05-07 04:51:51 +00:00
Vitaly Buka	52cd534788	Revert r268810 becase it brakes msan bot. 16802==WARNING: MemorySanitizer: use-of-uninitialized-value lib/Target/ARM/ARMFrameLowering.cpp:1632 llvm-svn: 268833	2016-05-07 01:54:00 +00:00

1 2 3 4 5 ...

37345 Commits