llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Pete Cooper	febaceff9f	Add missing load/store flags to thumb2 instructions. These were the cause of a verifier error when building 7zip with -verify-machineinstrs. Running 'make check' with the verifier triggered the same error on the test here so i've updated the test to run the verifier on one of its runs instead of adding a new one. While looking at this code, there was a stale comment that these instructions were only used for disassembly. This probably used to be the case, but they are now used in the 'ARM load / store optimization pass' too. This reapplies r242300 which was reverted in r242428 due to bot failures. Ultimately those failures were spurious and completely unrelated to this commit. I reverted this at the time because it was thought to be at fault. llvm-svn: 250969	2015-10-22 01:48:57 +00:00
Matt Arsenault	20a866624d	AMDGPU: Fix adding redundant m0 uses BuildMI already adds these since they are defined correctly now. llvm-svn: 250961	2015-10-21 22:37:51 +00:00
Matt Arsenault	34e6b29b92	AMDGPU: Fix verifier error in SIFoldOperands There may be other use operands that also need their kill flags cleared. This happens in a few tests when SIFoldOperands is moved after PeepholeOptimizer. PeepholeOptimizer rewrites cases that look like: %vreg0 = ... %vreg1 = COPY %vreg0 use %vreg1<kill> %vreg2 = COPY %vreg0 use %vreg2<kill> to use the earlier source to %vreg0 = ... use %vreg0 use %vreg0 Currently SIFoldOperands sees the copied registers, so there is only one use. So far I haven't managed to come up with a test that currently has multiple uses of a foldable VGPR -> VGPR copy. llvm-svn: 250960	2015-10-21 22:37:50 +00:00
Matt Arsenault	7b241d838d	AMDGPU: Split DiagnosticInfoUnsupported into its own file llvm-svn: 250959	2015-10-21 22:37:46 +00:00
Matt Arsenault	d1baf0fb57	AMDGPU: Simplify VOP3 operand legalization. This was checking for a variety of situations that should never happen. This saves a tiny bit of compile time. We should not be selecting instructions with invalid operands in the first place. Most of the time for registers copys are inserted to the correct operand register class. For VOP3, since all operand types are supported and literal constants never are, we just need to verify the constant bus requirements (all immediates should be legal inline ones). The only possibly tricky case to maybe worry about is if when legalizing operands in moveToVALU with s_add_i32 and similar instructions. If the original s_add_i32 had a literal constant and we need to replace it with v_add_i32_e64 we would have an unsupported literal operand. However, I don't think we should worry about that because SIFoldOperands should handle folding literal constant operands into the SALU instructions based on the uses. At SIFoldOperands time, the legality and profitability of operand types is a bit different. llvm-svn: 250951	2015-10-21 21:51:02 +00:00
Matt Arsenault	a710403d18	AMDGPU: Fix not checking implicit operands in verifyInstruction When verifying constant bus restrictions, this wasn't catching uses in implicit operands. llvm-svn: 250948	2015-10-21 21:15:01 +00:00
Joerg Sonnenberger	314bf5d434	Drop assert that a call with struct return goes to a function with sret attribute. Clang incorrectly misses it on __muldc3 and friends and the type system doesn't include it properly either. llvm-svn: 250938	2015-10-21 20:05:01 +00:00
Sanjay Patel	e2dc1e0494	[x86] move recursive add match for LEA to helper function; NFCI llvm-svn: 250926	2015-10-21 18:56:06 +00:00
Craig Topper	dcce633156	[X86] Add AMD mwaitx, monitorx, and clzero instructions to the assembly parser and disassembler. llvm-svn: 250911	2015-10-21 17:26:45 +00:00
Daniel Sanders	0ad83683fc	[mips][mips16] Re-work the inline assembly stubs to work with IAS. NFC. Summary: Previously, we were inserting an InlineAsm statement for each line of the inline assembly. This works for GAS but it triggers prologue/epilogue emission when IAS is in use. This caused: .set noreorder .cpload $25 to be emitted as: .set push .set reorder .set noreorder .set pop .set push .set reorder .cpload $25 .set pop which led to assembler errors and caused the test to fail. The whitespace-after-comma changes included in this patch are necessary to match the output when IAS is in use. Reviewers: vkalintiris Subscribers: rkotler, llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D13653 llvm-svn: 250895	2015-10-21 12:44:14 +00:00
Daniel Sanders	6fac4ec6e5	[mips][msa] Remove copy_u.d and move copy_u.w to MSA64. Summary: The forwards compatibility strategy employed by MIPS is to consider registers to be infinitely sign-extended. Then on ISA's with a wider register, the result of existing instructions are sign-extended to register width and zero-extended counterparts are added. copy_u.w on MSA32 and copy_u.w on MSA64 violate this strategy and we have therefore corrected the MSA specs to fix this. We still keep track of sign/zero-extension during legalization but we now match copy_s.[wd] where required. No change required to clang since __builtin_msa_copy_u_[wd] will map to copy_s.[wd] where appropriate for the target. Reviewers: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13472 llvm-svn: 250887	2015-10-21 09:58:54 +00:00
Mehdi Amini	a26e394644	Do not use `dyn_cast<X>` after `isa<X>` (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 250883	2015-10-21 06:11:01 +00:00
JF Bastien	b802f22980	WebAssembly: support imports C/C++ code can declare an extern function, which will show up as an import in WebAssembly's output. It's expected that the linker will resolve these, and mark unresolved imports as call_import (I have a patch which does this in wasmate). llvm-svn: 250875	2015-10-21 02:23:09 +00:00
Krzysztof Parzyszek	4625bb8dfb	[Hexagon] Bit-based instruction simplification Analyze bit patterns of operands and values of instructions to perform various simplifications, dead/redundant code elimination, etc. llvm-svn: 250868	2015-10-20 22:57:13 +00:00
Krzysztof Parzyszek	52f05db67a	[Hexagon] Fix isNVStorable flag in .td files An upper half and a double word cannot be used as value sources in a new-value store. llvm-svn: 250867	2015-10-20 22:40:57 +00:00
Krzysztof Parzyszek	55e9c1f1f1	[Hexagon] Capture aggregate variables by reference, not value llvm-svn: 250851	2015-10-20 19:33:46 +00:00
Krzysztof Parzyszek	37affea588	[Hexagon] Do not fall-through if there is no CFG edge llvm-svn: 250850	2015-10-20 19:30:21 +00:00
Krzysztof Parzyszek	f880b062f9	[Hexagon] Use symbolic name for subregister instead of hardcoded number llvm-svn: 250849	2015-10-20 19:26:36 +00:00
Krzysztof Parzyszek	722b0fe061	[Hexagon] Change Based->Base in getBasedWithImmOffset llvm-svn: 250848	2015-10-20 19:21:05 +00:00
Krzysztof Parzyszek	68328f0930	[Hexagon] Remove the remnants of isConstExtProfitable llvm-svn: 250845	2015-10-20 19:04:53 +00:00
Jonas Paulsson	43ad90033d	[SystemZ] Use LivePhysRegs helper class in SystemZShortenInst.cpp. Don't use home brewed liveness tracking code for phys regs, since this class does the job. Reviewed by Ulrich Weigand. llvm-svn: 250829	2015-10-20 15:05:58 +00:00
Artyom Skrobov	d5f3afc063	Adding support for TargetLoweringBase::LibCall Summary: TargetLoweringBase::Expand is defined as "Try to expand this to other ops, otherwise use a libcall." For ISD::UDIV and ISD::SDIV, the choice between the two possibilities was defined in a rather convoluted way: - if DIVREM is legal, expand to DIVREM - if DIVREM has a custom lowering, expand to DIVREM - if DIVREM libcall is defined and a remainder from the same division is computed elsewhere, expand to a DIVREM libcall - else, expand to a DIV libcall This had the undesirable effect that if both DIV and DIVREM are implemented as libcalls, then ISD::UDIV and ISD::SDIV are expanded to the heavier DIVREM libcall, even when the remainder isn't used. The new code adds a new LegalizeAction, TargetLoweringBase::LibCall, so that backends can directly control whether they prefer an expansion or a conversion to a libcall. This makes the generic lowering code even more generic, allowing its reuse in a wider range of target-specific configurations. The useful effect is that ARM backend will now generate a call to __aeabi_{i,u}div rather than __aeabi_{i,u}divmod in cases where it doesn't need the remainder. There's no functional change outside the ARM backend. Reviewers: t.p.northover, rengolin Subscribers: t.p.northover, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D13862 llvm-svn: 250826	2015-10-20 13:14:52 +00:00
Igor Breger	c385abd09d	AVX512: Implemented encoding and intrinsics for VPBROADCASTB/W/D/Q instructions. Differential Revision: http://reviews.llvm.org/D13884 llvm-svn: 250819	2015-10-20 11:56:42 +00:00
Matt Arsenault	aa9e5394b5	AMDGPU: Add MachineInstr overloads for instruction format tests llvm-svn: 250797	2015-10-20 04:35:43 +00:00
Matt Arsenault	6e5d4b912c	AMDGPU: Stop reserving v[254:255] This wasn't doing anything useful. They weren't explicitly used anywhere, and the RegScavenger ignores reserved registers. This for some reason caused a random scheduling change in the test. Getting the check lines to pass is too frustrating, and there's probably not too much value in checking the vector case's operands N times. llvm-svn: 250794	2015-10-20 03:59:58 +00:00
JF Bastien	73bf323fe1	WebAssembly: fix call/return syntax. They are now typeless, unlike other operations. llvm-svn: 250793	2015-10-20 01:26:54 +00:00
Duncan P. N. Exon Smith	a94e8688bd	MSP430: Remove implicit ilist iterator conversions, NFC llvm-svn: 250792	2015-10-20 01:18:39 +00:00
Duncan P. N. Exon Smith	4c0e7aa0f0	SystemZ: Remove implicit ilist iterator conversion, NFC llvm-svn: 250790	2015-10-20 01:12:46 +00:00
Duncan P. N. Exon Smith	1da85a4e27	XCore: Remove implicit ilist iterator conversions, NFC llvm-svn: 250788	2015-10-20 01:07:42 +00:00
Duncan P. N. Exon Smith	1767008e58	PowerPC: Remove implicit ilist iterator conversions, NFC llvm-svn: 250787	2015-10-20 01:07:37 +00:00
Duncan P. N. Exon Smith	becbc54dca	Sparc: Remove implicit ilist iterator conversions, NFC llvm-svn: 250781	2015-10-20 00:59:43 +00:00
Duncan P. N. Exon Smith	86ab7394bc	NVPTX: Remove implicit ilist iterator conversions, NFC llvm-svn: 250779	2015-10-20 00:54:09 +00:00
Duncan P. N. Exon Smith	7295da0c84	Hexagon: Remove implicit ilist iterator conversions, NFC There are two things out of the ordinary in this commit. First, I made a loop obviously "infinite" in HexagonInstrInfo.cpp. After checking if an instruction was at the beginning of a basic block (in which case, `break`), the loop decremented and checked the iterator for `nullptr` as the loop condition. This has never been possible (the prev pointers are always been circular, so even with the weird ilist/iplist implementation, this isn't been possible), so I removed the condition. Second, in HexagonAsmPrinter.cpp there was another case of comparing a `MachineBasicBlock::instr_iterator` against `MachineBasicBlock::end()` (which returns `MachineBasicBlock::iterator`). While not incorrect, it's fragile. I switched this to `::instr_end()`. All that said, no functionality change intended here. llvm-svn: 250778	2015-10-20 00:46:39 +00:00
JF Bastien	aea5642760	WebAssembly: fix syntax for br_if. llvm-svn: 250777	2015-10-20 00:37:42 +00:00
Duncan P. N. Exon Smith	6203d21e79	Mips: Remove implicit ilist iterator conversions, NFC llvm-svn: 250769	2015-10-20 00:15:20 +00:00
Duncan P. N. Exon Smith	f81c53edb8	CppBackend: Remove implicit ilist iterator conversions, NFC Mostly just converted to range-based for loops. May have converted a couple of extra loops as a drive-by (not sure). llvm-svn: 250766	2015-10-20 00:06:41 +00:00
Duncan P. N. Exon Smith	863e2080d4	BPF: Remove implicit ilist iterator conversion, NFC llvm-svn: 250765	2015-10-20 00:02:50 +00:00
Duncan P. N. Exon Smith	aba7bc993c	ARM: Remove implicit ilist iterator conversions, NFC llvm-svn: 250759	2015-10-19 23:25:57 +00:00
Duncan P. N. Exon Smith	4be8cb04fe	X86: Remove implicit ilist iterator conversions, NFC llvm-svn: 250741	2015-10-19 21:48:29 +00:00
Krzysztof Parzyszek	62bb5a1179	[Hexagon] Remove unnecessary argument sign extends llvm-svn: 250724	2015-10-19 19:10:48 +00:00
Benjamin Kramer	64bcbe6d8e	Add missing override noticed by Clang's -Winconsistent-missing-override. llvm-svn: 250720	2015-10-19 18:41:23 +00:00
Jun Bum Lim	16578167d7	[AArch64]Merge halfword loads into a 32-bit load Convert two halfword loads into a single 32-bit word load with bitfield extract instructions. For example : ldrh w0, [x2] ldrh w1, [x2, #2] becomes ldr w0, [x2] ubfx w1, w0, #16, #16 and w0, w0, #ffff llvm-svn: 250719	2015-10-19 18:34:53 +00:00
Krzysztof Parzyszek	f11fad31d2	[Hexagon] Fix debug information for local objects - Isolate the check for the existence of a stack frame into hasFP. - Implement getFrameIndexReference for DWARF address computation. - Use getFrameIndexReference for offset computation in eliminateFrameIndex. - Preserve debug information for dynamically allocated stack objects. - Prefer FP to access local objects at -O0. - Add experimental code to skip allocframe when not strictly necessary (disabled by default). llvm-svn: 250718	2015-10-19 18:30:27 +00:00
Krzysztof Parzyszek	1d955c1e56	[Hexagon] Delay emission of CFI instructions Emit the CFI instructions after all code transformation have been done. This will avoid any interference between CFI instructions and packetization. llvm-svn: 250714	2015-10-19 17:46:01 +00:00
Benjamin Kramer	170c107663	Remove CRLF newlines. NFC. llvm-svn: 250698	2015-10-19 13:05:25 +00:00
Asiri Rathnayake	8d48a252a1	Fix mapping of @llvm.arm.ssat/usat intrinsics to ssat/usat instructions The mapping of these two intrinsics in ARMInstrInfo.td had a small omission which lead to their operands not being validated/transformed before being lowered into usat and ssat instructions. This can cause incorrect instructions to be emitted. I've also added tests for the remaining two saturating arithmatic intrinsics @llvm.arm.qadd and @llvm.arm.qsub as they are missing codegen tests. llvm-svn: 250697	2015-10-19 11:44:24 +00:00
Elena Demikhovsky	2e0208e770	Removed parameter "Consecutive" from isLegalMaskedLoad() / isLegalMaskedStore(). Originally I planned to use the same interface for masked gather/scatter and set isConsecutive to "false" in this case. Now I'm implementing masked gather/scatter and see that the interface is inconvenient. I want to add interfaces isLegalMaskedGather() / isLegalMaskedScatter() instead of using the "Consecutive" parameter in the existing interfaces. Differential Revision: http://reviews.llvm.org/D13850 llvm-svn: 250686	2015-10-19 07:43:38 +00:00
Zlatko Buljan	56aeea0467	[mips][microMIPS] Implement ADDQ.PH, ADDQ_S.W, ADDQH.PH, ADDQH.W, ADDSC, ADDU.PH, ADDU_S.QB, ADDWC and ADDUH.QB instructions Differential Revision: http://reviews.llvm.org/D13130 llvm-svn: 250685	2015-10-19 07:16:26 +00:00
Zlatko Buljan	51aca82162	[mips][microMIPS] Implement ABSQ.QB, ABSQ_S.PH, ABSQ_S.W, ABSQ_S.QB, INSV, MADD, MADDU, MSUB, MSUBU, MULT and MULTU instructions Differential Revision: http://reviews.llvm.org/D13721 llvm-svn: 250683	2015-10-19 06:34:44 +00:00
Asaf Badouh	381b11d5f2	[X86][AVX512DQ] add scalar fpclass Differential Revision: http://reviews.llvm.org/D13769 llvm-svn: 250650	2015-10-18 11:04:38 +00:00

1 2 3 4 5 ...

34686 Commits