llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Craig Topper	cf3121d888	[AVX512] Bring vmovq instructions names into alignment with the AVX and SSE names. Add a missing encoding to disassembler and assembler. I believe this also fixes a case where a 64-bit memory form that is documented as being unsupported in 32-bit mode was able to be selected there. llvm-svn: 256483	2015-12-28 06:11:42 +00:00
Craig Topper	78232095c9	[X86] Move address for store target from outs to ins on a couple instructions. llvm-svn: 256482	2015-12-28 06:11:39 +00:00
Craig Topper	bfdc4a3764	[X86] Add proper Uses/Defs/mayLoad flags for AAA/AAD/AAM/AAS/DAA/DAS/XLAT instructions. llvm-svn: 256481	2015-12-28 06:11:37 +00:00
Chandler Carruth	57f828fd51	[lcg] Fix a few more formatting goofs found by clang-format. NFC. llvm-svn: 256480	2015-12-28 01:54:20 +00:00
Chandler Carruth	a18e7dcea6	[lcg] Fix formatting errors found with clang-format, remove the now optional '\brief' tag and reflow some comments based on the added horizontal space. NFC. llvm-svn: 256479	2015-12-28 01:54:18 +00:00
Craig Topper	e4e0592ca3	[AVX512] Remove separate instruction and patterns for lowering ctlz_zero_undef. Change the operation for CTLZ_ZERO_UNDEF to Expand so SelectionDAG will convert them to CTLZ before lowering. llvm-svn: 256477	2015-12-27 21:33:50 +00:00
Craig Topper	ec0fd66634	[SelectionDAG] Teach LegalizeVectorOps to not unroll CTLZ_ZERO_UNDEF and CTTZ_ZERO_UNDEF if the non-ZERO_UNDEF form is legal or custom. Will be used to simplify X86 code in a follow on commit. llvm-svn: 256476	2015-12-27 21:33:47 +00:00
Craig Topper	ce5014e9fe	[AVX512] Remove alternate data type versions of VALIGND, VALIGNQ, VMOVSHDUP and VMOVSLDUP. They don't have any tests and I don't think they can be selected. If they are truly needed they should be implemented with patterns against the normal instructions and not separate instructions. llvm-svn: 256475	2015-12-27 19:45:21 +00:00
Dan Liew	d6419a3e16	[lit] Implement support of per test timeout in lit. This should work with ShTest (executed externally or internally) and GTest test formats. To set the timeout a new option ``--timeout=`` has been added which specifies the maximum run time of an individual test in seconds. By default this 0 which causes no timeout to be enforced. The timeout can also be set from a lit configuration file by modifying the ``lit_config.maxIndividualTestTime`` property. To implement a timeout we now require the psutil Python module if a timeout is requested. This dependency is confined to the newly added ``lit.util.killProcessAndChildren()``. A note has been added into the TODO document describing how we can remove the dependency on the ``pustil`` module in the future. It would be nice to remove this immediately but that is a lot more work and Daniel Dunbar believes it is better that we get a working implementation first and then improve it. To avoid breaking the existing behaviour the psutil module will not be imported if no timeout is requested. The included testcases are derived from test cases provided by Jonathan Roelofs which were in an previous attempt to add a per test timeout to lit (http://reviews.llvm.org/D6584). Thanks Jonathan! Reviewers: ddunbar, jroelofs, cmatthews, MatzeB Subscribers: cmatthews, llvm-commits Differential Revision: http://reviews.llvm.org/D14706 llvm-svn: 256471	2015-12-27 14:03:49 +00:00
Igor Breger	a848a96908	AVX512: Change VPMOVB2M DAG lowering , use CVT2MASK node instead TRUNCATE. Fix TRUNCATE lowering vector to vector i1, use LSB and not MSB. Implement VPMOVB/W/D/Q2M intrinsic. Differential Revision: http://reviews.llvm.org/D15675 llvm-svn: 256470	2015-12-27 13:56:16 +00:00
Asaf Badouh	f94cbd0492	[X86][AVX512] change broadcast to use maskable pattern Differential Revision: http://reviews.llvm.org/D15786 llvm-svn: 256469	2015-12-27 12:14:34 +00:00
Chandler Carruth	8beb86a806	[attrs] Extract the pure inference of function attributes into a standalone pass. There is no call graph or even interesting analysis for this part of function attributes -- it is literally inferring attributes based on the target library identification. As such, we can do it using a much simpler module pass that just walks the declarations. This can also happen much earlier in the pass pipeline which has benefits for any number of other passes. In the process, I've cleaned up one particular aspect of the logic which was necessary in order to separate the two passes cleanly. It now counts inferred attributes independently rather than just counting all the inferred attributes as one, and the counts are more clearly explained. The two test cases we had for this code path are both ... woefully inadequate and copies of each other. I've kept the superset test and updated it. We need more testing here, but I had to pick somewhere to stop fixing everything broken I saw here. Differential Revision: http://reviews.llvm.org/D15676 llvm-svn: 256466	2015-12-27 08:41:34 +00:00
Chandler Carruth	cf6f5436f5	[attrs] Split off the forced attributes utility into its own pass that is (by default) run much earlier than FuncitonAttrs proper. This allows forcing optnone or other widely impactful attributes. It is also a bit simpler as the force attribute behavior needs no specific iteration order. I've added the pass into the default module pass pipeline and LTO pass pipeline which mirrors where function attrs itself was being run. Differential Revision: http://reviews.llvm.org/D15668 llvm-svn: 256465	2015-12-27 08:13:45 +00:00
Craig Topper	c79efd26f5	[AVX-512] Remove alernate integer forms for VPERMILPS and VPERMILPD. There no tests for them and I don't see any way to select them anyway. If they are really needed they should be implemented as patterns and not full fledged instructions. llvm-svn: 256462	2015-12-27 06:55:08 +00:00
David Majnemer	b3f332af9b	Make the test properly constrained llvm-svn: 256460	2015-12-27 06:26:41 +00:00
NAKAMURA Takumi	908789394c	InstrProfTest.cpp: Don't assume string literals are always merged. MSC18 Debug didn't merge them. FIXME: I tweaked just to appease a builder. Almost string literals should be addressed identically there. llvm-svn: 256459	2015-12-27 06:18:57 +00:00
David Majnemer	15ba8464b4	Try to passify buildbot llvm-svn: 256458	2015-12-27 06:18:48 +00:00
NAKAMURA Takumi	76584964f3	Prune the feature "tls". No one is using it since TLS is enabled for Cygwin. llvm-svn: 256457	2015-12-27 06:14:33 +00:00
David Majnemer	38d1ffe261	[X86, Win64] Use a frame pointer if pushf is emitted A frame pointer must be used if stack pointer is modified after the prologue. LLVM will emit pushf/popf if we need to save/restore the FLAGS register, requiring us to have a frame pointer for the function. There is a small twist: this sequence might exist in user code via inline-assembly. For now, conservatively assume that such functions require a frame pointer. For real world justification, please see clang's implementation of __readeflags. This fixes PR25945. llvm-svn: 256456	2015-12-27 06:07:26 +00:00
David Majnemer	870f172298	[WinEH] Add comments explaining the EH tables This is aids in debugging WinEH, similar functionality is present for DWARF EH. llvm-svn: 256455	2015-12-27 06:07:12 +00:00
Sanjay Patel	5cb4cfb9d6	[x86] lower calls to llvm.maxnum.v4f32 using maxps This is a follow-on to: http://reviews.llvm.org/rL255700 llvm-svn: 256454	2015-12-26 21:44:55 +00:00
Craig Topper	e482a43cb8	[X86] Fix an unused variable warning in released builds. llvm-svn: 256453	2015-12-26 20:13:33 +00:00
Craig Topper	dcab6c8bcf	[X86] Add support for printing shuffle comments for AVX512 PSHUFB instructions. llvm-svn: 256452	2015-12-26 19:48:43 +00:00
Craig Topper	89319531b9	[X86] Fold some variable declarations and initializations into if statements. NFC llvm-svn: 256451	2015-12-26 19:48:37 +00:00
Benjamin Kramer	f19aafee12	Fix safepoint intrinsic signatures in test. Should bring back the bots after r256443. llvm-svn: 256450	2015-12-26 11:40:48 +00:00
Chen Li	c60ad3e1fe	[gc.statepoint] Change gc.statepoint intrinsic's return type to token type instead of i32 type Summary: This patch changes gc.statepoint intrinsic's return type to token type instead of i32 type. Using token types could prevent LLVM to merge different gc.statepoint nodes into PHI nodes and cause further problems with gc relocations. The patch also changes the way on how gc.relocate and gc.result look for their corresponding gc.statepoint on unwind path. The current implementation uses the selector value extracted from a { i8*, i32 } landingpad as a hook to find the gc.statepoint, while the patch directly uses a token type landingpad (http://reviews.llvm.org/D15405) to find the gc.statepoint. Reviewers: sanjoy, JosephTremoulet, pgavlin, igor-laevsky, mjacob Subscribers: reames, mjacob, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D15662 llvm-svn: 256443	2015-12-26 07:54:32 +00:00
Craig Topper	c1da12f13f	Add test case for r256433. "[X86] Fix shuffle decoding for variable VPERMIL to be tolerant of the Constant type not matching due to folding in the constant pool and to get VPERMILPD correct." llvm-svn: 256435	2015-12-26 04:58:05 +00:00
Craig Topper	23e36b4275	Revert r256432 "Test" This is the test case for r256433, but it got committed incorrectly in my local repo. llvm-svn: 256434	2015-12-26 04:56:51 +00:00
Craig Topper	de07308c81	[X86] Fix shuffle decoding for variable VPERMIL to be tolerant of the Constant type not matching due to folding in the constant pool and to get VPERMILPD correct. llvm-svn: 256433	2015-12-26 04:50:07 +00:00
Craig Topper	bf05e5eef9	Test llvm-svn: 256432	2015-12-26 04:50:01 +00:00
Craig Topper	38dac27e3a	[X86] Fix copy and paste typo from pasting from another Makefile to restore code. llvm-svn: 256431	2015-12-25 23:27:57 +00:00
Craig Topper	f89e5ceed0	[X86] Put back the include path to the main X86 sources in the AsmParser library to fix the bots. llvm-svn: 256430	2015-12-25 22:22:16 +00:00
Craig Topper	7e1d2e52cd	[X86] Remove X86CodeGen dependency from the AsmParser library. llvm-svn: 256429	2015-12-25 22:10:11 +00:00
Craig Topper	8d6e33b512	[X86] Move getX86SubSuperRegisterOrZero to X86MCTargetDesc.cpp so it can be used by AsmParser library without depending on X86CodeGen library. llvm-svn: 256428	2015-12-25 22:10:08 +00:00
Craig Topper	8ac345d153	Remove extra forward declarations and scrub includes for all in tree InstPrinters. NFC llvm-svn: 256427	2015-12-25 22:10:01 +00:00
Craig Topper	08678ccbc2	[X86] Move AVX512 STATIC_ROUNDING enum to X86BaseInfo.h to fix a layering violation in AsmParser. llvm-svn: 256426	2015-12-25 22:09:49 +00:00
Craig Topper	fe5f33f108	[X86] Replace MVT::SimpleValueType in the AsmParser library and getX86SubSuperRegister with just an unsigned representing size. This a is step towards fixing a layering violation so the X86 AsmParser won't depending on CodeGen types. llvm-svn: 256425	2015-12-25 22:09:45 +00:00
Craig Topper	f023f89e96	[X86] Don't pass the default value to the High argument of getX86SubSuperRegister. Most place don't care about this argument. NFC llvm-svn: 256424	2015-12-25 19:44:16 +00:00
Davide Italiano	5e3e0bcb69	[llvm-objdump] Use stderr and not stdout for fatal errors. llvm-svn: 256423	2015-12-25 18:16:45 +00:00
Craig Topper	f3eef0e344	[X86] getX86SubSuperRegisterOrZero shouldn't call getX86SubSuperRegister recursively. It should call itself instead. Otherwise it might fire an assertion when it was designed not too. llvm-svn: 256422	2015-12-25 17:07:32 +00:00
Craig Topper	be47256c20	[X86] Add missing X86II::MRM_C4, MRM_C5, etc. encodings to getMemoryOperandNo. These aren't used by any instructions, but could be someday. NFC llvm-svn: 256421	2015-12-25 17:07:30 +00:00
Craig Topper	474cc56790	[X86] Use assert instead of if and llvm_unreachable. NFC llvm-svn: 256420	2015-12-25 17:07:27 +00:00
Craig Topper	e1f71ad36d	[X86] Minor identation fixes. NFC llvm-svn: 256419	2015-12-25 17:07:24 +00:00
David Majnemer	809c69c1e5	[CodeGen] Use generic printAsOperand machinery instead of hand rolling it We already know how to properly print out basic blocks in printAsOperand, we should not roll it ourselves in AsmPrinter::EmitBasicBlockStart. No functionality change is intended. llvm-svn: 256413	2015-12-25 09:37:26 +00:00
Craig Topper	42212f3d03	[IR] Mark the Type subclass helper methods 'inline' and move their definitions to DerivedTypes.h so they can be inlined by the compiler. llvm-svn: 256406	2015-12-25 04:06:20 +00:00
Craig Topper	d426e33014	[Transforms] Use asserts instead of ifs around llvm_unreachable. NFC llvm-svn: 256405	2015-12-25 02:04:17 +00:00
Dan Gohman	9c3961aabe	[WebAssembly] Fix handling of COPY instructions in WebAssemblyRegStackify. Move RegStackify after coalescing and teach it to use LiveIntervals instead of depending on SSA form. This avoids a problem where a register in a COPY instruction is stackified and then subsequently coalesced with a register that is not stackified. This also puts it after the scheduler, which allows us to simplify the EXPR_STACK constraint, as we no longer have instructions being reordered after stackification and before coloring. llvm-svn: 256402	2015-12-25 00:31:02 +00:00
Sanjay Patel	b4b4a9aeb1	[InstCombine] transform more extract/insert pairs into shuffles (PR2109) This is an extension of the shuffle combining from r203229: http://reviews.llvm.org/rL203229 The idea is to widen a short input vector with undef elements so the existing shuffle transform for extract/insert can kick in. The motivation is to finally solve PR2109: https://llvm.org/bugs/show_bug.cgi?id=2109 For that example, the IR becomes: %1 = bitcast <2 x i32>* %P to <2 x float>* %ld1 = load <2 x float>, <2 x float>* %1, align 8 %2 = shufflevector <2 x float> %ld1, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> %i2 = shufflevector <4 x float> %A, <4 x float> %2, <4 x i32> <i32 0, i32 1, i32 4, i32 5> ret <4 x float> %i2 And x86 SSE output improves from: movq (%rdi), %xmm1 ## xmm1 = mem[0],zero movdqa %xmm1, %xmm2 shufps $229, %xmm2, %xmm2 ## xmm2 = xmm2[1,1,2,3] shufps $48, %xmm0, %xmm1 ## xmm1 = xmm1[0,0],xmm0[3,0] shufps $132, %xmm1, %xmm0 ## xmm0 = xmm0[0,1],xmm1[0,2] shufps $32, %xmm0, %xmm2 ## xmm2 = xmm2[0,0],xmm0[2,0] shufps $36, %xmm2, %xmm0 ## xmm0 = xmm0[0,1],xmm2[2,0] retq To the almost optimal: movhpd (%rdi), %xmm0 Note: There's a tension in the existing transform related to generating arbitrary shufflevector masks. We avoid that in other places in InstCombine because we're scared that codegen can't handle strange masks, but it looks like we're ok with producing those here. I purposely chose weird insert/extract indexes for the regression tests to see the effect in these cases. For PowerPC+Altivec, AArch64, and X86+SSE/AVX, I think the codegen is equal or better for these examples. Differential Revision: http://reviews.llvm.org/D15096 llvm-svn: 256394	2015-12-24 21:17:56 +00:00
Dave Bartolomeo	f1ad63fa3f	Fix signed/unsigned warning in Line.h. llvm-svn: 256390	2015-12-24 19:17:54 +00:00
Dave Bartolomeo	41b83cb4b3	Remove unused constants from TypeTableBuilder.cpp. llvm-svn: 256389	2015-12-24 19:15:56 +00:00

1 2 3 4 5 ...

125412 Commits