llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Fangrui Song	4c12d59f85	Use the container form llvm::sort(C, ...) There are a few leftovers in rL343163 which span two lines. This commit changes these llvm::sort(C.begin(), C.end, ...) to llvm::sort(C, ...) llvm-svn: 343426	2018-09-30 22:31:29 +00:00
Simon Pilgrim	e1dabe807a	[X86] Fix scheduler class for BTmi instructions This wasn't treated as a folded load instruction llvm-svn: 343424	2018-09-30 20:19:16 +00:00
Lang Hames	b43a7e07a5	[ORC] Extract and tidy up JITTargetMachineBuilder, add unit test. (1) Adds comments for the API. (2) Removes the setArch method: This is redundant: the setArchStr method on the triple should be used instead. (3) Turns EmulatedTLS on by default. This matches EngineBuilder's behavior. llvm-svn: 343423	2018-09-30 19:12:23 +00:00
Simon Pilgrim	64935c752c	[LLVM-MCA][X86] Add missing VCMPESTR/VCMPESTR tests llvm-svn: 343421	2018-09-30 18:19:00 +00:00
Craig Topper	688761d566	[X86] Copy memrefs when folding a load for division instruction selection. llvm-svn: 343419	2018-09-30 17:47:18 +00:00
Bjorn Pettersson	b92a8375b0	[PHIElimination] Lower a PHI node with only undef uses as IMPLICIT_DEF Summary: The lowering of PHI nodes used to detect if all inputs originated from IMPLICIT_DEF's. If so the PHI node was replaced by an IMPLICIT_DEF. Now we also consider undef uses when checking the inputs. So if all inputs are implicitly defined or undef we lower the PHI to an IMPLICIT_DEF. This makes PHIElimination::LowerPHINode more consistent as it checks both implicit and undef properties at later stages. Reviewers: MatzeB, tstellar Reviewed By: MatzeB Subscribers: jvesely, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D52558 llvm-svn: 343417	2018-09-30 17:26:58 +00:00
Bjorn Pettersson	74b9872a0d	[PHIElimination] Update the regression test for PR16508 Summary: When PR16508 was solved (in rL185363) a regression test was added as test/CodeGen/PowerPC/2013-07-01-PHIElimBug.ll. I discovered that the test case no longer reproduced the scenario from PR16508. This problem could have been amended by adding an extra RUN line with "-O1" (or possibly "-O0"), but instead I added a mir-reproducer test/CodeGen/PowerPC/2013-07-01-PHIElimBug.mir to get a reproducer that is less sensitive to changes in earlier passes (including O-level). While being at it I also corrected a code comment in PHIElimination::EliminatePHINodes that has been incorrect since the related bugfix from rL185363. Reviewers: MatzeB, hfinkel Reviewed By: MatzeB Subscribers: nemanjai, jsji, llvm-commits Differential Revision: https://reviews.llvm.org/D52553 llvm-svn: 343416	2018-09-30 17:23:21 +00:00
Simon Pilgrim	9cc7b97428	[LLVM-MCA][X86] Add some AVX512 tests These are going to be necessary to check I don't mess up when I start cleaning up all the remaining vector integer overrides llvm-svn: 343414	2018-09-30 17:01:59 +00:00
Simon Pilgrim	1f8b17aca5	[X86][Btver2] Fix PCmpIStrI/PCmpIStrM schedules Missing JFPU0 pipe and double JFPU1 pipe (to match JVALU1) resources Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343413	2018-09-30 16:38:38 +00:00
Zachary Turner	3032fee0fa	[PDB] Add native support for dumping array types. llvm-svn: 343412	2018-09-30 16:19:18 +00:00
Simon Pilgrim	aa519f0231	[X86][BtVer2] Add the ability to add additional uops for folded instructions Some instructions take an extra load uop - but not consistently..... llvm-svn: 343410	2018-09-30 15:58:56 +00:00
Sanjay Patel	4dc4616ffa	[InstCombine] try to convert vector insert+extract to trunc This transform is requested for the backend in: https://bugs.llvm.org/show_bug.cgi?id=39016 ...but I figured it was worth doing in IR too, and it's probably easier to implement here, so that's this patch. In the simplest case, we are just truncating a scalar value. If the extract index doesn't correspond to the LSBs of the scalar, then we have to shift-right before the truncate. Endian-ness makes this tricky, but hopefully the ASCII-art helps visualize the transform. Differential Revision: https://reviews.llvm.org/D52439 llvm-svn: 343407	2018-09-30 14:34:01 +00:00
Sanjay Patel	733c62c1be	[InstCombine] allow lengthening of insertelement to eliminate shuffles As noted in post-commit comments for D52548, the limitation on increasing vector length can be applied by opcode. As a first step, this patch only allows insertelement to be widened because that has no logical downsides for IR and has little risk of pessimizing codegen. This may cause PR39132 to go into hiding during a full compile, but that bug is not fixed. llvm-svn: 343406	2018-09-30 13:50:42 +00:00
Simon Pilgrim	dee87a10d7	[DAG] Don't perform SINT_TO_FP<->UINT_TO_FP custom conversion after legalization The SINT_TO_FP<->UINT_TO_FP combines for non-negative integers should only occur for legal ops once LegalOperations = true No test case to hand, noticed when investigating PR38226 + PR38970 llvm-svn: 343405	2018-09-30 12:46:42 +00:00
Roman Lebedev	eb90d49956	[NFC][CodeGen][X86][AArch64] Add 64-bit constant bit field extract pattern tests llvm-svn: 343404	2018-09-30 12:42:08 +00:00
Simon Pilgrim	dd1c35c05a	[X86] Regenerate MMX coalescing test Exposes another extractelement(bitcast(scalartovector())) pattern llvm-svn: 343403	2018-09-30 09:42:04 +00:00
Zachary Turner	df0889e93a	[PDB] Fix this test for real. I was able to test this fix on an actual Windows machine so this should get the bot green again. llvm-svn: 343400	2018-09-30 03:57:49 +00:00
Craig Topper	bfed70b548	[X86] Disable BMI BEXTR in X86DAGToDAGISel::matchBEXTRFromAnd unless we're on compiling for a CPU with single uop BEXTR Summary: This function turns (X >> C1) & C2 into a BMI BEXTR or TBM BEXTRI instruction. For BMI BEXTR we have to materialize an immediate into a register to feed to the BEXTR instruction. The BMI BEXTR instruction is 2 uops on Intel CPUs. It looks like on SKL its one port 0/6 uop and one port 1/5 uop. Despite what Agner's tables say. I know one of the uops is a regular shift uop so it would have to go through the port 0/6 shifter unit. So that's the same or worse execution wise than the shift+and which is one 0/6 uop and one 0/1/5/6 uop. The move immediate into register is an additional 0/1/5/6 uop. For now I've limited this transform to AMD CPUs which have a single uop BEXTR. If may also might make sense if we can fold a load or if the and immediate is larger than 32-bits and can't be encoded as a sign extended 32-bit value or if LICM or CSE can hoist the move immediate and share it. But we'd need to look more carefully at that. In the regression I looked at it doesn't look load folding or large immediates were occurring so the regression isn't caused by the loss of those. So we could try to be smarter here if we find a compelling case. Reviewers: RKSimon, spatel, lebedev.ri, andreadb Reviewed By: RKSimon Subscribers: llvm-commits, andreadb, RKSimon Differential Revision: https://reviews.llvm.org/D52570 llvm-svn: 343399	2018-09-30 03:01:46 +00:00
Zachary Turner	b05d5b5adb	Only dump the types we need in the test. We added support for dumping pointers but pointers to arrays won't correctly dump until we add support for dumping arrays. Instead of trying to dump everything, which this test isn't even interested in, just dump enums and typedefs. llvm-svn: 343398	2018-09-30 00:51:54 +00:00
Zachary Turner	37291b85d3	Fix some tests on Windows. I don't actually have a Windows machine at the present moment, so hopefully this fixes it. llvm-svn: 343397	2018-09-30 00:22:21 +00:00
Lang Hames	1d65c306a2	[ORC] Add partitioning support to CompileOnDemandLayer2. CompileOnDemandLayer2 now supports user-supplied partition functions (the original CompileOnDemandLayer already supported these). Partition functions are called with the list of requested global values (i.e. global values that currently have queries waiting on them) and have an opportunity to select extra global values to materialize at the same time. Also adds testing infrastructure for the new feature to lli. llvm-svn: 343396	2018-09-29 23:49:57 +00:00
Lang Hames	f18e8314d4	[ORC] Clear SymbolToDefinitionMap when materializing a MaterializationUnit. The map is inaccessible at this point, so we may as well reclaim the memory early. llvm-svn: 343395	2018-09-29 23:49:56 +00:00
Lang Hames	ed714f2dc7	Add a comment to clarify the contract for LLVMGetErrorMessage in the c-bindings for Error. llvm-svn: 343394	2018-09-29 23:49:54 +00:00
Zachary Turner	3ba331bf03	[PDB] Better native API support for pointers. We didn't properly detect when a pointer was a member pointer, and when that was the case we were not properly returning class parent info. This caused member pointers to render incorrectly in pretty mode. However, we didn't even have pretty tests for pointers in native mode, so those are also added now to ensure this. llvm-svn: 343393	2018-09-29 23:28:19 +00:00
David Bolvansky	aa72b05b81	[DAGCombiner][NFC] Tests for X div/rem Y single bit fold llvm-svn: 343392	2018-09-29 21:00:37 +00:00
Simon Pilgrim	dbb4e044b7	[X86][AVX2] Cleanup shuffle combining tests - add common prefixes llvm-svn: 343391	2018-09-29 20:34:16 +00:00
Simon Pilgrim	1a4f4d3127	[X86] SimplifyDemandedVectorEltsForTargetNode - remove identity target shuffles before simplifying inputs By removing demanded target shuffles that simplify to zero/undef/identity before simplifying its inputs we improve chances of further simplification, as only the immediate parent user of the combined is added back to the work list - this still doesn't help us if its passed through other ops though (bitcasts....). llvm-svn: 343390	2018-09-29 18:15:26 +00:00
Craig Topper	eb9cee2f29	[X86] Add fast-isel test cases for unaligned load/store intrinsics recently added to clang This adds tests for: _mm_loadu_si16 _mm_loadu_si32 _mm_loadu_si16 _mm_storeu_si64 _mm_storeu_si32 _mm_storeu_si16 llvm-svn: 343389	2018-09-29 18:03:52 +00:00
Simon Pilgrim	9d4246e3f3	[X86][SSE] LowerScalarImmediateShift - remove 32-bit vXi64 special case handling. This is all handled generally by getTargetConstantBitsFromNode now llvm-svn: 343387	2018-09-29 17:36:22 +00:00
Simon Pilgrim	9661cf1100	Fix signed/unsigned mismatch warning. NFCI. llvm-svn: 343385	2018-09-29 17:11:19 +00:00
Simon Pilgrim	210e6fd5c3	[X86] getTargetConstantBitsFromNode - add support for rearranging constant bits via shuffles Exposed an issue that recursive calls to getTargetConstantBitsFromNode don't handle changes to EltSizeInBits yet. llvm-svn: 343384	2018-09-29 17:01:55 +00:00
Simon Pilgrim	e53b7b38bd	[X86][SSE] LowerScalarImmediateShift - use getTargetConstantBitsFromNode to get immediate data Don't just attempt to find a splat build vector. First step towards getting rid of all the 32-bit special case code. llvm-svn: 343383	2018-09-29 16:40:35 +00:00
Sanjay Patel	a4cee6174a	[InstCombine] fix formatting in vector evaluators; NFC We need to alter the functionality as shown in D52548. llvm-svn: 343379	2018-09-29 15:05:24 +00:00
Sanjay Patel	141720c73b	[InstCombine] add test for vector widening of insertelements; NFC The test shows a potential overreach with the fix from D52548. llvm-svn: 343378	2018-09-29 15:01:45 +00:00
Simon Pilgrim	20b5e7e0c4	[X86] getTargetConstantBitsFromNode - fix self-move assertions from gcc builds due to rL343375 llvm-svn: 343377	2018-09-29 14:51:09 +00:00
Simon Pilgrim	8a3c5a7763	[X86] Regenerate fma comments. llvm-svn: 343376	2018-09-29 14:31:00 +00:00
Simon Pilgrim	a32a521f91	[X86] getTargetConstantBitsFromNode - add support for peeking through ISD::EXTRACT_SUBVECTOR llvm-svn: 343375	2018-09-29 14:17:32 +00:00
Simon Pilgrim	cd6450622b	[X86][SSE] Fixed issue with v2i64 variable shifts on 32-bit targets The shift amount might have peeked through a extract_subvector, altering the number of vector elements in the 'Amt' variable - so we were incorrectly calculating the ratio when peeking through bitcasts, resulting in incorrectly detecting splats. llvm-svn: 343373	2018-09-29 13:25:22 +00:00
Heejin Ahn	60dae1e344	Fix comment indentation in addLandingPad rL343018 messed up the comment indentation while moving it. llvm-svn: 343371	2018-09-29 09:22:25 +00:00
Vitaly Buka	a3a88d2f03	[cxx2a] Fix warning triggered by r343285 llvm-svn: 343369	2018-09-29 02:17:12 +00:00
Lang Hames	3a8dd9b0c2	[ORC] Make MaterializationResponsibility::getRequestedSymbols() const. This makes it available for use in IRTransformLayer2::TransformFunction instances (since a const MaterializationResponsibility& parameter was added in r343365). llvm-svn: 343367	2018-09-28 22:03:17 +00:00
Lang Hames	c585dda426	[ORC] Add more utilities to aid debugging output. (1) A const accessor for the LLVMContext held by a ThreadSafeContext. (2) A const accessor for the ThreadSafeModules held by an IRMaterializationUnit. (3) A const MaterializationResponsibility reference to IRTransformLayer2's transform function. This makes IRTransformLayer2 useful for JIT debugging (since it can inspect JIT state through the responsibility argument) as well as program transformations. llvm-svn: 343365	2018-09-28 21:49:53 +00:00
Thomas Lively	4754140dfc	[ValueTracking] Allow select patterns to work on FP vectors Summary: This CL allows constant vectors of floats to be recognized as non-NaN and non-zero in select patterns. This change makes `matchSelectPattern` more powerful generally, but was motivated specifically because I wanted fminnan and fmaxnan to be created for vector versions of the scalar patterns they are created for. Tested with check-all on all targets. A testcase in the WebAssembly backend that tests the non-nan codepath is in an upcoming CL. Reviewers: aheejin, dschuff Subscribers: sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52324 llvm-svn: 343364	2018-09-28 21:36:43 +00:00
Robert Widmann	565733aaa9	[LLVM-C] Add an accessor for the "value type" of a global Summary: Before this, there was no reasonable way to retrieve the type of a global value (most notably, a function) that was created with the C API. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52659 llvm-svn: 343363	2018-09-28 20:54:29 +00:00
Heejin Ahn	7900ef0270	[WebAssembly] Fix memory leak on WasmEHFuncInfo Summary: WasmEHFuncInfo objects were not being properly deleted. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52582 llvm-svn: 343362	2018-09-28 20:54:04 +00:00
Eli Friedman	a0f892568b	[ARM] Fix correctness checks in promoteToConstantPool. Correctly check for relocations in the constant to promote. And don't allow promoting a constant multiple times. This partially fixes https://bugs.llvm.org//show_bug.cgi?id=32780 ; it's not a complete fix because we also need to prevent ARMConstantIslands from cloning the constant. (-arm-promote-constant is currently off by default, and it stays off with this patch. I'll look into turning it on again when all the known issues are fixed.) Differential Revision: https://reviews.llvm.org/D51472 llvm-svn: 343361	2018-09-28 20:27:31 +00:00
Eli Friedman	da1384c2fe	[ARM] Use preferred alignment for constants in promoteToConstantPool. This mostly affects IR generated by non-clang frontends because clang generally sets the alignment of globals explicitly. Fixes https://bugs.llvm.org//show_bug.cgi?id=32394 . (-arm-promote-constant is currently off by default, and it stays off with this patch. I'll look into turning it on again when all the known issues are fixed.) Differential Revision: https://reviews.llvm.org/D51469 llvm-svn: 343359	2018-09-28 20:21:51 +00:00
Lang Hames	f97e70c514	[ORC] Narrow a cast: the block guarded by the condition only handles GlobalVariables, not all GlobalValues. llvm-svn: 343358	2018-09-28 20:16:16 +00:00
Craig Topper	8805cb4f61	[X86] Add test cases for failures to use narrow test with immediate instructions when a truncate is beteen the CMP and the AND and the sign flag is used. The code in X86ISelDAGToDAG only looks through truncates if the sign flag isn't used, but that is overly restrictive. A future patch will improve this. llvm-svn: 343355	2018-09-28 19:06:28 +00:00
Evandro Menezes	fdd7b1d490	[AArch64] Split zero cycle feature more granularly Split the `zcz` feature into specific ones got GP and FP registers, `zcz-gp` and `zcz-fp`, respectively, while retaining the original feature option to mean both. Differential revision: https://reviews.llvm.org/D52621 llvm-svn: 343354	2018-09-28 19:05:09 +00:00

1 2 3 4 5 ...

169810 Commits