llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Craig Topper	23389924ea	[X86] Add avx512vl and avx512dq command lines to combine-pmuldq.ll to demonstrate where we fail to use pmuldq/pmuludq and use to pmullq instead. It's nice that pmullq exists, but it has higher latency and probably lower throughput than pmuldq/pmuludq. We should prefer those if we can. llvm-svn: 321436	2017-12-25 06:47:08 +00:00
Don Hinton	130f9a4a59	[cmake] Always respect existing CMAKE_REQUIRED_FLAGS when adding additional ones. Summary: Always respect existing CMAKE_REQUIRED_FLAGS when adding additional ones. This is important when cross compiling where --sysroot and -target were already added. In particular, this is needed when cross compiling from Darwin to Linux, since --sysroot is required to find headers and libraries. Cmake has a similar bug in check_include_file[_cxx] where CMAKE_REQUIRED_LIBRARIES isn't passed, which causes try_compile to fail. (please see https://gitlab.kitware.com/cmake/cmake/merge_requests/1620) Reviewers: compnerd, silvas, beanz, brad.king Reviewed By: compnerd Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D41568 llvm-svn: 321434	2017-12-25 01:23:09 +00:00
Craig Topper	18c5d6d384	[X86] Make some helper methods static functions instead. NFC llvm-svn: 321433	2017-12-25 00:54:53 +00:00
Craig Topper	08af9fe055	[X86] Use SelectionDAG::getFPExtendOrRound to simplify some code. llvm-svn: 321432	2017-12-25 00:54:51 +00:00
Simon Pilgrim	bec6592804	[X86][AVX] Add AVX1/AVX2 vmul tests llvm-svn: 321426	2017-12-24 12:51:54 +00:00
Benjamin Kramer	525d7e2ad5	Make helpers static. No functionality change. llvm-svn: 321425	2017-12-24 12:46:22 +00:00
Simon Pilgrim	7bfc03133a	[X86][X87] Mark pseudo memory fold instructions as load/sideeffects (PR21160, PR34080, PR34454). Match regular x87 memory fold instructions with load/sideeffects tags, to prevent the schedulers from re-ordering them across the fnstcw/fldcw sequences for truncating stores while they are still pseudo during the stack conversion pass. llvm-svn: 321424	2017-12-24 12:20:21 +00:00
Simon Pilgrim	cf86ce8687	[X86][X87] Renamed CHECK prefix, its not actually broken anymore just scheduled differently llvm-svn: 321423	2017-12-24 10:25:01 +00:00
Simon Pilgrim	f8ad83991f	[X86][X87] Add another test case mentioned on PR34080 Did my best to reduce this, but the X87 scheduling bug is hard to hit at the best of times... llvm-svn: 321422	2017-12-24 10:22:55 +00:00
Craig Topper	7136c64e49	[X86] Fix (v2f64 (s/uint_to_fp (v2i1))) to avoid scalarization without AVX512DQ. Previously we extended v2i1 to v2f64 and then tried to use cvtuqq2pd/cvtqq2pd, but that only works with avx512dq. So we ended up scalarizing it. Now we widen to v4i1 first and extend to v4i32. llvm-svn: 321420	2017-12-24 06:51:36 +00:00
George Rimar	1c5f654779	[MC] - Teach llvm-mc to handle comdats whose names are numbers. Currently llvm-mc ignores COMDATs whose names are numbers, for example following code: .section .foo,"G",@progbits,123,comdat would produce no COMDATs at all. Patch fixes the issue. Differential revision: https://reviews.llvm.org/D41552 llvm-svn: 321419	2017-12-24 06:13:36 +00:00
Craig Topper	bc75d9ff7b	[DAGCombiners] Don't turn ANDs to shuffles with zero so early. Give some other combines a chance to run. This moves the combine for turning ANDs into shuffle with zero out of SimplifyVBinOps and places it only in visitAND below the reassociate handling. This fixes the specific case I noticed where we failed to combine two ands with constants. llvm-svn: 321417	2017-12-24 02:05:18 +00:00
Craig Topper	7679c60646	[X86] Add assembler predicates to BITALG/VBMI2/VNNI features to be consistent with the other AVX512 ISAs. llvm-svn: 321416	2017-12-24 02:05:17 +00:00
Craig Topper	3628945622	[X86] Teach WidenMaskArithmetic to handle any constant buildvector on the RHS not just all zeros/ones. llvm-svn: 321415	2017-12-24 01:03:31 +00:00
Craig Topper	304bc9d6c9	[SelectionDAG] Teach SelectionDAG::getNode to constant fold zext/aext/sext of constant build vectors. llvm-svn: 321414	2017-12-23 20:21:29 +00:00
Florian Hahn	d971af6098	[CallSiteSplitting] Remove isOrHeader restriction. By following the single predecessors of the predecessors of the call site, we do not need to restrict the control flow. Reviewed By: junbuml, davide Differential Revision: https://reviews.llvm.org/D40729 llvm-svn: 321413	2017-12-23 20:02:26 +00:00
Craig Topper	a03424730d	[X86] Remove type restrictions from WidenMaskArithmetic. This can help AVX-512 code where mask types are legal allowing us to remove extends and truncates to/from mask types. llvm-svn: 321408	2017-12-23 18:53:05 +00:00
Craig Topper	42b3f68bf2	[X86] In WidenMaskArithmetic, make sure we check the input type of a truncate on N1. Later in the code we explicitly bypass the truncate so we should be checking its type to make sure that it's safe. llvm-svn: 321407	2017-12-23 18:53:03 +00:00
Craig Topper	aca2ae6aae	[X86] Remove unneeded EVT variable. NFC Immediately after it is created we check if its equal to another EVT. Then we inconsistently use one or the other variables in the code below. Instead do the equality check directly on the getValueType result and remove the variable. Use the origina VT variable throughout the remaining code. llvm-svn: 321406	2017-12-23 18:53:01 +00:00
Simon Pilgrim	ac253c6d81	[X86][X87] Wrap FpI_ pseudo to use PseudoI. NFCI. llvm-svn: 321405	2017-12-23 17:25:59 +00:00
Davide Italiano	adffd3064b	[SCCP] Manually fold branches on undef. This code was originally removed and replace with an assertion because believed unnecessary. It turns out there was simply no test coverage for this case, and the constant folder doesn't yet know about patterns like `br undef %label1, %label2`. Presumably at some point the constant folder might learn about these patterns, but it's a broader change. A testcase will be added to make sure this doesn't regress again in the future. Fixes PR35723. llvm-svn: 321402	2017-12-23 15:06:30 +00:00
Simon Pilgrim	c5c41fbb40	[X86] Add default InstrItinClass to PseudoI This will be used to help tidyup existing pseudos that we've added scheduling info to. llvm-svn: 321401	2017-12-23 10:47:21 +00:00
Craig Topper	daef9b5802	[X86] Pass the right VT to the getZeroExtendInReg introduced in r321398 Apparently we don't have tests for this which I didn't realize before. I'll try to fix that but wanted to fix the obvious bug. llvm-svn: 321399	2017-12-23 06:52:03 +00:00
Craig Topper	2f03b84c96	[X86] Use SelectionDAG::getZeroExtendInReg instead of implementing it manually. llvm-svn: 321398	2017-12-23 02:54:52 +00:00
Craig Topper	acac80b2f8	[SelectionDAG][X86] Don't use ->getValueType(0) after a call to getOperand to get the type of the operand. getOperand returns an SDValue that contains the node and the result number. There is no guarantee that the result number if 0. By using the -> operator we are calling SDNode::getValueType rather than SDValue::getValueType. This requires supplying a result number and we shouldn't assume it was 0. I don't have a test case. Just noticed while cleaning up some other code and saw that it occurred in other places. llvm-svn: 321397	2017-12-23 02:54:50 +00:00
Nirav Dave	787b28f8c8	[DAG] Add missing case check from findbaseoffset merge from r321389. llvm-svn: 321391	2017-12-22 22:06:56 +00:00
Nirav Dave	5c5bf3bc37	Integrate findBaseOffset address analyses to BaseIndexOffset. NFCI. BaseIndexOffset supercedes findBaseOffset analysis save only Constant Pool addresses. Migrate analysis to BaseIndexOffset. Relanding after correcting base address matching check. llvm-svn: 321389	2017-12-22 21:20:55 +00:00
Walter Lee	1392978da7	[git-llvm] Handle files ignored by svn correctly Summary: Correctly handle files ignored by svn (such as .o files, which are ignored by default) by adding "--no-ignore" flag to "svn status" and "svn add". Differential Revision: https://reviews.llvm.org/D41404 llvm-svn: 321388	2017-12-22 21:19:13 +00:00
Benjamin Kramer	18a651036a	Unbreak the build. Combining chrono with Optional is annoying. llvm-svn: 321387	2017-12-22 21:18:50 +00:00
Sam Clegg	3181a5120d	[WebAssembly] MC: Fix for address taken aliases Previously, taking the address for an alias would result in: "Symbol not found in table index space" Increase test coverage for weak aliases. This code should be more efficient too as it avoids building the `IsAddressTaken` set. Differential Revision: https://reviews.llvm.org/D41510 llvm-svn: 321384	2017-12-22 20:31:39 +00:00
Alina Sbirlea	ba7653634f	[MemorySSA] Allow reordering of loads that alias in the presence of volatile loads. Summary: Make MemorySSA allow reordering of two loads that may alias, when one is volatile. This makes MemorySSA less conservative and behaving the same as the AliasSetTracker. For more context, see D16875. LLVM language reference: "The optimizers must not change the number of volatile operations or change their order of execution relative to other volatile operations. The optimizers may change the order of volatile operations relative to non-volatile operations. This is not Java’s “volatile” and has no cross-thread synchronization behavior." Reviewers: george.burgess.iv, dberlin Subscribers: sanjoy, reames, hfinkel, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D41525 llvm-svn: 321382	2017-12-22 19:54:03 +00:00
Nirav Dave	4e214940fa	Revert "[DAG] Integrate findBaseOffset address analyses to BaseIndexOffset. NFCI." which was causing miscompilations in for some test-suite components. This reverts commit 3e9de9ff0f3162920a2a3cba51c7dc14b54b4d16. llvm-svn: 321380	2017-12-22 19:33:56 +00:00
Guozhi Wei	acd7d48f71	[SimplifyCFG] Don't do if-conversion if there is a long dependence chain If after if-conversion, most of the instructions in this new BB construct a long and slow dependence chain, it may be slower than cmp/branch, even if the branch has a high miss rate, because the control dependence is transformed into data dependence, and control dependence can be speculated, and thus, the second part can execute in parallel with the first part on modern OOO processor. This patch checks for the long dependence chain, and give up if-conversion if find one. Differential Revision: https://reviews.llvm.org/D39352 llvm-svn: 321377	2017-12-22 18:54:04 +00:00
Ben Dunbobbin	327736f089	[ThinLTO][CachePruning] explicitly disable pruning In https://reviews.llvm.org/rL321077 and https://reviews.llvm.org/D41231 I fixed a regression in the c-api which prevented the pruning from being effectively disabled. However this approach, helpfully recommended by @labath, is cleaner. It is also nice to remove the weasel words about effectively disabling from the api comments. Differential Revision: https://reviews.llvm.org/D41497 llvm-svn: 321376	2017-12-22 18:32:15 +00:00
Sanjoy Das	df40ece177	(Re-landing) Expose a TargetMachine::getTargetTransformInfo function Re-land r321234. It had to be reverted because it broke the shared library build. The shared library build broke because there was a missing LLVMBuild dependency from lib/Passes (which calls TargetMachine::getTargetIRAnalysis) to lib/Target. As far as I can tell, this problem was always there but was somehow masked before (perhaps because TargetMachine::getTargetIRAnalysis was a virtual function). Original commit message: This makes the TargetMachine interface a bit simpler. We still need the std::function in TargetIRAnalysis to avoid having to add a dependency from Analysis to Target. See discussion: http://lists.llvm.org/pipermail/llvm-dev/2017-December/119749.html I avoided adding all of the backend owners to this review since the change is simple, but let me know if you feel differently about this. Reviewers: echristo, MatzeB, hfinkel Reviewed By: hfinkel Subscribers: jholewinski, jfb, arsenm, dschuff, mcrosier, sdardis, nemanjai, nhaehnle, javed.absar, sbc100, jgravelle-google, aheejin, kbarton, llvm-commits Differential Revision: https://reviews.llvm.org/D41464 llvm-svn: 321375	2017-12-22 18:21:59 +00:00
Dmitry Preobrazhensky	db06df90f8	[AMDGPU][MC] Corrected handling of negative expressions See bug 35716: https://bugs.llvm.org/show_bug.cgi?id=35716 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D41488 llvm-svn: 321372	2017-12-22 18:03:35 +00:00
Craig Topper	e32e202b52	[SelectionDAG] Reverse the order of operands in the ISD::ADD created by TargetLowering::getVectorElementPointer so that the FrameIndex is on the left. This seems to improve X86's ability to match this into an address computation. Otherwise the other operand gets assigned to the base register and the stack pointer + frame index ends up in the index register. But index registers can't encode ESP/RSP so we end up having to move it into another register to meet the constraint. I could try to improve the address matcher in X86, but swapping the producer seemed easier. Several other places already have the operands in this order so this is at least consistent. llvm-svn: 321370	2017-12-22 17:18:13 +00:00
Craig Topper	acd88472c6	[X86] When lowering insert_vector_elt/extract_vector_elt of vXi1 with a non-constant index just use either a 128-bit type or the vXi8 type with the correct number of elements. Despite what the comment said there isn't better codegen for 512-bit vectors. The 128/256/512 bit implementation jus stores to memory and loads an element. There's no advantage to doing that with a larger size. In fact in many cases it causes a stack realignment and generates worse code. llvm-svn: 321369	2017-12-22 17:18:11 +00:00
Craig Topper	15d0e14230	[X86] Improve the printing of address mode during isel matching. Fix some inconsistent new line behavior and only print the FrameIndex when the address mode is a FrameIndexBase addressing mode. llvm-svn: 321368	2017-12-22 17:18:10 +00:00
Dmitry Preobrazhensky	e80d391b33	[AMDGPU][MC] Corrected parsing of optional operands for ds_swizzle_b32 See bug 35645: https://bugs.llvm.org/show_bug.cgi?id=35645 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D41186 llvm-svn: 321367	2017-12-22 17:13:28 +00:00
Haicheng Wu	00cf1cbef6	[InlineCost] Find more free binary operations Currently, inline cost model considers a binary operator as free only if both its operands are constants. Some simple cases are missing such as a + 0, a - a, etc. This patch modifies visitBinaryOperator() to call SimplifyBinOp() without going through simplifyInstruction() to get rid of the constant restriction. Thus, visitAnd() and visitOr() are not needed. Differential Revision: https://reviews.llvm.org/D41494 llvm-svn: 321366	2017-12-22 17:09:09 +00:00
Nirav Dave	b46f9f4b18	[DAG] Integrate findBaseOffset address analyses to BaseIndexOffset. NFCI. BaseIndexOffset supercedes findBaseOffset analysis save only Constant Pool addresses. Migrate analysis to BaseIndexOffset. llvm-svn: 321364	2017-12-22 16:59:09 +00:00
Dmitry Preobrazhensky	b8925d0036	[AMDGPU][MC] Added support of 256- and 512-bit tuples of ttmp registers See bug 35561: https://bugs.llvm.org/show_bug.cgi?id=35561 This patch also affects implementation of SGPR and VGPR registers though changes are cosmetic. Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D41437 llvm-svn: 321359	2017-12-22 15:18:06 +00:00
Simon Atanasyan	7212902956	[mips] Add test case to check that calls to mcount follow long calls / short calls options. NFC llvm-svn: 321357	2017-12-22 13:45:46 +00:00
Diana Picus	f6398cc5f8	[ARM GlobalISel] Support G_INTTOPTR and G_PTRTOINT for s32 Mark conversions between pointers and 32-bit scalars as legal, map them to the GPR and select to a simple COPY. llvm-svn: 321356	2017-12-22 13:05:51 +00:00
Diana Picus	82df1867df	[ARM GlobalISel] Support pointer constants Pointer constants are pretty rare, since we usually represent them as integer constants and then cast to pointer. One notable exception is the null pointer constant, which is represented directly as a G_CONSTANT 0 with pointer type. Mark it as legal and make sure it is selected like any other integer constant. llvm-svn: 321354	2017-12-22 11:09:18 +00:00
Sam Parker	9c30a4004c	[DAGCombine] Revert r321259 Improve ReduceLoadWidth for SRL Patch is causing an issue on the PPC64 BE santizer. llvm-svn: 321349	2017-12-22 08:36:25 +00:00
Chandler Carruth	5b662c8a1e	Rewrite the cached map used for locating the most precise DIE among inlined subroutines for a given address. This is essentially the hot path of llvm-symbolizer when extracting inlined frames during symbolization. Previously, we would read every subprogram and every inlined subroutine, building a std::map across the entire PC space to the best DIE, and then do only a handful of queries as we symbolized a backtrace. A huge fraction of the time was spent building the map itself. This patch changes it two a two-level system. First, we just build a map from PC-interval to DWARF subprograms. These are required to be disjoint and so constructing this is pretty easy. Second, we build a map just for the inlined subroutines within the subprogram containing the query address. This allows us to look at far fewer DIEs and build a much smaller set of cached maps in the llvm-symbolizer case where only a few address get symbolized during the entire run. It also builds both interval maps in a very different way. It constructs a single flat vector of pairs that maps from offset -> index. The indices point into collections of DIE objects, but can also be "tombstones" (-1) to mark gaps. In the case of subprograms, this mostly just simplifies the data structure a bit. For inlined subroutines, because we carefully split them as we build the map, we end up in many cases having no holes and not having to store both start and stop offsets. Finally, the PC ranges for the inlined subroutines are compressed into 32-bits by making them relative to the base PC of the outer subprogram. This means that if you have a single function body with over 2gb of executable code in it, we will stop mapping address past the first 2gb of that function into inlined subroutines and just give you the subprogram. This doesn't seem like a problem. ;] All of this combines to make llvm-symbolizer well over 2x faster for symbolizing backtraces out of LLVM's unittests. Death-test heavy unit tests are running >2x faster. I'm still going to look at completely disabling symbolization there, but figured while I had a good benchmark we should make symbolization a bit better. Sadly, the logic to build the flat interval map for the inlined subroutines is fairly complex. I'm not super happy about this and welcome any simplifying suggestions. Huge thanks to Dave Blaikie who helped walk me through what the various things I needed to do in DWARF to make this work. Differential Revision: https://reviews.llvm.org/D40987 llvm-svn: 321345	2017-12-22 06:41:23 +00:00
Craig Topper	e8c3eaaf3a	[X86] Add missing initialization for the HasPREFETCHWT1 subtarget variable. llvm-svn: 321340	2017-12-22 03:53:14 +00:00
Craig Topper	992ccaf509	[X86] Enable PRFCHW feature on KNL/KNM and all CPUs inherited from Broadwell. llvm-svn: 321336	2017-12-22 02:41:12 +00:00

1 2 3 4 5 ...

158351 Commits