llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Amjad Aboud	ba09d82dc0	Another try to commit 323321 (aggressive instruction combine). llvm-svn: 323416	2018-01-25 12:06:32 +00:00
George Rimar	ba9f28eca0	[LTO] - Get rid of friend 'computeDeadSymbols'. NFC. computeDeadSymbols accessed isLive() which was not public before. It does not make much sence to keep isLive() private because flags are available via flags() public member anyways. llvm-svn: 323415	2018-01-25 11:45:02 +00:00
Jonas Devlieghere	33eed2b799	[Dwarf] Add dsymutil Atom extensions. NFC This patch extends the atom types used by the Apple accelerator tables with two dsymutil extensions: - DW_ATOM_type_type_flags - DW_ATOM_qual_name_hash llvm-svn: 323414	2018-01-25 11:19:08 +00:00
Mikael Holmen	7a1f7a9d90	[GlobalOpt] Emit fragments using field offsets from struct layout Summary: When creating the debug fragments for a SRA'd struct, use the fields' offsets, taken from the struct layout, as the offsets for the resulting fragments. This fixes an issue where GlobalOpt would emit fragments with incorrect offsets for padded fields. This should solve PR36016. Patch by David Stenberg. Reviewers: aprantl Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42489 llvm-svn: 323411	2018-01-25 10:09:26 +00:00
Igor Laevsky	0fefdb08dc	[FuzzMutate] Inst deleter doesn't work with PhiNodes Differential Revision: https://reviews.llvm.org/D42412 llvm-svn: 323409	2018-01-25 09:22:18 +00:00
Eugene Leviant	449a2735e1	[IRMover] Add comment and fix test case llvm-svn: 323407	2018-01-25 08:35:52 +00:00
Craig Topper	ce1999a1e1	[X86] Expand IMUL/MUL instregexs in Intel scheduler models. Add load latency to some of them in SkylakeClient model. The regular expressions and the imul names caused some instructions to be matched by multiple regexs creating unpredictable results. This changes them all to use explicit instrs instead. While doing this I also found that some instructions in Skylake were missing load latency so I fixed that too. llvm-svn: 323406	2018-01-25 06:57:42 +00:00
Craig Topper	3a3e63c149	[X86] Expand IMUL/MUL instregexs in Znver1 scheduler to show what's actually implemented. The IMUL instruction names mixed with the prefix matching of the instregex lead to some strange matches. The worst being that several memory instructions are using the register form latency. I don't know what the right answer is, so I've left TODOs and will try to work with the AMD folks to get this cleaned up. llvm-svn: 323405	2018-01-25 06:57:39 +00:00
Don Hinton	7c9b4f6a12	[cmake] Set cmake policy CMP0068 to suppress warnings on OSX Set cmake policy CMP0068=NEW, if available, and set "CMAKE_BUILD_WITH_INSTALL_NAME_DIR=On" globally to maintain current behavior. This is needed to suppress warnings on OSX starting with cmake version 3.9.6. Differential Revision: https://reviews.llvm.org/D42463 llvm-svn: 323404	2018-01-25 04:55:18 +00:00
Craig Topper	fae7861fa0	[X86] Name the MMX phaddd instruction with 3 Ds instead of just 2. NFC llvm-svn: 323403	2018-01-25 04:45:32 +00:00
Craig Topper	cc7c2fbcc1	[X86] Remove 64/128/256 from MMX/SSE/AVX instruction names for overall consistency. NFC MMX instrutions all start with MMX_ so the 64 isn't needed for disambigutation. SSE/AVX1 instructions are assumed 128-bit so we don't need to say 128. AVX2 instructions should use a Y to indicate 256-bits. llvm-svn: 323402	2018-01-25 04:45:30 +00:00
Craig Topper	0a6f713634	[X86] Remove unnecessary '_alt' and '_Int' from scheduler model regular expressions. These were treated as optional suffixes, but the regular expressions are already prefix matches so this is unnecessary. It breaks the binary search optimization in tablegen due to the top level question mark. llvm-svn: 323401	2018-01-25 04:45:28 +00:00
Aditya Nandakumar	af9f61606c	Add support for pattern matching MachineInsts. https://reviews.llvm.org/D42439 Add Instcombine like matchers for MachineInstructions. There are only globalISel matchers for now. llvm-svn: 323400	2018-01-25 02:53:06 +00:00
Lang Hames	0943412040	[ORC] Refactor the various lookupFlags methods to return the flags map via the first argument. This makes lookupFlags more consistent with lookup (which takes the query as the first argument) and composes better in practice, since lookups are usually linearly chained: Each lookupFlags can populate the result map based on the symbols not found in the previous lookup. (If the maps were returned rather than passed by reference there would have to be a merge step at the end). llvm-svn: 323398	2018-01-25 01:43:00 +00:00
Aditya Nandakumar	44de88f9fc	[GISel]: Fix modules build by including <cassert> llvm-svn: 323394	2018-01-25 01:16:14 +00:00
Lang Hames	c647b54b9d	[ORC] Try to silence compiler error at http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/17264 NFC. llvm-svn: 323393	2018-01-25 01:05:29 +00:00
Aditya Nandakumar	65202b9329	[GISel]: Implement GlobalISel combiner API. https://reviews.llvm.org/D41373 The various components are GICombinerHelper contains transformations that are common to all targets. Targets can pick and choose which transformations (at function/opcode granularity) each pass uses via configuring a GICombinerInfo. GICombiner contains some common code and it does the traversal, driving of combines, worklist management and iterating until convergence. GICombinerInfo is an interface with a virtual method called combine. The combiner info will allow targets to pick and choose (or implement their own specific combines). CombineInfos can make use of available combines in GICombineHelper to configure the transformations for a particular pass. Currently this approach allows cherry picking transformations from helpers (at function/opcode granularity) and also allows early returning on specific transformations. Targets also get to prioritize whether target specific combines run before/after the opt-in generic combines. Ideally we would like this part to be configured by both C++ and Tablegen. The CombinerInfo also has a field which indicates how to deal with IllegalOps (ie - should we allow to create them/or legalize them?). A CombinerPass would configure a CombinerInfo, create the GICombiner with the Info, and call GICombiner::combineMachineInstrs(MachineFunction&). This organization is very similar to the GISelLegalizer. llvm-svn: 323392	2018-01-25 00:41:58 +00:00
Volkan Keles	081998b7f1	[GlobalISel][TableGen] Fix the statistics for emitted patters Collected statistics for the number of patterns emitted can be incorrect because rules can be grouped if OptimizeMatchTable is enabled. Increase the counter in RuleMatcher::emit(...) to avoid that. llvm-svn: 323391	2018-01-25 00:18:52 +00:00
Lang Hames	e2eabc82ff	[ORC] Add helpers for building orc::SymbolResolvers from legacy findSymbol-style functions/methods that return JITSymbols. lookupFlagsWithLegacyFn takes a SymbolNameSet and a legacy lookup function and returns a LookupFlagsResult. It uses the legacy lookup function to search for each symbol. If found, getFlags is called on the symbol and the flags added to the SymbolFlags map. If not found, the symbol is added to the SymbolsNotFound set. lookupWithLegacyFn takes an AsynchronousSymbolQuery, a SymbolNameSet and a legacy lookup function. Each symbol in the SymbolNameSet is searched for via the legacy lookup function. If it is found, its getAddress function is called (triggering materialization if it has not happened already) and the resulting mapping stored in the query. If it is not found the symbol is added to the unresolved symbols set which is returned at the end of the function. If an error occurs during legacy lookup or materialization it is passed to the query via setFailed and the function returns immediately. llvm-svn: 323388	2018-01-24 23:09:07 +00:00
Amara Emerson	96635ca939	[GlobalISel] Add a requires: asserts to a test. llvm-svn: 323384	2018-01-24 22:40:25 +00:00
Benjamin Kramer	96a9d2feef	[TableGen] Add a way of getting the number of generic opcodes without including modular CodeGen headers. This is a bit of a hack, but removes a cycle that broke modular builds of LLVM. Of course the cycle is still there in form of a dependency on the .def file. llvm-svn: 323383	2018-01-24 22:35:11 +00:00
Sanjay Patel	2e62e2ea1b	[InstCombine] fix datalayout in test file The only part of the datalayout that should matter for these tests is the part that specifies the legal int widths ('n*'). But there was a bug - that part of the string was not correctly separated with the expected '-' character, so we were testing as if there were no legal int widths at all. Removed the leading cruft so we have some legal ints to test with. I noticed this while testing a potential change to the way we transform shifts and sexts in D42424. llvm-svn: 323377	2018-01-24 21:36:45 +00:00
Lang Hames	6e0cc41ad1	[ORC] Add a LambdaSymbolResolver convenience class and docs for SymbolResolver. This patch adds a LambdaSymbolResolver convenience utility that can create an orc::SymbolResolver from a pair of function objects that supply the behavior for the lookupFlags and lookup methods. This class plays the same role for orc::SymbolResolver as the legacy LambdaResolver class plays for LegacyJITSymbolResolver, and will replace the latter class once all ORC APIs are migrated to orc::SymbolResolver. This patch also adds some documentation for the orc::SymbolResolver class as this was left out of the original commit. llvm-svn: 323375	2018-01-24 21:21:10 +00:00
Krzysztof Parzyszek	15306dc353	[Hexagon] Replace EmitFunctionEntryCode with a DAG preprocessing code The code in EmitFunctionEntryCode needs to know the maximum stack alignment, but it runs very early in the selection process (before lowering). The final stack alignment may change during lowering, so the code needs to be moved to where the alignment is known. llvm-svn: 323374	2018-01-24 21:19:51 +00:00
Daniel Sanders	b40cd97d7f	[globalisel] Fix long lines from r323342 They would be fixed in a later patch but they shouldn't have been introduced. llvm-svn: 323372	2018-01-24 20:43:21 +00:00
Amara Emerson	3e42041b2e	[AArch64][GlobalISel] Fall back during AArch64 isel if we have a volatile load. The tablegen imported patterns for sext(load(a)) don't check for single uses of the load or delete the original after matching. As a result two loads are left in the generated code. This particular issue will be fixed by adding support for a G_SEXTLOAD opcode in future. There are however other potential issues around this that wouldn't be fixed by a G_SEXTLOAD, so until we have a proper solution we don't try to handle volatile loads at all in the AArch64 selector. Fixes/works around PR36018. llvm-svn: 323371	2018-01-24 20:35:37 +00:00
Amara Emerson	0f8c9d286a	[GlobalISel] Don't fall back to FastISel. Apparently checking the pass structure isn't enough to ensure that we don't fall back to FastISel, as it's set up as part of the SelectionDAGISel. llvm-svn: 323369	2018-01-24 19:59:29 +00:00
Simon Pilgrim	966ca434a4	[X86][SSE] Aggressively use PMADDWD for v4i32 multiplies with 17 or more leading zeros As discussed in D41484, PMADDWD for 'zero extended' vXi32 is nearly always a better option than PMULLD: On SNB it will result in code that isn't any faster, but not any slower so we may as well keep it. On KNL it only has half the throughput, so I've disabled it on there - ideally there'd be a better way than this. Differential Revision: https://reviews.llvm.org/D42258 llvm-svn: 323367	2018-01-24 19:20:02 +00:00
Rafael Espindola	a39721e808	Simplify. NFC. Thanks to Teresa Johnson for the suggestion. llvm-svn: 323365	2018-01-24 19:11:24 +00:00
Simon Pilgrim	a1e5ae82ac	[X86][SSE] Add slow-pmulld attribute (silvermont-style) test Requested by @zvi on D42258 llvm-svn: 323364	2018-01-24 19:09:11 +00:00
Alexey Bataev	e0ff672468	Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle." This reverts commit r323348 because of the broken buildbots. llvm-svn: 323359	2018-01-24 18:36:51 +00:00
Easwaran Raman	144f3acb63	Revert "[ThinLTO] Add call edges' relative block frequency to per-module summary." Causes buildbot regressions. llvm-svn: 323358	2018-01-24 18:15:29 +00:00
Paul Robinson	76705a7388	Fix up and document controlling ccache via CMake options. Patch by Matthew Davis! Differential Revision: https://reviews.llvm.org/D41757 llvm-svn: 323357	2018-01-24 18:15:08 +00:00
Geoff Berry	0620939033	[AMDGPU] Make sure all super regs of reserved regs are marked reserved. Summary: Move reserveRegisterTuples into AMDGPURegisterInfo and use it in R600RegisterInfo::getReservedRegs and R600InstrInfo::reserveIndirectRegisters to ensure that all super registers of reserved registers are also marked as reserved. Before this change, under certain circumstances, the registers %t1_x and %t1_xyzw would be marked as reserved, but %t1_xy and %t1_xyz would not be, leading to the register allocator sometimes assigning a register to %t1_xy, which is invalid since %t1_x is reserved. Reviewers: arsenm, tstellar, MatzeB, qcolombet Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D42448 llvm-svn: 323356	2018-01-24 18:09:53 +00:00
Nicolai Haehnle	1ae074fb0f	Revert r321751, "StructurizeCFG: Fix broken backedge detection" It causes regressions in various OpenGL test suites. Keep the test cases introduced by r321751 as XFAIL, and add a test case for the regression. Change-Id: I90b4cc354f68cebe5fcef1f2422dc8fe1c6d3514 Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=36015 llvm-svn: 323355	2018-01-24 18:02:05 +00:00
Weiming Zhao	312b7968f5	[ARM] Expand long shifts for Thumb1 to __aeabi_ calls Summary: For long shifts, the inlined version takes about 20 instructions on Thumb1. To avoid the code bloat, expand to __aeabi_ calls if target is Thumb1. Reviewers: samparker Reviewed By: samparker Subscribers: samparker, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42401 llvm-svn: 323354	2018-01-24 18:00:57 +00:00
Craig Topper	838dbb5ab9	[X86] Fix some inconsistencies in the itineraries and Sched for (V)PEXTRW/(V)PINSRW The weirdest being that PEXTRWrr was tagged as a memory operation. llvm-svn: 323353	2018-01-24 17:58:57 +00:00
Craig Topper	34467739ab	[X86] Adjust names of PINSRW/PEXTRW intructions between MMX/SSE/AVX/AVX512 for consistency and to maybe enable more regular expression compaction in the scheduler models. NFCI llvm-svn: 323352	2018-01-24 17:58:51 +00:00
Craig Topper	650a89f354	[X86] Remove '(_REV)?' from a bunch of scheduler regular expressions. NFC The regexs are treated as a prefix match already so the checking for optional text at the end provides no value. Instead it prevents the binary search optimization in tablegen from kicking in due to the top level question mark. llvm-svn: 323351	2018-01-24 17:58:42 +00:00
Easwaran Raman	e7546e2838	[ThinLTO] Add call edges' relative block frequency to per-module summary. Summary: This allows relative block frequency of call edges to be passed to the thinlink stage where it will be used to compute synthetic entry counts of functions. Reviewers: tejohnson, pcc Subscribers: mehdi_amini, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D42212 llvm-svn: 323349	2018-01-24 17:51:23 +00:00
Alexey Bataev	d796c9f58d	[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. Summary: If the same value is going to be vectorized several times in the same tree entry, this entry is considered to be a gather entry and cost of this gather is counter as cost of InsertElementInstrs for each gathered value. But we can consider these elements as ShuffleInstr with SK_PermuteSingle shuffle kind. Reviewers: spatel, RKSimon, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38697 llvm-svn: 323348	2018-01-24 17:50:53 +00:00
Krzysztof Parzyszek	b5d87e74ca	[Hexagon] Run late copy propagation and dead code elimination passes llvm-svn: 323346	2018-01-24 17:48:11 +00:00
Rafael Espindola	5a7adaa761	Handle R_386_PLT32 in RuntimeDyldELF. This should fix the 32 bit buildbots. llvm-svn: 323344	2018-01-24 17:36:08 +00:00
Zvi Rackover	2888e5cce2	InstSimplify: If divisor element is undef simplify to undef Summary: If any vector divisor element is undef, we can arbitrarily choose it be zero which would make the div/rem an undef value by definition. Reviewers: spatel, reames Reviewed By: spatel Subscribers: magabari, llvm-commits Differential Revision: https://reviews.llvm.org/D42485 llvm-svn: 323343	2018-01-24 17:22:00 +00:00
Daniel Sanders	99f8a8b118	[globalisel] Introduce LegalityQuery to better encapsulate the legalizer decisions. NFC. Summary: `getAction(const InstrAspect &) const` breaks encapsulation by exposing the smaller components that are used to decide how to legalize an instruction. This is a problem because we need to change the implementation of LegalizerInfo so that it's able to describe particular type combinations rather than just cartesian products of types. For example, declaring the following setAction({..., 0, s32}, Legal) setAction({..., 0, s64}, Legal) setAction({..., 1, s32}, Legal) setAction({..., 1, s64}, Legal) currently declares these type combinations as legal: {s32, s32} {s64, s32} {s32, s64} {s64, s64} but we currently have no means to say that, for example, {s64, s32} is not legal. Some operations such as G_INSERT/G_EXTRACT/G_MERGE_VALUES/ G_UNMERGE_VALUES has relationships between the types that are currently described incorrectly. Additionally, G_LOAD/G_STORE currently have no means to legalize non-atomics differently to atomics. The necessary information is in the MMO but we have no way to use this in the legalizer. Similarly, there is currently no way for the register type and the memory type to differ so there is no way to cleanly represent extending-load/truncating-store in a way that can't be broken by optimizers (resulting in illegal MIR). This patch introduces LegalityQuery which provides all the information needed by the legalizer to make a decision on whether something is legal and how to legalize it. Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar, volkan, reames, bogner Reviewed By: bogner Subscribers: bogner, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D42244 llvm-svn: 323342	2018-01-24 17:17:46 +00:00
Jonas Devlieghere	bab446772d	[NFC] Make magic number for DJB hash function customizable. This allows us to specify the magic number for the DJB hash function. This feature is needed by dsymutil to emit Apple types accelerator table. llvm-svn: 323341	2018-01-24 16:53:14 +00:00
Jonas Devlieghere	29cb18004b	[dsymutil] Make NonRelocatableStringPool a wrapper around DwarfStringPoolEntry. NFC This is needed in order to use our StringPool entries in the Apple accelerator tables. As this is NFC we rely on the existing tests for correctness. llvm-svn: 323339	2018-01-24 16:16:43 +00:00
Sanjay Patel	373af89ec1	[ValueTracking] add recursion depth param to matchSelectPattern We're getting bug reports: https://bugs.llvm.org/show_bug.cgi?id=35807 https://bugs.llvm.org/show_bug.cgi?id=35840 https://bugs.llvm.org/show_bug.cgi?id=36045 ...where we blow up the stack in value tracking because other passes are sending in selects that have an operand that is itself the select. We don't currently have a reliable way to avoid analyzing dead code that may take non-standard forms, so bail out when things go too far. This mimics the recursion depth limitations in other parts of value tracking. Unfortunately, this pushes the underlying problems for other passes (jump-threading, simplifycfg, correlated-propagation) into hiding. If someone wants to uncover those again, the first draft of this patch on Phab would do that (it would assert rather than bail out). Differential Revision: https://reviews.llvm.org/D42442 llvm-svn: 323331	2018-01-24 15:20:37 +00:00
Zvi Rackover	20684de509	X86 Tests: Add more sdiv combine cases. NFC Add cases with vector non-splat pow2 contant divider. llvm-svn: 323329	2018-01-24 15:02:16 +00:00
Simon Pilgrim	d5d1ff89f8	Regenerate shuffle sink test llvm-svn: 323328	2018-01-24 14:59:02 +00:00

1 2 3 4 5 ...

159321 Commits