llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Matt Arsenault	9419a1ea69	AMDGPU: Use i16 comparison instructions llvm-svn: 290348	2016-12-22 16:27:11 +00:00
Matt Arsenault	0975fb877d	AMDGPU: Fixed '!NodePtr->isKnownSentinel()' assert Caused by dereferencing end iterator when trying to const cast the iterator. Patch by Martin Sherburn llvm-svn: 290347	2016-12-22 16:06:32 +00:00
Davide Italiano	bd0298b1e1	[GVN] Initial check-in of a new global value numbering algorithm. The code have been developed by Daniel Berlin over the years, and the new implementation goal is that of addressing shortcomings of the current GVN infrastructure, i.e. long compile time for large testcases, lack of phi predication, no load/store value numbering etc... The current code just implements the "core" GVN algorithm, although other pieces (load coercion, phi handling, predicate system) are already implemented in a branch out of tree. Once the core is stable, we'll start adding pieces on top of the base framework. The test currently living in test/Transform/NewGVN are a copy of the ones in GVN, with proper `XFAIL` (missing features in NewGVN). A flag will be added in a future commit to enable NewGVN, so that interested parties can exercise this code easily. Differential Revision: https://reviews.llvm.org/D26224 llvm-svn: 290346	2016-12-22 16:03:48 +00:00
Dan Gohman	6463134ea9	[WebAssembly] Add an "explicit" keyword to a constructor. llvm-svn: 290345	2016-12-22 16:03:02 +00:00
Dan Gohman	4f1d71a323	[WebAssembly] Don't use variadic operand indices in the MCOperandInfo array. llvm-svn: 290344	2016-12-22 16:00:55 +00:00
Dan Gohman	ab195580bf	[WebAssembly] Don't old negative load/store offsets in fast-isel. WebAssembly's load/store offsets are unsigned and don't wrap, so it's not valid to fold in a negative offset. llvm-svn: 290342	2016-12-22 15:15:10 +00:00
Sam Kolton	dc4ffc9328	[AMDGPU] Add pseudo SDWA instructions Summary: This is needed for later SDWA support in CodeGen. Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27412 llvm-svn: 290338	2016-12-22 12:57:41 +00:00
Sam Kolton	0ab0b61c0c	[AMDGPU] Disassembler: fix for disaasembling v_mac_f32/16_dpp/sdwa Summary: Real instruction should copy constraints from real instruction. This allows auto-generated disassembler to correctly process tied operands. Reviewers: nhaustov, vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27847 llvm-svn: 290336	2016-12-22 11:30:48 +00:00
Ayman Musa	6f11282fa7	[X86][AVX2] Passing the appropriate memory operand class to VPMADDWD instruction. Replacing the memory operand in the ymm version of VPMADDWD from i128mem to i256mem. Differential Revision: https://reviews.llvm.org/D28024 llvm-svn: 290333	2016-12-22 08:42:46 +00:00
Chandler Carruth	8c6f59de4c	[PM] Loosen the check ever so slightly -- MSVC appears to not include a space after the comma in template arguments with our hacky type name system. llvm-svn: 290331	2016-12-22 07:53:20 +00:00
Chandler Carruth	fc56c99646	[PM] Make a couple of CHECK lines a bit more precise, NFC. I was staring at these and didn't realize these were module-layer proxies as opposed to some other layer. Justin and I have a plan to rename things to make the names themselves much easier to reason about, but I at least want the CHECK lines to be precise for now. llvm-svn: 290328	2016-12-22 07:14:35 +00:00
Chandler Carruth	816d7841f8	[PM] Remove now-dead extern template and explicit instantiation declarations. We're using a custom class here instead of the helper template, these bits just didn't get deleted when the other bits did get deleted. This was found by a really nice MSVC warning about explicitly instantiating a template where some member functions aren't defined and thus can't be instantiatied. llvm-svn: 290327	2016-12-22 07:14:33 +00:00
Chandler Carruth	2ed363a464	[PM] Introduce a reasonable port of the main per-module pass pipeline from the old pass manager in the new one. I'm not trying to support (initially) the numerous options that are currently available to customize the pass pipeline. If we end up really wanting them, we can add them later, but I suspect many are no longer interesting. The simplicity of omitting them will help a lot as we sort out what the pipeline should look like in the new PM. I've also documented to the best of my ability why each pass or group of passes is used so that reading the pipeline is more helpful. In many cases I think we have some questionable choices of ordering and I've left FIXME comments in place so we know what to come back and revisit going forward. But for now, I've left it as similar to the current pipeline as I could. Lastly, I've had to comment out several places where passes are not ported to the new pass manager or where the loop pass infrastructure is not yet ready. I did at least fix a few bugs in the loop pass infrastructure uncovered by running the full pipeline, but I didn't want to go too far in this patch -- I'll come back and re-enable these as the infrastructure comes online. But I'd like to keep the comments in place because I don't want to lose track of which passes need to be enabled and where they go. One thing that seemed like a significant API improvement was to require that we don't build pipelines for O0. It seems to have no real benefit. I've also switched back to returning pass managers by value as at this API layer it feels much more natural to me for composition. But if others disagree, I'm happy to go back to an output parameter. I'm not 100% happy with the testing strategy currently, but it seems at least OK. I may come back and try to refactor or otherwise improve this in subsequent patches but I wanted to at least get a good starting point in place. Differential Revision: https://reviews.llvm.org/D28042 llvm-svn: 290325	2016-12-22 06:59:15 +00:00
Adrian Prantl	9e92e0bc4f	Fix an assertion in DwarfExpression when emitting fragments in vector registers When DwarfExpression is emitting a fragment that is located in a register and that fragment is smaller than the register, and the register must be composed from sub-registers (are you still with me?) the last DW_OP_piece operation must not be larger than the size of the fragment itself, since the last piece of the fragment could be smaller than the last subregister that is being emitted. rdar://problem/29779065 llvm-svn: 290324	2016-12-22 06:10:41 +00:00
Adrian Prantl	97f5f281ea	Refactor the DIExpression fragment query interface (NFC) ... so it becomes available to DIExpressionCursor. llvm-svn: 290322	2016-12-22 05:27:12 +00:00
Matt Arsenault	7363b816c7	DAG: Add helper for testing constant values There are helpers for testing for constant or constant build_vector, and for splat ConstantFP vectors, but not for a constantfp or non-splat ConstantFP vector. llvm-svn: 290317	2016-12-22 04:39:45 +00:00
Matt Arsenault	13f610555b	AMDGPU: Fix missing commute table entries for cmpx No tests because these aren't currently used anywhere. llvm-svn: 290316	2016-12-22 04:39:41 +00:00
Mehdi Amini	09072165d5	[ThinLTO] Save 8B per summary entry by rearranging the fields (NFC) Size goes from 72B to 64B per entry. Differential Revision: https://reviews.llvm.org/D27970 llvm-svn: 290314	2016-12-22 04:09:29 +00:00
Matt Arsenault	ee5d8d2da0	AMDGPU: Swap order of operands in fadd/fsub combine FMA is canonicalized to constant in the middle operand. Do the same so fmad matches and avoid an extra combine step. llvm-svn: 290313	2016-12-22 04:03:40 +00:00
Matt Arsenault	9d4a891569	AMDGPU: Check fast math flags in fadd/fsub combines llvm-svn: 290312	2016-12-22 04:03:35 +00:00
Matt Arsenault	dd6b858bbf	AMDGPU: Form more FMAs if fusion is allowed Extend the existing fadd/fsub->fmad combines to produce FMA if allowed. llvm-svn: 290311	2016-12-22 03:55:35 +00:00
Matt Arsenault	f4e299f829	AMDGPU: Move combines into separate functions llvm-svn: 290309	2016-12-22 03:44:42 +00:00
Matt Arsenault	5ba9667c15	AMDGPU: Enable some f32 fadd/fsub combines for f16 llvm-svn: 290308	2016-12-22 03:40:39 +00:00
Matt Arsenault	d7ec3d5ba4	AMDGPU: Implement isFMAFasterThanFMulAndFAdd for f16 llvm-svn: 290307	2016-12-22 03:21:48 +00:00
Matt Arsenault	808557202a	AMDGPU: setcc test cleanup llvm-svn: 290306	2016-12-22 03:21:45 +00:00
Matt Arsenault	5ecf306700	AMDGPU: Allow rcp and rsq usage with f16 llvm-svn: 290302	2016-12-22 03:05:44 +00:00
Matt Arsenault	a844bf67ff	AMDGPU: Custom lower f16 fdiv llvm-svn: 290301	2016-12-22 03:05:41 +00:00
Matt Arsenault	263e20ee06	AMDGPU: Implement f16 fcanonicalize llvm-svn: 290300	2016-12-22 03:05:37 +00:00
Matt Arsenault	5979870866	AMDGPU: Update isFPImmLegal for f16 I don't think this matters because ConstantFP is legal. llvm-svn: 290299	2016-12-22 03:05:30 +00:00
Peter Collingbourne	dd17eca81c	Clear the PendingTypeTests vector after moving from it. This is to put the vector into a well defined state. Apparently the state of a vector after being moved from is valid but unspecified. Found with clang-tidy. llvm-svn: 290298	2016-12-22 02:52:23 +00:00
Haicheng Wu	76f60eec7f	[AArch64] Correct the check of signed 9-bit imm in getIndexedAddressParts(). -256 is a legal indexed address part. Differential Revision: https://reviews.llvm.org/D27537 llvm-svn: 290296	2016-12-22 01:39:24 +00:00
Easwaran Raman	397ecf69ce	Pass GetAssumptionCache to InlineFunctionInfo constructor Differential revision: https://reviews.llvm.org/D28038 llvm-svn: 290295	2016-12-22 01:07:01 +00:00
David Majnemer	8347bff07a	[NVVMIntrRange] Only set range metadata if none is already present The range metadata inserted by NVVMIntrRange is pessimistic, range metadata already present could be more precise. llvm-svn: 290294	2016-12-22 00:51:59 +00:00
Adrian Prantl	1f3fc31b75	Renumber testcase metadata nodes after r290153. This patch renumbers the metadata nodes in debug info testcases after https://reviews.llvm.org/D26769. This is a separate patch because it causes so much churn. This was implemented with a python script that pipes the testcases through llvm-as - \| llvm-dis - and then goes through the original and new output side-by side to insert all comments at a close-enough location. Differential Revision: https://reviews.llvm.org/D27765 llvm-svn: 290292	2016-12-22 00:45:21 +00:00
Adrian Prantl	534ff7f919	[LLParser] Make the line field of DIMacro(File) optional. Otherwise these records do not survive roundtrips. llvm-svn: 290291	2016-12-22 00:29:00 +00:00
Adrian Prantl	6adadd607c	Legalize metadata in legacy testcases llvm-svn: 290288	2016-12-21 23:38:17 +00:00
Adrian Prantl	957c27b43e	Legalize metadata in legacy testcases llvm-svn: 290287	2016-12-21 23:36:06 +00:00
Adrian Prantl	f54db8e4ad	Legalize metadata in legacy testcases llvm-svn: 290286	2016-12-21 23:30:35 +00:00
Adrian Prantl	073dce6100	Legalize metadata in legacy testcases llvm-svn: 290285	2016-12-21 23:28:49 +00:00
Ahmed Bougacha	081c2ca61f	[GlobalISel] Add basic Selector-emitter tblgen backend. This adds a basic tablegen backend that analyzes the SelectionDAG patterns to find simple ones that are eligible for GlobalISel-emission. That's similar to FastISel, with one notable difference: we're not fed ISD opcodes, so we need to map the SDNode operators to generic opcodes. That's done using GINodeEquiv in TargetGlobalISel.td. Otherwise, this is mostly boilerplate, and lots of filtering of any kind of "complicated" pattern. On AArch64, this is sufficient to match G_ADD up to s64 (to ADDWrr/ADDXrr) and G_BR (to B). Differential Revision: https://reviews.llvm.org/D26878 llvm-svn: 290284	2016-12-21 23:26:20 +00:00
Ahmed Bougacha	d0a6918aed	[AsmWriter] Remove redundant cast<>s. NFC. llvm-svn: 290283	2016-12-21 23:26:13 +00:00
Dan Gohman	d29c16d443	[WebAssembly] Fix the opcode value for i64.rotr. llvm-svn: 290281	2016-12-21 23:09:42 +00:00
Peter Collingbourne	5ce602306f	IR: Function summary representation for type tests. Each function summary has an attached list of type identifier GUIDs. The idea is that during the regular LTO phase we would match these GUIDs to type identifiers defined by the regular LTO module and store the resolutions in a top-level "type identifier summary" (which will be implemented separately). Differential Revision: https://reviews.llvm.org/D27967 llvm-svn: 290280	2016-12-21 23:03:45 +00:00
Mike Aizatsky	d114d43c64	[sancov] skip duplicated points llvm-svn: 290278	2016-12-21 22:10:01 +00:00
Mike Aizatsky	04679cbc10	[sancov] hash prefix results in huge merge files, use shorter prefix llvm-svn: 290277	2016-12-21 22:09:57 +00:00
Haicheng Wu	4e146dfe3d	[AArch64] Remove a redundant check. NFC. The case AM.Scale == 0 is already handled by the code right above. Differential Revision: https://reviews.llvm.org/D28003 llvm-svn: 290275	2016-12-21 21:40:47 +00:00
Greg Clayton	aadbc74cdf	Add the ability for DWARFDie objects to get the parent DWARFDie. In order for the llvm DWARF parser to be used in LLDB we will need to be able to get the parent of a DIE. This patch adds that functionality by changing the DWARFDebugInfoEntry class to store a depth field instead of a sibling index. Using a depth field allows us to easily calculate the sibling and the parent without increasing the size of DWARFDebugInfoEntry. I tested llvm-dsymutil on a debug version of clang where this fully parses DWARF in over 1200 .o files to verify there was no serious regression in performance. Added a full suite of unit tests to test this functionality. Differential Revision: https://reviews.llvm.org/D27995 llvm-svn: 290274	2016-12-21 21:37:06 +00:00
Justin Bogner	a84529a444	cmake: Don't build llvm-config and tblgen concurrently in cross builds This sets USES_TERMINAL for the native llvm-config build, so that it doesn't run at the same time as builds of other native tools (namely, tablegen). Without this, if you're very unlucky with the timing it's possible to be relinking libSupport as one of the tools is linking, causing a spurious failure. The tablegen build adopted USES_TERMINAL for this same reason in r280748. llvm-svn: 290271	2016-12-21 21:19:00 +00:00
Ed Maste	f53471fe7e	Update mailing list post URL and add libunwind reference RTDyldMemoryManager.cpp describes the differing __register_frame API between libunwind and libgcc, with a mailing list posting URL. The original link was 404; replace it with what I believe is the intended post, as well as a reference to the "OS X" implementation in libunwind. Differential Revision: https://reviews.llvm.org/D27965 llvm-svn: 290269	2016-12-21 20:51:42 +00:00
Simon Pilgrim	2882ed164e	[X86][SSE] Improve lowering of vXi64 multiplies As mentioned on PR30845, we were performing our vXi64 multiplication as: AloBlo = pmuludq(a, b); AloBhi = pmuludq(a, psrlqi(b, 32)); AhiBlo = pmuludq(psrlqi(a, 32), b); return AloBlo + psllqi(AloBhi, 32)+ psllqi(AhiBlo, 32); when we could avoid one of the upper shifts with: AloBlo = pmuludq(a, b); AloBhi = pmuludq(a, psrlqi(b, 32)); AhiBlo = pmuludq(psrlqi(a, 32), b); return AloBlo + psllqi(AloBhi + AhiBlo, 32); This matches the lowering on gcc/icc. Differential Revision: https://reviews.llvm.org/D27756 llvm-svn: 290267	2016-12-21 20:00:10 +00:00

1 2 3 4 5 ...

142335 Commits