llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Fedor Sergeev	dd4337b392	[NewPM] teach -passes= to emit meaningful error messages Summary: All the PassBuilder::parse interfaces now return descriptive StringError instead of a plain bool. It allows to make -passes/aa-pipeline parsing errors context-specific and thus less confusing. TODO: ideally we should also make suggestions for misspelled pass names, but that requires some extensions to PassBuilder. Reviewed By: philip.pfaffe, chandlerc Differential Revision: https://reviews.llvm.org/D53246 llvm-svn: 344519	2018-10-15 15:00:18 +00:00
Aleksandar Beserminji	0dc405c302	[mips][micromips] Fix overlaping FDEs error When compiling static executable for micromips, CFI symbols are incorrectly labeled as MICROMIPS, which cause ".eh_frame_hdr refers to overlapping FDEs." error. This patch does not label CFI symbols as MICROMIPS, and FDEs do not overlap anymore. This patch also exposes another bug, which is fixed here: https://reviews.llvm.org/D52985 Differential Revision: https://reviews.llvm.org/D52987 llvm-svn: 344516	2018-10-15 14:39:12 +00:00
Aleksandar Beserminji	cb01fa1a2b	[mips][micromips] Revert "Fix overlaping FDEs error" This reverts r344511. llvm-svn: 344515	2018-10-15 14:36:48 +00:00
Simon Pilgrim	e44f188a68	[ARM][NEON] Improve vector popcnt lowering with PADDL (PR39281) As I suggested on PR39281, this patch uses PADDL pairwise addition to widen from the vXi8 CTPOP result to the target vector type. This is a blocker for moving more x86 code to generic vector CTPOP expansion (P32655 + D53258) - ARM's vXi64 CTPOP currently expands, which would generate a vXi64 MUL but ARM's custom lowering expands the general MUL case and vectors aren't well handled in LegalizeDAG - improving the CTPOP lowering was a lot easier than fixing the MUL lowering for this one case...... Differential Revision: https://reviews.llvm.org/D53257 llvm-svn: 344512	2018-10-15 13:20:41 +00:00
Aleksandar Beserminji	a021fdd743	[mips][micromips] Fix overlaping FDEs error When compiling static executable for micromips, CFI symbols are incorrectly labeled as MICROMIPS, which cause ".eh_frame_hdr refers to overlapping FDEs." error. This patch does not label CFI symbols as MICROMIPS, and FDEs do not overlap anymore. This patch also exposes another bug, which is fixed here: https://reviews.llvm.org/D52985 Differential Revision: https://reviews.llvm.org/D52987 llvm-svn: 344511	2018-10-15 12:59:17 +00:00
Nicolai Haehnle	05f1bdf7e6	AMDGPU: Test showing a scalar buffer load deficiency Change-Id: I5b64a565f22a8482aa0712488d85e45163ac3d12 llvm-svn: 344506	2018-10-15 11:37:04 +00:00
Fedor Sergeev	a32848448b	[NewPM] implement SCC printing for -print-before-all/-print-after-all Removing deficiency of initial implementation of -print-before-all/-after-all - it was effectively skipping IR printing for all the SCC passes. Now LazyCallGraph:SCC gets its IR printed. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D53270 llvm-svn: 344505	2018-10-15 10:46:35 +00:00
Chandler Carruth	cdfd07538f	[TI removal] Make `getTerminator()` return a generic `Instruction`. This removes the primary remaining API producing `TerminatorInst` which will reduce the rate at which code is introduced trying to use it and generally make it much easier to remove the remaining APIs across the codebase. Also clean up some of the stragglers that the previous mechanical update of variables missed. Users of LLVM and out-of-tree code generally will need to update any explicit variable types to handle this. Replacing `TerminatorInst` with `Instruction` (or `auto`) almost always works. Most of these edits were made in prior commits using the perl one-liner: ``` perl -i -ple 's/TerminatorInst(\b.* = .*getTerminator\(\))/Instruction\1/g' ``` This also my break some rare use cases where people overload for both `Instruction` and `TerminatorInst`, but these should be easily fixed by removing the `TerminatorInst` overload. llvm-svn: 344504	2018-10-15 10:42:50 +00:00
Chandler Carruth	7e3e101b48	[TI removal] Rework `InstVisitor` to support visiting instructions that are terminators without relying on the specific `TerminatorInst` type. This required cleaning up two users of `InstVisitor`s usage of `TerminatorInst` as well. llvm-svn: 344503	2018-10-15 10:10:54 +00:00
Chandler Carruth	1c28e7f745	[TI removal] Make variables declared as `TerminatorInst` and initialized by `getTerminator()` calls instead be declared as `Instruction`. This is the biggest remaining chunk of the usage of `getTerminator()` that insists on the narrow type and so is an easy batch of updates. Several files saw more extensive updates where this would cascade to requiring API updates within the file to use `Instruction` instead of `TerminatorInst`. All of these were trivial in nature (pervasively using `Instruction` instead just worked). llvm-svn: 344502	2018-10-15 10:04:59 +00:00
Chandler Carruth	1b568ccf5e	[TI removal] Remove `TerminatorInst` from GVN.h and GVN.cpp. This is the last interesting usage in all of LLVM's headers. The remaining usages in headers are the core typesystem bits (Core.h, instruction types, and InstVisitor) and as the return of `BasicBlock::getTerminator`. The latter is the big remaining API point that I'll remove after mass updates to user code. llvm-svn: 344501	2018-10-15 10:00:15 +00:00
Chandler Carruth	f87a26b538	[TI removal] Remove `TerminatorInst` from SparsePropagation.h and related code. This is simple as we just need to replace the type and move to the concept of visiting a "terminator" rather than a specific instruction subclass. llvm-svn: 344500	2018-10-15 09:47:26 +00:00
Chandler Carruth	fa2e7feeb3	[TI removal] Remove a dead forward declaration of TerminatorInst. NFC. llvm-svn: 344499	2018-10-15 09:34:31 +00:00
Chandler Carruth	84522b83d5	[TI removal] Remove `TerminatorInst` from BasicBlockUtils.h This requires updating a number of .cpp files to adapt to the new API. I've just systematically updated all uses of `TerminatorInst` within these files te `Instruction` so thta I won't have to touch them again in the future. llvm-svn: 344498	2018-10-15 09:34:05 +00:00
Chandler Carruth	30c1b2684f	[TI removal] Just use Instruction in the CFG printer code. NFC. llvm-svn: 344497	2018-10-15 09:33:40 +00:00
Guillaume Chatelet	0830493d09	[llvm-exegesis] Fix missing std::move. llvm-svn: 344496	2018-10-15 09:21:21 +00:00
Chandler Carruth	c5b5ebfade	[TI removal] Remove a unnecessary use of `TerminatorInst` from an IR header. NFC. Part of the removal of `TerminatorInst` from the type hierarchy. llvm-svn: 344495	2018-10-15 09:17:38 +00:00
Chandler Carruth	a93fe0b6b6	[TI removal] Remove TerminatorInst as an input parameter from all public LLVM APIs. There weren't very many. We still have the instruction visitor, and APIs with TerminatorInst as a return type or an output parameter. llvm-svn: 344494	2018-10-15 09:17:09 +00:00
Guillaume Chatelet	f8f0471ba9	[llvm-exegesis][NFC] Return many CodeTemplates instead of one. Summary: This is part one of the change where I simply changed the signature of the functions. More work need to be done to actually produce more than one CodeTemplate per instruction. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D53209 llvm-svn: 344493	2018-10-15 09:09:19 +00:00
Bjorn Pettersson	bb16824edf	[TwoAddressInstructionPass] Replace subregister uses when processing tied operands Summary: TwoAddressInstruction pass typically rewrites %1:short = foo %0.sub_lo:long as %1:short = COPY %0.sub_lo:long %1:short = foo %1:short when having tied operands. If there are extra un-tied operands that uses the same reg and subreg, such as the second and third inputs to fie here: %1:short = fie %0.sub_lo:long, %0.sub_hi:long, %0.sub_lo:long then there was a bug which replaced the register %0 also for the un-tied operand, but without changing the subregister indices. So we used to get: %1:short = COPY %0.sub_lo:long %1:short = fie %1, %1.sub_hi:short, %1.sub_lo:short With this fix we instead get: %1:short = COPY %0.sub_lo:long %1:short = fie %1, %0.sub_hi:long, %1 Reviewers: arsenm, JesperAntonsson, kparzysz, MatzeB Reviewed By: MatzeB Subscribers: bjope, kparzysz, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D36224 llvm-svn: 344492	2018-10-15 08:36:03 +00:00
Craig Topper	08fe06c745	[X86] Autogenerate checks. NFC llvm-svn: 344490	2018-10-15 05:31:24 +00:00
Lang Hames	c76b89e505	[ORC] Simplify naming for JITDylib definition generators. Renames: JITDylib's setFallbackDefinitionGenerator method to setGenerator. DynamicLibraryFallbackGenerator class to DynamicLibrarySearchGenerator. ReexportsFallbackDefinitionGenerator to ReexportsGenerator. llvm-svn: 344489	2018-10-15 05:07:54 +00:00
Craig Topper	185e5e3022	[X86] Move promotion of vector and/or/xor from legalization to DAG combine Summary: I've noticed that the bitcasts we introduce for these make computeKnownBits and computeNumSignBits not work well in LegalizeVectorOps. LegalizeVectorOps legalizes bottom up while LegalizeDAG legalizes top down. The bottom up strategy for LegalizeVectorOps means operands are legalized before their uses. So we promote and/or/xor before we legalize the operands that use them making computeKnownBits/computeNumSignBits in places like LowerTruncate suboptimal. I looked at changing LegalizeVectorOps to be top down as well, but that was more disruptive and caused some regressions. I also looked at just moving promotion of binops to LegalizeDAG, but that had a few issues one around matching AND,ANDN,OR into VSELECT because I had to create ANDN as vXi64, but the other nodes hadn't legalized yet, I didn't look too hard at fixing that. This patch seems to produce better results overall than my other attempts. We now form broadcasts of constants better in some cases. For at least some of them the AND was being introduced in LegalizeDAG, promoted to vXi64, and the BUILD_VECTOR was also legalized there. I think we got bad ordering of that. Now the promotion is out of the legalizer so we handle this better. In the longer term I think we really should evaluate whether we should be doing this promotion at all. It's really there to reduce isel pattern count, but I'm wondering if we'd be better served just eating the pattern cost or doing C++ based isel for vector and/or/xor in X86ISelDAGToDAG. The masked and/or/xor will definitely be difficult in patterns if a bitcast gets between the vselect and the and/or/xor node. That becomes a lot of permutations to cover. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53107 llvm-svn: 344487	2018-10-15 01:51:58 +00:00
Craig Topper	5159669048	[X86] Add 128 MOVDDUP to the constant pool printing in X86AsmPrinter::EmitInstruction. We use this instruction to broadcast a single 64-bit value to a v2i64/v2f64 vector. llvm-svn: 344486	2018-10-15 01:51:53 +00:00
Craig Topper	01876b1de7	[X86] Autogenerate complete checks. NFC llvm-svn: 344485	2018-10-15 01:51:50 +00:00
Simon Pilgrim	5e6be0f2c6	[InstCombine] Add PR27343 test cases llvm-svn: 344484	2018-10-14 20:14:33 +00:00
Ayal Zaks	cc44dab25d	[LV] Fix comments reported when not vectorizing single iteration loops; NFC Landing this as a separate part of https://reviews.llvm.org/D50480, being a seemingly unrelated change ([LV] Vectorizing loops of arbitrary trip count without remainder under opt for size). llvm-svn: 344483	2018-10-14 17:53:02 +00:00
Simon Pilgrim	64b9db7fa8	[X86][AVX] Enable lowerVectorShuffleAsLanePermuteAndPermute v16i16/v32i8 shuffle lowering Extends D53148 from v4f64 now that we have test coverage for v16i16/v32i8 shuffles. llvm-svn: 344481	2018-10-14 17:34:20 +00:00
Simon Pilgrim	2daeea058e	[ARM] Regenerate cttz tests Improve codegen view as part of PR32655 llvm-svn: 344479	2018-10-14 16:49:04 +00:00
Lang Hames	07ad335677	[ORC] Remove XXLayer::add methods that default to using the main JITDylib. They're not currently used and may complicate upcoming changes to add's signature and behavior. llvm-svn: 344478	2018-10-14 16:09:59 +00:00
Simon Pilgrim	9a7dcb6a22	[LegalizeDAG] Don't bother with final MUL+SRL stage for byte CTPOP. The final stage of CTPOP expansion (v = (v * 0x01010101...) >> (Len - 8)) is completely pointless for the byte (Len = 8) case as it reduces to (v = (v * 0x01...) >> 0), but annoyingly this doesn't always get optimized away. Found while investigating generic vector CTPOP expansion (PR32655). llvm-svn: 344477	2018-10-14 15:56:28 +00:00
Sanjay Patel	415665f309	[InstCombine] combine a shuffle and an extract subvector shuffle This is part of the missing IR-level folding noted in D52912. This should be ok as a canonicalization because the new shuffle mask can't be any more complicated than the existing shuffle mask. If there's some target where the shorter vector shuffle is not legal, it should just end up expanding to something like the pair of shuffles that we're starting with here. Differential Revision: https://reviews.llvm.org/D53037 llvm-svn: 344476	2018-10-14 15:25:06 +00:00
Dorit Nuzman	a3df726c55	recommit 344472 after fixing build failure on ARM and PPC. llvm-svn: 344475	2018-10-14 08:50:06 +00:00
Dorit Nuzman	70052d5053	revert 344472 due to failures. llvm-svn: 344473	2018-10-14 07:21:20 +00:00
Dorit Nuzman	c4c9199631	[IAI,LV] Add support for vectorizing predicated strided accesses using masked interleave-group The vectorizer currently does not attempt to create interleave-groups that contain predicated loads/stores; predicated strided accesses can currently be vectorized only using masked gather/scatter or scalarization. This patch makes predicated loads/stores candidates for forming interleave-groups during the Loop-Vectorizer's analysis, and adds the proper support for masked-interleave- groups to the Loop-Vectorizer's planning and transformation stages. The patch also extends the TTI API to allow querying the cost of masked interleave groups (which each target can control); Targets that support masked vector loads/ stores may choose to enable this feature and allow vectorizing predicated strided loads/stores using masked wide loads/stores and shuffles. Reviewers: Ayal, hsaito, dcaballe, fhahn, javed.absar Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D53011 llvm-svn: 344472	2018-10-14 07:06:16 +00:00
Craig Topper	102d6c0f56	[X86] Fix bad indentation. NFC llvm-svn: 344471	2018-10-14 04:01:40 +00:00
Craig Topper	27d328e6bc	[X86] Type legalize v2f32 stores by widening to v4f32, casting to v2f64, extracting f64 and storing. Summary: This is similar to what D52528 did for loads. It should match what generic type legalization does in 64-bit mode where it uses a v2i64 cast and an i64 store. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53173 llvm-svn: 344470	2018-10-14 03:36:27 +00:00
Benjamin Kramer	1fe8373aeb	Move some helpers from the global namespace into anonymous ones. llvm-svn: 344468	2018-10-13 22:18:22 +00:00
Lang Hames	f240d688c5	[ORC] During lookup, do not match against hidden symbols in other JITDylibs. This adds two arguments to the main ExecutionSession::lookup method: MatchNonExportedInJD, and MatchNonExported. These control whether and where hidden symbols should be matched when searching a list of JITDylibs. A similar effect could have been achieved by filtering search results, but this would have involved materializing symbol definitions (since materialization is triggered on lookup) only to throw the results away, among other issues. llvm-svn: 344467	2018-10-13 21:53:40 +00:00
Simon Pilgrim	dbe93508e0	[AARCH64] Regenerate popcnt tests Improve codegen view as part of PR32655 llvm-svn: 344466	2018-10-13 21:50:15 +00:00
Simon Pilgrim	9a5cbe8f3f	[ARM] Regenerate popcnt tests Improve codegen view as part of PR32655 llvm-svn: 344465	2018-10-13 21:32:49 +00:00
Simon Pilgrim	2e940224fd	Pull out repeated variables from SelectionDAGLegalize::ExpandBitCount. The CTPOP case has been changed from VT.getSizeInBits to VT.getScalarSizeInBits - but this fits in with future work for vector support (PR32655) and doesn't affect any current (scalar) uses. llvm-svn: 344461	2018-10-13 18:40:48 +00:00
Craig Topper	29759c6451	[LegalizeTypes] Prevent an assertion from PromoteIntRes_BSWAP and PromoteIntRes_BITREVERSE if the shift amount is too large for the VT returned by getShiftAmountTy Summary: getShiftAmountTy for X86 returns MVT::i8. If a BSWAP or BITREVERSE is created that requires promotion and the difference between the original VT and the promoted VT is more than 255 then we won't able to create the constant. This patch adds a check to replace the result from getShiftAmountTy to MVT::i32 if the difference won't fit. This should get legalized later when the shift is ultimately expanded since its clearly an illegal type that we're only promoting to make it a power of 2 bit width. Alternatively we could base the decision completely on the largest shift amount the promoted VT could use. Vectors should be immune here because getShiftAmountTy always returns the incoming VT for vectors. Only the scalar shift amount can be changed by the targets. Reviewers: eli.friedman, RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53232 llvm-svn: 344460	2018-10-13 17:47:20 +00:00
Thomas Lively	61113e7322	[WebAssembly][NFC] Fix signed/unsigned comparison warning llvm-svn: 344459	2018-10-13 16:58:03 +00:00
Sanjay Patel	f9612598c1	[InstCombine] fix complexity canonicalization with fake unary vector ops This is a preliminary step to avoid regressions when we add an actual 'fneg' instruction to IR. See D52934 and D53205. llvm-svn: 344458	2018-10-13 16:15:37 +00:00
Simon Pilgrim	a6a837775e	[X86][SSE] Remove most of vector CTTZ custom lowering and use LegalizeDAG instead. There is one remnant - AVX1 custom splitting of 256-bit vectors - which is due to a regression where the X86ISD::ANDNP is still performed as a YMM. I've also tightened the CTLZ or CTPOP lowering in SelectionDAGLegalize::ExpandBitCount to require a legal CTLZ - it doesn't affect existing users and fixes an issue with AVX512 codegen. llvm-svn: 344457	2018-10-13 16:11:15 +00:00
Sanjay Patel	f173d637d6	[InstCombine] add tests for operand complexity canonicalization; NFC The tests with undef vector elements demonstrate a hole in the current pattern matching. llvm-svn: 344456	2018-10-13 16:02:47 +00:00
David Bolvansky	2d0da5ff78	[NFC] Fixed duplicated test file llvm-svn: 344455	2018-10-13 15:26:13 +00:00
David Bolvansky	c5380371c0	[InstCombine] Fixed crash with aliased functions Summary: Fixes PR39177 Reviewers: spatel, jbuening Reviewed By: jbuening Subscribers: jbuening, llvm-commits Differential Revision: https://reviews.llvm.org/D53129 llvm-svn: 344454	2018-10-13 15:21:55 +00:00
Simon Pilgrim	88d4c5f25b	[X86][SSE] Begin removing vector CTTZ custom lowering and use LegalizeDAG instead. Adds CTTZ vector legalization support and begins the removal of the X86/SSE custom lowering. llvm-svn: 344453	2018-10-13 15:16:55 +00:00

1 2 3 4 5 ...

170378 Commits