llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Joel E. Denny	e3515ea4e8	[FileCheck] Don't permit overlapping CHECK-DAG That is, make CHECK-DAG skip matches that overlap the matches of any preceding consecutive CHECK-DAG directives. This change makes CHECK-DAG more consistent with other directives, and there is evidence it makes CHECK-DAG more intuitive and less error-prone. See the RFC discussion starting at: http://lists.llvm.org/pipermail/llvm-dev/2018-May/123010.html Moreover, this behavior enables CHECK-DAG groups for unordered, non-unique strings or patterns. For example, it is useful for verifying output or logs from a parallel program, such as the OpenMP runtime. This patch also implements the command-line option -allow-deprecated-dag-overlap, which reverts CHECK-DAG to the old overlapping behavior. This option should not be used in new tests. It is meant only for the existing tests that are broken by this change and that need time to update. See the following bugzilla issue for tracking of such tests: https://bugs.llvm.org/show_bug.cgi?id=37532 Patches to add -allow-deprecated-dag-overlap to those tests will follow immediately. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D47106 llvm-svn: 336830	2018-07-11 18:42:58 +00:00
Paul Semel	55e04ae62d	Revert "[llvm-objdump] Add -demangle (-C) option" This reverts commit 3a44ccd156e0edd2e89226f8ed63928e227900bb. This reverts commit d5cfc836bb5552e20507d3612d13ff66ff9e36a0. llvm-svn: 336829	2018-07-11 18:09:52 +00:00
Craig Topper	72f55d6b1c	[X86] Remove patterns for inserting a load into a zero vector. We can instead block the load folding isProfitableToFold. Then isel will emit a register->register move for the zeroing part and a separate load. The PostProcessISelDAG should be able to remove the register->register move. This saves us patterns and fixes the fact that we only had unaligned load patterns. The test changes show places where we should have been using an aligned load. llvm-svn: 336828	2018-07-11 18:09:04 +00:00
Simon Pilgrim	69f8bfb6c2	[TargetTransformInfo] Add pow2 analysis for scalar constants Add ConstantInt analysis to getOperandInfo so we get more realistic div/rem expansion costs comparable to the vector costs. llvm-svn: 336827	2018-07-11 17:51:27 +00:00
Sanjay Patel	fe6a0d2f83	[InstSimplify] add/move tests for add folds; NFC isKnownNegation() is currently proposed as part of D48754, but it could be used to make InstSimplify stronger independently of any abs() improvements. llvm-svn: 336822	2018-07-11 16:52:18 +00:00
Paul Semel	c6d2c0ed8a	Fix llvm-objdump demangle test (added triple option) llvm-svn: 336821	2018-07-11 16:31:33 +00:00
Andrea Di Biagio	6b6463f348	[X86] Fix MayLoad/HasSideEffect flag for (V)MOVLPSrm instructions. Before revision 336728, the "mayLoad" flag for instruction (V)MOVLPSrm was inferred directly from the "default" pattern associated with the instruction definition. r336728 removed special node X86Movlps, and all the patterns associated to it. Now instruction (V)MOVLPSrm doesn't have a pattern associated to it, and the 'mayLoad/hasSideEffects' flags are left unset. When the instruction info is emitted by tablegen, method CodeGenDAGPatterns::InferInstructionFlags() sees that (V)MOVLPSrm doesn't have a pattern, and flags are undefined. So, it conservatively sets the "hasSideEffects" flag for it. As a consequence, we were losing the 'mayLoad' flag, and we were gaining a 'hasSideEffect' flag in its place. This patch fixes the issue (originally reported by Michael Holmen). The mca tests show the differences in the instruction info flags. Instructions that were affected by this problem were: MOVLPSrm/VMOVLPSrm/VMOVLPSZ128rm. Differential Revision: https://reviews.llvm.org/D49182 llvm-svn: 336818	2018-07-11 15:27:50 +00:00
Paul Semel	1877687068	[llvm-objdump] Add -demangle (-C) option Differential Revision: https://reviews.llvm.org/D49043 llvm-svn: 336816	2018-07-11 15:25:39 +00:00
Simon Pilgrim	c007d602eb	[SLPVectorizer] Add initial alternate opcode support for cast instructions. (REAPPLIED) We currently only support binary instructions in the alternate opcode shuffles. This patch is an initial attempt at adding cast instructions as well, this raises several issues that we probably want to address as we continue to generalize the alternate mechanism: 1 - Duplication of cost determination - we should probably add scalar/vector costs helper functions and get BoUpSLP::getEntryCost to use them instead of determining costs directly. 2 - Support alternate instructions with the same opcode (e.g. casts with different src types) - alternate vectorization of calls with different IntrinsicIDs will require this. 3 - Allow alternates to be a different instruction type - mixing binary/cast/call etc. 4 - Allow passthrough of unsupported alternate instructions - related to PR30787/D28907 'copyable' elements. Reapplied with fix to only accept 2 different casts if they come from the same source type. Differential Revision: https://reviews.llvm.org/D49135 llvm-svn: 336812	2018-07-11 15:05:10 +00:00
Simon Pilgrim	e1a0471d84	[SLPVectorizer] Ensure alternate/passthrough doesn't vectorize sdiv with undef elts llvm-svn: 336809	2018-07-11 14:34:43 +00:00
Simon Pilgrim	ae0a7d6278	[SLPVectorizer] Add some additional alternate cast tests Initial attempt at D49135 failed as we weren't correctly handling casts with different source types. llvm-svn: 336808	2018-07-11 14:29:13 +00:00
Simon Pilgrim	4fd0b950a3	Revert rL336804: [SLPVectorizer] Add initial alternate opcode support for cast instructions. Reverting due to buildbot failures llvm-svn: 336806	2018-07-11 14:08:16 +00:00
Simon Pilgrim	17f835882b	[SLPVectorizer] Add initial alternate opcode support for cast instructions. We currently only support binary instructions in the alternate opcode shuffles. This patch is an initial attempt at adding cast instructions as well, this raises several issues that we probably want to address as we continue to generalize the alternate mechanism: 1 - Duplication of cost determination - we should probably add scalar/vector costs helper functions and get BoUpSLP::getEntryCost to use them instead of determining costs directly. 2 - Support alternate instructions with the same opcode (e.g. casts with different src types) - alternate vectorization of calls with different IntrinsicIDs will require this. 3 - Allow alternates to be a different instruction type - mixing binary/cast/call etc. 4 - Allow passthrough of unsupported alternate instructions - related to PR30787/D28907 'copyable' elements. Differential Revision: https://reviews.llvm.org/D49135 llvm-svn: 336804	2018-07-11 13:34:09 +00:00
Krzysztof Parzyszek	594edf22ad	[CodeGen] Ignore debug uses in MachineCopyPropagation Debug uses should not count as real uses, since the presence of debug information could affect the generated code. llvm-svn: 336803	2018-07-11 13:30:27 +00:00
Andrea Di Biagio	e2a4194fea	[llvm-mca] Use a different character to flag instructions with side-effects in the Instruction Info View. NFC This makes easier to identify changes in the instruction info flags. It also helps spotting potential regressions similar to the one recently introduced at r336728. Using the same character to mark MayLoad/MayStore/HasSideEffects is problematic for llvm-lit. When pattern matching substrings, llvm-lit consumes tabs and spaces. A change in position of the flag marker may not trigger a test failure. This patch only changes the character used for flag `hasSideEffects`. The reason why I didn't touch other flags is because I want to avoid spamming the mailing because of the massive diff due to the numerous tests affected by this change. In future, each instruction flag should be associated with a different character in the Instruction Info View. llvm-svn: 336797	2018-07-11 12:44:44 +00:00
Roman Lebedev	25a47d71d3	[NFC][InstCombine] Tests for x & (-1 >> y) == x -> x u<= (-1 >> y) fold https://bugs.llvm.org/show_bug.cgi?id=38123 This pattern will be produced by Implicit Integer Truncation sanitizer, https://reviews.llvm.org/D48958 https://bugs.llvm.org/show_bug.cgi?id=21530 in unsigned case, therefore it is probably a good idea to improve it. https://rise4fun.com/Alive/Rny llvm-svn: 336796	2018-07-11 12:37:12 +00:00
Sjoerd Meijer	2dddfdaff5	[ARM] ParallelDSP: multiple reduction stmts in loop This fixes an issue that we were not properly supporting multiple reduction stmts in a loop, and not generating SMLADs for these cases. The alias analysis checks were done too early, making it too conservative. Differential revision: https://reviews.llvm.org/D49125 llvm-svn: 336795	2018-07-11 12:36:25 +00:00
Jonas Devlieghere	cc7ac3cc07	Use debug-prefix-map for AT_NAME AT_NAME was being emitted before the directory paths were remapped. This ensures that all paths are remapped before anything is emitted. An additional test case has been added. Note that this only works if the replacement string is an absolute path. If not, then AT_decl_file believes the new path is a relative path, and joins that path with the compilation directory. I do not know of a good way to resolve this. Patch by: Siddhartha Bagaria (starsid) Differential revision: https://reviews.llvm.org/D49169 llvm-svn: 336793	2018-07-11 12:30:35 +00:00
Sander de Smalen	9a4aa02652	[AArch64][SVE] Asm: Support for COMPACT instruction. The compact instruction shuffles active elements of vector into lowest numbered elements and sets remaining elements to zero. e.g. compact z0.s, p0, z1.s llvm-svn: 336789	2018-07-11 11:22:26 +00:00
Simon Pilgrim	7c1ce0eb85	Fix check-prefix vs check-prefixes typo in updated test llvm-svn: 336787	2018-07-11 10:42:51 +00:00
Simon Pilgrim	4349dfb2f3	[AArch64] Regenerate SDIV tests Will make codegen diffs much easier to grok in a future patch llvm-svn: 336786	2018-07-11 10:39:50 +00:00
Roman Lebedev	119d3932c1	[NFC][InstCombine] icmp-logical.ll: add a few more tests. The @masked_and_notA_slightly_optimized and @masked_or_A will break when PR38123 will be fixed: https://rise4fun.com/Alive/Rny Clearly, they aren't optimized currently. https://rise4fun.com/Alive/ERo llvm-svn: 336784	2018-07-11 10:31:12 +00:00
Sander de Smalen	e8509a8efa	[AArch64][SVE] Asm: Support for LAST(A\|B) and CLAST(A\|B) instructions. The LASTB and LASTA instructions extract the last active element, or element after the last active, from the source vector. The added variants are: Scalar: last(a\|b) w0, p0, z0.b last(a\|b) w0, p0, z0.h last(a\|b) w0, p0, z0.s last(a\|b) x0, p0, z0.d SIMD & FP Scalar: last(a\|b) b0, p0, z0.b last(a\|b) h0, p0, z0.h last(a\|b) s0, p0, z0.s last(a\|b) d0, p0, z0.d The CLASTB and CLASTA conditionally extract the last or element after the last active element from the source vector. The added variants are: Scalar: clast(a\|b) w0, p0, w0, z0.b clast(a\|b) w0, p0, w0, z0.h clast(a\|b) w0, p0, w0, z0.s clast(a\|b) x0, p0, x0, z0.d SIMD & FP Scalar: clast(a\|b) b0, p0, b0, z0.b clast(a\|b) h0, p0, h0, z0.h clast(a\|b) s0, p0, s0, z0.s clast(a\|b) d0, p0, d0, z0.d Vector: clast(a\|b) z0.b, p0, z0.b, z1.b clast(a\|b) z0.h, p0, z0.h, z1.h clast(a\|b) z0.s, p0, z0.s, z1.s clast(a\|b) z0.d, p0, z0.d, z1.d Please refer to the architecture specification for more details on the semantics of the added instructions. llvm-svn: 336783	2018-07-11 10:08:00 +00:00
Paul Semel	18368fef36	[llvm-readobj] Add -hex-dump (-x) option Differential Revision: https://reviews.llvm.org/D48281 llvm-svn: 336782	2018-07-11 10:00:29 +00:00
Roman Lebedev	d9b763cee8	[NFC][InstCombine] Fix extra space padding in icmp-mul-zext.ll test update_test_checks will drop it anyway, creating noise.. llvm-svn: 336781	2018-07-11 09:57:53 +00:00
Roman Lebedev	58006620b3	[NFC][InstCombine] Add variable names and regenerate icmp-logical.ll test. llvm-svn: 336780	2018-07-11 09:57:46 +00:00
Andrea Di Biagio	d8954c65cb	[llvm-mca] Add tests for partial register writes. llvm-mca doesn't know that on modern AMD processors, portions of a general purpose register are not treated independently. So, a partial register write has a false dependency on the super-register. The issue with partial register writes will be addressed by a follow-up patch. llvm-svn: 336778	2018-07-11 09:50:00 +00:00
Simon Pilgrim	196e426837	[DAGCombiner] Support non-uniform X%C -> X-(X/C)*C folds First stage in PR38057 - support non-uniform constant vectors in the combine to reuse the division-by-constant logic. We can definitely do better for srem pow2 remainders (and avoid that extra multiply....) but this at least helps keep everything on the vector unit. Differential Revision: https://reviews.llvm.org/D48975 llvm-svn: 336774	2018-07-11 09:22:42 +00:00
Simon Pilgrim	14211c3808	[DAGCombiner] Add (urem X, -1) -> select(X == -1, 0, x) fold llvm-svn: 336773	2018-07-11 09:14:37 +00:00
Simon Tatham	7aeb5f145e	[TableGen] Add a general-purpose JSON backend. The aim of this backend is to output everything TableGen knows about the record set, similarly to the default -print-records backend. But where -print-records produces output in TableGen's input syntax (convenient for humans to read), this backend produces it as structured JSON data, which is convenient for loading into standard scripting languages such as Python, in order to extract information from the data set in an automated way. The output data contains a JSON representation of the variable definitions in output 'def' records, and a few pieces of metadata such as which of those definitions are tagged with the 'field' prefix and which defs are derived from which classes. It doesn't dump out absolutely every piece of knowledge it _could_ produce, such as type information and complicated arithmetic operator nodes in abstract superclasses; the main aim is to allow consumers of this JSON dump to essentially act as new backends, and backends don't generally need to depend on that kind of data. The new backend is implemented as an EmitJSON() function similar to all of llvm-tblgen's other EmitFoo functions, except that it lives in lib/TableGen instead of utils/TableGen on the basis that I'm expecting to add it to clang-tblgen too in a future patch. To test it, I've written a Python script that loads the JSON output and tests properties of it based on comments in the .td source - more or less like FileCheck, except that the CHECK: lines have Python expressions after them instead of textual pattern matches. Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: arichardson, labath, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D46054 llvm-svn: 336771	2018-07-11 08:40:19 +00:00
Craig Topper	0b5293684d	[X86] The TEST instruction is eliminated when BSF/TZCNT is used Summary: These changes cover the PR#31399. Now the ffs(x) function is lowered to (x != 0) ? llvm.cttz(x) + 1 : 0 and it corresponds to the following llvm code: %cnt = tail call i32 @llvm.cttz.i32(i32 %v, i1 true) %tobool = icmp eq i32 %v, 0 %.op = add nuw nsw i32 %cnt, 1 %add = select i1 %tobool, i32 0, i32 %.op and x86 asm code: bsfl %edi, %ecx addl $1, %ecx testl %edi, %edi movl $0, %eax cmovnel %ecx, %eax In this case the 'test' instruction can't be eliminated because the 'add' instruction modifies the EFLAGS, namely, ZF flag that is set by the 'bsf' instruction when 'x' is zero. We now produce the following code: bsfl %edi, %ecx movl $-1, %eax cmovnel %ecx, %eax addl $1, %eax Patch by Ivan Kulagin Reviewers: davide, craig.topper, spatel, RKSimon Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48765 llvm-svn: 336768	2018-07-11 06:57:42 +00:00
Craig Topper	152eba3166	[X86] Remove some composite MOVSS/MOVSD isel patterns. These patterns looked for a MOVSS/SD followed by a scalar_to_vector. Or a scalar_to_vector followed by a load. In both cases we emitted a MOVSS/SD for the MOVSS/SD part, a REG_CLASS for the scalar_to_vector, and a MOVSS/SD for the load. But we have patterns that do each of those 3 things individually so there's no reason to build large patterns. Most of the test changes are just reorderings. The one test that had a meaningful change is pr30430.ll and it appears to be a regression. But its doing -O0 so I think it missed a lot of opportunities and was just getting lucky before. llvm-svn: 336762	2018-07-11 04:51:40 +00:00
Sam Clegg	eb883aa8d1	[WebAssembly] Add pass to infer prototypes for prototype-less functions See https://bugs.llvm.org/show_bug.cgi?id=35385 Differential Revision: https://reviews.llvm.org/D48471 llvm-svn: 336759	2018-07-11 04:29:36 +00:00
Stefan Pintilie	23881494ba	[Power9] Add remaining __flaot128 builtin support for FMA round to odd Implement this as it is done on GCC: __float128 a, b, c, d; a = __builtin_fmaf128_round_to_odd (b, c, d); // generates xsmaddqpo a = __builtin_fmaf128_round_to_odd (b, c, -d); // generates xsmsubqpo a = - __builtin_fmaf128_round_to_odd (b, c, d); // generates xsnmaddqpo a = - __builtin_fmaf128_round_to_odd (b, c, -d); // generates xsnmsubpqp Differential Revision: https://reviews.llvm.org/D48218 llvm-svn: 336754	2018-07-11 01:42:22 +00:00
Chen Zheng	364af0bd9b	[test cases] add test cases for find more abs pattern Differential Revision: https://reviews.llvm.org/D49123 llvm-svn: 336752	2018-07-11 01:07:21 +00:00
Eli Friedman	9ae1cdf437	[ARM] Treat cmn immediates as legal in isLegalICmpImmediate. The original code attempted to do this, but the std::abs() call didn't actually do anything due to implicit type conversions. Fix the type conversions, and perform the correct check for negative immediates. This probably has very little practical impact, but it's worth fixing just to avoid confusion in the future, I think. Differential Revision: https://reviews.llvm.org/D48907 llvm-svn: 336742	2018-07-10 23:44:37 +00:00
Craig Topper	b9d34784e3	[X86] Teach X86InstrInfo::commuteInstructionImpl to use MOVSD/MOVSS for BLEND under optsize when the immediate allows it. Isel currently emits movss/movsd a lot of the time and an accidental double commute turns it into a blend. Ideally we'd select blend directly in isel under optspeed and not rely on the double commute to create blend. llvm-svn: 336731	2018-07-10 22:02:23 +00:00
Teresa Johnson	867a1b8b33	[ThinLTO] Use std::map to get determistic imports files Summary: I noticed that the .imports files emitted for distributed ThinLTO backends do not have consistent ordering. This is because StringMap iteration order is not guaranteed to be deterministic. Since we already have a std::map with this information, used when emitting the individual index files (ModuleToSummariesForIndex), use it for the imports files as well. This issue is likely causing some unnecessary rebuilds of the ThinLTO backends in our distributed build system as the imports files are inputs to those backends. Reviewers: pcc, steven_wu, mehdi_amini Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D48783 llvm-svn: 336721	2018-07-10 20:06:04 +00:00
Alexander Ivchenko	c84207f8fa	[GlobalISel][X86_64] Support for G_SITOFP The instruction selection is automatically handled by tablegen llvm-svn: 336703	2018-07-10 16:38:35 +00:00
Eugene Leviant	2b4e7c18a5	[Evaluator] Examine alias when evaluating function call This fixes PR38120 llvm-svn: 336702	2018-07-10 16:34:23 +00:00
Simon Pilgrim	6b5342cb2e	[DAGCombiner] Add special case fast paths for udiv x,1 and udiv x,-1 udiv x,-1 was going down the (slow) BuildUDIV route resulting in unnecessary shifts. llvm-svn: 336701	2018-07-10 16:33:07 +00:00
Konstantin Zhuravlyov	a7313cb243	AMDGPU: Make hidden argument metadata consistent with amdgpu-implicitarg-num-bytes attribute Differential Revision: https://reviews.llvm.org/D49096 llvm-svn: 336697	2018-07-10 16:12:51 +00:00
Sanjay Patel	6302df4087	[InstCombine] allow flag propagation when using safe constant This corresponds with the code for the single binop pattern added in rL336684. llvm-svn: 336696	2018-07-10 16:09:49 +00:00
Simon Pilgrim	185bf61e29	[X86] Add srem/udiv/urem by constant tests Match the tests in combine-sdiv.ll llvm-svn: 336694	2018-07-10 16:08:28 +00:00
Heejin Ahn	6bf9a0ff64	[WebAssembly] Add missing a few {{$}}s to a test llvm-svn: 336691	2018-07-10 16:00:43 +00:00
Konstantin Zhuravlyov	54b606a6d2	AMDGPU/NFC: Fix typo in test name hsa-metadata-enqueu-kernel.ll -> hsa-metadata-enqueue-kernel.ll llvm-svn: 336689	2018-07-10 15:54:46 +00:00
Paul Robinson	137d0a46b9	Update test to work on Windows llvm-svn: 336687	2018-07-10 15:23:10 +00:00
Sanjay Patel	1f887f3f07	[InstCombine] safely allow non-commutative binop identity constant folds This was originally intended with D48893, but as discussed there, we have to make the folds safe from producing extra poison. This should give the single binop folds the same capabilities as the existing folds for 2-binops+shuffle. LLVM binary opcode review: there are a total of 18 binops. There are 7 commutative binops (add, mul, and, or, xor, fadd, fmul) which we already fold. We're able to fold 6 more opcodes with this patch (shl, lshr, ashr, fdiv, udiv, sdiv). There are no folds for srem/urem/frem AFAIK. We don't bother with sub/fsub with constant operand 1 because those are canonicalized to add/fadd. 7 + 6 + 3 + 2 = 18. llvm-svn: 336684	2018-07-10 15:12:31 +00:00
Krzysztof Parzyszek	51bc3f9232	[Hexagon] Change .mir testcase to make sure function is not in SSA form If a machine function satisfies SSA, the IsSSA property is assumed even if the pass to be executed runs after existing from SSA. If the pass output then does not conform to SSA, a verifier error will be flagged (with expensive checks enabled). llvm-svn: 336682	2018-07-10 14:49:54 +00:00
Paul Robinson	a03ecc4a21	Support -fdebug-prefix-map in llvm-mc. This is useful to omit the debug compilation dir when compiling assembly files with -g. Part of PR38050. Patch by Siddhartha Bagaria! Differential Revision: https://reviews.llvm.org/D48988 llvm-svn: 336680	2018-07-10 14:41:54 +00:00

1 2 3 4 5 ...

54402 Commits