llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Philip Reames	a74c4e37ae	[unittest] Exercise SCEV's udiv and udiv ceiling routines The ceiling variant was recently added (due to the work towards D105216), and we're spending a lot of time trying to find optimizations for the expression. This patch brute forces the space of i8 unsigned divides and checks that we get a correct (well consistent with APInt) result for both udiv and udiv ceiling. (This is basically what I've been doing locally in a hand rolled C++ program, and I realized there no good reason not to check it in as a unit test which directly exercises the logic on constants.) Differential Revision: https://reviews.llvm.org/D106083	2021-07-15 11:55:00 -07:00
Nikita Popov	ff55ad2c7b	[Verifier] Use isIntrinsic() (NFC) Call Function::isIntrinsic() instead of manually checking the function name for an "llvm." prefix.	2021-07-15 20:30:42 +02:00
Simon Pilgrim	d5316f9a80	[InstCombine] Add select(cond,gep(gep(x,y),z),gep(x,y)) tests from PR51069	2021-07-15 19:26:24 +01:00
Sam Tebbs	d16f1096a9	[ARM][LowOverheadLoops] Make some stack spills valid for tail predication This patch makes vector spills valid for tail predication when all loads from the same stack slot are within the loop Differential Revision: https://reviews.llvm.org/D105443	2021-07-15 19:23:52 +01:00
Quinn Pham	ba35dd5a19	[PowerPC] Fix popcntb XL Compat Builtin for 32bit This patch implements the `__popcntb` XL compatibility builtin for 32bit in the frontend and backend. This patch also updates tests for `__popcntb` and other XL Compat sync related builtins. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D105360	2021-07-15 13:19:47 -05:00
Simon Pilgrim	95f4ef0879	Fix "unknown pragma 'GCC'" MSVC warning. NFCI.	2021-07-15 18:50:19 +01:00
Simon Pilgrim	66ad7fe2fe	[InstCombine] Add 3-operand gep test with different ptr and same indices	2021-07-15 18:50:19 +01:00
Philip Reames	6a45d08863	[SCEV] Fix unsound reasoning in howManyLessThans This is split from D105216, it handles only a subset of the cases in that patch. Specifically, the issue being fixed is that the code incorrectly assumed that (Start-Stide) < End implied that the backedge was taken at least once. This is not true when e.g. Start = 4, Stride = 2, and End = 3. Note that we often do produce the right backedge taken count despite the flawed reasoning. The fix chosen here is to use an alternate form of uceil (ceiling of unsigned divide) lowering which is safe when max(RHS,Start) > Start - Stride. (Note that signedness of both max expression and comparison depend on the signedness of the comparison being analyzed, and that overflow in the Start - Stride expression is allowed.) Note that this is weaker than proving the backedge is taken because it allows start - stride < end < start. Some cases which can't be proven safe are sent down the generic path, and we do end up generating less optimal expressions in a few cases. Credit for coming up with the approach goes entirely to Eli. I just split it off, tweaked the comments a bit, and did some additional testing. Differential Revision: https://reviews.llvm.org/D105942	2021-07-15 10:32:47 -07:00
Fangrui Song	35af2802d5	[test] Avoid llvm-readelf/llvm-readobj one-dash long options and deprecated aliases (e.g. --file-headers)	2021-07-15 10:26:21 -07:00
Vy Nguyen	27626de14b	[llvm-exegesis] Fix missing-headers build errors. Details: Switch all #includes to use <> because that is consistent with what happens in the cmake checks. Otherwise, we could be in the situation where cmake checks see that headers exist at <perfmon/...> but in llvm-exegesis code, we use "perfmon/...", which may not exist. Related PR/revisions: D84076, PR51017+D105615 Differential Revision: https://reviews.llvm.org/D105861	2021-07-15 13:20:25 -04:00
Arthur Eubanks	ee15f89094	Revert "[SLP]Workaround for InsertSubVector cost." This reverts commit 2eb50baf059648214cb1c624b5269978a62e86a1. Causes hangs, see comments on D105827.	2021-07-15 10:19:41 -07:00
Jessica Paquette	d52865f98f	[GlobalISel] Fix infinite loop in reassociationCanBreakAddressingModePattern It didn't update the opcode while walking through G_INTTOPTR/G_PTRTOINT. Differential Revision: https://reviews.llvm.org/D106080	2021-07-15 10:09:07 -07:00
Stanislav Mekhanoshin	5c27957af8	[AMDGPU] Refine -O0 and -O1 passes. Differential Revision: https://reviews.llvm.org/D105579	2021-07-15 09:51:54 -07:00
Fangrui Song	5aac6f60bc	[llvm-nm] Remove one-dash long options except -arch The documentation and help messages have recommended the double-dash forms for quite a while. Remove one-dash long options which are not recognized by GNU style `getopt_long`. `-arch` is kept as it is in the manpage of classic nm https://keith.github.io/xcode-man-pages/nm.1.html Note: the dyldinfo related options don't have a test. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105948	2021-07-15 09:50:37 -07:00
Andrzej Warzynski	e5843d83d7	Enable Flang by default in the test-release.sh script I've also brought this up on llvm-dev: https://lists.llvm.org/pipermail/llvm-dev/2021-July/151744.html Differential Revision: https://reviews.llvm.org/D105885	2021-07-15 17:17:49 +01:00
Nikita Popov	d58a8fbeab	[IR] Add elementtype attribute This implements the elementtype attribute specified in D105407. It just adds the attribute and the specified verifier rules, but doesn't yet make use of it anywhere. Differential Revision: https://reviews.llvm.org/D106008	2021-07-15 18:04:26 +02:00
Nikita Popov	159ef87203	[LangRef] Add elementtype attribute This adds an elementtype(<ty>) attribute, which can be used to attach an element type to a pointer typed argument. It is similar to byval/byref in purpose, but unlike those does not carry any specific semantics by itself. However, certain intrinsics may require it and interpret it in specific ways. The in-tree use cases for this that I'm currently aware of are: call ptr @llvm.preserve.array.access.index.p0.p0(ptr elementtype(%ty) %base, i32 %dim, i32 %index) call ptr @llvm.preserve.struct.access.index.p0.p0(ptr elementtype(%ty) %base, i32 %gep_index, i32 %di_index) call token @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(void ()) @foo, i32 0, i32 0, i32 0, i32 0, ptr addrspace(1) %obj) Notably, the gc.statepoint case needs a function as element type, in which case the workaround of adding a separate %ty undef argument would not work, as arguments cannot be unsized. Differential Revision: https://reviews.llvm.org/D105407	2021-07-15 18:04:25 +02:00
Arthur Eubanks	edc13daf17	[InstCombine] Look through invariant group intrinsics when removing malloc Fixes some regressions with -fstrict-vtable-pointers in llvm-test-suite. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D106017	2021-07-15 09:02:40 -07:00
Philip Reames	96cc0219ad	[LV] Enable vectorization of multiple exit loops w/computable exit counts This change enables vectorization of multiple exit loops when the exit count is statically computable. That requirement - shared with the rest of LV - in turn requires each exit to be analyzeable and to dominate the latch. The majority of work to support this was done in a set of previous patches. In particular,, 72314466 avoids having multiple edges from the middle block to the exits, and 4b33b2387 which added support for non-latch single exit and multiple exits with a single exiting block. As a result, this change is basically just removing a bailout and adjusting some tests now that the prerequisite work is done and has stuck in tree for a bit. Differential Revision: https://reviews.llvm.org/D105817	2021-07-15 08:53:51 -07:00
Nikita Popov	929097793e	[AsmParser] Unify parsing of attributes Continuing on from D105780, this should be the last major bit of attribute cleanup. Currently, LLParser implements attribute parsing for functions, parameters and returns separately, enumerating all supported (and unsupported) attributes each time. This patch extracts the common parsing logic, and performs a check afterwards whether the attribute is valid in the given position. Parameters and returns are handled together, while function attributes need slightly different logic to support attribute groups. Differential Revision: https://reviews.llvm.org/D105938	2021-07-15 17:51:11 +02:00
Shilei Tian	4ef4182afa	Revert "[AbstractAttributor] Fold function calls to `__kmpc_is_spmd_exec_mode` if possible" This reverts commit 1100e4aafea233bc8bbc307c5758a7d287ad3bae.	2021-07-15 11:19:28 -04:00
Simon Pilgrim	f2c8d69df9	[DAG] Fold select(cond,binop(x,y),binop(x,z)) -> binop(x,select(cond,y,z)) Similar to the folds performed in InstCombinerImpl::foldSelectOpOp, this attempts to push a select further up to help merge a pair of binops. I'm primarily interested in select(cond,add(x,y),add(x,z)) folds to help expose pointer math (see https://bugs.llvm.org/show_bug.cgi?id=51069 etc.) but I've tried to use the more generic isBinOp(). Differential Revision: https://reviews.llvm.org/D106058	2021-07-15 16:08:30 +01:00
Simon Pilgrim	ea93bd8da3	[NVPTX] Tweak fast-math tests to avoid select(binop(x,y),binop(x,z)) fold As suggested on D106058, tweak the tests to keep the combineRepeatedFPDivisors test coverage.	2021-07-15 15:42:25 +01:00
Sander de Smalen	50816080ef	Revert "[LV] Print remark when loop cannot be vectorized due to invalid costs." This reverts commit efaf3099c8cec1954831ee28a2f75a72096f50eb. This reverts commit dc7bdc1e7121693df112f2fdb11cc6b88580ba4b. Reverting patches due to buildbot failures.	2021-07-15 15:21:57 +01:00
Nathan Sidwell	83135ce4f4	[docs] More CMAKE variable documentation This breaks out some (more) common llvm-specific variables. Controlling the subprojects and target architectures, along with clues about restricting build parallelism when linking. 'more common' is somewhat subjective, of course. Differential Revision: https://reviews.llvm.org/D105822	2021-07-15 06:56:49 -07:00
David Green	47a6aacab4	[ARM] Expand types handled in VQDMULH recognition We have a DAG combine for recognizing the sequence of nodes that make up an MVE VQDMULH, but only currently handles specifically legal types. This patch expands that to other power-2 vector types. For smaller than legal types this means any_extending the type and casting it to a legal type, using a VQDMULH where we only use some of the lanes. The result is sign extended back to the original type, to properly set the invalid lanes. Larger than legal types are split into chunks with extracts and concat back together. Differential Revision: https://reviews.llvm.org/D105814	2021-07-15 14:47:53 +01:00
Tim Northover	9bdb647741	MachO: don't emit L... private symbols in do_not_dead_strip sections. The linker can sometimes drop the do_not_dead_strip if it can't associate the atom with a symbol (the other place to specify no dead-stripping in MachO files).	2021-07-15 14:40:43 +01:00
Roman Lebedev	5a590be916	[SimplifyCFG] Rerun PHI deduplication after common code sinkinkg (PR51092) `SinkCommonCodeFromPredecessors()` doesn't itself ensure that duplicate PHI nodes aren't created. I suppose, we could teach it to do that on-the-fly (& account for the already-existing PHI nodes, & adjust costmodel), the diff will be bigger than this. The alternative is to schedule a new EarlyCSE pass invocation somewhere later in the pipeline. Clearly, we don't have any EarlyCSE runs in module optimization passline, so this pattern isn't cleaned up... That would perhaps better, but it will again have some compile time impact. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D106010	2021-07-15 16:34:34 +03:00
Simon Pilgrim	87679ebe78	[TTI] Consistently make getMinVectorRegisterBitWidth() methods const. NFCI. The underlying getMinVectorRegisterBitWidth() methods are const, but it was missed in a couple of TargetTransformInfo wrappers. Noticed while working on D103925	2021-07-15 13:27:55 +01:00
Sander de Smalen	fe1fe35455	[LV] Fix determinism for failing scalable-call.ll test. The sort function for emitting an OptRemark was not deterministic, which caused scalable-call.ll to fail on some buildbots. This patch fixes that. This patch also fixes an issue where `Instruction::comesBefore()` is called when two Instructions are in different basic blocks, which would otherwise cause an assertion failure.	2021-07-15 13:16:59 +01:00
Stephen Tozer	a972fea3c6	Reapply "[DebugInfo] Enable variadic debug value salvaging" Reapplied after previous build failures were fixed in 14b62f7e2. This reverts commit 540b4a5fb31086b6d40735e96e6ec497022107e7.	2021-07-15 12:54:51 +01:00
Fraser Cormack	033e597afe	[VP][NFC] Correct formatting in unit test	2021-07-15 12:38:47 +01:00
Simon Pilgrim	ac23ae72cc	[NVPTX] Add selp.f32 checks to select(cond,fpbinop(),fpbinop()) tests Will help show codegen diffs in an upcoming patch	2021-07-15 12:42:29 +01:00
Simon Pilgrim	446183aa1c	[InstCombine] Strip inbounds from (select C, (gep Ptr, Idx), Ptr) -> (gep Ptr, (select C, Idx, 0)) fold As discussed on rGd561b6fbdbe6, we can't guarantee that the new gep is inbounds	2021-07-15 12:19:10 +01:00
Simon Pilgrim	7c3ad31154	[MIPS] Refresh ashr test checks. NFCI.	2021-07-15 12:12:19 +01:00
Irina Dobrescu	16f19bf26e	[AArch64][GlobalISel] Optimise lowering for some vector types for min/max Differential Revision: https://reviews.llvm.org/D105696	2021-07-15 11:34:32 +01:00
Sebastian Neubauer	b992832f74	[AMDGPU] Use isMetaInstruction for instruction size Meta instructions have a size of 0. Use isMetaInstruction instead of listing them explicitly. Differential Revision: https://reviews.llvm.org/D106043	2021-07-15 12:23:11 +02:00
Ilya Leoshkevich	3a1b644428	[TSan] Add SystemZ longjmp support Implement the interceptor and stack pointer demangling. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D105629	2021-07-15 12:18:48 +02:00
Ilya Leoshkevich	b0ff945d9d	[TSan] Use zeroext for function parameters SystemZ ABI requires zero-extending function parameters to 64-bit. The compiler is free to optimize the code around this assumption, e.g. failing to zero-extend __tsan_atomic32_load()'s morder may cause crashes in to_mo() switch table lookup. Fix by adding zeroext attributes to TSan's FunctionCallees, similar to how it was done in commit 3bc439bdff8b ("[MSan] Add instrumentation for SystemZ"). This is a no-op on arches that don't need it. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D105629	2021-07-15 12:18:47 +02:00
Max Kazantsev	4863909bb0	[Test] We can benefit from pipelining of ymm load/stores This patch demonstrates a scenario when we need to load/store a single 64-byte value, which is done by 2 ymm loads and stores in AVX. The current codegen choses the following sequence: load ymm0 load ymm1 store ymm1 store ymm0 If we instead stored ymm0 before ymm1, we could execute 2nd load and 1st store in parallel.	2021-07-15 17:15:14 +07:00
Cullen Rhodes	c55c74c634	[AArch64][SME] Add outer product instructions This patch adds support for the following outer product instructions: * BFMOPA, BFMOPS, FMOPA, FMOPS, SMOPA, SMOPS, SUMOPA, SUMOPS, UMOPA, UMOPS, USMOPA, USMOPS. Depends on D105570. The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-06 Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D105571	2021-07-15 09:51:06 +00:00
Florian Mayer	f4bb75377e	[NFC] [hwasan] Split argument logic into functions. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D105971	2021-07-15 10:45:43 +01:00
Bogdan Graur	baec013412	Fixes memory sanitizer 'use-of-uninitialized-value' diagnostic. Differential Revision: https://reviews.llvm.org/D106047	2021-07-15 11:17:04 +02:00
Timm Bäder	0108938558	[llvm][tools] Hide unrelated llvm-bcanalyzer options They otherwise show up when we link against the dynamic libLLVM.so. Differential Revision: https://reviews.llvm.org/D105893	2021-07-15 10:43:15 +02:00
LLVM GN Syncbot	091d61e196	[gn build] Port b0d38ad0bc25	2021-07-15 07:50:35 +00:00
Djordje Todorovic	140c795b75	[2/2][RemoveRedundantDebugValues] Add a Pass that removes redundant DBG_VALUEs This patch adds the forward scan for finding redundant DBG_VALUEs. This analysis aims to remove redundant DBG_VALUEs by going forward in the basic block by considering the first DBG_VALUE as a valid until its first (location) operand is not clobbered/modified. For example: (1) DBG_VALUE $edi, !"var1", ... (2) <block of code that does affect $edi> (3) DBG_VALUE $edi, !"var1", ... ... in this case, we can remove (3). Differential Revision: https://reviews.llvm.org/D105280	2021-07-15 00:08:31 -07:00
Tony Tye	57b2fbab2e	[AMDGPU] Reserve AMDGPU ELF e_flags machine 0x44 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D106034	2021-07-15 06:46:27 +00:00
Chuanqi Xu	ca13ea7edf	[Coroutines] Run coroutine passes by default This patch make coroutine passes run by default in LLVM pipeline. Now the clang and opt could handle IR inputs containing coroutine intrinsics without special options. It should be fine. On the one hand, the coroutine passes seems to be stable since there are already many projects using coroutine feature. On the other hand, the coroutine passes should do nothing for IR who doesn't contain coroutine intrinsic. Test Plan: check-llvm Reviewed by: lxfind, aeubanks Differential Revision: https://reviews.llvm.org/D105877	2021-07-15 14:33:40 +08:00
Kuter Dinel	b86b597e6c	[Attributor] AACallEdges, Add a way to ask nonasm unknown callees This patch adds a feature to AACallEdges AbstractAttribute that allows users to ask if there is a unknown callee that isn't a inline assembly. This feature is needed by some of it's users. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105992	2021-07-15 06:10:42 +03:00
Chen Zheng	90f3c2e043	[PowerPC][NFC] add testcase for update-form preparation with non-const increment	2021-07-15 02:46:24 +00:00

1 2 3 4 5 ...

218562 Commits