llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Cullen Rhodes	c640adbe73	[LV] Clamp VF hint when unsafe In the following loop the dependence distance is 2 and can only be vectorized if the vector length is no larger than this. void foo(int a, int b, int N) { #pragma clang loop vectorize(enable) vectorize_width(4) for (int i=0; i<N; ++i) { a[i + 2] = a[i] + b[i]; } } However, when specifying a VF of 4 via a loop hint this loop is vectorized. According to [1][2], loop hints are ignored if the optimization is not safe to apply. This patch introduces a check to bail of vectorization if the user specified VF is greater than the maximum feasible VF, unless explicitly forced with '-force-vector-width=X'. [1] https://llvm.org/docs/LangRef.html#llvm-loop-vectorize-and-llvm-loop-interleave [2] https://clang.llvm.org/docs/LanguageExtensions.html#extensions-for-loop-hint-optimizations Reviewed By: sdesmalen, fhahn, Meinersbur Differential Revision: https://reviews.llvm.org/D90687	2020-12-01 11:30:34 +00:00
Simon Pilgrim	32a49915a1	[InstCombine][X86] Fold addsub intrinsic to fadd/fsub depending on demanded elts (PR46277)	2020-12-01 11:27:40 +00:00
Caroline Concatto	319a0490a1	[NFC][CostModel]Extend class IntrinsicCostAttributes to use ElementCount Type This patch replaces the attribute `unsigned VF` in the class IntrinsicCostAttributes by `ElementCount VF`. This is a non-functional change to help upcoming patches to compute the cost model for scalable vector inside this class. Differential Revision: https://reviews.llvm.org/D91532	2020-12-01 11:12:51 +00:00
Florian Hahn	5168f3f070	[ConstraintElimination] Decompose GEP %ptr, SHL(). Add support the decompose a GEP with an SHL operand.	2020-12-01 10:58:36 +00:00
Kazushi (Jam) Marukawa	60cc012b65	[VE] Add vadd and vsub intrinsic instructions Add vadd and vsub intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92332	2020-12-01 19:57:22 +09:00
Simon Pilgrim	e202dff3a4	[InstCombine][X86] Add test coverage showing failure to simplify addsub intrinsics to fadd/fsub If we only use odd/even lanes then we just need fadd/fsub ops	2020-12-01 10:49:43 +00:00
Sjoerd Meijer	c00b31be29	ExtractValue instruction costs Instruction ExtractValue wasn't handled in LoopVectorizationCostModel::getInstructionCost(). As a result, it was modeled as a mul which is not really accurate. Since it is free (most of the times), this now gets a cost of 0 using getInstructionCost. This is a follow-up of D92208, that required changing this regression test. In a follow up I will look at InsertValue which also isn't handled yet. Differential Revision: https://reviews.llvm.org/D92317	2020-12-01 10:42:23 +00:00
David Green	de6fd49a53	[AArch64] Update pass pipeline test. NFC	2020-12-01 10:40:04 +00:00
David Green	a65007720f	[ARM] PREDICATE_CAST demanded bits The PREDICATE_CAST node is used to model moves between MVE predicate registers and gpr's, and eventually become a VMSR p0, rn. When moving to a predicate only the bottom 16 bits of the sources register are demanded. This adds a simple fold for that, allowing it to potentially remove instructions like uxth. Differential Revision: https://reviews.llvm.org/D92213	2020-12-01 10:32:24 +00:00
Jay Foad	54cad475e3	[AMDGPU] Simplify some generation checks. NFC.	2020-12-01 10:15:32 +00:00
Hans Wennborg	2f9caad35e	[gn build] Manually merge 40659cd	2020-12-01 11:15:05 +01:00
Georgii Rymar	59f342a322	[obj2yaml] - Teach tool to emit the "SectionHeaderTable" key and sort sections by file offset. Currently when we dump sections, we dump them in the order, which is specified in the sections header table. With that the order in the output might not match the order in the file. This patch starts sorting them by by file offsets when dumping. When the order in the section header table doesn't match the order in the file, we should emit the "SectionHeaderTable" key. This patch does it. Differential revision: https://reviews.llvm.org/D91249	2020-12-01 12:59:15 +03:00
Georgii Rymar	d41ff4ca02	[llvm-readobj][test] - Merge 2 test cases together. This merges `invalid-attr-section-size.test` and `invalid-attr-version.test` into `invalid-attributes-sec.test`. This allows to have a single place where other related test cases can be added. Differential revision: https://reviews.llvm.org/D92316	2020-12-01 12:51:07 +03:00
Georgii Rymar	f4dc2c0d44	[llvm-readobj] - Introduce `ObjDumper::reportUniqueWarning(const Twine &Msg)`. This introduces the overload for `reportUniqueWarning` which allows to avoid using `createError` in many places. Differential revision: https://reviews.llvm.org/D92371	2020-12-01 12:36:44 +03:00
Jan Svoboda	63e7cdb281	[clang][cli] Port DependencyOutput option flags to new option parsing system Depends on D91861. Reviewed By: dexonsmith Original patch by Daniel Grumberg. Differential Revision: https://reviews.llvm.org/D83694	2020-12-01 10:36:12 +01:00
Jan Svoboda	8ff798c583	[clang][cli] Port Frontend option flags to new option parsing system Depends on D91861. Reviewed By: dexonsmith Original patch by Daniel Grumberg. Differential Revision: https://reviews.llvm.org/D83697	2020-12-01 10:02:08 +01:00
Jan Svoboda	5bc4c8d4e4	[clang][cli] Split DefaultAnyOf into a default value and ImpliedByAnyOf This makes the options API composable, allows boolean flags to imply non-boolean values and makes the code more logical (IMO). Differential Revision: https://reviews.llvm.org/D91861	2020-12-01 09:50:11 +01:00
Kristof Beyls	27280e2609	collect_and_build_with_pgo.py: adapt to monorepo Differential Revision: https://reviews.llvm.org/D92328	2020-12-01 09:16:12 +01:00
Georgii Rymar	950e7e96bb	[llvm-readelf] - Switch to using from `reportWarning` to `reportUniqueWarning` in `DynRegionInfo`. This is a part of the plan we had previously to convert all calls to `reportUniqueWarning` and then rename it to just `reportWarning`. I was a bit unsure about this particular change at first, because it doesn't add a new functionality: seems it is impossible to trigger a warning duplication currently. At the same time I find the idea of the plan mentioned very reasonable. And with that we will be sure that `DynRegionInfo` can't report duplicate warnings, what looks like a nice feature for possible refactorings and further tool development. Differential revision: https://reviews.llvm.org/D92224	2020-12-01 11:09:30 +03:00
Georgii Rymar	37259d46e0	[llvm-readelf/obj] - Move unique warning handling logic to the `ObjDumper`. This moves the `reportUniqueWarning` method to the base class. My motivation is the following: I've experimented with replacing `reportWarning` calls with `reportUniqueWarning` in ELF dumper. I've found that for example for removing them from `DynRegionInfo` helper class, it is worth to pass a dumper instance to it (to be able to call dumper()->reportUniqueWarning()). The problem was that `ELFDumper<ELFT>` is a template class. I had to make `DynRegionInfo` to be templated and do lots of minor changes everywhere what did not look reasonable/nice. At the same time I guess one day other dumpers like COFF/MachO/Wasm etc might want to start using `reportUniqueWarning` API too. Then it looks reasonable to move the logic to the base class. With that the problem of passing the dumper instance will be gone. Differential revision: https://reviews.llvm.org/D92218	2020-12-01 10:53:00 +03:00
Kazu Hirata	ce90865fb0	[CodeView] Remove unused declaration collectInlineSiteChildren (NFC) The function definition was removed on Sep 7, 2016 in commit a9f4cc9510546f5728258524d344a3e03e43500b. The declaration seems to be unused since then.	2020-11-30 22:28:26 -08:00
Wei Wang	5e0aabe083	[Remarks][2/2] Expand remarks hotness threshold option support in more tools This is the #2 of 2 changes that make remarks hotness threshold option available in more tools. The changes also allow the threshold to sync with hotness threshold from profile summary with special value 'auto'. This change expands remarks hotness threshold option -fdiagnostics-hotness-threshold in clang and *-remarks-hotness-threshold in other tools to utilize hotness threshold from profile summary. Remarks hotness filtering relies on several driver options. Table below lists how different options are correlated and affect final remarks outputs: \| profile \| hotness \| threshold \| remarks printed \| \|---------\|---------\|-----------\|-----------------\| \| No \| No \| No \| All \| \| No \| No \| Yes \| None \| \| No \| Yes \| No \| All \| \| No \| Yes \| Yes \| None \| \| Yes \| No \| No \| All \| \| Yes \| No \| Yes \| None \| \| Yes \| Yes \| No \| All \| \| Yes \| Yes \| Yes \| >=threshold \| In the presence of profile summary, it is often more desirable to directly use the hotness threshold from profile summary. The new argument value 'auto' indicates threshold will be synced with hotness threshold from profile summary during compilation. The "auto" threshold relies on the availability of profile summary. In case of missing such information, no remarks will be generated. Differential Revision: https://reviews.llvm.org/D85808	2020-11-30 21:55:50 -08:00
Wei Wang	d0b74589e5	[Remarks][1/2] Expand remarks hotness threshold option support in more tools This is the #1 of 2 changes that make remarks hotness threshold option available in more tools. The changes also allow the threshold to sync with hotness threshold from profile summary with special value 'auto'. This change modifies the interface of lto::setupLLVMOptimizationRemarks() to accept remarks hotness threshold. Update all the tools that use it with remarks hotness threshold options: * lld: '--opt-remarks-hotness-threshold=' * llvm-lto2: '--pass-remarks-hotness-threshold=' * llvm-lto: '--lto-pass-remarks-hotness-threshold=' * gold plugin: '-plugin-opt=opt-remarks-hotness-threshold=' Differential Revision: https://reviews.llvm.org/D85809	2020-11-30 21:55:49 -08:00
Greg Parker	f76f6a2acc	[DSE] Remove a redundant call to getLocForWriteEx() Differential Revision: https://reviews.llvm.org/D92263	2020-11-30 21:12:24 -08:00
Craig Topper	27c403eff2	[RISCV] Rename RISCVGenSystemOperands.inc to RISCVGenSearchableTables.inc to prepare for more tables. NFC D89449 adds more tables so renaming as a pre-commit for that.	2020-11-30 20:47:58 -08:00
Hendrik Greving	56e95be0dd	Add MachineModuleInfo constructor with external MCContext Adds a constructor to MachineModuleInfo and MachineModuleInfoWapperPass that takes an external MCContext. If provided, the external context will be used throughout codegen instead of MMI's default one. This enables external drivers to take ownership of data put on the MMI's context during codegen. The internal context is used otherwise and destroyed upon finish. Differential Revision: https://reviews.llvm.org/D91313	2020-11-30 20:28:13 -08:00
Fangrui Song	90135dd4f2	[GlobalISel] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds	2020-11-30 18:31:42 -08:00
Fangrui Song	2b3f56b146	[GlobalISel] Fix -Wunused-variable	2020-11-30 18:25:54 -08:00
Amy Huang	b375f49096	Recommit "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" This reverts commit 1b63177a56e8cd6196778d2b90295f03e96b5800.	2020-11-30 17:36:12 -08:00
Leonard Chan	03ffcb1a94	[llvm] Fix for failing test from cf8ff75bade763b054476321dcb82dcb2e7744c7 Handle null values when handling operand changes for DSOLocalEquivalent.	2020-11-30 17:22:28 -08:00
Nico Weber	0e1884b28c	clang/test: Remove platform-linker feature By explicitly requesting the system linker with `-fuse-ld=`, the tests are able to CHECK for the system linker even with CLANG_DEFAULT_LINKER=lld. Alternative to D74704. Differential Revision: https://reviews.llvm.org/D92291	2020-11-30 20:00:46 -05:00
Amara Emerson	8126e7b46b	[AArch64][GlobalISel] Fix crash during legalization of a vector G_SELECT with scalar mask. The lowering of vector selects needs to first splat the scalar mask into a vector first. This was causing a crash when building oggenc in the test suite. Differential Revision: https://reviews.llvm.org/D91655	2020-11-30 16:37:49 -08:00
Nick Desaulniers	5788fba676	[InlineCost] prefer range-for. NFC Prefer range-for over iterators when such methods exist. Precommitted from https://reviews.llvm.org/D91816. Signed-off-by: Nick Desaulniers <ndesaulniers@google.com> Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D92350	2020-11-30 16:07:40 -08:00
Amy Huang	6760e386f1	[llvm-symbolizer] Fix native symbolization on windows for inline sites. The existing code handles this correctly and I checked that the code in NativeInlineSiteSymbol also handles this correctly, but it was wrong in the NativeFunctionSymbol code. Differential Revision: https://reviews.llvm.org/D92134	2020-11-30 14:27:35 -08:00
Florian Hahn	0cf56463f6	[ConstraintElimination] Expand GEP decomposition tests. This adjusts the tests to cover more scenarios.	2020-11-30 20:33:18 +00:00
Paul Robinson	a1da6ce5db	[FastISel] NFC: Clean up unnecessary bookkeeping Now that we flush the local value map for every instruction, we don't need any extra flushes for specific cases. Also, LastFlushPoint is not used for anything. Follow-ups to #dc35368 (D91734). Differential Revision: https://reviews.llvm.org/D92338	2020-11-30 12:27:50 -08:00
Eric Astor	54e377fc02	[ms] [llvm-ml] Test macro function invocations in arbitrary positions Differential Revision: https://reviews.llvm.org/D89741	2020-11-30 15:13:23 -05:00
Matt Arsenault	1b4ed603c0	SplitKit: Use Register	2020-11-30 15:09:33 -05:00
Mircea Trofin	6fefe6148b	[llvm][inliner] Reuse the inliner pass to implement 'always inliner' Enable performing mandatory inlinings upfront, by reusing the same logic as the full inliner, instead of the AlwaysInliner. This has the following benefits: - reduce code duplication - one inliner codebase - open the opportunity to help the full inliner by performing additional function passes after the mandatory inlinings, but before th full inliner. Performing the mandatory inlinings first simplifies the problem the full inliner needs to solve: less call sites, more contextualization, and, depending on the additional function optimization passes run between the 2 inliners, higher accuracy of cost models / decision policies. Note that this patch does not yet enable much in terms of post-always inline function optimization. Differential Revision: https://reviews.llvm.org/D91567	2020-11-30 12:03:39 -08:00
Nikita Popov	1f71c3e563	[DL] Inline getAlignmentInfo() implementation (NFC) Apart from getting the entry in the table (which is already a separate function), the remaining logic is different for all alignment types and is better combined with getAlignment(). This is a minor efficiency improvement, and should make further improvements like using separate storage for different alignment types simpler.	2020-11-30 20:56:15 +01:00
Nick Lewycky	25d19be185	Creating a named struct requires only a Context and a name, but looking up a struct by name requires a Module. The method on Module merely accesses the LLVMContextImpl and no data from the module itself, so this patch moves getTypeByName to a static method on StructType that takes a Context and a name. There's a small number of users of this function, they are all updated. This updates the C API adding a new method LLVMGetTypeByName2 that takes a context and a name. Differential Revision: https://reviews.llvm.org/D78793	2020-11-30 11:34:12 -08:00
Eric Astor	2f595b5b2e	[ms] [llvm-ml] Implement the statement expansion operator If prefaced with a %, expand text macros and macro functions in any statement. Also, prevent expanding text macros in the message of an ECHO directive unless expanded explicitly by the statement expansion operator. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D89740	2020-11-30 14:33:24 -05:00
Sanjay Patel	621f444a0d	[x86] add tests for maxnum/minnum with nnan; NFC	2020-11-30 14:30:28 -05:00
Sjoerd Meijer	4f02a2abc3	[AArch64] Enable Cortex-A55 schedmodel The model was committed in 4b8ade837e36b7f0181ce86fc23f33851d0fdd35 but not yet enabled to allow for a few fix ups. This adds a few of these fixes, and also a LLVM MCA test to check most instructions. While I do have plans to look into some more tuning, it's time to enable this as it better than using the A53 schedule. Differential Revision: https://reviews.llvm.org/D88017	2020-11-30 19:28:34 +00:00
Hongtao Yu	4171a5213d	[CSSPGO] Disabling a pseudo probe test on non-x86 platforms. Disabling a pseudo probe test on non-x86 platforms since it's not fully tested there.	2020-11-30 11:19:45 -08:00
Paul Robinson	a36c6e7cff	[FastISel] NFC: Remove obsolete -fast-isel-sink-local-values option This option is not used for anything after #dc35368 (D91734).	2020-11-30 10:55:49 -08:00
Harald van Dijk	398e3ba1d1	[X86] Zero-extend pointers to i64 for x86_64 For LP64 mode, this has no effect as pointers are already 64 bits. For ILP32 mode (x32), this extension is specified by the ABI. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D91338	2020-11-30 18:51:23 +00:00
Simon Pilgrim	d592ef7991	[InstCombine][X86] Add basic addsub intrinsic SimplifyDemandedVectorElts support (PR46277) Pass through the demanded elts mask to the source operands. The next step will be to add support for folding to add/sub if we only demand odd/even elements.	2020-11-30 18:40:16 +00:00
LLVM GN Syncbot	df643f2f12	[gn build] Port 64fa8cce225	2020-11-30 18:20:24 +00:00
Hongtao Yu	37399ba912	[CSSPGO] A Clang switch -fpseudo-probe-for-profiling for pseudo-probe instrumentation. This change introduces a new clang switch `-fpseudo-probe-for-profiling` to enable AutoFDO with pseudo instrumentation. Please refer to https://reviews.llvm.org/D86193 for the whole story. One implication from pseudo-probe instrumentation is that the profile is now sensitive to CFG changes. We perform the pseudo instrumentation very early in the pre-LTO pipeline, before any CFG transformation. This ensures that the CFG instrumented and annotated is stable and optimization-resilient. The early instrumentation also allows the inliner to duplicate probes for inlined instances. When a probe along with the other instructions of a callee function are inlined into its caller function, the GUID of the callee function goes with the probe. This allows samples collected on inlined probes to be reported for the original callee function. Reviewed By: wmi Differential Revision: https://reviews.llvm.org/D86502	2020-11-30 10:16:54 -08:00

1 2 3 4 5 ...

207539 Commits