llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Roman Lebedev	18451cc4a4	[NFC][X86][Codegen] Megacommit: mass-regenerate all check lines that were already autogenerated The motivation is that the update script has at least two deviations (`<...>@GOT`/`<...>@PLT`/ and not hiding pointer arithmetics) from what pretty much all the checklines were generated with, and most of the tests are still not updated, so each time one of the non-up-to-date tests is updated to see the effect of the code change, there is a lot of noise. Instead of having to deal with that each time, let's just deal with everything at once. This has been done via: ``` cd llvm-project/llvm/test/CodeGen/X86 grep -rl "; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py" \| xargs -L1 <...>/llvm-project/llvm/utils/update_llc_test_checks.py --llc-binary <...>/llvm-project/build/bin/llc ``` Not all tests were regenerated, however.	2021-06-11 23:57:02 +03:00
Daniil Fukalov	3d8b8a6451	[NFC][CostModel] Fixed comment that comparisons work regardless of the state. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D104068	2021-06-11 23:48:49 +03:00
Andrew Litteken	b0b7d897d8	Revert "[IRSim] Adding basic implementation of llvm-sim." This reverts commit f47d00c54b52bd8adf9b8725912ea1cd0f1873d5.	2021-06-11 15:44:19 -05:00
Philip Reames	5d8953a6cd	Allow ptrtoint/inttoptr of non-integral pointer types in IR I don't like landing this change, but it's an acknowledgement of a practical reality. Despite not having well specified semantics for inttoptr and ptrtoint involving non-integral pointer types, they are used in practice. Here's a quick summary of the current pragmatic reality: * I happen to know that the main external user of non-integral pointers has effectively disabled the verifier rules. * RS4GC (the lowering pass for abstract GC machine model which is the key motivation for non-integral pointers), even supports them. We just have all the tests using an integral pointer space to let the verifier run. * Certain idioms (such as alignment checks for alignment N, where any relocation is guaranteed to be N byte aligned) are fine in practice. * As implemented, inttoptr/ptrtoint are CSEd and are not control dependent. This means that any code which is intending to check a particular bit pattern at site of use must be wrapped in an intrinsic or external function call. This change allows them in the Verifier, and updates the LangRef to specific them as implementation dependent. This allows us to acknowledge current reality while still leaving ourselves room to punt on figuring out "good" semantics until the future.	2021-06-11 13:38:32 -07:00
Andrew Litteken	f5356265d8	[IRSim] Adding basic implementation of llvm-sim. This is a similarity visualization tool that accepts a Module and passes it to the IRSimilarityIdentifier. The resulting SimilarityGroups are output in a JSON file. Tests are found in test/tools/llvm-sim and check for the file not found, a bad module, and that the JSON is created correctly. Reviewers: paquette, jroelofs, MaskRay Recommit of: 15645d044bcfe2a0f63156048b302f997a717688 to fix linking errors. Differential Revision: https://reviews.llvm.org/D86974	2021-06-11 14:56:41 -05:00
Arthur Eubanks	df6897c0ce	[docs][OpaquePtr] Add some specific examples of what needs to be done	2021-06-11 12:51:46 -07:00
Kevin Athey	cd621c234a	[clang-cl][sanitizer] Add -fsanitize-address-use-after-return to clang. Also: - add driver test (fsanitize-use-after-return.c) - add basic IR test (asan-use-after-return.cpp) - (NFC) cleaned up logic for generating table of __asan_stack_malloc depending on flag. for issue: https://github.com/google/sanitizers/issues/1394 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D104076	2021-06-11 12:07:35 -07:00
Arthur Eubanks	5f73af13bd	[NFC][OpaquePtr] Explicitly pass GEP source type in optimizeGatherScatterInst() Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D103480	2021-06-11 11:49:59 -07:00
LLVM GN Syncbot	174f747fbd	[gn build] Port 7eba4856c702	2021-06-11 18:04:01 +00:00
Matt Arsenault	ddaa8ec259	AMDGPU/GlobalISel: Remove leftover hack for argument memory sizes Since the call lowering code now tries to respect the tablegen reported argument types, this is no longer necessary.	2021-06-11 13:45:25 -04:00
Matt Arsenault	8ecbf4dea3	AMDGPU/GlobalISel: Fix indentation	2021-06-11 13:45:25 -04:00
Matt Arsenault	71dc8a693a	GlobalISel: Reduce indentation and remove dead path	2021-06-11 13:45:24 -04:00
Matt Arsenault	f133d28419	CodeGen: Fix missing const	2021-06-11 13:45:24 -04:00
Valery N Dmitriev	abf46bdcd3	[SLP][NFC] Fix condition that was supposed to save a bit of compile time. It was found by chance revealing discrepancy between comment (few lines above), the condition and how re-ordering of instruction is done inside the if statement it guards. The condition was always evaluated to true. Differential Revision: https://reviews.llvm.org/D104064	2021-06-11 10:08:55 -07:00
LLVM GN Syncbot	c85b97b264	[gn build] Port c54d3050f7b9	2021-06-11 16:57:34 +00:00
Guozhi Wei	0b1bb7b700	[X86FixupLEAs] Sub register usage of LEA dest should block LEA/SUB optimization In function searchALUInst, sub register usage of LEA dest should also block LEA/SUB optimization, otherwise the sub register usage gets an undefined value. This patch fixes https://bugs.llvm.org/show_bug.cgi?id=50615. Differential Revision: https://reviews.llvm.org/D103922	2021-06-11 09:45:56 -07:00
LLVM GN Syncbot	0f438067de	[gn build] Port 9106047ee3dd	2021-06-11 16:34:49 +00:00
Adam Nemet	aa865c5959	[Matrix] In transpose opts, handle a^t * a^t Without the fix the testcase crashes because we remove the same instruction twice. Differential Revision: https://reviews.llvm.org/D104127	2021-06-11 09:29:43 -07:00
Tomas Matheson	bb84fe4048	[CodeGen][regalloc] Don't align stack slots if the stack can't be realigned Register allocation may spill virtual registers to the stack, which can increase alignment requirements of the stack frame. If the the function did not require stack realignment before register allocation, the registers required to do so may not be reserved/available. This results in a stack frame that requires realignment but can not be realigned. Instead, only increase the alignment of the stack if we are still able to realign. The register SpillAlignment will be ignored if we can't realign, and the backend will be responsible for emitting the correct unaligned loads and stores. This seems to be the assumed behaviour already, e.g. ARMBaseInstrInfo::storeRegToStackSlot and X86InstrInfo::storeRegToStackSlot are both `canRealignStack` aware. Differential Revision: https://reviews.llvm.org/D103602	2021-06-11 16:49:12 +01:00
Alexey Bataev	2de818fda2	[SLP]Allow reordering of insertelements. After we added support for non-ordered insertelements, we can allow their reordering. Differential Revision: https://reviews.llvm.org/D104057	2021-06-11 08:47:41 -07:00
Matt Morehouse	72312d4d23	[HWASan] Add basic stack tagging support for LAM. Adds the basic instrumentation needed for stack tagging. Currently does not support stack short granules or TLS stack histories, since a different code path is followed for the callback instrumentation we use. We may simply wait to support these two features until we switch to a custom calling convention. Patch By: xiangzhangllvm, morehouse Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102901	2021-06-11 08:21:17 -07:00
Alexey Bataev	fc8fbabc07	[SLP]Remove unnecessary UndefValue in CreateShuffle. No need to use UndefValue in CreateShuffle call. Differential Revision: https://reviews.llvm.org/D104113	2021-06-11 08:08:30 -07:00
Alexey Bataev	5c202745a6	[SLP][NFC]Add a test for unordered stores, NFC.	2021-06-11 08:02:24 -07:00
LLVM GN Syncbot	621e1fc902	[gn build] Port 9907746f5db7	2021-06-11 14:01:11 +00:00
Sjoerd Meijer	ee05a3e562	Move Function Specialization to its correct location. NFC. As a follow up of rGc4a0969b9c14, and as part of D104102, move it to the IPO transformations directory.	2021-06-11 15:00:10 +01:00
Zahira Ammarguellat	fa7aa68b13	Fix for error "'Run' overrides a member function but is not marked 'override' [-Werror,-Wsuggest-override]" occuring during windows debug self-build.	2021-06-11 06:56:00 -07:00
Sanjay Patel	c009e5b6df	[SimplifyCFG] avoid crash on degenerate loop The problematic code pattern in the test is based on: https://llvm.org/PR50638 If the IfCond is itself the phi that we are trying to remove, then the loop around line 2835 can end up with something like: %cmp = select i1 %cmp, i1 false, i1 true That can then lead to a use-after-free and assert (although I'm still not seeing that locally in my release + asserts build). I think this can only happen with unreachable code. Differential Revision: https://reviews.llvm.org/D104063	2021-06-11 09:37:06 -04:00
Simon Pilgrim	e36a7e7583	StringExtrasTest.cpp - add missing newline at the end of file. NFCI.	2021-06-11 14:32:35 +01:00
Simon Pilgrim	b871c5c8a1	APInt.h - add missing <utility> header. Some buildbots are complaining about std::move() after rG61cdaf66fe22be2b5942ddee4f46a998b4f3ee29	2021-06-11 13:35:12 +01:00
Simon Pilgrim	165132af1b	[ADT] Remove APInt/APSInt toString() std::string variants <string> is currently the highest impact header in a clang+llvm build: https://commondatastorage.googleapis.com/chromium-browser-clang/llvm-include-analysis.html One of the most common places this is being included is the APInt.h header, which needs it for an old toString() implementation that returns std::string - an inefficient method compared to the SmallString versions that it actually wraps. This patch replaces these APInt/APSInt methods with a pair of llvm::toString() helpers inside StringExtras.h, adjusts users accordingly and removes the <string> from APInt.h - I was hoping that more of these users could be converted to use the SmallString methods, but it appears that most end up creating a std::string anyhow. I avoided trying to use the raw_ostream << operators as well as I didn't want to lose having the integer radix explicit in the code. Differential Revision: https://reviews.llvm.org/D103888	2021-06-11 13:19:15 +01:00
Fraser Cormack	13db9330f8	[VP][NFC] Format comment to 80 columns	2021-06-11 12:53:48 +01:00
Max Kazantsev	26f8809224	[Test] One more elaborate test with selects for loop deletion	2021-06-11 18:41:34 +07:00
Jingu Kang	e751d91e95	precommit tests for D104042	2021-06-11 12:33:16 +01:00
Zarko Todorovski	64bac7eb58	[PowerPC] Allow wa inline asm to also accept floating point arguments GCC documentation for the `wa` constraint states that: ``` wa A VSX register (VSR), vs0…vs63. This is either an FPR (vs0…vs31 are f0…f31) or a VR (vs32…vs63 are v0…v31). ``` This technically means that we could accept floating point parameters. In fact, gcc itself does. The following testcase compiles and runs on all PPC platforms with GCC, whereas clang/llc will assert: ``` #include <stdio.h> double foo ( vector double a ) { double b, c; asm("xvabsdp %x0, %x2 \n" "xxsldwi %x1, %x0, %x0, 2 \n" : "+wa" (b), "=wa" (c) : "wa" (a) ); return b+c; } int main(void) { vector double a = {-3., -4.}; double t = foo( a ); printf("%g\n", t); } ``` This patch allows clang/llc to build and run this testcase. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D103409	2021-06-11 07:19:10 -04:00
Max Kazantsev	3bc0d7e3d3	[Test] Add loop deletion test with switch	2021-06-11 18:05:07 +07:00
Simon Pilgrim	942f6eb554	[ADT] Consistently use StringExtrasTest for the test suite filter. NFCI. Noticed while updating D103888 - some of the tests were using "StringExtras" for the test_suite_name instead of the expected "StringExtrasTest"	2021-06-11 12:00:54 +01:00
Koutheir Attouchi	3a880fd8a3	Do not generate calls to the 128-bit function __multi3() on 32-bit ARM Re-applying this patch after bots failures. Should be fine now. The function __multi3() is undefined on 32-bit ARM, so a call to it should never be emitted. Instead, plain instructions need to be generated to perform 128-bit multiplications. Differential Revision: https://reviews.llvm.org/D103906	2021-06-11 11:45:21 +01:00
Rosie Sumpter	b4082e9aee	[CostModel][AArch64] Improve the cost estimate of CTPOP intrinsic Added a case for CTPOP to AArch64TTIImpl::getIntrinsicInstrCost so that the cost estimate matches the codegen in test/CodeGen/AArch64/arm64-vpopcnt.ll Differential Revision: https://reviews.llvm.org/D103952	2021-06-11 11:15:46 +01:00
Simon Pilgrim	bb3852411e	[llvm-stress] Fix dead code preventing us generating per-element vector selects This has been reported several times by the PVS Studio team as well as coming up in some static analysis. getRandom() % 1 always returns 0 so we never actually test this codepath, (git blame suggests this has always been like this) - given that we have plenty of other "getRandom() & 1" the typo is pretty obvious, and matches the intention in the comment above - with this change we generate a nice mixture of scalar/vector condition selects of vectors. I don't know llvm-stress that well - but I don't think we guarantee that the same seed value will always generate the same IR for later versions of the program - just that the same binary would. Differential Revision: https://reviews.llvm.org/D104022	2021-06-11 10:56:19 +01:00
Roman Lebedev	43b5ee114b	[VectorCombine] scalarizeLoadExtract(): use computeAlignmentAfterScalarization() helper This results in slightly more optimistic alignments in some cases	2021-06-11 12:47:10 +03:00
Roman Lebedev	32191d3791	[NFC][VectorCombine] Extract computeAlignmentAfterScalarization() helper function	2021-06-11 12:47:09 +03:00
Bing1 Yu	c7250ce1db	[X86] Support __tile_stream_loadd intrinsic for new AMX interface Adding support for __tile_stream_loadd intrinsic. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D103784	2021-06-11 17:28:43 +08:00
Simon Pilgrim	5c5e621290	SampleProf.h - fix spelling mistake in assert message. NFC.	2021-06-11 10:24:14 +01:00
Simon Pilgrim	cac15052d4	[Analysis] Pass RecurrenceDescriptor as const reference. NFCI. We were passing the RecurrenceDescriptor by value to most of the reduction analysis methods, despite it being rather bulky with TrackingVH members (that can be costly to copy). In all these cases we're only using the RecurrenceDescriptor for rather basic purposes (access to types/kinds etc.). Differential Revision: https://reviews.llvm.org/D104029	2021-06-11 10:24:14 +01:00
Simon Pilgrim	2e422e4f72	Fix implicit dependency on <string> header. NFCI.	2021-06-11 10:24:14 +01:00
LLVM GN Syncbot	a7bd77dff1	[gn build] Port c4a0969b9c14	2021-06-11 08:23:07 +00:00
Sjoerd Meijer	6a49dbd1a3	Function Specialization Pass This adds a function specialization pass to LLVM. Constant parameters like function pointers and constant globals are propagated to the callee by specializing the function. This is a first version with a number of limitations: - The pass is off by default, so needs to be enabled on the command line, - It does not handle specialization of recursive functions, - It does not yet handle constants and constant ranges, - Only 1 argument per function is specialised, - The cost-model could be further looked into, and perhaps related, - We are not yet caching analysis results. This is based on earlier work by Matthew Simpson (D36432) and Vinay Madhusudan. More recently this was also discussed on the list, see: https://lists.llvm.org/pipermail/llvm-dev/2021-March/149380.html. The motivation for this work is that function specialisation often comes up as a reason for performance differences of generated code between LLVM and GCC, which has this enabled by default from optimisation level -O3 and up. And while this certainly helps a few cpu benchmark cases, this also triggers in real world codes and is thus a generally useful transformation to have in LLVM. Function specialisation has great potential to increase compile-times and code-size. The summary from some investigations with this patch is: - Compile-time increases for short compile jobs is high relatively, but the increase in absolute numbers still low. - For longer compile-jobs, the extra compile time is around 1%, and very much in line with GCC. - It is difficult to blame one thing for compile-time increases: it looks like everywhere a little bit more time is spent processing more functions and instructions. - But the function specialisation pass itself is not very expensive; it doesn't show up very high in the profile of the optimisation passes. The goal of this work is to reach parity with GCC which means that eventually we would like to get this enabled by default. But first we would like to address some of the limitations before that. Differential Revision: https://reviews.llvm.org/D93838	2021-06-11 09:11:29 +01:00
Qiu Chaofan	b4c3d240d2	[PowerPC] Relax register superclasses for paired memops Relaxing superclass constraint for VSX register classes helps reducing 32-byte spills and copies when register pressure is high. In test case affected, some of them introduces more copies due to new allocation order. However, this patch should not be the root cause, and we may be able to fix it in other places of register allocation. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D104006	2021-06-11 14:54:03 +08:00
Hsiangkai Wang	6f9d560485	[RISCV] Avoid scalar outgoing argumetns overwriting vector frame objects. When using FP to access stack objects, the scalable stack objects will be put at the lower end of the frame. It looks like ``` \|-------------------\| <-- FP \| callee-saved regs \| \|-------------------\| \| scalar local vars \| \|-------------------\| \| RVV local vars \| \|-------------------\| <-- SP ``` If there are scalar arguments that need to pass through memory and there are vector objects on the stack using FP to access. The outgoing scalar arguments will overwrite the vector objects. It looks like ``` \|-------------------\| <-- FP \| callee-saved regs \| \|-------------------\| \| scalar local vars \| \|-------------------\| \|-------------------\| \| RVV local vars \| \| outgoing args \| <- outgoing arguments \|-------------------\| <-- SP \|-------------------\| overwrite from here. ``` In this patch, we reserve the stack for the outgoing arguments before function calls if using FP to access and there are scalable vector frame objects. It looks like ``` \|-------------------\| <-- FP \| callee-saved regs \| \|-------------------\| \| scalar local vars \| \|-------------------\| \| RVV local vars \| \|-------------------\| \| outgoing args \| \|-------------------\| <-- SP ``` Differential Revision: https://reviews.llvm.org/D103622	2021-06-11 12:26:29 +08:00
Qiu Chaofan	7331aa3b53	[VectorCombine] Fix alignment in single element store This fixes the concern in single element store scalarization that the alignment of new store may be larger than it should be. It calculates the largest alignment if index is constant, and a safe one if not. Reviewed By: lebedev.ri, spatel Differential Revision: https://reviews.llvm.org/D103419	2021-06-11 10:28:15 +08:00

1 2 3 4 5 ...

217145 Commits