llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Tim Northover	7c89253a7a	Recommit: Support: add llvm::thread class that supports specifying stack size. This adds a new llvm::thread class with the same interface as std::thread except there is an extra constructor that allows us to set the new thread's stack size. On Darwin even the default size is boosted to 8MB to match the main thread. It also switches all users of the older C-style `llvm_execute_on_thread` API family over to `llvm::thread` followed by either a `detach` or `join` call and removes the old API. Moved definition of DefaultStackSize into the .cpp file to hopefully fix the build on some (GCC-6?) machines.	2021-07-08 16:22:26 +01:00
Tim Northover	1b885b1ce7	Revert "Support: add llvm::thread class that supports specifying stack size." It's causing build failures because DefaultStackSize isn't defined everywhere it should be and I need time to investigate.	2021-07-08 14:59:47 +01:00
Tim Northover	43bfac999c	Support: add llvm::thread class that supports specifying stack size. This adds a new llvm::thread class with the same interface as std::thread except there is an extra constructor that allows us to set the new thread's stack size. On Darwin even the default size is boosted to 8MB to match the main thread. It also switches all users of the older C-style `llvm_execute_on_thread` API family over to `llvm::thread` followed by either a `detach` or `join` call and removes the old API.	2021-07-08 14:51:53 +01:00
xndcn	4bda00e90e	[NFC] Mark Expected<T>::assertIsChecked() as const Some const methods of Expected<T> invoke assertIsChecked(), so we should mark it as const too. Differential Revision: https://reviews.llvm.org/D105292	2021-07-08 21:30:23 +08:00
Arthur Eubanks	b3ffc2a93b	[OpaquePtr] Remove checking pointee type for byval/preallocated type These currently always require a type parameter. The bitcode reader already upgrades old bitcode without the type parameter to use the pointee type. In cases where the caller does not have byval but the callee does, we need to follow CallBase::paramHasAttr() and also look at the callee for the byval type so that CallBase::isByValArgument() and CallBase::getParamByValType() are in sync. Do the same for preallocated. While we're here add a corresponding version for inalloca since we'll need it soon. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104663	2021-07-07 14:28:55 -07:00
Tom Stellard	e3b57c8a21	unittests: Fix library dependency name Corrects the library name of LLVMAsmParser added in 2e4ec3e5d6a3bf7f61bea6898286cba64be7b764.	2021-07-07 06:56:15 -07:00
Tom Stellard	d0248bfa5e	unittests: Fix build with LLVM_LINK_LLVM_DYLIB=ON The build system was linking the PluginsTests unittest against libLLVM.so and LLVMAsmParser which was causing the test to fail with this error: LLVM ERROR: inconsistency in registered CommandLine options We need to add llvm libraries to LLVM_LINK_COMPONENTS so that they are dropped from the linker arguments when linking with LLVM_LINK_LLVM_DYLIB=ON Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D105523	2021-07-07 06:50:45 -07:00
Jacob Hegna	721423a975	Unpack the CostEstimate feature in ML inlining models. This change yields an additional 2% size reduction on an internal search binary, and an additional 0.5% size reduction on fuchsia. Differential Revision: https://reviews.llvm.org/D104751	2021-07-02 16:57:16 +00:00
Scott Linder	62d5b36f28	[ADT] Follow up to fix bug in "Add makeVisitor to STLExtras.h" Address mistakenly comparing the pointer values of two C-style strings rather than comparing their contents in the unit tests for makeVisitor, added in 6d6f35eb7b92c6dd4478834497752f4e963db16d	2021-07-01 18:24:49 +00:00
Hussain Kadhem	7eddb43fa0	[VP] Implementation of intrinsic and SDNode definitions for VP load, store, gather, scatter. This patch adds intrinsic definitions and SDNodes for predicated load/store/gather/scatter, based on the work done in D57504. Reviewed By: simoll, craig.topper Differential Revision: https://reviews.llvm.org/D99355	2021-07-01 13:34:44 +02:00
Lang Hames	6567b76038	[ORC] Add wrapper-function support methods to ExecutorProcessControl. Adds support for both synchronous and asynchronous calls to wrapper functions using SPS (Simple Packed Serialization). Also adds support for wrapping functions on the JIT side in SPS-based wrappers that can be called from the executor. These new methods simplify calls between the JIT and Executor, and will be used in upcoming ORC runtime patches to enable communication between ORC and the runtime.	2021-07-01 18:21:49 +10:00
Matt Arsenault	d665475981	GlobalISel: Use LLT in memory legality queries This enables proper lowering of non-byte sized loads. We still aren't faithfully preserving memory types everywhere, so the legality checks still only consider the size.	2021-06-30 17:44:13 -04:00
Matt Arsenault	cc12b285b6	CodeGen: Print/parse LLTs in MachineMemOperands This will currently accept the old number of bytes syntax, and convert it to a scalar. This should be removed in the near future (I think I converted all of the tests already, but likely missed a few). Not sure what the exact syntax and policy should be. We can continue printing the number of bytes for non-generic instructions to avoid test churn and only allow non-scalar types for generic instructions. This will currently print the LLT in parentheses, but accept parsing the existing integers and implicitly converting to scalar. The parentheses are a bit ugly, but the parser logic seems unable to deal without either parentheses or some keyword to indicate the start of a type.	2021-06-30 16:54:13 -04:00
Michael Liao	e142d83be3	Fix shared build.	2021-06-30 14:04:16 -04:00
Tomas Matheson	fcb829d489	[NPM] Resolve llvmGetPassPluginInfo to the plugin being loaded Dynamically loaded plugins for the new pass manager are initialised by calling llvmGetPassPluginInfo. This is defined as a weak symbol so that it is continually redefined by each plugin that is loaded. When loading a plugin from a shared library, the intention is that llvmGetPassPluginInfo will be resolved to the definition in the most recent plugin. However, using a global search for this resolution can fail in situations where multiple plugins are loaded. Currently: * If a plugin does not define llvmGetPassPluginInfo, then it will be silently resolved to the previous plugin's definition. * If loading the same plugin twice with another in between, e.g. plugin A/plugin B/plugin A, then the second load of plugin A will resolve to llvmGetPassPluginInfo in plugin B. * The previous case can also occur when a dynamic library defines both NPM and legacy plugins; the legacy plugins are loaded first and then with `-fplugin=A -fpass-plugin=B -fpass-plugin=A`: A will be loaded as a legacy plugin and define llvmGetPassPluginInfo; B will be loaded and redefine it; and finally when A is loaded as an NPM plugin it will be resolved to the definition from B. Instead of searching globally, restrict the symbol lookup to the library that is currently being loaded. Differential Revision: https://reviews.llvm.org/D104916	2021-06-30 18:11:28 +01:00
Greg Clayton	35965d72e0	Fix a case where multiple symbols with zero size would cause duplicate entries in gsym files. Symbol tables can have symbols with no size in mach-o files that were failing to get combined into a single entry. This resulted in many duplicate entries for the same address and made gsym files larger. Differential Revision: https://reviews.llvm.org/D105068	2021-06-28 18:26:26 -07:00
Scott Linder	93375cf94f	[ADT] Add makeVisitor to STLExtras.h Relands patch reverted by 61242c0addb120294211d24a97ed89837418cb36 The original patch mistakenly included unrelated tests. Adds a utility to combine multiple Callables into a single Callable. This is useful to make constructing a visitor for `std::visit`-like functions more natural; functions like this will be added in future patches. Intended to supercede https://reviews.llvm.org/D99560 by perfectly-forwarding the combined Callables. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D100670	2021-06-28 20:07:11 +00:00
Scott Linder	28e61dd3f9	Revert "[ADT] Add makeVisitor to STLExtras.h" This reverts commit 14a8aa615597ef0aa424ac9545906bf8b9865063. Mistakenly landed this before a patch it should depend on was accepted.	2021-06-28 19:51:25 +00:00
Scott Linder	a106319b70	[ADT] Add makeVisitor to STLExtras.h Adds a utility to combine multiple Callables into a single Callable. This is useful to make constructing a visitor for `std::visit`-like functions more natural; functions like this will be added in future patches. Intended to supercede https://reviews.llvm.org/D99560 by perfectly-forwarding the combined Callables. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D100670	2021-06-28 19:35:42 +00:00
Zarko Todorovski	a96f500c09	[AIX] Use less than or equal to for some alignment tests on AIX On AIX the alignment implementation has the storage aligned to the preferred alignment instead of the alignment of a type. Macro guard these tests for AIX and have them pass when the "reference alignment" is less than or equal to the alignment observed. In other words, the alignment applied is at least as strict as the required alignment. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D104786	2021-06-28 10:32:32 -04:00
Sander de Smalen	a82143ea32	Reland [GlobalISel] NFC: Have LLT::getSizeInBits/Bytes return a TypeSize. This patch relands https://reviews.llvm.org/D104454, but fixes some failing builds on Mac OS which apparently has a different definition for size_t, that caused 'ambiguous operator overload' for the implicit conversion of TypeSize to a scalar value. This reverts commit b732e6c9a8438e5204ac96c8ca76f9b11abf98ff.	2021-06-28 15:24:27 +01:00
Sander de Smalen	4d07cbe876	Revert "[GlobalISel] NFC: Have LLT::getSizeInBits/Bytes return a TypeSize." This patch seems to be causing build errors, reverting it for now. This reverts commit aeab9d9570ac8cb554aff6e1af24a471fdf5b4e5.	2021-06-25 17:37:16 +01:00
Sander de Smalen	9d34fb6e49	[GlobalISel] NFC: Have LLT::getSizeInBits/Bytes return a TypeSize. To reflect that the size may be scalable, a TypeSize is returned instead of an unsigned. In places where the result is used, it currently relies on an implicit cast of TypeSize -> uint64_t, which asserts that the type is not scalable. This patch is NFC for fixed-width vectors. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D104454	2021-06-25 17:06:50 +01:00
Sander de Smalen	5eab663b62	[GlobalISel] NFC: Change LLT::changeNumElements to LLT::changeElementCount. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D104453	2021-06-25 15:54:00 +01:00
Sander de Smalen	88c55d538f	[GlobalISel] NFC: Change LLT::scalarOrVector to take ElementCount. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D104452	2021-06-25 11:26:16 +01:00
Fangrui Song	47a9b3b42d	[OptTable] Rename PrintHelp to printHelp To be consistent with other member functions and match the coding standard.	2021-06-24 14:47:03 -07:00
Martin Storsjö	9d14adb9f6	[llvm] Rename StringRef _lower() method calls to _insensitive() This is a mechanical change. This actually also renames the similarly named methods in the SmallString class, however these methods don't seem to be used outside of the llvm subproject, so this doesn't break building of the rest of the monorepo.	2021-06-25 00:22:01 +03:00
Martin Storsjö	24c3cf43d7	[ADT] Rename StringRef case insensitive methods for clarity Rename functions with the `xx_lower()` names to `xx_insensitive()`. This was requested during the review of D104218. Test names and variables in llvm/unittests/ADT/StringRefTest.cpp that refer to "lower" are renamed to "insensitive" correspondingly. Unused function aliases with the former method names are left in place (without any deprecation attributes) for transition purposes. All references within the monorepo will be changed (with essentially mechanical changes), and then the old names will be removed in a later commit. Also remove the superfluous method names at the start of doxygen comments, for the methods that are touched here. (There are more occurrances of this left in other methods though.) Also remove duplicate doxygen comments from the implementation file. Differential Revision: https://reviews.llvm.org/D104819	2021-06-25 00:22:00 +03:00
Anirudh Prasad	6fc759537e	[AsmParser][SystemZ][z/OS] Support for emitting labels in upper case - Currently, the emitting of labels in the parsePrimaryExpr function is case independent. It just takes the identifier and emits it. - However, for HLASM the emitting of labels is case independent. We are emitting them in the upper case only, to enforce case independency. So we need to ensure that at the time of parsing the label we are emitting the upper case (in `parseAsHLASMLabel`), but also, when we are processing a PC-relative relocatable expression, we need to ensure we emit it in upper case (in `parsePrimaryExpr`) - To achieve this a new MCAsmInfo attribute has been introduced which corresponding targets can override if needed. Reviewed By: abhina.sreeskantharajan, uweigand Differential Revision: https://reviews.llvm.org/D104715	2021-06-24 12:50:11 -04:00
Sander de Smalen	ac11cfc716	[GlobalISel] NFC: Change LLT::vector to take ElementCount. This also adds new interfaces for the fixed- and scalable case: * LLT::fixed_vector * LLT::scalable_vector The strategy for migrating to the new interfaces was as follows: * If the new LLT is a (modified) clone of another LLT, taking the same number of elements, then use LLT::vector(OtherTy.getElementCount()) or if the number of elements is halfed/doubled, it uses .divideCoefficientBy(2) or operator. That is because there is no reason to specifically restrict the types to 'fixed_vector'. If the algorithm works on the number of elements (as unsigned), then just use fixed_vector. This will need to be fixed up in the future when modifying the algorithm to also work for scalable vectors, and will need then need additional tests to confirm the behaviour works the same for scalable vectors. * If the test used the '/Scalable=/true` flag of LLT::vector, then this is replaced by LLT::scalable_vector. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D104451	2021-06-24 11:26:12 +01:00
Kuter Dinel	16d688b628	[Attributor] Derive AAFunctionReachability attribute. This attribute uses Attributor's internal 'optimistic' call graph information to answer queries about function call reachability. Functions can become reachable over time as new call edges are discovered. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104599	2021-06-23 20:43:10 +03:00
Joseph Huber	76b50aa3c4	[OpenMP] Add new OpenMP globalization functions to library info Summary: The changes to globalization introduced in D97680 created two new functions to push / pop shareably memory on the GPU, __kmpc_alloc_shared and __kmpc_free_shared. This patch adds these new runtime functions to the library info so they can be used by the HeapToStack attributor interface. This optimization replaces malloc / free pairs with stack memory if legal. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D102087	2021-06-22 13:23:05 -04:00
Martin Storsjö	a307928fe5	[ADT] Add StringRef consume_front_lower and consume_back_lower These serve as a convenient combination of consume_front/back and startswith_lower/endswith_lower, consistent with other existing case insensitive methods named <operation>_lower. Differential Revision: https://reviews.llvm.org/D104218	2021-06-22 12:38:08 +03:00
Sander de Smalen	fd053d5ffe	[GlobalISel] Add scalable property to LLT types. This patch aims to add the scalable property to LLT. The rest of the patch-series changes the interfaces to take/return ElementCount and TypeSize, which both have the ability to represent the scalable property. The changes are mostly mechanical and aim to be non-functional changes for fixed-width vectors. For scalable vectors some unit tests have been added, but no effort has been put into making any of the GlobalISel algorithms work with scalable vectors yet. That will be left as future work. The work is split into a series of 5 patches to make reviews easier. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D104450	2021-06-22 08:43:34 +01:00
Fraser Cormack	18c509d4ea	[VP][NFCI] Address various clang-tidy warnings Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D104288	2021-06-21 10:57:42 +01:00
Jim Lin	191d405aea	[IVDescriptors] Fix comment that getUnsafeAlgebraInst has been renamed to getExactFPMathInst https://reviews.llvm.org/rG36a489d194750dc888f214240e9dec9122ca1f0e renamed the function call in the test from getUnsafeAlgebraInst to getExactFPMathInst. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D104441	2021-06-21 07:56:22 +08:00
Michael Liao	d21f701c76	[MIRPrinter] Add machine metadata support. - Distinct metadata needs generating in the codegen to attach correct AAInfo on the loads/stores after lowering, merging, and other relevant transformations. - This patch adds 'MachhineModuleSlotTracker' to help assign slot numbers to these newly generated unnamed metadata nodes. - To help 'MachhineModuleSlotTracker' track machine metadata, the original 'SlotTracker' is rebased from 'AbstractSlotTrackerStorage', which provides basic interfaces to create/retrive metadata slots. In addition, once LLVM IR is processsed, additional hooks are also introduced to help collect machine metadata and assign them slot numbers. - Finally, if there is any such machine metadata, 'MIRPrinter' outputs an additional 'machineMetadataNodes' field containing all the definition of those nodes. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D103205	2021-06-19 12:48:08 -04:00
Eli Friedman	e4884552df	[ScalarEvolution] Fix pointer/int type handling converting select/phi to min/max. The old version of this code would blindly perform arithmetic without paying attention to whether the types involved were pointers or integers. This could lead to weird expressions like negating a pointer. Explicitly handle simple cases involving pointers, like "x < y ? x : y". In all other cases, coerce the operands of the comparison to integer types. This avoids the weird cases, while handling most of the interesting cases. Differential Revision: https://reviews.llvm.org/D103660	2021-06-17 14:05:12 -07:00
Tomasz Miąsko	a8b1d6117d	[Demangle] Support Rust v0 mangling scheme in llvm::demangle The llvm::demangle is currently used by llvm-objdump and llvm-readobj, so this effectively adds support for Rust v0 mangling to those applications. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D104340	2021-06-17 10:37:26 +02:00
Bjorn Pettersson	29ffba4b56	Update @llvm.powi to handle different int sizes for the exponent This can be seen as a follow up to commit 0ee439b705e82a4fe20e2, that changed the second argument of __powidf2, __powisf2 and __powitf2 in compiler-rt from si_int to int. That was to align with how those runtimes are defined in libgcc. One thing that seem to have been missing in that patch was to make sure that the rest of LLVM also handle that the argument now depends on the size of int (not using the si_int machine mode for 32-bit). When using __builtin_powi for a target with 16-bit int clang crashed. And when emitting libcalls to those rtlib functions, typically when lowering @llvm.powi), the backend would always prepare the exponent argument as an i32 which caused miscompiles when the rtlib was compiled with 16-bit int. The solution used here is to use an overloaded type for the second argument in @llvm.powi. This way clang can use the "correct" type when lowering __builtin_powi, and then later when emitting the libcall it is assumed that the type used in @llvm.powi matches the rtlib function. One thing that needed some extra attention was that when vectorizing calls several passes did not support that several arguments could be overloaded in the intrinsics. This patch allows overload of a scalar operand by adding hasVectorInstrinsicOverloadedScalarOpd, with an entry for powi. Differential Revision: https://reviews.llvm.org/D99439	2021-06-17 09:38:28 +02:00
Sanjay Patel	03cd031cb2	[ValueTracking] add FP intrinsics to test for propagatesPoison; NFC I'm not sure what behavior we want if the FP environment is not default (also not sure if there's a way to enumerate the full list of intrinsics programmatically), but currently these are all defaulting to 'false' (doesn't propagate).	2021-06-16 13:43:03 -04:00
Sanjay Patel	be210d2cc2	[ValueTracking] add tests for propagatesPoison with FP ops; NFC Verify that this matches the behavior in InstSimplify: D104383 / ce95200b7942 We still need to add code/tests for FP intrinsics.	2021-06-16 12:14:28 -04:00
Lang Hames	8b99b0a62e	[ORC] Fix missing function in unit test.	2021-06-15 21:39:00 +10:00
Lang Hames	e11b1aca83	[ORC] Port WrapperFunctionUtils and SimplePackedSerialization from ORC runtime. Replace the existing WrapperFunctionResult type in llvm/include/ExecutionEngine/Orc/Shared/TargetProcessControlTypes.h with a version adapted from the ORC runtime's implementation. Also introduce the SimplePackedSerialization scheme (also adapted from the ORC runtime's implementation) for wrapper functions to avoid manual serialization and deserialization for calls to runtime functions involving common types.	2021-06-15 21:13:57 +10:00
CarlosAlbertoEnciso	8355a030c0	[Debug-Info][CodeView] Fix GUID string generation for MSVC generated objects. This patch is to address https://bugs.llvm.org/show_bug.cgi?id=50459. YAML:455:28: error: GUID strings are 38 characters long The valid format for a GUID is {XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX} where X is a hex digit (0,1,2,3,4,5,6,7,8,9,A,B,C,D,E,F). The length of the individual components must be: 8, 4, 4, 4, 12. For some cases, the converted string generated by obj2yaml, does not comply with those lengths. yaml2obj checks that the GUID string must be 38 characters including the dashes and braces. Reviewed By: amccarth Differential Revision: https://reviews.llvm.org/D103089	2021-06-15 06:53:21 +01:00
zhijian	ad7e1ecf68	[AIX][XCOFF] emit vector info of traceback table. Summary: emit vector info of traceback table. Reviewers: Jason Liu,Hubert Tong Differential Revision: https://reviews.llvm.org/D93659	2021-06-14 11:15:22 -04:00
Simon Moll	91d4645488	[VP] Binary floating-point intrinsics. This patch implements vector-predicated intrinsics on IR level for fadd, fsub, fmul, fdiv and frem. There operate in the default floating-point environment. We will use constrained fp operand bundles for constrained vector-predicated fp math (D93455). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93470	2021-06-14 08:51:41 +02:00
RamNalamothu	a2306da6e0	Implement DW_CFA_LLVM_* for Heterogeneous Debugging Add support in MC/MIR for writing/parsing, and DebugInfo. This is part of the Extensions for Heterogeneous Debugging defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html Specifically the CFI instructions implemented here are defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html#cfa-definition-instructions Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D76877	2021-06-14 08:51:50 +05:30
Simon Pilgrim	e36a7e7583	StringExtrasTest.cpp - add missing newline at the end of file. NFCI.	2021-06-11 14:32:35 +01:00
Simon Pilgrim	165132af1b	[ADT] Remove APInt/APSInt toString() std::string variants <string> is currently the highest impact header in a clang+llvm build: https://commondatastorage.googleapis.com/chromium-browser-clang/llvm-include-analysis.html One of the most common places this is being included is the APInt.h header, which needs it for an old toString() implementation that returns std::string - an inefficient method compared to the SmallString versions that it actually wraps. This patch replaces these APInt/APSInt methods with a pair of llvm::toString() helpers inside StringExtras.h, adjusts users accordingly and removes the <string> from APInt.h - I was hoping that more of these users could be converted to use the SmallString methods, but it appears that most end up creating a std::string anyhow. I avoided trying to use the raw_ostream << operators as well as I didn't want to lose having the integer radix explicit in the code. Differential Revision: https://reviews.llvm.org/D103888	2021-06-11 13:19:15 +01:00
Simon Pilgrim	942f6eb554	[ADT] Consistently use StringExtrasTest for the test suite filter. NFCI. Noticed while updating D103888 - some of the tests were using "StringExtras" for the test_suite_name instead of the expected "StringExtrasTest"	2021-06-11 12:00:54 +01:00
Guillaume Chatelet	434526729b	[llvm] Make Sequence reverse-iterable This is a roll forward of D102679. This patch simplifies the implementation of Sequence and makes it compatible with llvm::reverse. It exposes the reverse iterators through rbegin/rend which prevents a dangling reference in std::reverse_iterator::operator++(). Note: Compared to D102679, this patch introduces a `asSmallVector()` member function and fixes compilation issue with GCC 5. Differential Revision: https://reviews.llvm.org/D103948	2021-06-10 11:15:28 +00:00
Florian Hahn	f91e95602e	[SCEV] Keep common NUW flags when inlining Add operands. Currently, NoWrapFlags are dropped if we inline operands of SCEVAddExpr operands. As a consequence, we always drop flags when building expressions like `getAddExpr(A, getAddExpr(B, C, NUW), NUW)`. We should be able to retain NUW flags common among all inlined SCEVAddExpr and the original flags. Reviewed By: nikic, mkazantsev Differential Revision: https://reviews.llvm.org/D103877	2021-06-09 17:13:21 +01:00
Sterling Augustine	59049d1a79	Add Twine support for std::string_view. With Twine now ubiquitous after rG92a79dbe91413f685ab19295fc7a6297dbd6c824, it needs support for string_view when building clang with newer C++ standards. This is similar to how StringRef is handled. Differential Revision: https://reviews.llvm.org/D103935	2021-06-08 20:19:04 -07:00
Matt Arsenault	fca6ba66d2	GlobalISel: Avoid use of G_INSERT in insertParts G_INSERT legalization is incomplete and doesn't work very well. Instead try to use sequences of G_MERGE_VALUES/G_UNMERGE_VALUES padding with undef values (although this can get pretty large). For the case of load/store narrowing, this is still performing the load/stores in irregularly sized pieces. It might be cleaner to split this down into equal sized pieces, and rely on load/store merging to optimize it.	2021-06-08 14:44:24 -04:00
Mehdi Amini	ff780e6f68	Revert "[llvm] Make Sequence reverse-iterable" This reverts commit e772216e708937988c039420d2c559568f91ae27 (and fixup 7f6c878a2c035eb6325ab228d9bc2d257509d959). The build is broken with gcc5 host compiler: In file included from from mlir/lib/Dialect/Utils/StructuredOpsUtils.cpp:9: tools/mlir/include/mlir/IR/BuiltinAttributes.h.inc:424:57: error: type/value mismatch at argument 1 in template parameter list for 'template<class ItTy, class FuncTy, class FuncReturnTy> class llvm::mapped_iterator' std::function<T(ptrdiff_t)>>; ^ tools/mlir/include/mlir/IR/BuiltinAttributes.h.inc:424:57: note: expected a type, got 'decltype (seq<ptrdiff_t>(0, 0))::const_iterator'	2021-06-08 17:03:10 +00:00
Guillaume Chatelet	83dd05c1f3	[llvm] Make Sequence reverse-iterable This patch simplifies the implementation of Sequence and makes it compatible with llvm::reverse. It exposes the reverse iterators through rbegin/rend which prevents a dangling reference in std::reverse_iterator::operator++(). Differential Revision: https://reviews.llvm.org/D102679	2021-06-08 13:18:57 +00:00
Simon Moll	64d5c9acc6	[VP] getDeclarationForParams `VPIntrinsic::getDeclarationForParams` creates a vp intrinsic declaration for parameters you want to call it with. This is in preparation of a new builder class that makes emitting vp intrinsic code nearly as convenient as using a plain ir builder (aka `VectorBuilder`, to be used by D99750). Reviewed By: frasercrmck, craig.topper, vkmr Differential Revision: https://reviews.llvm.org/D102686	2021-06-08 14:21:28 +02:00
Kerry McLaughlin	d259f6577a	[CostModel] Return an invalid cost for memory ops with unsupported types Fixes getTypeConversion to return `TypeScalarizeScalableVector` when a scalable vector type cannot be legalized by widening/splitting. When this is the method of legalization found, getTypeLegalizationCost will return an Invalid cost. The getMemoryOpCost, getMaskedMemoryOpCost & getGatherScatterOpCost functions already call getTypeLegalizationCost and will now also return an Invalid cost for unsupported types. Reviewed By: sdesmalen, david-arm Differential Revision: https://reviews.llvm.org/D102515	2021-06-08 12:07:36 +01:00
Simon Pilgrim	27b7e8e2e7	ValueTrackingTest.cpp - Pass DataLayout by reference. NFCI.	2021-06-08 10:41:01 +01:00
Amir Ayupov	af66a21d42	[ELF] getRelocatedSection: remove the check for ET_REL object file getRelocatedSection interface should not check that the object file is relocatable, as executable files may have relocations preserved with `--emit-relocs` linker flag. The relocations are useful in context of post-link binary analysis for function reference identification. For example, BOLT relies on relocations to perform function reordering. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D102296	2021-06-07 13:17:00 -07:00
jasonliu	c03aed5d1b	[XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing Add in the ability of parsing symbol table for 64 bit object. Reviewed By: jhenderson, DiggerLin Differential Revision: https://reviews.llvm.org/D85774	2021-06-07 17:24:13 +00:00
Nikita Popov	7503a24bcc	[LexicalScopesTest] Add missing IRBuilder.h include (NFC) This currently depends on a transitive include via TargetLowering.h.	2021-06-06 16:29:50 +02:00
Aaron Puchert	5e515eb427	Make TableGenGlobalISel an object library That's how it was originally intended but that wasn't possible because we still needed to support older CMake versions. The problem here is that the sources in TableGenGlobalISel are meant to be linked into both llvm-tblgen and TableGenTests (a unit test), but not be part of LLVM proper. So they shouldn't be an ordinary LLVM component. Because they are used in llvm-tblgen, they can't draw in the LLVM dylib dependency, but then we'd have to do the same thing in TableGenTests to make sure we don't link both a static Support library and another copy through the LLVM dylib. With an object library we're just reusing the object files and don't have to care about dependencies at all. Differential Revision: https://reviews.llvm.org/D74588	2021-06-05 15:04:33 +02:00
Scott Linder	ec59d09ea4	[ADT] Refactor enumerate unit tests Preparation for landing the tests for llvm::makeVisitor, including breaking out the a "Counted" base class and explicitly testing the prvalue case as distinct from the rvalue case. Differential Revision: https://reviews.llvm.org/D103206	2021-06-04 22:29:13 +00:00
Rong Xu	559805b594	[SampleFDO] New hierarchical discriminator for FS SampleFDO (llvm-profdata part) This patch was split from https://reviews.llvm.org/D102246 [SampleFDO] New hierarchical discriminator for Flow Sensitive SampleFDO This is for llvm-profdata part of change. It sets the bit masks for the profile reader in llvm-profdata. Also add an internal option "-fs-discriminator-pass" for show and merge command to process the profile offline. This patch also moved setDiscriminatorMaskedBitFrom() to SampleProfileReader::create() to simplify the interface. Differential Revision: https://reviews.llvm.org/D103550	2021-06-04 11:22:06 -07:00
Alexey Bataev	fe6e3a2893	[OPENMP]Fix PR50129: omp cancel parallel not working as expected. Need to emit a call for __kmpc_cancel_barrier in the exit block for __kmpc_cancel function call if cancellation of the parallel block is requested. Differential Revision: https://reviews.llvm.org/D103646	2021-06-04 08:24:55 -07:00
Brendon Cahoon	f5ff020d9c	[GlobalISel] Add G_SBFX/G_UBFX to computeKnownBits Differential Revision: https://reviews.llvm.org/D102969	2021-06-03 16:01:47 -04:00
Nikita Popov	03374e8fbc	[Tests] Add missing include (NFC) Fix one more missing include in a unit test after 983565a6fe4a9f40c7caf82b65c650c20dbcc104.	2021-06-03 18:58:29 +02:00
David Spickett	b2313e8eb4	[clang][ARM] Remove arm2/3/6/7m CPU names These legacy CPUs are known to clang but not llvm. Their use was ignored by llvm and it would print a warning saying it did not recognise them. However because some of them are default CPUs for their architecture, you would get those warnings even if you didn't choose a cpu explicitly. (now those architectures will default to a "generic" CPU) Information is thin on the ground for these older chips so this is the best I could find: https://en.wikichip.org/wiki/acorn/microarchitectures/arm2 https://en.wikichip.org/wiki/acorn/microarchitectures/arm3 https://en.wikichip.org/wiki/arm_holdings/microarchitectures/arm6 https://en.wikichip.org/wiki/arm_holdings/microarchitectures/arm7 Final part of fixing https://bugs.llvm.org/show_bug.cgi?id=50454. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D103028	2021-06-03 08:55:44 +00:00
Rong Xu	f505b894a2	[SampleFDO] New hierarchical discriminator for FS SampleFDO (ProfileData part) This patch was split from https://reviews.llvm.org/D102246 [SampleFDO] New hierarchical discriminator for Flow Sensitive SampleFDO This is mainly for ProfileData part of change. It will load FS Profile when such profile is detected. For an extbinary format profile, create_llvm_prof tool will add a flag to profile summary section. For other format profiles, the users need to use an internal option (-profile-isfs) to tell the compiler that the profile uses FS discriminators. This patch also simplified the bit API used by FS discriminators. Differential Revision: https://reviews.llvm.org/D103041	2021-06-02 10:32:52 -07:00
Arthur Eubanks	2a26a5c713	[OpaquePtr] Create API to make a copy of a PointerType with some address space Some existing places use getPointerElementType() to create a copy of a pointer type with some new address space. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D103429	2021-06-01 16:52:32 -07:00
Daniel Sanders	71a22fb7f8	[globalisel][legalizer] Separate the deprecated LegalizerInfo from the current one It's still in use in a few places so we can't delete it yet but there's not many at this point. Differential Revision: https://reviews.llvm.org/D103352	2021-06-01 13:23:48 -07:00
Craig Topper	22fc6f8fbe	[VP] Make getMaskParamPos/getVectorLengthParamPos return unsigned. Lowercase function names. Parameter positions seem like they should be unsigned. While there, make function names lowercase per coding standards. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D103224	2021-05-28 11:28:47 -07:00
Mats Petersson	caa14ae743	[OpenMP]Add support for workshare loop modifier in lowering When lowering the dynamic, guided, auto and runtime types of scheduling, there is an optional monotonic or non-monotonic modifier. This patch adds support in the OMP IR Builder to pass this down to the runtime functions. Also implements tests for the variants. Differential Revision: https://reviews.llvm.org/D102008	2021-05-27 15:33:05 +01:00
Mats Petersson	ffafbe5131	Revert "[OpenMP]Add support for workshare loop modifier in lowering" This reverts commit ea4c5fb04c6d9618d451fb2d2c360dc95c6d9131.	2021-05-27 13:09:47 +01:00
Mats Petersson	ae07366301	[OpenMP]Add support for workshare loop modifier in lowering When lowering the dynamic, guided, auto and runtime types of scheduling, there is an optional monotonic or non-monotonic modifier. This patch adds support in the OMP IR Builder to pass this down to the runtime functions. Also implements tests for the variants. Differential Revision: https://reviews.llvm.org/D102008	2021-05-27 12:28:27 +01:00
Jessica Paquette	044ed9b7c9	Fix unit test after 324af79dbc6066 Needed to add in an extra parameter to calls to `libcall`.	2021-05-26 17:50:53 -07:00
Philip Reames	88bae72814	[unroll] Use value domain for symbolic execution based cost model The current full unroll cost model does a symbolic evaluation of the loop up to a fixed limit. That symbolic evaluation currently simplifies to constants, but we can generalize to arbitrary Values using the InstructionSimplify infrastructure at very low cost. By itself, this enables some simplifications, but it's mainly useful when combined with the branch simplification over in D102928. Differential Revision: https://reviews.llvm.org/D102934	2021-05-26 08:41:25 -07:00
Anirudh Prasad	f106aa368d	[SystemZ][z/OS] Validate symbol names for z/OS for printing without quotes - Currently, before printing a label in MCSymbol.cpp (MCSymbol::print), the current code "validates" the label that is to be printed. - If it fails the validation step, then it prints the label within double quotes. - However, the validation is provided as a virtual function in MCAsmInfo.h (i.e. isAcceptableChar() function). So we can override this for the AD_HLASM dialect in SystemZMCAsmInfo.cpp. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D103091	2021-05-26 10:37:09 -04:00
Tomas Matheson	79405b0d62	[MC][ELF] Emit unique sections for different flags Global values imply flags such as readable, writable, executable for the sections that they will be placed in. Currently MC places all such entries into the same section, using the first set of flags seen. This can lead to situations in LTO where a writable global is placed in the same named section as a readable global from another file, and the section may not be marked writable. D72194 ensures that mergeable globals with explicit sections are placed in separate sections with compatible entry size, by emitting the `unique` assembly syntax where appropriate. This change extends that approach to include section flags, so that globals with different section flags are emitted in separate unique sections. Differential revision: https://reviews.llvm.org/D100944	2021-05-26 11:51:29 +01:00
David Green	75476d7b62	[ARM] Add patterns for vmulh Now that vmulh can be selected, this adds the MVE patterns to make it legal and generate instructions. Differential Revision: https://reviews.llvm.org/D88011	2021-05-26 09:22:12 +01:00
Anirudh Prasad	f663aad8da	[SystemZ][z/OS] Implement getHostCPUName for z/OS - Currently, the host cpu information is not easily available on z/OS as in other platforms. - This information is stored in the Communications Vector Table (https://www.ibm.com/docs/en/zos/2.2.0?topic=information-cvt-mapping) Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D102793	2021-05-25 11:18:12 -04:00
Roman Lebedev	5d534d8259	[llvm-exegesis] Loop unrolling for loop snippet repetitor mode I really needed this, like, factually, yesterday, when verifying dependency breaking idioms for AMD Zen 3 scheduler model. Consider the following example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=duplicate Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-4a7e50.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 0.31025, per_snippet_value: 0.31025 } error: '' info: '' assembled_snippet: C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C3 ... ``` What does it tell us? So wait, it can only execute ~3 x86 AVX YMM PXOR zero-idioms per cycle? That doesn't seem right. That's even less than there are pipes supporting this type of op. Now, second example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=loop Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-2418b5.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 1.00011, per_snippet_value: 1.00011 } error: '' info: '' assembled_snippet: 49B80800000000000000C5FDEFC0C5FDEFC04983C0FF75F2C3 ... ``` Now that's just worse. Due to the looping, the throughput completely plummeted, and now we can only do a single instruction/cycle!? That's not great. And final example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=loop --loop-body-size=1000 Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-c402e2.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 0.167087, per_snippet_value: 0.167087 } error: '' info: '' assembled_snippet: 49B80800000000000000C5FDEFC0C5FDEFC04983C0FF75F2C3 ... ``` So if we merge the previous two approaches, do duplicate this single-instruction snippet 1000x (loop-body-size/instruction count in snippet), and run a loop with 1000 iterations over that duplicated/unrolled snippet, the measured throughput goes through the roof, up to 5.9 instructions/cycle, which finally tells us that this idiom is zero-cycle! Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D102522	2021-05-25 12:08:27 +03:00
David Spickett	a4a13012a7	[clang][ARM] Remove non-existent arm9312 CPU I cannot find documentation on this CPU, and it is not supported by the Arm Compiler 5 product either. It was likely a mistake or a different name for the "ep9312", which is an Arm based Cirrus Logic chip. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D103024	2021-05-25 08:58:24 +00:00
David Spickett	523f0589d5	[llvm][ARM] Remove non-existent arm1176j-s CPU This was removed in https://reviews.llvm.org/D52594 for clang. The one test using it has been updated to use the mpcore CPU as the linked clang change does. This is part of fixing https://bugs.llvm.org/show_bug.cgi?id=50454. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D103022	2021-05-25 08:56:55 +00:00
David Spickett	e073151ba7	[clang][ARM] Remove non-existent arm1136jz-s CPU There is an ARM1136JF-S and an ARM1136J-S but I could find no references to an ARM1136JZ-S. In CPU manuals or the manual for Arm Compiler 5. See: https://developer.arm.com/documentation/ddi0211/latest/ https://developer.arm.com/documentation/dui0472/latest/ Using this CPU you get: $ ./bin/clang --target=arm-linux-gnueabihf -march=armv3m -mcpu=arm1136jz-s -c /tmp/test.c -o /tmp/test.o 'arm1136jz-s' is not a recognized processor for this target (ignoring processor) Since the llvm target does not know what it is. This is part of fixing https://bugs.llvm.org/show_bug.cgi?id=50454. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D103019	2021-05-25 08:54:59 +00:00
Lang Hames	9ee7d4ffa3	[JITLink] Suppress expect-death test in release mode.	2021-05-24 22:57:10 -07:00
Lang Hames	5ce7249a28	[JITLink] Enable creation and management of mutable block content. This patch introduces new operations on jitlink::Blocks: setMutableContent, getMutableContent and getAlreadyMutableContent. The setMutableContent method will set the block content data and size members and flag the content as mutable. The getMutableContent method will return a mutable copy of the existing content value, auto-allocating and populating a new mutable copy if the existing content is marked immutable. The getAlreadyMutableMethod asserts that the existing content is already mutable and returns it. setMutableContent should be used when updating the block with totally new content backed by mutable memory. It can be used to change the size of the block. The argument value should not be shared with any other block. getMutableContent should be used when clients want to modify the existing content and are unsure whether it is mutable yet. getAlreadyMutableContent should be used when clients want to modify the existing content and know from context that it must already be immutable. These operations reduce copy-modify-update boilerplate and unnecessary copies introduced when clients couldn't me sure whether the existing content was mutable or not.	2021-05-24 22:09:36 -07:00
David Blaikie	8e3f8bcb4e	Add a range-based wrapper for std::unique(begin, end, binary_predicate)	2021-05-24 17:26:46 -07:00
Florian Hahn	db4bd974f6	[VPlan] Add mayReadOrWriteMemory & friends. This patch adds initial implementation of mayReadOrWriteMemory, mayReadFromMemory and mayWriteToMemory to VPRecipeBase. Used by D100258.	2021-05-24 13:11:32 +01:00
Fady Ghanim	bb0b21b662	[OpenMP][OMPIRBuilder]Adding support for `omp atomic` This patch adds support for generating `omp atomic` for all different atomic clauses	2021-05-23 17:44:09 -04:00
Philipp Krones	df7a8b162e	[MC] Refactor MCObjectFileInfo initialization and allow targets to create MCObjectFileInfo This makes it possible for targets to define their own MCObjectFileInfo. This MCObjectFileInfo is then used to determine things like section alignment. This is a follow up to D101462 and prepares for the RISCV backend defining the text section alignment depending on the enabled extensions. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101921	2021-05-23 14:15:23 -07:00
Lang Hames	da54af9961	[ORC] Add more synchronization to TestLookupWithUnthreadedMaterialization. Don't run tasks until their corresponding thread has been added to the running threads vector. This is an extention to fda4300da82, which doesn't seem to have been enough to fix the synchronization issues on its own.	2021-05-22 07:59:24 -07:00
Lang Hames	52bc19513e	[ORC] Fix race condtition in CoreAPIsTest. This test has been failing intermittently on some builders, probably due to a race on the WorkThreads vector. This patch should fix that.	2021-05-21 21:11:54 -07:00
Lang Hames	7aa19629c2	[ORC][C-bindings] Replace LLVMOrcJITTargetMachineBuilderDisposeTargetTriple. The implementation and intent behind freeing the triple string here is the same as LLVMGetDefaultTargetTriple (and any other owned c string returned from the C API), so we should use LLVMDisposeMessage for to free the string for consistency. Patch by Mats Larsen -- thanks Mats! Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D102957	2021-05-21 17:38:06 -07:00
Arthur Eubanks	6347acc246	Revert "[NPM] Do not run function simplification pipeline unnecessarily" This reverts commit 97ab068034161fb35e5c9a7b293bf1e569cf077b. Depends on D100917, which is to be reverted.	2021-05-21 16:38:02 -07:00
Lang Hames	dfb6a9ab5f	[ORC] Use GTEST_SKIP in ORC C-API unit test. Now that gtest has been updated to 1.10 which supports GTEST_SKIP, we can use that over return; Patch by Mats Larsen. Thanks Mats! Reviewed By: lhames, ikudrin Differential Revision: https://reviews.llvm.org/D102710	2021-05-21 10:15:05 -07:00
Djordje Todorovic	88aa158bd7	Recommit: "[Debugify][Original DI] Test dbg var loc preservation"" [Debugify][Original DI] Test dbg var loc preservation This is an improvement of [0]. This adds checking of original llvm.dbg.values()/declares() instructions in optimizations. We have picked a real issue that has been found with this (actually, picked one variable location missing from [1] and resolved the issue), and the result is the fix for that -- D100844. Before applying the D100844, using the options from [0] (but with this patch applied) on the compilation of GDB 7.11, the final HTML report for the debug-info issues can be found at [1] (please scroll down, and look for "Summary of Variable Location Bugs"). After applying the D100844, the numbers has improved a bit -- please take a look into [2]. [0] https://llvm.org/docs/HowToUpdateDebugInfo.html#\ test-original-debug-info-preservation-in-optimizations [1] https://djolertrk.github.io/di-check-before-adce-fix/ [2] https://djolertrk.github.io/di-check-after-adce-fix/ Differential Revision: https://reviews.llvm.org/D100845 The Unit test was failing because the pass from the test that modifies the IR, in its runOnFunction() didn't return 'true', so the expensive-check configuration triggered an assertion.	2021-05-21 02:04:29 -07:00
Yevgeny Rouban	ebb8c67ccd	Allow incomplete template types in unique_function arguments We can't declare unique_function that has in its arguments a reference to a template type with an incomplete argument. For instance, we can't declare unique_function<void(SmallVectorImpl<A>&)> when A is forward declared. This is because SFINAE will trigger a hard error in this case, when instantiating IsSizeLessThanThresholdT with the incomplete type. This patch specialize AdjustedParamT for references to remove this error. Committed on behalf of: @math-fehr (Fehr Mathieu) Reviewed By: DaniilSuchkov, yrouban	2021-05-21 14:09:33 +07:00

1 2 3 4 5 ...

7008 Commits