llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-18 18:42:46 +02:00

Author	SHA1	Message	Date
Hans Wennborg	0bc6a17a7e	Test commit	2020-12-07 17:27:03 +01:00
Tony	0a2da89f49	[NFC][AMDGPU] AMDGPUUsage updates - Document code object V2 gfx800. - Document amdpal is supported by Linux Pro. Differential Revision: https://reviews.llvm.org/D92708	2020-12-05 02:13:17 +00:00
Sean Silva	bc224c8fad	[SmallVector] Allow SmallVector<T> This patch adds a capability to SmallVector to decide a number of inlined elements automatically. The policy is: - A minimum of 1 inlined elements, with more as long as sizeof(SmallVector<T>) <= 64. - If sizeof(T) is "too big", then trigger a static_assert: this dodges the more pathological cases This is expected to systematically improve SmallVector use in the LLVM codebase, which has historically been plagued by semi-arbitrary / cargo culted N parameters, often leading to bad outcomes due to excessive sizeof(SmallVector<T, N>). This default also makes programming more convenient by avoiding edit/rebuild cycles due to forgetting to type the N parameter. Differential Revision: https://reviews.llvm.org/D92522	2020-12-03 17:21:44 -08:00
Paul C. Anagnostopoulos	15379f2764	[TableGen] Eliminate the 'code' type Update the documentation. Rework various backends that relied on the code type. Differential Revision: https://reviews.llvm.org/D92269	2020-12-03 10:19:11 -05:00
Fangrui Song	649f05aa24	Switch from llvm::is_trivially_copyable to std::is_trivially_copyable GCC<5 did not support std::is_trivially_copyable. Now LLVM builds require 5.1 we can migrate to std::is_trivially_copyable. The Optional.h change made MSVC choke (https://buildkite.com/llvm-project/premerge-checks/builds/18587#cd1bb616-ffdc-4581-9795-b42c284196de) so I leave it out for now. Differential Revision: https://reviews.llvm.org/D92514	2020-12-02 22:02:48 -08:00
Reid Kleckner	7c87aeebfe	Revert "Use std::is_trivially_copyable", breaks MSVC build Revert "Delete llvm::is_trivially_copyable and CMake variable HAVE_STD_IS_TRIVIALLY_COPYABLE" This reverts commit 4d4bd40b578d77b8c5bc349ded405fb58c333c78. This reverts commit 557b00e0afb2dc1776f50948094ca8cc62d97be4.	2020-12-02 14:30:46 -08:00
Fangrui Song	dffdc25f75	Use std::is_trivially_copyable GCC<5 did not support std::is_trivially_copyable. Now LLVM builds require 5.1 we can migrate to std::is_trivially_copyable.	2020-12-02 09:58:07 -08:00
Bardia Mahjour	fbc2c5ae27	[LV] Epilogue Vectorization with Optimal Control Flow (Recommit) This is yet another attempt at providing support for epilogue vectorization following discussions raised in RFC http://llvm.1065342.n5.nabble.com/llvm-dev-Proposal-RFC-Epilog-loop-vectorization-tt106322.html#none and reviews D30247 and D88819. Similar to D88819, this patch achieve epilogue vectorization by executing a single vplan twice: once on the main loop and a second time on the epilogue loop (using a different VF). However it's able to handle more loops, and generates more optimal control flow for cases where the trip count is too small to execute any code in vector form. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D89566	2020-12-02 10:09:56 -05:00
David Sherwood	6d7c7dcc2b	[SVE] Add support for scalable vectors with vectorize.scalable.enable loop attribute In this patch I have added support for a new loop hint called vectorize.scalable.enable that says whether we should enable scalable vectorization or not. If a user wants to instruct the compiler to vectorize a loop with scalable vectors they can now do this as follows: br i1 %exitcond, label %for.end, label %for.body, !llvm.loop !2 ... !2 = !{!2, !3, !4} !3 = !{!"llvm.loop.vectorize.width", i32 8} !4 = !{!"llvm.loop.vectorize.scalable.enable", i1 true} Setting the hint to false simply reverts the behaviour back to the default, using fixed width vectors. Differential Revision: https://reviews.llvm.org/D88962	2020-12-02 13:23:43 +00:00
Tony	545440c6ae	[NFC][AMDGPU] Fix broken link to ClangOffloadBundler in AMDGPUUsage	2020-12-02 03:04:28 +00:00
Tony	ebb5d91fda	[NFC][AMDGPU] AMDGPU code object V4 ABI documentation - Documantation for AMDGPU code object V4. - Documentation clarification for code object V2 and V3. - Documentation for the clang-offload-bundler. - Numerous other documentation clarifications. Change-Id: I338b327cc9e75da6c987b7e081b496402a5a020e Differential Revision: https://reviews.llvm.org/D92434	2020-12-01 23:31:04 +00:00
Bardia Mahjour	b7eee47753	Revert "[LV] Epilogue Vectorization with Optimal Control Flow" This reverts commit 9c5504adceb544d9954ddb8ff3035a414f4b1423. Reverting to investigate build failure in http://lab.llvm.org:8011/#/builders/98/builds/1461/steps/9	2020-12-01 12:50:36 -05:00
Bardia Mahjour	63b138b338	[LV] Epilogue Vectorization with Optimal Control Flow This is yet another attempt at providing support for epilogue vectorization following discussions raised in RFC http://llvm.1065342.n5.nabble.com/llvm-dev-Proposal-RFC-Epilog-loop-vectorization-tt106322.html#none and reviews D30247 and D88819. Similar to D88819, this patch achieve epilogue vectorization by executing a single vplan twice: once on the main loop and a second time on the epilogue loop (using a different VF). However it's able to handle more loops, and generates more optimal control flow for cases where the trip count is too small to execute any code in vector form. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D89566	2020-12-01 12:04:29 -05:00
Amy Huang	b375f49096	Recommit "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" This reverts commit 1b63177a56e8cd6196778d2b90295f03e96b5800.	2020-11-30 17:36:12 -08:00
Juneyoung Lee	b00d8684c3	[LangRef] missing link, minor fix	2020-11-30 23:09:36 +09:00
David Spickett	5c2736b6c2	[llvm-objdump] Document --mattr=help in --help output This does the same as `--mcpu=help` but was only documented in the user guide. * Added a test for both options. * Corrected the single dash in `-mcpu=help` text. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D92305	2020-11-30 12:52:54 +00:00
Juneyoung Lee	e5f43fdeb3	[LangRef] minor fixes to poison examples and well-defined values section (NFC)	2020-11-29 20:51:25 +09:00
Juneyoung Lee	7546d005c8	[LangRef] Add poison constant This patch adds a description about the newly added poison constant to LangRef. Differential Revision: https://reviews.llvm.org/D92162	2020-11-27 10:29:52 +09:00
Marek Kurdej	542b1725df	[llvm-profgen] [docs] Fix invalid header. Add to ToC. NFC.	2020-11-26 10:45:05 +01:00
Amy Huang	2504c5bf49	Revert "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" Breaks some asan tests on the buildbot. This reverts commit c74b427cb2a90309ee0c29df21ad1ca26390263c.	2020-11-23 16:29:45 -08:00
Amy Huang	f6737ef448	[llvm-symbolizer] Switch to using native symbolizer by default on Windows llvm-symbolizer used to use the DIA SDK for symbolization on Windows; this patch switches to using native symbolization, which was implemented recently. Users can still make the symbolizer use DIA by adding the `-dia` flag in the LLVM_SYMBOLIZER_OPTS environment variable. Differential Revision: https://reviews.llvm.org/D91814	2020-11-23 15:57:08 -08:00
Paul C. Anagnostopoulos	58226c6585	[TableGen] Eliminte source location from CodeInit Step 1 in eliminating the 'code' type. Differential Revision: https://reviews.llvm.org/D91932	2020-11-23 11:30:13 -05:00
Tony	7cfcd72ff5	[NFC][AMDGPU] Document kernel descriptor - Document that the kernel descriptor defined is for code object V3. Document that it also applies to earlier code object formats for CP. - Document the deprecated bits in kernel descriptor. Differential Revision: https://reviews.llvm.org/D91458	2020-11-21 04:54:17 +00:00
wlei	dd799fc98b	[llvm-profgen][NFC]Fix build failure on different platform see titile Test Plan: ninja & ninja check-llvm Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D91897	2020-11-20 16:36:04 -08:00
wlei	563627eb7b	[CSSPGO][llvm-profgen] Disassemble text sections This stack of changes introduces `llvm-profgen` utility which generates a profile data file from given perf script data files for sample-based PGO. It’s part of(not only) the CSSPGO work. Specifically to support context-sensitive with/without pseudo probe profile, it implements a series of functionalities including perf trace parsing, instruction symbolization, LBR stack/call frame stack unwinding, pseudo probe decoding, etc. Also high throughput is achieved by multiple levels of sample aggregation and compatible format with one stop is generated at the end. Please refer to: https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s for the CSSPGO RFC. This change enables disassembling the text sections to build various address maps that are potentially used by the virtual unwinder. A switch `--show-disassembly` is being added to print the disassembly code. Like the llvm-objdump tool, this change leverages existing LLVM components to parse and disassemble ELF binary files. So far X86 is supported. Test Plan: ninja check-llvm Reviewed By: wmi, wenlei Differential Revision: https://reviews.llvm.org/D89712	2020-11-20 14:26:26 -08:00
wlei	cc95d46ee1	[CSSPGO][llvm-profgen] Parse mmap events from perf script This stack of changes introduces `llvm-profgen` utility which generates a profile data file from given perf script data files for sample-based PGO. It’s part of(not only) the CSSPGO work. Specifically to support context-sensitive with/without pseudo probe profile, it implements a series of functionalities including perf trace parsing, instruction symbolization, LBR stack/call frame stack unwinding, pseudo probe decoding, etc. Also high throughput is achieved by multiple levels of sample aggregation and compatible format with one stop is generated at the end. Please refer to: https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s for the CSSPGO RFC. As a starter, this change sets up an entry point by introducing PerfReader to load profiled binaries and perf traces(including perf events and perf samples). For the event, here it parses the mmap2 events from perf script to build the loader snaps, which is used to retrieve the image load address in the subsequent perf tracing parsing. As described in llvm-profgen.rst, the tool being built aims to support multiple input perf data (preprocessed by perf script) as well as multiple input binary images. It should also support dynamic reload/unload shared objects by leveraging the loader snaps being built by this change Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D89707	2020-11-20 14:26:26 -08:00
Alex Richardson	9c96f39f77	Add a default address space for globals to DataLayout This is similar to the existing alloca and program address spaces (D37052) and should be used when creating/accessing global variables. We need this in our CHERI fork of LLVM to place all globals in address space 200. This ensures that values are accessed using CHERI load/store instructions instead of the normal MIPS/RISC-V ones. The problem this is trying to fix is that most of the time the type of globals is created using a simple PointerType::getUnqual() (or ::get() with the default address-space value of 0). This does not work for us and we get assertion/compilation/instruction selection failures whenever a new call is added that uses the default value of zero. In our fork we have removed the default parameter value of zero for most address space arguments and use DL.getProgramAddressSpace() or DL.getGlobalsAddressSpace() whenever possible. If this change is accepted, I will upstream follow-up patches to use DL.getGlobalsAddressSpace() instead of relying on the default value of 0 for PointerType::get(), etc. This patch and the follow-up changes will not have any functional changes for existing backends with the default globals address space of zero. A follow-up commit will change the default globals address space for AMDGPU to 1. Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D70947	2020-11-20 15:46:52 +00:00
Pavel Iliin	2529cb73ff	[AArch64] Out-of-line atomics (-moutline-atomics) implementation. This patch implements out of line atomics for LSE deployment mechanism. Details how it works can be found in llvm/docs/Atomics.rst Options -moutline-atomics and -mno-outline-atomics to enable and disable it were added to clang driver. This is clang and llvm part of out-of-line atomics interface, library part is already supported by libgcc. Compiler-rt support is provided in separate patch. Differential Revision: https://reviews.llvm.org/D91157	2020-11-20 13:30:12 +00:00
Leonard Chan	c24d9d2b01	[llvm][IR] Add dso_local_equivalent Constant The `dso_local_equivalent` constant is a wrapper for functions that represents a value which is functionally equivalent to the global passed to this. That is, if this accepts a function, calling this constant should have the same effects as calling the function directly. This could be a direct reference to the function, the `@plt` modifier on X86/AArch64, a thunk, or anything that's equivalent to the resolved function as a call target. When lowered, the returned address must have a constant offset at link time from some other symbol defined within the same binary. The address of this value is also insignificant. The name is leveraged from `dso_local` where use of a function or variable is resolved to a symbol in the same linkage unit. In this patch: - Addition of `dso_local_equivalent` and handling it - Update Constant::needsRelocation() to strip constant inbound GEPs and take advantage of `dso_local_equivalent` for relative references This is useful for the [Relative VTables C++ ABI](https://reviews.llvm.org/D72959) which makes vtables readonly. This works by replacing the dynamic relocations for function pointers in them with static relocations that represent the offset between the vtable and virtual functions. If a function is externally defined, `dso_local_equivalent` can be used as a generic wrapper for the function to still allow for this static offset calculation to be done. See [RFC](http://lists.llvm.org/pipermail/llvm-dev/2020-August/144469.html) for more details. Differential Revision: https://reviews.llvm.org/D77248	2020-11-19 10:26:17 -08:00
Nick Desaulniers	b2b1b97849	Revert "[IR] add fn attr for no_stack_protector; prevent inlining on mismatch" This reverts commit b7926ce6d7a83cdf70c68d82bc3389c04009b841. Going with a simpler approach.	2020-11-17 17:27:14 -08:00
Florian Hahn	4864887dc5	[VPlan] Add VPDef class. This patch introduces a new VPDef class, which can be used to manage VPValues defined by recipes/VPInstructions. The idea here is to mirror VPUser for values defined by a recipe. A VPDef can produce either zero (e.g. a store recipe), one (most recipes) or multiple (VPInterleaveRecipe) result VPValues. To traverse the def-use chain from a VPDef to its users, one has to traverse the users of all values defined by a VPDef. VPValues now contain a pointer to their corresponding VPDef, if one exists. To traverse the def-use chain upwards from a VPValue, we first need to check if the VPValue is defined by a VPDef. If it does not have a VPDef, this means we have a VPValue that is not directly defined iniside the plan and we are done. If we have a VPDef, it is defined inside the region by a recipe, which is a VPUser, and the upwards def-use chain traversal continues by traversing all its operands. Note that we need to add an additional field to to VPVAlue to link them to their defs. The space increase is going to be offset by being able to remove the SubclassID field in future patches. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D90558	2020-11-17 16:18:11 +00:00
Michael Liao	f1ef8ad5ff	[InferAddrSpace] Teach to handle assumed address space. - In certain cases, a generic pointer could be assumed as a pointer to the global memory space or other spaces. With a dedicated target hook to query that address space from a given value, infer-address-space pass could infer and propagate that to all its users. Differential Revision: https://reviews.llvm.org/D91121	2020-11-16 17:06:33 -05:00
Paul C. Anagnostopoulos	af5277fb13	[TableGen] Improve a couple of descriptions in the command guide Differential Revision: https://reviews.llvm.org/D91484	2020-11-15 09:59:59 -05:00
Paul C. Anagnostopoulos	17cd31c30e	[TableGen] Add frontend/backend phase timing capability. Describe in the BackEnd Developer's Guide. Instrument a few backends. Remove an old unused timing facility. Add a null backend for timing the parser. Differential Revision: https://reviews.llvm.org/D91388	2020-11-14 10:10:29 -05:00
Nikita Popov	effaed5675	[LangRef] Clarify GEP inbounds wrapping semantics Clarify the semantics of GEP inbounds, in particular with regard to what it means for wrapping. This cleans up some confusion on when it is legal to apply nuw/nsw flags to various parts of the GEP calculation. Differential Revision: https://reviews.llvm.org/D90708	2020-11-13 17:49:41 +01:00
Paul C. Anagnostopoulos	82f80b36cd	[TableGen] Enhance the six comparison bang operators. Update the Programmer's Reference. Differential Revision: https://reviews.llvm.org/D91036	2020-11-13 09:57:27 -05:00
serge-sans-paille	ae7304bdea	llvmbuildectomy - compatibility with ocaml bindings Use exact component name in add_ocaml_library. Make expand_topologically compatible with new architecture. Fix quoting in is_llvm_target_library. Fix LLVMipo component name. Write release note.	2020-11-13 14:35:52 +01:00
Florian Hahn	041da6277f	Add !annotation metadata and remarks pass. This patch adds a new !annotation metadata kind which can be used to attach annotation strings to instructions. It also adds a new pass that emits summary remarks per function with the counts for each annotation kind. The intended uses cases for this new metadata is annotating 'interesting' instructions and the remarks should provide additional insight into transformations applied to a program. To motivate this, consider these specific questions we would like to get answered: * How many stores added for automatic variable initialization remain after optimizations? Where are they? * How many runtime checks inserted by a frontend could be eliminated? Where are the ones that did not get eliminated? Discussed on llvm-dev as part of 'RFC: Combining Annotation Metadata and Remarks' (http://lists.llvm.org/pipermail/llvm-dev/2020-November/146393.html) Reviewed By: thegameg, jdoerfert Differential Revision: https://reviews.llvm.org/D91188	2020-11-13 13:24:10 +00:00
Florian Hahn	744ac7e74e	[docs] Fix undefined reference in ORCv2 design doc. This fixes a typo introduced in 984e87923f1096c815cef900cda0926c68286ddf which caused the docs build to fail.	2020-11-13 09:44:48 +00:00
serge-sans-paille	82b6e6053d	llvmbuildectomy - replace llvm-build by plain cmake No longer rely on an external tool to build the llvm component layout. Instead, leverage the existing `add_llvm_componentlibrary` cmake function and introduce `add_llvm_component_group` to accurately describe component behavior. These function store extra properties in the created targets. These properties are processed once all components are defined to resolve library dependencies and produce the header expected by llvm-config. Differential Revision: https://reviews.llvm.org/D90848	2020-11-13 10:35:24 +01:00
Lang Hames	19bcc02c62	[docs] Fix formatting, clarify comment in ORCv2 doc	2020-11-12 13:11:01 +11:00
Lang Hames	56fbd9511b	[docs] Fix formatting in ORCv2.rst. Bold and fixed-width do not appear to mix well.	2020-11-12 11:08:58 +11:00
Lang Hames	f3031ea7aa	[docs] Update ORCv2 design doc. Fixes some formatting and wording, and adds a roadmap section.	2020-11-12 10:33:29 +11:00
Renato Golin	b0b9198036	[docs] link new support policy from developer policy Adding new paragraphs under "Introducing New Components" section to check the different levels of support we have, to help introduction of smaller set of changes without overwhelming new collaborators and potentially losing the contribution. Differential Revision: D91013	2020-11-10 19:40:57 +00:00
David Green	f1fd013f54	[Sphinx] Fix langref formatting. NFC	2020-11-10 16:47:43 +00:00
David Green	0773b05cfa	[ARM] Alter t2DoLoopStart to define lr This changes the definition of t2DoLoopStart from t2DoLoopStart rGPR to GPRlr = t2DoLoopStart rGPR This will hopefully mean that low overhead loops are more tied together, and we can more reliably generate loops without reverting or being at the whims of the register allocator. This is a fairly simple change in itself, but leads to a number of other required alterations. - The hardware loop pass, if UsePhi is set, now generates loops of the form: %start = llvm.start.loop.iterations(%N) loop: %p = phi [%start], [%dec] %dec = llvm.loop.decrement.reg(%p, 1) %c = icmp ne %dec, 0 br %c, loop, exit - For this a new llvm.start.loop.iterations intrinsic was added, identical to llvm.set.loop.iterations but produces a value as seen above, gluing the loop together more through def-use chains. - This new instrinsic conceptually produces the same output as input, which is taught to SCEV so that the checks in MVETailPredication are not affected. - Some minor changes are needed to the ARMLowOverheadLoop pass, but it has been left mostly as before. We should now more reliably be able to tell that the t2DoLoopStart is correct without having to prove it, but t2WhileLoopStart and tail-predicated loops will remain the same. - And all the tests have been updated. There are a lot of them! This patch on it's own might cause more trouble that it helps, with more tail-predicated loops being reverted, but some additional patches can hopefully improve upon that to get to something that is better overall. Differential Revision: https://reviews.llvm.org/D89881	2020-11-10 15:57:58 +00:00
Paul C. Anagnostopoulos	25ecf898fc	[TableGen] Add the !filter bang operator. Add a test. Update the Programmer's Reference. Use it in some TableGen files. Differential Revision: https://reviews.llvm.org/D91008	2020-11-09 10:56:55 -05:00
Sebastian Neubauer	7e4be9501b	[AMDGPU] Add amdgpu_gfx calling convention Add a calling convention called amdgpu_gfx for real function calls within graphics shaders. For the moment, this uses the same calling convention as other calls in amdgpu, with registers excluded for return address, stack pointer and stack buffer descriptor. Differential Revision: https://reviews.llvm.org/D88540	2020-11-09 16:51:44 +01:00
Renato Golin	5f28812127	[docs] Adding a Support Policy As discussed in the mailing list [1-4], we need a separation of support tiers when requiring support from the whole community versus a sub-community. Essentially, if a sub-community is active enough and takes maintenance into their own internal costs without affecting other parts of the community's maintenance costs, then code that is not immediately relevant to all parts (ie. not released, actively tested, etc) can still find its way into the LLVM main repository without major pain points. The main benefit is to reduce the maintenance cost that those sub-communities have outside of LLVM (for example, in duplicating common code, applying the same patches on top of multiple user repositories or downstream projects). This document outlines the components and responsibilities of the sub-communities with regards to maintenance costs and how they affect the rest of the community. It also adds an addendum on removal policies, which expand the existing "new target removal" policy into something more generic, to encompass any piece of code, scripts or documents in the repository. [1] http://lists.llvm.org/pipermail/llvm-dev/2020-October/146249.html [2] http://lists.llvm.org/pipermail/llvm-dev/2020-November/146335.html [3] http://lists.llvm.org/pipermail/llvm-dev/2020-October/146138.html [4] http://lists.llvm.org/pipermail/llvm-dev/2020-November/146298.html	2020-11-07 21:06:05 +00:00
Arnold Schwaighofer	3fe61868a9	llvm.coro.id.async lowering: Parameterize how-to restore the current's continutation context and restart the pipeline after splitting The `llvm.coro.suspend.async` intrinsic takes a function pointer as its argument that describes how-to restore the current continuation's context from the context argument of the continuation function. Before we assumed that the current context can be restored by loading from the context arguments first pointer field (`first_arg->caller_context`). This allows for defining suspension points that reuse the current context for example. Also: llvm.coro.id.async lowering: Add llvm.coro.preprare.async intrinsic Blocks inlining until after the async coroutine was split. Also, change the async function pointer's context size position struct async_function_pointer { uint32_t relative_function_pointer_to_async_impl; uint32_t context_size; } And make the position of the `async context` argument configurable. The position is specified by the `llvm.coro.id.async` intrinsic. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90783	2020-11-06 06:22:46 -08:00
Paul C. Anagnostopoulos	af8abf220b	[TableGen] Clarify text and fix errors in the Programmer's Reference Differential Revision: https://reviews.llvm.org/D90881	2020-11-06 08:56:29 -05:00
Paul C. Anagnostopoulos	a49fe8891e	[TableGen] Clean up documentation toctrees; clarify two paragraphs. Differential Revision: https://reviews.llvm.org/D90804	2020-11-05 16:19:18 -05:00
Paul C. Anagnostopoulos	0830936d52	[TableGen] Add true and false literals to represent booleans Update the Programmer's Reference document. Add a test. Update a couple of tests with an improved error message. Differential Revision: https://reviews.llvm.org/D90635	2020-11-05 09:07:21 -05:00
Atmn Patel	f818c9012a	[LangRef] Adds llvm.loop.mustprogress loop metadata This patch adds the llvm.loop.mustprogress loop metadata. This is to be added to loops where the frontend language requires that the loop makes observable interactions with the environment. This is the loop-level equivalent to the function attribute `mustprogress` defined in D86233. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D88464	2020-11-04 22:32:50 -05:00
Arnold Schwaighofer	d90984c1dd	Start of an llvm.coro.async implementation This patch adds the `async` lowering of coroutines. This will be used by the Swift frontend to lower async functions. In contrast to the `retcon` lowering the frontend needs to be in control over control-flow at suspend points as execution might be suspended at these points. This is very much work in progress and the implementation will change as it evolves with the frontend. As such the documentation is lacking detail as some of it might change. rdar://70097093 Reapply with fix for memory sanitizer failure and sphinx failure. Differential Revision: https://reviews.llvm.org/D90612	2020-11-04 10:29:21 -08:00
Arnold Schwaighofer	c8e9566a32	Revert "Start of an llvm.coro.async implementation" This reverts commit ea606cced0583d1dbd4c44680601d1d4e9a56e58. This patch causes memory sanitizer failures sanitizer-x86_64-linux-fast.	2020-11-04 08:26:20 -08:00
Arnold Schwaighofer	3e8facdd39	Start of an llvm.coro.async implementation This patch adds the `async` lowering of coroutines. This will be used by the Swift frontend to lower async functions. In contrast to the `retcon` lowering the frontend needs to be in control over control-flow at suspend points as execution might be suspended at these points. This is very much work in progress and the implementation will change as it evolves with the frontend. As such the documentation is lacking detail as some of it might change. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90612	2020-11-04 07:32:29 -08:00
Paul C. Anagnostopoulos	9295b21984	[TableGen] Add !interleave operator to concatenate a list of values with delimiters Add a test. Use it in some TableGen files. Differential Revision: https://reviews.llvm.org/D90469	2020-11-04 09:23:54 -05:00
Fangrui Song	955aec52ea	[docs] Fix docs-llvm-html after recent TableGen changes D90617	2020-11-03 13:43:24 -08:00
Tony	05efac7ee2	[NFC][AMDGPU] Minor editorial improvements to AMDGPUUsage.rst Differential Revision: https://reviews.llvm.org/D90661	2020-11-03 16:56:01 +00:00
Tim Renouf	83e3834a8d	[AMDGPU] Add gfx1033 target Differential Revision: https://reviews.llvm.org/D90447 Change-Id: If2650fc7f31bbdd49c76e74a9ca8e3734d769761	2020-11-03 16:27:48 +00:00
Tim Renouf	2a63696860	[AMDGPU] Add gfx90c target This differentiates the Ryzen 4000/4300/4500/4700 series APUs that were previously included in gfx909. Differential Revision: https://reviews.llvm.org/D90419 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-11-03 16:27:43 +00:00
Mircea Trofin	6041d3461b	[Docs][FileCheck] Small fix.	2020-11-03 07:08:51 -08:00
Tony	da9521a27f	[NFC][AMDGPU] Restructure the AMDGPU memory model description Separate the AMDGPU memory model description into separate sections for each architecture. Differential Revision: https://reviews.llvm.org/D90548	2020-11-02 21:32:20 +00:00
Atmn Patel	4adb5d4209	[Coroutines][Docs] Remove frame packing as a TODO This has already been done by @rjmccall in D76526 (49e5a97ec363), and 9514c048d89e. We should remove this from the docs. Differential Revision: https://reviews.llvm.org/D90550	2020-11-02 15:57:04 -05:00
Mircea Trofin	f2bff76dd3	[FileCheck] Added documentation for --allow-unused-prefixes Differential Revision: https://reviews.llvm.org/D90621	2020-11-02 12:15:45 -08:00
Paul C. Anagnostopoulos	e028635778	[TableGen] Fix a couple of minor issues regarding the paste operator. Update the documentation to fully describe it. Differential Revision: https://reviews.llvm.org/D90617	2020-11-02 12:21:54 -05:00
Caroline Concatto	d88ee71498	Revert "[AArch64][AsmParser] Remove 'x31' alias for 'sp/xzr' register." This reverts commit 8b281bfaf35d00d42c2993fd5a80d749cc21f45e.	2020-11-02 08:15:50 +00:00
Caroline Concatto	19fb2444af	[AArch64][AsmParser] Remove 'x31' alias for 'sp/xzr' register. Only the aliases 'xzr' and 'sp' exist for the physical register x31. The reason for wanting to remove the alias 'x31' is because it allows users to write invalid asm that is not accepted by the GNU assembler. Is there any objection to removing this alias? Or do we want to keep this for compatibility with existing code that uses w31/x31? Differential Revision: https://reviews.llvm.org/D90153	2020-11-02 07:57:05 +00:00
Liu, Chen3	0f29f1e458	[X86] Support Intel avxvnni This patch mainly made the following changes: 1. Support AVX-VNNI instructions; 2. Introduce ExplicitVEXPrefix flag so that vpdpbusd/vpdpbusds/vpdpbusds/vpdpbusds instructions only use vex-encoding when user explicity add {vex} prefix. Differential Revision: https://reviews.llvm.org/D89105	2020-10-31 12:39:51 +08:00
Tony	2aebcb4378	[NFC][AMDGPU] Minor cleanup to AMDGPU memory model table Differential Revision: https://reviews.llvm.org/D90509	2020-10-30 22:50:22 +00:00
Scott Linder	3d87386ebf	[NFC][AMDGPU] Resize Memory Model columns in AMDGPUUsage.rst Make all of the "AMDGPU Machine Code GFX*" columns in the Memory Model table a consistent width of 32-characters. Best viewed with something like --word-diff Differential Revision: https://reviews.llvm.org/D89977	2020-10-29 23:07:03 +00:00
Scott Linder	e2577bcfdc	[AMDGPU] Update Memory Model in AMDGPUUsage.rst Mostly NFC, but some changes are "bug fixes" rather than just e.g. formatting changes or typo corrections. - Fix typo "competing" -> "completing". - Document why waintcnt is added to stores and not loads for sequentially consistent ordering. - Lowercase some mentions of `buffer_gl{0,1}_inv`. - Make mentions of `*cnt(0)` consistently include the `(0)` count. - Remove some mentions of instructions for incorrect address spaces. For example, remove mention of `flat_load` from `load atomic acquire workgroup global`. - Re-flow some text to get all the target columns to fit in a 32-character wide column. Makes a future NFC patch to make these columns both 32-character wide more straightforward. Modified cherry-pick of patch by Tony Tye Reviewed By: t-tye Differential Revision: https://reviews.llvm.org/D89596	2020-10-29 23:07:03 +00:00
Stefanos Baziotis	d9f0b9e2f4	[LCSSA] Doc for special treatment of PHIs Differential Revision: https://reviews.llvm.org/D89739	2020-10-29 22:50:07 +02:00
Nikita Popov	261a4b21fb	[CodeGen] Fix neutral value of vecreduce fadd in tests (NFC) The neutral value is -0.0, not 0.0. This doesn't matter for "fast" reductions due to nsz, but does matter for reassoc-only and seq reductions. Change tests to mostly use -0.0 where the neutral value was intended, and add some additional test coverage in some places. Also update LangRef to use the right value.	2020-10-29 21:26:14 +01:00
Tony	4b2fceb859	[AMDGPU] Update AMD GPU documentation - AMDGPUUsage.rst: Correct AMD GPU DWARF address space table address sizes which are in bits and not bytes. - clang/.../Options.td: Improve description of AMD GPU options. - Re-generate ClangComamndLineReference.rst from clang/.../Options.td . Differential Revision: https://reviews.llvm.org/D90364	2020-10-29 20:12:47 +00:00
Mehdi Amini	5e84d47808	Make the post-commit review expectations more explicit with respect to revert See http://lists.llvm.org/pipermail/llvm-dev/2016-March/096529.html for context. Reviewed By: silvas, rengolin, echristo, dexonsmith, gribozavr2 Differential Revision: https://reviews.llvm.org/D89995	2020-10-28 23:29:29 +00:00
Paul C. Anagnostopoulos	961a515ed0	[TableGen] [AMDGPU] Add !sub operator for subtraction Use it in the AMDGPU target to eliminate !add(value1, !mul(value2, -1)) Differential Revision: https://reviews.llvm.org/D90107	2020-10-28 12:27:53 -04:00
Paul C. Anagnostopoulos	2be44969ba	[TableGen] Command description file requires a hyphen in document title.	2020-10-28 09:31:31 -04:00
Paul C. Anagnostopoulos	f49b986f20	[TableGen] Update xxx-tblgen command document. Add a few cross-references among TableGen documents. Differential Revision: https://reviews.llvm.org/D90186 Add cross-references between TableGen documents.	2020-10-28 09:08:13 -04:00
Clement Courbet	6f015a2381	[llvm-exegesis][doc] Remove old FIXME. This was fixed in a previous commit, the previous line in the documentation explains how to proceed.	2020-10-28 10:53:23 +01:00
Clement Courbet	e9e3b95b0c	[llvm-exegesis] Update doc. We don't need an external script to scan all opcodes anymore, just use `-opcode-index=-1`.	2020-10-28 08:42:38 +01:00
Johannes Doerfert	bf9703343b	[LangRef] Clarify `dereferenceable` -> `nonnull` implication If `null_pointer_is_valid` is present, `dereferenceable` does not imply `nonnull`, make it clear. Came up in D17993. Reviewed By: aqjune Differential Revision: https://reviews.llvm.org/D89417	2020-10-27 19:12:53 -05:00
Georgii Rymar	4708221cf7	[llvm-readelf] - Implement --section-details option. --section-details/-t is a GNU readelf option that produce an output that is an alternative to --sections. Differential revision: https://reviews.llvm.org/D89304	2020-10-27 13:29:39 +03:00
Vedant Kumar	54baa09ec4	[cmake] Add LLVM_UBSAN_FLAGS, to allow overriding UBSan flags Allow overriding the default set of flags used to enable UBSan when building llvm. This can be used to test new checks or opt out of certain checks. Differential Revision: https://reviews.llvm.org/D89439	2020-10-26 15:48:19 -07:00
Benjamin Kramer	1f13ddec12	[X86] Add a stub for Intel's alderlake. No scheduling, no autodetection.	2020-10-24 19:01:22 +02:00
Tony	a12729cc2b	[AMDGPU] Cleanup AMDGPUUsage.rst - Layout and typo improvements. - Add memory spaces section. - reStructure syntax fixes. Differential Revision: https://reviews.llvm.org/D90002	2020-10-24 06:21:27 +00:00
Artur Pilipenko	31af2fa7ed	GC-parseable element atomic memcpy/memmove This change introduces a GC parseable lowering for element atomic memcpy/memmove intrinsics. This way runtime can provide an implementation which can take a safepoint during copy operation. See "GC-parseable element atomic memcpy/memmove" thread on llvm-dev for the background and details: https://groups.google.com/g/llvm-dev/c/NnENHzmX-b8/m/3PyN8Y2pCAAJ Differential Revision: https://reviews.llvm.org/D88861	2020-10-23 14:06:09 -07:00
Nick Desaulniers	e95a065d26	[IR] add fn attr for no_stack_protector; prevent inlining on mismatch It's currently ambiguous in IR whether the source language explicitly did not want a stack a stack protector (in C, via function attribute no_stack_protector) or doesn't care for any given function. It's common for code that manipulates the stack via inline assembly or that has to set up its own stack canary (such as the Linux kernel) would like to avoid stack protectors in certain functions. In this case, we've been bitten by numerous bugs where a callee with a stack protector is inlined into an __attribute__((__no_stack_protector__)) caller, which generally breaks the caller's assumptions about not having a stack protector. LTO exacerbates the issue. While developers can avoid this by putting all no_stack_protector functions in one translation unit together and compiling those with -fno-stack-protector, it's generally not very ergonomic or as ergonomic as a function attribute, and still doesn't work for LTO. See also: https://lore.kernel.org/linux-pm/20200915172658.1432732-1-rkir@google.com/ https://lore.kernel.org/lkml/20200918201436.2932360-30-samitolvanen@google.com/T/#u Typically, when inlining a callee into a caller, the caller will be upgraded in its level of stack protection (see adjustCallerSSPLevel()). By adding an explicit attribute in the IR when the function attribute is used in the source language, we can now identify such cases and prevent inlining. Block inlining when the callee and caller differ in the case that one contains `nossp` when the other has `ssp`, `sspstrong`, or `sspreq`. Fixes pr/47479. Reviewed By: void Differential Revision: https://reviews.llvm.org/D87956	2020-10-23 11:55:39 -07:00
Paul C. Anagnostopoulos	a16b7dbf16	[TableGen] Change !getop and !setop to !getdagop and !setdagop. Differential Revision: https://reviews.llvm.org/D89814	2020-10-23 10:36:05 -04:00
Nick Desaulniers	9d3871429a	BitCodeFormat: update doc on new byref and mustprogress attrs; NFC Forked from review of: https://reviews.llvm.org/D87956	2020-10-22 16:29:56 -07:00
Tom Stellard	5454af6727	HowToReleaseLLVM: Clean up document and remove references to SVN Reviewed By: hans Differential Revision: https://reviews.llvm.org/D80395	2020-10-22 11:34:03 -07:00
Paul C. Anagnostopoulos	8a7a44c2c6	[TableGen] Update documents to make them more complete Differential Revision: https://reviews.llvm.org/D89962	2020-10-22 13:19:19 -04:00
Arthur Eubanks	9d7ca40d8d	Revert "[Docs] Clarify that FunctionPasses can't add/remove declarations" This reverts commit 710676cf3a3c6f6ddf2f18e24cac017d20dac1ff.	2020-10-22 09:49:42 -07:00
Arthur Eubanks	4381b544ab	[Docs] Clarify that FunctionPasses can't add/remove declarations In preparation for potential future concurrency, a FunctionPass shouldn't modify anything at the module level that other FunctionPasses can also modify. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D89890	2020-10-22 09:03:42 -07:00
Paul C. Anagnostopoulos	e57a8ed671	[TableGen] Continue improving the comments for the data structures. Differential Revision: https://reviews.llvm.org/D89901	2020-10-22 10:00:49 -04:00
Tianqing Wang	e6283a5b5d	[X86] Add User Interrupts(UINTR) instructions For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89301	2020-10-22 17:33:07 +08:00
Wang, Pengfei	a413347201	[X86] Add clang release notes for HRESET and minor change for llvm release notes. (NFC)	2020-10-21 15:59:42 +08:00
Konrad Kleine	310ff80e10	[doc] Apply buildbot worker terminology change: slave->worker Recently [1], there was an upgrade to the version of buildbot being deployed. The new setup will still work with old buildslaves but I thought it might be a good idea to update the documentation to reflect, that you now can use a newer buildbot version to when setting up your worker (formely known as slave). The upgrade from buildbot 0.8.5 to 2.8.5 went a long with a transition to a new "worker" terminology [2] which is also reflected by this change. [1]: http://lists.llvm.org/pipermail/llvm-dev/2020-October/145629.html [2]: http://docs.buildbot.net/0.9.12/manual/worker-transition.html Reviewed By: gkistanova Differential Revision: https://reviews.llvm.org/D89230	2020-10-20 06:43:09 -04:00
Artur Pilipenko	12e6efee22	Adding new Azul representative to security group Adding myself as a new Azul representative to security group. Differential Revision: https://reviews.llvm.org/D89287	2020-10-19 22:41:19 -07:00
Atmn Patel	cbe95c4921	[LangRef] Define mustprogress attribute LLVM IR currently assumes some form of forward progress. This form is not explicitly defined anywhere, and is the cause of miscompilations in most languages that are not C++11 or later. This implicit forward progress guarantee can not be opted out of on a function level nor on a loop level. Languages such as C (C11 and later), C++ (pre-C++11), and Rust have different forward progress requirements and this needs to be evident in the IR. Specifically, C11 and onwards (6.8.5, Paragraph 6) states that "An iteration statement whose controlling expression is not a constant expression, that performs no input/output operations, does not access volatile objects, and performs no synchronization or atomic operations in its body, controlling expression, or (in the case of for statement) its expression-3, may be assumed by the implementation to terminate." C++11 and onwards does not have this assumption, and instead assumes that every thread must make progress as defined in [intro.progress] when it comes to scheduling. This was initially brought up in [0] as a bug, a solution was presented in [1] which is the current workaround, and the predecessor to this change was [2]. After defining a notion of forward progress for IR, there are two options to address this: 1) Set the default to assuming Forward Progress and provide an opt-out for functions and an opt-in for loops. 2) Set the default to not assuming Forward Progress and provide an opt-in for functions, and an opt-in for loops. Option 2) has been selected because only C++11 and onwards have a forward progress requirement and it makes sense for them to opt-into it via the defined `mustprogress` function attribute. The `mustprogress` function attribute indicates that the function is required to make forward progress as defined. This is sharply in contrast to the status quo where this is implicitly assumed. In addition, `willreturn` implies `mustprogress`. The background for why this definition was chosen is in [3] and for why the option was chosen is in [4] and the corresponding thread(s). The implementation is in D85393, the clang patch is in D86841, the LoopDeletion patch is in D86844, the Inliner patches are in D87180 and D87262, and there will be more incoming. [0] https://bugs.llvm.org/show_bug.cgi?id=965#c25 [1] https://lists.llvm.org/pipermail/llvm-dev/2017-October/118558.html [2] https://reviews.llvm.org/D65718 [3] https://lists.llvm.org/pipermail/llvm-dev/2020-September/144919.html [4] https://lists.llvm.org/pipermail/llvm-dev/2020-September/145023.html Reviewed By: jdoerfert, efriedma, nikic Differential Revision: https://reviews.llvm.org/D86233	2020-10-19 13:34:27 -04:00
Paul C. Anagnostopoulos	40d033740a	[TableGen] Enhance !empty and !size to handle strings and DAGs. Fix bug in the type checking for !empty, !head, !size, !tail.	2020-10-19 09:22:20 -04:00
Sam Parker	1e873328c1	[LangRef] Correct return type llvm.test.set.loop.iterations.* The langref description for llvm.test.set.loop.iterations.* were missing the i1 return type. Differential Revision: https://reviews.llvm.org/D89564 Patch by: Janek van Oirschot	2020-10-19 12:56:38 +01:00
Lang Hames	18259b0a87	[ORC][examples] Update Kaleidoscope and BuildingAJIT tutorial series to OrcV2. This patch updates the Kaleidoscope and BuildingAJIT tutorial series (chapter 1-4) to OrcV2. Chapter 5 of the BuildingAJIT series is removed -- it will be re-instated once we have in-tree support for out-of-process JITing. This patch only updates the tutorial code, not the text. Patches welcome for that, otherwise I will try to update it in a few weeks.	2020-10-18 21:03:04 -07:00
Paul C. Anagnostopoulos	a667a4a9f8	[TableGen] Change Programmer's Reference to use "DAG argument" rather than "operand". Differential Revision: https://reviews.llvm.org/D89624	2020-10-18 10:50:14 -04:00
Juneyoung Lee	e7de338270	Add support for !noundef metatdata on loads This patch adds metadata !noundef and makes load instructions can optionally have it. A load with !noundef always return a well-defined value (has no undef bit or isn't poison). If the loaded value isn't well defined, the behavior is undefined. This metadata can be used to encode the assumption from C/C++ that certain reads of variables should have well-defined values. It is helpful for optimizing freeze instructions away, because freeze can be removed when its operand has well-defined value, and showing that a load from arbitrary location is well-defined is usually hard otherwise. The same information can be encoded with llvm.assume with operand bundle; using metadata is chosen because I wasn't sure whether code motion can be freely done when llvm.assume is inserted from clang instead. The existing codebase already is stripping unknown metadata when doing code motion, so using metadata is UB-safe as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D89050	2020-10-17 13:50:10 +09:00
Juneyoung Lee	59a2a236c5	[LangRef] Rename the names of metadata in load/store's syntax (NFC) Discussed in D89050	2020-10-17 13:30:02 +09:00
Alok Kumar Sharma	b846ffc438	[DebugInfo] Support for DWARF operator DW_OP_over LLVM rejects DWARF operator DW_OP_over. This DWARF operator is needed for Flang to support assumed rank array. Summary: Currently LLVM rejects DWARF operator DW_OP_over. Below error is produced when llvm finds this operator. [..] invalid expression !DIExpression(151, 20, 16, 48, 30, 35, 80, 34, 6) warning: ignoring invalid debug info in over.ll [..] There were some parts missing in support of this operator, which are now completed. Testing -added a unit testcase -check-debuginfo -check-llvm Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D89208	2020-10-17 08:42:28 +05:30
Stanislav Mekhanoshin	9c088650a5	[AMDGPU] Fix gfx1032 description in AMDGPUUsage.rst. NFC. Differential Revision: https://reviews.llvm.org/D89565	2020-10-16 13:29:20 -07:00
Vinicius Tinti	3f47e2d686	[llvm-objdump] Implement --prefix option The prefix given to --prefix will be added to GNU absolute paths when used with --source option (source interleaved with the disassembly). This matches GNU's objdump behavior. GNU and C++17 rules for absolute paths are different. Differential Revision: https://reviews.llvm.org/D85024 Fixes PR46368. Differential Revision: https://reviews.llvm.org/D85024	2020-10-16 17:50:42 +01:00
Matt Arsenault	e3bfefd3cc	Reapply "OpaquePtr: Add type to sret attribute" This reverts commit eb9f7c28e5fe6d75fed3587023e17f2997c8024b. Previously this was incorrectly handling linking of the contained type, so this merges the fixes from D88973.	2020-10-16 11:05:02 -04:00
Stanislav Mekhanoshin	86aeb69232	[AMDGPU] gfx1032 target Differential Revision: https://reviews.llvm.org/D89487	2020-10-15 12:41:18 -07:00
Paul C. Anagnostopoulos	15f7b61423	[TableGen] Add the !not and !xor operators. Update the TableGen Programmer's Reference.	2020-10-15 10:12:59 -04:00
Konstantin Zhuravlyov	5f87057393	AMDGPU: Update AMDHSA code object version handling Differential Revision: https://reviews.llvm.org/D89076	2020-10-14 13:04:27 -04:00
Scott Linder	c5455cf70e	[DebugInfo][docs] Document DILabel in LangRef Add some minimal documentation for DILabel, originally introduced in D45024. Update the name and semantics of the `variables:` field in the documentation for `DISubprogram`; the field is now called `retainedNodes:` and is a heterogeneous list of `DILocalVariable` and `DILabel`. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D89082	2020-10-13 18:26:41 +00:00
Paul C. Anagnostopoulos	25ff50b32f	[TableGen] Add new section to the TableGen Programmer's Reference. Fix typos in it and the TableGen Backend Developer's Guide.	2020-10-13 09:59:13 -04:00
Pietro Albini	70207b468c	Add expected response time and escalation path to the security docs Following up on the discussion within the group during the roundtable at the 2020 LLVM Developers Meeting, this commit adds to the security docs: * How long we expect acknowledging security reports will take * The escalation path the reporter can follow if they get no response A temporary line inviting reporters to directly follow the escalation path while the mailing list is being setup is also added. Differential Revision: https://reviews.llvm.org/D89068	2020-10-13 10:57:06 +02:00
Tobias Hieta	34a683def6	[llvm-install-name-tool] Add -delete_all_rpaths option This diff adds an option to remove all rpaths from a Mach-O binary. Test plan: make check-all Differential revision: https://reviews.llvm.org/D88674	2020-10-13 00:45:57 -07:00
Wang, Pengfei	4ae5349aa4	[X86] Add HRESET instruction. For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89102	2020-10-13 08:47:26 +08:00
Paul C. Anagnostopoulos	69c806cfe8	[TableGen] Add overload of RecordKeeper::getAllDerivedDefinitions() and use in PseudoLowering backend. Now the two getAllDerivedDefinitions() use StringRef and Arrayref. Use all_of() in getAllDerivedDefinitions().	2020-10-12 16:40:09 -04:00
Tony	c338b755d2	[AMDGPU] Correct processor names for gfx1010 and gfx1011 Change-Id: Ie409f86876b0437d0b0405aff42872963708d926 Differential Revision: https://reviews.llvm.org/D89259	2020-10-12 20:16:12 +00:00
Fangrui Song	7466a25f11	[X86] Support -march=x86-64-v[234] PR47686. These micro-architecture levels are defined in the x86-64 psABI: https://gitlab.com/x86-psABIs/x86-64-ABI/-/commit/77566eb03bc6a326811cb7e9 GCC 11 will support these levels. Note, -mtune=x86-64-v[234] are invalid and __builtin_cpu_is cannot be used on them. Reviewed By: craig.topper, RKSimon Differential Revision: https://reviews.llvm.org/D89197	2020-10-12 10:29:46 -07:00
Philip Reames	f5a55066e4	Step down from security group Resigning from security group as Azul representative as I have left Azul. Previously communicated via email with security group. Differential Revision: https://reviews.llvm.org/D88933	2020-10-10 09:48:02 -07:00
Tim Renouf	13991476f1	[AMDGPU] Add gfx602, gfx705, gfx805 targets At AMD, in an internal audit of our code, we found some corner cases where we were not quite differentiating targets enough for some old hardware. This commit is part of fixing that by adding three new targets: * The "Oland" and "Hainan" variants of gfx601 are now split out into gfx602. LLPC (in the GPUOpen driver) and other front-ends could use that to avoid using the shaderZExport workaround on gfx602. * One variant of gfx703 is now split out into gfx705. LLPC and other front-ends could use that to avoid using the shaderSpiCsRegAllocFragmentation workaround on gfx705. * The "TongaPro" variant of gfx802 is now split out into gfx805. TongaPro has a faster 64-bit shift than its former friends in gfx802, and a subtarget feature could be set up for that to take advantage of it. This commit does not make that change; it just adds the target. V2: Add clang changes. Put TargetParser list in order. V3: AMDGCNGPUs table in TargetParser.cpp needs to be in GPUKind order, so fix the GPUKind order. Differential Revision: https://reviews.llvm.org/D88916 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-10-10 17:22:22 +01:00
Alok Kumar Sharma	0a8029e199	[DebugInfo] Support for DWARF attribute DW_AT_rank This patch adds support for DWARF attribute DW_AT_rank. Summary: Fortran assumed rank arrays have dynamic rank. DWARF attribute DW_AT_rank is needed to support that. Testing: unit test cases added (hand-written) check llvm check debug-info Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D89141	2020-10-10 17:51:12 +05:30
Zi Xuan Wu	e3b36cdf43	[CSKY 1/n] Add basic stub or infra of csky backend This patch introduce files that just enough for lib/Target/CSKY to compile. Notably a basic CSKYTargetMachine and CSKYTargetInfo. Differential Revision: https://reviews.llvm.org/D88466	2020-10-10 10:44:08 +08:00
Rahman Lavaee	194be1c7dd	Introduce and use a new section type for the bb_addr_map section. This patch lets the bb_addr_map (renamed to __llvm_bb_addr_map) section use a special section type (SHT_LLVM_BB_ADDR_MAP) instead of SHT_PROGBITS. This would help parsers, dumpers and other tools to use the sh_type ELF field to identify this section rather than relying on string comparison on the section name. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D88199	2020-10-08 11:13:19 -07:00
Amara Emerson	bbd25a9a88	[GlobalISel] Add G_VECREDUCE_* opcodes for vector reductions. These mirror the IR and SelectionDAG intrinsics & nodes. Opcodes added: G_VECREDUCE_SEQ_FADD G_VECREDUCE_SEQ_FMUL G_VECREDUCE_FADD G_VECREDUCE_FMUL G_VECREDUCE_FMAX G_VECREDUCE_FMIN G_VECREDUCE_ADD G_VECREDUCE_MUL G_VECREDUCE_AND G_VECREDUCE_OR G_VECREDUCE_XOR G_VECREDUCE_SMAX G_VECREDUCE_SMIN G_VECREDUCE_UMAX G_VECREDUCE_UMIN Differential Revision: https://reviews.llvm.org/D88750	2020-10-08 10:33:19 -07:00
Luqman Aden	59baf6c915	[llvm-readobj] Add --coff-tls-directory flag to print TLS Directory & test. Akin to dumpbin's /TLS option, this will print out the TLS directory, if present, in the image. Example output: ``` > llvm-readobj --coff-tls-directory test.exe File: test.exe Format: COFF-x86-64 Arch: x86_64 AddressSize: 64bit TLSDirectory { StartAddressOfRawData: 0x140004000 EndAddressOfRawData: 0x140004040 AddressOfIndex: 0x140002000 AddressOfCallBacks: 0x0 SizeOfZeroFill: 0x0 Characteristics [ (0x0) ] } ``` Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D88635	2020-10-08 01:53:15 -07:00
Serge Guelton	2e4356ed42	Update documentation and implementation of stage3 build Have the build work out of the box by forcing an LLD build. That way, we don't require an external LTO-aware linker, as we build one. Also remove reference to the seemingly dead builder. Differential Revision: https://reviews.llvm.org/D88990	2020-10-08 07:55:37 +02:00
Amara Emerson	59c2440372	[llvm][mlir] Promote the experimental reduction intrinsics to be first class intrinsics. This change renames the intrinsics to not have "experimental" in the name. The autoupgrader will handle legacy intrinsics. Relevant ML thread: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html Differential Revision: https://reviews.llvm.org/D88787	2020-10-07 10:36:44 -07:00
Duncan P. N. Exon Smith	8fbfc2b8c5	docs: Emphasize ArrayRef over SmallVectorImpl The section on SmallVector has a note about preferring SmallVectorImpl for APIs but doesn't mention ArrayRef. Although ArrayRef is discussed elsewhere, let's re-emphasize here. Differential Revision: https://reviews.llvm.org/D49881	2020-10-06 18:13:52 -04:00
Michael Kruse	a5187b0c04	[docs] Revise loop terminology reference. Motivated by D88183, this seeks to clarify the current loop nomenclature with added illustrations, examples for possibly unexpected situations (infinite loops not part of the "parent" loop, logical loops sharing the same header, ...), and clarification on what other sources may consider a loop. The current document also has multiple errors that are fixed here. Some selected errors: * Loops a defined as strongly-connected components. A component a partition of all nodes, i.e. a subloop can never be a component. That is, the document as it currently is only covers top-level loops, even it also uses the term SCC for subloops. * "a block can be the header of two separate loops at the same time" (it is considered a single loop by LoopInfo) * "execute before some interesting event happens" (some interesting event is not well-defined) Reviewed By: baziotis, Whitney Differential Revision: https://reviews.llvm.org/D88408	2020-10-05 10:28:04 -05:00
Paul C. Anagnostopoulos	6ec902749c	[TableGen] New backend to print detailed records. Pertinent lints are fixed.	2020-10-02 10:22:13 -04:00
Chris Lattner	bff360bbf9	We don't need two different ways to get commit access, just simplify the policy here so that old SVN users and new contributors do the same thing.	2020-09-30 22:36:44 -07:00
Vedant Kumar	32d1049161	[docs] Recommend dropLocation() over setDebugLoc(DebugLoc())	2020-09-29 17:07:14 -07:00
Tres Popp	59b6daf823	Revert "OpaquePtr: Add type to sret attribute" This reverts commit 55c4ff91bd820d72014f63dcf7f3d5a0d3397986. Issues were introduced as discussed in https://reviews.llvm.org/D88241 where this change made previous bugs in the linker and BitCodeWriter visible.	2020-09-29 10:31:04 +02:00
Arthur Eubanks	ee468fc3e5	[Docs][NewPM] Add note about required passes Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D88342	2020-09-28 21:45:14 -07:00
Paul C. Anagnostopoulos	2483a36836	[TableGen] Add/edit Doxygen comments to match "TableGen Backend Developer's Guide."	2020-09-26 09:09:22 -04:00
Juneyoung Lee	68fbee82fd	[LangRef] Clarify the behavior of memory access instructions when pointers/sizes aren't well-defined This is a patch to LangRef that clarifies the behavior of load/store/memset/memcpy/memmove when the pointers or sizes are not well-defined as well. MSan detects a case when e.g., only lower bits of address are garbage when `-msan-check-access-address` is enabled, and it does not directly conflict with this patch because a C program should not use a pointer with undef bits and reasonable optimizations do not convert a well-defined pointer into a pointer with undef bits. This patch contains a definition of a well-defined value as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D87994	2020-09-26 08:13:27 +09:00
Matt Arsenault	0ec533bb8a	OpaquePtr: Add type to sret attribute Make the corresponding change that was made for byval in b7141207a483d39b99c2b4da4eb3bb591eca9e1a. Like byval, this requires a bulk update of the test IR tests to include the type before this can be mandatory.	2020-09-25 14:07:30 -04:00
Ian Levesque	919066e494	[xray] Function coverage groups Add the ability to selectively instrument a subset of functions by dividing the functions into N logical groups and then selecting a group to cover. By selecting different groups over time you could cover the entire application incrementally with lower overhead than instrumenting the entire application at once. Differential Revision: https://reviews.llvm.org/D87953	2020-09-24 22:09:53 -04:00
Stefanos Baziotis	490925f87c	[LoopTerminology][NFC] Fix formatting typo	2020-09-23 22:53:05 +03:00
Mehdi Amini	1d8405c45e	Document the `--verbatim` flag from arc to update the description for a phabricator revision	2020-09-23 18:01:10 +00:00
Mehdi Amini	be10725056	Update Phabricator doc to remove the warning on "arc land": tags a properly handled server side now	2020-09-23 18:01:09 +00:00
SuJunda (Junda Su)	c1d4b9d633	[docs][llvm] Fix typos I don't have commit access. Please help me commit it. Thanks : ) Reviewed By: Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D88139	2020-09-23 10:19:02 -04:00
Florian Hahn	64c733fe5d	[VPlan] Disconnect VPValue and VPUser. This refactors VPuser to not inherit from VPValue to facilitate introducing operations that introduce multiple VPValues (e.g. VPInterleaveRecipe). Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D84679	2020-09-23 14:44:31 +01:00
antonio-cortes-perez	a3cd2113f4	[NFC][docs] Fix link. The rendered html was (no hyperlink was generated): (see Getting Started <GettingStarted.html#git-pre-push-hook>) Now, it is (with proper hyperlink): (see Git pre-push hook) Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D88116	2020-09-22 23:40:03 +00:00
Paul C. Anagnostopoulos	54f163f8a4	Two patches to fix the broken build. One to fix a C++ compiler warning. One to allow Sphinx to find a new document.	2020-09-22 16:00:31 -04:00
Paul C. Anagnostopoulos	82dfae475d	Version 0.5 of the new "TableGen Backend Developer's Guide." Files modified to take comments into account. MLIR documentation updated for new TableGen documentation files.	2020-09-22 14:01:52 -04:00
antonio-cortes-perez	0eca672f50	[docs] Update ExtendingLLVM.rst Updated file paths and function signatures in section "Adding a new type". Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D88049	2020-09-21 16:49:48 -07:00
Alexander Shaposhnikov	29027aaeab	[llvm-install-name-tool] Update the command-line guide	2020-09-17 13:44:26 -07:00
Paul C. Anagnostopoulos	3fb53046bc	Add section with details about DAGs.	2020-09-16 09:27:28 -04:00
Han Seoul-Oh	30955c0ef8	[doc] Fix broken link	2020-09-15 09:58:08 +02:00
Xun Li	c84dc6693c	[Coroutines] Fix a typo in documentation In the example, the variable that's crossing suspend point was referred wrongly, fix it. Differential Revision: https://reviews.llvm.org/D83563	2020-09-14 18:56:57 -07:00
Arthur Eubanks	75d5f2cf7a	Reland [docs][NewPM] Add docs for writing NPM passes As to not conflict with the legacy PM example passes under llvm/lib/Transforms/Hello, this is under HelloNew. This makes the CMakeLists.txt and general directory structure less confusing for people following the example. Much of the doc structure was taken from WritinAnLLVMPass.rst. This adds a HelloWorld pass which simply prints out each function name. More will follow after this, e.g. passes over different units of IR, analyses. https://llvm.org/docs/WritingAnLLVMPass.html contains a lot more. Relanded with missing "Support" dependency in LLVMBuild.txt. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D86979	2020-09-14 16:06:19 -07:00
Arthur Eubanks	0afd1bce59	Revert "[docs][NewPM] Add docs for writing NPM passes" This reverts commit c2590de30df23ef0db39b496cdec62a83a61fbfa. Breaks shared libs build	2020-09-14 15:55:17 -07:00
Lang Hames	16aabf22d4	[docs] Update OrcV1 removal timeline.	2020-09-14 14:23:20 -07:00
Arthur Eubanks	73dd3483e1	[docs][NewPM] Add docs for writing NPM passes As to not conflict with the legacy PM example passes under llvm/lib/Transforms/Hello, this is under HelloNew. This makes the CMakeLists.txt and general directory structure less confusing for people following the example. Much of the doc structure was taken from WritinAnLLVMPass.rst. This adds a HelloWorld pass which simply prints out each function name. More will follow after this, e.g. passes over different units of IR, analyses. https://llvm.org/docs/WritingAnLLVMPass.html contains a lot more. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D86979	2020-09-14 13:26:03 -07:00
Balazs Benics	140d801153	[analyzer][docs][NFC] Document the ento namespace in the llvm/Lexicon Document the `ento` namespace in the Lexicon according to @nicolas17 on the mailing list (http://lists.llvm.org/pipermail/cfe-dev/2020-August/066577.html). The analyzer lived at different namespaces at different times. Originally lived at the `GR` aka. (Graph Reachability) namespace [7], later it moved under the `ento` namespace [9]. The Static Analyzer's code lived at many other places as well: `Analysis` -[2]-> `Checker` -[5]-> `GR` -[10]> `entoSA` -[11]-> `StaticAnalyzer` The relevant code motion, refactor commits, cfe-dev mailing in chronological order: 1) 2008-03-15 Make a major restructuring of the clang tree: introduce a ... 7a51313d8a0a358bb92eb5dbf8fd846b7c48e7fe 2) 2010-01-25 Split libAnalysis into two libraries: libAnalysis and libChecker d6b8708643219776b1f0f41df32c5eccf065ed5b 3) 2010-12-21 Reorganization of Checker files http://lists.llvm.org/pipermail/cfe-dev/2010-December/012694.html 4) 2010-12-22 Refactoring: include/clang/Checker -> include/clang/GR 8d602a8aa8e6697509465d8a5473fc41cb1a382e 5) 2010-12-22 Refactoring: lib/Checker -> lib/GR 2ff5ab1516e48c2fff0138f953d887b5e695214b 6) 2010-12-22 Refactoring: Move checkers into lib/GR/Checkers and their own a700e976b658860418bc145ec0bdacd4f1db3264 7) 2010-12-22 Refactoring: Move stuff into namespace 'GR' ca08fba4141f1d3ae6193b3c81fb6ba8fb10d7dc 8) 2010-12-22 Refactoring: Drop the 'GR' prefix. 1696f508e2fe95793ca8bb70d78b88023b6b8625 9) 2010-12-23 Rename static analyzer namespace 'GR' to 'ento' 98857c986078c6e6a10910628dbabf75ae735b76 10) 2010-12-23 Rename headers: 'clang/GR' 'clang/EntoSA' and update Makefile ef33f0996c6a625767690395f3cfb41afb84db5a 11) 2010-12-23 Chris Lattner has strong opinions about directory d99bd55a5e092774214ba31fc5a871bfc31e711c 12) 2010-12-24 Remove the EntoSA directories. 9d6af5328e3a61641a125b17125952fa1a6bf11d Reviewed By: Szelethus,martong,ASDenysPetrov,xazax.hun Differential Revision: https://reviews.llvm.org/D86446	2020-09-14 08:43:56 +02:00
Dave Lee	2be90dca65	[docs] Document LLVM_EXTERNALIZE_DEBUGINFO CMake option Add `LLVM_EXTERNALIZE_DEBUGINFO` to CMake.rst. This should help make dSYM generation more discoverable. Differential Revision: https://reviews.llvm.org/D87591	2020-09-13 21:39:27 -07:00
Sanjay Patel	2c86671523	[Intrinsics] define semantics for experimental fmax/fmin vector reductions As discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html This is hopefully the final remaining showstopper before we can remove the 'experimental' from the reduction intrinsics. No behavior was specified for the FP min/max reductions, so we have a mess of different interpretations. There are a few potential options for the semantics of these max/min ops. I think this is the simplest based on current behavior/implementation: make the reductions inherit from the existing llvm.maxnum/minnum intrinsics. These correspond to libm fmax/fmin, and those are similar to the (now deprecated?) IEEE-754 maxNum/minNum functions (NaNs are treated as missing data). So the default expansion creates calls to libm functions. Another option would be to inherit from llvm.maximum/minimum (NaNs propagate), but most targets just crash in codegen when given those nodes because no default expansion was ever implemented AFAICT. We could also just assume 'nnan' semantics by default (we are already assuming 'nsz' semantics in the maxnum/minnum intrinsics), but some targets (AArch64, PowerPC) support the more defined behavior, so it doesn't make much sense to not allow a tighter spec. Fast-math-flags (nnan) can be used to loosen the semantics. (Note that D67507 was proposed to update the LangRef to acknowledge the more recent IEEE-754 2019 standard, but that patch seems to have stalled. If we do update based on the new standard, the reduction instructions can seamlessly inherit from whatever updates are made to the max/min intrinsics.) x86 sees a regression here on 'nnan' tests because we have underlying, longstanding bugs in FMF creation/propagation. Those need to be fixed apart from this change (for example: https://llvm.org/PR35538). The expansion sequence before this patch may not have been correct. Differential Revision: https://reviews.llvm.org/D87391	2020-09-12 09:10:28 -04:00
YangZhihui	86cfd2d991	[docs] Fix typos Differential Revision: https://reviews.llvm.org/D87356	2020-09-11 17:58:07 +02:00
YangZhihui	e04c95e2e2	Fix typo in dsymutil.rst Differential revision: https://reviews.llvm.org/D87438	2020-09-10 09:46:10 -07:00
Guillaume Chatelet	1b36883d0d	Fix broken link for Sphinx installation	2020-09-10 12:27:49 +00:00
Tony	06c11c6b12	[AMDGPU] Correct gfx1031 XNACK setting documentation - gfx1031 does not support XNACK. Differential Revision: https://reviews.llvm.org/D87198	2020-09-09 19:43:02 +00:00
Nate Voorhies	81112063c3	Insert missing bracket in docs. Body of unrolled loop was missing opening bracket. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D87329	2020-09-08 15:20:39 -07:00
Paul C. Anagnostopoulos	94da56feaf	fix typos; improve a couple of descriptions; add release note	2020-09-08 15:48:18 -04:00
Paul C. Anagnostopoulos	88dabfb171	Add detailed reference for the SearchableTables backend.	2020-09-08 13:48:12 -04:00
Florian Hahn	6276196b86	[LangRef] Adjust guarantee for llvm.memcpy to also allow equal arguments. This adjusts the description of `llvm.memcpy` to also allow operands to be equal. This is in line with what Clang currently expects. This change is intended to be temporary and followed by re-introduce a variant with the non-overlapping guarantee for cases where we can actually ensure that property in the front-end. See the links below for more details: http://lists.llvm.org/pipermail/cfe-dev/2020-August/066614.html and PR11763. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D86815	2020-09-05 19:18:23 +01:00
Yang Zhihui	50256eeb51	Fix typos in doc LangRef.rst Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D87077	2020-09-04 05:17:31 -07:00
JF Bastien	5da192d70b	Step down from security group Propose Ahmed as a replacement. He's fixed many security issues in LLVM for Apple in the last few years, as such he'll fit the "Individual contributors" description. Differential Revision: https://reviews.llvm.org/D86742	2020-09-03 08:44:27 -07:00
Michael Kruse	292847821f	[LangRef] Fix condition for when a loop is considered parallel. The wording before this patch applies to llvm.mem.parallel_loop_access, not access groups. Reviewed By: mppf, hfinkel Differential Revision: https://reviews.llvm.org/D83781	2020-09-01 15:41:59 -05:00
Arthur Eubanks	81eaf47f84	[Bindings] Add LLVMAddInstructionSimplifyPass Reviewed By: sroland Differential Revision: https://reviews.llvm.org/D86764	2020-09-01 12:38:49 -07:00
Hans Wennborg	df821d52a8	First commit on the release/11.x branch.	2020-09-01 11:44:02 -07:00
Arthur Eubanks	8e53f78912	[docs] Fix indentation in FileCheck.rst Fixes C:\src\llvm-project\llvm\docs\CommandGuide\FileCheck.rst:745:Bullet list ends without a blank line; unexpected unindent.	2020-08-31 13:20:04 -07:00
Alexandre Ganea	4be21d8696	Fix sphinx documentation after a6a37a2fcd2a8048a75bd0d8280497ed89d73224	2020-08-31 08:06:13 -04:00
Thomas Preud'homme	14ecb9c6d9	[FileCheck] Add precision to format specifier Add printf-style precision specifier to pad numbers to a given number of digits when matching them if the value is smaller than the given precision. This works on both empty numeric expression (e.g. variable definition from input) and when matching a numeric expression. The syntax is as follows: [[#%.<precision><format specifier>, ...] where <format specifier> is optional and ... can be a variable definition or not with an empty expression or not. In the absence of a precision specifier, a variable definition will accept leading zeros. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D81667	2020-08-30 19:40:57 +01:00
Juneyoung Lee	d99f7f85b1	[LangRef] Apply a missing comment from D86189	2020-08-30 14:56:17 +09:00
Juneyoung Lee	f38cacc4bc	[LangRef] State that storing an aggregate fills padding with undef This patch makes LangRef be explicit about the value of padding when storing an aggregate. It states that when an aggregate is stored into memory, padding is filled with undef. Here is a clue that supports this change (edited to reflect the discussion from llvm-dev): - IPSCCP ignores padding and directly stores a constant aggregate if possible. It loses the data stored in the padding. https://godbolt.org/z/xzenYs Memcpyopt ignores (the preexisting value of) padding when copying an aggregate or storing a constant: https://godbolt.org/z/hY6ndd / https://godbolt.org/z/3WMP5a The two items below are not relevant with this patch because Clang lowers load/store of individual field of struct into load/stores of the corresponding pointer with a primitive type. Also, when copy is needed, it uses memcpy instead of load/store of an aggregate, as discussed in the llvm-dev. However, this patch is still valid (as discussed) because it is needed to explain the two optimizations above. - According to C17, the value of padding bytes when storing values in structures or unions is unspecified. - I updated Alive2 and it did not find any problematic transformation from LLVM unit tests and while running translation validation of a few C programs. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D86189	2020-08-30 14:53:20 +09:00
JF Bastien	3fa9e34bdc	Add an unsigned shift base sanitizer It's not undefined behavior for an unsigned left shift to overflow (i.e. to shift bits out), but it has been the source of bugs and exploits in certain codebases in the past. As we do in other parts of UBSan, this patch adds a dynamic checker which acts beyond UBSan and checks other sources of errors. The option is enabled as part of -fsanitize=integer. The flag is named: -fsanitize=unsigned-shift-base This matches shift-base and shift-exponent flags. <rdar://problem/46129047> Differential Revision: https://reviews.llvm.org/D86000	2020-08-27 19:50:10 -07:00
Alexandre Ganea	1c64f56c35	[Support] On Windows, add optional support for {rpmalloc\|snmalloc\|mimalloc} This patch optionally replaces the CRT allocator (i.e., malloc and free) with rpmalloc (mixed public domain licence/MIT licence) or snmalloc (MIT licence) or mimalloc (MIT licence). Please note that the source code for these allocators must be available outside of LLVM's tree. To enable, use `cmake ... -DLLVM_INTEGRATED_CRT_ALLOC=D:/git/rpmalloc -DLLVM_USE_CRT_RELEASE=MT` where `D:/git/rpmalloc` has already been git clone'd from `https://github.com/mjansson/rpmalloc`. The same applies to snmalloc and mimalloc. When enabled, the allocator will be embeded (statically linked) into the LLVM tools & libraries. This currently only works with the static CRT (/MT), although using the dynamic CRT (/MD) could potentially work as well in the future. When enabled, this changes the memory stack from: new/delete -> MS VC++ CRT malloc/free -> HeapAlloc -> VirtualAlloc to: new/delete -> {rpmalloc\|snmalloc\|mimalloc} -> VirtualAlloc The goal of this patch is to bypass the application's global heap - which is thread-safe thus inducing locking - and instead take advantage of a modern lock-free, thread cache, allocator. On a 6-core Xeon Skylake we observe a 2.5x decrease in execution time when linking a large scale application with LLD and ThinLTO (12 min 20 sec -> 5 min 34 sec), when all hardware threads are being used (using LLD's flag /opt:lldltojobs=all). On a dual 36-core Xeon Skylake with all hardware threads used, we observe a 24x decrease in execution time (1 h 2 min -> 2 min 38 sec) when linking a large application with LLD and ThinLTO. Clang build times also see a decrease in the range 5-10% depending on the configuration. Differential Revision: https://reviews.llvm.org/D71786	2020-08-27 11:09:46 -04:00
Sjoerd Meijer	7ef96b82c5	Follow up of rGca243b07276a: fixed a typo. NFC.	2020-08-27 10:53:41 +01:00
Sjoerd Meijer	fe5ce52722	[LangRef] get.active.lane.mask can produce poison value We had already specified that second argument `n` of this intrinsic is `n > 0`, but now add to this that the result is a poison value if this is not the case. Differential Revision: https://reviews.llvm.org/D86637	2020-08-27 08:57:35 +01:00
Craig Topper	7a5934b151	[X86] Update release notes for -mtune support.	2020-08-26 16:16:56 -07:00
Arthur Eubanks	d74ec65308	[ConstProp] Remove ConstantPropagation As discussed in http://lists.llvm.org/pipermail/llvm-dev/2020-July/143801.html. Currently no users outside of unit tests. Replace all instances in tests of -constprop with -instsimplify. Notable changes in tests: * vscale.ll - @llvm.sadd.sat.nxv16i8 is evaluated by instsimplify, use a fake intrinsic instead * InsertElement.ll - insertelement undef is removed by instsimplify in @insertelement_undef llvm/test/Transforms/ConstProp moved to llvm/test/Transforms/InstSimplify/ConstProp Reviewed By: lattner, nikic Differential Revision: https://reviews.llvm.org/D85159	2020-08-26 15:51:30 -07:00
Juneyoung Lee	caa42f61a0	[LangRef] Memset/memcpy/memmove can take undef/poison pointer if the size is 0 According to the current LangRef, Memset/memcpy/memmove can take a null/dangling pointer if the size is zero. (Relevant thread: http://lists.llvm.org/pipermail/llvm-dev/2017-July/115665.html ) This patch expands it and allows the functions to take undef/poison pointers too. This required the updates in the align attribute since it isn't specified what is the alignment of undef/poison pointers. This patch states that their alignment is 1. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D86643	2020-08-27 06:19:28 +09:00
Craig Topper	452ef84095	[X86] Add assembler support for .d32 and .d8 mnemonic suffixes to control displacement size. This is an older syntax than the {disp32} and {disp8} pseudo prefixes that were added a few weeks ago. We can reuse most of the support for that to support .d32 and .d8 as well.	2020-08-26 10:45:50 -07:00
Shoaib Meenai	ae601115d9	[llvm-libtool-darwin] Address post-commit feedback Address James Henderson's comments on https://reviews.llvm.org/D86359.	2020-08-25 15:04:23 -07:00
Craig Topper	037bbf9042	[X86] Mention -march=sapphirerapids in the release notes. This was just added in e02d081f2b60b61eb60ef6a49b1a9f907e432d4c.	2020-08-25 11:57:34 -07:00
Sjoerd Meijer	ac0a363b76	[LangRef] Revise semantics of intrinsic get.active.lane.mask A first version of get.active.lane.mask was committed in rG7fb8a40e5220. One of the main purposes and uses of this intrinsic is to communicate information from the middle-end to the back-end, but its current definition and semantics make this actually very difficult. The intrinsic was defined as: @llvm.get.active.lane.mask(%IV, %BTC) where %BTC is the Backedge-Taken Count (variable names are different in the LangRef spec). This allows to implicitly communicate the loop tripcount, which can be reconstructed by calculating BTC + 1. But it has been very difficult to prove that calculating BTC + 1 is safe and doesn't overflow. We need complicated range and SCEV analysis, and thus the problem is that this intrinsic isn't really doing what it was supposed to solve. Examples of the overflow checks that are required in the (ARM) back-end are D79175 and D86074, which aren't even complete/correct yet. To solve this problem, we are revising the definitions/semantics for get.active.lane.mask to avoid all the complicated overflow analysis. This means that instead of communicating the BTC, we are now using the loop tripcount. Now using LangRef's variable names, its semantics is changed from: icmp ule (%base + i), %n to: icmp ult (%base + i), %n with %n > 0 and corresponding to the loop tripcount. The intrinsic signature remains the same. Differential Revision: https://reviews.llvm.org/D86147	2020-08-25 16:23:51 +01:00
Yang Zhihui	cd03b849a7	[FileCheck][docs] Fix word errors ouput -> output Reviewed By: thopre Differential Revision: https://reviews.llvm.org/D86504	2020-08-25 09:53:52 +01:00
vnalamot	8a056d6164	[AMDGPU, docs] Fix typos Reviewed By: t-tye, Flakebi Differential Revision: https://reviews.llvm.org/D86340	2020-08-25 00:00:23 +05:30
Sourabh Singh Tomar	9241b8151b	[DebugInfo][flang]Added support for representing Fortran assumed length strings This patch adds support for representing Fortran `character(n)`. Primarily patch is based out of D54114 with appropriate modifications. Test case IR is generated using our downstream classic-flang. We're in process of upstreaming flang PR's but classic-flang has dependencies on llvm, so this has to get in first. Patch includes functional test case for both IR and corresponding dwarf, furthermore it has been manually tested as well using GDB. Source snippet: ``` program assumedLength call sub('Hello') call sub('Goodbye') contains subroutine sub(string) implicit none character(len=), intent(in) :: string print , string end subroutine sub end program assumedLength ``` GDB: ``` (gdb) ptype string type = character (5) (gdb) p string $1 = 'Hello' ``` Reviewed By: aprantl, schweitz Differential Revision: https://reviews.llvm.org/D86305	2020-08-22 10:13:40 +05:30
Paul C. Anagnostopoulos	0f17e8fd6b	Replace TableGen range piece punctuator with '...' The TableGen range piece punctuator is currently '-' (e.g., {0-9}), which interacts oddly with the fact that an integer literal's sign is part of the literal. This patch replaces the '-' with the new punctuator '...'. The '-' punctuator is deprecated. Differential Revision: https://reviews.llvm.org/D85585 Change-Id: I3d53d14e23f878b142d8f84590dd465a0fb6c09c	2020-08-21 23:33:57 +02:00
Paul C. Anagnostopoulos	a24cdd8596	New TableGen Programmer's Reference document This new TableGen Programmer's Reference document replaces the current Language Introduction and Language Reference documents. It brings all the TableGen reference information into one document. As an experiment, I numbered the sections in the document. See what you think about that. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D85838 (changes by Nicolai Hähnle <nicolai.haehnle@amd.com>: - fixed build error due to toctree in docs/LangRef/index.rst - fixed reference to ProgRef) Change-Id: Ifbdfa39768b8a460aae2873103d31c7b347aff00	2020-08-21 23:18:32 +02:00
Dmitry Preobrazhensky	50947152c2	[AMDGPU][MC][NFC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - added description of MTBUF instructions and format modifier; - described limitations of f16 inline constants when used with integer operands; - updated description of gfx9+ flat global addressing modes; - v_accvgpr_write_b32 src0 corrections (gfx908); - minor bugfixing and improvements.	2020-08-21 14:25:14 +03:00
Tony	67e779a7ea	[AMDGPU] Correct DWARF register defintions - Rename AMDGPU SCC DWARF register to STATUS since the scalar condition code is a bit within the STATUS register. - Correct bit size of the VCC_64 register to 64 which is the size in wave64 mode. Differential Revision: https://reviews.llvm.org/D86259	2020-08-20 01:15:04 +00:00
Florian Hahn	7dd87cb5e7	[docs] Clarify ENABLE_MODULES uses Clang Header Modules. Suggested post-commit by @dblaikie, thanks!	2020-08-19 17:38:34 +01:00
madhur13490	cc99379745	[NFC] Fix typo in AMDGPU doc Reviewed By: t-tye, arsenm Differential Revision: https://reviews.llvm.org/D86206	2020-08-19 14:33:26 +00:00
Hongtao Yu	029f749bc7	[llvm-objdump] Attempt to fix html doc generation issue. https://reviews.llvm.org/D84191 caused a html doc build issue with the changes in `llvm-objdump.rst`. It looks like a blank line is missing from the `code-block` directives. Test Plan: Differential Revision: https://reviews.llvm.org/D86123	2020-08-17 18:06:22 -07:00
Hongtao Yu	43bf988191	[llvm-objdump] Symbolize binary addresses for low-noisy asm diff. When diffing disassembly dump of two binaries, I see lots of noises from mismatched jump target addresses and global data references, which unnecessarily causes diffs on every function, making it impractical. I'm trying to symbolize the raw binary addresses to minimize the diff noise. In this change, a local branch target is modeled as a label and the branch target operand will simply be printed as a label. Local labels are collected by a separate pre-decoding pass beforehand. A global data memory operand will be printed as a global symbol instead of the raw data address. Unfortunately, due to the way the disassembler is set up and to be less intrusive, a global symbol is always printed as the last operand of a memory access instruction. This is less than ideal but is probably acceptable from checking code quality point of view since on most targets an instruction can have at most one memory operand. So far only the X86 disassemblers are supported. Test Plan: llvm-objdump -d --x86-asm-syntax=intel --no-show-raw-insn --no-leading-addr : ``` Disassembly of section .text: <_start>: push rax mov dword ptr [rsp + 4], 0 mov dword ptr [rsp], 0 mov eax, dword ptr [rsp] cmp eax, dword ptr [rip + 4112] # 202182 <g> jge 0x20117e <_start+0x25> call 0x201158 <foo> inc dword ptr [rsp] jmp 0x201169 <_start+0x10> xor eax, eax pop rcx ret ``` llvm-objdump -d --symbolize-operands --x86-asm-syntax=intel --no-show-raw-insn --no-leading-addr : ``` Disassembly of section .text: <_start>: push rax mov dword ptr [rsp + 4], 0 mov dword ptr [rsp], 0 <L1>: mov eax, dword ptr [rsp] cmp eax, dword ptr <g> jge <L0> call <foo> inc dword ptr [rsp] jmp <L1> <L0>: xor eax, eax pop rcx ret ``` Note that the jump instructions like `jge 0x20117e <_start+0x25>` without this work is printed as a real target address and an offset from the leading symbol. With a change in the optimizer that adds/deletes an instruction, the address and offset may shift for targets placed after the instruction. This will be a problem when diffing the disassembly from two optimizers where there are unnecessary false positives due to such branch target address changes. With `--symbolize-operand`, a label is printed for a branch target instead to reduce the false positives. Similarly, the disassemble of PC-relative global variable references is also prone to instruction insertion/deletion. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D84191	2020-08-17 16:55:12 -07:00
Matt Arsenault	4a4ddffaf1	GlobalISel: Make type for lower action more consistently optional Some of the lower implementations were relying on this, however the type was not set depending on which form .lower* helper form you were using. For instance, if you used an unconditonal lower(), the type was never set. Most of the lower actions do not benefit from a type parameter, and just expand in terms of the original operation's types. However, some lowerings could benefit from an additional type hint to combine a promotion and an expansion. An example of this is for add/sub sat. The DAG integer legalization tries to use smarter expansions directly when promoting the integer type, and doesn't always produce the same instruction with a wider type. Treat this as an optional hint argument, that only means something for specific lower actions. It may be useful to generalize this mechanism to pass a full list of type indexes and desired types, but I haven't run into a case like that yet.	2020-08-17 16:24:55 -04:00
Philip Reames	87fc3812c0	Remove inline gc arguments from statepoints The "gc-live" operand bundles were recently added, and all tests have been updated to use that format. A migration period was provided, though it's worth noting these intrinsics are experimental, so formally there is no compatibile requirement. This is an extension to a96fc46. "gc-live" hadn't been implemented at the point that patch was initially posted.	2020-08-14 19:44:24 -07:00
Philip Reames	340c7ffdef	Remove deopt and gc transition arguments from gc.statepoint intrinsic (Forgot to land this a couple of weeks back.) In a recent series of changes, I've introduced support for using the respective operand bundle kinds on the statepoint. At the moment, code supports either/or, but there's no need to keep the old support around. For the moment, I am simply changing the specification and verifier to require zero length argument sets in the intrinsic. The intrinsic itself is experimental. Given that, there's no forward serialization needed. The in tree uses and generation have already been updated to use the new operand bundle based forms, the only folks broken by the change will be those with frontends generating statepoints directly and the updates should be easy. Why not go ahead and just remove the arguments entirely? Well, I plan to. But while working on this I've found that almost all of the arguments to the statepoint can be expressed via operand bundles or attributes. Given that, I'm planning a radical simplification of the arguments and figured I'd do one update not several small ones. Differential Revision: https://reviews.llvm.org/D80892	2020-08-14 16:07:40 -07:00
Sameer Arora	50c525de69	[llvm-libtool-darwin] Add support for -l and -L Add support for passing in libraries via `-l` and `-L` options to `llvm-libtool-darwin`. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D85540	2020-08-14 11:44:17 -07:00
Sameer Arora	7aa42c2f0c	[llvm-libtool-darwin] Add support for -arch_only Add support for -arch_only option for llvm-libtool-darwin. This diff also adds support for accepting universal files as input and flattening them to create the required static library. Supports input universal files contaning both Mach-O object files or archives. Differences from cctools' libtool: - `-arch_only` can be specified multiple times - archives containing universal files are considered invalid (libtool allows such archives) Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D84770	2020-08-13 11:08:46 -07:00
Sameer Arora	5e902e2307	[llvm-install-name-tool] Add more documentation Add documentation for the remaining options of `llvm-install-name-tool`. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D85655	2020-08-13 10:47:47 -07:00
Sebastian Neubauer	f6c931c6b8	[AMDGPU] Fix typo. NFC	2020-08-13 10:41:48 +02:00
Jay Foad	928c1dd7ef	[GlobalISel] Add G_ABS This is equivalent to the new llvm.abs intrinsic added by D84125 with is_int_min_poison=0. Differential Revision: https://reviews.llvm.org/D85718	2020-08-11 16:34:37 +01:00
Dávid Bolvanský	27e916d19d	[Docs] Fixed missing closing quote character	2020-08-11 11:21:15 +02:00
Fangrui Song	d551fbf23e	[llvm-symbolizer] Add back --version and add a -v alias The switch from llvm::cl to OptTable (D83530) dropped --version, which is needed by some users. This patch also adds a -v alias, which is available in GNU addr2line. The version dumping is similar to llvm-objcopy --version (exotic): ``` llvm-symbolizer LLVM (http://llvm.org/): LLVM version 12.0.0git Optimized build with assertions. Default target: x86_64-unknown-linux-gnu Host CPU: skylake-avx512 ``` Reviewed By: dyung, jhenderson Differential Revision: https://reviews.llvm.org/D85624	2020-08-10 08:21:43 -07:00
Kazu Hirata	fc9284d5ff	[docs] Fix typos	2020-08-09 19:31:49 -07:00
Sameer Arora	629a98ce69	[llvm-libtool-darwin] Add support for -D and -U options Add support for `-D` and `-U` options for llvm-libtool-darwin. `-D` allows for using zero for timestamps and UIDs/GIDs. `-U` allows for using actual timestamps and UIDs/GIDs. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D84209	2020-08-07 14:44:32 -07:00
Sameer Arora	a753e1d6dd	[llvm-libtool-darwin] Add support for -filelist option Add support for `-filelist` option for llvm-libtool-darwin. `-filelist` option allows for passing in a file containing a list of filenames. Reviewed by jhenderson, smeenai Differential Revision: https://reviews.llvm.org/D84206	2020-08-07 14:29:24 -07:00
Sameer Arora	218d68982d	[FileCheck] Add docs for --allow-empty This diff adds documentation for `allow-empty` flag under FileCheck docs. Reviewed by jhenderson, smeenai, thopre Differential Revision: https://reviews.llvm.org/D83682	2020-08-07 13:27:57 -07:00
Sameer Arora	8d6c074d8c	[llvm-install-name-tool] Adds docs for llvm-install-name-tool Adding documentation for llvm-install-name-tool. Reviewed by smeenai, Ktwu Differential Revision: https://reviews.llvm.org/D81944	2020-08-07 12:51:58 -07:00
Bevin Hansson	7c243aea4b	[Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics. Summary: This patch adds two intrinsics, llvm.sshl.sat and llvm.ushl.sat, which perform signed and unsigned saturating left shift, respectively. These are useful for implementing the Embedded-C fixed point support in Clang, originally discussed in http://lists.llvm.org/pipermail/llvm-dev/2018-August/125433.html and http://lists.llvm.org/pipermail/cfe-dev/2018-May/058019.html Reviewers: leonardchan, craig.topper, bjope, jdoerfert Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83216	2020-08-07 15:09:24 +02:00
Bevin Hansson	71fc113f30	[LangRef] Minor fixes to intrinsic headers and descriptions. NFC.	2020-08-07 15:09:24 +02:00
Nico Weber	20496ec614	fix doc typo to cycle bots	2020-08-06 21:02:41 -04:00
Tony	9357771d35	[AMDGPU] Correct missing sram-ecc target feature for gfx906 Differential Revision: https://reviews.llvm.org/D85476	2020-08-06 22:12:25 +00:00
Stanislav Mekhanoshin	d688e1d62e	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Matt Morehouse	1a65e8de72	Revert "Add libFuzzer shared object build output" This reverts commit 98d91aecb26a51225242332e73ed454c0f6cac5e since it breaks on platforms without libstdc++.	2020-08-05 12:11:24 -07:00
Matt Morehouse	31a4f48675	Add libFuzzer shared object build output This change adds a CMake rule to produce shared object versions of libFuzzer (no-main). Like the static library versions, these shared libraries have a copy of libc++ statically linked in. For i386 we don't link with libc++ since i386 does not support mixing position- independent and non-position-independent code in the same library. Patch By: IanPudney Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D84947	2020-08-05 09:03:22 -07:00
Hans Wennborg	c3fca9996d	Bump forgotten version nbr in llvm/docs/conf.py	2020-08-05 17:11:59 +02:00
Jordan Rupprecht	8003b49524	[docs] Document pattern of using CHECK-SAME to skip irrelevant lines This came up during the review for D67656. It's nice but also subtle, so documenting it as an idiom will make tests easier to understand. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D68061	2020-08-05 11:03:56 +01:00
Florian Hahn	3ac0102ef0	[docs] Mention LLVM_ENABLE_MODULES.	2020-08-04 16:59:39 +01:00
Fangrui Song	f6b39f50d1	[llvm-symbolizer] Switch command line parsing from llvm::cl to OptTable for the advantage outlined by D83639 ([OptTable] Support grouped short options) Some behavior changes: * -i={0,false} is removed. Use --no-inlines instead. * --demangle={0,false} is removed. Use --no-demangle instead * -untag-addresses={0,false} is removed. Use --no-untag-addresses instead Added a higher level API OptTable::parseArgs which handles optional initial options populated from an environment variable, expands response files recursively, and parses options. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83530	2020-08-04 08:53:15 -07:00
Simon Pilgrim	1832d64356	Fix sphinx "Title underline too short" warning	2020-08-04 16:36:00 +01:00
Simon Pilgrim	0fc57afbbe	Fix sphinx indentation warning to stop newline in byref section html output.	2020-08-04 16:12:50 +01:00
Simon Pilgrim	5f8db00af1	Fix sphinx indentation warning. Don't double indent and make it clear we're referting to the latency mode.	2020-08-04 15:57:46 +01:00
Fangrui Song	5554745259	Add test utility 'split-file' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. This patch has explored several alternatives. The current semantics are similar to what @dblaikie proposed. `split-file filename output` splits the input file into multiple parts separated by regex `^(.\|//)--- filename` and write each part to the file `output/filename` (`filename` can include path separators). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` # RUN: split-file %s %t # RUN: llvm-mc %t/asm -o %t.o # RUN: ld.lld -T %t/lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. # asm ... # lds ... ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: split-file %s %t // RUN: llc < %t/1.ll \| FileCheck %s --check-prefix=CASE1 // RUN: llc < %t/2.ll \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. For example, when testing parsing errors if the recovery mechanism isn't possible, grouping the tests in one file can more readily see test coverage/strategy. //--- 1.ll ... //--- 2.ll ... ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson, lattner Differential Revision: https://reviews.llvm.org/D83834	2020-08-03 20:42:09 -07:00
Florian Hahn	5eea1c2f70	Recommit "[IPConstProp] Remove and move tests to SCCP." This reverts commit 59d6e814ce0e7b40b7cc3ab136b9af2ffab9c6f8. The cause for the revert (3 clang tests running opt -ipconstprop) was fixed by removing those lines.	2020-08-02 22:23:54 +01:00
Fangrui Song	bf5334b827	[Support][CommandLine] Delete unused llvm:🆑:ParseEnvrironmentOptions The function was added in 2003. It is not used and can be emulated with ParseCommandLineOptions.	2020-07-31 10:48:09 -07:00
Mircea Trofin	629123ee54	[doc] Describe the header guard style clang-tidy's llvm-header-guard rule references the LLVM style - where it's missing. Differential Revision: https://reviews.llvm.org/D84989	2020-07-30 16:08:07 -07:00
Florian Hahn	1316430704	Revert "[IPConstProp] Remove and move tests to SCCP." This reverts commit e77624a3be942c7abba48942b3a8da3462070a3f. Looks like some clang tests manually invoke -ipconstprop via opt.....	2020-07-30 13:06:54 +01:00
Florian Hahn	ce3655671a	[IPConstProp] Remove and move tests to SCCP. As far as I know, ipconstprop has not been used in years and ipsccp has been used instead. This has the potential for confusion and sometimes leads people to spend time finding & reporting bugs as well as updating it to work with the latest API changes. This patch moves the tests over to SCCP. There's one functional difference I am aware of: ipconstprop propagates for each call-site individually, so for functions that are called with different constant arguments it can sometimes produce better results than ipsccp (at much higher compile-time cost).But IPSCCP can be thought to do so as well for internal functions and as mentioned earlier, the pass seems unused in practice (and there are no plans on working towards enabling it anytime). Also discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-July/143773.html Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84447	2020-07-30 12:36:27 +01:00
Tony	66a4f3eb09	[AMDGPU] Fix DWARF extensions User Guide table of contents	2020-07-30 05:10:21 +00:00
Tony	57dd67ea2c	[AMDGPU] DWARF proposal changes - Clarify that these are extensions to DWARF 5 and not as yet a proposal. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 05:07:09 +00:00
Tony	a3714ce03c	[AMDGPU] DWARF proposal changes for expression context - Clarify what context is used in DWARF expression evaluation. - Define location descriptions to fully resolve the context and so include the context in their result. - As a consequence of location descriptions being fully resoved, change address spaces so only a swizzled and unswizzled private address space is defined. The lane is now part of the location description context. - Clarify how call frame information is used to fully resolve expressions that specify registers. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 01:59:22 +00:00
Varun Gandhi	830a8aba04	[docs] [lit] Add a more helpful description for lit.py's -s flag. Reviewed By: yln Differential Revision: https://reviews.llvm.org/D82808	2020-07-28 14:36:03 -07:00
Fangrui Song	8a55d3cc1c	Revert D83834 "Add test utility 'extract'" This reverts commit d054c7ee2e9f4f98af7f22a5b00a941eb919bd59. There are discussions about the utility name, its functionality and user interface. Revert before we reach consensus.	2020-07-28 13:26:33 -07:00
Arthur Eubanks	8a584da153	[FunctionAttrs] Rename functionattrs -> function-attrs To match NewPM pass name, and also for readability. Also rename rpo-functionattrs -> rpo-function-attrs while we're here. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D84694	2020-07-28 09:09:13 -07:00
Jinsong Ji	a3d207d6bc	Re-land "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit bf544fa1c3cb80f24d85e84559fb11193846259f. Fixed the typo in PPCInstrInfo.cpp.	2020-07-28 14:00:11 +00:00
Wei Mi	51d4708437	Supplement instr profile with sample profile. PGO profile is usually more precise than sample profile. However, PGO profile needs to be collected from loadtest and loadtest may not be representative enough to the production workload. Sample profile collected from production can be used as a supplement -- for functions cold in loadtest but warm/hot in production, we can scale up the related function in PGO profile if the function is warm or hot in sample profile. The implementation contains changes in compiler side and llvm-profdata side. Given an instr profile and a sample profile, for a function cold in PGO profile but warm/hot in sample profile, llvm-profdata will either mark all the counters in the profile to be -1 or scale up the max count in the function to be above hot threshold, depending on the zero counter ratio in the profile. The assumption is if there are too many counters being zero in the function profile, the profile is more likely to cause harm than good, then llvm-profdata will mark all the counters to be -1 indicating the function is hot but the profile is unaccountable. In compiler side, if a function profile with all -1 counters is seen, the function entry count will be set to be above hot threshold but its internal profile will be dropped. In the long run, it may be useful to let compiler support using PGO profile and sample profile at the same time, but that requires more careful design and more substantial changes to make two profiles work seamlessly. The patch here serves as a simple intermediate solution. Differential Revision: https://reviews.llvm.org/D81981	2020-07-27 20:17:40 -07:00
Jinsong Ji	89408b2ab3	Revert "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit adffce71538e219aab4eeb024819baa7687262ff. This is breaking test-suite, revert while investigation.	2020-07-27 21:07:00 +00:00
Jinsong Ji	2d65e976a4	[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support Per RFC http://lists.llvm.org/pipermail/llvm-dev/2020-April/141295.html no one is making use of QPX/A2Q/BGQ/BGP CNK anymore. This patch remove the support of QPX/A2Q in llvm, BGQ/BGP in clang, CNK support in openmp/polly. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D83915	2020-07-27 19:24:39 +00:00
Matt Morehouse	964edac32c	Replace fuzzer::FuzzerDriver's INTERFACE marking with new LLVMRunFuzzerDriver. This adds a new extern "C" function that serves the same purpose. This removes the need for external users to depend on internal headers in order to use this feature. It also standardizes the interface in a way that other fuzzing engines will be able to match. Patch By: IanPudney Reviewed By: kcc Differential Revision: https://reviews.llvm.org/D84561	2020-07-27 18:38:04 +00:00
Vy Nguyen	0724050861	Reland [llvm-exegesis] Add benchmark latency option on X86 that uses LBR for more precise measurements. Starting with Skylake, the LBR contains the precise number of cycles between the two consecutive branches. Making use of this will hopefully make the measurements more precise than the existing methods of using RDTSC. Differential Revision: https://reviews.llvm.org/D77422 New change: check for existence of field `cycles` in perf_branch_entry before enabling this mode. This should prevent compilation errors when building for older kernel whose headers don't support it.	2020-07-27 12:38:05 -04:00
Afanasyev Ivan	2f03196312	[Docs] remove unused arguments in documentation examples on vectorization passes Reviewers: nadav, tyler.nowicki Reviewed By: nadav Differential Revision: https://reviews.llvm.org/D83851	2020-07-27 10:20:26 +01:00

... 3 4 5 6 7 ...

8661 Commits