llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 10:42:39 +01:00

Author	SHA1	Message	Date
Matt Arsenault	2e74ccd44a	GlobalISel: Update documentation	2021-05-05 17:35:02 -04:00
Pooja Yadav	dfdd7885ea	[docs] Update the llvm/example section Added details about the llvm/example section. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D101284	2021-05-05 21:33:14 +05:30
Sushma Unnibhavi	53774cc7a7	[DOCS] Added example for G_EXTRACT and G_INSERT Reviewed By: xgupta, gargaroff Differential Revision: https://reviews.llvm.org/D101227	2021-05-05 15:47:35 +05:30
Fangrui Song	49266e7300	[llvm-objdump] Add -M {att,intel} & deprecate --x86-asm-syntax={att,intel} The internal `cl::opt` option --x86-asm-syntax sets the AsmParser and AsmWriter dialect. The option is used by llc and llvm-mc tests to set the AsmWriter dialect. This patch adds -M {att,intel} as GNU objdump compatible aliases (PR43413). Note: the dialect is initialized when the MCAsmInfo is constructed. `MCInstPrinter::applyTargetSpecificCLOption` is called too late and its MCAsmInfo reference is const, so changing the `cl::opt` in `MCInstPrinter::applyTargetSpecificCLOption` is not an option, at least without large amount of refactoring. Reviewed By: hoy, jhenderson, thakis Differential Revision: https://reviews.llvm.org/D101695	2021-05-05 00:20:41 -07:00
Alina Sbirlea	f3aac64c1f	Add cal entry for MemorySSA syncs.	2021-05-04 12:56:06 -07:00
Alina Sbirlea	91efc4eb34	Add monthly MemorySSA sync.	2021-05-04 11:23:36 -07:00
Arthur Eubanks	d13aef09f1	[docs] Fix some wording	2021-05-04 10:21:38 -07:00
gbreynoo	5291a705c1	[llvm-objdump] Remove --cfg option from command guide The llvm-objdump command guide has the option --cfg which was removed from the tool by 888320e9fa5eb33194c066f68d50f1e73c5fff5e in 2014. This change updates the command guide to reflect this. Differential Revision: https://reviews.llvm.org/D101648	2021-05-04 16:42:13 +01:00
Fraser Cormack	1c9a07e9b0	[LangRef] Fix a typo in the vector-type memory layout section	2021-05-04 15:40:53 +01:00
Arthur Eubanks	0883647021	[docs][NewPM] Add section on analyses Reviewed By: asbirlea, ychen Differential Revision: https://reviews.llvm.org/D100912	2021-05-03 10:15:02 -07:00
Christian Kühnel	dd38cc31c5	[doc] typo fixes as proposed by @FlashSheridan in https://reviews.llvm.org/rG7f9717b922d4	2021-05-03 10:59:51 +02:00
Nick Desaulniers	b45152915e	[llvm-objdump] add -v alias for --version Used by the Linux kernel's CONFIG_X86_DECODER_SELFTEST. Link: https://github.com/ClangBuiltLinux/linux/issues/1130 Reviewed By: MaskRay, jhenderson, rupprecht Differential Revision: https://reviews.llvm.org/D101483	2021-04-30 11:26:36 -07:00
Pooja Yadav	b5839a5bcf	[docs]Added llvm/bindings section Added information about language bindings provided by LLVM. Reviewed By: xgupta, gandhi21299 Differential Revision: https://reviews.llvm.org/D101295	2021-04-30 19:05:22 +05:30
Jonas Devlieghere	ffe6466b17	[dsymutil] Add flag to force a static variable to keep its enclosing function Add a flag to change dsymutil's behavior and force a static variable to keep its enclosing function. The test shows a situation where that could be useful. I'm not convinced this behavior makes sense as a default, which is why it's behind a flag. rdar://74918374 Differential revision: https://reviews.llvm.org/D101337	2021-04-28 11:33:04 -07:00
Paul C. Anagnostopoulos	545a4d4a19	[TableGen] Add the !find bang operator !find searches a source string for a target string and returns the position. Differential Revision: https://reviews.llvm.org/D101318	2021-04-28 09:51:00 -04:00
Ahmed Bougacha	4a04d21325	[docs] Replace Apple representative to security group. Differential Revision: https://reviews.llvm.org/D100864	2021-04-27 11:00:49 -07:00
Christian Kühnel	6d2a6bfee4	[doc] added documentation for pre-merge testing fixes https://github.com/google/llvm-premerge-checks/issues/275 Differential Revision: https://reviews.llvm.org/D100936	2021-04-27 16:53:16 +02:00
Pooja Yadav	b7bdf617bd	[Docs] Updated LLVM_TARGETS_TO_BUILD section in GettingStarted.rst Updated LLVM_TARGETS_TO_BUILD under https://llvm.org/docs/GettingStarted.html#local-llvm-configuration. Differential Revision: https://reviews.llvm.org/D101101	2021-04-24 00:31:43 +05:30
Paul C. Anagnostopoulos	3b840f40c3	[TableGen] [docs] Improve BNF for the 'multiclass' statement [NFC]	2021-04-23 12:05:52 -04:00
Paul C. Anagnostopoulos	3d33c078f3	[TableGen] [docs] Improve description of NAME in Programmer's Reference Also use "parent class" consistently and add a note about the term. Differential Revision: https://reviews.llvm.org/D100867	2021-04-23 09:49:17 -04:00
Thomas Preud'homme	71007d77ca	[doc] Clarify constrained fcmps behavior Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D101053	2021-04-23 11:55:20 +01:00
Fangrui Song	c83fe04e08	[IR][sanitizer] Add module flag "frame-pointer" and set it for cc1 -mframe-pointer={non-leaf,all} The Linux kernel objtool diagnostic `call without frame pointer save/setup` arise in multiple instrumentation passes (asan/tsan/gcov). With the mechanism introduced in D100251, it's trivial to respect the command line -m[no-]omit-leaf-frame-pointer/-f[no-]omit-frame-pointer, so let's do it. Fix: https://github.com/ClangBuiltLinux/linux/issues/1236 (tsan) Fix: https://github.com/ClangBuiltLinux/linux/issues/1238 (asan) Also document the function attribute "frame-pointer" which is long overdue. Differential Revision: https://reviews.llvm.org/D101016	2021-04-22 18:07:30 -07:00
Keith Smiley	82ce0102d8	llvm-objdump: add --rpaths to macho support This prints the rpaths for the given binary Reviewed By: kastiglione Differential Revision: https://reviews.llvm.org/D100681	2021-04-22 16:01:10 -07:00
Evgeniy Brevnov	d5e146fe48	Wordsmith the semantics of invariant.load Don't phrase the semantics in terms of the optimizer. Instead have a more straightforward execution based semantic. Reviewed By: ebrevnov Differential Revision: https://reviews.llvm.org/D63439	2021-04-22 10:06:13 +07:00
Christian Kühnel	a6b448f809	[NFC] fixed link in documentation	2021-04-21 10:17:03 +02:00
Christian Kühnel	271e47c106	added section on CI system Add documentation for working with the CI systems. This is based on the discussion in the Infrastructure Working Group: https://github.com/ChristianKuehnel/iwg-workspace/issues/37 Differential Revision: https://reviews.llvm.org/D97389	2021-04-21 09:59:41 +02:00
David Sherwood	1ea67899d6	[Docs] Fix formatting issue for llvm.experimental.stepvector in LangRef The llvm.experimental.stepvector section was missing the '^^^' line underneath the intrinsic name.	2021-04-21 08:42:40 +01:00
Nico Weber	74a701ab05	[llvm-objdump] Add an llvm-otool tool This implements an LLVM tool that's flag- and output-compatible with macOS's `otool` -- except for bugs, but from testing with both `otool` and `xcrun otool-classic`, llvm-otool matches vanilla otool's behavior very well already. It's not 100% perfect, but it's a very solid start. This uses the same approach as llvm-objcopy: llvm-objdump uses a different OptTable when it's invoked as llvm-otool. This is possible thanks to D100433. Differential Revision: https://reviews.llvm.org/D100583	2021-04-20 08:24:58 -04:00
Luo, Yuanke	c35d11aa02	[X86][AMX] Add description of x86_amx to LangRef. Differential Revision: https://reviews.llvm.org/D100032	2021-04-20 14:29:17 +08:00
xgupta	e9c576eeca	[Docs] Mention LLVM_EXPERIMENTAL_TARGETS_TO_BUILD variable in CMake.rst Beginners might not aware of this variable and wanted to try a new experimental target. Although this variable mention in Writing a Backend Documentation. But it becomes easy to search when listed in cmake.rst doc where most variables are listed. Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D100729	2021-04-20 09:27:57 +05:30
Paul C. Anagnostopoulos	0d02a45ced	[TableGen] Add support for the 'assert' statement in multiclasses This is step 3 of adding the 'assert' statement. Differential Revision: https://reviews.llvm.org/D99751	2021-04-19 09:01:42 -04:00
xgupta	79c9c7433b	[Docs] Correct Boehm collector weblink in GarbageCollection.rst	2021-04-18 17:30:17 +05:30
Philip Reames	d5252f4ce6	[nofree] Restrict semantics to memory visible to caller This patch clarifies the semantics of the nofree function attribute to make clear that it provides an "as if" semantic. That is, a nofree function is guaranteed not to free memory which existed before the call, but might allocate and then deallocate that same memory within the lifetime of the callee. This is the result of the discussion on llvm-dev under the thread "Ambiguity in the nofree function attribute". The most important part of this change is the LangRef wording. The rest is minor comment changes to emphasize the new semantics where code was accidentally consistent, and fix one place which wasn't consistent. That one place is currently narrowly used as it is primarily part of the ongoing (and not yet enabled) deref-at-point semantics work. Differential Revision: https://reviews.llvm.org/D100141	2021-04-16 11:38:55 -07:00
Kristof Beyls	734cc1c60a	[docs] Add Pointer Authentication call info	2021-04-16 15:18:21 +02:00
Simon Moll	d32b153555	[docs] Add vector predication call Add the syncup call to the table Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100474	2021-04-16 10:49:34 +02:00
Juneyoung Lee	bc62ce57fa	[LangRef] formatting	2021-04-16 10:41:30 +09:00
Juneyoung Lee	2f6006376d	[LangRef] fix unexepcted unindent errror	2021-04-16 09:58:55 +09:00
Juneyoung Lee	4fba55a0b5	[LangRef] clarify the semantics of nocapture This patch clarifies the semantics of nocapture attribute. A 'Pointer Capture' subsection is added to describe the semantics of pointer capture first. For the nocapture example with two same pointer arguments, it is consistent with the semantics that Alive2 used to run lit tests. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D97924	2021-04-16 09:48:42 +09:00
Jon Roelofs	87c0b7c256	s/setGenerator/addGenerator/ in the JIT docs. NFC	2021-04-15 15:54:28 -07:00
Momchil Velikov	d98e321d12	[clang][AArch64] Correctly align HFA arguments when passed on the stack When we pass a AArch64 Homogeneous Floating-Point Aggregate (HFA) argument with increased alignment requirements, for example struct S { __attribute__ ((__aligned__(16))) double v[4]; }; Clang uses `[4 x double]` for the parameter, which is passed on the stack at alignment 8, whereas it should be at alignment 16, following Rule C.4 in AAPCS (https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#642parameter-passing-rules) Currently we don't have a way to express in LLVM IR the alignment requirements of the function arguments. The align attribute is applicable to pointers only, and only for some special ways of passing arguments (e..g byval). When implementing AAPCS32/AAPCS64, clang resorts to dubious hacks of coercing to types, which naturally have the needed alignment. We don't have enough types to cover all the cases, though. This patch introduces a new use of the stackalign attribute to control stack slot alignment, when and if an argument is passed in memory. The attribute align is left as an optimizer hint - it still applies to pointer types only and pertains to the content of the pointer, whereas the alignment of the pointer itself is determined by the stackalign attribute. For byval arguments, the stackalign attribute assumes the role, previously perfomed by align, falling back to align if stackalign` is absent. On the clang side, when passing arguments using the "direct" style (cf. `ABIArgInfo::Kind`), now we can optionally specify an alignment, which is emitted as the new `stackalign` attribute. Patch by Momchil Velikov and Lucas Prates. Differential Revision: https://reviews.llvm.org/D98794	2021-04-15 22:58:14 +01:00
Paul C. Anagnostopoulos	b1e2c33e83	[TableGen] [docs] Correct a reference in the TableGen Overview document Differential Revision: https://reviews.llvm.org/D100382	2021-04-15 09:25:09 -04:00
Kostya Kortchinsky	6d9d112a9c	[docs][scudo] Update Scudo documentation Update the Scudo document to align with the standalone version. Add some more verbiage about the various component of the allocator, rework a bit everything. The build instructions have been updated. The options and their default values have been updated, and the `mallopt` ones have been added. Differential Revision: https://reviews.llvm.org/D100230	2021-04-13 08:41:56 -07:00
Gulfem Savrun Yeniceri	c650eac142	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-04-13 01:29:41 +00:00
Kristof Beyls	d157c20ece	[docs] Add Windows/COFF call info	2021-04-12 17:11:25 +02:00
Sushma Unnibhavi	86f9f72f44	Typo fix Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D100254	2021-04-11 12:24:27 +05:30
Sushma Unnibhavi	2e031186a6	Missing syntax highlighting for LLVM IR in Langref Added syntax highlighting Differential Revision: https://reviews.llvm.org/D100125	2021-04-11 12:19:58 +05:30
Paul C. Anagnostopoulos	2a3388364d	[TableGen] [docs] Change title of tblgen.rst to fix man page filename	2021-04-09 09:37:56 -04:00
Konstantin Zhuravlyov	ec8823f099	AMDGPU: Add gfx90c support to code object v2 for backwards compatibility Differential Revision: https://reviews.llvm.org/D100126	2021-04-08 16:42:43 -04:00
Paul C. Anagnostopoulos	16f0e00ff4	Revert "[TableGen] Add support for the 'assert' statement in multiclasses" This reverts commit 3b9a15d910a8c748b1444333a4a3905a996528bc.	2021-04-08 13:58:58 -04:00
Paul C. Anagnostopoulos	282eb5170a	[TableGen] Add support for the 'assert' statement in multiclasses	2021-04-08 08:36:03 -04:00
Philip Reames	b81ddb9786	[docs] Document our norms around reverts This has come up a few times recently, and I was surprised to notice that we don't have anything in the docs. This patch deliberately sticks to stuff that is uncontroversial in the community. Everything herein is thought to be widely agreed to by a large majority of the community. A few things were noted and removed in review which failed this standard, if you spot anything else, please point it out. Differential Revision: https://reviews.llvm.org/D99305	2021-04-07 21:02:19 -07:00
Tony Tye	2ebd0d6fb1	[NFC][AMDGPU] Correct indentation in AMDGPUUsage.rst Correct indentation that results in rST syntax error.	2021-04-08 01:00:13 +00:00
Tony Tye	36036d99fe	[AMDGPU] Update gfx90a memory model support Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D100070	2021-04-07 22:17:58 +00:00
Paul C. Anagnostopoulos	8c9238aaa1	[TableGen] [docs] Correct a couple of mistakes; use 'true' and 'false' in examples Differential Revision: https://reviews.llvm.org/D99800	2021-04-05 09:15:58 -04:00
Nikita Popov	5794a3620e	[FastISel] Remove kill tracking This is a followup to D98145: As far as I know, tracking of kill flags in FastISel is just a compile-time optimization. However, I'm not actually seeing any compile-time regression when removing the tracking. This probably used to be more important in the past, before FastRA was switched to allocate instructions in reverse order, which means that it discovers kills as a matter of course. As such, the kill tracking doesn't really seem to serve a purpose anymore, and just adds additional complexity and potential for errors. This patch removes it entirely. The primary changes are dropping the hasTrivialKill() method and removing the kill arguments from the emitFast methods. The rest is mechanical fixup. Differential Revision: https://reviews.llvm.org/D98294	2021-04-03 15:50:13 +02:00
Paul C. Anagnostopoulos	f346298b33	[TableGen] [Docs] Add lldb-tblgen to command guide; add 4 guide stubs Differential Revision: https://reviews.llvm.org/D99605	2021-04-02 09:52:16 -04:00
Tony	99c3eaf30c	[NFC][AMDGPU] Add product names for gfx908 and gfx10 processors Reviewed By: msearles Differential Revision: https://reviews.llvm.org/D99781	2021-04-02 00:58:11 +00:00
Jon Roelofs	abd11b2935	[docs] Fix up dead clang-format links after monorepo move. NFC	2021-03-30 14:29:35 -07:00
oToToT	2752ef7d4d	[Docs] Update googletest docs link. The documentation link of Google Test on GitHub have been moved to the top-level docs directory. Thus, the original link is invalid now. Reviewed By: Pavel Labath Differential Revision: https://reviews.llvm.org/D99559	2021-03-30 23:20:23 +08:00
Krasimir Georgiev	72bca5f483	Revert "[Passes] Add relative lookup table converter pass" This reverts commit 5178ffc7cf92527557ae16e86d0fa90d538c2a19. Compiling `llvm-profdata` with a compiler build from this produces a crashing binary.	2021-03-30 14:13:37 +02:00
Nuno Lopes	695aa28456	[docs] remove references to checking out svn repos	2021-03-30 10:00:31 +01:00
Tim Renouf	2d9a94e442	[AMDGPU] Update AMDGPU PAL usage documentation Change-Id: I65f3edcfe5063551cad5aab0da1374c3a6ccd3a2	2021-03-30 08:33:18 +01:00
Gulfem Savrun Yeniceri	abf79b4a39	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-29 21:53:32 +00:00
Paul C. Anagnostopoulos	128e39dc70	[TableGen] Add support for the 'assert' statement in class definitions. Differential Revision: https://reviews.llvm.org/D99275	2021-03-29 09:20:29 -04:00
Matt Arsenault	efe6c99b52	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 07e46367baeca96d84b03fa215b41775f69d5989.	2021-03-29 08:55:30 -04:00
Oliver Stannard	0914bea32c	Revert "Reapply "OpaquePtr: Turn inalloca into a type attribute"" Reverting because test 'Bindings/Go/go.test' is failing on most buildbots. This reverts commit fc9df309917e57de704f3ce4372138a8d4a23d7a.	2021-03-29 11:32:22 +01:00
Matt Arsenault	403cadc380	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 20d5c42e0ef5d252b434bcb610b04f1cb79fe771.	2021-03-28 13:35:21 -04:00
Nico Weber	755e1b95c9	Revert "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 4fefed65637ec46c8c2edad6b07b5569ac61e9e5. Broke check-clang everywhere.	2021-03-28 13:02:52 -04:00
Zakk Chen	56db174a0b	[RISCV][Clang] Update new overloading rules for RVV intrinsics. RVV intrinsics has new overloading rule, please see `82aac7dad4` Changed: 1. Rename `generic` to `overloaded` because the new rule is not using C11 generic. 2. Change HasGeneric to HasNoMaskedOverloaded because all masked operations support overloading api. 3. Add more overloaded tests due to overloading rule changed. Differential Revision: https://reviews.llvm.org/D99189	2021-03-28 09:04:35 -07:00
Matt Arsenault	9b63996812	OpaquePtr: Turn inalloca into a type attribute I think byval/sret and the others are close to being able to rip out the code to support the missing type case. A lot of this code is shared with inalloca, so catch this up to the others so that can happen.	2021-03-28 11:12:23 -04:00
George Burgess IV	f2c39381e1	docs: Adding Google representative to the security group This adds me as a Google representative for the LLVM security group. This was proposed, discussed, and voted on in the differential revision linked below; please see it for more information. Differential Revision: https://reviews.llvm.org/D99232	2021-03-26 18:55:37 -07:00
Tony	382c4642c9	[NFC][AMDGPU] Corrections to AMD GPU initial kernel launch documentation Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D99223	2021-03-26 02:05:45 +00:00
Amara Emerson	61b28d4f51	[GlobalISel] Add G_ROTR and G_ROTL opcodes for rotates. Differential Revision: https://reviews.llvm.org/D99383	2021-03-25 17:23:30 -07:00
Djordje Todorovic	29c99e8499	[Debugify] Expose original debug info preservation check as CC1 option In order to test the preservation of the original Debug Info metadata in your projects, a front end option could be very useful, since users usually report that a concrete entity (e.g. variable x, or function fn2()) is missing debug info. The [0] is an example of running the utility on GDB Project. This depends on: D82546 and D82545. Differential Revision: https://reviews.llvm.org/D82547	2021-03-25 05:29:42 -07:00
Gulfem Savrun Yeniceri	54e2d4cdab	Revert "[Passes] Add relative lookup table converter pass" This reverts commit 5fd001a5ffbad403053c4a06bf4b2b76dc52bba8 because it broke clang-with-thin-lto-ubuntu bot.	2021-03-24 18:59:33 +00:00
Gulfem Savrun Yeniceri	93b265f8c0	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-24 17:31:18 +00:00
Vinicius Tinti	046d087d1d	[llvm-objdump] Implement --prefix-strip option The option `--prefix-strip` is only used when `--prefix` is not empty. It removes N initial directories from absolute paths before adding the prefix. This matches GNU's objdump behavior. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D96679	2021-03-24 13:22:35 +00:00
Andrew Savonichev	182b0cd903	[MCA] Disable RCU for InOrderIssueStage This is a follow-up for: D98604 [MCA] Ensure that writes occur in-order When instructions are aligned by the order of writes, they retire in-order naturally. There is no need for an RCU, so it is disabled. Differential Revision: https://reviews.llvm.org/D98628	2021-03-24 13:54:04 +03:00
Bruno Cardoso Lopes	054affa82b	[CGAtomic] Lift stronger requirements on cmpxch and support acquire failure mode - Fix `emitAtomicCmpXchgFailureSet` to support release/acquire (succ/fail) memory order. - Remove stronger checks for cmpxch. Effectively, this addresses http://wg21.link/p0418 Differential Revision: https://reviews.llvm.org/D98995	2021-03-23 16:45:37 -07:00
Tony	da027401bd	[NFC][AMDGPU] Reserve AMD GPU ELF machine number 0x41 Reviewed By: foad Differential Revision: https://reviews.llvm.org/D99196	2021-03-23 17:53:02 +00:00
Fraser Cormack	7850d98674	[LangRef] Fix typos in the vector-type memory layout section Reviewed By: bjope Differential Revision: https://reviews.llvm.org/D99163	2021-03-23 12:28:50 +00:00
David Sherwood	42a72164a2	[IR][SVE] Add new llvm.experimental.stepvector intrinsic This patch adds a new llvm.experimental.stepvector intrinsic, which takes no arguments and returns a linear integer sequence of values of the form <0, 1, ...>. It is primarily intended for scalable vectors, although it will work for fixed width vectors too. It is intended that later patches will make use of this new intrinsic when vectorising induction variables, currently only supported for fixed width. I've added a new CreateStepVector method to the IRBuilder, which will generate a call to this intrinsic for scalable vectors and fall back on creating a ConstantVector for fixed width. For scalable vectors this intrinsic is lowered to a new ISD node called STEP_VECTOR, which takes a single constant integer argument as the step. During lowering this argument is set to a value of 1. The reason for this additional argument at the codegen level is because in future patches we will introduce various generic DAG combines such as mul step_vector(1), 2 -> step_vector(2) add step_vector(1), step_vector(1) -> step_vector(2) shl step_vector(1), 1 -> step_vector(2) etc. that encourage a canonical format for all targets. This hopefully means all other targets supporting scalable vectors can benefit from this too. I've added cost model tests for both fixed width and scalable vectors: llvm/test/Analysis/CostModel/AArch64/neon-stepvector.ll llvm/test/Analysis/CostModel/AArch64/sve-stepvector.ll as well as codegen lowering tests for fixed width and scalable vectors: llvm/test/CodeGen/AArch64/neon-stepvector.ll llvm/test/CodeGen/AArch64/sve-stepvector.ll See this thread for discussion of the intrinsic: https://lists.llvm.org/pipermail/llvm-dev/2021-January/147943.html	2021-03-23 10:43:35 +00:00
Tony	7be40b4abf	[AMDGPU] Reserve ELF code Reserve AMD GPU ELF machine code 0x040. Minor AMDGPUUsage format consistency change. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D99122	2021-03-23 04:30:38 +00:00
Gulfem Savrun Yeniceri	61bfb34ac2	Revert "[Passes] Add relative lookup table converter pass" This reverts commit 78a65cd945d006ff02f9d24d9cc20a302ed93b08 which caused buildbot failures.	2021-03-23 00:43:16 +00:00
Gulfem Savrun Yeniceri	947cc1dce8	[doc] Fix typo in rel lookup table converter pass Add additonal hypens to match the title size that was introduced in 78a65cd.	2021-03-22 23:25:06 +00:00
Gulfem Savrun Yeniceri	59cc51764b	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-22 22:09:02 +00:00
Bradley Smith	839304c777	[IR] Add vscale_range IR function attribute This attribute represents the minimum and maximum values vscale can take. For now this attribute is not hooked up to anything during codegen, this will be added in the future when such codegen is considered stable. Additionally hook up the -msve-vector-bits=<x> clang option to emit this attribute. Differential Revision: https://reviews.llvm.org/D98030	2021-03-22 12:05:06 +00:00
Kristof Beyls	917f15dbfc	[docs] GettingInvolved: split out flang and openmp meeting series Split out the flang and openmp meeting series, as each has a separate canonical page where the information is maintained. As part of that, also call out the alias analysis series separately as it doesn't seem to be relevant for just flang. Differential Revision: https://reviews.llvm.org/D99012	2021-03-22 09:25:57 +01:00
Jessica Paquette	ae291b6dfb	[GlobalISel] Add G_SBFX + G_UBFX (bitfield extraction opcodes) There is a bunch of similar bitfield extraction code throughout *ISelDAGToDAG. E.g, ARMISelDAGToDAG, AArch64ISelDAGToDAG, and AMDGPUISelDAGToDAG all contain code that matches a bitfield extract from an and + right shift. Rather than duplicating code in the same way, this adds two opcodes: - G_UBFX (unsigned bitfield extract) - G_SBFX (signed bitfield extract) They work like this ``` %x = G_UBFX %y, %lsb, %width ``` Where `lsb` and `width` are - The least-significant bit of the extraction - The width of the extraction This will extract `width` bits from `%y`, starting at `lsb`. G_UBFX zero-extends the result, while G_SBFX sign-extends the result. This should allow us to use the combiner to match the bitfield extraction patterns rather than duplicating pattern-matching code in each target. Differential Revision: https://reviews.llvm.org/D98464	2021-03-19 14:37:19 -07:00
Bjorn Pettersson	13603c344c	[LangRef] Describe memory layout for vectors types There are a couple of caveats when it comes to how vectors are stored to memory, and thereby also how bitcast between vector and integer types work, in LLVM IR. Specially in relation to endianess. This patch is an attempt to document such things. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D94964	2021-03-19 19:00:37 +01:00
Christian Kühnel	b512b2d48c	propose Chocolately as package manager Installing the Unix tools on Windows is quite painful. To make things easier, I explained how to use a package manager or a Docker image. Note: This still uses the GNUWin tools as explained on this page. Once we replace these with something else, we would also need to update the installation commands. Differential Revision: https://reviews.llvm.org/D97387	2021-03-19 16:15:18 +01:00
Paul C. Anagnostopoulos	f8fbe9eb04	[TableGen] Improve handling of template arguments This requires changes to TableGen files and some C++ files due to incompatible multiclass template arguments that slipped through before the improved handling.	2021-03-19 09:57:53 -04:00
Jeroen Dobbelaere	13605b24cd	Support intrinsic overloading on unnamed types This patch adds support for intrinsic overloading on unnamed types. This fixes PR38117 and PR48340 and will also be needed for the Full Restrict Patches (D68484). The main problem is that the intrinsic overloading name mangling is using 's_s' for unnamed types. This can result in identical intrinsic mangled names for different function prototypes. This patch changes this by adding a '.XXXXX' to the intrinsic mangled name when at least one of the types is based on an unnamed type, ensuring that we get a unique name. Implementation details: - The mapping is created on demand and kept in Module. - It also checks for existing clashes and recycles potentially existing prototypes and declarations. - Because of extra data in Module, Intrinsic::getName needs an extra Module* argument and, for speed, an optional FunctionType* argument. - I still kept the original two-argument 'Intrinsic::getName' around which keeps the original behavior (providing the base name). -- Main reason is that I did not want to change the LLVMIntrinsicGetName version, as I don't know how acceptable such a change is -- The current situation already has a limitation. So that should not get worse with this patch. - Intrinsic::getDeclaration and the verifier are now using the new version. Other notes: - As far as I see, this should not suffer from stability issues. The count is only added for prototypes depending on at least one anonymous struct - The initial count starts from 0 for each intrinsic mangled name. - In case of name clashes, existing prototypes are remembered and reused when that makes sense. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D91250	2021-03-19 14:34:25 +01:00
Kristof Beyls	9d15a1dd6c	[docs] Add calendar info for SVE sync-ups	2021-03-19 10:27:34 +01:00
Kristof Beyls	1c7cdd5117	[docs] Document regular LLVM sync-ups This documents current regular LLVM sync-ups that are happening in the Getting Involved section. I hope this gives a bit more visibility to regular sync-ups that are happening in the LLVM community, documenting another way communication in the community happens. Of course the downside is that this is another location that sync-up metadata needs to be maintained. That being said, the structure as proposed means that no changes are needed once a new sync-up is added, apart from maybe removing the entry once it becomes clear that that particular sync-up series is completely cancelled. Documenting a few pointers on how current sync-ups happen may also encourage others to organize useful sync-ups on specific topics. I've started with adding the sync-ups I'm aware of. There's a good chance I've missed some. If most sync-ups end up having a public google calendar, we could also create and maintain a public google calendar that shows all events happening in the LLVM community, including dev meetings, sync-ups, socials, etc - assuming that would be valuable. Differential Revision: https://reviews.llvm.org/D98797	2021-03-18 18:32:27 +01:00
Vaivaswatha Nagaraj	062bb2a6ef	[Docs] Mention linking to reviews page when committing Differential Revision: https://reviews.llvm.org/D98695	2021-03-16 23:04:22 +05:30
Fangrui Song	d3961f8ad2	[llvm-nm] Add --format=just-symbols and make --just-symbol-name its alias https://sourceware.org/bugzilla/show_bug.cgi?id=27487 binutils will have --format=just-symbols/-j as well. Arbitrarily prefer `-j` to `--format=sysv`. Previously `--format=sysv -j` prints in the sysv format while `-j` takes precedence over other formats. Differential Revision: https://reviews.llvm.org/D98569	2021-03-16 10:07:01 -07:00
David Zarzycki	643090aa23	[lit] Sort test start times based on prior test timing data Lit as it exists today has three hacks that allow users to run tests earlier: 1) An entire test suite can set the `is_early` boolean. 2) A very recently introduced "early_tests" feature. 3) The `--incremental` flag forces failing tests to run first. All of these approaches have problems. 1) The `is_early` feature was until very recently undocumented. Nevertheless it still lacks testing and is a imprecise way of optimizing test starting times. 2) The `early_tests` feature requires manual updates and doesn't scale. 3) `--incremental` is undocumented, untested, and it requires modifying the source file system by "touching" the file. This "touch" based approach is arguably a hack because it confuses editors (because it looks like the test was modified behind the back of the editor) and "touching" the test source file doesn't work if the test suite is read only from the perspective of `lit` (via advanced filesystem/build tricks). This patch attempts to simplify and address all of the above problems. This patch formalizes, documents, tests, and defaults lit to recording the execution time of tests and then reordering all tests during the next execution. By reordering the tests, high core count machines run faster, sometimes significantly so. This patch also always runs failing tests first, which is a positive user experience win for those that didn't know about the hidden `--incremental` flag. Finally, if users want, they can _optionally_ commit the test timing data (or a subset thereof) back to the repository to accelerate bots and first-time runs of the test suite. Reviewed By: jhenderson, yln Differential Revision: https://reviews.llvm.org/D98179	2021-03-16 05:23:04 -04:00
Thomas Preud'homme	0911193c17	[FileCheck] Add support for hex alternate form in FileCheck Add printf-style alternate form flag to prefix hex number with 0x when present. This works on both empty numeric expression (e.g. variable definition from input) and when matching a numeric expression. The syntax is as follows: [[#%#<precision specifier><format specifier>, ...] where <precision specifier> and <format specifier> are optional and ... can be a variable definition or not with an empty expression or not. This feature was requested in https://reviews.llvm.org/D81144#2075532 for llvm/test/MC/ELF/gen-dwarf64.s Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D97845	2021-03-12 18:14:17 +00:00
David Green	18fc27f084	[ARM] Improve WLS lowering Recently we improved the lowering of low overhead loops and tail predicated loops, but concentrated first on the DLS do style loops. This extends those improvements over to the WLS while loops, improving the chance of lowering them successfully. To do this the lowering has to change a little as the instructions are terminators that produce a value - something that needs to be treated carefully. Lowering starts at the Hardware Loop pass, inserting a new llvm.test.start.loop.iterations that produces both an i1 to control the loop entry and an i32 similar to the llvm.start.loop.iterations intrinsic added for do loops. This feeds into the loop phi, properly gluing the values together: %wls = call { i32, i1 } @llvm.test.start.loop.iterations.i32(i32 %div) %wls0 = extractvalue { i32, i1 } %wls, 0 %wls1 = extractvalue { i32, i1 } %wls, 1 br i1 %wls1, label %loop.ph, label %loop.exit ... loop: %lsr.iv = phi i32 [ %wls0, %loop.ph ], [ %iv.next, %loop ] .. %iv.next = call i32 @llvm.loop.decrement.reg.i32(i32 %lsr.iv, i32 1) %cmp = icmp ne i32 %iv.next, 0 br i1 %cmp, label %loop, label %loop.exit The llvm.test.start.loop.iterations need to be lowered through ISel lowering as a pair of WLS and WLSSETUP nodes, which each get converted to t2WhileLoopSetup and t2WhileLoopStart Pseudos. This helps prevent t2WhileLoopStart from being a terminator that produces a value, something difficult to control at that stage in the pipeline. Instead the t2WhileLoopSetup produces the value of LR (essentially acting as a lr = subs rn, 0), t2WhileLoopStart consumes that lr value (the Bcc). These are then converted into a single t2WhileLoopStartLR at the same point as t2DoLoopStartTP and t2LoopEndDec. Otherwise we revert the loop to prevent them from progressing further in the pipeline. The t2WhileLoopStartLR is a single instruction that takes a GPR and produces LR, similar to the WLS instruction. %1:gprlr = t2WhileLoopStartLR %0:rgpr, %bb.3 t2B %bb.1 ... bb.2.loop: %2:gprlr = PHI %1:gprlr, %bb.1, %3:gprlr, %bb.2 ... %3:gprlr = t2LoopEndDec %2:gprlr, %bb.2 t2B %bb.3 The t2WhileLoopStartLR can then be treated similar to the other low overhead loop pseudos, eventually being lowered to a WLS providing the branches are within range. Differential Revision: https://reviews.llvm.org/D97729	2021-03-11 17:56:19 +00:00
Djordje Todorovic	1e88deac13	[Debugify][OriginalDIMode] Export the report into JSON file By using the original-di check with debugify in the combination with the llvm/utils/llvm-original-di-preservation.py it becomes very user friendly tool. An example of the HTML page with the issues related to debug info can be found at [0]. [0] https://djolertrk.github.io/di-checker-html-report-example/ Differential Revision: https://reviews.llvm.org/D82546	2021-03-11 01:11:13 -08:00
Zakk Chen	da49c75c2f	[Clang][RISCV] Add custom TableGen backend for riscv-vector intrinsics. Demonstrate how to generate vadd/vfadd intrinsic functions 1. add -gen-riscv-vector-builtins for clang builtins. 2. add -gen-riscv-vector-builtin-codegen for clang codegen. 3. add -gen-riscv-vector-header for riscv_vector.h. It also generates ifdef directives with extension checking, base on D94403. 4. add -gen-riscv-vector-generic-header for riscv_vector_generic.h. Generate overloading version Header for generic api. https://github.com/riscv/rvv-intrinsic-doc/blob/master/rvv-intrinsic-rfc.md#c11-generic-interface 5. update tblgen doc for riscv related options. riscv_vector.td also defines some unused type transformers for vadd, because I think it could demonstrate how tranfer type work and we need them for the whole intrinsic functions implementation in the future. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: jrtc27, craig.topper, HsiangKai, Jim, Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D95016	2021-03-10 18:43:43 -08:00
Christudasan Devadasan	aa8030a6bf	GlobalISel: Try to combine G_[SU]DIV and G_[SU]REM It is good to have a combined `divrem` instruction when the `div` and `rem` are computed from identical input operands. Some targets can lower them through a single expansion that computes both division and remainder. It effectively reduces the number of instructions than individually expanding them. Reviewed By: arsenm, paquette Differential Revision: https://reviews.llvm.org/D96013	2021-03-10 18:46:07 +05:30
Yao Zhao	47a5a13f53	[xray] Fix xray document spelling fix a couple of words spelling Reviewed By: dberris Differential Revision: https://reviews.llvm.org/D96658	2021-03-10 16:03:55 +11:00
Cullen Rhodes	6682076a17	[IR] Introduce llvm.experimental.vector.splice intrinsic This patch introduces a new intrinsic @llvm.experimental.vector.splice that constructs a vector of the same type as the two input vectors, based on a immediate where the sign of the immediate distinguishes two variants. A positive immediate specifies an index into the first vector and a negative immediate specifies the number of trailing elements to extract from the first vector. For example: @llvm.experimental.vector.splice(<A,B,C,D>, <E,F,G,H>, 1) ==> <B, C, D, E> ; index @llvm.experimental.vector.splice(<A,B,C,D>, <E,F,G,H>, -3) ==> <B, C, D, E> ; trailing element count These intrinsics support both fixed and scalable vectors, where the former is lowered to a shufflevector to maintain existing behaviour, although while marked as experimental the recommended way to express this operation for fixed-width vectors is to use shufflevector. For scalable vectors where it is not possible to express a shufflevector mask for this operation, a new ISD node has been implemented. This is one of the named shufflevector intrinsics proposed on the mailing-list in the RFC at [1]. Patch by Paul Walker and Cullen Rhodes. [1] https://lists.llvm.org/pipermail/llvm-dev/2020-November/146864.html Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D94708	2021-03-09 10:44:22 +00:00
Alexander Shaposhnikov	4904bec77b	[docs] Fix llvm-objcopy.rst Adjust the title underline, NFC.	2021-03-08 19:06:32 -08:00
Alexander Shaposhnikov	98aa5107b5	[llvm-objcopy][MachO] Add support for --keep-undefined This diff introduces --keep-undefined in llvm-objcopy/llvm-strip for Mach-O which makes the tools preserve undefined symbols. Test plan: make check-all Differential revision: https://reviews.llvm.org/D97040	2021-03-08 18:57:25 -08:00
Alexander Shaposhnikov	97b192e85f	[llvm-objdump][MachO] Add support for dumping function starts Add support for dumping function starts for Mach-O binaries. Test plan: make check-all Differential revision: https://reviews.llvm.org/D97027	2021-03-08 18:44:44 -08:00
Juneyoung Lee	8d4adfa7a5	[LangRef] mention that the lifetime intrinsics' description in LangRef isn't everything This is a minor patch that addresses concerns about lifetime in D94002. We need to mention that what's written in LangRef isn't everything about lifetime.start/end and its semantics depends on the stack coloring algorithm's pattern matching of a stack pointer. If the stack coloring algorithm cannot conclude that a pointer is a stack-allocated object, the pointer is conservatively considered as a non-stack one because stack coloring won't take this lifetime into account while assigning addresses. A reference from alloca to lifetime.start/end is added as well. Differential Revision: https://reviews.llvm.org/D98112	2021-03-09 11:33:36 +09:00
Ben Dunbobbin	761a3287bd	Reland: [Docs][Windows Itanium] Add a How-To document for Windows Itanium. This is a basic How-To that describes: - What Windows Itanium is. - How to assemble a build environment. Differential Revision: https://reviews.llvm.org/D89518	2021-03-09 01:36:34 +00:00
Tony	a686697492	[NFC][AMDGPU] Correct typo in DWARF Extensions For Heterogeneous Debugging A note in the defintion of DW_OP_piece had an incomplete sentence. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D98157	2021-03-09 00:23:23 +00:00
Rahman Lavaee	2c790c2f9f	[llvm-readelf] Support dumping the BB address map section with --bb-addr-map. This patch lets llvm-readelf dump the content of the BB address map section in the following format: ``` Function { At: <address> BB entries [ { Offset: <offset> Size: <size> Metadata: <metadata> }, ... ] } ... ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95511	2021-03-08 16:20:11 -08:00
Ben Dunbobbin	00a8ffd3fa	Revert "[Docs][Windows Itanium] Add a How-To document for Windows Itanium." This reverts commit 5a91d23ddfb2effd471b919241d1ef80bf1a4c9d. Markup was incorrect.	2021-03-08 23:57:27 +00:00
Ben Dunbobbin	70e9df61a5	[Docs][Windows Itanium] Add a How-To document for Windows Itanium. This is a basic How-To that describes: - What Windows Itanium is. - How to assemble a build environment. Differential Revision: https://reviews.llvm.org/D89518	2021-03-08 23:48:51 +00:00
Keith Smiley	30b780a55a	llvm-nm: add flag to suppress no symbols warning This spelling matches binutils https://sourceware.org/bugzilla/show_bug.cgi?id=27408 Differential Revision: https://reviews.llvm.org/D83152	2021-03-07 16:20:13 -08:00
Tony	4f31275f73	[NFC][AMDGPU] DWARF Extensions For Heterogeneous Debugging clarifications Clarify that the base type endianity is used when creating implicit location storage. Remove duplicate definition of the generic type. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D98137	2021-03-07 18:34:17 +00:00
Tony	b0eb76b4ee	[NFC][AMDGPU]DWARF Extensions For Heterogeneous Debugging generic type endianity In "DWARF Extensions For Heterogeneous Debugging" document that the DWARF generic type has a target architecture defined endianity. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D98126	2021-03-07 04:51:05 +00:00
Juneyoung Lee	5caecc36c3	[LangRef] dos2unix (NFC)	2021-03-06 18:44:40 +09:00
gbtozers	c52cf11f42	[DebugInfo] Add DIArgList MD to store multple values in DbgVariableIntrinsics This patch adds a new metadata node, DIArgList, which contains a list of SSA values. This node is in many ways similar in function to the existing ValueAsMetadata node, with the difference being that it tracks a list instead of a single value. Internally, it uses ValueAsMetadata to track the individual values, but there is also a reasonable amount of DIArgList-specific value-tracking logic on top of that. Similar to ValueAsMetadata, it is a special case in parsing and printing due to the fact that it requires a function state (as it may reference function-local values). This patch should not result in any immediate functional change; it allows for DIArgLists to be parsed and printed, but debug variable intrinsics do not yet recognize them as a valid argument (outside of parsing). Differential Revision: https://reviews.llvm.org/D88175	2021-03-05 17:02:24 +00:00
Stephen Tozer	e0cb677eb6	Reapply "[DebugInfo] Add new instruction and DIExpression operator for variadic debug values" Rewrites test to use correct architecture triple; fixes incorrect reference in SourceLevelDebugging doc; simplifies `spillReg` behaviour so as to not be dependent on changes elsewhere in the patch stack. This reverts commit d2000b45d033c06dc7973f59909a0ad12887ff51.	2021-03-05 12:32:05 +00:00
Juneyoung Lee	5f3a69dfff	[LangRef] lifetime intrinsics: don't use word 'offset' from Philip's comments	2021-03-05 12:53:13 +09:00
Philip Reames	5a592bf8e6	[docs] Remove some stale wording from gc.relocate description We dropped support for the non-bundle form a while back, but I apparently missed updating one place in the docs.	2021-03-04 15:18:11 -08:00
Philip Reames	42428235cc	[docs] Move statepoint related intrinsics into main LangRef	2021-03-04 15:13:27 -08:00
Akira Hatanaka	4055195f29	[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR This reapplies ed4718eccb12bd42214ca4fb17d196d49561c0c7, which was reverted because it was causing a miscompile. The bug that was causing the miscompile has been fixed in 75805dce5ff874676f3559c069fcd6737838f5c0. Original commit message: Background: This fixes a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.attachedcall" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if claimRV is attached to the call since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since the ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if retainRV is attached to the call and does nothing if claimRV is attached to it. - SCCP refrains from replacing the return value of a call with a constant value if the call has the operand bundle. This ensures the call always has at least one user (the call to @llvm.objc.clang.arc.noop.use). - This patch also fixes a bug in replaceUsesOfNonProtoConstant where multiple operand bundles of the same kind were being added to a call. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-03-04 11:22:30 -08:00
Stephen Tozer	977ffc2c60	Revert "[DebugInfo] Add new instruction and DIExpression operator for variadic debug values" This reverts commit d07f106f4a48b6e941266525b6f7177834d7b74e.	2021-03-04 11:59:21 +00:00
gbtozers	7cf2776667	[DebugInfo] Add new instruction and DIExpression operator for variadic debug values This patch adds a new instruction that can represent variadic debug values, DBG_VALUE_VAR. This patch alone covers the addition of the instruction and a set of basic code changes in MachineInstr and a few adjacent areas, but does not correctly handle variadic debug values outside of these areas, nor does it generate them at any point. The new instruction is similar to the existing DBG_VALUE instruction, with the following differences: the operands are in a different order, any number of values may be used in the instruction following the Variable and Expression operands (these are referred to in code as “debug operands”) and are indexed from 0 so that getDebugOperand(X) == getOperand(X+2), and the Expression in a DBG_VALUE_VAR must use the DW_OP_LLVM_arg operator to pass arguments into the expression. The new DW_OP_LLVM_arg operator is only valid in expressions appearing in a DBG_VALUE_VAR; it takes a single argument and pushes the debug operand at the index given by the argument onto the Expression stack. For example the sub-expression `DW_OP_LLVM_arg, 0` has the meaning “Push the debug operand at index 0 onto the expression stack.” Differential Revision: https://reviews.llvm.org/D82363	2021-03-04 11:45:35 +00:00
Andrew Savonichev	064cc1a22c	[MCA] Add support for in-order CPUs This patch adds a pipeline to support in-order CPUs such as ARM Cortex-A55. In-order pipeline implements a simplified version of Dispatch, Scheduler and Execute stages as a single stage. Entry and Retire stages are common for both in-order and out-of-order pipelines. Differential Revision: https://reviews.llvm.org/D94928	2021-03-04 14:08:19 +03:00
James Henderson	6bdae7560c	[llvm-objcopy][llvm-strip] Improve --discard-all documentation and help The help text and documentation for the --discard-all option failed to mention that the option also causes the removal of debug sections. This change fixes both for both llvm-objcopy and llvm-strip. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D97662	2021-03-04 10:25:35 +00:00
Juneyoung Lee	4267f32aaf	[LangRef] remove links to lifetime since use marker intro already has a link	2021-03-04 17:19:23 +09:00
Juneyoung Lee	c2621fae37	[LangRef] fix more undefined label errors	2021-03-04 17:09:03 +09:00
Johannes Doerfert	97088b4db9	[Docs] Remove `no-aa` from the alias analysis documentation The `no-aa` pass has been removed with 7b560d40bddf. Differential Revision: https://reviews.llvm.org/D95416	2021-03-04 00:35:52 -06:00
Wang, Pengfei	2e2e287013	Add Windows ehcont section support (/guard:ehcont). Add option /guard:ehcont Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D96709	2021-03-04 11:47:29 +08:00
Juneyoung Lee	09d28f9368	[LangRef] fix undefined label	2021-03-04 10:12:57 +09:00
Juneyoung Lee	f861c629c1	[LangRef] Make lifetime intrinsic's semantics consistent with StackColoring's comment This patch is an update to LangRef by describing lifetime intrinsics' behavior by following the description of MIR's LIFETIME_START/LIFETIME_END markers at StackColoring.cpp (`eb44682d67/llvm/lib/CodeGen/StackColoring.cpp (L163)`) and the discussion in llvm-dev. In order to explicitly define the meaning of an object lifetime, I added 'Object Lifetime' subsection. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D94002	2021-03-04 09:58:06 +09:00
Xun Li	a7cf9dd738	[LICM][Coroutine] Don't sink stores from loops with coro.suspend instructions See pr46990(https://bugs.llvm.org/show_bug.cgi?id=46990). LICM should not sink store instructions to loop exit blocks which cross coro.suspend intrinsics. This breaks semantic of coro.suspend intrinsic which return to caller directly. Also this leads to use-after-free if the coroutine is freed before control returns to the caller in multithread environment. This patch disable promotion by check whether loop contains coro.suspend intrinsics. This is a resubmit of D86190. Disabling LICM for loops with coroutine suspension is a better option not only for correctness purpose but also for performance purpose. In most cases LICM sinks memory operations. In the case of coroutine, sinking memory operation out of the loop does not improve performance since coroutien needs to get data from the frame anyway. In fact LICM would hurt coroutine performance since it adds more entries to the frame. Differential Revision: https://reviews.llvm.org/D96928	2021-03-03 15:21:57 -08:00
Hans Wennborg	c2ea8c4219	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR" This caused miscompiles of Chromium tests for iOS due clobbering of live registers. See discussion on the code review for details. > Background: > > This fixes a longstanding problem where llvm breaks ARC's autorelease > optimization (see the link below) by separating calls from the marker > instructions or retainRV/claimRV calls. The backend changes are in > https://reviews.llvm.org/D92569. > > https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue > > What this patch does to fix the problem: > > - The front-end adds operand bundle "clang.arc.attachedcall" to calls, > which indicates the call is implicitly followed by a marker > instruction and an implicit retainRV/claimRV call that consumes the > call result. In addition, it emits a call to > @llvm.objc.clang.arc.noop.use, which consumes the call result, to > prevent the middle-end passes from changing the return type of the > called function. This is currently done only when the target is arm64 > and the optimization level is higher than -O0. > > - ARC optimizer temporarily emits retainRV/claimRV calls after the calls > with the operand bundle in the IR and removes the inserted calls after > processing the function. > > - ARC contract pass emits retainRV/claimRV calls after the call with the > operand bundle. It doesn't remove the operand bundle on the call since > the backend needs it to emit the marker instruction. The retainRV and > claimRV calls are emitted late in the pipeline to prevent optimization > passes from transforming the IR in a way that makes it harder for the > ARC middle-end passes to figure out the def-use relationship between > the call and the retainRV/claimRV calls (which is the cause of > PR31925). > > - The function inliner removes an autoreleaseRV call in the callee if > nothing in the callee prevents it from being paired up with the > retainRV/claimRV call in the caller. It then inserts a release call if > claimRV is attached to the call since autoreleaseRV+claimRV is > equivalent to a release. If it cannot find an autoreleaseRV call, it > tries to transfer the operand bundle to a function call in the callee. > This is important since the ARC optimizer can remove the autoreleaseRV > returning the callee result, which makes it impossible to pair it up > with the retainRV/claimRV call in the caller. If that fails, it simply > emits a retain call in the IR if retainRV is attached to the call and > does nothing if claimRV is attached to it. > > - SCCP refrains from replacing the return value of a call with a > constant value if the call has the operand bundle. This ensures the > call always has at least one user (the call to > @llvm.objc.clang.arc.noop.use). > > - This patch also fixes a bug in replaceUsesOfNonProtoConstant where > multiple operand bundles of the same kind were being added to a call. > > Future work: > > - Use the operand bundle on x86-64. > > - Fix the auto upgrader to convert call+retainRV/claimRV pairs into > calls with the operand bundles. > > rdar://71443534 > > Differential Revision: https://reviews.llvm.org/D92808 This reverts commit ed4718eccb12bd42214ca4fb17d196d49561c0c7.	2021-03-03 15:51:40 +01:00
Stefan Gränitz	e9a5668ec5	[docs][JITLink] Fix a typo (NFC)	2021-03-02 15:07:36 +01:00
Tony Tye	e33a5d6364	[NFC][AMDGPU] Document the AMDGPU target feature defaults Document the default for the XNACK and SRAMECC target features for code object V2-V3 and V4. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D97598	2021-02-27 18:28:15 +00:00
Kazu Hirata	9d2bb4e874	[llvm] Fix typos in documentation (NFC)	2021-02-27 10:09:23 -08:00
Arthur Eubanks	33f6c8a3f1	[docs] Add documentation on using the new pass manager And clarify in the "writing a pass" docs that both the legacy and new PMs are being used for the codegen/optimization pipelines. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D97515	2021-02-26 15:28:19 -08:00
Stefan Gränitz	66b9e5f6f9	[docs][JITLink] Few typo fixes in JITLink design/API doc	2021-02-26 12:56:42 +01:00
Nico Weber	d8f7d5b6a7	[arm builtin crosscompile docs] add COMPILER_RT_BUILD_MEMPROF=OFF Reported by artok on irc, thanks!	2021-02-25 10:44:52 -05:00
Nico Weber	6de0baf5a6	[arm builtin crosscompile docs] alphabetize flags, no behavior change	2021-02-25 10:44:16 -05:00
Lang Hames	c85460437c	[docs][JITLink] Reintroduce JITLink design/API doc with fixes and improvements. This document was originally introduced in ab4648504b2, and was reverted in 912bc4980e9 while I investigated a number of shpinx bot errors. This commit reintroduces the document with fixes for those errors, as well as some improvements to the wording and formatting.	2021-02-25 15:27:59 +11:00
Joel E. Denny	eaa132dc46	[lit] Add --ignore-fail For some build configurations, `check-all` calls lit multiple times to run multiple lit test suites. Most recently, I've found this to be true when configuring openmp as part of `LLVM_ENABLE_RUNTIMES`, but this is not the first time. If one test suite fails, none of the remaining test suites run, so you cannot determine if your patch has broken them. It can then be frustrating to try to determine which `check-` targets will run the remaining tests without getting stuck on the failing tests. When such cases arise, it is probably best to adjust the cmake configuration for `check-all` to run all test suites as part of one lit invocation. Because that fix will likely not be implemented and land immediately, this patch introduces `--ignore-fail` to serve as a workaround for developers trying to see test results until it does land: ``` $ LIT_OPTS=--ignore-fail ninja check-all ``` One problem with `--ignore-fail` is that it makes it challenging to detect test failures in a script, perhaps in CI. This problem should serve as motivation to actually fix the cmake configuration instead of continuing to use `--ignore-fail` indefinitely. Reviewed By: jhenderson, thopre Differential Revision: https://reviews.llvm.org/D96371	2021-02-24 13:10:27 -05:00
Lang Hames	4e02a88925	[docs][JITLink] Remove the JITLink doc for now. I'll reinstate and continue investigation tomorrow.	2021-02-24 22:32:18 +11:00
Lang Hames	9a46912f53	[docs][JITLink] Yet more experiments to try to understand sphinx error.	2021-02-24 22:22:48 +11:00
Lang Hames	1022a61830	[docs][JITLink] More experiments to try to understand sphinx error.	2021-02-24 22:22:47 +11:00
Lang Hames	296cd8fcdb	[docs][JITLink] Make ``ObjectLinkingLayer`` not a paragraph start. More experiments as I try to placate sphinx.	2021-02-24 22:04:14 +11:00
Lang Hames	972cb5e278	[docs][JITLink] Return to `` for inline literals. Also awkwardly reformat text to test whether the error is occurring on the line with the '::', or the previous one.	2021-02-24 21:55:49 +11:00
Lang Hames	bceb5d6608	[docs][JITLink] Try explicit literal blocks for monospace list elements.	2021-02-24 21:50:27 +11:00
Lang Hames	eeb1165ed0	[docs][JITLink] More attempted fixes for formatting issues in the JITLink doc. Try using the literal domain for `ObjectLinkingLayer::Plugin` and literal blocks for multi-line method names.	2021-02-24 21:41:27 +11:00
Lang Hames	009afd2bc6	[docs][JITLink] Sphinx does not like '::' in monotype. Try using a cpp domain expr instead.	2021-02-24 21:23:10 +11:00
Lang Hames	e6ec15b603	[docs][JITLink] Add a JITLink design and API document.	2021-02-24 21:04:35 +11:00
xgupta	4ad2cda5ee	[Docs] Mention clone depth feature of git in LLVM getting started The current size of the llvm-project repository exceeds 1 GB. A shallow clone can save a lot of space and time. Some developers might not aware of this feature. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D97118	2021-02-24 10:56:10 +05:30
Lang Hames	e7e4125b9a	Revert "[docs][ORC] Fix section title and reference." This reverts commit 6e1affe71c79a1cb5ea9d805ff7baae5cba59c0e, which caused an error on the Sphinx doc bot.	2021-02-24 07:27:39 +11:00
Lang Hames	e4f5ded240	[docs][ORC] Fix section title and reference.	2021-02-23 17:38:51 +11:00
Sanjay Patel	d83af3b33a	[LangRef] fix typo in assume bundle description; NFC	2021-02-22 09:30:49 -05:00
Dmitry Preobrazhensky	d8fddd2027	[AMDGPU][MC] Corrected bound_ctrl for compatibility with sp3 Enabled "bound_ctrl:1" and disabled "bound_ctrl:-1" syntax. Corrected printer to output "bound_ctrl:1" instead of "bound_ctrl:0". See bug 35397 for detailed issue description. Differential Revision: https://reviews.llvm.org/D97048	2021-02-22 14:59:40 +03:00
David Zarzycki	81606a10c8	[lit] Add --xfail and --filter-out (inverse of --filter) In semi-automated environments, XFAILing or filtering out known regressions without actually committing changes or temporarily modifying the test suite can be quite useful. Reviewed By: yln Differential Revision: https://reviews.llvm.org/D96662	2021-02-20 05:43:29 -05:00
Djordje Todorovic	ff9317c291	[docs] Fix the GlobalISel/GenericOpcode.rst This couses docs build to fail. Introduced with D96890.	2021-02-19 10:31:31 +01:00
Djordje Todorovic	38dcbfd498	Reland "[Debugify] Make the debugify aware of the original (-g) Debug Info" As discussed on the RFC [0], I am sharing the set of patches that enables checking of original Debug Info metadata preservation in optimizations. The proof-of-concept/proposal can be found at [1]. The implementation from the [1] was full of duplicated code, so this set of patches tries to merge this approach into the existing debugify utility. For example, the utility pass in the original-debuginfo-check mode could be invoked as follows: $ opt -verify-debuginfo-preserve -pass-to-test sample.ll Since this is very initial stage of the implementation, there is a space for improvements such as: - Add support for the new pass manager - Add support for metadata other than DILocations and DISubprograms [0] https://groups.google.com/forum/#!msg/llvm-dev/QOyF-38YPlE/G213uiuwCAAJ [1] https://github.com/djolertrk/llvm-di-checker Differential Revision: https://reviews.llvm.org/D82545 The test that was failing is now forced to use the old PM.	2021-02-18 23:29:22 -08:00
Konstantin Zhuravlyov	7646090a8e	AMDGPU/ELF: Sort MACHs by value and add missing reserved MACHs - Sort MACHs by its value - Add missing reserved MACHs - EF_AMDGPU_MACH_AMDGCN_RESERVED_0X3D - EF_AMDGPU_MACH_AMDGCN_RESERVED_0X3E Differential Revision: https://reviews.llvm.org/D97010	2021-02-18 20:46:27 -05:00
Petr Hosek	d548975423	[Coverage] Store compilation dir separately in coverage mapping We currently always store absolute filenames in coverage mapping. This is problematic for several reasons. It poses a problem for distributed compilation as source location might vary across machines. We are also duplicating the path prefix potentially wasting space. This change modifies how we store filenames in coverage mapping. Rather than absolute paths, it stores the compilation directory and file paths as given to the compiler, either relative or absolute. Later when reading the coverage mapping information, we recombine relative paths with the working directory. This approach is similar to handling ofDW_AT_comp_dir in DWARF. Finally, we also provide a new option, -fprofile-compilation-dir akin to -fdebug-compilation-dir which can be used to manually override the compilation directory which is useful in distributed compilation cases. Differential Revision: https://reviews.llvm.org/D95753	2021-02-18 14:34:39 -08:00
Petr Hosek	8ce86e4200	Revert "[Coverage] Store compilation dir separately in coverage mapping" This reverts commit 97ec8fa5bb07e3f5bf25ddcb216b545cd3d03b65 since the test is failing on some bots.	2021-02-18 12:50:24 -08:00
Petr Hosek	fd85f23c36	[Coverage] Store compilation dir separately in coverage mapping We currently always store absolute filenames in coverage mapping. This is problematic for several reasons. It poses a problem for distributed compilation as source location might vary across machines. We are also duplicating the path prefix potentially wasting space. This change modifies how we store filenames in coverage mapping. Rather than absolute paths, it stores the compilation directory and file paths as given to the compiler, either relative or absolute. Later when reading the coverage mapping information, we recombine relative paths with the working directory. This approach is similar to handling ofDW_AT_comp_dir in DWARF. Finally, we also provide a new option, -fprofile-compilation-dir akin to -fdebug-compilation-dir which can be used to manually override the compilation directory which is useful in distributed compilation cases. Differential Revision: https://reviews.llvm.org/D95753	2021-02-18 12:27:42 -08:00
Paul C. Anagnostopoulos	5e8a4a10a2	Revert "[TableGen] Improve algorithms for processing template arguments" This reverts commit e589207d5aaee6cbf1d7c7de8867a17727d14aca.	2021-02-18 09:26:26 -05:00
Paul C. Anagnostopoulos	31c49eba49	[TableGen] Improve algorithms for processing template arguments Rework template argument checking so that all arguments are type-checked and cast if necessary. Add a test. Differential Revision: https://reviews.llvm.org/D96416	2021-02-18 09:15:26 -05:00
Djordje Todorovic	78cda129f4	Revert "[Debugify] Make the debugify aware of the original (-g) Debug Info" This reverts rG8ee7c7e02953. One test is failing, I'll reland this as soon as possible.	2021-02-18 02:04:27 -08:00
Djordje Todorovic	74ad5c12ad	[Debugify] Make the debugify aware of the original (-g) Debug Info As discussed on the RFC [0], I am sharing the set of patches that enables checking of original Debug Info metadata preservation in optimizations. The proof-of-concept/proposal can be found at [1]. The implementation from the [1] was full of duplicated code, so this set of patches tries to merge this approach into the existing debugify utility. For example, the utility pass in the original-debuginfo-check mode could be invoked as follows: $ opt -verify-debuginfo-preserve -pass-to-test sample.ll Since this is very initial stage of the implementation, there is a space for improvements such as: - Add support for the new pass manager - Add support for metadata other than DILocations and DISubprograms [0] https://groups.google.com/forum/#!msg/llvm-dev/QOyF-38YPlE/G213uiuwCAAJ [1] https://github.com/djolertrk/llvm-di-checker Differential Revision: https://reviews.llvm.org/D82545	2021-02-18 01:52:16 -08:00
Stanislav Mekhanoshin	f1c6dbc4d5	[AMDGPU] gfx90a support Differential Revision: https://reviews.llvm.org/D96906	2021-02-17 16:01:32 -08:00
Jessica Paquette	840095b1e1	[GlobalISel] Add G_ASSERT_SEXT This adds a G_ASSERT_SEXT opcode, similar to G_ASSERT_ZEXT. This instruction signifies that an operation was already sign extended from a smaller type. This is useful for functions with sign-extended parameters. E.g. ``` define void @foo(i16 signext %x) { ... } ``` This adds verifier, regbankselect, and instruction selection support for G_ASSERT_SEXT equivalent to G_ASSERT_ZEXT. Differential Revision: https://reviews.llvm.org/D96890	2021-02-17 13:10:34 -08:00
David Zarzycki	4d99d51ea2	[lit] Add "early_tests" config option With enough cores, the slowest tests can significantly change the total testing time if they happen to run late. With this change, a test suite can improve performance (for high-end systems) by listing just a few of the slowest tests up front. Reviewed By: jdenny, jhenderson Differential Revision: https://reviews.llvm.org/D96594	2021-02-17 06:32:04 -05:00
Thomas Preud'homme	b894f8c766	Add lit config for dir with standalone tests Some test systems do not use lit for test discovery but only for its substitution and test selection because they use another way of managing test collections, e.g. CTest. This forces those tests to be invoked with lit --no-indirectly-run-check. When a mix of lit version is in use, it requires to detect the availability of that option. This commit provides a new config option standalone_tests to signal a directory made of tests meant to run as standalone. When this option is set, lit skips test discovery and the indirectly run check. It also adds the missing documentation for --no-indirectly-run-check. Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D94766	2021-02-17 10:38:58 +00:00
Alexander Shaposhnikov	dc7498891d	[llvm-libtool] Emit warnings for files without symbols 1. Emit warnings for files without symbols. 2. Add -no_warning_for_no_symbols. Test plan: make check-all Differential revision: https://reviews.llvm.org/D95843	2021-02-16 17:52:12 -08:00
Fangrui Song	3296757d5e	[llvm-objcopy] Delete --build-id-link-{dir,input,output} The few options are niche. They solved a problem which was traditionally solved with more shell commands (`llvm-readelf -n` fetches the Build ID. Then `ln` is used to hard link the file to a directory derived from the Build ID.) Due to limitation, they are no longer used by Fuchsia and they don't appear to be used elsewhere (checked with Google Search and Debian Code Search). So delete them without a transition period. Announcement: https://lists.llvm.org/pipermail/llvm-dev/2021-February/148446.html Differential Revision: https://reviews.llvm.org/D96310	2021-02-15 11:17:32 -08:00
Caroline Concatto	8ca25dcbed	[LangRef] Increase size of title underline for experimental.vector.reverse	2021-02-15 15:19:26 +00:00
Caroline Concatto	9ea32f75fa	[CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse This patch adds a new intrinsic experimental.vector.reduce that takes a single vector and returns a vector of matching type but with the original lane order reversed. For example: ``` vector.reverse(<A,B,C,D>) ==> <D,C,B,A> ``` The new intrinsic supports fixed and scalable vectors types. The fixed-width vector relies on shufflevector to maintain existing behaviour. Scalable vector uses the new ISD node - VECTOR_REVERSE. This new intrinsic is one of the named shufflevector intrinsics proposed on the mailing-list in the RFC at [1]. Patch by Paul Walker (@paulwalker-arm). [1] https://lists.llvm.org/pipermail/llvm-dev/2020-November/146864.html Differential Revision: https://reviews.llvm.org/D94883	2021-02-15 13:39:43 +00:00
Juneyoung Lee	b74bacff63	[LangRef] Update memory access ops to raise UB if ptrs are not well defined In the past, it was stated in D87994 that it is allowed to dereference a pointer that is partially undefined if all of its possible representations fit into a dereferenceable range. The motivation of the direction was to make a range analysis helpful for assuring dereferenceability. Even if a range analysis concludes that its offset is within bounds, the offset could still be partially undefined; to utilize the range analysis, this relaxation was necessary. https://groups.google.com/g/llvm-dev/c/2Qk4fOHUoAE/m/KcvYMEgOAgAJ has more context about this. However, this is currently blocking another optimization, which is annotating the noundef attribute for library functions' arguments. D95122 is the patch. Currently, there are quite a few library functions which cannot have noundef attached to its pointer argument because it can be transformed from load/store. For example, MemCpyOpt can convert stores into memset: ``` store p, i32 0 store (p+1), i32 0 // Since currently it is allowed for store to have partially undefined pointer.. -> memset(p, 0, 8) // memset cannot guarantee that its ptr argument is noundef. ``` A bigger problem is that this makes unclear which library functions are allowed to have 'noundef' and which functions aren't (e.g., strlen). This makes annotating noundef almost impossible for this kind of functions. This patch proposes that all memory operations should have well-defined pointers. For memset/memcpy, it is semantically equivalent to running a loop until the size is met (and branching on undef is UB), so the size is also updated to be well-defined. Strictly speaking, this again violates the implication of dereferenceability from range analysis result. However, I think this is okay for the following reasons: 1. It seems the existing analyses in the LLVM main repo does not have conflicting implementation with the new proposal. `isDereferenceableAndAlignedPointer` works only when the GEP offset is constant, and `isDereferenceableAndAlignedInLoop` is also fine. 2. A possible miscompilation happens only when the source has a pointer with a partially undefined offset (it's okay with poison because there is no 'partially poison' value). But, at least I'm not aware of a language using LLVM as backend that has a well-defined program while allowing partially undefined pointers. There might be such a language that I'm not aware of, but improving the performance of the mainstream languages like C and Rust is more important IMHO. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95238	2021-02-13 14:13:19 +09:00
Vedant Kumar	97f4d2cb51	[docs/Coverage] Document -show-region-summary As a drive-by, fix the section in the clang docs about the number of statistics visible in a report.	2021-02-12 12:05:45 -08:00
Akira Hatanaka	078c441e76	[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR Background: This fixes a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.attachedcall" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if claimRV is attached to the call since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since the ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if retainRV is attached to the call and does nothing if claimRV is attached to it. - SCCP refrains from replacing the return value of a call with a constant value if the call has the operand bundle. This ensures the call always has at least one user (the call to @llvm.objc.clang.arc.noop.use). - This patch also fixes a bug in replaceUsesOfNonProtoConstant where multiple operand bundles of the same kind were being added to a call. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-12 09:51:57 -08:00
Sjoerd Meijer	dbbe3c7ed5	[MIRLangRef] Document MachineOperand comments Late follow-up of D74306 to document MachineOperand comments in MIRLangRef. Differential Revision: https://reviews.llvm.org/D96518	2021-02-12 10:15:47 +00:00
Kristof Beyls	ceb25a320b	[DeveloperPolicy] Specify LLVM's license more clearly. Before, the first mention of LLVM's license on the developer policy page stated that LLVM's license is Apache 2. This patch makes that more accurate by mentioning the LLVM exception this first time the LLVM license is discussed on that page, i.e. Apache-2.0 with LLVM-exception. Technically, the correct SPDX identifier for LLVM's license is 'Apache-2.0 WITH LLVM-exception', but I thought that writing the 'WITH' in lower case made the paragraph easier to read without reducing clarity. Differential Revision: https://reviews.llvm.org/D96482	2021-02-12 09:16:43 +00:00
Guillaume Chatelet	92238fe870	Fix incorrect indentation in LangRef.rst	2021-02-11 20:47:43 +00:00
Guillaume Chatelet	d7077079f8	Fix incorrect indentation in LangRef.rst	2021-02-11 20:34:19 +00:00
Guillaume Chatelet	d83a14c77d	Encode alignment attribute for `cmpxchg` This is a follow up patch to D83136 adding the align attribute to `cmpxchg`. See also D83465 for `atomicrmw`. Differential Revision: https://reviews.llvm.org/D87443	2021-02-11 15:17:50 -05:00
Guillaume Chatelet	1d36d2af13	Encode alignment attribute for `atomicrmw` This is a follow up patch to D83136 adding the align attribute to `atomicwmw`. Differential Revision: https://reviews.llvm.org/D83465	2021-02-11 15:17:37 -05:00
xgupta	0a80a6fb48	[Draft] [examples] Move llvm/examples/OCaml-Kaleidoscope/ to llvm-archive	2021-02-11 06:52:24 +05:30
Arthur Eubanks	edd40c02bf	[docs] Make clearer in WritingAnLLVMPass that the legacy PM isn't the default Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D96452	2021-02-10 15:26:25 -08:00
Nico Weber	7e84b293f2	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly" This reverts commit 4a64d8fe392449b205e59031aad5424968cf7446. Makes clang crash when buildling trivial iOS programs, see comment after https://reviews.llvm.org/D92808#2551401	2021-02-09 11:06:32 -05:00
Zakk Chen	a9bf54fece	[Docs] Fix Typo	2021-02-08 23:45:32 -08:00
Akira Hatanaka	52b9cd0c02	[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly emitting retainRV or claimRV calls in the IR This reapplies 3fe3946d9a958b7af6130241996d9cfcecf559d4 without the changes made to lib/IR/AutoUpgrade.cpp, which was violating layering. Original commit message: Background: This patch makes changes to the front-end and middle-end that are needed to fix a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.rv" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if the call is annotated with claimRV since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if the implicit call is a call to retainRV and does nothing if it's a call to claimRV. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls annotated with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-05 06:09:42 -08:00
Akira Hatanaka	9b804d4398	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly" This reverts commit 3fe3946d9a958b7af6130241996d9cfcecf559d4. The commit violates layering by including a header from Analysis in lib/IR/AutoUpgrade.cpp.	2021-02-05 06:00:05 -08:00
Akira Hatanaka	b1b35f383e	[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly emitting retainRV or claimRV calls in the IR Background: This patch makes changes to the front-end and middle-end that are needed to fix a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.rv" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if the call is annotated with claimRV since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if the implicit call is a call to retainRV and does nothing if it's a call to claimRV. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls annotated with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-05 05:55:18 -08:00
Adrian Prantl	81c0c6d2aa	Remove overzealous verifier check on DW_OP_LLVM_entry_value and improve the documentation Based on the comments in the code, the idea is that AsmPrinter is unable to produce entry value blocks of arbitrary length, such as DW_OP_entry_value [DW_OP_reg5 DW_OP_lit1 DW_OP_plus]. But the way the Verifier check is written it also disallows DW_OP_entry_value [DW_OP_reg5] DW_OP_lit1 DW_OP_plus which seems to overshoot the target. Note that this patch does not change any of the safety guards in LiveDebugValues — there is zero behavior change for clang. It just allows us to legalize more complex expressions in future patches. rdar://73907559 Differential Revision: https://reviews.llvm.org/D95990	2021-02-04 10:58:35 -08:00
Arthur Eubanks	213c28c3fc	[NewPM][HelloWorld] Move HelloWorld to Utils To prevent creating a new component, which creates a new library. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D95907	2021-02-03 12:59:40 -08:00
Abhina Sreeskantharajan	e6f220185d	[test] Use host platform specific error message substitution in lit tests - continued On z/OS, other error messages are not matched correctly in lit tests. ``` EDC5121I Invalid argument. EDC5111I Permission denied. ``` This patch adds a lit substitution to fix it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95808	2021-02-03 09:53:22 -05:00
Arthur Eubanks	67b774212d	[Docs] Update HowToSubmitABug With the new PM imminent, bugpoint will diverge from opt, meaning it may not reproduce a crash with the same arguments passed to opt. We need to specify alternatives to bugpoint for reducing crashes. I looked at the rest of the document to see if anything could be improved. Major highlights: * Run -Xclang -disable-llvm-passes instead of -O0 for skipping IR passes * Mention the files that clang dumps on a crash * Remove outdated reference to `delta` and plug `creduce` instead * Mention llvm-reduce on top of bugpoint * Mention --print-before-all --print-module-scope * Mention sanitizers in addition to valgrind * Mention opt-bisect for miscompiles Reviewed By: fhahn, MaskRay Differential Revision: https://reviews.llvm.org/D95578	2021-02-02 21:19:20 -08:00
Sebastian Neubauer	32c7642ef3	[AMDGPU] Clarify calling conv about inactive lanes So far, it was not specified what happens with the VGPRs of inactive lanes when functions are called. This patch explicitely mentions that the VGPR values of inactive lanes need to be preserved for all registers. This describes the current behavior, as only active lanes of registers are saved to scratch. Also, as the multi-lane nature of VGPRs is not properly modeled, we cannot determine the live VGPRs from inactive lanes at calls. So we cannot save them, even if we intended to do so. Differential Revision: https://reviews.llvm.org/D95610	2021-02-02 10:15:09 +01:00
xgupta	ddd3adad6f	[Branch-Rename] Fix some links According to the [[ https://foundation.llvm.org/docs/branch-rename/ \| status of branch rename ]], the master branch of the LLVM repository is removed on 28 Jan 2021. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95766	2021-02-01 16:43:21 +05:30

... 2 3 4 5 6 ...

8878 Commits