llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Kristof Beyls	d157c20ece	[docs] Add Windows/COFF call info	2021-04-12 17:11:25 +02:00
Sushma Unnibhavi	86f9f72f44	Typo fix Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D100254	2021-04-11 12:24:27 +05:30
Sushma Unnibhavi	2e031186a6	Missing syntax highlighting for LLVM IR in Langref Added syntax highlighting Differential Revision: https://reviews.llvm.org/D100125	2021-04-11 12:19:58 +05:30
Paul C. Anagnostopoulos	2a3388364d	[TableGen] [docs] Change title of tblgen.rst to fix man page filename	2021-04-09 09:37:56 -04:00
Konstantin Zhuravlyov	ec8823f099	AMDGPU: Add gfx90c support to code object v2 for backwards compatibility Differential Revision: https://reviews.llvm.org/D100126	2021-04-08 16:42:43 -04:00
Paul C. Anagnostopoulos	16f0e00ff4	Revert "[TableGen] Add support for the 'assert' statement in multiclasses" This reverts commit 3b9a15d910a8c748b1444333a4a3905a996528bc.	2021-04-08 13:58:58 -04:00
Paul C. Anagnostopoulos	282eb5170a	[TableGen] Add support for the 'assert' statement in multiclasses	2021-04-08 08:36:03 -04:00
Philip Reames	b81ddb9786	[docs] Document our norms around reverts This has come up a few times recently, and I was surprised to notice that we don't have anything in the docs. This patch deliberately sticks to stuff that is uncontroversial in the community. Everything herein is thought to be widely agreed to by a large majority of the community. A few things were noted and removed in review which failed this standard, if you spot anything else, please point it out. Differential Revision: https://reviews.llvm.org/D99305	2021-04-07 21:02:19 -07:00
Tony Tye	2ebd0d6fb1	[NFC][AMDGPU] Correct indentation in AMDGPUUsage.rst Correct indentation that results in rST syntax error.	2021-04-08 01:00:13 +00:00
Tony Tye	36036d99fe	[AMDGPU] Update gfx90a memory model support Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D100070	2021-04-07 22:17:58 +00:00
Paul C. Anagnostopoulos	8c9238aaa1	[TableGen] [docs] Correct a couple of mistakes; use 'true' and 'false' in examples Differential Revision: https://reviews.llvm.org/D99800	2021-04-05 09:15:58 -04:00
Nikita Popov	5794a3620e	[FastISel] Remove kill tracking This is a followup to D98145: As far as I know, tracking of kill flags in FastISel is just a compile-time optimization. However, I'm not actually seeing any compile-time regression when removing the tracking. This probably used to be more important in the past, before FastRA was switched to allocate instructions in reverse order, which means that it discovers kills as a matter of course. As such, the kill tracking doesn't really seem to serve a purpose anymore, and just adds additional complexity and potential for errors. This patch removes it entirely. The primary changes are dropping the hasTrivialKill() method and removing the kill arguments from the emitFast methods. The rest is mechanical fixup. Differential Revision: https://reviews.llvm.org/D98294	2021-04-03 15:50:13 +02:00
Paul C. Anagnostopoulos	f346298b33	[TableGen] [Docs] Add lldb-tblgen to command guide; add 4 guide stubs Differential Revision: https://reviews.llvm.org/D99605	2021-04-02 09:52:16 -04:00
Tony	99c3eaf30c	[NFC][AMDGPU] Add product names for gfx908 and gfx10 processors Reviewed By: msearles Differential Revision: https://reviews.llvm.org/D99781	2021-04-02 00:58:11 +00:00
Jon Roelofs	abd11b2935	[docs] Fix up dead clang-format links after monorepo move. NFC	2021-03-30 14:29:35 -07:00
oToToT	2752ef7d4d	[Docs] Update googletest docs link. The documentation link of Google Test on GitHub have been moved to the top-level docs directory. Thus, the original link is invalid now. Reviewed By: Pavel Labath Differential Revision: https://reviews.llvm.org/D99559	2021-03-30 23:20:23 +08:00
Krasimir Georgiev	72bca5f483	Revert "[Passes] Add relative lookup table converter pass" This reverts commit 5178ffc7cf92527557ae16e86d0fa90d538c2a19. Compiling `llvm-profdata` with a compiler build from this produces a crashing binary.	2021-03-30 14:13:37 +02:00
Nuno Lopes	695aa28456	[docs] remove references to checking out svn repos	2021-03-30 10:00:31 +01:00
Tim Renouf	2d9a94e442	[AMDGPU] Update AMDGPU PAL usage documentation Change-Id: I65f3edcfe5063551cad5aab0da1374c3a6ccd3a2	2021-03-30 08:33:18 +01:00
Gulfem Savrun Yeniceri	abf79b4a39	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-29 21:53:32 +00:00
Paul C. Anagnostopoulos	128e39dc70	[TableGen] Add support for the 'assert' statement in class definitions. Differential Revision: https://reviews.llvm.org/D99275	2021-03-29 09:20:29 -04:00
Matt Arsenault	efe6c99b52	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 07e46367baeca96d84b03fa215b41775f69d5989.	2021-03-29 08:55:30 -04:00
Oliver Stannard	0914bea32c	Revert "Reapply "OpaquePtr: Turn inalloca into a type attribute"" Reverting because test 'Bindings/Go/go.test' is failing on most buildbots. This reverts commit fc9df309917e57de704f3ce4372138a8d4a23d7a.	2021-03-29 11:32:22 +01:00
Matt Arsenault	403cadc380	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 20d5c42e0ef5d252b434bcb610b04f1cb79fe771.	2021-03-28 13:35:21 -04:00
Nico Weber	755e1b95c9	Revert "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 4fefed65637ec46c8c2edad6b07b5569ac61e9e5. Broke check-clang everywhere.	2021-03-28 13:02:52 -04:00
Zakk Chen	56db174a0b	[RISCV][Clang] Update new overloading rules for RVV intrinsics. RVV intrinsics has new overloading rule, please see `82aac7dad4` Changed: 1. Rename `generic` to `overloaded` because the new rule is not using C11 generic. 2. Change HasGeneric to HasNoMaskedOverloaded because all masked operations support overloading api. 3. Add more overloaded tests due to overloading rule changed. Differential Revision: https://reviews.llvm.org/D99189	2021-03-28 09:04:35 -07:00
Matt Arsenault	9b63996812	OpaquePtr: Turn inalloca into a type attribute I think byval/sret and the others are close to being able to rip out the code to support the missing type case. A lot of this code is shared with inalloca, so catch this up to the others so that can happen.	2021-03-28 11:12:23 -04:00
George Burgess IV	f2c39381e1	docs: Adding Google representative to the security group This adds me as a Google representative for the LLVM security group. This was proposed, discussed, and voted on in the differential revision linked below; please see it for more information. Differential Revision: https://reviews.llvm.org/D99232	2021-03-26 18:55:37 -07:00
Tony	382c4642c9	[NFC][AMDGPU] Corrections to AMD GPU initial kernel launch documentation Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D99223	2021-03-26 02:05:45 +00:00
Amara Emerson	61b28d4f51	[GlobalISel] Add G_ROTR and G_ROTL opcodes for rotates. Differential Revision: https://reviews.llvm.org/D99383	2021-03-25 17:23:30 -07:00
Djordje Todorovic	29c99e8499	[Debugify] Expose original debug info preservation check as CC1 option In order to test the preservation of the original Debug Info metadata in your projects, a front end option could be very useful, since users usually report that a concrete entity (e.g. variable x, or function fn2()) is missing debug info. The [0] is an example of running the utility on GDB Project. This depends on: D82546 and D82545. Differential Revision: https://reviews.llvm.org/D82547	2021-03-25 05:29:42 -07:00
Gulfem Savrun Yeniceri	54e2d4cdab	Revert "[Passes] Add relative lookup table converter pass" This reverts commit 5fd001a5ffbad403053c4a06bf4b2b76dc52bba8 because it broke clang-with-thin-lto-ubuntu bot.	2021-03-24 18:59:33 +00:00
Gulfem Savrun Yeniceri	93b265f8c0	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-24 17:31:18 +00:00
Vinicius Tinti	046d087d1d	[llvm-objdump] Implement --prefix-strip option The option `--prefix-strip` is only used when `--prefix` is not empty. It removes N initial directories from absolute paths before adding the prefix. This matches GNU's objdump behavior. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D96679	2021-03-24 13:22:35 +00:00
Andrew Savonichev	182b0cd903	[MCA] Disable RCU for InOrderIssueStage This is a follow-up for: D98604 [MCA] Ensure that writes occur in-order When instructions are aligned by the order of writes, they retire in-order naturally. There is no need for an RCU, so it is disabled. Differential Revision: https://reviews.llvm.org/D98628	2021-03-24 13:54:04 +03:00
Bruno Cardoso Lopes	054affa82b	[CGAtomic] Lift stronger requirements on cmpxch and support acquire failure mode - Fix `emitAtomicCmpXchgFailureSet` to support release/acquire (succ/fail) memory order. - Remove stronger checks for cmpxch. Effectively, this addresses http://wg21.link/p0418 Differential Revision: https://reviews.llvm.org/D98995	2021-03-23 16:45:37 -07:00
Tony	da027401bd	[NFC][AMDGPU] Reserve AMD GPU ELF machine number 0x41 Reviewed By: foad Differential Revision: https://reviews.llvm.org/D99196	2021-03-23 17:53:02 +00:00
Fraser Cormack	7850d98674	[LangRef] Fix typos in the vector-type memory layout section Reviewed By: bjope Differential Revision: https://reviews.llvm.org/D99163	2021-03-23 12:28:50 +00:00
David Sherwood	42a72164a2	[IR][SVE] Add new llvm.experimental.stepvector intrinsic This patch adds a new llvm.experimental.stepvector intrinsic, which takes no arguments and returns a linear integer sequence of values of the form <0, 1, ...>. It is primarily intended for scalable vectors, although it will work for fixed width vectors too. It is intended that later patches will make use of this new intrinsic when vectorising induction variables, currently only supported for fixed width. I've added a new CreateStepVector method to the IRBuilder, which will generate a call to this intrinsic for scalable vectors and fall back on creating a ConstantVector for fixed width. For scalable vectors this intrinsic is lowered to a new ISD node called STEP_VECTOR, which takes a single constant integer argument as the step. During lowering this argument is set to a value of 1. The reason for this additional argument at the codegen level is because in future patches we will introduce various generic DAG combines such as mul step_vector(1), 2 -> step_vector(2) add step_vector(1), step_vector(1) -> step_vector(2) shl step_vector(1), 1 -> step_vector(2) etc. that encourage a canonical format for all targets. This hopefully means all other targets supporting scalable vectors can benefit from this too. I've added cost model tests for both fixed width and scalable vectors: llvm/test/Analysis/CostModel/AArch64/neon-stepvector.ll llvm/test/Analysis/CostModel/AArch64/sve-stepvector.ll as well as codegen lowering tests for fixed width and scalable vectors: llvm/test/CodeGen/AArch64/neon-stepvector.ll llvm/test/CodeGen/AArch64/sve-stepvector.ll See this thread for discussion of the intrinsic: https://lists.llvm.org/pipermail/llvm-dev/2021-January/147943.html	2021-03-23 10:43:35 +00:00
Tony	7be40b4abf	[AMDGPU] Reserve ELF code Reserve AMD GPU ELF machine code 0x040. Minor AMDGPUUsage format consistency change. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D99122	2021-03-23 04:30:38 +00:00
Gulfem Savrun Yeniceri	61bfb34ac2	Revert "[Passes] Add relative lookup table converter pass" This reverts commit 78a65cd945d006ff02f9d24d9cc20a302ed93b08 which caused buildbot failures.	2021-03-23 00:43:16 +00:00
Gulfem Savrun Yeniceri	947cc1dce8	[doc] Fix typo in rel lookup table converter pass Add additonal hypens to match the title size that was introduced in 78a65cd.	2021-03-22 23:25:06 +00:00
Gulfem Savrun Yeniceri	59cc51764b	[Passes] Add relative lookup table converter pass Lookup tables generate non PIC-friendly code, which requires dynamic relocation as described in: https://bugs.llvm.org/show_bug.cgi?id=45244 This patch adds a new pass that converts lookup tables to relative lookup tables to make them PIC-friendly. Differential Revision: https://reviews.llvm.org/D94355	2021-03-22 22:09:02 +00:00
Bradley Smith	839304c777	[IR] Add vscale_range IR function attribute This attribute represents the minimum and maximum values vscale can take. For now this attribute is not hooked up to anything during codegen, this will be added in the future when such codegen is considered stable. Additionally hook up the -msve-vector-bits=<x> clang option to emit this attribute. Differential Revision: https://reviews.llvm.org/D98030	2021-03-22 12:05:06 +00:00
Kristof Beyls	917f15dbfc	[docs] GettingInvolved: split out flang and openmp meeting series Split out the flang and openmp meeting series, as each has a separate canonical page where the information is maintained. As part of that, also call out the alias analysis series separately as it doesn't seem to be relevant for just flang. Differential Revision: https://reviews.llvm.org/D99012	2021-03-22 09:25:57 +01:00
Jessica Paquette	ae291b6dfb	[GlobalISel] Add G_SBFX + G_UBFX (bitfield extraction opcodes) There is a bunch of similar bitfield extraction code throughout *ISelDAGToDAG. E.g, ARMISelDAGToDAG, AArch64ISelDAGToDAG, and AMDGPUISelDAGToDAG all contain code that matches a bitfield extract from an and + right shift. Rather than duplicating code in the same way, this adds two opcodes: - G_UBFX (unsigned bitfield extract) - G_SBFX (signed bitfield extract) They work like this ``` %x = G_UBFX %y, %lsb, %width ``` Where `lsb` and `width` are - The least-significant bit of the extraction - The width of the extraction This will extract `width` bits from `%y`, starting at `lsb`. G_UBFX zero-extends the result, while G_SBFX sign-extends the result. This should allow us to use the combiner to match the bitfield extraction patterns rather than duplicating pattern-matching code in each target. Differential Revision: https://reviews.llvm.org/D98464	2021-03-19 14:37:19 -07:00
Bjorn Pettersson	13603c344c	[LangRef] Describe memory layout for vectors types There are a couple of caveats when it comes to how vectors are stored to memory, and thereby also how bitcast between vector and integer types work, in LLVM IR. Specially in relation to endianess. This patch is an attempt to document such things. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D94964	2021-03-19 19:00:37 +01:00
Christian Kühnel	b512b2d48c	propose Chocolately as package manager Installing the Unix tools on Windows is quite painful. To make things easier, I explained how to use a package manager or a Docker image. Note: This still uses the GNUWin tools as explained on this page. Once we replace these with something else, we would also need to update the installation commands. Differential Revision: https://reviews.llvm.org/D97387	2021-03-19 16:15:18 +01:00
Paul C. Anagnostopoulos	f8fbe9eb04	[TableGen] Improve handling of template arguments This requires changes to TableGen files and some C++ files due to incompatible multiclass template arguments that slipped through before the improved handling.	2021-03-19 09:57:53 -04:00
Jeroen Dobbelaere	13605b24cd	Support intrinsic overloading on unnamed types This patch adds support for intrinsic overloading on unnamed types. This fixes PR38117 and PR48340 and will also be needed for the Full Restrict Patches (D68484). The main problem is that the intrinsic overloading name mangling is using 's_s' for unnamed types. This can result in identical intrinsic mangled names for different function prototypes. This patch changes this by adding a '.XXXXX' to the intrinsic mangled name when at least one of the types is based on an unnamed type, ensuring that we get a unique name. Implementation details: - The mapping is created on demand and kept in Module. - It also checks for existing clashes and recycles potentially existing prototypes and declarations. - Because of extra data in Module, Intrinsic::getName needs an extra Module* argument and, for speed, an optional FunctionType* argument. - I still kept the original two-argument 'Intrinsic::getName' around which keeps the original behavior (providing the base name). -- Main reason is that I did not want to change the LLVMIntrinsicGetName version, as I don't know how acceptable such a change is -- The current situation already has a limitation. So that should not get worse with this patch. - Intrinsic::getDeclaration and the verifier are now using the new version. Other notes: - As far as I see, this should not suffer from stability issues. The count is only added for prototypes depending on at least one anonymous struct - The initial count starts from 0 for each intrinsic mangled name. - In case of name clashes, existing prototypes are remembered and reused when that makes sense. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D91250	2021-03-19 14:34:25 +01:00
Kristof Beyls	9d15a1dd6c	[docs] Add calendar info for SVE sync-ups	2021-03-19 10:27:34 +01:00
Kristof Beyls	1c7cdd5117	[docs] Document regular LLVM sync-ups This documents current regular LLVM sync-ups that are happening in the Getting Involved section. I hope this gives a bit more visibility to regular sync-ups that are happening in the LLVM community, documenting another way communication in the community happens. Of course the downside is that this is another location that sync-up metadata needs to be maintained. That being said, the structure as proposed means that no changes are needed once a new sync-up is added, apart from maybe removing the entry once it becomes clear that that particular sync-up series is completely cancelled. Documenting a few pointers on how current sync-ups happen may also encourage others to organize useful sync-ups on specific topics. I've started with adding the sync-ups I'm aware of. There's a good chance I've missed some. If most sync-ups end up having a public google calendar, we could also create and maintain a public google calendar that shows all events happening in the LLVM community, including dev meetings, sync-ups, socials, etc - assuming that would be valuable. Differential Revision: https://reviews.llvm.org/D98797	2021-03-18 18:32:27 +01:00
Vaivaswatha Nagaraj	062bb2a6ef	[Docs] Mention linking to reviews page when committing Differential Revision: https://reviews.llvm.org/D98695	2021-03-16 23:04:22 +05:30
Fangrui Song	d3961f8ad2	[llvm-nm] Add --format=just-symbols and make --just-symbol-name its alias https://sourceware.org/bugzilla/show_bug.cgi?id=27487 binutils will have --format=just-symbols/-j as well. Arbitrarily prefer `-j` to `--format=sysv`. Previously `--format=sysv -j` prints in the sysv format while `-j` takes precedence over other formats. Differential Revision: https://reviews.llvm.org/D98569	2021-03-16 10:07:01 -07:00
David Zarzycki	643090aa23	[lit] Sort test start times based on prior test timing data Lit as it exists today has three hacks that allow users to run tests earlier: 1) An entire test suite can set the `is_early` boolean. 2) A very recently introduced "early_tests" feature. 3) The `--incremental` flag forces failing tests to run first. All of these approaches have problems. 1) The `is_early` feature was until very recently undocumented. Nevertheless it still lacks testing and is a imprecise way of optimizing test starting times. 2) The `early_tests` feature requires manual updates and doesn't scale. 3) `--incremental` is undocumented, untested, and it requires modifying the source file system by "touching" the file. This "touch" based approach is arguably a hack because it confuses editors (because it looks like the test was modified behind the back of the editor) and "touching" the test source file doesn't work if the test suite is read only from the perspective of `lit` (via advanced filesystem/build tricks). This patch attempts to simplify and address all of the above problems. This patch formalizes, documents, tests, and defaults lit to recording the execution time of tests and then reordering all tests during the next execution. By reordering the tests, high core count machines run faster, sometimes significantly so. This patch also always runs failing tests first, which is a positive user experience win for those that didn't know about the hidden `--incremental` flag. Finally, if users want, they can _optionally_ commit the test timing data (or a subset thereof) back to the repository to accelerate bots and first-time runs of the test suite. Reviewed By: jhenderson, yln Differential Revision: https://reviews.llvm.org/D98179	2021-03-16 05:23:04 -04:00
Thomas Preud'homme	0911193c17	[FileCheck] Add support for hex alternate form in FileCheck Add printf-style alternate form flag to prefix hex number with 0x when present. This works on both empty numeric expression (e.g. variable definition from input) and when matching a numeric expression. The syntax is as follows: [[#%#<precision specifier><format specifier>, ...] where <precision specifier> and <format specifier> are optional and ... can be a variable definition or not with an empty expression or not. This feature was requested in https://reviews.llvm.org/D81144#2075532 for llvm/test/MC/ELF/gen-dwarf64.s Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D97845	2021-03-12 18:14:17 +00:00
David Green	18fc27f084	[ARM] Improve WLS lowering Recently we improved the lowering of low overhead loops and tail predicated loops, but concentrated first on the DLS do style loops. This extends those improvements over to the WLS while loops, improving the chance of lowering them successfully. To do this the lowering has to change a little as the instructions are terminators that produce a value - something that needs to be treated carefully. Lowering starts at the Hardware Loop pass, inserting a new llvm.test.start.loop.iterations that produces both an i1 to control the loop entry and an i32 similar to the llvm.start.loop.iterations intrinsic added for do loops. This feeds into the loop phi, properly gluing the values together: %wls = call { i32, i1 } @llvm.test.start.loop.iterations.i32(i32 %div) %wls0 = extractvalue { i32, i1 } %wls, 0 %wls1 = extractvalue { i32, i1 } %wls, 1 br i1 %wls1, label %loop.ph, label %loop.exit ... loop: %lsr.iv = phi i32 [ %wls0, %loop.ph ], [ %iv.next, %loop ] .. %iv.next = call i32 @llvm.loop.decrement.reg.i32(i32 %lsr.iv, i32 1) %cmp = icmp ne i32 %iv.next, 0 br i1 %cmp, label %loop, label %loop.exit The llvm.test.start.loop.iterations need to be lowered through ISel lowering as a pair of WLS and WLSSETUP nodes, which each get converted to t2WhileLoopSetup and t2WhileLoopStart Pseudos. This helps prevent t2WhileLoopStart from being a terminator that produces a value, something difficult to control at that stage in the pipeline. Instead the t2WhileLoopSetup produces the value of LR (essentially acting as a lr = subs rn, 0), t2WhileLoopStart consumes that lr value (the Bcc). These are then converted into a single t2WhileLoopStartLR at the same point as t2DoLoopStartTP and t2LoopEndDec. Otherwise we revert the loop to prevent them from progressing further in the pipeline. The t2WhileLoopStartLR is a single instruction that takes a GPR and produces LR, similar to the WLS instruction. %1:gprlr = t2WhileLoopStartLR %0:rgpr, %bb.3 t2B %bb.1 ... bb.2.loop: %2:gprlr = PHI %1:gprlr, %bb.1, %3:gprlr, %bb.2 ... %3:gprlr = t2LoopEndDec %2:gprlr, %bb.2 t2B %bb.3 The t2WhileLoopStartLR can then be treated similar to the other low overhead loop pseudos, eventually being lowered to a WLS providing the branches are within range. Differential Revision: https://reviews.llvm.org/D97729	2021-03-11 17:56:19 +00:00
Djordje Todorovic	1e88deac13	[Debugify][OriginalDIMode] Export the report into JSON file By using the original-di check with debugify in the combination with the llvm/utils/llvm-original-di-preservation.py it becomes very user friendly tool. An example of the HTML page with the issues related to debug info can be found at [0]. [0] https://djolertrk.github.io/di-checker-html-report-example/ Differential Revision: https://reviews.llvm.org/D82546	2021-03-11 01:11:13 -08:00
Zakk Chen	da49c75c2f	[Clang][RISCV] Add custom TableGen backend for riscv-vector intrinsics. Demonstrate how to generate vadd/vfadd intrinsic functions 1. add -gen-riscv-vector-builtins for clang builtins. 2. add -gen-riscv-vector-builtin-codegen for clang codegen. 3. add -gen-riscv-vector-header for riscv_vector.h. It also generates ifdef directives with extension checking, base on D94403. 4. add -gen-riscv-vector-generic-header for riscv_vector_generic.h. Generate overloading version Header for generic api. https://github.com/riscv/rvv-intrinsic-doc/blob/master/rvv-intrinsic-rfc.md#c11-generic-interface 5. update tblgen doc for riscv related options. riscv_vector.td also defines some unused type transformers for vadd, because I think it could demonstrate how tranfer type work and we need them for the whole intrinsic functions implementation in the future. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: jrtc27, craig.topper, HsiangKai, Jim, Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D95016	2021-03-10 18:43:43 -08:00
Christudasan Devadasan	aa8030a6bf	GlobalISel: Try to combine G_[SU]DIV and G_[SU]REM It is good to have a combined `divrem` instruction when the `div` and `rem` are computed from identical input operands. Some targets can lower them through a single expansion that computes both division and remainder. It effectively reduces the number of instructions than individually expanding them. Reviewed By: arsenm, paquette Differential Revision: https://reviews.llvm.org/D96013	2021-03-10 18:46:07 +05:30
Yao Zhao	47a5a13f53	[xray] Fix xray document spelling fix a couple of words spelling Reviewed By: dberris Differential Revision: https://reviews.llvm.org/D96658	2021-03-10 16:03:55 +11:00
Cullen Rhodes	6682076a17	[IR] Introduce llvm.experimental.vector.splice intrinsic This patch introduces a new intrinsic @llvm.experimental.vector.splice that constructs a vector of the same type as the two input vectors, based on a immediate where the sign of the immediate distinguishes two variants. A positive immediate specifies an index into the first vector and a negative immediate specifies the number of trailing elements to extract from the first vector. For example: @llvm.experimental.vector.splice(<A,B,C,D>, <E,F,G,H>, 1) ==> <B, C, D, E> ; index @llvm.experimental.vector.splice(<A,B,C,D>, <E,F,G,H>, -3) ==> <B, C, D, E> ; trailing element count These intrinsics support both fixed and scalable vectors, where the former is lowered to a shufflevector to maintain existing behaviour, although while marked as experimental the recommended way to express this operation for fixed-width vectors is to use shufflevector. For scalable vectors where it is not possible to express a shufflevector mask for this operation, a new ISD node has been implemented. This is one of the named shufflevector intrinsics proposed on the mailing-list in the RFC at [1]. Patch by Paul Walker and Cullen Rhodes. [1] https://lists.llvm.org/pipermail/llvm-dev/2020-November/146864.html Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D94708	2021-03-09 10:44:22 +00:00
Alexander Shaposhnikov	4904bec77b	[docs] Fix llvm-objcopy.rst Adjust the title underline, NFC.	2021-03-08 19:06:32 -08:00
Alexander Shaposhnikov	98aa5107b5	[llvm-objcopy][MachO] Add support for --keep-undefined This diff introduces --keep-undefined in llvm-objcopy/llvm-strip for Mach-O which makes the tools preserve undefined symbols. Test plan: make check-all Differential revision: https://reviews.llvm.org/D97040	2021-03-08 18:57:25 -08:00
Alexander Shaposhnikov	97b192e85f	[llvm-objdump][MachO] Add support for dumping function starts Add support for dumping function starts for Mach-O binaries. Test plan: make check-all Differential revision: https://reviews.llvm.org/D97027	2021-03-08 18:44:44 -08:00
Juneyoung Lee	8d4adfa7a5	[LangRef] mention that the lifetime intrinsics' description in LangRef isn't everything This is a minor patch that addresses concerns about lifetime in D94002. We need to mention that what's written in LangRef isn't everything about lifetime.start/end and its semantics depends on the stack coloring algorithm's pattern matching of a stack pointer. If the stack coloring algorithm cannot conclude that a pointer is a stack-allocated object, the pointer is conservatively considered as a non-stack one because stack coloring won't take this lifetime into account while assigning addresses. A reference from alloca to lifetime.start/end is added as well. Differential Revision: https://reviews.llvm.org/D98112	2021-03-09 11:33:36 +09:00
Ben Dunbobbin	761a3287bd	Reland: [Docs][Windows Itanium] Add a How-To document for Windows Itanium. This is a basic How-To that describes: - What Windows Itanium is. - How to assemble a build environment. Differential Revision: https://reviews.llvm.org/D89518	2021-03-09 01:36:34 +00:00
Tony	a686697492	[NFC][AMDGPU] Correct typo in DWARF Extensions For Heterogeneous Debugging A note in the defintion of DW_OP_piece had an incomplete sentence. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D98157	2021-03-09 00:23:23 +00:00
Rahman Lavaee	2c790c2f9f	[llvm-readelf] Support dumping the BB address map section with --bb-addr-map. This patch lets llvm-readelf dump the content of the BB address map section in the following format: ``` Function { At: <address> BB entries [ { Offset: <offset> Size: <size> Metadata: <metadata> }, ... ] } ... ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95511	2021-03-08 16:20:11 -08:00
Ben Dunbobbin	00a8ffd3fa	Revert "[Docs][Windows Itanium] Add a How-To document for Windows Itanium." This reverts commit 5a91d23ddfb2effd471b919241d1ef80bf1a4c9d. Markup was incorrect.	2021-03-08 23:57:27 +00:00
Ben Dunbobbin	70e9df61a5	[Docs][Windows Itanium] Add a How-To document for Windows Itanium. This is a basic How-To that describes: - What Windows Itanium is. - How to assemble a build environment. Differential Revision: https://reviews.llvm.org/D89518	2021-03-08 23:48:51 +00:00
Keith Smiley	30b780a55a	llvm-nm: add flag to suppress no symbols warning This spelling matches binutils https://sourceware.org/bugzilla/show_bug.cgi?id=27408 Differential Revision: https://reviews.llvm.org/D83152	2021-03-07 16:20:13 -08:00
Tony	4f31275f73	[NFC][AMDGPU] DWARF Extensions For Heterogeneous Debugging clarifications Clarify that the base type endianity is used when creating implicit location storage. Remove duplicate definition of the generic type. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D98137	2021-03-07 18:34:17 +00:00
Tony	b0eb76b4ee	[NFC][AMDGPU]DWARF Extensions For Heterogeneous Debugging generic type endianity In "DWARF Extensions For Heterogeneous Debugging" document that the DWARF generic type has a target architecture defined endianity. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D98126	2021-03-07 04:51:05 +00:00
Juneyoung Lee	5caecc36c3	[LangRef] dos2unix (NFC)	2021-03-06 18:44:40 +09:00
gbtozers	c52cf11f42	[DebugInfo] Add DIArgList MD to store multple values in DbgVariableIntrinsics This patch adds a new metadata node, DIArgList, which contains a list of SSA values. This node is in many ways similar in function to the existing ValueAsMetadata node, with the difference being that it tracks a list instead of a single value. Internally, it uses ValueAsMetadata to track the individual values, but there is also a reasonable amount of DIArgList-specific value-tracking logic on top of that. Similar to ValueAsMetadata, it is a special case in parsing and printing due to the fact that it requires a function state (as it may reference function-local values). This patch should not result in any immediate functional change; it allows for DIArgLists to be parsed and printed, but debug variable intrinsics do not yet recognize them as a valid argument (outside of parsing). Differential Revision: https://reviews.llvm.org/D88175	2021-03-05 17:02:24 +00:00
Stephen Tozer	e0cb677eb6	Reapply "[DebugInfo] Add new instruction and DIExpression operator for variadic debug values" Rewrites test to use correct architecture triple; fixes incorrect reference in SourceLevelDebugging doc; simplifies `spillReg` behaviour so as to not be dependent on changes elsewhere in the patch stack. This reverts commit d2000b45d033c06dc7973f59909a0ad12887ff51.	2021-03-05 12:32:05 +00:00
Juneyoung Lee	5f3a69dfff	[LangRef] lifetime intrinsics: don't use word 'offset' from Philip's comments	2021-03-05 12:53:13 +09:00
Philip Reames	5a592bf8e6	[docs] Remove some stale wording from gc.relocate description We dropped support for the non-bundle form a while back, but I apparently missed updating one place in the docs.	2021-03-04 15:18:11 -08:00
Philip Reames	42428235cc	[docs] Move statepoint related intrinsics into main LangRef	2021-03-04 15:13:27 -08:00
Akira Hatanaka	4055195f29	[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR This reapplies ed4718eccb12bd42214ca4fb17d196d49561c0c7, which was reverted because it was causing a miscompile. The bug that was causing the miscompile has been fixed in 75805dce5ff874676f3559c069fcd6737838f5c0. Original commit message: Background: This fixes a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.attachedcall" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if claimRV is attached to the call since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since the ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if retainRV is attached to the call and does nothing if claimRV is attached to it. - SCCP refrains from replacing the return value of a call with a constant value if the call has the operand bundle. This ensures the call always has at least one user (the call to @llvm.objc.clang.arc.noop.use). - This patch also fixes a bug in replaceUsesOfNonProtoConstant where multiple operand bundles of the same kind were being added to a call. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-03-04 11:22:30 -08:00
Stephen Tozer	977ffc2c60	Revert "[DebugInfo] Add new instruction and DIExpression operator for variadic debug values" This reverts commit d07f106f4a48b6e941266525b6f7177834d7b74e.	2021-03-04 11:59:21 +00:00
gbtozers	7cf2776667	[DebugInfo] Add new instruction and DIExpression operator for variadic debug values This patch adds a new instruction that can represent variadic debug values, DBG_VALUE_VAR. This patch alone covers the addition of the instruction and a set of basic code changes in MachineInstr and a few adjacent areas, but does not correctly handle variadic debug values outside of these areas, nor does it generate them at any point. The new instruction is similar to the existing DBG_VALUE instruction, with the following differences: the operands are in a different order, any number of values may be used in the instruction following the Variable and Expression operands (these are referred to in code as “debug operands”) and are indexed from 0 so that getDebugOperand(X) == getOperand(X+2), and the Expression in a DBG_VALUE_VAR must use the DW_OP_LLVM_arg operator to pass arguments into the expression. The new DW_OP_LLVM_arg operator is only valid in expressions appearing in a DBG_VALUE_VAR; it takes a single argument and pushes the debug operand at the index given by the argument onto the Expression stack. For example the sub-expression `DW_OP_LLVM_arg, 0` has the meaning “Push the debug operand at index 0 onto the expression stack.” Differential Revision: https://reviews.llvm.org/D82363	2021-03-04 11:45:35 +00:00
Andrew Savonichev	064cc1a22c	[MCA] Add support for in-order CPUs This patch adds a pipeline to support in-order CPUs such as ARM Cortex-A55. In-order pipeline implements a simplified version of Dispatch, Scheduler and Execute stages as a single stage. Entry and Retire stages are common for both in-order and out-of-order pipelines. Differential Revision: https://reviews.llvm.org/D94928	2021-03-04 14:08:19 +03:00
James Henderson	6bdae7560c	[llvm-objcopy][llvm-strip] Improve --discard-all documentation and help The help text and documentation for the --discard-all option failed to mention that the option also causes the removal of debug sections. This change fixes both for both llvm-objcopy and llvm-strip. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D97662	2021-03-04 10:25:35 +00:00
Juneyoung Lee	4267f32aaf	[LangRef] remove links to lifetime since use marker intro already has a link	2021-03-04 17:19:23 +09:00
Juneyoung Lee	c2621fae37	[LangRef] fix more undefined label errors	2021-03-04 17:09:03 +09:00
Johannes Doerfert	97088b4db9	[Docs] Remove `no-aa` from the alias analysis documentation The `no-aa` pass has been removed with 7b560d40bddf. Differential Revision: https://reviews.llvm.org/D95416	2021-03-04 00:35:52 -06:00
Wang, Pengfei	2e2e287013	Add Windows ehcont section support (/guard:ehcont). Add option /guard:ehcont Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D96709	2021-03-04 11:47:29 +08:00
Juneyoung Lee	09d28f9368	[LangRef] fix undefined label	2021-03-04 10:12:57 +09:00
Juneyoung Lee	f861c629c1	[LangRef] Make lifetime intrinsic's semantics consistent with StackColoring's comment This patch is an update to LangRef by describing lifetime intrinsics' behavior by following the description of MIR's LIFETIME_START/LIFETIME_END markers at StackColoring.cpp (`eb44682d67/llvm/lib/CodeGen/StackColoring.cpp (L163)`) and the discussion in llvm-dev. In order to explicitly define the meaning of an object lifetime, I added 'Object Lifetime' subsection. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D94002	2021-03-04 09:58:06 +09:00
Xun Li	a7cf9dd738	[LICM][Coroutine] Don't sink stores from loops with coro.suspend instructions See pr46990(https://bugs.llvm.org/show_bug.cgi?id=46990). LICM should not sink store instructions to loop exit blocks which cross coro.suspend intrinsics. This breaks semantic of coro.suspend intrinsic which return to caller directly. Also this leads to use-after-free if the coroutine is freed before control returns to the caller in multithread environment. This patch disable promotion by check whether loop contains coro.suspend intrinsics. This is a resubmit of D86190. Disabling LICM for loops with coroutine suspension is a better option not only for correctness purpose but also for performance purpose. In most cases LICM sinks memory operations. In the case of coroutine, sinking memory operation out of the loop does not improve performance since coroutien needs to get data from the frame anyway. In fact LICM would hurt coroutine performance since it adds more entries to the frame. Differential Revision: https://reviews.llvm.org/D96928	2021-03-03 15:21:57 -08:00
Hans Wennborg	c2ea8c4219	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR" This caused miscompiles of Chromium tests for iOS due clobbering of live registers. See discussion on the code review for details. > Background: > > This fixes a longstanding problem where llvm breaks ARC's autorelease > optimization (see the link below) by separating calls from the marker > instructions or retainRV/claimRV calls. The backend changes are in > https://reviews.llvm.org/D92569. > > https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue > > What this patch does to fix the problem: > > - The front-end adds operand bundle "clang.arc.attachedcall" to calls, > which indicates the call is implicitly followed by a marker > instruction and an implicit retainRV/claimRV call that consumes the > call result. In addition, it emits a call to > @llvm.objc.clang.arc.noop.use, which consumes the call result, to > prevent the middle-end passes from changing the return type of the > called function. This is currently done only when the target is arm64 > and the optimization level is higher than -O0. > > - ARC optimizer temporarily emits retainRV/claimRV calls after the calls > with the operand bundle in the IR and removes the inserted calls after > processing the function. > > - ARC contract pass emits retainRV/claimRV calls after the call with the > operand bundle. It doesn't remove the operand bundle on the call since > the backend needs it to emit the marker instruction. The retainRV and > claimRV calls are emitted late in the pipeline to prevent optimization > passes from transforming the IR in a way that makes it harder for the > ARC middle-end passes to figure out the def-use relationship between > the call and the retainRV/claimRV calls (which is the cause of > PR31925). > > - The function inliner removes an autoreleaseRV call in the callee if > nothing in the callee prevents it from being paired up with the > retainRV/claimRV call in the caller. It then inserts a release call if > claimRV is attached to the call since autoreleaseRV+claimRV is > equivalent to a release. If it cannot find an autoreleaseRV call, it > tries to transfer the operand bundle to a function call in the callee. > This is important since the ARC optimizer can remove the autoreleaseRV > returning the callee result, which makes it impossible to pair it up > with the retainRV/claimRV call in the caller. If that fails, it simply > emits a retain call in the IR if retainRV is attached to the call and > does nothing if claimRV is attached to it. > > - SCCP refrains from replacing the return value of a call with a > constant value if the call has the operand bundle. This ensures the > call always has at least one user (the call to > @llvm.objc.clang.arc.noop.use). > > - This patch also fixes a bug in replaceUsesOfNonProtoConstant where > multiple operand bundles of the same kind were being added to a call. > > Future work: > > - Use the operand bundle on x86-64. > > - Fix the auto upgrader to convert call+retainRV/claimRV pairs into > calls with the operand bundles. > > rdar://71443534 > > Differential Revision: https://reviews.llvm.org/D92808 This reverts commit ed4718eccb12bd42214ca4fb17d196d49561c0c7.	2021-03-03 15:51:40 +01:00
Stefan Gränitz	e9a5668ec5	[docs][JITLink] Fix a typo (NFC)	2021-03-02 15:07:36 +01:00
Tony Tye	e33a5d6364	[NFC][AMDGPU] Document the AMDGPU target feature defaults Document the default for the XNACK and SRAMECC target features for code object V2-V3 and V4. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D97598	2021-02-27 18:28:15 +00:00
Kazu Hirata	9d2bb4e874	[llvm] Fix typos in documentation (NFC)	2021-02-27 10:09:23 -08:00
Arthur Eubanks	33f6c8a3f1	[docs] Add documentation on using the new pass manager And clarify in the "writing a pass" docs that both the legacy and new PMs are being used for the codegen/optimization pipelines. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D97515	2021-02-26 15:28:19 -08:00
Stefan Gränitz	66b9e5f6f9	[docs][JITLink] Few typo fixes in JITLink design/API doc	2021-02-26 12:56:42 +01:00
Nico Weber	d8f7d5b6a7	[arm builtin crosscompile docs] add COMPILER_RT_BUILD_MEMPROF=OFF Reported by artok on irc, thanks!	2021-02-25 10:44:52 -05:00
Nico Weber	6de0baf5a6	[arm builtin crosscompile docs] alphabetize flags, no behavior change	2021-02-25 10:44:16 -05:00

1 2 3 4 5 ...

8735 Commits