llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Sergey Dmitriev	0e229fe66b	Revert "[llvm-link] use file magic when deciding if input should be loaded as archive" This reverts commit 55f8c2fdfbc5eda1be946e97ecffa2dea44a883e.	2020-12-02 16:53:57 -08:00
Xun Li	5243362b90	Small improvements to Intrinsic::getName While I was adding a new intrinsic instruction (not overloaded), I accidentally used CreateUnaryIntrinsic to create the intrinsics, which turns out to be passing the type list to getName, and ended up naming the intrinsics function with type suffix, which leads to wierd bugs latter on. It took me a long time to debug. It seems a good idea to add an assertion in getName so that it fails if types are passed but it's not a overloaded function. Also, the overloade version of getName is less efficient because it creates an std::string. We should avoid calling it if we know that there are no types provided. Differential Revision: https://reviews.llvm.org/D92523	2020-12-02 16:49:12 -08:00
Sergey Dmitriev	f1419e8887	[llvm-link] use file magic when deciding if input should be loaded as archive llvm-link should not rely on the '.a' file extension when deciding if input file should be loaded as archive. Archives may have other extensions (f.e. .lib) or no extensions at all. This patch changes llvm-link to use llvm::file_magic to check if input file is an archive. Reviewed By: RaviNarayanaswamy Differential Revision: https://reviews.llvm.org/D92376	2020-12-02 16:29:41 -08:00
Duncan P. N. Exon Smith	9f6a612ccc	ADT: Rely on std::aligned_union_t for math in AlignedCharArrayUnion, NFC Instead of computing the alignment and size of the `char` buffer in `AlignedCharArrayUnion`, rely on the math in `std::aligned_union_t`. Because some users of this rely on the `buffer` field existing with a type convertible to `char *`, we can't change the field type, but we can still avoid duplicating the logic. A potential follow up would be to delete `AlignedCharArrayUnion` after updating its users to use `std::aligned_union_t` directly; or if we like our template parameters better, could update users to stop peeking inside and then replace the definition with: ``` template <class T, class... Ts> using AlignedCharArrayUnion = std::aligned_union_t<1, T, Ts...>; ``` Differential Revision: https://reviews.llvm.org/D92500	2020-12-02 15:56:12 -08:00
Mircea Trofin	611de20466	[NFC][MC] TargetRegisterInfo::getSubReg is a MCRegister. Typing the API appropriately. Differential Revision: https://reviews.llvm.org/D92341	2020-12-02 15:46:38 -08:00
Duncan P. N. Exon Smith	402de92d98	ADT: Remove redundant `alignas` from IntervalMap, NFC `AlignedArrayCharUnion` is now using `alignas`, which is properly supported now by all the host toolchains we support. As a result, the extra `alignas` on `IntervalMap` isn't needed anymore. This is effectively a revert of 379daa29744cd96b0a87ed0d4a010fa4bc47ce73. Differential Revision: https://reviews.llvm.org/D92509	2020-12-02 14:33:20 -08:00
Reid Kleckner	7c87aeebfe	Revert "Use std::is_trivially_copyable", breaks MSVC build Revert "Delete llvm::is_trivially_copyable and CMake variable HAVE_STD_IS_TRIVIALLY_COPYABLE" This reverts commit 4d4bd40b578d77b8c5bc349ded405fb58c333c78. This reverts commit 557b00e0afb2dc1776f50948094ca8cc62d97be4.	2020-12-02 14:30:46 -08:00
Florian Hahn	fad4c5768d	[ConstraintElimination] Make sure arguments of std:pow match. This should fix a build failure on some systems, e.g. solaris11-sparcv9 http://lab.llvm.org:8014/#/builders/22	2020-12-02 22:23:26 +00:00
Harald van Dijk	cca089bd44	[X86] Add TLS_(base_)addrX32 for X32 mode LLVM has TLS_(base_)addr32 for 32-bit TLS addresses in 32-bit mode, and TLS_(base_)addr64 for 64-bit TLS addresses in 64-bit mode. x32 mode wants 32-bit TLS addresses in 64-bit mode, which were not yet handled. This adds TLS_(base_)addrX32 as copies of TLS_(base_)addr64, except that they use tls32(base)addr rather than tls64(base)addr, and then restricts TLS_(base_)addr64 to 64-bit LP64 mode, TLS_(base_)addrX32 to 64-bit ILP32 mode. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D92346	2020-12-02 22:20:36 +00:00
H.J. Lu	ced4c140a0	Use PC-relative address for x32 TLS address Since x32 supports PC-relative address, it shouldn't use EBX for TLS address. Instead of checking N.getValueType(), we should check Subtarget->is32Bit(). This fixes PR 22676. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D16474	2020-12-02 22:20:36 +00:00
LLVM GN Syncbot	eec32dd187	[gn build] Port 24d4291ca70	2020-12-02 21:52:41 +00:00
Hongtao Yu	4cefe8e200	[CSSPGO] Pseudo probes for function calls. An indirect call site needs to be probed for its potential call targets. With CSSPGO a direct call also needs a probe so that a calling context can be represented by a stack of callsite probes. Unlike pseudo probes for basic blocks that are in form of standalone intrinsic call instructions, pseudo probes for callsites have to be attached to the call instruction, thus a separate instruction would not work. One possible way of attaching a probe to a call instruction is to use a special metadata that carries information about the probe. The special metadata will have to make its way through the optimization pipeline down to object emission. This requires additional efforts to maintain the metadata in various places. Given that the `!dbg` metadata is a first-class metadata and has all essential support in place , leveraging the `!dbg` metadata as a channel to encode pseudo probe information is probably the easiest solution. With the requirement of not inflating `!dbg` metadata that is allocated for almost every instruction, we found that the 32-bit DWARF discriminator field which mainly serves AutoFDO can be reused for pseudo probes. DWARF discriminators distinguish identical source locations between instructions and with pseudo probes such support is not required. In this change we are using the discriminator field to encode the ID and type of a callsite probe and the encoded value will be unpacked and consumed right before object emission. When a callsite is inlined, the callsite discriminator field will go with the inlined instructions. The `!dbg` metadata of an inlined instruction is in form of a scope stack. The top of the stack is the instruction's original `!dbg` metadata and the bottom of the stack is for the original callsite of the top-level inliner. Except for the top of the stack, all other elements of the stack actually refer to the nested inlined callsites whose discriminator field (which actually represents a calliste probe) can be used together to represent the inline context of an inlined PseudoProbeInst or CallInst. To avoid collision with the baseline AutoFDO in various places that handles dwarf discriminators where a check against the `-pseudo-probe-for-profiling` switch is not available, a special encoding scheme is used to tell apart a pseudo probe discriminator from a regular discriminator. For the regular discriminator, if all lowest 3 bits are non-zero, it means the discriminator is basically empty and all higher 29 bits can be reversed for pseudo probe use. Callsite pseudo probes are inserted in `SampleProfileProbePass` and a target-independent MIR pass `PseudoProbeInserter` is added to unpack the probe ID/type from `!dbg`. Note that with this work the switch -debug-info-for-profiling will not work with -pseudo-probe-for-profiling anymore. They cannot be used at the same time. Reviewed By: wmi Differential Revision: https://reviews.llvm.org/D91756	2020-12-02 13:45:20 -08:00
Jianzhou Zhao	1191fc6062	[dfsan] Rename CachedCombinedShadow to be CachedShadow At D92261, this type will be used to cache both combined shadow and converted shadow values. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D92458	2020-12-02 21:39:16 +00:00
Jianzhou Zhao	38f4d125f9	[dfsan] Test loading global ptrs This covers a branch in the loadShadow method. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D92460	2020-12-02 21:35:41 +00:00
Jianzhou Zhao	7ce5dd535d	[dfsan] Add a test case for phi	2020-12-02 21:29:44 +00:00
Fangrui Song	355a5fca7d	[ThinLTO][test] Fix X86/nossp.ll after D91816	2020-12-02 13:13:58 -08:00
jasonliu	59a39cf74c	[XCOFF][AIX] Alternative path in EHStreamer for platforms do not have uleb128 support Summary: Not all system assembler supports `.uleb128 label2 - label1` form. When the target do not support this form, we have to take alternative manual calculation to get the offsets from them. Reviewed By: hubert.reinterpretcast Diffierential Revision: https://reviews.llvm.org/D92058	2020-12-02 20:03:15 +00:00
Nick Desaulniers	5d27c8ae50	[Inline] prevent inlining on stack protector mismatch It's common for code that manipulates the stack via inline assembly or that has to set up its own stack canary (such as the Linux kernel) would like to avoid stack protectors in certain functions. In this case, we've been bitten by numerous bugs where a callee with a stack protector is inlined into an attribute((no_stack_protector)) caller, which generally breaks the caller's assumptions about not having a stack protector. LTO exacerbates the issue. While developers can avoid this by putting all no_stack_protector functions in one translation unit together and compiling those with -fno-stack-protector, it's generally not very ergonomic or as ergonomic as a function attribute, and still doesn't work for LTO. See also: https://lore.kernel.org/linux-pm/20200915172658.1432732-1-rkir@google.com/ https://lore.kernel.org/lkml/20200918201436.2932360-30-samitolvanen@google.com/T/#u SSP attributes can be ordered by strength. Weakest to strongest, they are: ssp, sspstrong, sspreq. Callees with differing SSP attributes may be inlined into each other, and the strongest attribute will be applied to the caller. (No change) After this change: * A callee with no SSP attributes will no longer be inlined into a caller with SSP attributes. * The reverse is also true: a callee with an SSP attribute will not be inlined into a caller with no SSP attributes. * The alwaysinline attribute overrides these rules. Functions that get synthesized by the compiler may not get inlined as a result if they are not created with the same stack protector function attribute as their callers. Alternative approach to https://reviews.llvm.org/D87956. Fixes pr/47479. Signed-off-by: Nick Desaulniers <ndesaulniers@google.com> Reviewed By: rnk, MaskRay Differential Revision: https://reviews.llvm.org/D91816	2020-12-02 11:00:16 -08:00
LLVM GN Syncbot	c8016d5fbf	[gn build] Port a65d8c5d720	2020-12-02 18:50:30 +00:00
jasonliu	60c6f78bef	[XCOFF][AIX] Generate LSDA data and compact unwind section on AIX Summary: AIX uses the existing EH infrastructure in clang and llvm. The major differences would be 1. AIX do not have CFI instructions. 2. AIX uses a new personality routine, named __xlcxx_personality_v1. It doesn't use the GCC personality rountine, because the interoperability is not there yet on AIX. 3. AIX do not use eh_frame sections. Instead, it would use a eh_info section (compat unwind section) to store the information about personality routine and LSDA data address. Reviewed By: daltenty, hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D91455	2020-12-02 18:42:44 +00:00
Sanjay Patel	06d5ff1803	[JumpThreading][VectorUtils] avoid infinite loop on unreachable IR https://llvm.org/PR48362 It's possible that we could stub this out sooner somewhere within JumpThreading, but I'm not sure how to do that, and then we would still have potential danger in other callers. I can't find a way to trigger this using 'instsimplify', however, because that already has a bailout on unreachable blocks.	2020-12-02 13:39:33 -05:00
Simon Pilgrim	2b85962be2	[LoopVectorize] Fix optimal-epilog-vectorization-limitations.ll test on non-debug build bots Add "REQUIRES: asserts" as the test uses the "--debug-only" switch Should fix the clang-with-thin-lto-ubuntu buildbot failure	2020-12-02 18:00:42 +00:00
Simon Pilgrim	294d591070	[Thumb2] Regenerate predicated-liveout-unknown-lanes.ll test Helps to reduce diff in D90113	2020-12-02 18:00:42 +00:00
Simon Pilgrim	42a8d060f0	[PowerPC] Regenerate cmpb tests Helps to reduce diff in D90113	2020-12-02 18:00:41 +00:00
Fangrui Song	3effbaa68f	Delete llvm::is_trivially_copyable and CMake variable HAVE_STD_IS_TRIVIALLY_COPYABLE GCC<5 did not support std::is_trivially_copyable. Now LLVM builds require 5.1 we can delete llvm::is_trivially_copyable after the users have been migrated to std::is_trivially_copyable.	2020-12-02 09:58:08 -08:00
Fangrui Song	dffdc25f75	Use std::is_trivially_copyable GCC<5 did not support std::is_trivially_copyable. Now LLVM builds require 5.1 we can migrate to std::is_trivially_copyable.	2020-12-02 09:58:07 -08:00
Simon Pilgrim	b68e123805	[X86] EltsFromConsecutiveLoads - remove old FIXME comment. NFC. Its unlikely an undef element in a zero vector will be any use.	2020-12-02 17:21:41 +00:00
Simon Pilgrim	4eb782cbfc	[LSR][X86] Replace -march with -mtriples Fixes build on gnux32 hosts	2020-12-02 17:05:15 +00:00
Simon Pilgrim	54f386f68a	[X86] combineX86ShufflesRecursively - remove old FIXME comment. NFC. Its unlikely an undef element in a zero vector will be any use, and SimplifyDemandedVectorElts now calls combineX86ShufflesRecursively so its unlikely we actually have a dependency on these specific elements.	2020-12-02 16:29:38 +00:00
Simon Pilgrim	2f80a929a6	[X86] Regenerate 32-bit merge-consecutive-loads tests Avoid use of X32 check prefix - we try to only use that for gnux32 triple tests	2020-12-02 16:29:38 +00:00
Simon Pilgrim	ab9ea3e2e8	[X86] EltsFromConsecutiveLoads - pull out repeated NumLoadedElts. NFCI.	2020-12-02 16:29:37 +00:00
Michael Liao	48788ed811	Remove `-Wunused-result` and `-Wpedantic` warnings from GCC. NFC.	2020-12-02 10:53:59 -05:00
Bardia Mahjour	fbc2c5ae27	[LV] Epilogue Vectorization with Optimal Control Flow (Recommit) This is yet another attempt at providing support for epilogue vectorization following discussions raised in RFC http://llvm.1065342.n5.nabble.com/llvm-dev-Proposal-RFC-Epilog-loop-vectorization-tt106322.html#none and reviews D30247 and D88819. Similar to D88819, this patch achieve epilogue vectorization by executing a single vplan twice: once on the main loop and a second time on the epilogue loop (using a different VF). However it's able to handle more loops, and generates more optimal control flow for cases where the trip count is too small to execute any code in vector form. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D89566	2020-12-02 10:09:56 -05:00
Sanjay Patel	1b9dd18234	[SLP] use 'match' for binop/select; NFC This might be a small improvement in readability, but the real motivation is to make it easier to adapt the code to deal with intrinsics like 'maxnum' and/or integer min/max. There is potentially help in doing that with D92086, but we might also just add specialized wrappers here to deal with the expected patterns.	2020-12-02 09:04:08 -05:00
Alex Zinenko	0085eeb3aa	[OpenMPIRBuilder] forward arguments as pointers to outlined function OpenMPIRBuilder::createParallel outlines the body region of the parallel construct into a new function that accepts any value previously defined outside the region as a function argument. This function is called back by OpenMP runtime function __kmpc_fork_call, which expects trailing arguments to be pointers. If the region uses a value that is not of a pointer type, e.g. a struct, the produced code would be invalid. In such cases, make createParallel emit IR that stores the value on stack and pass the pointer to the outlined function instead. The outlined function then loads the value back and uses as normal. Reviewed By: jdoerfert, llitchev Differential Revision: https://reviews.llvm.org/D92189	2020-12-02 14:59:41 +01:00
Hans Wennborg	f8c8b1a8b3	[ThinLTO] Import symver directives for imported symbols (PR48214) When importing symbols from another module, also import any corresponding symver directives. Differential revision: https://reviews.llvm.org/D92335	2020-12-02 14:56:43 +01:00
Hans Wennborg	3bed264463	Simplify append to module inline asm string in IRLinker::run() This also removes the empty extra "module asm" that would be created, and updates the test to reflect that while making it more explicit. Broken out from https://reviews.llvm.org/D92335	2020-12-02 14:56:43 +01:00
Kazushi (Jam) Marukawa	a87022c5f8	[VE] Add vand, vor, and vxor intrinsic instructions Add vand, vor, and vxor intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92454	2020-12-02 22:52:54 +09:00
Anirudh Prasad	46dd095887	[SystemZ] Adding extra extended mnemonics for SystemZ target This patch consists of the addition of some common additional extended mnemonics to the SystemZ target. - These are jnop, jct, jctg, jas, jasl, jxh, jxhg, jxle, jxleg, bru, brul, br, brl. - These mnemonics and the instructions they map to are defined here, Chapter 4 - Branching with extended mnemonic codes. - Except for jnop (which is a variant of brc 0, label), every other mnemonic is marked as a MnemonicAlias since there is already a "defined" instruction with the same encoding and/or condition mask values. - brc 0, label doesn't have a defined extended mnemonic, thus jnop is defined using as an InstAlias. Furthermore, the applyMnemonicAliases function is called in the overridden parseInstruction function in SystemZAsmParser.cpp to ensure any mnemonic aliases are applied before any further processing on the instruction is done. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D92185	2020-12-02 08:25:31 -05:00
David Sherwood	6d7c7dcc2b	[SVE] Add support for scalable vectors with vectorize.scalable.enable loop attribute In this patch I have added support for a new loop hint called vectorize.scalable.enable that says whether we should enable scalable vectorization or not. If a user wants to instruct the compiler to vectorize a loop with scalable vectors they can now do this as follows: br i1 %exitcond, label %for.end, label %for.body, !llvm.loop !2 ... !2 = !{!2, !3, !4} !3 = !{!"llvm.loop.vectorize.width", i32 8} !4 = !{!"llvm.loop.vectorize.scalable.enable", i1 true} Setting the hint to false simply reverts the behaviour back to the default, using fixed width vectors. Differential Revision: https://reviews.llvm.org/D88962	2020-12-02 13:23:43 +00:00
Georgii Rymar	ae44f6b6df	[llvm-readobj, libSupport] - Refine the implementation of the code that dumps build attributes. This implementation of `ELFDumper<ELFT>::printAttributes()` in llvm-readobj has issues: 1) It crashes when the content of the attribute section is empty. 2) It uses `unwrapOrError` and `reportWarning` calls, though ideally we want to use `reportUniqueWarning`. 3) It contains a TODO about redundant format version check. `lib/Support/ELFAttributeParser.cpp` uses a hardcoded constant instead of the named constant. This patch fixes all these issues. Differential revision: https://reviews.llvm.org/D92318	2020-12-02 13:51:32 +03:00
Cullen Rhodes	1b33c95080	[InstructionsTest] NFC: Replace VectorType::get(.., .., true) with ScalableVectorType::get Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D92467	2020-12-02 10:50:05 +00:00
Jay Foad	8f6490d6da	[AMDGPU] Stop adding an implicit def of vcc_hi for wave32 This doesn't seem to be needed for anything. Differential Revision: https://reviews.llvm.org/D92400	2020-12-02 10:11:42 +00:00
Georgii Rymar	c4ac5014a0	[llvm-readelf/obj] - Lowercase the warning message reported. Our warnings/errors reported are using lowercase normally. This addresses one of review comments from D92382.	2020-12-02 13:09:47 +03:00
Georgii Rymar	b7a191f814	[llvm-readelf/obj] - Report unique warnings in `parseDynamicTable`. This makes the warnings reported to be unique and adds test cases. Differential revision: https://reviews.llvm.org/D92382	2020-12-02 12:52:42 +03:00
David Green	9ac48f4618	[Intrinsics] Re-remove experimental_vector_reduce intrinsics These were re-added by fbfb1c790982277eaa5134c2b6aa001e97fe828d but should not have been. This removes the old experimental versions of the reduction intrinsics again, leaving the new non experimental ones. Differential Revision: https://reviews.llvm.org/D92411	2020-12-02 09:22:41 +00:00
Qiu Chaofan	a779ff600d	[PowerPC] Fix FLT_ROUNDS_ on little endian In lowering of FLT_ROUNDS_, FPSCR content will be moved into FP register and then GPR, and then truncated into word. For subtargets without direct move support, it will store and then load. The load address needs adjustment (+4) only on big-endian targets. This patch fixes it on using generic opcodes on little-endian and subtargets with direct-move. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D91845	2020-12-02 17:16:32 +08:00
Georgii Rymar	6340cbbc6f	[llvm-readelf/obj] - Refine the error message about the broken string table. This: 1) Changes `reportWarning` to `reportUniqueWarning` (no-op here). 2) Adds more context to the message. 3) Merges `broken-dynsym-link.test` into `dyn-symbols.test`, adds more testing. Differential revision: https://reviews.llvm.org/D92380	2020-12-02 12:06:16 +03:00
Max Kazantsev	babe0d225d	[Test] One CodeGen test showing missing opportunity on move elimination	2020-12-02 13:16:34 +07:00
Max Kazantsev	d9df073176	[Test] One more IndVars test	2020-12-02 13:16:34 +07:00

1 2 3 4 5 ...

207649 Commits