llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Johannes Doerfert	fb9c302f30	[Attributor] Add initial AAIsDead for arguments We usually will ask for liveness of an argument anyway so we ended up lazily creating the attribute anyway. However, that is not always the case and even if it is we should go the eager route here. Various tests show how this can improve the outcome. One test exposed a problem with type mismatches between argument and call site argument, a fix is included. For liveness various more tests were added as well.	2020-02-19 21:39:45 -06:00
Lang Hames	57e72ad661	[examples] Fix the SpeculativeJIT example for 85fb997659b.	2020-02-19 19:06:15 -08:00
Johannes Doerfert	1f256edf28	[Attributor] Allow multiple uses of a casted function pointer If a function pointer is casted into a different type the resulting expression can be a constant. If so, it can be used multiple times which cannot be handled by the AbstractCallSite constructor alone. Instead, we follow the cast expression uses now explicitly during the call site traversal.	2020-02-19 20:43:38 -06:00
Sourabh Singh Tomar	d56dedce6a	[DebugInfo][NFCI]: Removed an exclamation mark from error message.	2020-02-20 07:49:08 +05:30
Igor Kudrin	9e850ddd9b	[DebugInfo] Remove a misleading comment for llvm::dwarf::FDE. The comment described a linked CIE to be acquired lazily. That is not true and looks like it was never true. Differential Revision: https://reviews.llvm.org/D74761	2020-02-20 09:12:05 +07:00
Igor Kudrin	36cf70c05c	[DebugInfo] Read CIE pointer as a relocatable value. The CIE pointer field of an FDE record contains an offset to a corresponding CIE record. In object files, this value comes with relocation because the value has to be fixed when a linker combines the final section from multiple sources. In most object files there is only one CIE record at offset 0 of the .debug_frame section, so reading a relocated or a raw value makes no difference. However, in partially linked object files there are multiple CIE records and the relocations should be applied to recover the right offset value. Differential Revision: https://reviews.llvm.org/D74612	2020-02-20 09:12:05 +07:00
Nico Weber	f9df0f69e3	[gn build] (manually) partially (?) merge 7ff1f55a1219	2020-02-19 21:09:44 -05:00
Sam Clegg	96c9f4b05a	[WebAssembly] Use llvm::Optional to store optional symbol attributes. NFC. The changes the in-memory representation of wasm symbols such that their optional ImportName and ImportModule use llvm::Optional. ImportName is set whenever WASM_SYMBOL_EXPLICIT_NAME flag is set. ImportModule (for imports) is currently always set since it defaults to "env". In the future we can possibly extent to binary format distingish import which have explit module names. Tags: #llvm Differential Revision: https://reviews.llvm.org/D74109	2020-02-19 17:25:33 -08:00
Greg Clayton	76f7457508	Add an Offset field to the SourceLocation for LookupResult objects. Summary: The Offset provides the offset within the function in a SourceLocation struct. This allows us to show the byte offset within a function. We also track offsets within inline functions as well. Updated the lookup tests to verify the offset for functions and inline functions. 0x1000: main + 32 @ /tmp/main.cpp:45 Reviewers: labath, aadsm, serhiy.redko, jankratochvil, xiaobai, wallace, aprantl, JDevlieghere Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74680	2020-02-19 16:12:32 -08:00
Thomas Lively	8c7beab605	[WebAssembly] Fix memory bug introduced in 52861809994c Summary: The instruction at `DefI` can sometimes be destroyed by `rematerializeCheapDef`, so it should not be used after calling that function. The fix is to use `Insert` instead when examining additional multivalue stackifications. `Insert` is the address of the new defining instruction after all moves and rematerializations have taken place. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74875	2020-02-19 15:07:45 -08:00
LLVM GN Syncbot	1a870bddc6	[gn build] Port 85fb997659b	2020-02-19 22:58:29 +00:00
Lang Hames	896a3b9d4c	[JITLink] Fix testcase for main JITDylib rename in 85fb997659b.	2020-02-19 14:58:13 -08:00
Matt Arsenault	93f691bf87	AMDGPU: Enable integer division bypass We probably want this, and I've meant to turn this on for a long time. SC actually emits a special case to early-out for a 1 denominator, which perhaps should also be considered.	2020-02-19 17:50:19 -05:00
Matt Arsenault	5ca9fc991d	AMDGPU/GlobalISel: Remove outdated comment	2020-02-19 17:32:25 -05:00
Matt Arsenault	e4884a1a12	AMDGPU/GlobalISel: Cleanup min/max RegBankSelect tests Use common check prefix, although update_mir_test_checks makes this unnecessarily annoying. Also make sure to have uses in case that ever ends up mattering.	2020-02-19 17:32:25 -05:00
Lang Hames	117fadbabd	[ORC] Fix a missing move.	2020-02-19 14:27:31 -08:00
Lang Hames	48e17a271a	[ORC] Qualify nullptr_t.	2020-02-19 14:25:53 -08:00
Lang Hames	900dc7edc7	[ORC] Add generic initializer/deinitializer support. Initializers and deinitializers are used to implement C++ static constructors and destructors, runtime registration for some languages (e.g. with the Objective-C runtime for Objective-C/C++ code) and other tasks that would typically be performed when a shared-object/dylib is loaded or unloaded by a statically compiled program. MCJIT and ORC have historically provided limited support for discovering and running initializers/deinitializers by scanning the llvm.global_ctors and llvm.global_dtors variables and recording the functions to be run. This approach suffers from several drawbacks: (1) It only works for IR inputs, not for object files (including cached JIT'd objects). (2) It only works for initializers described by llvm.global_ctors and llvm.global_dtors, however not all initializers are described in this way (Objective-C, for example, describes initializers via specially named metadata sections). (3) To make the initializer/deinitializer functions described by llvm.global_ctors and llvm.global_dtors searchable they must be promoted to extern linkage, polluting the JIT symbol table (extra care must be taken to ensure this promotion does not result in symbol name clashes). This patch introduces several interdependent changes to ORCv2 to support the construction of new initialization schemes, and includes an implementation of a backwards-compatible llvm.global_ctor/llvm.global_dtor scanning scheme, and a MachO specific scheme that handles Objective-C runtime registration (if the Objective-C runtime is available) enabling execution of LLVM IR compiled from Objective-C and Swift. The major changes included in this patch are: (1) The MaterializationUnit and MaterializationResponsibility classes are extended to describe an optional "initializer" symbol for the module (see the getInitializerSymbol method on each class). The presence or absence of this symbol indicates whether the module contains any initializers or deinitializers. The initializer symbol otherwise behaves like any other: searching for it triggers materialization. (2) A new Platform interface is introduced in llvm/ExecutionEngine/Orc/Core.h which provides the following callback interface: - Error setupJITDylib(JITDylib &JD): Can be used to install standard symbols in JITDylibs upon creation. E.g. __dso_handle. - Error notifyAdding(JITDylib &JD, const MaterializationUnit &MU): Generally used to record initializer symbols. - Error notifyRemoving(JITDylib &JD, VModuleKey K): Used to notify a platform that a module is being removed. Platform implementations can use these callbacks to track outstanding initializers and implement a platform-specific approach for executing them. For example, the MachOPlatform installs a plugin in the JIT linker to scan for both __mod_inits sections (for C++ static constructors) and ObjC metadata sections. If discovered, these are processed in the usual platform order: Objective-C registration is carried out first, then static initializers are executed, ensuring that calls to Objective-C from static initializers will be safe. This patch updates LLJIT to use the new scheme for initialization. Two LLJIT::PlatformSupport classes are implemented: A GenericIR platform and a MachO platform. The GenericIR platform implements a modified version of the previous llvm.global-ctor scraping scheme to provide support for Windows and Linux. LLJIT's MachO platform uses the MachOPlatform class to provide MachO specific initialization as described above. Reviewers: sgraenitz, dblaikie Subscribers: mgorny, hiraditya, mgrang, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74300	2020-02-19 13:59:32 -08:00
Stanislav Mekhanoshin	de50517235	[AMDGPU] Fix DS_WRITE_B32 patterns It uses VGPR_32.RegTypes which includes 16 bit types. As a result DS_WRITE_B32 may be generated for "store i16" which is a bug. The only reason we do not hit it now is relative patterns complexity and sorting. Should DS_WRITE_B16 pattern complexity become higher and the bug appears. Differential Revision: https://reviews.llvm.org/D74868	2020-02-19 13:42:16 -08:00
Tony	491ebe17c8	[AMDGPU] AMDGPUUsage define call convention ABI Reviewers: scott.linder, arsenm, b-sumner Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74861	2020-02-19 15:56:19 -05:00
Michael Kruse	71dc77c1eb	[IndVarSimply] Fix assert/release build difference. In builds with assertions enabled (!NDEBUG), IndVarSimplify does an additional query to ScalarEvolution which may change future SCEV queries since it fills the internal cache differently. The result is actually only used with the -verify-indvars command line option. We fix the issue by only calling SE->getBackedgeTakenCount(L) if -verify-indvars is enabled such that only -verify-indvars shows the behavior, but not debug builds themselves. Also add a remark to the description of -verify-indvars about this behavior. Fixes llvm.org/PR44815 Differential Revision: https://reviews.llvm.org/D74810	2020-02-19 14:36:22 -06:00
Tony	f119c347fa	[AMDGPU] Update AMDGPUUsage with DWARF proposal Summary: - Add AMDGPU DWARF proposal. - Add references for gfx10 ISA and SemVer. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, aprantl, dstuttard, tpr, jfb, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70523	2020-02-19 15:30:53 -05:00
Sanjay Patel	2dc6c3489f	[x86] add test for uint->fp with unsafe-fp-math (PR43609); NFC	2020-02-19 15:18:52 -05:00
Krzysztof Parzyszek	c28d8cf19b	[Hexagon] Change HVX vector predicate types from v512/1024i1 to v64/128i1 This commit removes the artificial types <512 x i1> and <1024 x i1> from HVX intrinsics, and makes v512i1 and v1024i1 no longer legal on Hexagon. It may cause existing bitcode files to become invalid. * Converting between vector predicates and vector registers must be done explicitly via vandvrt/vandqrt instructions (their intrinsics), i.e. (for 64-byte mode): %Q = call <64 x i1> @llvm.hexagon.V6.vandvrt(<16 x i32> %V, i32 -1) %V = call <16 x i32> @llvm.hexagon.V6.vandqrt(<64 x i1> %Q, i32 -1) The conversion intrinsics are: declare <64 x i1> @llvm.hexagon.V6.vandvrt(<16 x i32>, i32) declare <128 x i1> @llvm.hexagon.V6.vandvrt.128B(<32 x i32>, i32) declare <16 x i32> @llvm.hexagon.V6.vandqrt(<64 x i1>, i32) declare <32 x i32> @llvm.hexagon.V6.vandqrt.128B(<128 x i1>, i32) They are all pure. * Vector predicate values cannot be loaded/stored directly. This directly reflects the architecture restriction. Loading and storing or vector predicates must be done indirectly via vector registers and explicit conversions via vandvrt/vandqrt instructions.	2020-02-19 14:14:56 -06:00
Nikita Popov	453ec55b92	Reapply [IRBuilder] Always respect inserter/folder Some IRBuilder methods that were originally defined on IRBuilderBase do not respect custom IRBuilder inserters/folders, because those were not accessible prior to D73835. Fix this by making use of existing (and now accessible) IRBuilder methods, which will handle inserters/folders correctly. There are some changes in OpenMP and Instrumentation tests, where bitcasts now get constant folded. I've also highlighted one InstCombine test which now finishes in two rather than three iterations, thanks to new instructions being inserted into the worklist. Differential Revision: https://reviews.llvm.org/D74787	2020-02-19 20:51:38 +01:00
Bill Wendling	c708922cd0	Include static prof data when collecting loop BBs Summary: If the programmer adds static profile data to a branch---i.e. uses "__builtin_expect()" or similar---then we should honor it. Otherwise, "__builtin_expect()" is ignored in crucial situations. So we trust that the programmer knows what they're doing until proven wrong. Subscribers: hiraditya, JDevlieghere, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74809	2020-02-19 11:33:48 -08:00
Simon Pilgrim	d794c53322	[AMDGPU] Regenerate immediate constant tests	2020-02-19 18:58:44 +00:00
Simon Pilgrim	c00ec168b5	[UpdateTestChecks] Add support for '.' in ir function names Will let us regenerate from amdgpu float constant tests	2020-02-19 18:58:44 +00:00
Louis Dionne	879ac1565d	[CMake] Only detect the linker once in AddLLVM.cmake Summary: Otherwise, the build output contains a bunch of "Linker detection: <xxx>" lines that are really redundant. We also make redundant calls to the linker, although that is a smaller concern. Reviewers: smeenai Subscribers: mgorny, fedor.sergeev, jkorous, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68648	2020-02-19 13:53:38 -05:00
Bardia Mahjour	d2570e78cd	[DDG] Data Dependence Graph - Graph Simplification Summary: This is the last functional patch affecting the representation of DDG. Here we try to simplify the DDG to reduce the number of nodes and edges by iteratively merging pairs of nodes that satisfy the following conditions, until no such pair can be identified. A pair of nodes consisting of a and b can be merged if: 1. the only edge from a is a def-use edge to b and 2. the only edge to b is a def-use edge from a and 3. there is no cyclic edge from b to a and 4. all instructions in a and b belong to the same basic block and 5. both a and b are simple (single or multi instruction) nodes. These criteria allow us to fold many uninteresting def-use edges that commonly exist in the graph while avoiding the risk of introducing dependencies that didn't exist before. Authored By: bmahjour Reviewer: Meinersbur, fhahn, myhsu, xtian, dmgreen, kbarton, jdoerfert Reviewed By: Meinersbur Subscribers: ychen, arphaman, simoll, a.elovikov, mgorny, hiraditya, jfb, wuzish, llvm-commits, jsji, Whitney, etiotto, ppc-slack Tags: #llvm Differential Revision: https://reviews.llvm.org/D72350	2020-02-19 13:41:51 -05:00
Florian Hahn	05986e5408	Revert "[PatternMatch] Match XOR variant of unsigned-add overflow check." This reverts commit e01a3d49c224d6f8a7afc01205b05b9deaa07afa. and commit a6a585b8030b6e8d4c50c71f54a6addb21995fe0. This causes a failure on GreenDragon: http://lab.llvm.org:8080/green/view/LLDB/job/lldb-cmake/9597	2020-02-19 19:37:08 +01:00
Tyker	797dfd61ae	[AssumeBundle] Add documentation for the operand bundles of an llvm.assume Summary: Operand bundles on an llvm.assume allows representing assumptions that an attribute holds for a certain value at a certain position. Operand bundles enable assumptions that are either hard or impossible to represent as a boolean argument of an llvm.assume. Reviewers: jdoerfert, fhahn, nlopes, reames, regehr, efriedma Reviewed By: jdoerfert Subscribers: lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74209	2020-02-19 18:53:15 +01:00
Jonas Paulsson	9dcacebd4c	[ValueTracking] Improve isKnownNonNaN() to recognize zero splats. isKnownNonNaN() could not recognize a zero splat because that is a ConstantAggregateZero which is-a ConstantData but not a ConstantDataVector. Patch makes a ConstantAggregateZero return true. Review: Thomas Lively Differential Revision: https://reviews.llvm.org/D74263	2020-02-19 09:35:36 -08:00
Nico Weber	b49202c11d	[gn build] use \bfoo\b instead of \<foo\> in sync script \<foo\> is more correct, but since we use shell=True on Windows, the < and > get interpreted as redirection operators. Rather than adding cmd escaping, just use \bfoo\b, which is Good Enough Often Enough.	2020-02-19 12:32:02 -05:00
LLVM GN Syncbot	1580d3f4b6	[gn build] Port a54d81f5979	2020-02-19 17:28:29 +00:00
Nico Weber	83f9342471	[gn build] Set up include_dirs for a54d81f597 (first checker in a subdir)	2020-02-19 12:24:01 -05:00
Craig Topper	d3629d756d	[X86] Add DCI.isBeforeLegalize() check to the v64i1 constant splitting code in combineStore. We only need to split after type legalization. If we're before we can just use a wide store and type legalization will split it. Add a v128i1 test to exercise it post type legalization.	2020-02-19 09:18:16 -08:00
Stanislav Mekhanoshin	62c0ed15d8	[AMDGPU] Fix assumption about LaneBitmask content Yet another assumption about an actual LaneBitmask content is fixed. Differential Revision: https://reviews.llvm.org/D74805	2020-02-19 09:07:11 -08:00
Nikita Popov	ad15d536fe	Revert "[IRBuilder] Always respect inserter/folder" This reverts commit f12fb2d99b8dd0dbef1c79f1d401200150f2d0bd. I missed some changes in instrumentation test cases.	2020-02-19 17:51:55 +01:00
Nikita Popov	b55118d592	[InstCombine] Fix removal from deferred instructions Make sure we don't skip the Deferred.remove() call if the instruction is not in the worklist. Both of those are separate. We don't have any cases where deferred instructions get removed right now, but may cause problems in the future.	2020-02-19 17:48:28 +01:00
Nikita Popov	e70b7afc11	[IRBuilder] Always respect inserter/folder Some IRBuilder methods that were originally defined on IRBuilderBase do not respect custom IRBuilder inserters/folders, because those were not accessible prior to D73835. Fix this by making use of existing (and now accessible) IRBuilder methods, which will handle inserters/folders correctly. There are some changes in OpenMP tests, where bitcasts now get constant folded. I've also highlighted one InstCombine test which now finishes in two rather than three iterations, thanks to new instructions being inserted into the worklist. Differential Revision: https://reviews.llvm.org/D74787	2020-02-19 17:44:43 +01:00
Simon Pilgrim	2a62d779bd	[SystemZ] Regenerate risbg tests. NFCI. Pre-commit for some upcoming SimplifyDemandedBits bitrotate handling.	2020-02-19 16:39:28 +00:00
Mikhail Maltsev	bf9a57c53c	[ARM,MVE] Fix predicate types of some intrinsics Summary: Some predicated MVE intrinsics return a vector with element size different from the input vector element size. In this case the predicate must type correspond to the output vector type. The following intrinsics use the incorrect predicate type: * llvm.arm.mve.mull.int.predicated * llvm.arm.mve.mull.poly.predicated * llvm.arm.mve.vshll.imm.predicated This patch fixes the issue. Reviewers: simon_tatham, dmgreen, ostannard, MarkMurrayARM Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74838	2020-02-19 16:24:54 +00:00
Cameron McInally	65ad000134	[AArch64][SVE] Add initial backend support for FP splat_vector Differential Revision: https://reviews.llvm.org/D74632	2020-02-19 10:19:11 -06:00
Nico Weber	d3b8bade8b	[gn build] revert e8e078c8bf7987 Now that I've updated ancient goma clients on the bots, this should work. (Internal goma bug: b/139410332, fixed months ago.)	2020-02-19 11:11:25 -05:00
Jay Foad	92cac194bd	[AMDGPU][ConstantFolding] Fold llvm.amdgcn.fmul.legacy intrinsic Reviewers: arsenm, rampitec, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74835	2020-02-19 16:01:30 +00:00
Stefan Pintilie	6d0e277aa7	[Hexagon][NFC] Rename VK_Hexagon_PCREL to VK_PCREL On PowerPC we will soon need to use pcrel to indicate PC Relative addressing. Renamed the Hexagon specific variant kind to a non target specific VK so that it can be used on both Hexagon and PowerPC. Differential Revision: https://reviews.llvm.org/D74788	2020-02-19 09:52:58 -06:00
Krzysztof Parzyszek	f9d00e5265	Add <128 x i1> as an intrinsic type	2020-02-19 09:38:13 -06:00
Florian Hahn	0c4f1f800d	[CGP] Adjust CodeGen tests after e01a3d49c22	2020-02-19 16:05:00 +01:00
Florian Hahn	aaa658ee7c	[PatternMatch] Match XOR variant of unsigned-add overflow check. Instcombine folds (a + b <u a) to (a ^ -1 <u b) and that does not match the expected pattern in CodeGenPerpare via UAddWithOverflow. This causes a regression over Clang 7 on both X86 and AArch64: https://gcc.godbolt.org/z/juhXYV This patch extends UAddWithOverflow to also catch the XOR case, if the XOR is only used in the ICMP. This covers just a single case, but I'd like to make sure I am not missing anything before tackling the other cases. Reviewers: nikic, RKSimon, lebedev.ri, spatel Reviewed By: nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D74228	2020-02-19 15:25:18 +01:00

... 3 4 5 6 7 ...

192419 Commits