llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Sanjay Patel	6cb1ff7fe4	[Utils][x86] add an option to reduce scrubbing of shuffles with memops I was drafting a patch that would increase broadcast load usage, but our shuffle scrubbing makes it impossible to see if the memory operand offset was getting created correctly. I'm proposing to make that an option (defaulted to 'off' for now to reduce regression test churn). The updated files provide examples of tests where we can now verify that the pointer offset for a loaded memory operand is correct. We still have stack and constant scrubbing that can obscure the operand even if we don't scrub the entire instruction. Differential Revision: https://reviews.llvm.org/D74775	2020-02-20 09:33:05 -05:00
Sebastian Neubauer	8b6d8bc210	[AMDGPU] Don’t marke the .note section as ALLOC Marking a section as ALLOC tells the ELF loader to load the section into memory. As we do not want to load the notes into VRAM, the flag should not be there. Differential Revision: https://reviews.llvm.org/D74600	2020-02-20 15:14:48 +01:00
Simon Pilgrim	8741b93e56	Regenerate rotate test. NFC.	2020-02-20 13:54:43 +00:00
Djordje Todorovic	e67d8322ba	Revert "Reland "[DebugInfo] Enable the debug entry values feature by default"" This reverts commit rGfaff707db82d. A failure found on an ARM 2-stage buildbot. The investigation is needed.	2020-02-20 14:41:39 +01:00
Andrzej Warzynski	ccc0dc6d63	[AArch64][SVE] Re-arrange definitions in AArch64SVEInstrInfo.td (NFC) Re-arrange definitions related to loads and stores so that they are grouped together. This patch implements only non-functional changes.	2020-02-20 12:41:16 +00:00
Simon Pilgrim	84096bbb77	[AMDGPU] simplifyI24 - replace GetDemandedBits with SimplifyMultipleUseDemandedBits GetDemandedBits mostly just calls SimplifyMultipleUseDemandedBits now, but it does a very blunt constant simplification that SimplifyMultipleUseDemandedBits avoids. If we need to demand bits from constants we should handle this through ShrinkDemandedConstant/targetShrinkDemandedConstant. @arsenm confirmed that the sign extended immediates are better for code size. Differential Revision: https://reviews.llvm.org/D74857	2020-02-20 12:03:08 +00:00
dfukalov	e81c6d4c67	SpeculativeExecution: fixed ingoring free execution Summary: After updating cost model in AMDGPU target (47a5c36b37f0) the pass started to ignore some BBs since they got all instructions estimated as free. Reviewers: arsenm, chandlerc, nhaehnle Reviewed By: nhaehnle Subscribers: jvesely, wdng, nhaehnle, tpr, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74825	2020-02-20 14:45:02 +03:00
Mikhail Maltsev	29b9c00369	[ARM,MVE] Add vqdmull[b,t]q intrinsic families Summary: This patch adds two families of ACLE intrinsics: vqdmullbq and vqdmulltq (including vector-vector and vector-scalar variants) and the corresponding LLVM IR intrinsics llvm.arm.mve.vqdmull and llvm.arm.mve.vqdmull.predicated. Reviewers: simon_tatham, MarkMurrayARM, dmgreen, ostannard Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74845	2020-02-20 10:51:19 +00:00
serge-sans-paille	c4fa124875	[NFC] Remove ar/ranlib test noise during cmake step At least on RHEL, ar outputs on stderr a message similar to .../bin/ar: creating t.a Which creates noise during the cmake step. Get rid of it.	2020-02-20 11:26:26 +01:00
Johannes Doerfert	8fcd2f1bf2	[Attributor] Make sure abstract attributes are properly initialized	2020-02-20 02:46:40 -06:00
Johannes Doerfert	dec8fd7748	[Attributor][NFC] Refactor interface	2020-02-20 02:46:40 -06:00
Johannes Doerfert	2d214fefe5	[Attributor][NFC] Prepare some tests to be used with update test script	2020-02-20 02:44:05 -06:00
serge-sans-paille	cbe51c16a5	Fix compiler extension in standalone mode Use a dedicated cmake file to store the extension configured within LLVM. That way, a standalone build of clang can load this cmake file and get all the configured standalone extensions. This patch is related to https://reviews.llvm.org/D74602 Differential Revision: https://reviews.llvm.org/D74757	2020-02-20 07:19:04 +01:00
Hideto Ueno	26e1302f3c	[MustExecute] Add backward exploration for must-be-executed-context Summary: As mentioned in D71974, it is useful for must-be-executed-context to explore CFG backwardly. This patch is ported from parts of D64975. We use a dominator tree to find the previous context if a dominator tree is available. Reviewers: jdoerfert, hfinkel, baziotis, sstefan1 Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74817	2020-02-20 14:49:30 +09:00
Johannes Doerfert	735b538753	[Attributor][NFC] Improve the debug output & add a TODO	2020-02-19 23:46:08 -06:00
Johannes Doerfert	2fca20aa6c	[Attributor][NFC] Add more memory_location tests	2020-02-19 23:46:08 -06:00
Johannes Doerfert	e7cf000a53	[Attributor] Use existing `returned` information better We can look through calls with `returned` argument attributes when we collect subsuming positions. This allows us to get existing attributes from more places.	2020-02-19 23:46:07 -06:00
Johannes Doerfert	2d3bc0f805	[Attributor][FIX] Avoid setting wrong load/store alignments	2020-02-19 23:46:07 -06:00
Matt Arsenault	3e62f38540	TableGen: Fix logic for default operands This was checking for default operands in the current DAG instruction, rather than the correct result operand list. I'm not entirly sure how this managed to work before, but was failing for me when multiple default operands were overridden.	2020-02-19 23:41:07 -05:00
Johannes Doerfert	c91da0566b	[Attributor] Generalize `getAssumedConstantInt` interface We are often interested in an assumed constant and sometimes it has to be an integer constant. Before we only looked for the latter, now we can ask for either.	2020-02-19 22:33:51 -06:00
Johannes Doerfert	2d32597cfc	[Attributor][FIX] Do not create new calls edge we cannot handle If we propagate function pointers across function boundaries we can create new call edges. These need to be represented in the CG if we run as a CGSCC pass. In the new pass manager that is currently not handled by the CallGraphUpdater so we need to prevent the situation for now.	2020-02-19 22:33:51 -06:00
Johannes Doerfert	fb9c302f30	[Attributor] Add initial AAIsDead for arguments We usually will ask for liveness of an argument anyway so we ended up lazily creating the attribute anyway. However, that is not always the case and even if it is we should go the eager route here. Various tests show how this can improve the outcome. One test exposed a problem with type mismatches between argument and call site argument, a fix is included. For liveness various more tests were added as well.	2020-02-19 21:39:45 -06:00
Lang Hames	57e72ad661	[examples] Fix the SpeculativeJIT example for 85fb997659b.	2020-02-19 19:06:15 -08:00
Johannes Doerfert	1f256edf28	[Attributor] Allow multiple uses of a casted function pointer If a function pointer is casted into a different type the resulting expression can be a constant. If so, it can be used multiple times which cannot be handled by the AbstractCallSite constructor alone. Instead, we follow the cast expression uses now explicitly during the call site traversal.	2020-02-19 20:43:38 -06:00
Sourabh Singh Tomar	d56dedce6a	[DebugInfo][NFCI]: Removed an exclamation mark from error message.	2020-02-20 07:49:08 +05:30
Igor Kudrin	9e850ddd9b	[DebugInfo] Remove a misleading comment for llvm::dwarf::FDE. The comment described a linked CIE to be acquired lazily. That is not true and looks like it was never true. Differential Revision: https://reviews.llvm.org/D74761	2020-02-20 09:12:05 +07:00
Igor Kudrin	36cf70c05c	[DebugInfo] Read CIE pointer as a relocatable value. The CIE pointer field of an FDE record contains an offset to a corresponding CIE record. In object files, this value comes with relocation because the value has to be fixed when a linker combines the final section from multiple sources. In most object files there is only one CIE record at offset 0 of the .debug_frame section, so reading a relocated or a raw value makes no difference. However, in partially linked object files there are multiple CIE records and the relocations should be applied to recover the right offset value. Differential Revision: https://reviews.llvm.org/D74612	2020-02-20 09:12:05 +07:00
Nico Weber	f9df0f69e3	[gn build] (manually) partially (?) merge 7ff1f55a1219	2020-02-19 21:09:44 -05:00
Sam Clegg	96c9f4b05a	[WebAssembly] Use llvm::Optional to store optional symbol attributes. NFC. The changes the in-memory representation of wasm symbols such that their optional ImportName and ImportModule use llvm::Optional. ImportName is set whenever WASM_SYMBOL_EXPLICIT_NAME flag is set. ImportModule (for imports) is currently always set since it defaults to "env". In the future we can possibly extent to binary format distingish import which have explit module names. Tags: #llvm Differential Revision: https://reviews.llvm.org/D74109	2020-02-19 17:25:33 -08:00
Greg Clayton	76f7457508	Add an Offset field to the SourceLocation for LookupResult objects. Summary: The Offset provides the offset within the function in a SourceLocation struct. This allows us to show the byte offset within a function. We also track offsets within inline functions as well. Updated the lookup tests to verify the offset for functions and inline functions. 0x1000: main + 32 @ /tmp/main.cpp:45 Reviewers: labath, aadsm, serhiy.redko, jankratochvil, xiaobai, wallace, aprantl, JDevlieghere Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74680	2020-02-19 16:12:32 -08:00
Thomas Lively	8c7beab605	[WebAssembly] Fix memory bug introduced in 52861809994c Summary: The instruction at `DefI` can sometimes be destroyed by `rematerializeCheapDef`, so it should not be used after calling that function. The fix is to use `Insert` instead when examining additional multivalue stackifications. `Insert` is the address of the new defining instruction after all moves and rematerializations have taken place. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74875	2020-02-19 15:07:45 -08:00
LLVM GN Syncbot	1a870bddc6	[gn build] Port 85fb997659b	2020-02-19 22:58:29 +00:00
Lang Hames	896a3b9d4c	[JITLink] Fix testcase for main JITDylib rename in 85fb997659b.	2020-02-19 14:58:13 -08:00
Matt Arsenault	93f691bf87	AMDGPU: Enable integer division bypass We probably want this, and I've meant to turn this on for a long time. SC actually emits a special case to early-out for a 1 denominator, which perhaps should also be considered.	2020-02-19 17:50:19 -05:00
Matt Arsenault	5ca9fc991d	AMDGPU/GlobalISel: Remove outdated comment	2020-02-19 17:32:25 -05:00
Matt Arsenault	e4884a1a12	AMDGPU/GlobalISel: Cleanup min/max RegBankSelect tests Use common check prefix, although update_mir_test_checks makes this unnecessarily annoying. Also make sure to have uses in case that ever ends up mattering.	2020-02-19 17:32:25 -05:00
Lang Hames	117fadbabd	[ORC] Fix a missing move.	2020-02-19 14:27:31 -08:00
Lang Hames	48e17a271a	[ORC] Qualify nullptr_t.	2020-02-19 14:25:53 -08:00
Lang Hames	900dc7edc7	[ORC] Add generic initializer/deinitializer support. Initializers and deinitializers are used to implement C++ static constructors and destructors, runtime registration for some languages (e.g. with the Objective-C runtime for Objective-C/C++ code) and other tasks that would typically be performed when a shared-object/dylib is loaded or unloaded by a statically compiled program. MCJIT and ORC have historically provided limited support for discovering and running initializers/deinitializers by scanning the llvm.global_ctors and llvm.global_dtors variables and recording the functions to be run. This approach suffers from several drawbacks: (1) It only works for IR inputs, not for object files (including cached JIT'd objects). (2) It only works for initializers described by llvm.global_ctors and llvm.global_dtors, however not all initializers are described in this way (Objective-C, for example, describes initializers via specially named metadata sections). (3) To make the initializer/deinitializer functions described by llvm.global_ctors and llvm.global_dtors searchable they must be promoted to extern linkage, polluting the JIT symbol table (extra care must be taken to ensure this promotion does not result in symbol name clashes). This patch introduces several interdependent changes to ORCv2 to support the construction of new initialization schemes, and includes an implementation of a backwards-compatible llvm.global_ctor/llvm.global_dtor scanning scheme, and a MachO specific scheme that handles Objective-C runtime registration (if the Objective-C runtime is available) enabling execution of LLVM IR compiled from Objective-C and Swift. The major changes included in this patch are: (1) The MaterializationUnit and MaterializationResponsibility classes are extended to describe an optional "initializer" symbol for the module (see the getInitializerSymbol method on each class). The presence or absence of this symbol indicates whether the module contains any initializers or deinitializers. The initializer symbol otherwise behaves like any other: searching for it triggers materialization. (2) A new Platform interface is introduced in llvm/ExecutionEngine/Orc/Core.h which provides the following callback interface: - Error setupJITDylib(JITDylib &JD): Can be used to install standard symbols in JITDylibs upon creation. E.g. __dso_handle. - Error notifyAdding(JITDylib &JD, const MaterializationUnit &MU): Generally used to record initializer symbols. - Error notifyRemoving(JITDylib &JD, VModuleKey K): Used to notify a platform that a module is being removed. Platform implementations can use these callbacks to track outstanding initializers and implement a platform-specific approach for executing them. For example, the MachOPlatform installs a plugin in the JIT linker to scan for both __mod_inits sections (for C++ static constructors) and ObjC metadata sections. If discovered, these are processed in the usual platform order: Objective-C registration is carried out first, then static initializers are executed, ensuring that calls to Objective-C from static initializers will be safe. This patch updates LLJIT to use the new scheme for initialization. Two LLJIT::PlatformSupport classes are implemented: A GenericIR platform and a MachO platform. The GenericIR platform implements a modified version of the previous llvm.global-ctor scraping scheme to provide support for Windows and Linux. LLJIT's MachO platform uses the MachOPlatform class to provide MachO specific initialization as described above. Reviewers: sgraenitz, dblaikie Subscribers: mgorny, hiraditya, mgrang, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74300	2020-02-19 13:59:32 -08:00
Stanislav Mekhanoshin	de50517235	[AMDGPU] Fix DS_WRITE_B32 patterns It uses VGPR_32.RegTypes which includes 16 bit types. As a result DS_WRITE_B32 may be generated for "store i16" which is a bug. The only reason we do not hit it now is relative patterns complexity and sorting. Should DS_WRITE_B16 pattern complexity become higher and the bug appears. Differential Revision: https://reviews.llvm.org/D74868	2020-02-19 13:42:16 -08:00
Tony	491ebe17c8	[AMDGPU] AMDGPUUsage define call convention ABI Reviewers: scott.linder, arsenm, b-sumner Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74861	2020-02-19 15:56:19 -05:00
Michael Kruse	71dc77c1eb	[IndVarSimply] Fix assert/release build difference. In builds with assertions enabled (!NDEBUG), IndVarSimplify does an additional query to ScalarEvolution which may change future SCEV queries since it fills the internal cache differently. The result is actually only used with the -verify-indvars command line option. We fix the issue by only calling SE->getBackedgeTakenCount(L) if -verify-indvars is enabled such that only -verify-indvars shows the behavior, but not debug builds themselves. Also add a remark to the description of -verify-indvars about this behavior. Fixes llvm.org/PR44815 Differential Revision: https://reviews.llvm.org/D74810	2020-02-19 14:36:22 -06:00
Tony	f119c347fa	[AMDGPU] Update AMDGPUUsage with DWARF proposal Summary: - Add AMDGPU DWARF proposal. - Add references for gfx10 ISA and SemVer. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, aprantl, dstuttard, tpr, jfb, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70523	2020-02-19 15:30:53 -05:00
Sanjay Patel	2dc6c3489f	[x86] add test for uint->fp with unsafe-fp-math (PR43609); NFC	2020-02-19 15:18:52 -05:00
Krzysztof Parzyszek	c28d8cf19b	[Hexagon] Change HVX vector predicate types from v512/1024i1 to v64/128i1 This commit removes the artificial types <512 x i1> and <1024 x i1> from HVX intrinsics, and makes v512i1 and v1024i1 no longer legal on Hexagon. It may cause existing bitcode files to become invalid. * Converting between vector predicates and vector registers must be done explicitly via vandvrt/vandqrt instructions (their intrinsics), i.e. (for 64-byte mode): %Q = call <64 x i1> @llvm.hexagon.V6.vandvrt(<16 x i32> %V, i32 -1) %V = call <16 x i32> @llvm.hexagon.V6.vandqrt(<64 x i1> %Q, i32 -1) The conversion intrinsics are: declare <64 x i1> @llvm.hexagon.V6.vandvrt(<16 x i32>, i32) declare <128 x i1> @llvm.hexagon.V6.vandvrt.128B(<32 x i32>, i32) declare <16 x i32> @llvm.hexagon.V6.vandqrt(<64 x i1>, i32) declare <32 x i32> @llvm.hexagon.V6.vandqrt.128B(<128 x i1>, i32) They are all pure. * Vector predicate values cannot be loaded/stored directly. This directly reflects the architecture restriction. Loading and storing or vector predicates must be done indirectly via vector registers and explicit conversions via vandvrt/vandqrt instructions.	2020-02-19 14:14:56 -06:00
Nikita Popov	453ec55b92	Reapply [IRBuilder] Always respect inserter/folder Some IRBuilder methods that were originally defined on IRBuilderBase do not respect custom IRBuilder inserters/folders, because those were not accessible prior to D73835. Fix this by making use of existing (and now accessible) IRBuilder methods, which will handle inserters/folders correctly. There are some changes in OpenMP and Instrumentation tests, where bitcasts now get constant folded. I've also highlighted one InstCombine test which now finishes in two rather than three iterations, thanks to new instructions being inserted into the worklist. Differential Revision: https://reviews.llvm.org/D74787	2020-02-19 20:51:38 +01:00
Bill Wendling	c708922cd0	Include static prof data when collecting loop BBs Summary: If the programmer adds static profile data to a branch---i.e. uses "__builtin_expect()" or similar---then we should honor it. Otherwise, "__builtin_expect()" is ignored in crucial situations. So we trust that the programmer knows what they're doing until proven wrong. Subscribers: hiraditya, JDevlieghere, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74809	2020-02-19 11:33:48 -08:00
Simon Pilgrim	d794c53322	[AMDGPU] Regenerate immediate constant tests	2020-02-19 18:58:44 +00:00
Simon Pilgrim	c00ec168b5	[UpdateTestChecks] Add support for '.' in ir function names Will let us regenerate from amdgpu float constant tests	2020-02-19 18:58:44 +00:00
Louis Dionne	879ac1565d	[CMake] Only detect the linker once in AddLLVM.cmake Summary: Otherwise, the build output contains a bunch of "Linker detection: <xxx>" lines that are really redundant. We also make redundant calls to the linker, although that is a smaller concern. Reviewers: smeenai Subscribers: mgorny, fedor.sergeev, jkorous, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68648	2020-02-19 13:53:38 -05:00

1 2 3 4 5 ...

192240 Commits