llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 12:41:49 +01:00

Author	SHA1	Message	Date
Hiroshi Yamauchi	9896d761e8	[ThinLTO] Compute the basic block count across modules. Summary: Count the per-module number of basic blocks when the module summary is computed and sum them up during Thin LTO indexing. This is used to estimate the working set size under the partial sample PGO. This is split off of D79831. Reviewers: davidxl, espindola Subscribers: emaste, inglorion, hiraditya, MaskRay, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80403	2020-05-28 10:33:05 -07:00
Philip Reames	546a9c07c3	Default to generating statepoints with deopt and gc-transition bundles if needed Continues from D80598. The key point of the change is to default to using operand bundles instead of the inline length prefix argument lists for statepoint nodes. An important subtlety to note is that the presence of a bundle has semantic meaning, even if it is empty. As such, we need to make a somewhat deeper change to the interface than is first obvious. Existing code treats statepoint deopt arguments and the deopt bundle operands differently during inlining. The former is ignored (resulting in caller state being dropped), the later is merged. We can't preserve the old behaviour for calls with deopt fed to RS4GC and then inlining, but we can avoid the no-deopt case changing. At least in internal testing, that seem to be the important one. (I'd argue the "stop merging after RS4GC" behaviour for the former was always "unexpected", but that the behaviour for non-deopt calls actually make sense.) Differential Revision: https://reviews.llvm.org/D80674	2020-05-28 10:14:23 -07:00
Hiroshi Yamauchi	f331ef22e8	[PGO] Guard the memcmp/bcmp size value profiling instrumentation behind flag. Summary: Follow up D79751 and put the instrumentation / value collection side (in addition to the optimization side) behind the flag as well. Reviewers: davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80646	2020-05-28 10:07:04 -07:00
Craig Topper	eccf8034a4	[X86] Add 'avx512vp2intersect' to getHostCPUFeatures.	2020-05-28 09:57:17 -07:00
LLVM GN Syncbot	08b2f1da9b	[gn build] Port 7cfdff7b4a6	2020-05-28 16:49:43 +00:00
Nikita Popov	2d0f3c3ccb	[SDAG] Don't require LazyBlockFrequencyInfo at optnone While LazyBlockFrequencyInfo itself is lazy, the dominator tree and loop info analyses it requires are not. Drop the dependency on this pass in SelectionDAGIsel at O0. This makes for a ~0.6% O0 compile-time improvement. Differential Revision: https://reviews.llvm.org/D80387	2020-05-28 18:48:33 +02:00
Sidharth Baveja	d7febd5baa	Create utility function to Merge Adjacent Basic Blocks Summary: The following code from /llvm/lib/Transforms/Utils/LoopUnrollAndJam.cpp can be used by other transformations: while (!MergeBlocks.empty()) { BasicBlock BB = MergeBlocks.begin(); BranchInst Term = dyn_cast<BranchInst>(BB->getTerminator()); if (Term && Term->isUnconditional() && L->contains(Term->getSuccessor(0))) { BasicBlock Dest = Term->getSuccessor(0); BasicBlock *Fold = Dest->getUniquePredecessor(); if (MergeBlockIntoPredecessor(Dest, &DTU, LI)) { // Don't remove BB and add Fold as they are the same BB assert(Fold == BB); (void)Fold; MergeBlocks.erase(Dest); } else MergeBlocks.erase(BB); } else MergeBlocks.erase(BB); } Hence it should be separated into its own utility function. Authored By: sidbav Reviewer: Whitney, Meinersbur, asbirlea, dmgreen, etiotto Reviewed By: asbirlea Subscribers: hiraditya, zzheng, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D80583	2020-05-28 16:44:37 +00:00
Vy Nguyen	86e59df733	[llvm-exegesis] Make a few counter methods virtual to allow targets to provide target-specific support. Misc: Also include errno in failure message. Differential Revision: https://reviews.llvm.org/D80610	2020-05-28 12:38:25 -04:00
Adrian Prantl	d02e7da40e	Make VE.def a textual header	2020-05-28 09:35:19 -07:00
alex-t	90d04d1194	[AMDGPU] Reject moving PHI to VALU if the only VGPR input originated from move immediate Summary: PHIs result register class is set to VGPR or SGPR depending on the cross block value divergence. In some cases uniform PHI need to be converted to return VGPR to prevent the oddnumber of moves values from VGPR to SGPR and back. PHI should certainly return VGPR if it has at least one VGPR input. This change adds the exception. We don't want to convert uniform PHI to VGPRs in case the only VGPR input is a VGPR to SGPR COPY and definition od the source VGPR in this COPY is move immediate. bb.0: %0:vgpr_32 = V_MOV_B32_e32 0, implicit $exec %2:sreg_32 = ..... bb.1: %3:sreg_32 = PHI %1, %bb.3, %2, %bb.1 S_BRANCH %bb.3 bb.3: %1:sreg_32 = COPY %0 S_BRANCH %bb.2 Reviewers: rampitec Reviewed By: rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80434	2020-05-28 19:25:51 +03:00
Jean-Michel Gorius	d3e7b3c0e9	[x86] Propagate memory operands during call frame optimization Summary: Propagate memory operands when folding load instructions into instructions that directly operate on memory. The original revision has been split. See D80140 for the other part of the changes. Reviewers: craig.topper, rnk, lebedev.ri, efriedma Reviewed By: craig.topper Subscribers: lebedev.ri, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80062	2020-05-28 17:45:53 +02:00
Matt Arsenault	e803204f39	AMDGPU: Add missing test for s_denorm_mode scheduling Forgot to add this file to 1a9e0d7092145e33175f628f4cdd28acf0d17100	2020-05-28 11:07:22 -04:00
Matt Arsenault	8d88eb100a	AMDGPU: Make S_DENORM_MODE not be a scheduling boundary Now that the mode register uses/defs should be properly modeled, we don't need to treat the FP mode switch as an arbitrary side effect.	2020-05-28 10:39:33 -04:00
Simon Pilgrim	5b2222b080	SymbolicFile.h - removed unused FileSystem.h include. NFC. Exposes a number of implicit dependencies that needs fixing in source files and XCOFFObjectFile.h.	2020-05-28 15:26:31 +01:00
Matt Arsenault	f681d242fd	InferAddressSpaces: Handle ptrmask intrinsic This one is slightly odd since it counts as an address expression, which previously could never fail. Allow the existing TTI hook to return the value to use, and re-use it for handling how to handle ptrmask. Handles the no-op addrspacecasts for AMDGPU. We could probably do something better based on analysis of the mask value based on the address space, but leave that for now.	2020-05-28 10:04:02 -04:00
Matt Arsenault	d32b831da3	AMDGPU: Add baseline test for ptrmask infer address space	2020-05-28 10:04:02 -04:00
Simon Pilgrim	d8062a1a74	FileOutputBuffer.h - remove unused includes. NFC. Move dependent includes down to source files where necessary.	2020-05-28 14:38:12 +01:00
Simon Pilgrim	6687c8f843	WithColor.h - reduce unnecessary includes to forward declarations. NFC.	2020-05-28 14:38:11 +01:00
Simon Pilgrim	777eac6f23	[X86][SSE] Peek though MOVMSK source sign bits using SimplifyMultipleUseDemandedBits Allows SimplifyDemandedBitsForTargetNode to peek through multi-use ops where MOVMSK only demands the signbit of each vector element.	2020-05-28 13:42:24 +01:00
Alok Kumar Sharma	07d5d3cc6f	Fixed bot failure after d20bf5a7258d4b6a7 There was a failure on windows bit due to format mismatch on different(Hex and Decimal) platforms even if meaning of output is same. For example on X86 linux => DW_OP_plus_uconst 0x70, DW_OP_deref, DW_OP_lit4, DW_OP_mul ^ on X86 Windows-gnu => DW_AT_location (DW_OP_fbreg +112, DW_OP_deref, DW_OP_lit4, DW_OP_mul) : error: CHECK-SAME: expected string not found in input ; CHECK-SAME: DW_OP_plus_uconst 0x70, DW_OP_deref, DW_OP_lit4, DW_OP_mul ^ <stdin>:28:17: note: scanning from here DW_AT_location (DW_OP_fbreg +112, DW_OP_deref, DW_OP_lit4, DW_OP_mul) ^ <stdin>:28:18: note: possible intended match here DW_AT_location (DW_OP_fbreg +112, DW_OP_deref, DW_OP_lit4, DW_OP_mul) Now the test is limited to x86 using REQUIRED and -mtriple. http://45.33.8.238/win/16214/step_11.txt	2020-05-28 18:01:38 +05:30
Dmitry Preobrazhensky	0a188f55c6	[AMDGPU][MC][GFX908] Corrected src0 of v_accvgpr_write to accept only VGPRs and inline constants. This change disables use of special SGPR registers like scc, vccz, execz, etc as operands of v_accvgpr_write. See bug 45414: https://bugs.llvm.org/show_bug.cgi?id=45414 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D80530	2020-05-28 15:10:55 +03:00
Simon Pilgrim	2d65366850	Fix MSVC signed/unsigned comparison warnings. NFC.	2020-05-28 13:07:06 +01:00
Simon Pilgrim	9a9c4806d2	DWARFDebugMacro.h - remove unnecessary WithColor.h include. NFC.	2020-05-28 13:03:18 +01:00
Simon Pilgrim	20177dfe57	llvm-dwarfdump.h - remove unnecessary WithColor.h include. NFC.	2020-05-28 13:03:18 +01:00
Dmitry Preobrazhensky	edc91f55de	[AMDGPU][MC] Corrected v_writelane_b32 to fix a decoding bug Corrected vdst_in to match vdst operand type. See bug 45193: https://bugs.llvm.org/show_bug.cgi?id=45193 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D80636	2020-05-28 14:43:49 +03:00
Dmitry Preobrazhensky	2e7cdb8278	[AMDGPU][MC][DISASSEMBLER] Corrected decoder to consume each code fragment only once Summary: disabled disassembly of successfully decoded fragments of code. See detailed bug description: https://bugs.llvm.org/show_bug.cgi?id=46101 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D80637	2020-05-28 14:20:18 +03:00
Georgii Rymar	248c471022	[yaml2obj] - Implement the "SectionHeaderTable" tag. With the "SectionHeaderTable" it is now possible to reorder entries in the section header table. It also allows to stop emitting the table. Differential revision: https://reviews.llvm.org/D80002	2020-05-28 13:42:43 +03:00
Florian Hahn	81012eebda	[AArch64] Precommit new fp extraction/insertion test.	2020-05-28 11:13:47 +01:00
Alok Kumar Sharma	254b7710b8	Fixed bot failure after d20bf5a7258d4b6a7 There were some bot failures due unused funtion `rotateSign` left in code. http://lab.llvm.org:8011/builders/clang-ppc64le-rhel/builds/3731 error: unused function 'rotateSign' [-Werror,-Wunused-function] static uint64_t rotateSign(int64_t I)	2020-05-28 15:42:04 +05:30
Cullen Rhodes	f463d29a62	[AArch64][SVE] Add support for spilling/filling ZPR2/3/4 Summary: This patch enables the register allocator to spill/fill lists of 2, 3 and 4 SVE vectors registers to/from the stack. This is implemented with pseudo instructions that get expanded to individual LDR_ZXI/STR_ZXI instructions in AArch64ExpandPseudoInsts. Patch by Sander de Smalen. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D75988	2020-05-28 10:02:57 +00:00
Victor Campos	c43ac664bc	[ARM] Improve codegen of volatile load/store of i64 Summary: Instead of generating two i32 instructions for each load or store of a volatile i64 value (two LDRs or STRs), now emit LDRD/STRD. These improvements cover architectures implementing ARMv5TE or Thumb-2. The code generation explicitly deviates from using the register-offset variant of LDRD/STRD. In this variant, the register allocated to the register-offset cannot be reused in any of the remaining operands. Such restriction seems to be non-trivial to implement in LLVM, thus it is left as a to-do. Differential Revision: https://reviews.llvm.org/D70072	2020-05-28 10:52:43 +01:00
Thomas Preud'homme	959d70cd6e	FileCheck [10/12]: Add support for signed numeric values Summary: This patch is part of a patch series to add support for FileCheck numeric expressions. This specific patch adds support signed numeric values, thus allowing negative numeric values. As such, the patch adds a new class to represent a signed or unsigned value and add the logic for type promotion and type conversion in numeric expression mixing signed and unsigned values. It also adds the %d format specifier to represent signed value. Finally, it also adds underflow and overflow detection when performing a binary operation. Copyright: - Linaro (changes up to diff 183612 of revision D55940) - GraphCore (changes in later versions of revision D55940 and in new revision created off D55940) Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson Reviewed By: jhenderson, arichardson Subscribers: MaskRay, hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, kristina, hfinkel, rogfer01, JonChesterfield Tags: #llvm Differential Revision: https://reviews.llvm.org/D60390	2020-05-28 10:44:21 +01:00
Cullen Rhodes	c219487139	[TableGen] Fix non-standard escape warnings for braces in InstAlias Summary: TableGen interprets braces ('{}') in the asm string of instruction aliases as variants but when defining aliases with literal braces they have to be escaped to prevent them being removed. Braces are escaped with '\\', for example: def FooBraces : InstAlias<"foo \\{$imm\\}", (foo IntOperand:$imm)>; Although when TableGen is emitting the assembly writer (-gen-asm-writer) the AsmString that gets emitted is: AsmString = "foo \{$\x01\}"; In c/c++ braces don't need to be escaped which causes compilation warnings: warning: use of non-standard escape character '\{' [-Wpedantic] This patch fixes the issue by unescaping the flattened alias asm string in the asm writer, by replacing '\{\}' with '{}'. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D79991	2020-05-28 09:36:24 +00:00
Sander de Smalen	1563809d70	[CodeGen] Specify meaning of ISD opcodes for scalable vectors This patch contains changes to the description of EXTRACT_SUBVECTOR, INSERT_SUBVECTOR, INSERT_VECTOR_ELT, EXTRACT_VECTOR_ELT and CONCAT_VECTORS to specify their behaviour for scalable vectors. For EXTRACT_SUBVECTOR it specifies that the IDX is scaled by the same runtime scaling as the extracted (or inserted) vector. This definition is the most natural extension to EXTRACT_SUBVECTOR for scalable vectors, as most use-cases that work on fixed-width types will have the same meaning for scalable types. For legalization for example, it is common to split the vector operation to operate on the LO and HI halfs of a vector. For a fixed width vector <16 x i8> this would be expressed with: v16i8 %res = EXTRACT_SUBVECTOR v32i8 %v, i32 16 For a scalable vector, this would similarly be expressed as: nxv16i8 %res = EXTRACT_SUBVECTOR nxv32i8 %V, i32 16 By extending the meaning of IDX for scalable vectors, most existing optimisations on EXTRACT/INSERT_SUBVECTOR work for scalable vectors without any changes. This definition also allows extracting a fixed-width subvector from a scalable vector, which is useful to e.g. extract the low N lanes of a scalable vector. This patch is not NFC because it sets the meaning of these nodes for scalable vectors, which future patches will build upon. Reviewers: efriedma, ctetreau, rogfer01, craig.topper Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D79806	2020-05-28 09:29:15 +01:00
Alok Kumar Sharma	6077c65472	[DebugInfo] Upgrade DISubrange to support Fortran dynamic arrays This patch upgrades DISubrange to support fortran requirements. Summary: Below are the updates/addition of fields. lowerBound - Now accepts signed integer or DIVariable or DIExpression, earlier it accepted only signed integer. upperBound - This field is now added and accepts signed interger or DIVariable or DIExpression. stride - This field is now added and accepts signed interger or DIVariable or DIExpression. This is required to describe bounds of array which are known at runtime. Testing: unit test cases added (hand-written) check clang check llvm check debug-info Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D80197	2020-05-28 13:46:41 +05:30
LLVM GN Syncbot	a688a48908	[gn build] Port 5921782f744	2020-05-28 08:08:39 +00:00
Kazushi (Jam) Marukawa	3398257fcd	[VE] Implements minimum MC layer for VE (3/4) Summary: Define ELF binary code for VE and modify code where should use this new code. Depends on D79544. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D79545	2020-05-28 10:07:48 +02:00
Sjoerd Meijer	69e0e4ce9a	[HardwareLoops] LangRef Intrinsic descriptions The HardwareLoop intrinsics were missing and not described in LangRef. This adds these descriptions/definitions. Differential Revision: https://reviews.llvm.org/D80316	2020-05-28 08:36:04 +01:00
Kazu Hirata	030d6689c5	[JumpThreading] Use emplace_back instead of push_back (NFC) Summary: This patch replaces push_back with emplace_back where appropriate. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80688	2020-05-27 22:31:23 -07:00
Sourabh Singh Tomar	ec4b82df7d	[docs] Release notes for DIModule metadata Updated the release notes for the changes in the DIModule metadata. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D80614	2020-05-28 10:17:40 +05:30
Vitaly Buka	a5c0d86ca2	[NFC,StackSafety] Add StackSafetyGlobalInfo class	2020-05-27 20:07:12 -07:00
LLVM GN Syncbot	ff792d0b41	[gn build] Port 660cda572d6	2020-05-28 02:47:12 +00:00
Philip Reames	3ef66ae8ae	[Statepoint] Reduce scope of usage of ImmutableStatepoint Can't quite fully remove it yet as some more items need sunk the GCStatepointInst class from the wrapper, but we can at least reduce scope.	2020-05-27 18:57:42 -07:00
Xing GUO	6ee8cacb58	[ObjectYAML][MachO] Add error handling in MachOEmitter. Currently, `yaml2macho` doesn't support error handling. This patch helps improve it. Differential Revision: https://reviews.llvm.org/D80535	2020-05-28 09:54:46 +08:00
Philip Reames	0fc6f7056b	[Statepoint] Replace uses of isX functions with idiomatic isa<X> Now that all of the statepoint related routines have classes with isa support, let's cleanup. I'm leaving the (dead) utitilities in tree for a few days so that I can do the same cleanup downstream without breakage.	2020-05-27 18:32:28 -07:00
Philip Reames	ea75ead8b0	Sink first bit of functionality from Statepoint to GCStatepointInst Starting with the obvious stuff. I initially tried to include the inline operand sequences too, but managed to get code which confused me. Since several parts of those are being entirely removed in the near future, I may defer that portion until the cleanup is done.	2020-05-27 18:32:28 -07:00
Vitaly Buka	e9294ed2fc	[NFC,StackSafety] Cleanup alloca size calculation	2020-05-27 17:47:02 -07:00
Philip Reames	bce8a58b24	Introduce a GCStatepointInst type analogous to IntrinsicInst subclasses Back when we had CallSite, we implemented the current Statepoint/ImmutableStatepoint structure in analogous manner. Now that CallSite has been removed, the structure used for statepoints looks decidely out of place. gc.statepoint is one of the small handful of intrinsics which are invokable. Because of this, it can't subclass IntrinsicInst as is idiomatic. This change simply introduces the GCStatepointInst class, restructures the existing Statepoint/ImmutableStatepoint types to wrap it. I will be landing a series of changes to sink functionality into GCStatepointInst and updating callers to be more idiomatic.	2020-05-27 17:25:13 -07:00
Fangrui Song	48475697c0	[gn build] Add MLAnalysisTests after D80579	2020-05-27 17:21:05 -07:00
Mircea Trofin	e93714327a	[llvm][NFC] ProfileSummaryInfo - const-ify APIs Follow-up from https://reviews.llvm.org/D79920	2020-05-27 17:14:41 -07:00

1 2 3 4 5 ...

197370 Commits