llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 12:41:49 +01:00

Author	SHA1	Message	Date
Jay Foad	6c1703b074	[SDA] Don't stop divergence propagation at the IPD. Summary: This fixes B42473 and B42706. This patch makes the SDA propagate branch divergence until the end of the RPO traversal. Before, the SyncDependenceAnalysis propagated divergence only until the IPD in rpo order. RPO is incompatible with post dominance in the presence of loops. This made the SDA crash because blocks were missed in the propagation. Reviewers: foad, nhaehnle Reviewed By: foad Subscribers: jvesely, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65274 llvm-svn: 372223	2019-09-18 13:40:22 +00:00
Simon Atanasyan	6a1f00145f	[mips] Pass "xgot" flag as a subtarget feature We need "xgot" flag in the MipsAsmParser to implement correct expansion of some pseudo instructions in case of using 32-bit GOT (XGOT). MipsAsmParser does not have reference to MipsSubtarget but has a reference to "feature bit set". llvm-svn: 372220	2019-09-18 12:24:57 +00:00
Simon Atanasyan	1124aeadf7	[mips] Mark tests for lw/sw expansion in PIC by a separate "check prefix". NFC That simplify adding XGOT tests later. llvm-svn: 372219	2019-09-18 12:24:30 +00:00
Simon Atanasyan	fa04b58fc7	[mips] Reduce code duplication in the `loadAndAddSymbolAddress`. NFC llvm-svn: 372218	2019-09-18 12:24:23 +00:00
Simon Pilgrim	5b325c0c9d	Fix -Wdocumentation warning. NFCI. llvm-svn: 372215	2019-09-18 11:22:22 +00:00
Simon Pilgrim	9a5abcd236	Fix -Wdocumentation "empty paragraph passed to '\brief'" warning. NFCI. llvm-svn: 372214	2019-09-18 10:41:20 +00:00
Simon Pilgrim	cfa33bb88f	Fix -Wdocumentation "@returns in a void function" warning. NFCI. llvm-svn: 372212	2019-09-18 10:39:16 +00:00
Simon Pilgrim	710428e863	Fix -Wdocumentation "Unknown param" warning. NFCI. llvm-svn: 372211	2019-09-18 10:37:53 +00:00
Russell Gallop	eddfe913c8	[cmake] Changes to get Windows self-host working with PGO Fixes quoting of profile arguments to work on Windows Suppresses adding profile arguments to linker flags when using lld-link Avoids -fprofile-instr-use being added to rc.exe flags Removes duplicated adding of -fprofile-instr-use to linker flags (since r355541) Move handling LLVM_PROFDATA_FILE to HandleLLVMOptions.cmake Differential Revision: https://reviews.llvm.org/D62063 llvm-svn: 372209	2019-09-18 09:43:13 +00:00
Tim Renouf	13fa3ce1bc	[AMDGPU] Allow FP inline constant in v_madak_f16 and v_fmaak_f16 Differential Revision: https://reviews.llvm.org/D67680 Change-Id: Ic38f47cb2079c2c1070a441b5943854844d80a7c llvm-svn: 372208	2019-09-18 09:32:06 +00:00
Guillaume Chatelet	08358b1ba1	[Alignment] Add a None() member function Summary: This will allow writing `if(A != llvm::Align::None())` which is clearer than `if(A > llvm::Align(1))` This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67697 llvm-svn: 372207	2019-09-18 09:24:40 +00:00
Sander de Smalen	30475093d7	[AArch64][DebugInfo] Do not recompute CalleeSavedStackSize This patch fixes a bug exposed by D65653 where a subsequent invocation of `determineCalleeSaves` ends up with a different size for the callee save area, leading to different frame-offsets in debug information. In the invocation by PEI, `determineCalleeSaves` tries to determine whether it needs to spill an extra callee-saved register to get an emergency spill slot. To do this, it calls 'estimateStackSize' and manually adds the size of the callee-saves to this. PEI then allocates the spill objects for the callee saves and the remaining frame layout is calculated accordingly. A second invocation in LiveDebugValues causes estimateStackSize to return the size of the stack frame including the callee-saves. Given that the size of the callee-saves is added to this, these callee-saves are counted twice, which leads `determineCalleeSaves` to believe the stack has become big enough to require spilling an extra callee-save as emergency spillslot. It then updates CalleeSavedStackSize with a larger value. Since CalleeSavedStackSize is used in the calculation of the frame offset in getFrameIndexReference, this leads to incorrect offsets for variables/locals when this information is recalculated after PEI. Reviewers: omjavaid, eli.friedman, thegameg, efriedma Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D66935 llvm-svn: 372204	2019-09-18 09:02:44 +00:00
Ilya Biryukov	93a7fe8d08	Revert "r372201: [Support] Replace function with function_ref in writeFileAtomically. NFC" function_ref causes calls to the function to be ambiguous, breaking compilation. Reverting for now. llvm-svn: 372202	2019-09-18 08:47:09 +00:00
Ilya Biryukov	5e10e7bc8c	[Support] Replace function with function_ref in writeFileAtomically. NFC Summary: The latter is slightly more efficient and communicates the intent of the API: writeFileAtomically does not own or copy the callback, it merely calls it at some point. Reviewers: jkorous Reviewed By: jkorous Subscribers: hiraditya, dexonsmith, jfb, llvm-commits, cfe-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67584 llvm-svn: 372201	2019-09-18 08:31:28 +00:00
Craig Topper	a5385636e7	[X86] Break non-power of 2 vXi1 vectors into scalars for argument passing with avx512. This generates worse code, but matches what is done for avx2 and prevents crashes when more arguments are passed than we have registers for. llvm-svn: 372200	2019-09-18 06:06:11 +00:00
Craig Topper	043fb9113b	[X86] Add test case for passing a v17i1 vector with avx512 llvm-svn: 372199	2019-09-18 06:06:07 +00:00
Yonghong Song	284d1a3fdb	[BPF] Permit all user instructed offset relocatiions Currently, not all user specified relocations (with clang intrinsic __builtin_preserve_access_index()) will turn into relocations. In the current implementation, a __builtin_preserve_access_index() chain is turned into relocation only if the result of the clang intrinsic is used in a function call or a nonzero offset computation of getelementptr. For all other cases, the relocatiion request is ignored and the __builtin_preserve_access_index() is turned into regular getelementptr instructions. The main reason is to mimic bpf_probe_read() requirement. But there are other use cases where relocatable offset is generated but not used for bpf_probe_read(). This patch relaxed previous constraints when to generate relocations. Now, all user __builtin_preserve_access_index() will have relocations generated. Differential Revision: https://reviews.llvm.org/D67688 llvm-svn: 372198	2019-09-18 03:49:07 +00:00
Craig Topper	2ad85d1f47	[X86] Prevent assertion when calling a function that returns double with -mno-sse2 on x86-64. As seen in the most recent updates to PR10498 llvm-svn: 372197	2019-09-18 01:57:46 +00:00
Francis Visoiu Mistrih	e75b4debb9	[Remarks] Allow the RemarkStreamer to be used directly with a stream The filename in the RemarkStreamer should be optional to allow clients to stream remarks to memory or to existing streams. This introduces a new overload of `setupOptimizationRemarks`, and avoids enforcing the presence of a filename at different places. llvm-svn: 372195	2019-09-18 01:04:45 +00:00
Teresa Johnson	023768353b	[PGO] Change hardcoded thresholds for cold/inlinehint to use summary Summary: The PGO counter reading will add cold and inlinehint (hot) attributes to functions that are very cold or hot. This was using hardcoded thresholds, instead of the profile summary cutoffs which are used in other hot/cold detection and are more dynamic and adaptable. Switch to using the summary-based cold/hot detection. The hardcoded limits were causing some code that had a medium level of hotness (per the summary) to be incorrectly marked with a cold attribute, blocking inlining. Reviewers: davidxl Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67673 llvm-svn: 372189	2019-09-17 23:12:13 +00:00
Eli Friedman	1031830a08	[ARM] VFPv2 only supports 16 D registers. r361845 changed the way we handle "D16" vs. "D32" targets; there used to be a negative "d16" which removed instructions from the instruction set, and now there's a "d32" feature which adds instructions to the instruction set. This is good, but there was an oversight in the implementation: the behavior of VFPv2 was changed. In particular, the "vfp2" feature was changed to imply "d32". This is wrong: VFPv2 only supports 16 D registers. In practice, this means if you specify -mfpu=vfpv2, the compiler will generate illegal instructions. This patch gets rid of "vfp2d16" and "vfp2d16sp", and fixes "vfp2" and "vfp2sp" so they don't imply "d32". Differential Revision: https://reviews.llvm.org/D67375 llvm-svn: 372186	2019-09-17 21:42:38 +00:00
Reid Kleckner	5f03c263a9	[PGO] Don't use comdat groups for counters & data on COFF For COFF, a comdat group is really a symbol marked IMAGE_COMDAT_SELECT_ANY and zero or more other symbols marked IMAGE_COMDAT_SELECT_ASSOCIATIVE. Typically the associative symbols in the group are not external and are not referenced by other TUs, they are things like debug info, C++ dynamic initializers, or other section registration schemes. The Visual C++ linker reports a duplicate symbol error for symbols marked IMAGE_COMDAT_SELECT_ASSOCIATIVE even if they would be discarded after handling the leader symbol. Fixes coverage-inline.cpp in check-profile after r372020. llvm-svn: 372182	2019-09-17 21:10:49 +00:00
Jinsong Ji	a033158ef6	Reland "[docs][Bugpoint]Add notes about multiple crashes" Fix the warning. Bugpoint.rst:124:Mismatch: both interpreted text role prefix and reference suffix. Note that the line no here is wrong and misleading, the problem is in line 128, not 124. llvm-svn: 372181	2019-09-17 21:09:41 +00:00
Greg Clayton	5ff58013a9	Fix buildbots. MSVC doesn't correctly capture constexpr in lambdas, and other builds warn if you do, others will error out if you do. Avoid lambdas. llvm-svn: 372179	2019-09-17 20:31:01 +00:00
Jessica Paquette	a00d53313e	[AArch64][GlobalISel] Support -tailcallopt This adds support for `-tailcallopt` tail calls to CallLowering. This piggy-backs off the changes from D67577, since doing it without a bit of refactoring gets extremely ugly. Support is basically ported from AArch64ISelLowering. The main difference here is that tail calls in `-tailcallopt` change the ABI, so there's some extra bookkeeping for the stack. Show that we are correctly lowering these by updating tail-call.ll. Also show that we don't do anything strange in general by updating fastcc-reserved.ll, which passes `-tailcallopt`, but doesn't emit any tail calls. Differential Revision: https://reviews.llvm.org/D67580 llvm-svn: 372177	2019-09-17 20:24:23 +00:00
GN Sync Bot	02bf2ff5f8	gn build: Merge r372168 llvm-svn: 372173	2019-09-17 19:41:36 +00:00
Roman Lebedev	0395b18dd2	AArch64CallLowering::lowerCall(): fix build by not passing InArgs into lowerTailCall() llvm-svn: 372172	2019-09-17 19:37:07 +00:00
Roman Lebedev	07cce5d638	[NFC][InstCombine] dropRedundantMaskingOfLeftShiftInput(): some NFC diff shaving llvm-svn: 372171	2019-09-17 19:32:26 +00:00
Roman Lebedev	74ebe31eb0	[NFC][InstCombine] More tests for "Dropping pointless masking before left shift" (PR42563) While we already fold that pattern if the sum of shift amounts is not smaller than bitwidth, there's painfully obvious generalization: https://rise4fun.com/Alive/F5R I.e. the "sub of shift amounts" tells us how many bits will be left in the output. If it's less than bitwidth, we simply need to apply a mask, which is constant. llvm-svn: 372170	2019-09-17 19:32:11 +00:00
Bardia Mahjour	2d2a856f02	Revert "Data Dependence Graph Basics" This reverts commit c98ec60993a7aa65073692b62f6d728b36e68ccd, which broke the sphinx-docs build. llvm-svn: 372168	2019-09-17 19:22:01 +00:00
Simon Pilgrim	a07ba2c360	NVPTXAsmPrinter - Don't dereference a dyn_cast result. NFCI. llvm-svn: 372166	2019-09-17 19:16:00 +00:00
Simon Pilgrim	ebdb3552e6	WasmEmitter - Don't dereference a dyn_cast result. NFCI. llvm-svn: 372165	2019-09-17 19:14:11 +00:00
Jessica Paquette	c2540f58f7	[AArch64][GlobalISel][NFC] Refactor tail call lowering code When you begin implementing -tailcallopt, this gets somewhat hairy. Refactor the call lowering code so that the tail call lowering stuff gets its own function. Differential Revision: https://reviews.llvm.org/D67577 llvm-svn: 372164	2019-09-17 19:08:44 +00:00
GN Sync Bot	88acb8cbf4	gn build: Merge r372162 llvm-svn: 372163	2019-09-17 19:00:41 +00:00
Bardia Mahjour	3f478f329a	Data Dependence Graph Basics Summary: This is the first patch in a series of patches that will implement data dependence graph in LLVM. Many of the ideas used in this implementation are based on the following paper: D. J. Kuck, R. H. Kuhn, D. A. Padua, B. Leasure, and M. Wolfe (1981). DEPENDENCE GRAPHS AND COMPILER OPTIMIZATIONS. This patch contains support for a basic DDGs containing only atomic nodes (one node for each instruction). The edges are two fold: def-use edges and memory-dependence edges. The implementation takes a list of basic-blocks and only considers dependencies among instructions in those basic blocks. Any dependencies coming into or going out of instructions that do not belong to those basic blocks are ignored. The algorithm for building the graph involves the following steps in order: 1. For each instruction in the range of basic blocks to consider, create an atomic node in the resulting graph. 2. For each node in the graph establish def-use edges to/from other nodes in the graph. 3. For each pair of nodes containing memory instruction(s) create memory edges between them. This part of the algorithm goes through the instructions in lexicographical order and creates edges in reverse order if the sink of the dependence occurs before the source of it. Authored By: bmahjour Reviewer: Meinersbur, fhahn, myhsu, xtian, dmgreen, kbarton, jdoerfert Reviewed By: Meinersbur, fhahn, myhsu Subscribers: ychen, arphaman, simoll, a.elovikov, mgorny, hiraditya, jfb, wuzish, llvm-commits, jsji, Whitney, etiotto Tag: #llvm Differential Revision: https://reviews.llvm.org/D65350 llvm-svn: 372162	2019-09-17 18:55:44 +00:00
Jinsong Ji	13a8a7aaf6	[docs][Bugpoint] Revert 5584ead50 a5aa3353 No sure why there are still warnings, revert while I investigate. llvm-svn: 372161	2019-09-17 18:39:04 +00:00
Jinsong Ji	96ff72efbe	[docs][Bugpoint] Fix build break. Bugpoint.rst:124: WARNING: Mismatch: both interpreted text role prefix and reference suffix. llvm-svn: 372160	2019-09-17 18:23:06 +00:00
Craig Topper	1123deea41	[X86] Use APInt::operator<<= and APInt::lshrInPlace. NFC llvm-svn: 372159	2019-09-17 18:19:06 +00:00
Craig Topper	848c5cfc3a	[SimplifyDemandedBits] Use APInt::intersects to instead of ANDing and comparing to 0 separately. NFC llvm-svn: 372158	2019-09-17 18:19:02 +00:00
Jinsong Ji	f3b3429a72	[docs][Bugpoint]Add notes about multiple crashes Summary: When reducing case for a CodeGenCrash, bugpoint may generate a new reduced testcase that exposes/causes another crash or break something due to limitation. Bugpoint does not distiguish different crashes currently, so when this happens, bugpoint will go on reducing for the new crash, or just abort, we can't get the case reduced for the origial crash. An advice is added into usage doc to connect to recommend checking error message with scripts and `-compile-command`. Reviewers: modocache, bogner, sebpop, reames, vsk, MatzeB Reviewed By: vsk Subscribers: mehdi_amini, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66832 llvm-svn: 372157	2019-09-17 18:10:09 +00:00
Craig Topper	4c6063ee26	[X86] Simplify b2b KSHIFTL+KSHIFTR using demanded elts. llvm-svn: 372155	2019-09-17 18:02:56 +00:00
Craig Topper	3771a78ae4	[X86] Call SimplifyDemandedVectorElts on KSHIFTL/KSHIFTR nodes during DAG combine. llvm-svn: 372154	2019-09-17 18:02:52 +00:00
Craig Topper	89ae93e6d6	[X86] Simplify some code in LowerBUILD_VECTORvXi1. NFCI The case were Immediate is 0 and HasConstElts is true should never happen since that would mean the constant elts were all zero. But we check for all zero build vector earlier. So just use HasConstElts and blindly take Immediate without checking if its 0. Move the code that bitcasts and extract the immediate into the the HasConstElts case since the other code just creates an undef with the right type. No casting needed. llvm-svn: 372153	2019-09-17 18:02:46 +00:00
Stanislav Mekhanoshin	4dfc345a12	[AMDGPU] Added MI bit IsDOT NFC, needed for future commit. Differential Revision: https://reviews.llvm.org/D67669 llvm-svn: 372151	2019-09-17 17:56:13 +00:00
GN Sync Bot	194feb7c63	gn build: Merge r372149 llvm-svn: 372150	2019-09-17 17:51:27 +00:00
Greg Clayton	989d8f9a89	GSYM: Add the llvm::gsym::Header header class with tests This patch adds the llvm::gsym::Header class which appears at the start of a stand alone GSYM file, or in the first bytes of the GSYM data in a GSYM section within a file. Added encode and decode methods with full error handling and full tests. Differential Revision: https://reviews.llvm.org/D67666 llvm-svn: 372149	2019-09-17 17:46:13 +00:00
Simon Pilgrim	0ca68847de	[TableGen] CodeGenMapTable - Don't dereference a dyn_cast result. NFCI. The static analyzer is warning about potential null dereferences of dyn_cast<> results - in these cases we can safely use cast<> directly as we know that these cases should all be the correct type, which is why its working atm and anyway cast<> will assert if they aren't. llvm-svn: 372146	2019-09-17 17:32:15 +00:00
Simon Pilgrim	f18077e396	[ARM][AsmParser] Don't dereference a dyn_cast result. NFCI. The static analyzer is warning about potential null dereferences of dyn_cast<> results - in these cases we can safely use cast<> directly as we know that these cases should all be the correct type, which is why its working atm and anyway cast<> will assert if they aren't. llvm-svn: 372145	2019-09-17 17:26:14 +00:00
Simon Pilgrim	884f30e411	Fix MSVC lambda capture warnings. NFCI. llvm-svn: 372144	2019-09-17 17:24:55 +00:00
David Bolvansky	5b5fcd24ff	Reland "[SLC] Preserve attrs for strncpy(x, "", y) -> memset(align 1 x, '\0', y)" llvm-svn: 372142	2019-09-17 17:12:24 +00:00

1 2 3 4 5 ...

185060 Commits