llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Mark de Wever	0c2c90b2f6	[Tools] Fixes -Wrange-loop-analysis warnings This avoids new warnings due to D68912 adds -Wrange-loop-analysis to -Wall. Differential Revision: https://reviews.llvm.org/D71808	2019-12-22 19:11:17 +01:00
Mark de Wever	fe7ca9d333	[TableGen] Fixes -Wrange-loop-analysis warnings This avoids new warnings due to D68912 adds -Wrange-loop-analysis to -Wall. Differential Revision: https://reviews.llvm.org/D71807	2019-12-22 18:58:32 +01:00
Philip Reames	f0044adba8	[Test] Add examples of problematic assembler auto-padding This is in the context of the automatic padding work for the jcc erratum mitigation. These are example cases we need to not pad for correctness. Exact mechanism to suppress is still TBD, but saving the tests which have come up.	2019-12-22 09:01:04 -08:00
Sanjay Patel	0b82d3c468	[InstCombine] enhance fold for copysign with known sign arg This is another optimization suggested in PRPR44153: https://bugs.llvm.org/show_bug.cgi?id=44153	2019-12-22 10:07:01 -05:00
Eric Astor	00e36b584c	[ms] [X86] Use "P" modifier on operands to call instructions in inline X86 assembly. Summary: This is documented as the appropriate template modifier for call operands. Fixes PR44272, and adds a regression test. Also adds support for operand modifiers in Intel-style inline assembly. Reviewers: rnk Reviewed By: rnk Subscribers: merge_guards_bot, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71677	2019-12-22 09:16:34 -05:00
Sanjay Patel	bc27e215ac	[AArch64] match splat of bitcasted extract subvector to DUPLANE This is another potential regression exposed by D63815. Here we peek through a bitcast to find an extract subvector and scale the splat offset based on that: splat (bitcast (extract X, C)), LaneC --> duplane (bitcast X), LaneC' Differential Revision: https://reviews.llvm.org/D71672	2019-12-22 08:37:03 -05:00
David Blaikie	8197755242	DebugInfo: Remove out of date comment	2019-12-21 23:13:26 -08:00
LLVM GN Syncbot	4e17d93dee	[gn build] Port 7376d9eb389	2019-12-22 02:15:02 +00:00
Nico Weber	587ab43da0	[gn build] fixup after c3d13d9c56	2019-12-21 21:14:26 -05:00
Nico Weber	1361644edc	[gn build] fold Basic:version into Basic This now defines HAVE_VCS_VERSION_INC for all files in Basic, but now the BUILD.gn file has only a single "sources" field again, and the automerger requires that. Having the automerger work for clang/lib/Basic is a very nice to have, and the downside seems tiny.	2019-12-21 21:10:02 -05:00
Simon Pilgrim	64e7c37e84	Fix "result of 32-bit shift implicitly converted to 64 bits" warning. NFC.	2019-12-21 17:45:30 +00:00
Simon Pilgrim	94093a9d23	Fix Wpedantic 'extra semicolon' warning. NFC.	2019-12-21 17:32:00 +00:00
Sanjay Patel	0440e33c6b	[InstCombine] check alloc size in bitcast of geps fold (PR44321) We missed a constraint in D44833 when folding a bitcast into a GEP with vector/array types. If the alloc sizes specified by the datalayout don't match, this could miscompile as shown in: https://bugs.llvm.org/show_bug.cgi?id=44321 Differential Revision: https://reviews.llvm.org/D71771	2019-12-21 10:31:21 -05:00
Sanjay Patel	a6e1f3d2ea	[SimplifyLibCalls] require fast-math-flags for pow(X, -0.5) transforms As discussed in PR44330: https://bugs.llvm.org/show_bug.cgi?id=44330 ...the transform from pow(X, -0.5) libcall/intrinsic to reciprocal square root can result in small deviations from the expected result due to differences in the pow() implementation and/or the extra rounding step from the division. This patch proposes to allow that difference with either the 'approximate functions' or 'reassociate' FMF: http://llvm.org/docs/LangRef.html#fast-math-flags In practice, this likely means that the code is compiled with all of 'fast' (-ffast-math), but I have preserved the existing specializations for -0.0/-INF that enable generating safe code if those special values are allowed simultaneously with allowing approximation/reassociation. The question about whether a similar restriction is needed for the non-reciprocal case -- pow(X, 0.5) -- is deferred. That transform is allowed without FMF currently, and this patch does not change that behavior. Differential Revision: https://reviews.llvm.org/D71706	2019-12-21 10:00:53 -05:00
Florian Hahn	1dcaf28381	[AArch64] Respect reserved registers while renaming in LdSt opt. We cannot pick reserved registers as rename registers. Fixes https://bugs.llvm.org/show_bug.cgi?id=44358	2019-12-21 15:10:07 +01:00
Matt Arsenault	133fc88f3b	AMDGPU: Fix repeated word in comment	2019-12-21 04:57:35 -05:00
Matt Arsenault	e75a9647dd	Mips: Make test resistant to future changes This seems to have been relying on extra spills being inserted in these blocks to increase the code size to trigger branch relaxation. This broke when these spills were avoided. Add some asm to pad the size of the blocks to make it not matter.	2019-12-21 04:56:20 -05:00
Matt Arsenault	3f41ea1bc1	AMDGPU/GlobalISel: Fix misuse of div_scale intrinsics Confusingly, the intrinsic operands do not match the instruction/custom node. The order is shuffled, and the 3rd operand is an immediate to select operands. I'm not 100% sure I did this right, but fdiv still doesn't select end to end and it will be easier to tell when it does. This at least avoids an assertion in RegBankSelect and allows hitting the fallback on selection.	2019-12-21 04:55:36 -05:00
Matt Arsenault	0a495d2b92	AMDGPU/GlobalISel: Fix missing scc imp-def on scalar and/or/xor	2019-12-21 04:55:36 -05:00
Matt Arsenault	a6c9caf2a3	AMDGPU/GlobalISel: Simplify code This can directly access the register bank, and doesn't need to get it through the ID.	2019-12-21 04:55:36 -05:00
Lang Hames	24ee3b266f	[ORC] De-register eh-frames in the RTDyldObjectLinkingLayer destructor. This matches the behavior of the legacy layer, which automatically deregistered frames.	2019-12-20 21:10:49 -08:00
Nico Weber	6cb56ea0a6	fix another doc typo to cycle bots	2019-12-20 21:59:51 -05:00
Michael Trent	fc29f2e437	Constrain the macho-stabs test added in f72d001e099 to run on systems configured with an x86 backend. Summary: This fixes a failure on the Builder clang-cmake-armv7-quick bot. Reviewers: lhames, jhenderson Reviewed By: lhames Subscribers: kristof.beyls, rupprecht, seiya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71792	2019-12-20 17:40:37 -08:00
Philip Reames	e43909be99	Add a set of tests with basic coverage of the recently added boundary align feature. There are tests in the included patch, but the duplication is non obvious, so I'm starting with basic coverage beore cleaning up a few of them	2019-12-20 17:19:23 -08:00
Peter Collingbourne	bb2d372edc	gn build: Silence mismatched-new-delete warning in scudo C++ wrapper tests. These tests are deliberately mismatching new and delete, so the warnings are just noise. Differential Revision: https://reviews.llvm.org/D71783	2019-12-20 16:53:13 -08:00
Jessica Paquette	522b2762d7	[NFC][MachineOutliner] Rewrite setSuffixIndices to be iterative Having this function be recursive could use up way too much stack space. Rewrite it as an iterative traversal in the tree instead to prevent this. Fixes PR44344.	2019-12-20 16:12:37 -08:00
Craig Topper	2287f0c506	[X86] Add test cases for missing propagation of fpexcept flag on strict fp operations. NFC The flag is being lost during type legalization or lowering. This covers some of the cases. I'm sure there are many missing.	2019-12-20 15:55:13 -08:00
Petr Hosek	384d8ce0d5	[llvm-symbolizer] Prefix invocations in test with env This addresses an issue introduced in dedad08 and is needed to make sure this test works properly on Windows.	2019-12-20 15:52:14 -08:00
Vedant Kumar	65fb265c86	[DWARF] Defer creating declaration DIEs until we prepare call site info It isn't necessary to create DIEs for all of the declaration subprograms in a CU's retainedTypes list. We can defer creating these subprograms until we need to prepare a call site tag that refers to one. This cleanup was mentioned in passing in D70350.	2019-12-20 15:26:31 -08:00
Vedant Kumar	a496d0518f	Reland: [DWARF] Allow cross-CU references of subprogram definitions This allows a call site tag in CU A to reference a callee DIE in CU B without resorting to creating an incomplete duplicate DIE for the callee inside of CU A. We already allow cross-CU references of subprogram declarations, so it doesn't seem like definitions ought to be special. This improves entry value evaluation and tail call frame synthesis in the LTO setting. During LTO, it's common for cross-module inlining to produce a call in some CU A where the callee resides in a different CU, and there is no declaration subprogram for the callee anywhere. In this case llvm would (unnecessarily, I think) emit an empty DW_TAG_subprogram in order to fill in the call site tag. That empty 'definition' defeats entry value evaluation etc., because the debugger can't figure out what it means. As a follow-up, maybe we could add a DWARF verifier check that a DW_TAG_subprogram at least has a DW_AT_name attribute. Update: Reland with a fix to create a declaration DIE when the declaration is missing from the CU's retainedTypes list. The declaration is left out of the retainedTypes list in two cases: 1) Re-compiling pre-r266445 bitcode (in which declarations weren't added to the retainedTypes list), and 2) Doing LTO function importing (which doesn't update the retainedTypes list). It's possible to handle (1) and (2) by modifying the retainedTypes list (in AutoUpgrade, or in the LTO importing logic resp.), but I don't see an advantage to doing it this way, as it would cause more DWARF to be emitted compared to creating the declaration DIEs lazily. Tested with a stage2 ThinLTO+RelWithDebInfo build of clang, and with a ReleaseLTO-g build of the test suite. rdar://46577651, rdar://57855316, rdar://57840415 Differential Revision: https://reviews.llvm.org/D70350	2019-12-20 15:26:31 -08:00
Michael Trent	d317d94f80	llvm-objdump should ignore Mach-O stab symbols for disassembly. Summary: llvm-objdump will commonly error out when disassembling a Mach-O binary with stab symbols, or when printing a Mach-O symbol table that includesstab symbols. That is because the Mach-O N_OSO symbol has been modified to include the bottom 8-bit value of the Mach-O's cpusubtype value in the section field. In general, one cannot blindly assume a stab symbol's section field is valid unless one has actually consulted the specification for the specific stab. Since objdump mostly just walks the symbol table to get mnemonics for code disassembly it's best for objdump to just ignore stab symbols. llvm-nm will do a more complete and correct job of displaying Mach-O symbol table contents. Reviewers: pete, lhames, ab, thegameg, jhenderson, MaskRay Reviewed By: thegameg, MaskRay Subscribers: MaskRay, rupprecht, seiya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71394	2019-12-20 15:20:53 -08:00
Yury Delendik	6939b086bb	[WebAssembly] Use TargetIndex operands in DbgValue to track WebAssembly operands locations Extends DWARF expression language to express locals/globals locations. (via target-index operands atm) (possible variants are: non-virtual registers or address spaces) The WebAssemblyExplicitLocals can replace virtual registers to targertindex operand type at the time when WebAssembly backend introduces {get,set,tee}_local instead of corresponding virtual registers. Reviewed By: aprantl, dschuff Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D52634	2019-12-20 14:39:05 -08:00
Sam Clegg	28833b4485	Fix name of InitLibcalls() function in comment Differential Revision: https://reviews.llvm.org/D71781	2019-12-20 14:35:05 -08:00
Jakub Kuderski	300700e70f	[InstCombine] Improve infinite loop detection Summary: This patch limits the default number of iterations performed by InstCombine. It also exposes a new option that allows to specify how many iterations is considered getting stuck in an infinite loop. Based on experiments performed on real-world C++ programs, InstCombine seems to perform at most ~8-20 iterations, so treating 1000 iterations as an infinite loop seems like a safe choice. See D71145 for details. The two limits can be specified via command line options. Reviewers: spatel, lebedev.ri, nikic, xbolva00, grosser Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71673	2019-12-20 16:15:04 -05:00
Adrian Prantl	4dc53df55f	Rename DW_AT_LLVM_isysroot to DW_AT_LLVM_sysroot This is a purely cosmetic change that is NFC in terms of the binary output. I bugs me that I called the attribute DW_AT_LLVM_isysroot since the "i" is an artifact of GCC command line option syntax (-isysroot is in the category of -i options) and doesn't carry any useful information otherwise. This attribute only appears in Clang module debug info. Differential Revision: https://reviews.llvm.org/D71722	2019-12-20 13:11:17 -08:00
Bill Wendling	c9e75264ee	Add parentheses to silence warning	2019-12-20 12:52:36 -08:00
Petr Hosek	dcf1bd6c94	[llvm-symbolizer] Support reading options from environment llvm-symbolizer is used by sanitizers to symbolize errors discovered by sanitizer, but there's no way to pass options to llvm-symbolizer since the tool is invoked directly by the sanitizer runtime. Therefore, we don't have a way to pass options needed to find debug symbols such as -dsym-hint or -debug-file-directory. This change enables reading options from the LLVM_SYMBOLIZER_OPTS in addition to command line which can be used to pass those additional options to llvm-symbolizer invocations made by sanitizer runtime. Differential Revision: https://reviews.llvm.org/D71668	2019-12-20 12:47:27 -08:00
LLVM GN Syncbot	113b26aae4	[gn build] Port 82923c71efa	2019-12-20 20:35:49 +00:00
Philip Reames	26f411e5f6	More style cleanups following rG14fc20ca6282 [NFC] Demote member functions to static functions where possible Use early continue/early return to reduce nesting Clarify comments slightly. Reuse previously define expression in one case.	2019-12-20 12:19:33 -08:00
Philip Reames	2d3e7f7066	Fix a memory leak introduced w/the instruction padding support in rG14fc20ca6282 Should have caught this in review, but only noticed when addressing post commit style items. We were creating a new instance of the X86MCInstrInfo class, and then never reclaiming the memory. This wasn't even conditional on the new off by default flags, so it was an unconditional leak.	2019-12-20 12:04:07 -08:00
Philip Reames	1f8262a87a	Comment and adjust style in the newly introduced MCBoundaryAlignFragment infrastructure. More to follow.	2019-12-20 12:04:07 -08:00
Philip Reames	e0244827fe	Align branches within 32-Byte boundary (NOP padding) WARNING: If you're looking at this patch because you're looking for a full performace mitigation of the Intel JCC Erratum, this is not it! This is a preliminary patch on the patch towards mitigating the performance regressions caused by Intel's microcode update for Jump Conditional Code Erratum. For context, see: https://www.intel.com/content/www/us/en/support/articles/000055650.html The patch adds the required assembler infrastructure and command line options needed to exercise the logic for INTERNAL TESTING. These are NOT public flags, and should not be used for anything other than LLVM's own testing/debugging purposes. They are likely to change both in spelling and meaning. WARNING: This patch is knowingly incorrect in some cornercases. We need, and do not yet provide, a mechanism to selective enable/disable the padding. Conversation on this will continue in parellel with work on extending this infrastructure to support prefix padding. The goal here is to have the assembler align specific instructions such that they neither cross or end at a 32 byte boundary. The impacted instructions are: a. Conditional jump. b. Fused conditional jump. c. Unconditional jump. d. Indirect jump. e. Ret. f. Call. The new options for llvm-mc are: -x86-align-branch-boundary=NUM aligns branches within NUM byte boundary. -x86-align-branch=TYPE[+TYPE...] specifies types of branches to align. A new MCFragment type, MCBoundaryAlignFragment, is added, which may emit NOP to align the fused/unfused branch. alignBranchesBegin inserts MCBoundaryAlignFragment before instructions, alignBranchesEnd marks the end of the branch to be aligned, relaxBoundaryAlign grows or shrinks sizes of NOP to align the target branch. Nop padding is disabled when the instruction may be rewritten by the linker, such as TLS Call. Process Note: I am landing a patch by skan as it has been LGTMed, and continuing to iterate on the review is simply slowing us down at this point. We can and will continue to iterate in tree. Patch By: skan Differential Revision: https://reviews.llvm.org/D70157	2019-12-20 11:35:50 -08:00
Fangrui Song	19fb620829	[PPC32] Emit R_PPC_PLTREL24 for calls to dso_local ifunc static void *ifunc(void) __attribute__((ifunc("resolver"))); void foo() { ifunc(); } The relocation produced by the ifunc() call: 1. gcc -msecure-plt -fPIC => R_PPC_PLTREL24 r_addend=0x8000 2. gcc -msecure-plt -PIE => R_PPC_PLTREL24 r_addend=0x8000 3. clang -msecure-plt -fPIC => R_PPC_PLTREL24 r_addend=0x8000 4. clang -msecure-plt -fPIE => R_PPC_REL24 4 is incorrect. The R_PPC_REL24 needs a call stub due to ifunc. If this relocation is mixed with other R_PPC_PLTREL24(r_addend=0x8000) in a function, both GNU ld and lld (after D71621 fix) may produce a wrong result. This patch fixes 4 to use R_PPC_PLTREL24, which matches GCC. Both GNU ld and lld (after D71621) will be happy. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D71649	2019-12-20 11:32:02 -08:00
Craig Topper	54c65a9a69	[X86] Fix a KNL miscompile caused by combineSetCC swapping LHS/RHS variables before a later use. The setcc operands are copied into LHS and RHS variables at the top of the function. We also capture the condition code. A later piece of code swaps the operands and changing the CC variable as part of a canonicalization to make some other checks simpler. But we might not make the transform we canonicalized for. So we continue on through the function where we can use the swapped LHS/RHS variables and access the original condition code operand instead of the modified CC variable. This leads to a setcc being created with the original condition code, but with swapped operands. To mitigate this, this patch does a couple things. The LHS/RHS/CC variables are made const to keep them from being modified like this again. The transform that needs the swap now uses temporary copies of the variables. And the transform that used the original condition code operand has been altered to use the CC variable we cached originally. Either of these changes are enough to fix the issue, but doing both to make this code very safe. I also considered rewriting the swap code in some way to check both permutations without explicitly swapping or needing temporary variables, but held off on that. Differential Revision: https://reviews.llvm.org/D71736	2019-12-20 11:24:45 -08:00
Danilo Carvalho Grael	ab60031c47	[AArch64][SVE] Replace integer immediate intrinsics with splat vector variant Summary: Replace the integer immediate intrisics with splat vector variants so they can be applied as optimizations for the C/C++ intrinsics. Reviewers: sdesmalen, huntergr, rengolin, efriedma, c-rhodes, mgudim, kmclaughlin Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits, amehsan Tags: #llvm Differential Revision: https://reviews.llvm.org/D71614	2019-12-20 13:52:19 -05:00
Jonas Paulsson	fa76304be1	[SystemZ] Add a mapping from "select register" to "load on condition" (2-addr). The SELR(Mux) instructions can be converted to two-address form as LOCR(Mux) instructions whenever one of the sources are the same reg as dest. By adding this mapping in getTwoOperandOpcode(), we get: - Two-address hints in getRegAllocationHints() for select register instructions. - No need anymore for special handling in SystemZShortenInst.cpp - shortenSelect() removed. The two-address hints are now added before the GRX32 hints, which should be preferred. Review: Ulrich Weigand https://reviews.llvm.org/D68870	2019-12-20 10:44:58 -08:00
Evgenii Stepanov	809f567590	llvm-symbolizer: support DW_FORM_loclistx locations. Summary: With -gdwarf-5 local variable locations are emitted as DW_FORM_loclistx form instead of the regular DW_FORM_sec_offset. Teach DWARFDie::getLocations to understand the new format and use it in llvm-symbolizer "FRAME" command. Reviewers: pcc, jdoerfert Subscribers: srhines, aprantl, hiraditya, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70756	2019-12-20 10:36:14 -08:00
Jordan Rupprecht	a9ab626582	Temporarily revert "Reapply [LVI] Normalize pointer behavior" and "[LVI] Restructure caching" This reverts commits 7e18aeba5062cd4324a9efb7bc25c9dbc4a34c2c (D70376) 21fbd5587cdfa11dabb3aeb0ead2d3d5fd0b490d (D69914) due to increased memory usage.	2019-12-20 10:25:57 -08:00
Jonas Paulsson	61f35cd2aa	[SystemZ] Bugfix and improve the handling of CC values. It was recently discovered that the handling of CC values was actually broken since overflow was not properly handled ('nsw' flag not checked for). Add and sub instructions now have a new target specific instruction flag named SystemZII::CCIfNoSignedWrap. It means that the CC result can be used instead of a compare with 0, but only if the instruction has the 'nsw' flag set. This patch also adds the improvements of conversion to logical instructions and the analyzing of add with immediates, to be able to eliminate more compares. Review: Ulrich Weigand https://reviews.llvm.org/D66868	2019-12-20 10:20:23 -08:00
Victor Campos	bbb6f6b671	Revert "[ARM] Improve codegen of volatile load/store of i64" This reverts commit bbcf1c3496ce2bd1ed87e8fb15ad896e279633ce.	2019-12-20 18:08:24 +00:00

1 2 3 4 5 ...

189196 Commits