llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Florian Hahn	1eeb9c02e5	[LV] Allow assume calls in predicated blocks. The assume intrinsic is intentionally marked as may reading/writing memory, to avoid passes moving them around. When flattening the CFG for predicated blocks, we have to drop the assume calls, as they are control-flow dependent. There are some cases where we can do better (when control flow is preserved), but that is follow-up work. Fixes PR43620. Reviewers: hsaito, rengolin, dcaballe, Ayal Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D68814	2020-01-16 10:11:35 +00:00
Florian Hahn	6c45b4f8b1	[LV] Make X86/assume.ll X86 independent (NFC). The test does not check anything X86 specific. This is a preparation for the D68814.	2020-01-16 10:01:35 +00:00
LLVM GN Syncbot	24c5598549	[gn build] Port ed181efa175	2020-01-16 09:55:55 +00:00
Sameer Sahasrabuddhe	6a0b5d46f8	[HIP][AMDGPU] expand printf when compiling HIP to AMDGPU Summary: This change implements the expansion in two parts: - Add a utility function emitAMDGPUPrintfCall() in LLVM. - Invoke the above function from Clang CodeGen, when processing a HIP program for the AMDGPU target. The printf expansion has undefined behaviour if the format string is not a compile-time constant. As a sufficient condition, the HIP ToolChain now emits -Werror=format-nonliteral. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D71365	2020-01-16 15:15:38 +05:30
Kazushi (Jam) Marukawa	a4acca0462	[VE] i64 arguments, return values and constants Summary: Support for i64 arguments (in register), return values and constants along with tests. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D72776	2020-01-16 10:09:50 +01:00
Igor Kudrin	f87be52260	[DebugInfo] Simplify the constructor of DWARFDebugAranges::Range. NFC. This removes the default values of the arguments. The only caller, DWARFDebugAranges::construct(), provides all three parameters. Differential Revision: https://reviews.llvm.org/D72757	2020-01-16 13:08:30 +07:00
Craig Topper	da61486b44	[LegalizeDAG][TargetLowering] Move vXi64/i64->vXf32/f32 uint_to_fp legalizing code from TargetLowering::expandUINT_TO_FP back to LegalizeDAG. This was moved in October 2018, but we don't appear to be using this for vectors on any in tree target. Moving it back simplifies D72794 so we can share the code for i32->f32.	2020-01-15 22:04:50 -08:00
LLVM GN Syncbot	e4adbf252b	[gn build] Port 8fdafb7dced	2020-01-16 04:13:31 +00:00
Liu, Chen3	9217c910b5	Insert wait instruction after X87 instructions which could raise float-point exception. This patch also modify some mayRaiseFPException flag which set in D68854. Differential Revision: https://reviews.llvm.org/D72750	2020-01-16 12:12:51 +08:00
Matt Arsenault	61c8d1a930	Set some fast math attributes in setFunctionAttributes This will provide a more consistent view to codegen for these attributes. The current system is somewhat awkward, and the fields in TargetOptions are reset based on the command line flag if the attribute isn't set. By forcing these attributes with the flag, there can never be an inconsistency in the behavior if code directly inspects the attribute on the function without considering the command line flags.	2020-01-15 22:23:18 -05:00
Wei Mi	866992fbb3	[SampleFDO] Fix invalid branch profile generated by indirect call promotion. Suppose an inline instance has hot total sample count but 0 entry count, and it is an indirect call target. If the indirect call has no other call target and inline instance associated with it and it is promoted, currently the conditional branch generated by indirect call promotion will have invalid branch profile which is !{!"branch_weights", i32 0, i32 0} -- because the entry count of the promoted target is 0 and the total entry count of all targets is also 0. This caused a SEGV in Control Height Reduction and may cause problem in other passes. Function entry count of an inline instance is computed by a heuristic -- using either the sample of the starting line or starting inner inline instance. The patch changes the heuristic a little bit so that when total sample count is larger than 0, the computed entry count will be at least 1. Then the new branch profile will be !{!"branch_weights", i32 1, i32 0}. Differential Revision: https://reviews.llvm.org/D72790	2020-01-15 18:36:06 -08:00
Craig Topper	1990a5b605	[X86] When handling i64->f32 sint_to_fp on 32-bit targets only bitcast to f64 if sse2 is enabled. The code is trying to copy the i64 value to an xmm register to use a 64-bit store so that the 64-bit fild can benefit from store forwarding. But this trick only works if f64 is going to be stored in an XMM register. If we only have SSE1 then only float is in xmm register. So this trick just causes 2 stores i32 stores, an f64 load into the x87, an f64 from x87, and a 64-bit fild. So we end up with an extra stack temporary and still didn't get store forwarding. We might be able to use v2f32 here instead, but I didn't check. I just wanted the code to make sense. Found by inspection as I continue to stare too hard at our int_to_fp conversions.	2020-01-15 18:26:28 -08:00
Craig Topper	d2622ad06a	[X86] Add 32-bit mode sse1 command line to scalar-int-to-fp.ll. NFC	2020-01-15 18:26:27 -08:00
Matt Arsenault	b1cd77c7fd	llc: Don't overwrite frame-pointer attribute Continue making command line flags with matching attribute behavior consistent.	2020-01-15 20:56:46 -05:00
Yuanfang Chen	b1c09bbef0	Revert "[Support] make report_fatal_error `abort` instead of `exit`" This reverts commit 647c3f4e47de8a850ffcaa897db68702d8d2459a. Got bots failure from sanitizer-windows and maybe others.	2020-01-15 17:52:25 -08:00
Yuanfang Chen	725cd0da61	[Support] make report_fatal_error `abort` instead of `exit` Summary: This patch could be treated as a rebase of D33960. It also fixes PR35547. A fix for `llvm/test/Other/close-stderr.ll` is proposed in D68164. Seems the consensus is that the test is passing by chance and I'm not sure how important it is for us. So it is removed like in D33960 for now. The rest of the test fixes are just adding `--crash` flag to `not` tool. ** The reason it fixes PR35547 is `exit` does cleanup including calling class destructor whereas `abort` does not do any cleanup. In multithreading environment such as ThinLTO or JIT, threads may share states which mostly are ManagedStatic<>. If faulting thread tearing down a class when another thread is using it, there are chances of memory corruption. This is bad 1. It will stop error reporting like pretty stack printer; 2. The memory corruption is distracting and nondeterministic in terms of error message, and corruption type (depending one the timing, it could be double free, heap free after use, etc.). Reviewers: rnk, chandlerc, zturner, sepavloff, MaskRay, espindola Reviewed By: rnk, MaskRay Subscribers: wuzish, jholewinski, qcolombet, dschuff, jyknight, emaste, sdardis, nemanjai, jvesely, nhaehnle, sbc100, arichardson, jgravelle-google, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, jsji, lenary, s.egerton, pzheng, cfe-commits, MaskRay, filcab, davide, MatzeB, mehdi_amini, hiraditya, steven_wu, dexonsmith, rupprecht, seiya, llvm-commits Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D67847	2020-01-15 17:05:13 -08:00
Matt Arsenault	a391b3aed6	llc: Change behavior of -mattr with existing attribute Append this to the existing target-features attribute on the function. Some flags ignore existing attributes, and some overwrite them. Move towards consistently respecting existing attributes if present. Since target features act as a state machine on their own, append to the function attribute. The backend default added feature list, function attributes, and -mattr will all be appended together, and the later features can individually toggle the earlier settings.	2020-01-15 19:46:01 -05:00
Stanislav Mekhanoshin	33946d17b3	Process BUNDLE in tail duplication When tail duplication estimates a size of tail it uses instruction count. Account for a number of instrictions in a bundle too. Differential Revision: https://reviews.llvm.org/D72783	2020-01-15 15:46:57 -08:00
Vedant Kumar	e7370b6a19	[CodeExtractor] Transfer debug info to extracted function After extracting, fix up debug info in both the old and new functions by 1) Pointing line locations and debug intrinsics to the new subprogram scope, and 2) Deleting intrinsics which point to values outside of the new function. Depends on https://reviews.llvm.org/D72795. Testing: check-llvm, check-clang, a build of LNT in the `-Os -g` config with "-mllvm -hot-cold-split=1" set, and end-to-end debugging of a toy program which undergoes splitting to verify that lldb can find variables, single step, etc. in extracted code. rdar://45507940 Differential Revision: https://reviews.llvm.org/D72801	2020-01-15 15:38:36 -08:00
Matt Arsenault	6b6ac6c3eb	AMDGPU/GlobalISel: Select exp with patterns This does produce slightly different code. Now a unique IMPLICIT_DEF is emitted for each of the implicit_def operands, rather than reusing the same one.	2020-01-15 18:33:15 -05:00
Matt Arsenault	5127b285b9	AMDGPU: Remove custom node for exports I'm mildly worried about potentially reordering exp/exp_done with IntrWriteMem on the intrinsic. Requires hacking out the illegal type on SI, so manually select that case during lowering.	2020-01-15 18:33:15 -05:00
Matt Arsenault	2f408e3893	GlobalISel: Handle more cases of G_SEXT narrowing This now develops the same problem G_ZEXT/G_ANYEXT have where the requested type is assumed to be the source type. This will be fixed separately by creating intermediate merges.	2020-01-15 18:33:15 -05:00
Brian Gesiak	17e5fc71fe	[IR] Module's NamedMD table needn't be 'void ' Summary: In July 21 2010 `llvm::NamedMDNode` was refactored such that it would no longer subclass `llvm::Value`: https://github.com/llvm/llvm-project/commit/2637cc1a38d7336ea30caf As part of this change, a map type from metadata names to their named metadata, `llvm::MDSymbolTable`, was deleted. In its place, the type of member `llvm::Module::NamedMDSymTab` was changed, from `llvm::MDSymbolTable` to `void `. The underlying memory allocations for this pointer were changed to `new StringMap<NamedMDNode >()`. However, as far as I can tell, there's no need for obscuring the underlying type being pointed to by the `void `, and no need for static casts from `void ` to `StringMap`. In fact, I don't think there's a need for explicit calls to `new` and `delete` at all. This commit changes `NamedMDSymTab` from a pointer to a reference, which automatically couples its lifetime with the lifetime of its owning `llvm::Module` instance, thus removing the explicit calls to `new` and `delete` in the `llvm::Module` constructor and destructor. It also changes the type from `void ` to a newly defined `NamedMDSymTabType`, and removes the static casts. Test Plan: An ASAN-enabled build and run of `check-all` succeeds with this change (aside from some tests that always fail for me in ASAN for some reason, such as `check-clang` `SemaTemplate/stack-exhaustion.cpp`). Reviewers: aprantl, dblaikie, chandlerc, pcc, echristo Reviewed By: dblaikie Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72812	2020-01-15 18:27:25 -05:00
Vedant Kumar	78c74c90d1	[test] Move call-site-entry-linking.test into test/tools/dsymutil/X86 This should fix a failure on the clang-cmake-armv7-quick bot.	2020-01-15 14:19:55 -08:00
Vedant Kumar	17ac0304c7	DWARF: Simplify the way the return PC is attached to call site tags, NFC This cleanup was suggested by Djordje in D72489.	2020-01-15 14:16:21 -08:00
Fedor Sergeev	83c723ebe8	[BasicBlock] add helper getPostdominatingDeoptimizeCall It appears to be rather useful when analyzing Loops with multiple deoptimizing exits, perhaps merged ones. For now it is used in LoopPredication, will be adding more uses in other loop passes. Reviewers: asbirlea, fhahn, skatkov, spatel, reames Reviewed By: reames Tags: #llvm Differential Revision: https://reviews.llvm.org/D72754	2020-01-16 01:15:57 +03:00
Jinsong Ji	1d416d3cac	[MachineScheduler][NFC] Don't swap when we can't cluster https://reviews.llvm.org/D72706 tried to reduce reordering due to mem op clustering. This patch avoid doing the swap when we can't cluster. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D72800	2020-01-15 21:55:31 +00:00
Mircea Trofin	53ae48b077	[NFC] Refactor InlineResult for readability Summary: InlineResult is used both in APIs assessing whether a call site is inlinable (e.g. llvm::isInlineViable) as well as in the function inlining utility (llvm::InlineFunction). It means slightly different things (can/should inlining happen, vs did it happen), and the implicit casting may introduce ambiguity (casting from 'false' in InlineFunction will default a message about hight costs, which is incorrect here). The change renames the type to a more generic name, and disables implicit constructors. Reviewers: eraman, davidxl Reviewed By: davidxl Subscribers: kerbowa, arsenm, jvesely, nhaehnle, eraman, hiraditya, haicheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72744	2020-01-15 13:34:20 -08:00
Zhongduo Lin	d02332a857	[NFC][IndVarSimplify] remove duplicate code in widenWithVariantLoadUseCodegen. Summary: Duplicate code in widenWithVariantLoadUseCodegen is removed and also use assert to check unknown extension type as it should be filtered out by the pre condition check before calling this function. Reviewers: az, sanjoy, sebpop, efriedma, javed.absar, sanjoy.google Reviewed By: efriedma Subscribers: hiraditya, llvm-commits, amehsan Tags: #llvm Differential Revision: https://reviews.llvm.org/D72652	2020-01-15 16:27:58 -05:00
Vedant Kumar	60cebe7843	DebugInfo: Factor out logic to update locations in MD_loop metadata, NFC Factor out the logic needed to update debug locations contained within MD_loop metadata. This refactor is preparation for a future change that also needs to rewrite MD_loop metadata. rdar://45507940	2020-01-15 13:02:36 -08:00
Vedant Kumar	025d4791c0	[DWARF] Emit DW_AT_call_return_pc as an address This reverts D53469, which changed llvm's DWARF emission to emit DW_AT_call_return_pc as a function-local offset. Such an encoding is not compatible with post-link block re-ordering tools and isn't standards- compliant. In addition to reverting back to the original DW_AT_call_return_pc encoding, teach lldb how to fix up DW_AT_call_return_pc when the address comes from an object file pointed-to by a debug map. While doing this I noticed that lldb's support for tail calls that cross a DSO/object file boundary wasn't covered, so I added tests for that. This latter case exercises the newly added return PC fixup. The dsymutil changes in this patch were originally included in D49887: the associated test should be sufficient to test DW_AT_call_return_pc encoding purely on the llvm side. Differential Revision: https://reviews.llvm.org/D72489	2020-01-15 13:02:23 -08:00
Lang Hames	284b455103	[docs][ORC] Update the laziness section of the ORCv2 design doc. This updates the discussion of lazy reexports, fixes a TBD for a usage example, and adds a reference to the fully worked lazy reexports example that was added in e9e26c01cd865da678b1af6ba5f417c713956a66.	2020-01-15 11:32:20 -08:00
Nico Weber	31a3e4d169	[gn build] re-run "gn format" with trunk gn	2020-01-15 13:37:38 -05:00
Nico Weber	d9e1114f21	[gn build] add multi-line forcing comments in more places	2020-01-15 13:36:42 -05:00
Nico Weber	800e88dfdc	[gn build] make "gn format" comment slightly more concise	2020-01-15 13:36:42 -05:00
Craig Topper	a25a7f9567	[Mips] Add FileCheck to a test that just tested for a crash. I believe the generated code here can suffer from double rounding. So I wanted to capture the existing codegen so we can make decisions about how to fix it.	2020-01-15 10:29:56 -08:00
Lang Hames	afe0b073a6	[ORC] Set setCloneToNewContextOnEmit on LLJIT's transform layer when needed. Based on Don Hinton's patch in https://reviews.llvm.org/D72406. This feature was accidentally left out of e9e26c01cd865da678b1af6ba5f417c713956a66, and would have pessimized concurrent compilation in the default case. Thanks for spotting this Don!	2020-01-15 10:22:57 -08:00
Amara Emerson	b3af3376ee	Revert "Revert rG6078f2fedcac5797ac39ee5ef3fd7a35ef1202d5 - "[AArch64][GlobalISel]: Support @llvm.{return,frame}address selection."" The original change wasn't constraining the operand regclasses which broke EXPENSIVE_CHECKS.	2020-01-15 10:13:11 -08:00
Nico Weber	fbe5badf52	[gn build] Reformat all build files Ran `git ls-files '.gn' '.gni' \| xargs llvm/utils/gn/gn.py format`. The motivation is to reformat them with trunk gn again right after. Trunk gn changed formatting of some single-element lists.	2020-01-15 12:59:45 -05:00
Mark Murray	724b6dac82	[ARM][MVE][Intrinsics] Add VMINAQ, VMINNMAQ, VMAXAQ, VMAXNMAQ intrinsics. Summary: Add VMINAQ, VMINNMAQ, VMAXAQ, VMAXNMAQ intrinsics and unit tests. Reviewers: simon_tatham, miyuki, dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72761	2020-01-15 17:20:15 +00:00
Teresa Johnson	947a98e6a0	Fix bot by adjusting wildcard matching I noticed one bot failure due to 24a00ef2404104e9ca6fbd7eb523a8a340be9d99 because the wildcard matching was not working as intended, fixed it to act similar to other checks of CGSCCToFunctionPassAdaptor.	2020-01-15 08:37:15 -08:00
evgeny	eb9d6d122a	[ThinLTO] Always import constants This patch imports constant variables even when they can't be internalized (which results in promotion). This offers some extra constant folding opportunities. Differential revision: https://reviews.llvm.org/D70404	2020-01-15 19:29:01 +03:00
Arkady Shlykov	c37dacad6a	[Loop Peeling] Add possibility to enable peeling on loop nests. Summary: Current peeling implementation bails out in case of loop nests. The patch introduces a field in TargetTransformInfo structure that certain targets can use to relax the constraints if it's profitable (disabled by default). Also additional option is added to enable peeling manually for experimenting and testing purposes. Reviewers: fhahn, lebedev.ri, xbolva00 Reviewed By: xbolva00 Subscribers: xbolva00, hiraditya, zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D70304	2020-01-15 08:25:21 -08:00
Sanjay Patel	fd9e6d0724	[InstCombine] reassociate fsub+fsub into fsub+fadd As discussed in the motivating PR44509: https://bugs.llvm.org/show_bug.cgi?id=44509 ...we can end up with worse code using fast-math than without. This is because the reassociate pass greedily transforms fsub into fneg/fadd and apparently (based on the regression tests seen here) expects instcombine to clean that up if it wasn't profitable. But we were missing this fold: (X - Y) - Z --> X - (Y + Z) There's another, more specific case that I think we should handle as shown in the "fake" fneg test (but missed with a real fneg), but that's another patch. That may be tricky to get right without conflicting with existing transforms for fneg. Differential Revision: https://reviews.llvm.org/D72521	2020-01-15 11:14:13 -05:00
Lang Hames	7f70888b46	[ORC] Simplify use of lazyReexports with LLJIT. This patch makes the target triple available via the LLJIT interface, and moves the IRTransformLayer from LLLazyJIT down into LLJIT. Together these changes make it easier to use the lazyReexports utility with LLJIT, and to apply IR transforms to code as it is compiled in LLJIT (rather than requiring transforms to be applied manually before code is added). An code example is added in llvm/examples/LLJITExamples/LLJITWithLazyReexports	2020-01-15 08:02:53 -08:00
Lang Hames	bbe6d35d70	[ORC] Update lazyReexports to support aliases with different symbol names. A bug in the existing implementation meant that lazyReexports would not work if the aliased name differed from the alias's name, i.e. all lazy reexports had to be of the form (lib1, name) -> (lib2, name). This patch fixes the issue by capturing the alias's name in the NotifyResolved callback. To simplify this capture, and the LazyCallThroughManager code in general, the NotifyResolved callback is updated to use llvm::unique_function rather than a custom class. No test case yet: This can only be tested at runtime, and the only in-tree client (lli) always uses aliases with matching names. I will add a new LLJIT example shortly that will directly test the lazyReexports API and the non-trivial alias use case.	2020-01-15 08:02:53 -08:00
Hubert Tong	04d6bb1ef0	DWARFDebugLine.cpp: Format unknown line number standard opcodes Summary: This patch implements `formatv()` formatting for `dwarf::LineNumberOps` and makes use of it for the `llvm-dwarfdump --debug-line` dump. Previously, unknown line number standard opcodes would lead to undefined behaviour. The code would attempt to format the data pointer of an empty `StringRef` (a null pointer) using `%s`. According to the description for `format()`, use of that interface carries the "risk of `printf`". Passing a null pointer in place of an array to a C library function results in undefined behaviour. Reviewers: jhenderson, daltenty, stevewan Reviewed By: jhenderson Subscribers: aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72369	2020-01-15 10:45:50 -05:00
Hubert Tong	e4d966f35b	[CMake] Enable -qfuncsect when building with IBM XL Summary: The IBM XL compiler uses `-qfuncsect` for `-ffunction-sections`. The comment about sanitizers and `-f[no-]function-sections` is corrected also, as it is pertinent to this patch. The sanitizer-related use of `-fno-function-sections` is associated with powerpc64le, a target for which there is an IBM XL compiler, so that use is updated in this patch to apply `-qnofuncsect` in case a build using the XL compiler is viable on that platform. This patch has been verified with the XL compiler on AIX only. Reviewers: daltenty, stevewan Reviewed By: daltenty Subscribers: mgorny, steven.zhang, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D72335	2020-01-15 10:45:50 -05:00
Georgii Rymar	24781a9350	[llvm-readobj][test] - Cleanup SHT_RELR sections testing. After recent changes (D71872) in yaml2obj, it is possible so cleanup testing of the SHT_RELR sections. Differential revision: https://reviews.llvm.org/D71874	2020-01-15 18:40:01 +03:00
Teresa Johnson	a9daf65b1c	Restore "[ThinLTO] Add additional ThinLTO pipeline testing with new PM" This restores 2af97be8027a0823b88d4b6a07fc5eedb440bc1f (reverted at 6288f86e870c7bb7fe47cc138320b9eb34c93941), with all the fixes I had applied at the time, along with a new fix for non-determinism in the ordering of a couple of passes due to being accessed as parameters on the same call. I've also added --dump-input=fail to the new tests so I can more thoroughly fix any additional failures.	2020-01-15 07:33:08 -08:00

... 4 5 6 7 8 ...

190418 Commits