llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Mehdi Amini	c143a7b981	Run GlobalOpt before emitting the bitcode for ThinLTO This is motivated by reducing the size of the IR and thus reduce compile time. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267385	2016-04-25 08:47:49 +00:00
Mehdi Amini	6153f5124e	ThinLTO: Move createNameAnonFunctionPass insertion in PassManagerBuilder (NFC) It is just code motion, but makes more sense this way. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267384	2016-04-25 08:47:37 +00:00
Igor Breger	367c01cee7	fix comments related to Differential Revision: http://reviews.llvm.org/D17913 llvm-svn: 267383	2016-04-25 08:30:28 +00:00
Michael Zuckerman	4cc752e542	Fixing wrong mask size error. From __mmask8 to __mmask16. Was reviewed over the shoulder by AsafBadouh. Connected to review http://reviews.llvm.org/D19195. llvm-svn: 267379	2016-04-25 05:27:51 +00:00
Davide Italiano	03f6669dd8	[Support/ELFRelocs] Add R_386_GOT32X. The new relocation recently defined in the Intel386 psABI was still missing from this file. A subsequent commit will add support for GOT32X in MC, together with a test. llvm-svn: 267378	2016-04-25 04:38:08 +00:00
Craig Topper	8e26afd797	[X86] Replace a SmallVector used to pass 2 values to an ArrayRef parameter with a fixed size array. NFC llvm-svn: 267377	2016-04-25 04:30:29 +00:00
Junmo Park	df2e2c23ce	Minor code cleanups. NFC. llvm-svn: 267375	2016-04-25 01:40:54 +00:00
Craig Topper	65fd322915	[X86] Add a complete set of tests for all operand sizes of cttz/ctlz with and without zero undef being lowered to bsf/bsr. llvm-svn: 267373	2016-04-25 01:01:15 +00:00
Adrian Prantl	0510e5ff3d	Verifier: Verify that each inlinable callsite of a debug-info-bearing function in a debug-info-bearing function has a debug location attached to it. Failure to do so causes an "!dbg attachment points at wrong subprogram for function" assertion failure when the inliner sets up inline scope info. rdar://problem/25878916 This reaplies r267320 without changes after fixing an issue in the OpenMP IR generator in clang. llvm-svn: 267370	2016-04-24 22:23:13 +00:00
Rafael Espindola	feeb4744f9	Also check the IR. llvm-svn: 267367	2016-04-24 21:42:56 +00:00
Rafael Espindola	5305051507	Add a test for how we handle protected visibility. llvm-svn: 267366	2016-04-24 21:30:18 +00:00
Simon Pilgrim	386d31614a	[X86][AVX] Added PR24935 test case llvm-svn: 267362	2016-04-24 20:30:48 +00:00
Saleem Abdulrasool	13a27b5a96	ARM: fix __chkstk Frame Setup on WoA This corrects the MI annotations for the stack adjustment following the __chkstk invocation. We were marking the original SP usage as a Def rather than Kill. The (new) assigned value is the definition, the original reference is killed. Adjust the ISelLowering to mark Kills and FrameSetup as well. This partially resolves PR27480. llvm-svn: 267361	2016-04-24 20:12:48 +00:00
Simon Pilgrim	498be3d330	Tweak comments to make it clear that these combines are for SSE scalar instructions. llvm-svn: 267360	2016-04-24 19:31:56 +00:00
Simon Pilgrim	4943bd650f	[InstCombine][SSE] Reduce DIVSS/DIVSD to FDIV if only first element is required As discussed on D19318, if we only demand the first element of a DIVSS/DIVSD intrinsic, then reduce to a FDIV call. This matches the existing FADD/FSUB/FMUL patterns. llvm-svn: 267359	2016-04-24 18:35:59 +00:00
Simon Pilgrim	1b73eef3b4	[InstCombine][SSE] Demanded vector elements for scalar intrinsics (Part 2 of 2) Split from D17490. This patch improves support for determining the demanded vector elements through SSE scalar intrinsics: 1 - demanded vector element support for unary and some extra binary scalar intrinsics (RCP/RSQRT/SQRT/FRCZ and ADD/CMP/DIV/ROUND). 2 - addss/addsd get simplified to a fadd call if we aren't interested in the pass through elements 3 - if we don't need the lowest element of a scalar operation then just use the first argument (the pass through elements) directly We can add support for propagating demanded elements through any equivalent packed SSE intrinsics in a future patch (these wouldn't use the pass through patterns). Differential Revision: http://reviews.llvm.org/D19318 llvm-svn: 267357	2016-04-24 18:23:14 +00:00
Simon Pilgrim	cd433f92ba	[InstCombine][SSE] Demanded vector elements for scalar intrinsics (Part 1 of 2) This patch improves support for determining the demanded vector elements through SSE scalar intrinsics: 1 - recognise that we only need the lowest element of the second input for binary scalar operations (and all the elements of the first input) 2 - recognise that the roundss/roundsd intrinsics use the lowest element of the second input and the remaining elements from the first input Differential Revision: http://reviews.llvm.org/D17490 llvm-svn: 267356	2016-04-24 18:12:42 +00:00
Simon Pilgrim	ef7f370991	[InstCombine] Avoid updating argument demanded elements in separate passes. As discussed on D17490, we should attempt to update an intrinsic's arguments demanded elements in one pass if we can. llvm-svn: 267355	2016-04-24 17:57:27 +00:00
Nick Lewycky	e703258f62	Fix typo in comment. NFC llvm-svn: 267354	2016-04-24 17:55:57 +00:00
Nick Lewycky	de389b61df	Remove emacs mode markers from .cpp files. NFC .cpp files are unambiguously C++, you only need the mode markers on .h files. llvm-svn: 267353	2016-04-24 17:55:41 +00:00
Simon Pilgrim	5db46debc1	[X86][InstCombine] Tidyup VPERMILVAR -> shufflevector conversion to helper function. NFCI. llvm-svn: 267352	2016-04-24 17:23:46 +00:00
Simon Pilgrim	398bcf079b	[X86][InstCombine] Tidyup PSHUFB -> shufflevector conversion to helper function. NFCI. llvm-svn: 267351	2016-04-24 17:00:34 +00:00
Simon Pilgrim	713ecd864c	[X86][SSE] getTargetShuffleMaskIndices - dropped (unused) UNDEF handling We aren't currently making use of this in any successful mask decode and its actually incorrect as it inserts the wrong number of SM_SentinelUndef mask elements. llvm-svn: 267350	2016-04-24 16:49:53 +00:00
Simon Pilgrim	81b1ce90a3	[X86][SSE] Use range loop. NFCI. llvm-svn: 267349	2016-04-24 16:33:35 +00:00
Craig Topper	043de2291d	[Lanai] Use EVT::getEVTString() to print a type as a string instead of an enum encoding value. llvm-svn: 267348	2016-04-24 16:30:51 +00:00
Simon Pilgrim	30bc5e06bf	[X86][SSE] Added SSSE3/AVX/AVX2 BITREVERSE tests Codegen is pretty bad at the moment but could use PSHUFB quite efficiently llvm-svn: 267347	2016-04-24 15:45:06 +00:00
Simon Pilgrim	9549c9fa45	[X86][XOP] Fixed VPPERM permute op decoding (PR27472). Fixed issue with VPPERM target shuffle mask decoding that was incorrectly masking off the 3-bit permute op with a 2-bit mask. llvm-svn: 267346	2016-04-24 15:05:04 +00:00
Duncan P. N. Exon Smith	e1c776018e	BitcodeReader: Delay metadata parsing until reading a function body There's hardly any functionality change here. Instead of calling materializeMetadata on the first call to materialize(GlobalValue*), wait until the first one that's actually going to do something. Noticed by inspection; I don't have a concrete case where this makes a difference. Added an assertion in materializeMetadata to be sure this (or a future change) doesn't delay materializeMetadata after function-level metadata. llvm-svn: 267345	2016-04-24 15:04:28 +00:00
Teresa Johnson	d10e6055f6	[ThinLTO] Remove GlobalValueInfo class from index Summary: Remove the GlobalValueInfo and change the ModuleSummaryIndex to directly reference summary objects. The info structure was there to support lazy parsing of the combined index summary objects, which is no longer needed and not supported. Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19462 llvm-svn: 267344	2016-04-24 14:57:11 +00:00
Simon Pilgrim	1211a431ae	[X86][SSE] Improved support for decoding target shuffle masks through bitcasts Reused the ability to split constants of a type wider than the shuffle mask to work with masks generated from scalar constants transfered to xmm. This fixes an issue preventing PSHUFB target shuffle masks decoding rematerialized scalar constants and also exposes the XOP VPPERM bug described in PR27472. llvm-svn: 267343	2016-04-24 14:53:54 +00:00
Duncan P. N. Exon Smith	9a22e0f76e	ModuleSummaryIndex: Avoid enum bitfields for MSVC portability Enum bitfields have crazy portability issues with MSVC. Use unsigned instead of LinkageTypes here in the ModuleSummaryIndex to address Takumi's concerns from r267335. llvm-svn: 267342	2016-04-24 14:25:37 +00:00
Duncan P. N. Exon Smith	14807bb76b	Revert "Declare GlobalValue::LinkageTypes based on unsigned." This reverts commit r267335. The build has been broken for hours because of it: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_build/23352/ The correct fix is avoid using any enum in a bitfield. llvm-svn: 267341	2016-04-24 14:13:17 +00:00
Marcin Koscielnicki	6b999dbbc0	[SystemZ] [SSP] Add support for LOAD_STACK_GUARD. This fixes PR22248 on s390x. The previous attempt at this was D19101, which was before LOAD_STACK_GUARD existed. Compared to the previous version, this always emits a rather ugly block of 4 instructions, involving a thread pointer load that can't be shared with other potential users. However, this is necessary for SSP - spilling the guard value (or thread pointer used to load it) is counter to the goal, since it could be overwritten along with the frame it protects. Differential Revision: http://reviews.llvm.org/D19363 llvm-svn: 267340	2016-04-24 13:57:49 +00:00
Simon Pilgrim	79561eb759	[X86][SSE] Demonstrate issue with decoding shuffle masks that have been lowered as rematerialized constants on scalar unit Found whilst investigating PR27472 llvm-svn: 267339	2016-04-24 13:45:30 +00:00
Aaron Ballman	b2fd283fa7	Silence two C4806 warnings ('\|': unsafe operation: no value of type 'bool' promoted to type 'const unsigned int' can equal the given constant). The fact that they trigger with this code seems like it may be a bug, but the warning itself is still generally useful enough to retain it for now. llvm-svn: 267337	2016-04-24 13:03:20 +00:00
NAKAMURA Takumi	c8db88cdfd	Declare GlobalValue::LinkageTypes based on unsigned. Or, "LinkageTypes Linkage : 4;" might be sign-extended on msc. llvm-svn: 267335	2016-04-24 10:11:45 +00:00
NAKAMURA Takumi	013ddcd61c	llvm/test/tools/gold/X86/thinlto.ll: Possible fix corresponding to r267318. llvm-svn: 267334	2016-04-24 08:02:00 +00:00
Duncan P. N. Exon Smith	edafeb956c	BitcodeReader: Fix some holes in upgrade from r267296 Add tests for some missing cases to bitcode upgrade in r267296. - DICompositeType with an 'elements:' field, which will cause it to be involved in a cycle after the upgrade. - A DIDerivedType that references a class in 'extraData:'. I updated test/Bitcode/dityperefs-3.8.ll with the missing cases and regenerated test/Bitcode/dityperefs-3.8.ll.bc. llvm-svn: 267332	2016-04-24 06:52:01 +00:00
Craig Topper	87f8c97986	[X86] Merge LowerCTLZ and LowerCTLZ_ZERO_UNDEF into a single function that branches internally for the one difference, allowing the rest of the code to be common. NFC llvm-svn: 267331	2016-04-24 06:27:39 +00:00
Craig Topper	5e5f4bfbbd	[X86] Node need to check if AVX512 is supported when lowering vector CTLZ. The CTLZ operation is only Custom for vectors if AVX512 is enabled so if a vector gets here AVX512 is implied. NFC llvm-svn: 267330	2016-04-24 06:27:35 +00:00
Mehdi Amini	ae8cf7ca98	Add "hasSection" flag in the Summary Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19405 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267329	2016-04-24 05:31:43 +00:00
Gerolf Hoflehner	f601ce3709	[MachineCombiner] Support for floating-point FMA on ARM64 (re-commit r267098) The original patch caused crashes because it could derefence a null pointer for SelectionDAGTargetInfo for targets that do not define it. Evaluates fmul+fadd -> fmadd combines and similar code sequences in the machine combiner. It adds support for float and double similar to the existing integer implementation. The key features are: - DAGCombiner checks whether it should combine greedily or let the machine combiner do the evaluation. This is only supported on ARM64. - It gives preference to throughput over latency: the heuristic used is to combine always in loops. The targets decides whether the machine combiner should optimize for throughput or latency. - Supports for fmadd, f(n)msub, fmla, fmls patterns - On by default at O3 ffast-math llvm-svn: 267328	2016-04-24 05:14:01 +00:00
Craig Topper	7cda2fdad5	[X86] Remove isel patterns for selecting tzcnt/lzcnt from cmove/ne+cttz/ctlz. These are folded by DAG combine now. llvm-svn: 267326	2016-04-24 04:38:34 +00:00
Craig Topper	370802b1a1	[CodeGen] Teach DAG combine to fold select_cc seteq X, 0, sizeof(X), ctlz_zero_undef(X) -> ctlz(X). InstCombine already does this for IR and X86 pattern matches this during isel. A follow up commit will remove the X86 patterns to allow this to be tested. llvm-svn: 267325	2016-04-24 04:38:32 +00:00
Craig Topper	3b13c64dcf	Fix an assertion that can never fire because the condition ANDed with the string is just true or 1. llvm-svn: 267324	2016-04-24 04:38:29 +00:00
Adrian Prantl	72cad9d8f8	Revert "Verifier: Verify that each inlinable callsite of a debug-info-bearing function" This reverts commit r267320 while investigating an OpenMP buildbot failure. llvm-svn: 267322	2016-04-24 03:47:37 +00:00
Adrian Prantl	dcfa52d905	Verifier: Verify that each inlinable callsite of a debug-info-bearing function in a debug-info-bearing function has a debug location attached to it. Failure to do so causes an "!dbg attachment points at wrong subprogram for function" assertion failure when the inliner sets up inline scope info. rdar://problem/25878916 llvm-svn: 267320	2016-04-24 03:23:02 +00:00
Mehdi Amini	9e35f9ec46	Reorganize GlobalValueSummary with a "Flags" bitfield. Right now it only contains the LinkageType, but will be extended with "hasSection", "isOptSize", "hasInlineAssembly", etc. Differential Revision: http://reviews.llvm.org/D19404 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267319	2016-04-24 03:18:18 +00:00
Mehdi Amini	e000bb6ae8	Add a version field in the bitcode for the summary Differential Revision: http://reviews.llvm.org/D19456 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267318	2016-04-24 03:18:11 +00:00
Mehdi Amini	c68b6482c1	Add an internalization step to the ThinLTOCodeGenerator Keeping as much as possible internal/private is known to help the optimizer. Let's try to benefit from this in ThinLTO. Note: this is early work, but is enough to build clang (and all the LLVM tools). I still need to write some lit-tests... Differential Revision: http://reviews.llvm.org/D19103 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267317	2016-04-24 03:18:01 +00:00

1 2 3 4 5 ...

130666 Commits