llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Eric Christopher	4de79a10e1	Make an X86 specific directory and put the recent X86 tti specific inlining test into it. llvm-svn: 241223	2015-07-02 01:36:31 +00:00
Eric Christopher	5168f3454a	Implement TargetTransformInfo::hasCompatibleFunctionAttributes for X86. This checks subtarget feature compatibility for inlining by verifying that the callee is a strict subset of the caller's features. This includes the cpu as part of the subtarget we can get via the incoming functions as the backend takes CPUs as feature sets. This allows us to inline things like: int foo() { return baz(); } int __attribute__((target("sse4.2"))) bar() { return foo(); } so that generic code can be inlined into specialized functions. llvm-svn: 241221	2015-07-02 01:11:50 +00:00
Eric Christopher	99d0bcb7d3	Add a routine to TargetTransformInfo that will allow targets to look at the attributes on a function to determine whether or not to allow inlining. llvm-svn: 241220	2015-07-02 01:11:47 +00:00
JF Bastien	8ef54c36dd	WebAssembly: start instructions Summary: * Add 64-bit address space feature. * Rename SIMD feature to SIMD128. * Handle single-thread model with an IR pass (same way ARM does). * Rename generic processor to MVP, to follow design's lead. * Add bleeding-edge processors, with all features included. * Fix a few DEBUG_TYPE to match other backends. Test Plan: ninja check Reviewers: sunfish Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D10880 llvm-svn: 241211	2015-07-01 23:41:25 +00:00
Quentin Colombet	fe7967bf11	[TwoAddressInstructionPass] Try 3 Addr Conversion After Commuting. TwoAddressInstructionPass stops after a successful commuting but 3 Addr conversion might be good for some cases. Consider: int foo(int a, int b) { return a + b; } Before this commit, we emit: addl %esi, %edi movl %edi, %eax ret After this commit, we try 3 Addr conversion: leal (%rsi,%rdi), %eax ret Patch by Volkan Keles <vkeles@apple.com>! Differential Revision: http://reviews.llvm.org/D10851 llvm-svn: 241206	2015-07-01 23:12:13 +00:00
Pawel Bylica	87cf433a3d	Change APInt comparison with uint64_t. Summary: This patch changes the way APInt is compared with a value of type uint64_t. Before the uint64_t value was truncated to the size of APInt before comparison. Now the comparison takes into account full 64-bit precision. Test Plan: Unit tests added. No regressions. Self-hosted check-all done as well. Reviewers: chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10655 llvm-svn: 241204	2015-07-01 22:56:43 +00:00
Matthias Braun	e7b061b51a	Test for specific output in lit test llvm-svn: 241200	2015-07-01 22:34:59 +00:00
Alexey Samsonov	0c5bae62d1	[LoopVectorize] Use ReplaceInstWithInst() helper where appropriate. This is mostly an NFC, which increases code readability (instead of saving old terminator, generating new one in front of old, and deleting old, we just call a function). However, it would additionaly copy the debug location from old instruction to replacement, which would help PR23837. llvm-svn: 241197	2015-07-01 22:18:30 +00:00
Pete Cooper	c7d19030e2	Pack MCSymbol::Flags in to the bitfield with other members. NFC. All file formats only needed 16-bits right now which is enough to fit in to the padding with other fields. This reduces the size of MCSymbol to 24-bytes on a 64-bit system. The layout is now 0 \| class llvm::MCSymbol 0 \| class llvm::PointerIntPair SectionOrFragmentAndHasName 0 \| intptr_t Value \| [sizeof=8, dsize=8, align=8 \| nvsize=8, nvalign=8] 8 \| unsigned int IsTemporary 8 \| unsigned int IsRedefinable 8 \| unsigned int IsUsed 8 \| _Bool IsRegistered 8 \| unsigned int IsExternal 8 \| unsigned int IsPrivateExtern 8 \| unsigned int Kind 9 \| unsigned int IsUsedInReloc 9 \| unsigned int SymbolContents 9 \| unsigned int CommonAlignLog2 10 \| uint32_t Flags 12 \| uint32_t Index 16 \| union 16 \| uint64_t Offset 16 \| uint64_t CommonSize 16 \| const class llvm::MCExpr * Value \| [sizeof=8, dsize=8, align=8 \| nvsize=8, nvalign=8] \| [sizeof=24, dsize=24, align=8 \| nvsize=24, nvalign=8] llvm-svn: 241196	2015-07-01 21:57:51 +00:00
Dan Gohman	3b5aac894a	[WebAssembly] Define separate Target instances for 32-bit and 64-bit. llvm-svn: 241193	2015-07-01 21:42:34 +00:00
Jingyue Wu	5f36b4cd05	[NVPTX] expand extload/truncstore for vectors of floats Summary: According to PTX ISA: For convenience, ld, st, and cvt instructions permit source and destination data operands to be wider than the instruction-type size, so that narrow values may be loaded, stored, and converted using regular-width registers. For example, 8-bit or 16-bit values may be held directly in 32-bit or 64-bit registers when being loaded, stored, or converted to other types and sizes. The operand type checking rules are relaxed for bit-size and integer (signed and unsigned) instruction types; floating-point instruction types still require that the operand type-size matches exactly, unless the operand is of bit-size type. So, the ISA does not support load with extending/store with truncatation for floating numbers. This is reflected in setting the loadext/truncstore actions to expand in the code for floating numbers, but vectors of floating numbers are not taken care of. As a result, loading a vector of floats followed by a fp_extend may be combined by DAGCombiner to a extload, and the extload may be lowered to NVPTXISD::LoadV2 with extending information. However, NVPTXISD::LoadV2 does not perform extending, and no extending instructions are inserted. Finally, PTX instructions with mismatched types are generated, like ld.v2.f32 {%fd3, %fd4}, [%rd2] This patch adds the correct actions for vectors of floats, so DAGCombiner would not create loads with extending, and correct code is generated. Patched by Gang Hu. Test Plan: Test case attached. Reviewers: jingyue Reviewed By: jingyue Subscribers: llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D10876 llvm-svn: 241191	2015-07-01 21:32:42 +00:00
Pete Cooper	c374818b5f	Encode MCSymbol alignment as log2(align). Given that alignments are always powers of 2, just encode it this way. This matches how we encode alignment on IR GlobalValue's for example. This compresses the CommonAlign member down to 5 bits which allows it to pack better with the surrounding fields. Reviewed by Duncan Exon Smith. llvm-svn: 241189	2015-07-01 21:07:03 +00:00
Reid Kleckner	de3cecdd83	[WinEH] Use llvm.x86.seh.recoverfp in WinEHPrepare Don't pattern match for frontend outlined finally calls on non-x64 platforms. The 32-bit runtime uses a different funclet prototype. Now, the frontend is pre-outlining the finally bodies so that it ends up doing most of the heavy lifting for variable capturing. We're just outlining the callsite, and adapting the frameaddress(0) call to line up the frame pointer recovery. llvm-svn: 241186	2015-07-01 20:59:25 +00:00
Jingyue Wu	bf15de754a	[NVPTX] Move NVPTXPeephole after NVPTXPrologEpilogPass Summary: Offset of frame index is calculated by NVPTXPrologEpilogPass. Before that the correct offset of stack objects cannot be obtained, which leads to wrong offset if there are more than 2 frame objects. This patch move NVPTXPeephole after NVPTXPrologEpilogPass. Because the frame index is already replaced by %VRFrame in NVPTXPrologEpilogPass, we check VRFrame register instead, and try to remove the VRFrame if there is no usage after NVPTXPeephole pass. Patched by Xuetian Weng. Test Plan: Strengthened test/CodeGen/NVPTX/local-stack-frame.ll to check the offset calculation based on SP and SPL. Reviewers: jholewinski, jingyue Reviewed By: jingyue Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10853 llvm-svn: 241185	2015-07-01 20:08:06 +00:00
Bill Schmidt	2cbe4a84c5	[PPC64LE] Enable missing lxvdsx optimization, and related swap optimization When adding little-endian vector support for PowerPC last year, I inadvertently disabled an optimization that recognizes a load-splat idiom and generates the lxvdsx instruction. This patch moves the offending logic so lxvdsx is once again generated. This pattern is frequently generated by the vectorizer for scalar loads of an effective constant. Previously the lxvdsx instruction was wrongly listed as lane-sensitive for the VSX swap optimization (since both doublewords are identical, swaps are safe). This patch fixes this as well, so that vectorized code using lxvdsx can now have swaps removed from the computation. There is an existing test (@test50) in test/CodeGen/PowerPC/vsx.ll that checks for the missing optimization. However, vsx.ll was only being tested for POWER7 with big-endian code generation. I've added a little-endian RUN statement and expected LE code generation for all the tests in vsx.ll to give us a bit better VSX coverage, including what's needed for this patch. llvm-svn: 241183	2015-07-01 19:40:07 +00:00
Sanjay Patel	09ab09a461	add a cl::opt override for TargetLoweringBase's JumpIsExpensive This patch is not intended to change existing codegen behavior for any target. It just exposes the JumpIsExpensive setting on the command-line to allow for easier testing and emergency overrides. Also, change the existing regression test to use FileCheck, explicitly specify the jump-is-expensive option, and use more precise checks. Differential Revision: http://reviews.llvm.org/D10846 llvm-svn: 241179	2015-07-01 18:10:20 +00:00
Jonathan Roelofs	d560481f7a	Disallow in-source builds (as we already do for the cmake build). http://reviews.llvm.org/D10614 llvm-svn: 241178	2015-07-01 18:09:21 +00:00
David Blaikie	154167e051	Revert "[DWARF] Fix debug info generation for function static variables, typedefs, and records" Caused PR24008 This reverts commit 37cb5f1c2db9f42d29f26b215585f56bb64ae4f5. llvm-svn: 241176	2015-07-01 18:07:16 +00:00
Sanjay Patel	b294bc94ef	fix formatting; NFC llvm-svn: 241175	2015-07-01 17:58:53 +00:00
Sanjay Patel	30c5c88ab0	fix typos in comment; NFC llvm-svn: 241174	2015-07-01 17:55:07 +00:00
Matthias Braun	dcd259d33d	LivePhysRegs: Add support to add pristine registers when populating with live-in/live-out registers. Differential Revision: http://reviews.llvm.org/D10139 llvm-svn: 241172	2015-07-01 17:17:17 +00:00
Reid Kleckner	ff8213aeba	[SEH] Don't assert if the parent function lacks a personality The EH code might have been deleted as unreachable and the personality pruned while the filter is still present. Currently I'm hitting this at -O0 due to the clang bug PR24009. llvm-svn: 241170	2015-07-01 16:45:47 +00:00
Benjamin Kramer	49252b40d4	[AsmPrinter] Hide implementation details NFC. llvm-svn: 241169	2015-07-01 16:18:16 +00:00
Arnaud A. de Grandmaison	9f66584696	[AArch64] Implement add/adds/sub/subs/cmp/cmn with negative immediate aliases This patch teaches the AsmParser to accept add/adds/sub/subs/cmp/cmn with a negative immediate operand and convert them as shown: add Rd, Rn, -imm -> sub Rd, Rn, imm sub Rd, Rn, -imm -> add Rd, Rn, imm adds Rd, Rn, -imm -> subs Rd, Rn, imm subs Rd, Rn, -imm -> adds Rd, Rn, imm cmp Rn, -imm -> cmn Rn, imm cmn Rn, -imm -> cmp Rn, imm Those instructions are an alternate syntax available to assembly coders, and are needed in order to support code already compiling with some other assemblers (gas). They are documented in the "ARMv8 Instruction Set Overview", in the "Arithmetic (immediate)" section. This makes llvm-mc a programmer-friendly assembler ! This also fixes PR20978: "Assembly handling of adding negative numbers not as smart as gas". llvm-svn: 241166	2015-07-01 15:05:58 +00:00
Benjamin Kramer	8ac96a89b0	[SDAG] Give InstrEmitter hidden visibility NFC. llvm-svn: 241165	2015-07-01 14:55:10 +00:00
Benjamin Kramer	dcff586d10	[CodeGen] Reduce visibility of implementation details NFC. llvm-svn: 241164	2015-07-01 14:47:39 +00:00
James Y Knight	4771453f17	[Sparc] Rearrange SparcInstrInfo, no change. Move some instructions into order of sections in the spec, as the rest already were. Differential Revision: http://reviews.llvm.org/D9102 llvm-svn: 241163	2015-07-01 14:38:07 +00:00
Michael Kuperstein	2f57b2f3e7	Test committed in r241153 is more target-specific than I thought. Moving the (original, x86-only) test to the X86 directory. llvm-svn: 241162	2015-07-01 13:45:25 +00:00
Scott Douglass	a5d4043494	Expand Phabricator docs slightly llvm-svn: 241161	2015-07-01 13:41:18 +00:00
Igor Breger	cdff3524c0	AVX-512: Implemented missing encoding for FMA scalar instructions Added tests for encoding Differential Revision: http://reviews.llvm.org/D10865 llvm-svn: 241159	2015-07-01 13:24:28 +00:00
Michael Kuperstein	588a0d1157	Fix non-target-specific test not to use the x86 triple. llvm-svn: 241158	2015-07-01 13:05:57 +00:00
Rafael Espindola	2aa69908b2	Return ErrorOr from getSection. This also improves the logic of what is an error: * getSection(uint_32): only return an error if the index is out of bounds. The index 0 corresponds to a perfectly valid entry. * getSection(Elf_Sym): Returns null for symbols that normally don't have sections and error for out of bound indexes. In many places this just moves the report_fatal_error up the stack, but those can then be fixed in smaller patches. llvm-svn: 241156	2015-07-01 12:56:27 +00:00
Michael Kuperstein	1d95d15e94	[DWARF] Fix debug info generation for function static variables, typedefs, and records Function static variables, typedefs and records (class, struct or union) declared inside a lexical scope were associated with the function as their parent scope, rather than the lexical scope they are defined or declared in. This fixes PR19238 Patch by: amjad.aboud@intel.com Differential Revision: http://reviews.llvm.org/D9758 llvm-svn: 241153	2015-07-01 12:33:11 +00:00
Michael Kuperstein	5ab1a3193f	[X86] Avoid over-relaxation of 8-bit immediates in integer arithmetic instructions. Only consider an instruction a candidate for relaxation if the last operand of the instruction is an expression. We previously checked whether any operand is an expression, which is useless, since for all instructions concerned, the only operand that may be affected by relaxation is the last one. In addition, this removes the check for having RIP as an argument, since it was plain wrong - even when one of the arguments is RIP, relaxation may still be needed. This fixes PR9807. Patch by: david.l.kreitzer@intel.com Differential Revision: http://reviews.llvm.org/D10766 llvm-svn: 241152	2015-07-01 10:54:42 +00:00
NAKAMURA Takumi	87d6ebc814	Revert part of r241149, "Fix PR23872: Integrated assembler error message when using .type directive with @ in AArch32 assembly." The test should be split among targets. llvm/test/MC/ELF/ is assumed as X86. llvm-svn: 241151	2015-07-01 10:28:09 +00:00
Zoran Jovanovic	0f96f10936	[mips][microMIPS] Implement SLL and NOP instructions http://reviews.llvm.org/D10474 llvm-svn: 241150	2015-07-01 09:54:51 +00:00
Gabor Ballabas	f27e3515e1	Fix PR23872: Integrated assembler error message when using .type directive with @ in AArch32 assembly. The AArch32 assembler parses the '@' as a comment symbol, so the error message shouldn't suggest that '@<type>' is a valid replacement when assembling for AArch32 target. Differential Revision: http://reviews.llvm.org/D10651 llvm-svn: 241149	2015-07-01 08:58:49 +00:00
David Majnemer	194197c127	[LoopUnroll] Use undef for phis with no value live We would create a phi node with a zero initialized operand instead of undef in the case where no value was originally available. This was problematic for x86_mmx which has no null value. llvm-svn: 241143	2015-07-01 05:38:07 +00:00
David Majnemer	e129f33667	[SCCP] Turn loads of null into undef instead of zero initialized values Surprisingly, this is a correctness issue: the mmx type exists for calling convention purposes, LLVM doesn't have a zero representation for them. This partially fixes PR23999. llvm-svn: 241142	2015-07-01 05:37:57 +00:00
Jingyue Wu	add2634803	[NaryReassociate] enhances nsw by leveraging @llvm.assume Summary: nsw are flaky and can often be removed by optimizations. This patch enhances nsw by leveraging @llvm.assume in the IR. Specifically, NaryReassociate now understands that assume(a + b >= 0) && assume(a >= 0) ==> a +nsw b As a result, it can split more sext(a + b) into sext(a) + sext(b) for CSE. Test Plan: nary-gep.ll Reviewers: broune, meheff Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10822 llvm-svn: 241139	2015-07-01 03:38:49 +00:00
JF Bastien	fb8500ea43	Getting started docs: https, and check signature Summary: Download should be over https, not insecure ftp at least for the signature and key files. The signature should also get verified. Test Plan: None Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10845 llvm-svn: 241138	2015-07-01 03:32:08 +00:00
Alexey Samsonov	9e9ff8b9bc	[SanitizerCoverage] Don't add instrumentation to unreachable blocks. llvm-svn: 241127	2015-06-30 23:11:45 +00:00
Mark Heffernan	89eed2bb9e	Fix several typos in LangRef.rst related to loop unrolling metadata. llvm-svn: 241126	2015-06-30 22:48:51 +00:00
Reid Kleckner	8011644f25	[SEH] Add new intrinsics for recovering and restoring parent frames The incoming EBP value established by the runtime is actually a pointer to the end of the EH registration object, and not the true parent function frame pointer. Clang doesn't need llvm.x86.seh.exceptioninfo anymore because we know that the exception info pointer is at a fixed offset from this incoming EBP. The llvm.x86.seh.recoverfp intrinsic takes an EBP value provided by the EH runtime and returns a pointer that is usable with llvm.framerecover. The llvm.x86.seh.restoreframe intrinsic is inserted by the 32-bit specific preparation pass in blocks targetted by the EH runtime. It re-establishes any physical registers used by the parent function to address the stack, such as the frame, base, and stack pointers. Neither of these intrinsics correctly handle stack realignment prologues yet, but it's possible to add that later. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D10848 llvm-svn: 241125	2015-06-30 22:46:59 +00:00
Alexey Samsonov	8074d98668	[IRBuilder] Delete unused constructor and SetInsertPoint overload. llvm-svn: 241124	2015-06-30 22:38:22 +00:00
Alexey Samsonov	8f755821c9	Fix memory leak in unittest added in r241101. llvm-svn: 241123	2015-06-30 22:17:29 +00:00
David Majnemer	cd7100d557	[Cloning] Teach CloneModule about personality functions CloneModule didn't take into account that it needed to remap the value using values in the module. This fixes PR23992. llvm-svn: 241122	2015-06-30 22:14:01 +00:00
Jingyue Wu	4759e1d16f	[NVPTX] cleanups and refacotring in NVPTXFrameLowering.cpp Summary: NFC Test Plan: no regression Reviewers: wengxt Reviewed By: wengxt Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10849 llvm-svn: 241118	2015-06-30 21:28:31 +00:00
Sanjoy Das	d0dd6a0ba1	[FaultMaps] Let the frontend pre-select implicit null check candidates. Summary: This change introduces a !make.implicit metadata that allows the frontend to pre-select the set of explicit null checks that will be considered for transformation into implicit null checks. The reason for not using profiling data instead of !make.implicit is explained in the change to `FaultMaps.rst`. Reviewers: atrick, reames, pgavlin, JosephTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10824 llvm-svn: 241116	2015-06-30 21:22:32 +00:00
Pete Cooper	3c754aaf59	Pack MCSymbol::HasName in to a spare bit in the section/fragment union. This is part of an effort to pack the average MCSymbol down to 24 bytes. The HasName bit was pushing the size of the bitfield over to another word, so this change uses a PointerIntPair to fit in it to unused bits of a PointerUnion. Reviewed by Rafael Espíndola llvm-svn: 241115	2015-06-30 20:54:21 +00:00

... 4 5 6 7 8 ...

119110 Commits