llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-29 23:12:55 +01:00

Author	SHA1	Message	Date
Andrea Di Biagio	de7985179f	[X86] Always prefer to lower a VECTOR_SHUFFLE into a BLENDI instead of SHUFP (or VPERM2X128). This patch teaches method 'LowerVECTOR_SHUFFLE' to give higher precedence to the check for 'isBlendMask'; the idea is that, when possible, we should firstly check if a shuffle performs a blend, and in case, try to lower it into a BLENDI instead of selecting a SHUFP or (worse) a VPERM2X128. In general: - AVX VBLENDPS/D always have better latency and throughput than VPERM2F128; - BLENDPS/D instructions tend to always have better 'reciprocal throughput' than the equivalent SHUFPS/D; - Both BLENDPS/D and SHUFPS/D are often decoded into the same number of m-ops; however, a m-op obtained from a BLENDPS/D can be scheduled to more than one execution port. This patch: - Moves the check for 'isBlendMask' immediately before the check for 'isSHUFPMask' within method 'LowerVECTOR_SHUFFLE'; - Updates existing tests for sse/avx shuffle/blend instructions to verify that we select (v)blendps/d when possible (instead of (v)shufps/d or vperm2f128). llvm-svn: 211720	2014-06-25 17:41:58 +00:00
Juergen Ributzka	15c30a5d46	Fix indentation. llvm-svn: 211717	2014-06-25 16:49:37 +00:00
Rafael Espindola	4cfe49663b	Fix the build. llvm-svn: 211715	2014-06-25 15:47:36 +00:00
Rafael Espindola	196881cf29	Move expression visitation logic up to MCStreamer. Remove the duplicate from MCRecordStreamer. No functionality change. llvm-svn: 211714	2014-06-25 15:45:33 +00:00
Eli Bendersky	90bda3bffd	Add some test files for r211710. llvm-svn: 211711	2014-06-25 15:41:39 +00:00
Eli Bendersky	def2619060	Rename loop unrolling and loop vectorizer metadata to have a common prefix. [LLVM part] These patches rename the loop unrolling and loop vectorizer metadata such that they have a common 'llvm.loop.' prefix. Metadata name changes: llvm.vectorizer.* => llvm.loop.vectorizer.* llvm.loopunroll.* => llvm.loop.unroll.* This was a suggestion from an earlier review (http://reviews.llvm.org/D4090) which added the loop unrolling metadata. Patch by Mark Heffernan. llvm-svn: 211710	2014-06-25 15:41:00 +00:00
JF Bastien	00d9135fdd	Fix = delete in MSVC build from r211705 MSVC doesn't support = delete yet, use LLVM_DELETED_FUNCTION instead. Related to: http://reviews.llvm.org/D3390 llvm-svn: 211709	2014-06-25 15:38:02 +00:00
Rafael Espindola	0e71694bfa	Simplify the visitation of target expressions. No functionality change. llvm-svn: 211707	2014-06-25 15:29:54 +00:00
JF Bastien	80d81f27d3	Random Number Generator (llvm) Provides an abstraction for a random number generator (RNG) that produces a stream of pseudo-random numbers. The current implementation uses C++11 facilities and is therefore not cryptographically secure. The RNG is salted with the text of the current command line invocation. In addition, a user may specify a seed (reproducible builds). In clang, the seed can be set via -frandom-seed=X In the back end, the seed can be set via -rng-seed=X This is the llvm part of the patch. clang part: D3391 URL: http://reviews.llvm.org/D3390 Author: yln I'm landing this for the second time, it broke Windows bots the first time around. llvm-svn: 211705	2014-06-25 15:21:42 +00:00
Rafael Espindola	d7f079b49c	Simplify AddValueSymbols. No functionality change. llvm-svn: 211701	2014-06-25 14:42:14 +00:00
Evgeniy Stepanov	cda29aab74	[msan] Fix bad interaction between with-calls mode and chained origin tracking. Origin history should only be recorded for uninitialized values, because it is meaningless otherwise. This change moves __msan_chain_origin to the runtime library side and makes it conditional on the corresponding shadow value. Previous code was correct, but _very_ inefficient. llvm-svn: 211700	2014-06-25 14:41:57 +00:00
Rafael Espindola	6eca6c6b38	Don't leak a file descriptor. llvm-svn: 211699	2014-06-25 14:35:59 +00:00
Logan Chien	b6044edb18	Code cleanup. llvm-svn: 211697	2014-06-25 13:46:17 +00:00
Chandler Carruth	4a4a94b49b	Add Polly to the ignored trees. llvm-svn: 211695	2014-06-25 13:13:36 +00:00
Chandler Carruth	2262038283	[x86] Add intrinsics for the pshufd, pshuflw, and pshufhw instructions. llvm-svn: 211694	2014-06-25 13:12:54 +00:00
NAKAMURA Takumi	c5a2c81f7e	Re-apply r211399, "Generate native unwind info on Win64" with a fix to ignore SEH pseudo ops in X86 JIT emitter. -- This patch enables LLVM to emit Win64-native unwind info rather than DWARF CFI. It handles all corner cases (I hope), including stack realignment. Because the unwind info is not flexible enough to describe stack frames with a gap of unknown size in the middle, such as the one caused by stack realignment, I modified register spilling code to place all spills into the fixed frame slots, so that they can be accessed relative to the frame pointer. Patch by Vadim Chugunov! Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D4081 llvm-svn: 211691	2014-06-25 12:41:52 +00:00
NAKAMURA Takumi	35a44c8eda	Reformat. llvm-svn: 211689	2014-06-25 12:40:56 +00:00
Andrea Di Biagio	7f4e676911	[X86] Add target combine rule to select ADDSUB instructions from a build_vector This patch teaches the backend how to combine a build_vector that implements an 'addsub' between packed float vectors into a sequence of vector add and vector sub followed by a VSELECT. The new VSELECT is expected to be lowered into a BLENDI. At ISel stage, the sequence 'vector add + vector sub + BLENDI' is pattern-matched against ISel patterns added at r211427 to select 'addsub' instructions. Added three more ISel patterns for ADDSUB. Added test sse3-avx-addsub-2.ll to verify that we correctly emit 'addsub' instructions. llvm-svn: 211679	2014-06-25 10:02:21 +00:00
Evgeniy Stepanov	5dad33bfe8	Factor out part of LICM::sink into a helper function. llvm-svn: 211678	2014-06-25 09:17:21 +00:00
Alexey Volkov	ba8b63c17a	Fix unresolved symbols when loading gold plugin Differential Revision: http://reviews.llvm.org/D4275 llvm-svn: 211675	2014-06-25 08:04:58 +00:00
Evgeniy Stepanov	aeb724c213	[LICM] Don't create more than one copy of an instruction per loop exit block when sinking. Fixes exponential compilation complexity in PR19835, caused by LICM::sink not handling the following pattern well: f = op g e = op f, g d = op e c = op d, e b = op c a = op b, c When an instruction with N uses is sunk, each of its operands gets N new uses (all of them - phi nodes). In the example above, if a had 1 use, c would have 2, e would have 4, and g would have 8. llvm-svn: 211673	2014-06-25 07:54:58 +00:00
Rafael Espindola	bbdd74d0fb	Fix another asserting method in the null streamer. llvm-svn: 211668	2014-06-25 05:37:58 +00:00
Rafael Espindola	f314cd4f5f	Fix a regression from r211653. The method was empty in the null streamer but I mistakenly replaced it with the aborting one in MCStreamer. llvm-svn: 211666	2014-06-25 05:31:22 +00:00
NAKAMURA Takumi	fdee168275	MCNullStreamer.cpp: Roll back a few empty methods that have been marked as unreachable in MCStreamer.cpp. void EmitCOFFSecRel32(MCSymbol const Symbol) override {} void EmitGPRel32Value(const MCExpr Value) override {} It should fix crash like "llc -mtriple=i686-cygwin -filetype=null". llvm-svn: 211664	2014-06-25 04:34:36 +00:00
NAKAMURA Takumi	dc31d81b4f	CodeGen/X86/pr20088.ll: Add -march=x86-64, or llc fails due to non-x86 default target. llvm-svn: 211659	2014-06-25 03:05:47 +00:00
Alp Toker	af677c39a3	Use SourceMgr::getMemoryBuffer() in a couple of places Cleanup only. llvm-svn: 211656	2014-06-25 00:41:15 +00:00
Rafael Espindola	44abada33b	Move some trivial methods up to MCStreamer. This saves some duplicated boilerplate in RecordStreamer and NullStreamer. llvm-svn: 211653	2014-06-25 00:27:53 +00:00
Lang Hames	85285a01f4	[RuntimeDyld] Adds the necessary hooks to MCJIT to be able to debug generated MachO files using the GDB JIT debugging interface. Patch by Keno Fischer. Thanks Keno! llvm-svn: 211652	2014-06-25 00:20:53 +00:00
Rafael Espindola	c718fdc4f6	Simplify the handling of .cfi_endproc. No functionality change. llvm-svn: 211651	2014-06-25 00:13:59 +00:00
Rafael Espindola	00c3a7ea4c	Simplify EmitLabel. All the "real" streamers were already calling to MCStreamer::EmitLabel to do part of the work. llvm-svn: 211646	2014-06-24 23:54:40 +00:00
Juergen Ributzka	236ea1c61a	[FastISel][X86] Fold XALU condition into branch and compare. Optimize the codegen of select and branch instructions to directly use the EFLAGS from the {s\|u}{add\|sub\|mul}.with.overflow intrinsics. llvm-svn: 211645	2014-06-24 23:51:21 +00:00
Tom Stellard	86f1137544	R600/SI: Use a ComplexPattern for MUBUF stores Now that non-leaf ComplexPatterns are allowed we can fold all the MUBUF store patterns into the instruction definition. We will also be able to reuse this new ComplexPattern for MUBUF loads and atomic operations. llvm-svn: 211644	2014-06-24 23:33:07 +00:00
Tom Stellard	840992bb71	R600: Promote i64 stores to v2i32 Now we need only one 64-bit pattern for stores. llvm-svn: 211643	2014-06-24 23:33:04 +00:00
NAKAMURA Takumi	bd0dc4812d	ldr-pseudo-obj-errors.s: Fix silly copypasto. llvm-svn: 211642	2014-06-24 23:18:07 +00:00
NAKAMURA Takumi	cd2bcef166	llvm/test/MC/AArch64/ldr-pseudo-obj-errors.s: Add -triple=aarch64-linux. AArch64 is unaware of PECOFF for now. FIXME: This should pass for also targeting aarch64-darwin. llvm-svn: 211640	2014-06-24 23:11:42 +00:00
Rafael Espindola	c0fea93ce8	Print a=b as an assignment. In assembly the expression a=b is parsed as an assignment, so it should be printed as one. This remove a truly horrible hack for producing a label with "a=.". It would be used by codegen but would never be reached by the asm parser. Sorry I missed this when it was first committed. llvm-svn: 211639	2014-06-24 22:45:16 +00:00
Matt Arsenault	37d6d91b5b	R600: Fix inconsistency in rsq instructions. R600 was using a clamped version of rsq, but SI was not. Add a new rsq_clamped intrinsic and use them consistently. It's unclear to me from the documentation what behavior the R600 instructions have, so I assume they have the legacy behavior described by the SI documents. For R600, use RECIPSQRT_IEEE for both llvm.AMDGPU.rsq.legacy and llvm.AMDGPU.rsq. R600 also has RECIPSQRT_FF, which I'm not sure how it fits in here. llvm-svn: 211637	2014-06-24 22:13:39 +00:00
Sanjay Patel	def1964051	fixed a few typos in comments llvm-svn: 211634	2014-06-24 21:11:51 +00:00
David Blaikie	ad1b24c1a0	Fix up scoping in a few tests (and delete one that validates unnecessary behavior). Most of this is just tests that were silently succeeding in spite of schema changes I made over a year ago. Cleaning them up as they lead to failures in a change I'm working on/will come soon. test/DebugInfo/2010-01-19-DbgScope.ll was removed as it tested miscoping where a DebugLoc described a location not in the current function. The test case doesn't describe why this is a valid situation and should be supported, so I'm removing it and shortly going to commit changes that make this firmly unsupported/assert-fail. llvm-svn: 211628	2014-06-24 20:10:27 +00:00
Bill Schmidt	bfe90f8c83	[PPC64] Fix PR20071 (fctiduz generated for targets lacking that instruction) PR20071 identifies a problem in PowerPC's fast-isel implementation for floating-point conversion to integer. The fctiduz instruction was added in Power ISA 2.06 (i.e., Power7 and later). However, this instruction is being generated regardless of which 64-bit PowerPC target is selected. The intent is for fast-isel to punt to DAG selection when this instruction is not available. This patch implements that change. For testing purposes, the existing fast-isel-conversion.ll test adds a RUN line for -mcpu=970 and tests for the expected code generation. Additionally, the existing test fast-isel-conversion-p5.ll was found to be incorrectly expecting the unavailable instruction to be generated. I've removed these test variants since we have adequate coverage in fast-isel-conversion.ll. llvm-svn: 211627	2014-06-24 20:05:18 +00:00
Robert Khasanov	c1b0024016	vpblend intrinsics combines as shifts intrinsics due to absence return stmt between them Fix PR20088 Differential Revision: http://reviews.llvm.org/D4277 llvm-svn: 211617	2014-06-24 18:08:04 +00:00
Matt Arsenault	11e06d5cd5	R600: Remove DIV_INF This corresponded to an amdil instruction which there is a 2 instruction equivalent for. llvm-svn: 211616	2014-06-24 17:42:16 +00:00
Matt Arsenault	7819e41b84	R600/SI: Move pattern to instruction definition llvm-svn: 211614	2014-06-24 17:17:06 +00:00
Weiming Zhao	f75734dbe4	Fix test case in r211605/r211533 The test case in "Fix PR20056: Implement pseudo LDR <reg>, =<literal/label> for AArch64" should only work with Linux. llvm-svn: 211613	2014-06-24 17:05:43 +00:00
Diego Novillo	51b3abee30	Add new debug kind LocTrackingOnly. Summary: This new debug emission kind supports emitting line location information in all instructions, but stops code generation from emitting debug info to the final output. This mode is useful when the backend wants to track source locations during code generation, but it does not want to produce debug info. This is currently used by optimization remarks (-pass-remarks, -pass-remarks-missed and -pass-remarks-analysis). To prevent debug info emission, DIBuilder never inserts the annotation 'llvm.dbg.cu' when LocTrackingOnly is enabled. Reviewers: echristo, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4234 llvm-svn: 211609	2014-06-24 17:02:03 +00:00
Weiming Zhao	23e9a680c2	Resubmit commit r211533 "Fix PR20056: Implement pseudo LDR <reg>, =<literal/label> for AArch64" Missed files are added in this commit. llvm-svn: 211605	2014-06-24 16:21:38 +00:00
David Majnemer	02a115bee2	CodeGen: Avoid multiple strlen calls Use a StringRef to hold our section prefix. This avoids multiple calls to strlen. llvm-svn: 211602	2014-06-24 16:01:53 +00:00
Christian Pirker	4deae9a4a4	ARM: Fix TPsoft for Thumb mode Reviewed at http://reviews.llvm.org/D4230 llvm-svn: 211601	2014-06-24 15:45:59 +00:00
Rafael Espindola	c24f075f54	Replace two release calls with std::move. I missed this on the previous commit. llvm-svn: 211597	2014-06-24 14:25:17 +00:00
Rafael Espindola	3df1c115ec	Pass a unique_ptr<MemoryBuffer> to the constructors in the Binary hierarchy. Once the objects are constructed, they own the buffer. Passing a unique_ptr makes that clear. llvm-svn: 211595	2014-06-24 13:56:32 +00:00

1 2 3 4 5 ...

104873 Commits