llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 22:12:47 +01:00

Author	SHA1	Message	Date
Craig Topper	19cca98c0c	Make RecordKeeper::addClass/addDef take unique_ptrs instead of creating one internally. llvm-svn: 222948	2014-11-29 05:52:51 +00:00
Craig Topper	2d10991762	Use unique_ptr to remove some explicit deletes on some error case returns. At least one spot of weird ownership passing that needs some future cleanup. llvm-svn: 222947	2014-11-29 05:31:10 +00:00
Duncan P. N. Exon Smith	57cead164b	DebugIR: Delete -debug-ir llvm-svn: 222945	2014-11-29 03:15:47 +00:00
Matt Arsenault	5c2fb0e261	R600/SI: Fix assertion on sign extend of 3 vectors This was trying to create an MVT with 3x vectors which created an invalid EVT llvm-svn: 222942	2014-11-28 22:51:38 +00:00
Duncan P. N. Exon Smith	73ce6dbb2b	Revert "Masked Vector Load and Store Intrinsics." This reverts commit r222632 (and follow-up r222636), which caused a host of LNT failures on an internal bot. I'll respond to the commit on the list with a reproduction of one of the failures. Conflicts: lib/Target/X86/X86TargetTransformInfo.cpp llvm-svn: 222936	2014-11-28 21:29:14 +00:00
David Majnemer	87a17a0975	InstCombine: FoldOrOfICmps harder We may be in a situation where the icmps might not be near each other in a tree of or instructions. Try to dig out related compare instructions and see if they combine. N.B. This won't fire on deep trees of compares because rewritting the tree might end up creating a net increase of IR. We may have to resort to something more sophisticated if this is a real problem. llvm-svn: 222928	2014-11-28 19:58:29 +00:00
Bruno Cardoso Lopes	a072cee758	[LICM] Store sink and indirectbr instructions Loop simplify skips exit-block insertion when exits contain indirectbr instructions. This leads to an assertion in LICM when trying to sink stores out of non-dedicated loop exits containing indirectbr instructions. This patch fix this issue by re-checking for dedicated exits in LICM prior to store sink attempts. Differential Revision: http://reviews.llvm.org/D6414 rdar://problem/18943047 llvm-svn: 222927	2014-11-28 19:47:46 +00:00
Bruno Cardoso Lopes	02319ec9b1	[SwitchLowering] Handle multiple destinations on condensed case stmts Switch cases statements with sequential values that branch to the same destination BB may often be handled together in a single new source BB. In this scenario we need to remove remaining incoming values from PHI instructions in the destination BB, as to match the number of source branches. Differential Revision: http://reviews.llvm.org/D6415 rdar://problem/19040894 llvm-svn: 222926	2014-11-28 19:47:33 +00:00
Sanjay Patel	019be40c8c	Enable FeatureFastUAMem for btver2 Allow unaligned 16-byte memop codegen for btver2. No functional changes for any other subtargets. Replace the existing supposed small memcpy test with an actual test of a small memcpy. The previous test wasn't using FileCheck either. This patch should allow us to close PR21541 ( http://llvm.org/bugs/show_bug.cgi?id=21541 ). Differential Revision: http://reviews.llvm.org/D6360 llvm-svn: 222925	2014-11-28 18:40:18 +00:00
Rafael Espindola	880240ae91	Add back r222727 with a fix. The original patch would fail when: * A dst opaque type (%A) is matched with a src type (%A). * A src opaque (%E) type is then speculatively matched with %A and the speculation fails afterward. * When rolling back the speculation we would cancel the source %A to dest %A mapping. The fix is to keep an explicit list of which resolutions are speculative. Original message: Fix overly aggressive type merging. If we find out that two types are not isomorphic, we learn nothing about opaque sub types in both the source and destination. llvm-svn: 222923	2014-11-28 16:41:24 +00:00
Rafael Espindola	6128226dc5	Add an assert and use a range loop. NFC. llvm-svn: 222922	2014-11-28 16:26:14 +00:00
Charlie Turner	dc3f8f7176	Fix wrong encoding of MRSBanked. Patch by Matthew Wahab. Change-Id: Ia2a001ca2760028ea360fe77b56f203a219eefbc llvm-svn: 222920	2014-11-28 15:01:06 +00:00
Evgeniy Stepanov	118b7804cb	[msan] Fix origin propagation for select of floats. MSan does not assign origin for instrumentation temps (i.e. the ones that do not come from the application code), but "select" instrumentation erroneously tried to use one of those. https://code.google.com/p/memory-sanitizer/issues/detail?id=78 llvm-svn: 222918	2014-11-28 11:17:58 +00:00
Ankur Garg	0bf088f50d	Removed extra line from a comment to test first commit. NFC. llvm-svn: 222916	2014-11-28 10:38:18 +00:00
Craig Topper	c8e385cf52	Add missing 'override' keyword. llvm-svn: 222911	2014-11-28 03:58:26 +00:00
Tim Northover	98dae1ec10	Stop using ArrayRef of a const type. I think this is what the GCC bots are complaining about. llvm-svn: 222905	2014-11-27 21:29:20 +00:00
Tim Northover	e8b34aaff0	AArch64: treat [N x Ty] as a block during procedure calls. The AAPCS treats small structs and homogeneous floating (or vector) aggregates specially, and guarantees they either get passed as a contiguous block of registers, or prevent any future use of those registers and get passed on the stack. This concept can fit quite neatly into LLVM's own type system, mapping an HFA to [N x float] and so on, and small structs to [N x i64]. Doing so allows front-ends to emit AAPCS compliant code without having to duplicate the register counting logic. llvm-svn: 222903	2014-11-27 21:02:42 +00:00
Zoran Jovanovic	15712f82b0	[mips][microMIPS] Implement SWM16 and LWM16 instructions Differential Revision: http://reviews.llvm.org/D5579 llvm-svn: 222901	2014-11-27 18:28:59 +00:00
Jozef Kolek	c85a4a5656	[mips][microMIPS] Implement BREAK16 and SDBBP16 instructions Patch by Radovan Obradovic. Differential Revision: http://reviews.llvm.org/D5048 llvm-svn: 222900	2014-11-27 18:18:42 +00:00
Daniel Sanders	d7d5d4cdb1	[mips] Add synci instruction. Patch by Amaury Pouly Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6421 llvm-svn: 222899	2014-11-27 17:28:10 +00:00
Rafael Espindola	fe0669d419	Commit back the correct bits of r222760 (was r222538). I also added a test. Original message: Allow FDE references outside the +/-2GB range supported by PC relative offsets for code models other than small/medium. For JIT application, memory layout is less controlled and can result in truncations otherwise. Patch from Akos Kiss. Differential Revision: http://reviews.llvm.org/D6079 llvm-svn: 222897	2014-11-27 17:13:56 +00:00
Rafael Espindola	008818ce29	Revert "Reapply 222538 and update tests to explicitly request small code model and PIC:" This reverts commit r222760. It changed our behaviour on PIC so we don't match gas anymore. It also included lots of unnecessary changes to tests. If those changes are desirable, there should be an independent discussion as they are out of scope for that patch. I will recommit the other bits. llvm-svn: 222896	2014-11-27 17:13:51 +00:00
Duncan P. N. Exon Smith	bcb91dbf3f	Revert "Fix overly aggressive type merging." This reverts commit r222727, which causes LTO bootstrap failures. Last passing @ r222698: http://lab.llvm.org:8080/green/job/clang-Rlto_master_build/532/ First failing @ r222843: http://lab.llvm.org:8080/green/job/clang-Rlto_master_build/533/ Internal bootstraps pointed at a much narrower range: r222725 is passing, and r222731 is failing. LTO crashes while handling libclang.dylib: http://lab.llvm.org:8080/green/job/clang-Rlto_master_build/533/consoleFull#-158682280549ba4694-19c4-4d7e-bec5-911270d8a58c GEP is not of right type for indices! %InfoObj.i.i = getelementptr inbounds %"class.llvm::OnDiskIterableChainedHashTable"* %.lcssa, i64 0, i32 0, i32 4, !dbg !123627 %"class.clang::serialization::reader::ASTIdentifierLookupTrait" = type { %"class.clang::ASTReader.31859", %"class.clang::serialization::ModuleFile.31870", %"class.clang::IdentifierInfo"* }LLVM ERROR: Broken function found, compilation aborted! clang: error: linker command failed with exit code 1 (use -v to see invocation) Looks like the new algorithm doesn't merge types aggressively enough. llvm-svn: 222895	2014-11-27 17:01:10 +00:00
Erik Eckstein	b81721adb5	reinstate r222872: Peephole optimization in switch table lookup: reuse the guarding table comparison if possible. Fixed missing dominance check. Original commit message: This optimization tries to reuse the generated compare instruction, if there is a comparison against the default value after the switch. Example: if (idx < tablesize) r = table[idx]; // table does not contain default_value else r = default_value; if (r != default_value) ... Is optimized to: cond = idx < tablesize; if (cond) r = table[idx]; else r = default_value; if (cond) ... Jump threading will then eliminate the second if(cond). llvm-svn: 222891	2014-11-27 15:13:14 +00:00
Evgeniy Stepanov	a93bac024f	[msan] Remove indirect call wrapping code. This functionality was only used in MSanDR, which is deprecated. llvm-svn: 222889	2014-11-27 14:54:02 +00:00
Jozef Kolek	90243462d4	[mips][microMIPS] Implement disassembler support for 16-bit instructions LI16, ADDIUR1SP, ADDIUR2 and ADDIUS5 Differential Revision: http://reviews.llvm.org/D6419 llvm-svn: 222887	2014-11-27 14:41:44 +00:00
Charlie Turner	e4a6c7fd89	Stop uppercasing build attribute data. The string data for string-valued build attributes were being unconditionally uppercased. There is no mention in the ARM ABI addenda about case conventions, so it's technically implementation defined as to whether the data are capitialised in some way or not. However, there are good reasons not to captialise the data. * It's less work. * Some vendors may legitimately have case-sensitive checks for these attributes which would fail on LLVM generated object files. * There could be locale issues with uppercasing. The original reasons for uppercasing appear to have stemmed from an old codesourcery toolchain behaviour, see http://comments.gmane.org/gmane.comp.compilers.llvm.cvs/87133 This patch makes the object file emitted no longer captialise string data, it encodes as seen in the assembly source. Change-Id: Ibe20dd6e60d2773d57ff72a78470839033aa5538 llvm-svn: 222882	2014-11-27 12:13:56 +00:00
Erik Eckstein	6a5635f3ac	Revert "Peephole optimization in switch table lookup: reuse the guarding table comparison if possible." It is breaking the clang bootstrag. llvm-svn: 222877	2014-11-27 10:59:08 +00:00
Erik Eckstein	15d20044f2	Peephole optimization in switch table lookup: reuse the guarding table comparison if possible. This optimization tries to reuse the generated compare instruction, if there is a comparison against the default value after the switch. Example: if (idx < tablesize) r = table[idx]; // table does not contain default_value else r = default_value; if (r != default_value) ... Is optimized to: cond = idx < tablesize; if (cond) r = table[idx]; else r = default_value; if (cond) ... \endcode Jump threading will then eliminate the second if(cond). llvm-svn: 222872	2014-11-27 08:33:51 +00:00
David Majnemer	9698d3487d	InstCombine: Restore optimizations lost in r210006 This restores our ability to optimize: (X & C) == 0 ? X ^ C : X into X \| C (X & C) != 0 ? X ^ C : X into X & ~C llvm-svn: 222871	2014-11-27 07:25:21 +00:00
NAKAMURA Takumi	86d230cb48	Add LLVMObject to LLVMExecutionEngine. llvm-svn: 222869	2014-11-27 06:36:22 +00:00
David Majnemer	d9ae958b9b	InstSimplify: Restore optimizations lost in r210006 This restores our ability to optimize: (X & C) ? X & ~C : X into X & ~C (X & C) ? X : X & ~C into X (X & C) ? X \| C : X into X (X & C) ? X : X \| C into X \| C llvm-svn: 222868	2014-11-27 06:32:46 +00:00
Lang Hames	3f06ba3b2f	[MCJIT] Remove the local symbol table from RuntimeDlyd - it's not needed. All symbols have to be stored in the global symbol to enable cross-rtdyld-instance linking, so the local symbol table content is redundant. llvm-svn: 222867	2014-11-27 05:40:13 +00:00
Lang Hames	63b80f5e8f	[MCJIT] Replace JITEventListener::anchor (temporarily removed in r222861), and move GDBRegistrationListener into ExecutionEngine to avoid layering violation. llvm-svn: 222864	2014-11-27 01:41:16 +00:00
Lang Hames	8af711e954	[MCJIT] Remove JITEventListener's anchor until I can determine the right place to put it. This should unbreak the Mips bots. llvm-svn: 222861	2014-11-27 00:15:28 +00:00
Lang Hames	7d4e7a1733	[MCJIT] Move get-any-symbol-load-address logic out of RuntimeDyld and into RuntimeDyldChecker. RuntimeDyld instances should only provide lookup for locally defined symbols. llvm-svn: 222859	2014-11-27 00:12:28 +00:00
David Majnemer	7e9b94486e	Revert "Added inst combine transforms for single bit tests from Chris's note" This reverts commit r210006, it miscompiled libapr which is used in who knows how many projects. A test has been added to ensure that we don't regress again. I'll work on a rewrite of what the optimization was trying to do later. llvm-svn: 222856	2014-11-26 23:00:38 +00:00
Rui Ueyama	ab9d8be85c	Object/COFF: Fix off-by-one error for object having lots of relocations llvm-objdump printed out an error message for this off-by-one error, but because it always exits with 0 whether or not it found an error, the test (llvm-objdump/coff-many-relocs.test) succeeded. I made llvm-objdump exit with EXIT_FAILURE when an error is found. llvm-svn: 222852	2014-11-26 22:17:25 +00:00
Matt Arsenault	796e0c24e7	R600/SI: Use ZeroOrNegativeOneBooleanContent This sort of doesn't matter since the setcc type is i1, but this previously was using the default UndefinedBooleanContent. This makes it more consistent with R600. This enables more optimizations which typically give up on UndefinedBooleanContent. For example, there is already a special case target DAG combine for setcc + sext which can be eliminated in favor of what the generic DAG combiner can do if it assumes boolean values are sign extended. Since -1 is an inline immediate, using it is basically free and the backend already uses it when a boolean value is needed in a wider type. llvm-svn: 222850	2014-11-26 21:23:15 +00:00
Colin LeMahieu	0872710917	[Hexagon] Adding cmp* immediate form instructions. llvm-svn: 222849	2014-11-26 19:43:12 +00:00
Jozef Kolek	ecfa20e7f7	[mips][microMIPS] Implement disassembler support for 16-bit instructions LBU16, LHU16, LW16, SB16, SH16 and SW16 Differential Revision: http://reviews.llvm.org/D6405 llvm-svn: 222847	2014-11-26 18:56:38 +00:00
Colin LeMahieu	1f8cdbb85d	[Hexagon] Adding and64, or64, and xor64 instructions. llvm-svn: 222846	2014-11-26 18:55:59 +00:00
Matt Arsenault	b4b33cf9de	R600/SI: Create e64 versions of and/or/xor in SILowerI1Copies This fixes moving boolean constants into registers before operating on them. They get permuted and shrunk down to e32 anyway later. This is a temporary fix until the patch that removes these pseudos is committed. llvm-svn: 222844	2014-11-26 18:18:28 +00:00
Lang Hames	b0f5c0e15c	[MCJIT] Fix missing return statement. llvm-svn: 222841	2014-11-26 17:21:41 +00:00
Lang Hames	787f3dbf64	[MCJIT] Reapply r222828 and r222810-r222812 with fix for MSVC move-op issues. llvm-svn: 222840	2014-11-26 16:54:40 +00:00
Aaron Ballman	18c93c9af6	Reverting r222828 and r222810-r222812 as they broke the build on Windows. http://bb.pgr.jp/builders/ninja-clang-i686-msc17-R/builds/11753 llvm-svn: 222833	2014-11-26 15:27:39 +00:00
Aaron Ballman	011e7b3c54	Removing a spurious semicolon; NFC llvm-svn: 222830	2014-11-26 13:55:55 +00:00
Evgeniy Stepanov	f853256b9e	Add missing "override". Fixes compilation failure in r222810. llvm-svn: 222828	2014-11-26 12:26:03 +00:00
Will Newton	da953f13b9	Update AArch64 ELF relocations to ABI 1.0 This mostly entails adding relocations, however there are a couple of changes to existing relocations: 1. R_AARCH64_NONE is defined to be zero rather than 256 R_AARCH64_NONE has been defined to be zero for a long time elsewhere e.g. binutils and glibc since the submission of the AArch64 port in 2012 so this is required for compatibility. 2. R_AARCH64_TLSDESC_ADR_PAGE renamed to R_AARCH64_TLSDESC_ADR_PAGE21 I don't think there is any way for relocation names to leak out of LLVM so this should not break anything. Tested with check-all with no regressions. llvm-svn: 222821	2014-11-26 10:49:18 +00:00
Elena Demikhovsky	868b76ae69	AVX-512: Scalar ERI intrinsics including SAE mode and memory operand. Added AVX512_maskable_scalar template, that should cover all scalar instructions in the future. The main difference between AVX512_maskable_scalar<> and AVX512_maskable<> is using X86select instead of vselect. I need it, because I can't create vselect node for MVT::i1 mask for scalar instruction. http://reviews.llvm.org/D6378 llvm-svn: 222820	2014-11-26 10:46:49 +00:00
Lang Hames	df26c1b59a	[MCJIT] Re-enable GDB registration (temporarily disabled in r222811), but check that we actually have an object to register first. For MachO objects, RuntimeDyld::LoadedObjectInfo::getObjectForDebug returns an empty OwningBinary<ObjectFile> which was causing crashes in the GDB registration code. llvm-svn: 222812	2014-11-26 07:39:03 +00:00
Lang Hames	fdbbc5eefc	[MCJIT] Temporarily disable automatic JIT debugger registration. The RuntimeDyld cleanup patch r222810 turned on GDB registration for MachO objects. I expected this to be harmless, but it seems to have broken on MacsOS. Temporarily disabling debugger registration while I dig in to what's gone wrong. llvm-svn: 222811	2014-11-26 07:25:26 +00:00
Lang Hames	ad18dbbb5b	[MCJIT] Clean up RuntimeDyld's quirky object-ownership/modification scheme. Previously, when loading an object file, RuntimeDyld (1) took ownership of the ObjectFile instance (and associated MemoryBuffer), (2) potentially modified the object in-place, and (3) returned an ObjectImage that managed ownership of the now-modified object and provided some convenience methods. This scheme accreted over several years as features were tacked on to RuntimeDyld, and was both unintuitive and unsafe (See e.g. http://llvm.org/PR20722). This patch fixes the issue by removing all ownership and in-place modification of object files from RuntimeDyld. Existing behavior, including debugger registration, is preserved. Noteworthy changes include: (1) ObjectFile instances are now passed to RuntimeDyld by const-ref. (2) The ObjectImage and ObjectBuffer classes have been removed entirely, they existed to model ownership within RuntimeDyld, and so are no longer needed. (3) RuntimeDyld::loadObject now returns an instance of a new class, RuntimeDyld::LoadedObjectInfo, which can be used to construct a modified object suitable for registration with the debugger, following the existing debugger registration scheme. (4) The JITRegistrar class has been removed, and the GDBRegistrar class has been re-written as a JITEventListener. This should fix http://llvm.org/PR20722 . llvm-svn: 222810	2014-11-26 06:53:26 +00:00
Craig Topper	0734168db8	Replace neverHasSideEffects=1 with hasSideEffects=0 in all .td files. llvm-svn: 222801	2014-11-26 00:46:26 +00:00
Simon Pilgrim	0e1c44b939	[X86][SSE] Improvements to byte shift shuffle matching Since (v)pslldq / (v)psrldq instructions resolve to a single input argument it is useful to match it much earlier than we currently do - this prevents more complicated shuffles (notably insertion into a zero vector) matching before it. Differential Revision: http://reviews.llvm.org/D6409 llvm-svn: 222796	2014-11-25 22:34:59 +00:00
Colin LeMahieu	535f692186	[Hexagon] Adding add64 and sub64 instructions. llvm-svn: 222795	2014-11-25 22:15:44 +00:00
Colin LeMahieu	732a0febbc	Reverting 222792 llvm-svn: 222793	2014-11-25 21:39:57 +00:00
Colin LeMahieu	bf565f821c	[Hexagon] Adding compare with immediate instructions. llvm-svn: 222792	2014-11-25 21:30:28 +00:00
Colin LeMahieu	dbc4cca085	[Hexagon] Adding NOP encoding bits. llvm-svn: 222791	2014-11-25 21:23:07 +00:00
Matt Arsenault	36581167b0	R600/SI: Only use one DEBUG() llvm-svn: 222789	2014-11-25 21:03:22 +00:00
Cameron McInally	c32dadfa69	[AVX512] Add 512b integer shift by variable intrinsics and patterns. llvm-svn: 222786	2014-11-25 20:41:51 +00:00
Colin LeMahieu	9b0719975d	[Hexagon] Adding C2_mux instruction. llvm-svn: 222784	2014-11-25 20:20:09 +00:00
Craig Topper	3ac6865792	Remove space before tab in all AVX512 mnemonic strings. llvm-svn: 222778	2014-11-25 20:11:23 +00:00
Colin LeMahieu	4a42d4abd9	[Hexagon] Replacing cmp* instructions with ones that contain encoding bits. llvm-svn: 222771	2014-11-25 18:20:52 +00:00
Hans Wennborg	333795bf71	LazyValueInfo: Actually re-visit partially solved block-values in solveBlockValue() If solveBlockValue() needs results from predecessors that are not already computed, it returns false with the intention of resuming when the dependencies have been resolved. However, the computation would never be resumed since an 'overdefined' result had been placed in the cache, preventing any further computation. The point of placing the 'overdefined' result in the cache seems to have been to break cycles, but we can check for that when inserting work items in the BlockValue stack instead. This makes the "stop and resume" mechanism of solveBlockValue() work as intended, unlocking more analysis. Using this patch shaves 120 KB off a 64-bit Chromium build on Linux. I benchmarked compiling bzip2.c at -O2 but couldn't measure any difference in compile time. Tests by Jiangning Liu from r215343 / PR21238, Pete Cooper, and me. Differential Revision: http://reviews.llvm.org/D6397 llvm-svn: 222768	2014-11-25 17:23:05 +00:00
Rafael Espindola	6791ceb3c6	Set the body of a new struct as soon as it is created. This changes the order in which different types are passed to get, but one order is not inherently better than the other. The main motivation is that this simplifies linkDefinedTypeBodies now that it is only linking "real" opaque types. It is also means that we only have to call it once and that we don't need getImpl. A small change in behavior is that we don't copy type names when resolving opaque types. This is an improvement IMHO, but it can be added back if desired. A test is included with the new behavior. llvm-svn: 222764	2014-11-25 15:33:40 +00:00
Evgeniy Stepanov	930e5bcfb5	[msan] Annotate zlib functions for MemorySanitizer. Mark destination buffer in zlib::compress and zlib::decompress as fully initialized. When building LLVM with system zlib and MemorySanitizer instrumentation, MSan does not observe memory writes in zlib code and erroneously considers zlib output buffers as uninitialized, resulting in false use-of-uninitialized memory reports. This change helps MSan understand the state of that memory and prevents such reports. llvm-svn: 222763	2014-11-25 15:24:07 +00:00
Rafael Espindola	004800f30f	Misc style fixes. NFC. This just reduces the noise in the next patch. llvm-svn: 222761	2014-11-25 14:35:53 +00:00
Joerg Sonnenberger	829c958371	Reapply 222538 and update tests to explicitly request small code model and PIC: Allow FDE references outside the +/-2GB range supported by PC relative offsets for code models other than small/medium. For JIT application, memory layout is less controlled and can result in truncations otherwise. Patch from Akos Kiss. Differential Revision: http://reviews.llvm.org/D6079 llvm-svn: 222760	2014-11-25 13:37:55 +00:00
Rafael Espindola	4e69143a49	Remove a bit of duplicated code. Exactly the same checks are present in areTypesIsomorphic. This might have been a premature performance optimization. I cannot reproduce any slowdown with this patch. llvm-svn: 222758	2014-11-25 13:19:46 +00:00
Chandler Carruth	5a24aaefeb	Revert r222746: That commit did not update any tests and caused two R600 tests to start failing. Original commit log: R600/SI: Disable commutativity for MIN/MAX_LEGACY llvm-svn: 222753	2014-11-25 10:50:41 +00:00
Zoran Jovanovic	c3664f6f8a	[mips][micromips] Use call instructions with short delay slots Differential Revision: http://reviews.llvm.org/D6338 llvm-svn: 222752	2014-11-25 10:50:00 +00:00
Chandler Carruth	6264bc6537	[InstCombine] Change LLVM To canonicalize toward the value type being stored rather than the pointer type. This change is analogous to r220138 which changed the canonicalization for loads. The rationale is the same: memory does not have a type, operations (and thus the values they produce) have a type. We should match that type as closely as possible rather than reading some form of semantics into the pointer type. With this change, loads and stores should no longer be made with nonsensical types for the values that tehy load and store. This is particularly important when trying to match specific loaded and stored types in the process of doing other instcombines, which is what led me down this twisty maze of miscanonicalization. I've put quite some effort into looking through IR to find places where LLVM's optimizer was being unreasonably conservative in the face of mismatched load and store types, however it is possible (let's say, likely!) I have missed some. If you see regressions here, or from r220138, the likely cause is some part of LLVM failing to cope with load and store types differing. Test cases appreciated, it is important that we root all of these out of LLVM. llvm-svn: 222748	2014-11-25 10:09:51 +00:00
Marek Olsak	c593ef1ab1	R600/SI: Disable commutativity for MIN/MAX_LEGACY llvm-svn: 222746	2014-11-25 09:49:23 +00:00
Chandler Carruth	7feb19d89c	Revert r220349 to re-instate r220277 with a fix for PR21330 -- quite clearly only exactly equal width ptrtoint and inttoptr casts are no-op casts, it says so right there in the langref. Make the code agree. Original log from r220277: Teach the load analysis to allow finding available values which require inttoptr or ptrtoint cast provided there is datalayout available. Eventually, the datalayout can just be required but in practice it will always be there today. To go with the ability to expose available values requiring a ptrtoint or inttoptr cast, helpers are added to perform one of these three casts. These smarts are necessary to finish canonicalizing loads and stores to the operational type requirements without regressing fundamental combines. I've added some test cases. These should actually improve as the load combining and store combining improves, but they may fundamentally be highlighting some missing combines for select in addition to exercising the specific added logic to load analysis. llvm-svn: 222739	2014-11-25 08:20:27 +00:00
Matt Arsenault	14d278bdec	R600/SI: Fix allocating flat_scr_lo / flat_scr_hi Only the super register flat_scr was marked as reserved, so in some cases with high register usage it would still try to allocate the subregisters. llvm-svn: 222737	2014-11-25 07:53:06 +00:00
David Majnemer	3f7ae9c4d6	COFF: Add back an assertion that is superseded by r222124 llvm-svn: 222735	2014-11-25 07:43:14 +00:00
Rafael Espindola	4b6af9d891	Use a range loop. NFC. llvm-svn: 222730	2014-11-25 06:16:27 +00:00
Rafael Espindola	81e0387f70	Style fix: don't indent inside a namemespace. llvm-svn: 222729	2014-11-25 06:11:24 +00:00
Rafael Espindola	edb5344434	Remove a nested anonymous namespace. llvm-svn: 222728	2014-11-25 06:07:51 +00:00
Rafael Espindola	34bccf8e6c	Fix overly aggressive type merging. If we find out that two types are not isomorphic, we learn nothing about opaque sub types in both the source and destination. llvm-svn: 222727	2014-11-25 05:59:24 +00:00
Rafael Espindola	906b92c9d0	Link the type of aliases. They are not more or less "well typed" than GlobalVariables. llvm-svn: 222725	2014-11-25 04:43:59 +00:00
Rafael Espindola	2f4773263d	Don't repeat name in comment or duplicate comment. NFC. llvm-svn: 222724	2014-11-25 04:28:31 +00:00
Rafael Espindola	654b3a4863	Use range loops. NFC. llvm-svn: 222723	2014-11-25 04:26:19 +00:00
Juergen Ributzka	c90ddb75a2	[FastISel][AArch64] Fix and extend the tbz/tbnz pattern matching. The pattern matching failed to recognize all instances of "-1", because when comparing against "-1" we didn't use an APInt of the same bitwidth. This commit fixes this and also adds inverse versions of the conditon to catch more cases. llvm-svn: 222722	2014-11-25 04:16:15 +00:00
David Majnemer	4e5c0f46f5	InstSimplify: Handle some simple tautological comparisons This handles cases where we are comparing a masked value against itself. The analysis could be further improved by making it recursive but such expense is not currently justified. llvm-svn: 222716	2014-11-25 02:55:48 +00:00
David Blaikie	de835e646d	Revert "unique_ptrify LLVMContextImpl::CAZConstants" Missed the complexities of how these elements are destroyed. This reverts commit r222714. llvm-svn: 222715	2014-11-25 02:26:22 +00:00
David Blaikie	14dc3c5b02	unique_ptrify LLVMContextImpl::CAZConstants llvm-svn: 222714	2014-11-25 02:13:54 +00:00
Hal Finkel	d19aa4cdf8	[PowerPC] Add the 'attn' instruction The attn instruction is not part of the Power ISA, but is documented in the A2 user manual, and is accepted by the GNU assembler for the A2 and the POWER4+. Reported as part of PR21650. llvm-svn: 222712	2014-11-25 00:30:11 +00:00
Hal Finkel	515f6e50f5	[PowerPC] Implement combineRepeatedFPDivisors This does not matter on newer cores (where we can use reciprocal estimates in fast-math mode anyway), but for older cores this allows us to generate better fast-math code where we have multiple FDIVs with a common divisor. llvm-svn: 222710	2014-11-24 23:45:21 +00:00
Philip Reames	f22f53238c	Factor check for the assume intrinsic out of checks in computeKnownBitsFromAssume We were matching against the assume intrinsic in every check. Since we know that it must be an assume, this is just wasted work. Somewhat surprisingly, matching an intrinsic id is actually relatively expensive. It devolves to a string construction and comparison in Function::isIntrinsic. I originally spotted this because it showed up in a performance profile of my compiler. I've since discovered a separate issue which seems to be the actual root cause, but this is minor perf goodness regardless. I'm likely to follow up with another change to factor out the comparison matching. There's no need to match the compare instruction in every single one of the tests. Differential Revision: http://reviews.llvm.org/D6312 llvm-svn: 222709	2014-11-24 23:44:28 +00:00
Philip Reames	83a1682665	Incorporate review comments from r221742 This change implements the comment and style changes Sean requested during post commit review with r221742. Sorry for the delay. llvm-svn: 222707	2014-11-24 23:24:24 +00:00
Matt Arsenault	454e837bd2	Bug 21610: Canonicalize min/max fcmp selects to use ordered comparisons llvm-svn: 222705	2014-11-24 23:15:18 +00:00
Rafael Espindola	ed91a36cd8	Remove the unused FindUsedTypes pass. It was dead since r134829. llvm-svn: 222684	2014-11-24 20:53:26 +00:00
Rafael Espindola	5cee6ee598	Add and use Type::subtypes. NFC. llvm-svn: 222682	2014-11-24 20:44:36 +00:00
Chad Rosier	17cb0c630f	[AArch64] Fix clobber computation in A57LoadBalancing pass. Extremely difficult to reproduce, so no test case included. PR21637 llvm-svn: 222677	2014-11-24 18:57:58 +00:00
Colin LeMahieu	c450f360d3	Removing unused variable. llvm-svn: 222676	2014-11-24 18:55:32 +00:00
Kostya Serebryany	cb8f8175c9	[asan/coverage] change the way asan coverage instrumentation is done: instead of setting the guard to 1 in the generated code, pass the pointer to guard to __sanitizer_cov and set it there. No user-visible functionality change expected llvm-svn: 222675	2014-11-24 18:49:53 +00:00
Ulrich Weigand	24b899d017	[PowerPC] Fix PR 21652 - copy st_other bits on symbol assignment When processing an assignment in the integrated assembler that sets a symbol to the value of another symbol, we need to copy the st_other bits that encode the local entry point offset. Modeled after MipsTargetELFStreamer::emitAssignment handling of the ELF::STO_MIPS_MICROMIPS flag. llvm-svn: 222672	2014-11-24 18:09:47 +00:00
Paul Robinson	a98fe04f95	More long path name support on Windows, this time in program execution. Allows long paths for the executable and redirected stdin/stdout/stderr. Addresses PR21563. llvm-svn: 222671	2014-11-24 18:05:29 +00:00
Colin LeMahieu	1dc5a7ca7c	[Hexagon] Adding asrh instruction, removing unused multiclasses. llvm-svn: 222670	2014-11-24 18:04:42 +00:00
Colin LeMahieu	b3868ebb81	[Hexagon] Adding aslh instruction. llvm-svn: 222668	2014-11-24 17:44:19 +00:00
Colin LeMahieu	e1cd9ff6b5	[Hexagon] Adding zxth instruction. llvm-svn: 222662	2014-11-24 17:11:34 +00:00
Colin LeMahieu	80e59674e9	[Hexagon] Adding zxtb instruction. llvm-svn: 222660	2014-11-24 16:48:43 +00:00
David Majnemer	291966cd3b	InstCombine: Don't create an unused instruction We would create an instruction but not inserting it. Not inserting the unused instruction would lead us to verification failure. This fixes PR21653. llvm-svn: 222659	2014-11-24 16:41:13 +00:00
Jozef Kolek	a4e87d7a74	[mips][microMIPS] Fix JRADDIUSP instruction Fix JRADDIUSP instruction, remove delay slot flag because this instruction doesn't have delay slot. Differential Revision: http://reviews.llvm.org/D6365 llvm-svn: 222658	2014-11-24 16:14:10 +00:00
Jozef Kolek	dd0dbf282b	[mips][microMIPS] Implement LBU16, LHU16, LW16, SB16, SH16 and SW16 instructions Differential Revision: http://reviews.llvm.org/D5122 llvm-svn: 222653	2014-11-24 14:39:13 +00:00
Jozef Kolek	95061e7de1	[mips][microMIPS] Implement 16-bit instructions registers including ZERO instead of S0 Implement microMIPS 16-bit instructions register set: $0, $2-$7 and $17. Differential Revision: http://reviews.llvm.org/D5780 llvm-svn: 222652	2014-11-24 14:25:53 +00:00
Aaron Ballman	4f29ba5173	Removing a variable that is initialized but never read. The original author has been alerted to the warning, in case this variable is meant to be used. Fixes -Werror builds in the meantime. llvm-svn: 222649	2014-11-24 14:03:16 +00:00
Jozef Kolek	0d926f8fc9	[mips][microMIPS] Implement disassembler support for 16-bit instructions With the help of new method readInstruction16() two bytes are read and decodeInstruction() is called with DecoderTableMicroMips16, if this fails four bytes are read and decodeInstruction() is called with DecoderTableMicroMips32. Differential Revision: http://reviews.llvm.org/D6149 llvm-svn: 222648	2014-11-24 13:29:59 +00:00
Andrea Di Biagio	3646b17160	[X86] Improved target specific combine on VSELECT dag nodes. This patch teaches function 'transformVSELECTtoBlendVECTOR_SHUFFLE' how to convert VSELECT dag nodes to shuffles on targets that do not have SSE4.1. On pre-SSE4.1 targets, we can still perform blend operations using movss/movsd. Also, removed a target specific combine that performed a premature lowering of VSELECT nodes to target specific MOVSS/MOVSD nodes. llvm-svn: 222647	2014-11-24 12:23:15 +00:00
David Majnemer	2445c6caf5	InstCombine: Don't assume DataLayout is always available We tried to get the result of DataLayout::getLargestLegalIntTypeSize but we didn't have a DataLayout. This resulted in opt crashing. This fixes PR21651. llvm-svn: 222645	2014-11-24 07:26:20 +00:00
Elena Demikhovsky	25f6c9047c	Converted back to Unix format (after my last commit 222632) llvm-svn: 222636	2014-11-23 15:21:53 +00:00
Michael Kuperstein	b12b19a24a	[X86] Fixes bug in build_vector v4x32 lowering r222375 made some improvements to build_vector lowering of v4x32 and v4xf32 into an insertps, but it missed a case where: 1. A single extracted element is used twice. 2. The lower of the two non-zero indexes should be preserved, and the higher should be used for the dest mask. This caused a crash, since the source value for the insertps ends-up uninitialized. Differential Revision: http://reviews.llvm.org/D6377 llvm-svn: 222635	2014-11-23 13:09:06 +00:00
Craig Topper	22f2dfbc9f	Add missing override keywords. llvm-svn: 222634	2014-11-23 09:40:13 +00:00
Elena Demikhovsky	36a2243ab7	Masked Vector Load and Store Intrinsics. Introduced new target-independent intrinsics in order to support masked vector loads and stores. The loop vectorizer optimizes loops containing conditional memory accesses by generating these intrinsics for existing targets AVX2 and AVX-512. The vectorizer asks the target about availability of masked vector loads and stores. Added SDNodes for masked operations and lowering patterns for X86 code generator. Examples: <16 x i32> @llvm.masked.load.v16i32(i8* %addr, <16 x i32> %passthru, i32 4 /* align /, <16 x i1> %mask) declare void @llvm.masked.store.v8f64(i8 %addr, <8 x double> %value, i32 4, <8 x i1> %mask) Scalarizer for other targets (not AVX2/AVX-512) will be done in a separate patch. http://reviews.llvm.org/D6191 llvm-svn: 222632	2014-11-23 08:07:43 +00:00
Matt Arsenault	1b03538afe	R600: Fix extloads of i1 on R600/Evergreen llvm-svn: 222631	2014-11-23 02:57:54 +00:00
Matt Arsenault	417f5ceb20	R600: Fix assert on copy of an i1 on pre-SI i1 is not a legal type on Evergreen, so this combine proceeded and tried to produce a bitcast between i1 and i8. llvm-svn: 222630	2014-11-23 02:57:52 +00:00
David Majnemer	0b413925f3	InstCombine: Propagate exact for (sdiv X, Pow2) -> (udiv X, Pow2) llvm-svn: 222625	2014-11-22 20:00:41 +00:00
David Majnemer	ba33e07fad	InstCombine: Propagate exact for (sdiv X, Y) -> (udiv X, Y) llvm-svn: 222624	2014-11-22 20:00:38 +00:00
David Majnemer	e3d9e29780	InstCombine: Propagate exact for (sdiv -X, C) -> (sdiv X, -C) llvm-svn: 222623	2014-11-22 20:00:34 +00:00
Simon Pilgrim	815bb3182b	Tidied up target triple OS detection. NFC Use Triple::isOS*() helper functions where possible. llvm-svn: 222622	2014-11-22 19:12:10 +00:00
David Majnemer	23e1540ef9	InstCombine: Propagate exact in (udiv (lshr X,C1),C2) -> (udiv x,C1<<C2) llvm-svn: 222620	2014-11-22 18:16:54 +00:00
Chandler Carruth	3e70f9f348	[x86] Teach the vector shuffle yet another step of canonicalization. No functionality changed yet, but this will prevent subsequent patches from having to handle permutations of various interleaved shuffle patterns. llvm-svn: 222614	2014-11-22 09:18:53 +00:00
David Majnemer	1847177b9b	InstCombine: Propagate NSW/NUW for X*(1<<Y) -> X<<Y llvm-svn: 222613	2014-11-22 08:57:02 +00:00
David Majnemer	3c7153d5d6	InstCombine: Propagate NSW for -X * -Y -> X * Y llvm-svn: 222612	2014-11-22 07:25:19 +00:00
David Majnemer	26583aff1f	InstSimplify: Simplify (sub 0, X) -> X if it's NUW This is a generalization of the X - (0 - Y) -> X transform. llvm-svn: 222611	2014-11-22 07:15:16 +00:00
David Majnemer	6b5df7ef8d	InstCombine: Silence a parenthesis warning llvm-svn: 222609	2014-11-22 06:09:28 +00:00
David Majnemer	c405b87f53	InstCombine: Preserve nsw when folding X*(2^C) -> X << C llvm-svn: 222606	2014-11-22 04:52:55 +00:00
David Majnemer	96d9c67b69	InstCombine: Preserve nsw/nuw for ((X << C2)C1) -> (X (C1 << C2)) llvm-svn: 222605	2014-11-22 04:52:52 +00:00
David Majnemer	6191590b23	InstCombine: Preserve nsw for (mul %V, -1) -> (sub 0, %V) llvm-svn: 222604	2014-11-22 04:52:38 +00:00
Gerolf Hoflehner	cb87bd4853	[InstCombine] Re-commit of r218721 (Optimize icmp-select-icmp sequence) Fixes the self-host fail. Note that this commit activates dominator analysis in the combiner by default (like the original commit did). llvm-svn: 222590	2014-11-21 23:36:44 +00:00
Joerg Sonnenberger	00a4fe60d0	Fix transformation of add with pc argument to adr for non-immediate arguments. llvm-svn: 222587	2014-11-21 22:39:34 +00:00
Kostya Serebryany	ec6bd28ded	[asan] remove old experimental code llvm-svn: 222586	2014-11-21 22:34:29 +00:00
Tom Stellard	cfd2fce8a1	R600/SI: Add an s_mov_b32 to patterns which use the M0RegClass We need to use a s_mov_b32 rather than a copy, so that CSE will eliminate redundant moves to the m0 register. llvm-svn: 222584	2014-11-21 22:31:46 +00:00
Tom Stellard	a112fe4e40	R600/SI: Emit s_mov_b32 m0, -1 before every DS instruction This s_mov_b32 will write to a virtual register from the M0Reg class and all the ds instructions now take an extra M0Reg explicit argument. This change is necessary to prevent issues with the scheduler mixing together instructions that expect different values in the m0 registers. llvm-svn: 222583	2014-11-21 22:31:44 +00:00
Tom Stellard	484f10138e	R600/SI: Add SIFoldOperands pass This pass attempts to fold the source operands of mov and copy instructions into their uses. llvm-svn: 222581	2014-11-21 22:06:37 +00:00
Jozef Kolek	52fa965cf8	[mips][microMIPS] This patch implements functionality in MIPS delay slot filler such as if delay slot filler have to put NOP instruction into the delay slot of microMIPS BEQ or BNE instruction which uses the register $0, then instead of emitting NOP this instruction is replaced by the corresponding microMIPS compact branch instruction, i.e. BEQZC or BNEZC. Differential Revision: http://reviews.llvm.org/D3566 llvm-svn: 222580	2014-11-21 22:04:35 +00:00
Tom Stellard	b76305ec11	R600/SI: Mark s_mov_b32 and s_mov_b64 as rematerializable llvm-svn: 222579	2014-11-21 22:00:16 +00:00
Colin LeMahieu	4986bc53c5	[Hexagon] Adding sxth instruction. llvm-svn: 222577	2014-11-21 21:54:59 +00:00
Colin LeMahieu	9a7b747bf6	[Hexagon] Adding sxtb instruction. Renaming some identically named classes that will be removed after converting referencing defs. llvm-svn: 222575	2014-11-21 21:35:52 +00:00
Kostya Serebryany	c172ff4b3e	[asan] add statistic counter to dynamic alloca instrumentation llvm-svn: 222573	2014-11-21 21:25:18 +00:00
Colin LeMahieu	6e2ce8815f	[Hexagon] Removing SUB_rr and replacing with A2_sub. llvm-svn: 222571	2014-11-21 21:19:18 +00:00
Tim Northover	42401484d7	Remove duplication of relocation names in lib/Object/ELFYAML.cpp We can now use the ELF relocation .def files to create the mapping of relocation numbers to names and avoid having to duplicate the list of relocations. Patch by Will Newton. llvm-svn: 222567	2014-11-21 20:16:09 +00:00
Tim Northover	a8336c7a53	Remove duplication of relocation names in lib/Object/ELF.cpp We can now use the ELF relocation .def files to create the mapping of relocation numbers to names and avoid having to duplicate the list of relocations. Patch by Will Newton. llvm-svn: 222566	2014-11-21 20:16:07 +00:00
Manman Ren	8be1069f3f	Debug Info: revert r222195, r222210 and r222239. This is no longer needed after David's fix at r222377 + r222485. rdar://18958417 llvm-svn: 222563	2014-11-21 19:55:23 +00:00
Roman Divacky	de854ff9cd	Disable header duplication at -Oz in loop-rotate pass. llvm-svn: 222562	2014-11-21 19:53:24 +00:00
Manman Ren	ff5753e1f2	Debug Info: add an assertion that the context field of a global variable can not be a DIType with identifier. This makes sure that there is no need to use DIScopeRef for global variable's context. rdar://18958417 llvm-svn: 222561	2014-11-21 19:47:48 +00:00
Manman Ren	ea36e798d4	[Objective-C] Support a new special module flag that will be put into the objc_imageinfo struct. rdar://17954668 llvm-svn: 222558	2014-11-21 19:24:55 +00:00
Hans Wennborg	ffb28ee503	LazyValueInfo: range'ify some for-loops. No functional change. llvm-svn: 222557	2014-11-21 19:07:46 +00:00
Rafael Espindola	f10986a833	Add params() to FunctionType. NFC. While at it, also use makeArrayRef in elements(). llvm-svn: 222556	2014-11-21 19:03:35 +00:00
Sanjay Patel	5d493f5d01	Don't repeat class/function/variable names in comments. NFC. llvm-svn: 222555	2014-11-21 18:58:38 +00:00
Hans Wennborg	e827b4f0ff	LazyValueInfo: fix some typos and indentation, etc. NFC. llvm-svn: 222554	2014-11-21 18:58:23 +00:00
Rafael Espindola	798ac6c06b	Add and use a helper elements() to StructType. NFC. llvm-svn: 222553	2014-11-21 18:53:05 +00:00
Matthias Braun	cfe609e473	Allow multiple -debug-only args Debug output is shown if any of the -debug-only arguments match. llvm-svn: 222547	2014-11-21 18:06:09 +00:00
Sanjay Patel	e65f60a9c9	Less space; NFC llvm-svn: 222546	2014-11-21 18:05:59 +00:00
Sanjay Patel	776e5485fb	Add a feature flag for slow 32-byte unaligned memory accesses [x86]. This patch adds a feature flag to avoid unaligned 32-byte load/store AVX codegen for Sandy Bridge and Ivy Bridge. There is no functionality change intended for those chips. Previously, the absence of AVX2 was being used as a proxy to detect this feature. But that hindered codegen for AVX-enabled AMD chips such as btver2 that do not have the 32-byte unaligned access slowdown. Performance measurements are included in PR21541 ( http://llvm.org/bugs/show_bug.cgi?id=21541 ). Differential Revision: http://reviews.llvm.org/D6355 llvm-svn: 222544	2014-11-21 17:40:04 +00:00
Duncan P. N. Exon Smith	924cca4044	Revert "Allow FDE references outside the +/-2GB range supported by PC relative offsets for code models other than small/medium. For JIT application, memory layout is less controlled and can result in truncations otherwise." This reverts commit r222538. It's causing test failures for CFI, at least on Darwin: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental/1189/ http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_check/1391/ Note that the previous incremental build was on r222537, and the CFI tests weren't failing: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental/1188/ llvm-svn: 222542	2014-11-21 17:21:18 +00:00
Chandler Carruth	5e598c0342	[x86] Restructure the checking patterns for v16 and v32 avx2 vector shuffle lowering to allow much better blend matching. Specifically, with the new structure the code seems clearer to me and we correctly can hit the cases where merging two 128-bit lanes is a clear win and can be shuffled cheaply afterward. llvm-svn: 222539	2014-11-21 14:53:03 +00:00
Joerg Sonnenberger	2047b62087	Allow FDE references outside the +/-2GB range supported by PC relative offsets for code models other than small/medium. For JIT application, memory layout is less controlled and can result in truncations otherwise. Patch from Akos Kiss. Differential Revision: http://reviews.llvm.org/D6079 llvm-svn: 222538	2014-11-21 14:42:43 +00:00
Chandler Carruth	7491f1f32f	[x86] Make the previous logic significantly less conservative and get a bunch more improvements. Non-lane-crossing is fine, the key is that lane merging only makes sense for single-input shuffles. Not sure why I got so turned around here. The code all works, I was just using the wrong model for it. This only updates v4 and v8 lowering. The v16 and v32 lowering requires restructuring the entire check sequence. llvm-svn: 222537	2014-11-21 14:33:24 +00:00
Andrea Di Biagio	0a8cf1ad5a	[DAG] Teach how to turn a build_vector into a shuffle if some of the operands are zero. Before this patch, the DAGCombiner only tried to convert build_vector dag nodes into shuffles if all operands were either extract_vector_elt or undef. This patch improves that logic and teaches the DAGCombiner how to deal with build_vector dag nodes where one or more operands are zero. A build_vector dag node with some zero operands is turned into a shuffle only if the resulting shuffle mask is legal for the target. llvm-svn: 222536	2014-11-21 14:32:06 +00:00
Chandler Carruth	8387bec088	[x86] Teach the x86 vector shuffle lowering to detect mergable 128-bit lanes. By special casing these we can often either reduce the total number of shuffles significantly or reduce the number of (high latency on Haswell) AVX2 shuffles that potentially cross 128-bit lanes. Even when these don't actually cross lanes, they have much higher latency to support that. Doing two of them and a blend is worse than doing a single insert across the 128-bit lanes to blend and then doing a single interleaved shuffle. While this seems like a narrow case, it kept cropping up on me and the difference is huge as you can see in many of the test cases. I first hit this trying to perfectly fix the interleaving shuffle patterns used by Halide for AVX2. llvm-svn: 222533	2014-11-21 13:56:05 +00:00
Andrea Di Biagio	9c99df5e6c	[DAG] Refactor the shuffle combining logic in DAGCombiner. NFC. This patch simplifies the logic that combines a pair of shuffle nodes into a single shuffle if there is a legal mask. Also added comments to better describe the algorithm. No functional change intended. llvm-svn: 222522	2014-11-21 11:33:07 +00:00
Alexey Volkov	235268b4ed	[X86] For Silvermont CPU use 16-bit division instead of 64-bit for small positive numbers Differential Revision: http://reviews.llvm.org/D5938 llvm-svn: 222521	2014-11-21 11:19:34 +00:00
Yury Gribov	cb671c0b2c	[asan] Add new hidden compile-time flag asan-instrument-allocas to sanitize variable-sized dynamic allocas. Patch by Max Ostapenko. Reviewed at http://reviews.llvm.org/D6055 llvm-svn: 222519	2014-11-21 10:29:50 +00:00
NAKAMURA Takumi	3cd455da60	Add LLVMScalarOpts to LLVMPowerPCCodeGen. llvm-svn: 222516	2014-11-21 09:14:45 +00:00
Hao Liu	9cb82be410	DAGCombiner: Allow the DAGCombiner to combine multiple FDIVs with the same divisor info FMULs by the reciprocal. E.g., ( a / D; b / D ) -> ( recip = 1.0 / D; a * recip; b * recip) A hook is added to allow the target to control whether it needs to do such combine. Reviewed in http://reviews.llvm.org/D6334 llvm-svn: 222510	2014-11-21 06:39:58 +00:00
Craig Topper	45dffff5e4	Remove a bunch of unnecessary typecasts to 'const TargetRegisterClass *' llvm-svn: 222509	2014-11-21 05:58:21 +00:00
Hal Finkel	ac26448a5c	[PPC] Use SeparateConstOffsetFromGEP This mirrors r222331, which enabled SeparateConstOffsetFromGEP on AArch64, in the PowerPC backend. Yields, on a POWER7 machine, a 30% speedup on SingleSource/Benchmarks/Shootout/nestedloop (this might just be from LICM, there is a store moved out of the inner loop) and a potential speedup on MultiSource/Benchmarks/mediabench/mpeg2/mpeg2dec/mpeg2decode. Regardless, it makes some code look cleaner, and synchronizing the backends in this regard seems like a generally good thing. llvm-svn: 222504	2014-11-21 04:35:51 +00:00
Richard Trieu	df554c2abd	Add accessor marcos to ConstantPlaceHolder, similar to those in the base class. llvm-svn: 222502	2014-11-21 02:42:08 +00:00
David Majnemer	0f2c44c562	This Reassociate change unintentionally slipped in r222499 llvm-svn: 222500	2014-11-21 02:37:38 +00:00
David Majnemer	8a561be3da	SROA: The alloca type isn't a candidate promotion type for vectors The alloca's type is irrelevant, only those types which are used in a load or store of the exact size of the slice should be considered. This manifested as an assertion failure when we compared the various types: we had a size mismatch. This fixes PR21480. llvm-svn: 222499	2014-11-21 02:34:55 +00:00
Lang Hames	54c1ec218d	[MCJIT] Remove JITEventListener::NotifyFreeingMachineCode. This method is dead now that the old JIT has been removed. llvm-svn: 222494	2014-11-21 01:57:09 +00:00
Zachary Turner	9766a420be	Add curly braces to workaround an MSVC bug. MSVC can't parse this pattern for range-based for loops. llvm-svn: 222491	2014-11-21 01:19:09 +00:00
Quentin Colombet	8fea50c066	[X86] Do not custom lower UINT_TO_FP when the target type does not match the custom lowering. <rdar://problem/19026326> llvm-svn: 222489	2014-11-21 00:47:19 +00:00
Adrian Prantl	3fbd902da2	Verifier: Check that all instructions have their parent pointers set up correctly. This helps with catching problems caused by IRBuilder abuse such as the one fixed in CFE r222487. llvm-svn: 222488	2014-11-21 00:39:43 +00:00
Reid Kleckner	dbf3d8a5a4	Fix more instances of -Wsentinel on Windows with s/NULL/nullptr/ Follow up to r221940, where I must not have caught em all. NFC llvm-svn: 222481	2014-11-20 23:51:47 +00:00
Reid Kleckner	6a21619ebc	Add out of line virtual destructors to all LLVMTargetMachine subclasses These recently all grew a unique_ptr<TargetLoweringObjectFile> member in r221878. When anyone calls a virtual method of a class, clang-cl requires all virtual methods to be semantically valid. This includes the implicit virtual destructor, which triggers instantiation of the unique_ptr destructor, which fails because the type being deleted is incomplete. This is just part of the ongoing saga of PR20337, which is affecting Blink as well. Because the MSVC ABI doesn't have key functions, we end up referencing the vtable and implicit destructor on any virtual call through a class. We don't actually end up emitting the dtor, so it'd be good if we could avoid this unneeded type completion work. llvm-svn: 222480	2014-11-20 23:37:18 +00:00
Mehdi Amini	fe9410a47b	Update Makefile following directory removal in r222466 llvm-svn: 222475	2014-11-20 22:48:24 +00:00
Mehdi Amini	0e5577d057	SimplifyCFG: Refactor GatherConstantCompares() result in a struct Code seems cleaner and easier to understand this way This is basically r222416, after fixes for MSVC lack of standard support, and a few cleaning (got rid of a warning). Thanks Nakamura Takumi and Nico Weber for the MSVC fixes. llvm-svn: 222472	2014-11-20 22:40:25 +00:00
Colin LeMahieu	ff9ce82394	[Hexagon] [NFC] Merging InstPrinter directory in to MCTargetDesc since they have a circular dependency. llvm-svn: 222458	2014-11-20 21:56:35 +00:00
Lang Hames	c2ac93c9a0	[MCJIT] Remove JITEventListener::NotifyFunctionEmitted - this method is dead now that the legacy JIT has been removed. llvm-svn: 222453	2014-11-20 21:16:16 +00:00
Michael Zolotukhin	7e8ae7cad7	Fix a trip-count overflow issue in LoopUnroll. Currently LoopUnroll generates a prologue loop before the main loop body to execute first N%UnrollFactor iterations. Also, this loop is used if trip-count can overflow - it's determined by a runtime check. However, we've been mistakenly optimizing this loop to a linear code for UnrollFactor = 2, not taking into account that it also serves as a safe version of the loop if its trip-count overflows. llvm-svn: 222451	2014-11-20 20:19:55 +00:00
Saleem Abdulrasool	0de13e90eb	X86: use the correct alloca symbol for Windows Itanium Windows itanium targets the MSVCRT, and the stack probe symbol is provided by MSVCRT. This corrects the emission of stack probes on i686-windows-itanium. llvm-svn: 222439	2014-11-20 18:01:26 +00:00
Frederic Riss	2dc59ac07d	Make DWARFAcceleratorTable::dump() const. As dump() methods should be. To allow that, do not store the DWARFFormValue objects used for the dump in the header data. Per Alexey's suggestion! llvm-svn: 222436	2014-11-20 16:21:11 +00:00
Frederic Riss	38a8c3bf9f	Add missing copyright headers. llvm-svn: 222435	2014-11-20 16:21:06 +00:00
Frederic Riss	dc8c6cbba0	Do not create a replaceable Variables MDNode for function forward decls. These fields would need to be explicitly deleted before we RAUW the temporary node anyway (this was done in cfe commit r222373). Instead, do not create these useless nodes in the first place. llvm-svn: 222434	2014-11-20 15:52:34 +00:00
Timur Iskhodzhanov	acdb11d1ac	Revert r222416, r222422, r222426: the former revision had problems and fixing them introduced bugs llvm-svn: 222428	2014-11-20 12:36:43 +00:00
Timur Iskhodzhanov	e067365308	Fix a typo llvm-svn: 222426	2014-11-20 11:48:58 +00:00
NAKAMURA Takumi	ac9ac4332a	SimplifyCFG.cpp: Tweak to let msc17 compliant. - Use LLVM_DELETED_FUNCTION. - Don't use member initializers. - Don't use initializer list. llvm-svn: 222422	2014-11-20 08:59:02 +00:00
Mehdi Amini	af75a5fdde	SimplifyCFG: Refactor GatherConstantCompares() result in a struct Code seems cleaner and easier to understand this way llvm-svn: 222416	2014-11-20 06:51:02 +00:00
Jyoti Allur	0aaf89456e	[ELF] Prevent ARM ELF object writer from generating deprecated relocation code R_ARM_PLT32 llvm-svn: 222414	2014-11-20 05:58:11 +00:00
Craig Topper	0f032e3130	Fix a typo in a comment. llvm-svn: 222412	2014-11-20 05:22:37 +00:00
Alexey Samsonov	b78ee81286	Remove support for undocumented SpecialCaseList entries. "global-init", "global-init-src" and "global-init-type" were originally used to blacklist entities in ASan init-order checker. However, they were never documented, and later were replaced by "=init" category. Old blacklist entries should be converted as follows: * global-init:foo -> global:foo=init * global-init-src:bar -> src:bar=init * global-init-type:baz -> type:baz=init llvm-svn: 222401	2014-11-20 01:27:19 +00:00
Colin LeMahieu	384b462d47	[Hexagon] Adding A2_xor instruction with IR selection pattern and test. llvm-svn: 222399	2014-11-19 23:22:23 +00:00
Chad Rosier	eafdf66096	Revert "[Reassociate] As the expression tree is rewritten make sure the operands are" This reverts commit r222142. This is causing/exposing an execution-time regression in spec2006/gcc and coremark on AArch64/A57/Ofast. Conflicts: test/Transforms/Reassociate/optional-flags.ll llvm-svn: 222398	2014-11-19 23:21:20 +00:00
Colin LeMahieu	35e8a8aa73	[Hexagon] Adding A2_or instruction with IR selection pattern and test. llvm-svn: 222396	2014-11-19 22:58:04 +00:00
Nico Weber	e2c0ab0b48	Try to fix MSVS build after r222384. No intended behavior change. llvm-svn: 222386	2014-11-19 21:16:11 +00:00
Mehdi Amini	6f7a6c456e	SimplifyCFG: turn recursive GatherConstantCompares into iterative A long sequence of \|\| or && could lead to a stack explosion. llvm-svn: 222384	2014-11-19 20:09:11 +00:00
Matthias Braun	6ce466d916	RegisterCoalescer: Improve debug messages - Show "Considering..." message after flipping so you actually see the final destination vreg as destination. - Add a message on final join, so you can grep for "Success" messages to obtain a list of which register got merged with which. llvm-svn: 222382	2014-11-19 19:46:17 +00:00
Matthias Braun	50dcec92ed	Add a print and verify pass after the RegisterCoalescer llvm-svn: 222381	2014-11-19 19:46:15 +00:00
Matthias Braun	e700647af2	MachineVerifier: Report register for bad liveranges llvm-svn: 222380	2014-11-19 19:46:13 +00:00
Matthias Braun	314ef39016	Introduce register dump helper llvm-svn: 222379	2014-11-19 19:46:11 +00:00
David Majnemer	a30df875ae	AliasSet: Simplify mergeSetIn No functional change intended. llvm-svn: 222376	2014-11-19 19:36:18 +00:00
Andrea Di Biagio	b770dd344e	[X86] Improved lowering of v4x32 build_vector dag nodes. This patch improves the lowering of v4f32 and v4i32 build_vector dag nodes that are known to have at least two non-zero elements. With this patch, a build_vector that performs a blend with zero is converted into a shuffle. This is done to let the shuffle legalizer expand the dag node in a optimal way. For example, if we know that a build_vector performs a blend with zero, we can try to lower it as a movq/blend instead of always selecting an insertps. This patch also improves the logic that lowers a build_vector into a insertps with zero masking. See for example the extra test cases added to test sse41.ll. Differential Revision: http://reviews.llvm.org/D6311 llvm-svn: 222375	2014-11-19 19:34:29 +00:00
Lang Hames	2082c9d610	[ADT] Fix PR20728 - Incorrect APFloat::fusedMultiplyAdd results for x86_fp80. As detailed at http://llvm.org/PR20728, due to an internal overflow in APFloat::multiplySignificand the APFloat::fusedMultiplyAdd method can return incorrect results for x87DoubleExtended (x86_fp80) values. This commonly manifests as incorrect constant folding of libm fmal calls on x86. E.g. fmal(1.0L, 1.0L, 3.0L) == 0.0L (should be 4.0L) This patch fixes PR20728 by adding an extra bit to the significand for intermediate results of APFloat::multiplySignificand, avoiding the overflow. llvm-svn: 222374	2014-11-19 19:15:41 +00:00
Tom Stellard	271c0a936e	R600/SI: Make SIInstrInfo::isOperandLegal() more strict A register operand that has a common sub-class with its instruction's defined register class is not always legal. For example, SReg_32 and M0Reg both have a common sub-class, but we can't use an SReg_32 in instructions that expect a M0Reg. This prevents the llvm.SI.sendmsg.ll test from failing when the fold operand pass is added. llvm-svn: 222368	2014-11-19 16:58:49 +00:00
Zoran Jovanovic	ebf19d975c	[mips][micromips] Implement SWM32 and LWM32 instructions Differential Revision: http://reviews.llvm.org/D5519 llvm-svn: 222367	2014-11-19 16:44:02 +00:00
Suyog Sarda	e6a1f30c00	Vectorize a reduction chain feeding into a 'return' statement. e.x return (a[0]+b[0]) + (a[1]+b[1]) Differential Revision: http://reviews.llvm.org/D6227 llvm-svn: 222364	2014-11-19 16:07:38 +00:00
Jozef Kolek	2b6a42be6d	[mips][microMIPS] Fix opcodes of MFHC1 and MTHC1 instructions. Differential Revision: http://reviews.llvm.org/D6169 llvm-svn: 222355	2014-11-19 13:37:51 +00:00
Arnaud A. de Grandmaison	fdfed29d10	Fix tail recursion elimination When the BasicBlock containing the return instrution has a PHI with 2 incoming values, FoldReturnIntoUncondBranch will remove the no longer used incoming value and remove the no longer needed phi as well. This leaves us with a BB that no longer has a PHI, but the subsequent call to FoldReturnIntoUncondBranch from FoldReturnAndProcessPred will not remove the return instruction (which still uses the result of the call instruction). This prevents EliminateRecursiveTailCall to remove the value, as it is still being used in a basicblock which has no predecessors. The basicblock can not be erased on the spot, because its iterator is still being used in runTRE. This issue was exposed when removing the threshold on size for lifetime marker insertion for named temporaries in clang. The testcase is a much reduced version of peelOffOuterExpr(const Expr, const ExplodedNode ) from clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp. llvm-svn: 222354	2014-11-19 13:32:51 +00:00
Jozef Kolek	d19675f448	[mips][microMIPS] Implement CodeGen support for 16-bit instruction ADDIUR2. Differential Revision: http://reviews.llvm.org/D5800 llvm-svn: 222352	2014-11-19 13:23:58 +00:00
Jozef Kolek	9fbf00198c	[mips][microMIPS] Implement CodeGen support for ADDIUS5 instruction. Differential Revision: http://reviews.llvm.org/D5799 llvm-svn: 222351	2014-11-19 13:11:09 +00:00
Jozef Kolek	0de52b5b97	[mips][microMIPS] Implement LWXS instruction. Differential Revision: http://reviews.llvm.org/D5407 llvm-svn: 222348	2014-11-19 11:39:12 +00:00
Jozef Kolek	e466cd5b54	[mips][microMIPS] Implement SDBBP and RDHWR instructions. Differential Revision: http://reviews.llvm.org/D5240 llvm-svn: 222347	2014-11-19 11:25:50 +00:00
Simon Pilgrim	e5f972f1c1	[X86][SSE] pslldq/psrldq byte shifts/rotation for SSE2 This patch builds on http://reviews.llvm.org/D5598 to perform byte rotation shuffles (lowerVectorShuffleAsByteRotate) on pre-SSSE3 (palignr) targets - pre-SSSE3 is only enabled on i8 and i16 vector targets where it is a more definite performance gain. I've also added a separate byte shift shuffle (lowerVectorShuffleAsByteShift) that makes use of the ability of the SLLDQ/SRLDQ instructions to implicitly shift in zero bytes to avoid the need to create a zero register if we had used palignr. Differential Revision: http://reviews.llvm.org/D5699 llvm-svn: 222340	2014-11-19 10:06:49 +00:00
David Majnemer	074041b4ec	AliasSetTracker: UnknownInsts should contribute to the refcount AliasSetTracker::addUnknown may create an AliasSet devoid of pointers just to contain an instruction if no suitable AliasSet already exists. It will then AliasSet::addUnknownInst and we will be done. However, it's possible for addUnknown to choose an existing AliasSet to addUnknownInst. If this were to occur, we are in a bit of a pickle: removing pointers from the AliasSet can cause the entire AliasSet to become destroyed, taking our unknown instructions out with them. Instead, keep track whether or not our AliasSet has any unknown instructions. This fixes PR21582. llvm-svn: 222338	2014-11-19 09:41:05 +00:00
David Blaikie	60e6c80905	Update SetVector to rely on the underlying set's insert to return a pair<iterator, bool> This is to be consistent with StringSet and ultimately with the standard library's associative container insert function. This lead to updating SmallSet::insert to return pair<iterator, bool>, and then to update SmallPtrSet::insert to return pair<iterator, bool>, and then to update all the existing users of those functions... llvm-svn: 222334	2014-11-19 07:49:26 +00:00
Hao Liu	f7e0bd2878	[AArch64] Disable useAA for Cortex-A57. Using AA during CodeGen is very useful for in-order cores. It is less useful for ooo cores. Also I find enabling useAA for Cortex-A57 may generate worse code for some test cases. If useAA in codegen is improved and benefical for ooo cores, we can enable it again. llvm-svn: 222333	2014-11-19 06:48:56 +00:00
Hao Liu	00d285aca3	[AArch64] Enable SeparateConstOffsetFromGEP, EarlyCSE and LICM passes on AArch64 backend. SeparateConstOffsetFromGEP can gives more optimizaiton opportunities related to GEPs, which benefits EarlyCSE and LICM. By enabling these passes we can have better address calculations and generate a better addressing mode. Some SPEC 2006 benchmarks (astar, gobmk, namd) have obvious improvements on Cortex-A57. Reviewed in http://reviews.llvm.org/D5864. llvm-svn: 222331	2014-11-19 06:39:53 +00:00
Hao Liu	a3e7d1ff7e	[SeparateConstOffsetFromGEP] Allow SeparateConstOffsetFromGEP pass to lower GEPs. If LowerGEP is enabled, it can lower a GEP with multiple indices into GEPs with a single index or arithmetic operations. Lowering GEPs can always extract structure indices. Lowering GEPs can also give use more optimization opportunities. It can benefit passes like CSE, LICM and CGP. Reviewed in http://reviews.llvm.org/D5864 llvm-svn: 222328	2014-11-19 06:24:44 +00:00
David Blaikie	7499cbae4c	Remove StringMap::GetOrCreateValue in favor of StringMap::insert Having two ways to do this doesn't seem terribly helpful and consistently using the insert version (which we already has) seems like it'll make the code easier to understand to anyone working with standard data structures. (I also updated many references to the Entry's key and value to use first() and second instead of getKey{Data,Length,} and get/setValue - for similar consistency) Also removes the GetOrCreateValue functions so there's less surface area to StringMap to fix/improve/change/accommodate move semantics, etc. llvm-svn: 222319	2014-11-19 05:49:42 +00:00
Rui Ueyama	520b1e8263	llvm-readobj: fix off-by-one error in COFFDumper It printed out base relocation table header as table entry. This patch also makes llvm-readobj to not skip ABSOLUTE entries becuase it was confusing. llvm-svn: 222299	2014-11-19 02:07:10 +00:00
Weiming Zhao	c7ce2ee93f	[Aarch64] Customer lowering of CTPOP to SIMD should check for NEON availability llvm-svn: 222292	2014-11-19 00:29:14 +00:00
Kostya Serebryany	52d047bc0f	[asan] add experimental basic-block tracing to asan-coverage; also fix -fsanitize-coverage=3 which was broken by r221718 llvm-svn: 222290	2014-11-19 00:22:58 +00:00
Rui Ueyama	2b5655092a	llvm-readobj: teach it how to dump COFF base relocation table llvm-svn: 222289	2014-11-19 00:18:07 +00:00
Kostya Serebryany	d66807fbdc	Introduce llvm::SplitAllCriticalEdges Summary: move the code from BreakCriticalEdges::runOnFunction() into a separate utility function llvm::SplitAllCriticalEdges() so that it can be used independently. No functionality change intended. Test Plan: check-llvm Reviewers: nlewycky Reviewed By: nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6313 llvm-svn: 222288	2014-11-19 00:17:31 +00:00
Manman Ren	df5625ea3a	Revert r222039 because of bot failure. http://lab.llvm.org:8080/green/job/clang-Rlto_master/298/ Hopefully, bot will be green. If not, we will re-submit the commit. llvm-svn: 222287	2014-11-19 00:13:26 +00:00
Matt Arsenault	73f4bd8758	R600/SI: Implement areMemAccessesTriviallyDisjoint This partially makes up for not having address spaces used for alias analysis in some simple cases. This is not yet enabled by default so shouldn't change anything yet. llvm-svn: 222286	2014-11-19 00:01:31 +00:00
Matt Arsenault	1843dcc2b6	R600/SI: Set hasSideEffects = 0 on load and store instructions. Assuming unmodeled side effects interferes with some scheduling opportunities. Don't put it in the base class of DS instructions since there are a few weird effecting, non load/store instructions there. llvm-svn: 222285	2014-11-18 23:57:33 +00:00
Simon Pilgrim	daabed160f	[X86][AVX] 256-bit vector stack unaligned load/stores identification Under many circumstances the stack is not 32-byte aligned, resulting in the use of the vmovups/vmovupd/vmovdqu instructions when inserting ymm reloads/spills. This minor patch adds these instructions to the isFrameLoadOpcode/isFrameStoreOpcode helpers so that they can be correctly identified and not be treated as folded reloads/spills. This has also been noticed by http://llvm.org/bugs/show_bug.cgi?id=18846 where it was causing redundant spills - I've added a reduced test case at test/CodeGen/X86/pr18846.ll Differential Revision: http://reviews.llvm.org/D6252 llvm-svn: 222281	2014-11-18 23:38:19 +00:00
Colin LeMahieu	f815ca1b0b	[Hexagon] Adding A2_and instruction. llvm-svn: 222274	2014-11-18 22:45:47 +00:00
Chad Rosier	ccf41a5c21	[FastISel][AArch64] Also allow folding of sign-/zero-extend and arithmetic shift-right for booleans (i1). Arithmetic shift-right immediate with sign-/zero-extensions also works for boolean values. Update the assert and the test cases to reflect that fact. llvm-svn: 222272	2014-11-18 22:41:49 +00:00
Chad Rosier	7153154a79	[FastISel][AArch64] Also allow folding of sign-/zero-extend and logical shift-right for booleans (i1). Logical shift-right immediate with sign-/zero-extensions also works for boolean values. Update the assert and the test cases to reflect that fact. llvm-svn: 222270	2014-11-18 22:38:42 +00:00
David Majnemer	e6cc1061cc	InstCombine: Fix another infinite loop caused by visitFPTrunc We would attempt to replace an frem's operand with the same operand. This would cause InstCombine to think real work was done, causing InstCombine to enter an infinite loop. This fixes the second part of PR21576. llvm-svn: 222265	2014-11-18 22:06:45 +00:00
Colin LeMahieu	1d74e8f01b	[Hexagon] Adding A2_sub instruction Renaming test files. llvm-svn: 222263	2014-11-18 21:51:51 +00:00
David Majnemer	0c67e78132	Revert "Revert r222040 because of bot failure." This reverts commit r222203, reverting r222040 didn't end up turning the bot green. llvm-svn: 222261	2014-11-18 21:30:02 +00:00
Juergen Ributzka	b3791ee3a7	[FastISel][AArch64] Follow-up fix for "Fix shift-immediate emission for "zero" shifts." Shifts also perform sign-/zero-extends to larger types, which requires us to emit an integer extend instead of a simple COPY. Related to PR21594. llvm-svn: 222257	2014-11-18 21:20:17 +00:00
Matt Arsenault	84d2214a94	R600/SI: Move SIFixSGPRCopies to inst selector passes This should expose more of the actually used VALU instructions to the machine optimization passes. This also should help getting i1 handling into a better state. For not entirly understood reasons, this fixes the split-scalar-i64-add.ll test where a 64-bit add would only partially be moved to the VALU resulting in use of undefined VCC. llvm-svn: 222256	2014-11-18 21:06:58 +00:00
Juergen Ributzka	fe3cb34d8e	[AArch64] Don't optimize all compare instructions. "optimizeCompareInstr" converts compares (cmp/cmn) into plain sub/add instructions when the flags are not used anymore. This conversion is valid for most instructions, but not all. Some instructions that don't set the flags (e.g. sub with immediate) can set the SP, whereas the flag setting version uses the same encoding for the "zero" register. Update the code to also check for the return register before performing the optimization to make sure that a cmp doesn't suddenly turn into a sub that sets the stack pointer. I don't have a test case for this, because it isn't easy to trigger. llvm-svn: 222255	2014-11-18 21:02:40 +00:00
Owen Anderson	4490a7cce1	Fix an incorrect chain operand when expanding INSERT_VECTOR operations through the stack. Patch by Daniil Troshkov! llvm-svn: 222254	2014-11-18 20:50:19 +00:00
Tom Stellard	962ccd7f85	R600/SI: Make sure resource descriptors are always stored in SGPRs llvm-svn: 222253	2014-11-18 20:39:39 +00:00
Colin LeMahieu	f7ca7f6c70	[Hexagon] Converting from ADD_rr to A2_add which has encoding bits. Adding test to show correct instruction selection and encoding. llvm-svn: 222249	2014-11-18 20:28:11 +00:00
Chad Rosier	dc822c8d6f	[Reassociate] Rename local variable to not use same name as a member variable. NFC. llvm-svn: 222248	2014-11-18 20:21:54 +00:00
Juergen Ributzka	a2005be2b4	[FastISel][AArch64] Fix shift-immediate emission for "zero" shifts. This change emits a COPY for a shift-immediate with a "zero" shift value. This fixes PR21594 where we emitted a shift instruction with an incorrect immediate operand. llvm-svn: 222247	2014-11-18 19:58:59 +00:00
Jozef Kolek	d5cccacaae	Test commit to verify that commit access works. llvm-svn: 222244	2014-11-18 19:20:34 +00:00
Philip Reames	33e827b222	Tweak EarlyCSE to recognize series of dead stores EarlyCSE is giving up on the current instruction immediately when it recognizes that the current instruction makes a previous store trivially dead. There's no reason to do this. Once the previous store has been deleted, it's perfectly legal to remember the value of the current store (for value forwarding) and the fact the store occurred (it could be dead too!). Reviewed by: Hal Differential Revision: http://reviews.llvm.org/D6301 llvm-svn: 222241	2014-11-18 17:46:32 +00:00
David Majnemer	a43009a5dc	InstCombine: Fold away tautological masked compares It is impossible for (x & INT_MAX) == 0 && x == INT_MAX to ever be true. While this sort of reasoning should normally live in InstSimplify, the machinery that derives this result is not trivial to split out. llvm-svn: 222230	2014-11-18 09:31:41 +00:00
David Majnemer	fdefc8c778	InstCombine: Clean up foldLogOpOfMaskedICmps No functional change intended. llvm-svn: 222229	2014-11-18 09:31:36 +00:00
Frederic Riss	f1bfe6e383	Allow DwarfCompileUnit::constructImportedEntityDIE to instanciate a GlobalVariable DIE. Usually global variables are in a retain list and instanciated before any call to constructImportedEntityDIE is made. This isn't true for forward declarations though. The testcase for this change is generated by a clang patched to emit such forward declarations (patch at http://reviews.llvm.org/D6173 which will land soon). The updated testcase tests more than just global variables, it now tests every type of 'using' clause we support. llvm-svn: 222217	2014-11-18 02:46:11 +00:00
Hans Wennborg	aaf76f8d38	SimplifyCFG: Range'ify some for-loops. No functional change. llvm-svn: 222215	2014-11-18 02:37:11 +00:00
David Majnemer	35be2aba4c	IndVarSimplify: Allow LFTR to fire more often I added a pessimization in r217102 to prevent miscompiles when the incremented induction variable was used in a comparison; it would be poison. Try to use the incremented induction variable more often when we can be sure that the increment won't end in poison. Differential Revision: http://reviews.llvm.org/D6222 llvm-svn: 222213	2014-11-18 02:20:58 +00:00
Duncan P. N. Exon Smith	07ee25e184	IR: Sink MDNode::Hash down to GenericMDNode::Hash Part of PR21532. llvm-svn: 222212	2014-11-18 02:20:29 +00:00
Duncan P. N. Exon Smith	c050fbe211	IR: Move MDNode operands from the back to the front Having the operands at the back prevents subclasses from safely adding fields. Move them to the front. Instead of replicating the custom `malloc()`, `free()` and `DestroyFlag` logic that was there before, overload `new` and `delete`. I added calls to a new `GenericMDNode::dropAllReferences()` in `LLVMContextImpl::~LLVMContextImpl()`. There's a maze of callbacks happening during teardown, and this resolves them before we enter the destructors. Part of PR21532. llvm-svn: 222211	2014-11-18 01:56:14 +00:00
Michael J. Spencer	968894e67c	Fix covered switch warning llvm-svn: 222209	2014-11-18 01:26:46 +00:00
Michael J. Spencer	312f54fa4a	Support ELF files of unknown type. llvm-svn: 222208	2014-11-18 01:14:25 +00:00
Duncan P. N. Exon Smith	99bd43a493	IR: Split MDNode into GenericMDNode and MDNodeFwdDecl Split `MDNode` into two classes: - `GenericMDNode`, which is uniquable (and for now, always starts uniqued). Once `Metadata` is split from the `Value` hierarchy, this class will lose the ability to RAUW itself. - `MDNodeFwdDecl`, which is used for the "temporary" interface, is never uniqued, and isn't managed by `LLVMContext` at all. I've left most of the guts in `MDNode` for now, but I'll incrementally move things to the right places (or delete the functionality, as appropriate). Part of PR21532. llvm-svn: 222205	2014-11-18 00:37:17 +00:00
Manman Ren	3d4f707d60	Revert r222040 because of bot failure. http://lab.llvm.org:8080/green/job/clang-Rlto_master/298/ Hopefully, bot will be green. llvm-svn: 222203	2014-11-18 00:33:22 +00:00
Manman Ren	9b65b2864d	Debug Info: In DIBuilder, the context field of a global variable is updated to use DIScopeRef. A paired commit at clang will follow to show cases where we will use an identifer for the context of a global variable. rdar://18958417 llvm-svn: 222195	2014-11-18 00:29:08 +00:00
Duncan P. N. Exon Smith	41818ec794	IR: Simplify uniquing for MDNode Change uniquing from a `FoldingSet` to a `DenseSet` with custom `DenseMapInfo`. Unfortunately, this doesn't save any memory, since `DenseSet<T>` is a simple wrapper for `DenseMap<T, char>`, but I'll come back to fix that later. I used the name `GenericDenseMapInfo` to the custom `DenseMapInfo` since I'll be splitting `MDNode` into two classes soon: `MDNodeFwdDecl` for temporaries, and `GenericMDNode` for everything else. I also added a non-debug-info reduced version of a type-uniquing test that started failing on an earlier draft of this patch. Part of PR21532. llvm-svn: 222191	2014-11-17 23:28:21 +00:00
Reid Kleckner	0ae02a3892	Revert "ADT: correctly report isMSVCEnvironment for windows itanium" This reverts commit r222180. llvm-svn: 222188	2014-11-17 22:55:59 +00:00
Saleem Abdulrasool	c8bc5eb9ab	ADT: correctly report isMSVCEnvironment for windows itanium The itanium environment on Windows uses MSVC and is a MSVC environment. Report this correctly. llvm-svn: 222180	2014-11-17 22:13:26 +00:00
Matt Arsenault	0f208ea195	R600/SI: Don't copy flags when extracting subreg This was resulting in use of a register after a kill. For some reason this showed up as a problem in many tests when moving the SIFixSGPRCopies pass closer to instruction selection. llvm-svn: 222175	2014-11-17 21:11:37 +00:00
Matt Arsenault	76c97dc14a	R600/SI: Assume SIFixSGPRCopies makes changes I'm not sure if this was breaking anything. llvm-svn: 222174	2014-11-17 21:11:34 +00:00
Rafael Espindola	fbe022fed3	Factor common code it Linker::init. The TypeFinder was not being used in one of the constructors. llvm-svn: 222172	2014-11-17 20:51:01 +00:00
Rafael Espindola	78bd4d36bf	Pass a reference to ValueEnumerator. NFC. This will just make it easier to use std::unique_ptr in a caller. llvm-svn: 222170	2014-11-17 20:06:27 +00:00
Juergen Ributzka	05cff0a244	[SimplifyCFG] Make the value type of the hole check bitmask a power-of-2. When converting a switch to a lookup table we might have to generate a bitmaks to encode and check for holes in the original switch statement. The type of this mask depends on the number of switch statements, which can result in illegal types for pretty much all architectures. To avoid unnecessary type legalization and help FastISel this commit increases the size of the bitmask to next power-of-2 value when necessary. This fixes rdar://problem/18984639. llvm-svn: 222168	2014-11-17 19:39:56 +00:00
Chad Rosier	2db8cbf601	[Reassociate] As the expression tree is rewritten make sure the operands are emitted in canonical form. llvm-svn: 222142	2014-11-17 16:33:50 +00:00
Alexey Volkov	3cc8e8e28b	[X86] Use ADD/SUB instead of INC/DEC for Haswell and Broadwell CPUs Differential Revision: http://reviews.llvm.org/D5934 llvm-svn: 222141	2014-11-17 16:17:51 +00:00
Chad Rosier	fc00bbc305	[Reassociate] Canonicalize constants to RHS operand. Fix a thinko where the RHS was already a constant. llvm-svn: 222139	2014-11-17 15:52:51 +00:00
Renato Golin	b92ad16856	Fix ARM triple parsing The triple parser should only accept existing architecture names when the triple starts with armv, armebv, thumbv or thumbebv. Patch by Gabor Ballabas. llvm-svn: 222129	2014-11-17 14:08:57 +00:00
David Majnemer	b5ae33d9e3	ScalarEvolution: Construct SCEVDivision's Derived type instead of itself SCEVDivision::divide constructed an object of SCEVDivision<Derived> instead of Derived. divide would call visit which would cast the SCEVDivision<Derived> to type Derived. As it happens, SCEVDivision<Derived> and Derived currently have the same layout but this is fragile and grounds for UB. Instead, just construct Derived. No functional change intended. llvm-svn: 222126	2014-11-17 11:27:45 +00:00
Oliver Stannard	93a823bc7a	[Thumb1] Re-write emitThumbRegPlusImmediate This was motivated by a bug which caused code like this to be miscompiled: declare void @take_ptr(i8) define void @test() { %addr1.32 = alloca i8 %addr2.32 = alloca i32, i32 1028 call void @take_ptr(i8 %addr1) ret void } This was emitting the following assembly to get the value of %addr1: add r0, sp, #1020 add r0, r0, #8 However, "add r0, r0, #8" is not a valid Thumb1 instruction, and this could not be assembled. The generated object file contained this, resulting in r0 holding SP+8 rather tha SP+1028: add r0, sp, #1020 add r0, sp, #8 This function looked like it could have caused miscompilations for other combinations of registers and offsets (though I don't think it is currently called with these), and the heuristic it used did not match the emitted code in all cases. llvm-svn: 222125	2014-11-17 11:18:10 +00:00
David Majnemer	903d9c0ab0	Object, COFF: Tighten the object file parser We were a little lax in a few areas: - We pretended that import libraries were like any old COFF file, they are not. In fact, they aren't really COFF files at all, we should probably grow some specialized functionality to handle them smarter. - Our symbol iterators were more than happy to attempt to go past the end of the symbol table if you had a symbol with a bad list of auxiliary symbols. llvm-svn: 222124	2014-11-17 11:17:17 +00:00
Oliver Stannard	2efad103c3	Fix optimisations of SELECT_CC which assumed result is boolean Some optimisations in DAGCombiner cause miscompilations for targets that use TargetLowering::UndefinedBooleanContent, because they assume that the results of a SELECT_CC node are boolean values, and can be safely ANDed, ORed and XORed. These optimisations are only valid for targets that use ZeroOrOneBooleanContent or ZeroOrNegativeOneBooleanContent. This is a follow-up to D6210/r221693. llvm-svn: 222123	2014-11-17 10:49:31 +00:00
Yaron Keren	6469f7bd2d	silence gcc 4.9.1 warning in /llvm/lib/Support/Windows/Path.inc:564:39: warning: suggest parentheses around assignment used as truth value [-Wparentheses] if (ec = widenPath(path, path_utf16)) llvm-svn: 222122	2014-11-17 09:29:33 +00:00
Erik Eckstein	49292b906e	Optimize switch lookup tables with linear mapping. This is a simple optimization for switch table lookup: It computes the output value directly with an (optional) mul and add if there is a linear mapping between index and output. Example: int f1(int x) { switch (x) { case 0: return 10; case 1: return 11; case 2: return 12; case 3: return 13; } return 0; } generates: define i32 @f1(i32 %x) #0 { entry: %0 = icmp ult i32 %x, 4 br i1 %0, label %switch.lookup, label %return switch.lookup: %switch.offset = add i32 %x, 10 ret i32 %switch.offset return: ret i32 0 } llvm-svn: 222121	2014-11-17 09:13:57 +00:00
Craig Topper	bc3d6e1d6d	Add missing semicolon from r222118. llvm-svn: 222119	2014-11-17 05:58:26 +00:00
Craig Topper	5b6e56da60	Move register class name strings to a single array in MCRegisterInfo to reduce static table size and number of relocation entries. Indices into the table are stored in each MCRegisterClass instead of a pointer. A new method, getRegClassName, is added to MCRegisterInfo and TargetRegisterInfo to lookup the string in the table. llvm-svn: 222118	2014-11-17 05:50:14 +00:00
Rafael Espindola	7382f6d0d4	Add back r222061 with a fix. This adds back r222061, but now calls initializePAEvalPass from the correct library to avoid link problems. Original message: Don't make assumptions about the name of private global variables. Private variables are can be renamed, so it is not reliable to make decisions on the name. The name is also dropped by the assembler before getting to the linker, so using the name causes a disconnect between how llvm makes a decision (var name) and how the linker makes a decision (section it is in). This patch changes one case where we were looking at the variable name to use the section instead. Test tuning by Michael Gottesman. llvm-svn: 222117	2014-11-17 02:28:27 +00:00
Craig Topper	2aa3e27053	Replace a couple asserts with static_asserts. llvm-svn: 222114	2014-11-17 00:26:50 +00:00
Craig Topper	5121b0369b	Convert some EVTs to MVTs where only a SimpleValueType is needed. llvm-svn: 222109	2014-11-16 21:17:18 +00:00
David Majnemer	f2223dee20	ScalarEvolution: Introduce SCEVSDivision and SCEVUDivision It turns out that not all users of SCEVDivision want the same signedness. Let the users determine which operation they'd like by explicitly choosing SCEVUDivision or SCEVSDivision. findArrayDimensions and computeAccessFunctions will use SCEVSDivision while HowFarToZero will use SCEVUDivision. llvm-svn: 222104	2014-11-16 20:35:19 +00:00
Jingyue Wu	230717d66d	[DependenceAnalysis] Allow subscripts of different types Summary: Several places in DependenceAnalysis assumes both SCEVs in a subscript pair share the same integer type. For instance, isKnownPredicate calls SE->getMinusSCEV(X, Y) which asserts X and Y share the same type. However, DependenceAnalysis fails to ensure this assumption when producing a subscript pair, causing tests such as NonCanonicalizedSubscript to crash. With this patch, DependenceAnalysis runs unifySubscriptType before producing any subscript pair, ensuring the assumption. Test Plan: Added NonCanonicalizedSubscript.ll on which DependenceAnalysis before the fix crashed because subscripts have different types. Reviewers: spop, sebpop, jingyue Reviewed By: jingyue Subscribers: eliben, meheff, llvm-commits Differential Revision: http://reviews.llvm.org/D6289 llvm-svn: 222100	2014-11-16 16:52:44 +00:00
Craig Topper	eb88940e3d	[x86] Remove two redundant isel patterns. They equivalent already exists in the instruction pattern. llvm-svn: 222094	2014-11-16 09:24:16 +00:00
David Majnemer	7e29d637c6	ScalarEvolution: HowFarToZero was wrongly using signed division HowFarToZero was supposed to use unsigned division in order to calculate the backedge taken count. However, SCEVDivision::divide performs signed division. Unless I am mistaken, no users of SCEVDivision actually want signed arithmetic: switch to udiv and urem. This fixes PR21578. llvm-svn: 222093	2014-11-16 07:30:35 +00:00
David Majnemer	12827e608f	InstSimplify: Optimize ICmpInst xform that uses computeKnownBits A few things: - computeKnownBits is relatively expensive, let's delay its use as long as we can. - Don't create two APInt values just to run computeKnownBits on a ConstantInt, we already know the exact value! - Avoid creating a temporary APInt value in order to calculate unary negation. llvm-svn: 222092	2014-11-16 02:20:08 +00:00
Andrea Di Biagio	5475bc1d1b	[DAG] Improved target independent vector shuffle folding logic. This patch teaches the DAGCombiner how to combine shuffles according to rules: shuffle(shuffle(A, Undef, M0), B, M1) -> shuffle(B, A, M2) shuffle(shuffle(A, B, M0), B, M1) -> shuffle(B, A, M2) shuffle(shuffle(A, B, M0), A, M1) -> shuffle(B, A, M2) llvm-svn: 222090	2014-11-15 22:56:25 +00:00
Simon Pilgrim	5e170d7652	[X86][SSE] Improve legal SHUFP and PSHUFD shuffle matching Updated X86TargetLowering::isShuffleMaskLegal to match SHUFP masks with commuted inputs and PSHUFD masks that reference the second input. As part of this I've refactored isPSHUFDMask to work in a more general manner and allow it to match against either the first or second input vector. Differential Revision: http://reviews.llvm.org/D6287 llvm-svn: 222087	2014-11-15 21:13:05 +00:00
Matt Arsenault	1298bf6e9c	R600: Permute operands when selecting legacy min/max This gets the correct NaN behavior based on the compare type the hardware uses. This now passes the new piglit test I have for this on SI. Add stricter tests for the operand order. llvm-svn: 222079	2014-11-15 05:02:57 +00:00
Reid Kleckner	30a587b9ae	Revert "Don't make assumptions about the name of private global variables." This reverts commit r222061. It's causing linker errors. llvm-svn: 222077	2014-11-15 02:03:53 +00:00
Tom Stellard	d8a0a4cc2b	R600: Fix 64-bit integer division This fixes a failure in one of the oclconform tests. Patch by: Jan Vesely llvm-svn: 222073	2014-11-15 01:07:57 +00:00
Tom Stellard	573a5f6172	R600: Factor i64 UDIVREM lowering into its own fuction This is so it could potentially be used by SI. However, the current implementation does not always produce correct results, so the IntegerDivisionPass is being used instead. llvm-svn: 222072	2014-11-15 01:07:53 +00:00
Duncan P. N. Exon Smith	8508167a68	DIBuilder: Use Constant instead of Value Make explicit the requirement that most IR values in `DIBuilder` are `Constant`. This requires a follow-up change in clang. Part of PR21532. llvm-svn: 222070	2014-11-15 00:23:49 +00:00
Duncan P. N. Exon Smith	323f78513c	DIBuilder: Change private helper function to static, NFC llvm-svn: 222068	2014-11-15 00:05:04 +00:00
Duncan P. N. Exon Smith	500a28518d	DI: Use Metadata for DITypeRef and DIScopeRef Now that `MDString` and `MDNode` have a common base class, use it. Note that it's not useful to assume subclasses of `Metadata` must be one or the other since we'll be adding more subclasses soon enough. Part of PR21532. llvm-svn: 222064	2014-11-14 23:55:03 +00:00
Reid Kleckner	1bf13cbc83	Rename EH related stuff to be more precise Summary: The current "WinEH" exception handling type is more about Itanium-style LSDA tables layered on top of the Windows native unwind info format instead of .eh_frame tables or EHABI unwind info. Use the name "ItaniumWinEH" to better reflect the hybrid nature of the design. Also rename isExceptionHandlingDWARF to usesItaniumLSDAForExceptions, since the LSDA is part of the Itanium C++ ABI document, and not the DWARF standard. Reviewers: echristo Subscribers: llvm-commits, compnerd Differential Revision: http://reviews.llvm.org/D6279 llvm-svn: 222062	2014-11-14 23:31:07 +00:00
Rafael Espindola	c01b31682e	Don't make assumptions about the name of private global variables. Private variables are can be renamed, so it is not reliable to make decisions on the name. The name is also dropped by the assembler before getting to the linker, so using the name causes a disconnect between how llvm makes a decision (var name) and how the linker makes a decision (section it is in). This patch changes one case where we were looking at the variable name to use the section instead. Test tuning by Michael Gottesman. llvm-svn: 222061	2014-11-14 23:17:47 +00:00
Tim Northover	f511e3a633	ARM: refactor .cfi_def_cfa_offset emission. We use to track quite a few "adjusted" offsets through the FrameLowering code to account for changes in the prologue instructions as we went and allow the emission of correct CFA annotations. However, we were missing a couple of cases and the code was almost impenetrable. It's easier to just add any stack-adjusting instruction to a list and emit them together. llvm-svn: 222057	2014-11-14 22:45:33 +00:00

... 4 5 6 7 8 ...

74766 Commits