llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
NAKAMURA Takumi	fa94088102	OrcJIT: Try to appease msc18 to add move constructor in FullyPartitionedModule . llvm-svn: 229492	2015-02-17 12:52:58 +00:00
Manuel Klimek	d758fc017a	Fix problem with uninitialized bool found by asan. llvm-svn: 229490	2015-02-17 12:42:14 +00:00
Justin Bogner	892964de44	Revert "InstrProf: Add unit tests for the profile reader and writer" This added API to the InstrProfWriter to write to a string so I could write unittests without using temp files. This doesn't really work, since the format has tighter alignment requirements than a char. This reverts r229478 and its follow-up, r229481. llvm-svn: 229483	2015-02-17 09:21:43 +00:00
Justin Bogner	5a941203c6	Re-apply "InstrProf: Add unit tests for the profile reader and writer" Add these tests again, but use va_list instead of initializer lists. This reverts r229456, reapplying r229455. llvm-svn: 229478	2015-02-17 07:50:59 +00:00
Jonas Paulsson	3770814b38	[PBQP] NDEBUG guards added around code needed for assert. wasConservativelyAllocatable() is only called to assert that a conservatively allocatable node wasn't forced to spill. llvm-svn: 229477	2015-02-17 07:45:06 +00:00
Hal Finkel	c9890f4fe1	[BDCE] Add a bit-tracking DCE pass BDCE is a bit-tracking dead code elimination pass. It is based on ADCE (the "aggressive DCE" pass), with the added capability to track dead bits of integer valued instructions and remove those instructions when all of the bits are dead. Currently, it does not actually do this all-bits-dead removal, but rather replaces the instruction's uses with a constant zero, and lets instcombine (and the later run of ADCE) do the rest. Because we essentially get a run of ADCE "for free" while tracking the dead bits, we also do what ADCE does and removes actually-dead instructions as well (this includes instructions newly trivially dead because all bits were dead, but not all such instructions can be removed). The motivation for this is a case like: int __attribute__((const)) foo(int i); int bar(int x) { x \|= (4 & foo(5)); x \|= (8 & foo(3)); x \|= (16 & foo(2)); x \|= (32 & foo(1)); x \|= (64 & foo(0)); x \|= (128& foo(4)); return x >> 4; } As it turns out, if you order the bit-field insertions so that all of the dead ones come last, then instcombine will remove them. However, if you pick some other order (such as the one above), the fact that some of the calls to foo() are useless is not locally obvious, and we don't remove them (without this pass). I did a quick compile-time overhead check using sqlite from the test suite (Release+Asserts). BDCE took ~0.4% of the compilation time (making it about twice as expensive as ADCE). I've not looked at why yet, but we eliminate instructions due to having all-dead bits in: External/SPEC/CFP2006/447.dealII/447.dealII External/SPEC/CINT2006/400.perlbench/400.perlbench External/SPEC/CINT2006/403.gcc/403.gcc MultiSource/Applications/ClamAV/clamscan MultiSource/Benchmarks/7zip/7zip-benchmark llvm-svn: 229462	2015-02-17 01:36:59 +00:00
Lang Hames	126ce3b498	[Orc] Update the Orc indirection utils and refactor the CompileOnDemand layer. This patch replaces most of the Orc indirection utils API with a new class: JITCompileCallbackManager, which creates and manages JIT callbacks. Exposing this functionality directly allows the user to create callbacks that are associated with user supplied compilation actions. For example, you can create a callback to lazyily IR-gen something from an AST. (A kaleidoscope example demonstrating this will be committed shortly). This patch also refactors the CompileOnDemand layer to use the JITCompileCallbackManager API. llvm-svn: 229461	2015-02-17 01:18:38 +00:00
Duncan P. N. Exon Smith	b89af4c667	AsmPrinter: Stop creating DebugLocs While looking at a heap profile of a clang LTO bootstrap with -g, I noticed that 2.2% of memory in an `llvm-lto` of clang is from calling `DebugLoc::get()` in `collectVariableInfo()` (accounting for ~40% of memory used for `MDLocation`s). I suspect this was introduced by r226736, whose goal was to prevent uniquing of `DebugLoc`s (goal achieved, if so). There's no reason we need a `DebugLoc` here at all -- it was just being used for (in)convenient API -- so the fix is to pass the scope and inlined-at directly to `LexicalScopes::findInlinedScope()`. llvm-svn: 229459	2015-02-17 00:02:27 +00:00
Justin Bogner	7e3c504766	Revert "InstrProf: Add unit tests for the profile reader and writer" Looks like the bots don't like my initializer lists. This reverts r229455 llvm-svn: 229456	2015-02-16 23:31:07 +00:00
Justin Bogner	116967eecd	InstrProf: Add unit tests for the profile reader and writer This required some minor API to be added to these types to avoid needing temp files. Also, I've used initializer lists in the tests, as MSVC 2013 claims to support them. I'll redo this without them if the bots complain. llvm-svn: 229455	2015-02-16 23:27:48 +00:00
Simon Atanasyan	bfe1d906dc	[Mips] Add .MIPS.options section descriptor kinds enumeration No functional changes. llvm-svn: 229452	2015-02-16 22:59:29 +00:00
Lang Hames	cc93cf5453	[Orc] Add an emitAndFinalize method to the ObjectLinkingLayer, IRCompileLayer and LazyEmittingLayer of Orc. This method allows you to immediately emit and finalize a module. It is required by an upcoming refactor of the indirection utils and the compile-on-demand layer. I've filed http://llvm.org/PR22608 to write unit tests for this and other Orc APIs. llvm-svn: 229451	2015-02-16 22:36:25 +00:00
Justin Bogner	bb1f720b6c	InstrProf: Use ErrorOr for IndexedInstrProfReader::create (NFC) The other InstrProfReader::create factories were updated to return ErrorOr in r221120, and it's odd for these APIs not to match. llvm-svn: 229433	2015-02-16 21:28:58 +00:00
Craig Topper	3f6d0f923d	[X86] Remove x86.avx2.psll.dq.bs and x86.avx2.psrl.dq.bs intrinsics. llvm-svn: 229430	2015-02-16 20:51:59 +00:00
Andrew Trick	e7964c82c7	AArch64: Safely handle the incoming sret call argument. This adds a safe interface to the machine independent InputArg struct for accessing the index of the original (IR-level) argument. When a non-native return type is lowered, we generate the hidden machine-level sret argument on-the-fly. Before this fix, we were representing this argument as OrigArgIndex == 0, which is an outright lie. In particular this crashed in the AArch64 backend where we actually try to access the type of the original argument. Now we use a sentinel value for machine arguments that have no original argument index. AArch64, ARM, Mips, and PPC now check for this case before accessing the original argument. Fixes <rdar://19792160> Null pointer assertion in AArch64TargetLowering llvm-svn: 229413	2015-02-16 18:10:47 +00:00
Jonas Paulsson	e4359975ab	[PBQP] Improve the assert for conservatively allocatables. Remember if the node ever was in this state instead of checking just the final state. Reviewed by Arnaud de Grandmaison. llvm-svn: 229400	2015-02-16 15:39:26 +00:00
Chandler Carruth	160432e384	Switch our index sequence away from template aliases and just use classes. We can't use template aliases because on MSVC they don't appear to work correctly in the common usage such as Format.h. Many thanks to Zach for doing all the testing and debugging here. I just slotted the fix into the code. llvm-svn: 229362	2015-02-16 08:22:35 +00:00
David Majnemer	2b452a1df4	IR: Properly return nullptr when getAggregateElement is out-of-bounds We didn't properly handle the out-of-bounds case for ConstantAggregateZero and UndefValue. This would manifest as a crash when the constant folder was asked to fold a load of a constant global whose struct type has no operands. This fixes PR22595. llvm-svn: 229352	2015-02-16 04:02:09 +00:00
Craig Topper	916052961d	[X86] Remove gcc builtins for AVX2 psll_dq and psrl_dq intrinsics. Clang no longer needs them. llvm-svn: 229347	2015-02-16 00:42:36 +00:00
Benjamin Kramer	d21608f72f	MinGW's snprintf is not exposed through std::. llvm-svn: 229342	2015-02-15 23:17:20 +00:00
Aaron Ballman	0b45511a2e	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229340	2015-02-15 22:54:22 +00:00
Benjamin Kramer	99efbfba33	Format: Modernize using variadic templates. Introduces a subset of C++14 integer sequences in STLExtras. This is just enough to support unpacking a std::tuple into the arguments of snprintf, we can add more of it when it's actually needed. Also removes an ancient macro hack that leaks a macro into the global namespace. Clean up users that made use of the convenient hack. llvm-svn: 229337	2015-02-15 22:15:41 +00:00
Aaron Ballman	0e19b5d670	Removing LLVM_EXPLICIT, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229335	2015-02-15 22:00:20 +00:00
Aaron Ballman	f48307f858	Since MSVC 1800 is our lowest common denominator, we don't need an explicit check for it in these macros any longer; NFC. llvm-svn: 229333	2015-02-15 21:21:52 +00:00
Benjamin Kramer	1256b8262c	CommandLine: Use variadic templates to simplify opt constructors. llvm-svn: 229332	2015-02-15 21:11:25 +00:00
Zachary Turner	e9cd3c4989	llvm-pdbdump: Add flags controlling the type of values to dump. llvm-svn: 229330	2015-02-15 20:27:53 +00:00
Benjamin Kramer	7ce6e93852	FoldingSet: Replace faux variadics with real variadics. NFC. llvm-svn: 229328	2015-02-15 20:12:17 +00:00
Benjamin Kramer	c1b3bb4e32	Remove LLVM_HAS_VARIADIC_TEMPLATES and all the faux variadic workarounds guarded by it. We no longer support compilers without variadic template support. llvm-svn: 229324	2015-02-15 19:34:28 +00:00
Benjamin Kramer	4579422ac9	Update the docs to require at least MSVC 2013. llvm-svn: 229323	2015-02-15 19:34:17 +00:00
Arnaud A. de Grandmaison	1c6984dc94	[PBQP] Assert conservativelly allocatable nodes are spilled by choice. llvm-svn: 229302	2015-02-15 10:35:31 +00:00
Ramkumar Ramachandra	af4f23c6ae	InstCombine: propagate deref via new addDereferenceableAttr The "dereferenceable" attribute cannot be added via .addAttribute(), since it also expects a size in bytes. AttrBuilder#addAttribute or AttributeSet#addAttribute is wrapped by classes Function, InvokeInst, and CallInst. Add corresponding wrappers to AttrBuilder#addDereferenceableAttr. Having done this, propagate the dereferenceable attribute via gc.relocate, adding a test to exercise it. Note that -datalayout is required during execution over and above -instcombine, because InstCombine only optionally requires DataLayoutPass. Differential Revision: http://reviews.llvm.org/D7510 llvm-svn: 229265	2015-02-14 19:37:54 +00:00
Richard Smith	a95b03186b	[modules] Try harder to stop DebugInfo/PDB/DIA being built if not available. llvm-svn: 229243	2015-02-14 05:54:56 +00:00
Zachary Turner	ee319cd955	llvm-pdbdump: Only dump whitelisted global symbols. Dumping the global scope contains a lot of very uninteresting things and is generally polluted with a lot of random junk. Furthermore, it dumps values unsorted, making it hard to read. This patch dumps known interesting types only, and as a side effect sorts the list by symbol type. llvm-svn: 229232	2015-02-14 03:54:28 +00:00
Duncan P. N. Exon Smith	33685ffe5d	CodeGen: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) Also, add `Function::getFnStackAlignment()`, and canonicalize: getAttributes().getStackAlignment(AttributeSet::FunctionIndex) => getFnStackAlignment() llvm-svn: 229208	2015-02-14 01:44:41 +00:00
Matthias Braun	cad0e84d8e	Revert "On ELF, put PIC jump tables in a non executable section." This reverts commit r228939. The commit broke something in the output of exception handling tables on darwin x86-64. llvm-svn: 229203	2015-02-14 01:16:54 +00:00
Richard Smith	571f7ccf5d	[modules] Split off a separate module for DebugInfo/PDB/DIA so that its headers don't get included on systems where the DIA SDK is unavailable. llvm-svn: 229200	2015-02-14 00:47:20 +00:00
Reid Kleckner	56669ab852	Unify the two EH personality classification routines I wrote We only need one. llvm-svn: 229193	2015-02-14 00:21:02 +00:00
Frederic Riss	f729d83709	DWARFUnit: Add a couple of helpers to access the DIE array. To be used in dsymutil (or any other client that wants to take advantage of the fact that DIEs are stored in a vector). llvm-svn: 229179	2015-02-13 23:18:24 +00:00
Richard Smith	471dcba2f5	[modules] Mark include/llvm/Support/Dwarf.def as being a textually-included header. llvm-svn: 229154	2015-02-13 21:06:45 +00:00
Richard Smith	1007da1563	Clean up some inappropriate choices of type in the bitcode reader. None of these are expected to fix any 64->32 bit real truncation issues. llvm-svn: 229153	2015-02-13 21:05:11 +00:00
Benjamin Kramer	2220034e93	Reapply r229142 with some enable_if magic to avoid memcpying between differing types. Original commit message: SmallVector: Resolve a long-standing fixme by using the existing unitialized_copy dispatch. This makes append() use memcpy for trivially copyable types. llvm-svn: 229149	2015-02-13 20:45:14 +00:00
Benjamin Kramer	b6f1935109	Revert r229142. It breaks the world for unknown reasons. llvm-svn: 229144	2015-02-13 19:45:28 +00:00
Benjamin Kramer	e9e8f9f6de	SmallVector: Resolve a long-standing fixme by using the existing unitialized_copy dispatch. This makes append() use memcpy for trivially copyable types. llvm-svn: 229142	2015-02-13 19:20:39 +00:00
Zachary Turner	56655b0bb0	llvm-pdbdump: Improve printing of functions and signatures. This correctly prints the function pointers, and also prints function signatures for symbols as opposed to just types. So actual functions in your program will now be printed with full name and signature, as opposed to just name as before. llvm-svn: 229129	2015-02-13 17:57:09 +00:00
Arnaud A. de Grandmaison	b1fde3b904	[PBQP] Conservativelly allocatable nodes can be spilled and give a better solution Although such nodes are allocatable, the cost of spilling may be less than allocating to register, so spilling the node may provide a better solution. The assert does not account for this case, so remove it for now. llvm-svn: 229103	2015-02-13 12:04:42 +00:00
Chandler Carruth	18e8c62883	[PM] Remove the old 'PassManager.h' header file at the top level of LLVM's include tree and the use of using declarations to hide the 'legacy' namespace for the old pass manager. This undoes the primary modules-hostile change I made to keep out-of-tree targets building. I sent an email inquiring about whether this would be reasonable to do at this phase and people seemed fine with it, so making it a reality. This should allow us to start bootstrapping with modules to a certain extent along with making it easier to mix and match headers in general. The updates to any code for users of LLVM are very mechanical. Switch from including "llvm/PassManager.h" to "llvm/IR/LegacyPassManager.h". Qualify the types which now produce compile errors with "legacy::". The most common ones are "PassManager", "PassManagerBase", and "FunctionPassManager". llvm-svn: 229094	2015-02-13 10:01:29 +00:00
Chandler Carruth	33dabe4f44	Re-sort #include lines using my handy dandy ./utils/sort_includes.py script. This is in preparation for changes to lots of include lines. llvm-svn: 229088	2015-02-13 09:09:03 +00:00
Zachary Turner	0d71c47d7a	Fix the windows build again. Grrr, MSVC. llvm-svn: 229081	2015-02-13 07:55:29 +00:00
Chandler Carruth	6c78cd7569	Revert a series of commits starting at r228886 which is triggering some regressions for LLDB on Linux. Rafael indicated on lldb-dev that we should just go ahead and revert these but that he wasn't at a computer. The patches backed out are as follows: r228980: Add support for having multiple sections with the name and ... r228889: Invert the section relocation map. r228888: Use the existing SymbolTableIndex intsead of doing a lookup. r228886: Create the Section -> Rel Section map when it is first needed. These patches look pretty nice to me, so hoping its not too hard to get them re-instated. =D llvm-svn: 229080	2015-02-13 07:52:39 +00:00
Zachary Turner	fb935fca62	Fix non-windows builds unhappy about a missing header. llvm-svn: 229079	2015-02-13 07:45:49 +00:00
Zachary Turner	96be932905	llvm-pdbdump: Add more comprehensive dumping of symbol types. In particular this patch adds the ability to dump complete function signature information including argument types as correctly formatted strings. A side effect of this is that almost all symbol and meta types are now formatted. llvm-svn: 229076	2015-02-13 07:40:03 +00:00
Craig Topper	eaf6d626b1	[X86] Remove int_x86_sse2_psll_dq_bs and int_x86_sse2_psrl_dq_bs intrinsics. The builtins aren't used by clang. llvm-svn: 229069	2015-02-13 06:07:24 +00:00
Craig Topper	3ab5637fda	[X86] Remove references to builtin names that have been removed from clang. Hope to remove the intrinsics themselves soon. llvm-svn: 229068	2015-02-13 06:07:14 +00:00
Duncan P. N. Exon Smith	c41b59bc4f	IR: Drop never-used defaults for DIBuilder::createTemplate*(), NFC No caller specifies anything different; these parameters are dead code and probably always have been. The new hierarchy doesn't bother with the fields at all (see r228607 and r228652). llvm-svn: 229037	2015-02-13 03:35:29 +00:00
Duncan P. N. Exon Smith	71f770946c	Bitcode: Remove confusing '?' from r229004, NFC The name is always part of the record, it just might be empty. Remove the `?` for clarity. llvm-svn: 229032	2015-02-13 02:43:38 +00:00
Duncan P. N. Exon Smith	84cf08e569	Bitcode: Add trailing comma to MetadataCodes, NFC Suggested in the review of r229004, this should simplify diffs in the future. llvm-svn: 229031	2015-02-13 02:41:36 +00:00
Duncan P. N. Exon Smith	8dc64a4707	AsmWriter/Bitcode: MDImportedEntity llvm-svn: 229025	2015-02-13 01:46:02 +00:00
Duncan P. N. Exon Smith	baf6eacc58	AsmWriter/Bitcode: MDObjCProperty llvm-svn: 229024	2015-02-13 01:43:22 +00:00
Duncan P. N. Exon Smith	e023c0f5eb	AsmWriter/Bitcode: MDExpression llvm-svn: 229023	2015-02-13 01:42:09 +00:00
Duncan P. N. Exon Smith	c9450daed2	AsmWriter/Bitcode: MDLocalVariable llvm-svn: 229022	2015-02-13 01:39:44 +00:00
Duncan P. N. Exon Smith	58b49ba795	AsmWriter/Bitcode: MDGlobalVariable llvm-svn: 229020	2015-02-13 01:35:40 +00:00
Duncan P. N. Exon Smith	d136432599	AsmWriter/Bitcode: MDTemplate{Type,Value}Parameter llvm-svn: 229019	2015-02-13 01:34:32 +00:00
Duncan P. N. Exon Smith	c96d92ad70	AsmWriter/Bitcode: MDNamespace llvm-svn: 229018	2015-02-13 01:32:09 +00:00
Duncan P. N. Exon Smith	affacdfc5b	AsmWriter/Bitcode: MDLexicalBlockFile llvm-svn: 229017	2015-02-13 01:30:42 +00:00
Duncan P. N. Exon Smith	b3ef6197cf	AsmWriter/Bitcode: MDLexicalBlock llvm-svn: 229016	2015-02-13 01:29:28 +00:00
Duncan P. N. Exon Smith	52584d6996	AsmWriter/Bitcode: MDSubprogram llvm-svn: 229014	2015-02-13 01:26:47 +00:00
Duncan P. N. Exon Smith	21bc2cacec	AsmWriter/Bitcode: MDCompileUnit llvm-svn: 229013	2015-02-13 01:25:10 +00:00
Zachary Turner	bacf14945c	Improve llvm-pdbdump output display. This patch adds a number of improvements to llvm-pdbdump. 1) Dumping of the entire global scope, and not only those symbols that live in individual compilands. 2) Prepend class name to member functions and data 3) Improved display of bitfields. 4) Support for dumping more kinds of data symbols. llvm-svn: 229012	2015-02-13 01:23:51 +00:00
Duncan P. N. Exon Smith	51dcb8de94	AsmWriter/Bitcode: MDSubroutineType llvm-svn: 229011	2015-02-13 01:22:59 +00:00
Duncan P. N. Exon Smith	c4bb6d7bbb	AsmWriter/Bitcode: MDDerivedType and MDCompositeType llvm-svn: 229009	2015-02-13 01:20:38 +00:00
Duncan P. N. Exon Smith	4428ff1087	AsmWriter/Bitcode: MDFile llvm-svn: 229007	2015-02-13 01:19:14 +00:00
Duncan P. N. Exon Smith	38e2854cc3	AsmWriter/Bitcode: MDBasicType llvm-svn: 229005	2015-02-13 01:14:58 +00:00
Duncan P. N. Exon Smith	8b689964a4	AsmWriter/Bitcode: MDEnumerator llvm-svn: 229004	2015-02-13 01:14:11 +00:00
Duncan P. N. Exon Smith	9879c4ea87	AsmWriter/Bitcode: MDSubrange llvm-svn: 229003	2015-02-13 01:10:38 +00:00
Duncan P. N. Exon Smith	94f67658e0	IR: Add MDExpression::ExprOperand Port `DIExpression::Operand` over to `MDExpression::ExprOperand`. The logic is needed directly in `MDExpression` to support printing in assembly. llvm-svn: 229002	2015-02-13 01:07:46 +00:00
Duncan P. N. Exon Smith	661eb5dea8	Support: Add dwarf::getOperationEncoding() llvm-svn: 229001	2015-02-13 01:05:00 +00:00
Duncan P. N. Exon Smith	980b9ef6c3	Support: Rewrite LocationAtom and OperationEncodingString(), NFC Use `Dwarf.def` more. llvm-svn: 229000	2015-02-13 01:04:08 +00:00
Akira Hatanaka	53f74bf662	[LinkModules] Change the way ModuleLinker merges triples. This commit makes the following changes: - Stop issuing a warning when the triples' string representations do not match exactly if the Triple objects generated from the strings compare equal. - On Apple platforms, choose the triple that has the larger minimum version number. rdar://problem/16743513 Differential Revision: http://reviews.llvm.org/D7591 llvm-svn: 228999	2015-02-13 00:40:41 +00:00
Rafael Espindola	4467ec2e41	Add support for having multiple sections with the same name and comdat. Using this in combination with -ffunction-sections allows LLVM to output a .o file with mulitple sections named .text. This saves space by avoiding long unique names of the form .text.<C++ mangled name>. llvm-svn: 228980	2015-02-12 23:29:51 +00:00
David Blaikie	52492e048a	Add missing override. llvm-svn: 228974	2015-02-12 22:58:53 +00:00
Zachary Turner	8f5e13b9c7	Attempt to fix the build again. llvm-svn: 228964	2015-02-12 21:25:58 +00:00
Zachary Turner	5a969b378c	Attempt to fix Linux builds after r228960. llvm-svn: 228962	2015-02-12 21:17:07 +00:00
Rafael Espindola	83518ea75f	Remove mostly unused setters. Most of the code was setting the TargetOptions directly. llvm-svn: 228961	2015-02-12 21:16:34 +00:00
Zachary Turner	6cde1e9388	Add concrete type overloads to PDBSymbol::findChildren(). Frequently you only want to iterate over children of a specific type (e.g. functions). Previously you would get back a generic interface that allowed iteration over the base symbol type, which you would have to dyn_cast<> each one of. With this patch, we allow the user to specify the concrete type as a template parameter, and it will return an iterator which returns instances of the concrete type directly. llvm-svn: 228960	2015-02-12 21:09:24 +00:00
Rafael Espindola	5feecddc53	On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 228939	2015-02-12 17:46:49 +00:00
Rafael Espindola	fb65819e24	Put each jump table in an independent section if the function is too. This allows the linker to GC both, fixing pr22557. llvm-svn: 228937	2015-02-12 17:16:46 +00:00
Benjamin Kramer	4b76aa3d46	MathExtras: Bring Count(Trailing\|Leading)Ones and CountPopulation in line with countTrailingZeros Update all callers. llvm-svn: 228930	2015-02-12 15:35:40 +00:00
Andrea Di Biagio	7ca0db442c	[TTI] Teach the cost heuristic how to query TLI to check if a zext/trunc is 'free' for the target. Now that SimplifyCFG uses TTI for the cost heuristic, we can teach BasicTTIImpl how to query TLI in order to get a more accurate cost for truncates and zero-extends. Before this patch, the basic cost heuristic in TargetTransformInfoImplCRTPBase would have conservatively returned a 'default' TCC_Basic for all zero-extends, and TCC_Free for truncates on native types. This patch improves the heuristic so that we query TLI (if available) to get more accurate answers. If TLI is available, then methods 'isZExtFree' and 'isTruncateFree' can be used to check if a zext/trunc is free for the target. Added more test cases to SimplifyCFG/X86/speculate-cttz-ctlz.ll. With this change, SimplifyCFG is now able to speculate a 'cheap' cttz/ctlz immediately followed by a free zext/trunc. Differential Revision: http://reviews.llvm.org/D7585 llvm-svn: 228923	2015-02-12 14:17:24 +00:00
Benjamin Kramer	c7a7636094	BitVector: Remove manual bit width dispatch, this is handled by templates NFC. llvm-svn: 228922	2015-02-12 14:02:58 +00:00
Benjamin Kramer	d08a40831d	MathExtras: Parametrize count(Trailing\|Leading)Zeros on the type size. Otherwise we will always select the generic version for e.g. unsigned long if uint64_t is typedef'd to 'unsigned long long'. Also remove enable_if hacks in favor of static_assert. llvm-svn: 228921	2015-02-12 13:47:29 +00:00
Adrian Prantl	9ec54ab53b	Generalize DIBuilder's createReplaceableForwardDecl() to a more flexible createReplaceableCompositeType() that allows to create non-forward-declared temporary nodes. Paired commit with CFE. llvm-svn: 228852	2015-02-11 17:45:05 +00:00
Andrea Di Biagio	70c7608263	[TTI] Improved cost heuristic for cttz/ctlz calls. This patch is a follow-up of r228826 (see code-review: D7506). Now that SimplifyCFG uses TargetTransformInfo for cost analysis, we have to fix the cost heuristic for intrinsic calls to cttz/ctlz. This patch defines method 'getIntrinsicCost' in BasicTTIImpl: now, BasicTTIImpl queries TLI to check if a call to cttz/ctlz is cheap for the target. Added test cases in Transforms/SimplifyCFG/X86 to verify that on x86, SimplifyCFG only speculates a call to cttz/ctlz if it is cheap. Differential Revision: http://reviews.llvm.org/D7554 llvm-svn: 228829	2015-02-11 14:22:18 +00:00
Arnaud A. de Grandmaison	bfad2ea31a	[PBQP] Cautiously update edge costs in the solver The NodeMetadata are maintained in an incremental way. When an edge between 2 nodes has its cost updated, in the course of graph reduction for example, the NodeMetadata need first to have the old edge cost removed, then the new edge cost added. Only once the NodeMetadata have been fully updated, it becomes safe to consider promoting the nodes to the ConservativelyAllocatable or OptimallyReducible sets. Previously, this promotion was occuring right after the removing the old cost, and this was breaking the assumption that a ConservativelyAllocatable should not be spilled. This patch also adds asserts to: - enforces the invariant that a node's reduction can not be downgraded, - only not provably allocatable or optimally reducible nodes can be spilled. llvm-svn: 228816	2015-02-11 08:25:36 +00:00
Reid Kleckner	86643b627c	Don't promote asynch EH invokes of nounwind functions to calls If the landingpad of the invoke is using a personality function that catches asynch exceptions, then it can catch a trap. Also add some landingpads to invalid LLVM IR test cases that lack them. Over-the-shoulder reviewed by David Majnemer. llvm-svn: 228782	2015-02-11 01:23:16 +00:00
Zachary Turner	473b4aac78	Rewrite llvm-pdbdump in terms of LLVMDebugInfoPDB. This makes llvm-pdbdump available on all platforms, although it will currently fail to create a dumper if there is no PDB reader implementation for the current platform. It implements dumping of compilands and children, which is less information than was previously available, but it has to be rewritten from scratch using the new set of interfaces, so the rest of the functionality will be added back in subsequent commits. llvm-svn: 228755	2015-02-10 22:43:25 +00:00
Zachary Turner	422f6e29d5	Provide DIA implementation of DebugInfoPDB. This implements DebugInfoPDB when the DIA SDK is present on the system. Specifically, this means that the following conditions are met: 1) You are building on Windows. 2) You are building with MSVC. 3) Visual Studio did not corrupt the installation of DIA due to a known issue with side-by-side installations of VS2012 and VS2013. If all of these conditions are true, you will be able to pass a value of PDB_Reader::DIA to PDB::createPdbReader(). There are no tests for this yet, as any test will be in the form of a lit test which tests the llvm-pdbdump.exe, which still needs to be rewritten in terms of this library. llvm-svn: 228747	2015-02-10 21:17:52 +00:00
Aaron Ballman	e5024a035a	Now use the __debugbreak intrinsic instead of calling RaiseException; it requires no forward declares and still calls VEH. llvm-svn: 228745	2015-02-10 21:13:04 +00:00
Aaron Ballman	39da612547	Changing the status code generated by LLVM_BUILTIN_TRAP on Windows to be something categorized as a valid error code. Fixes crashing uses (such as not --crash) with existing sys::Wait behavior. llvm-svn: 228738	2015-02-10 20:13:52 +00:00
Andrew Kaylor	fff974fc6d	Adding support for llvm.eh.begincatch and llvm.eh.endcatch intrinsics and beginning the documentation of native Windows exception handling. Differential Revision: http://reviews.llvm.org/D7398 llvm-svn: 228733	2015-02-10 19:52:43 +00:00
Duncan P. N. Exon Smith	73d123e7bc	IR: Add MDNode::replaceWithPermanent() Add new API for converting temporaries that may self-reference. Self-referencing nodes are not allowed to be uniqued, so sending them into `replaceWithUniqued()` is dangerous (and this commit adds assertions that prevent it). `replaceWithPermanent()` has similar semantics to `get()` followed by calls to `replaceOperandWith()`. In particular, if there's a self-reference, it returns a distinct node; otherwise, it returns a uniqued one. Like `replaceWithUniqued()` and `replaceWithDistinct()` (well, it calls out to them) it mutates the temporary node in place if possible, only calling `replaceAllUsesWith()` on a uniquing collision. llvm-svn: 228726	2015-02-10 19:13:46 +00:00
Paul Robinson	b0fca412c4	Explicitly initialize a flag in a default constructor. Works around a Visual C++ issue. Patch by Douglas Yung! llvm-svn: 228699	2015-02-10 15:30:02 +00:00
Aaron Ballman	a7a249093b	Re-committing r228628 with a fix for 64-bit builds. On Windows, we now use RaiseException to generate the kind of trap we require (one which calls our vectored exception handler), and fall back to using a volatile write to simulate a trap elsewhere. llvm-svn: 228691	2015-02-10 14:28:11 +00:00
Lang Hames	27d2677ad7	[Orc] Fix a bug in the LazyEmittingLayer - capture names by value (as std::strings) rather than StringRefs in JITSymbol get-address lambda. Capturing a StringRef by-value is still effectively capturing a reference, which is no good here because the referenced string may be gone by the time the lambda is being evaluated the original value may be gone. Make sure to capture a std::string instead. No test case: This bug doesn't manifest under OrcMCJITReplacement, since it keeps IR modules (from which the StringRefs are sourced) alive permanently. llvm-svn: 228676	2015-02-10 07:35:39 +00:00
Lang Hames	132f37f8e5	[Orc] Add missing casserts header to JITSymbol.h. llvm-svn: 228675	2015-02-10 07:26:19 +00:00
Zachary Turner	0d4fc2d795	Define HAVE_DIA_SDK on Windows when DIA is present. This allows all CMake projects, as well as C++ code, to detect if and when DIA SDK is available for use so that we can enable the DIA-based PDB reader implementation. Differential Revision: http://reviews.llvm.org/D7457 Reviewed By: Chandler Carruth llvm-svn: 228669	2015-02-10 05:04:25 +00:00
Duncan P. N. Exon Smith	61535117b4	IR: Remove unnecessary fields from MDTemplateParameter I noticed this fields were never used in r228607, but I neglected to propagate that into `MDTemplateParameter` until now. This really should have been done before commit in r228640; sorry for the churn. llvm-svn: 228652	2015-02-10 01:59:57 +00:00
Duncan P. N. Exon Smith	6c66615baf	IR: Add accessors to MDExpression Add some accessors to `MDExpression`. llvm-svn: 228648	2015-02-10 01:36:46 +00:00
Duncan P. N. Exon Smith	9fcf9cd379	AsmParser: Add stubs for specialized MDNodes, NFC Well, the exact error from the failed parse will change, but... llvm-svn: 228644	2015-02-10 01:08:16 +00:00
Duncan P. N. Exon Smith	1c43bf9cfb	IR: Add specialized debug info metadata nodes Add specialized debug info metadata nodes that match the `DIDescriptor` wrappers (used by `DIBuilder`) closely. Assembly and bitcode support to follow soon (it'll mostly just be obvious), but this sketches in today's schema. This is the first big commit (well, the only big one aside from the testcase changes that'll come when I move this into place) for PR22464. I've marked a bunch of obvious changes as `TODO`s in the source; I plan to make those changes promptly after this hierarchy is moved underneath `DIDescriptor`, but for now I'm aiming mostly to match the status quo. llvm-svn: 228640	2015-02-10 00:52:32 +00:00
Lang Hames	2359267cca	[Orc] Back out one of the GCC ICE workarounds from r228568. NFC. llvm-svn: 228637	2015-02-10 00:37:26 +00:00
Aaron Ballman	da25678483	Reverting r228628; it broke at least one builder due to the forward declare of RaiseException. llvm-svn: 228633	2015-02-10 00:00:54 +00:00
Adrian Prantl	f10ec50249	Debug info: Use DW_OP_bit_piece instead of DW_OP_piece in the intermediate representation. This - increases consistency by using the same granularity everywhere - allows for pieces < 1 byte - DW_OP_piece didn't actually allow storing an offset. Part of PR22495. llvm-svn: 228631	2015-02-09 23:57:15 +00:00
Duncan P. N. Exon Smith	8456fa3c41	ADT: Allow up to 18 arguments in hash_combine() I just realized that the specialized metadata node patch I'm about to commit won't compile on old compilers. Bump `hash_combine()`'s support for non-variadic templates to 18 (I tested this by reversing the logic in the #ifdef). llvm-svn: 228629	2015-02-09 23:21:05 +00:00
Aaron Ballman	230da450ab	On Windows, we now use RaiseException to generate the kind of trap we require (one which calls our vectored exception handler), and fall back to using a volatile write to simulate a trap elsewhere. llvm-svn: 228628	2015-02-09 23:11:39 +00:00
Ramkumar Ramachandra	545e586a0e	[Statepoint] Improve two asserts, fix some style (NFC) Summary: It's important that our users immediately know what gc.safepoint_poll is. Also fix the style of the declaration of CreateGCStatepoint, in preparation for another change that will wrap it. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7517 llvm-svn: 228626	2015-02-09 23:02:10 +00:00
Duncan P. N. Exon Smith	ec76782680	IR: Take uint64_t in DIBuilder::createExpression() `DIExpression` deals with `uint64_t`, so it doesn't make sense that `createExpression()` is created from `int64_t`. Switch to `uint64_t` to unify them. I've temporarily left in the `int64_t` version, which forwards to the `uint64_t` version. I'll delete it once I've updated the callers. llvm-svn: 228619	2015-02-09 22:13:27 +00:00
Duncan P. N. Exon Smith	03fb95aca6	IR: Document horrible abuse of loose DIDescriptor, NFC I'll circle back and fix this somehow; for now I just don't want to forget about it. llvm-svn: 228608	2015-02-09 21:26:34 +00:00
Duncan P. N. Exon Smith	c4219c892a	IR: Remove dead code in DITemplate* These are never referenced or filled in. llvm-svn: 228607	2015-02-09 21:23:34 +00:00
Ben Langmuir	615dcdb7b4	Reduce the LockFileManager timeout, and provide unsafeRemoveLockFile 5 minutes is an eternity, so try to strike a better balance between waiting long enough for any reasonable module build and not so long that users kill the process because they think it's hanging. Also give the client a way to delete the lock file after a timeout. llvm-svn: 228603	2015-02-09 20:34:24 +00:00
Sanjoy Das	db6aec61b3	Address post-commit review for rL228587: make it explicit that the <NW> bit of a SCEVAddRecExpr does not depend on the sign of the step and the start value of the step. llvm-svn: 228595	2015-02-09 19:39:00 +00:00
Sanjoy Das	2602c601e8	Clarify the wording on what it means for a SCEVAddRecExpr to be <NW>. llvm-svn: 228587	2015-02-09 18:44:42 +00:00
Lang Hames	a19eaab874	[Orc] Revert r228567 (GCC ICE workaround) - it doesn't seem to have helped. As far as I can tell r228568 was the right workaround, and r228567 was unnecessary. If reverting this causes problems on the bots I'll reinstate it. llvm-svn: 228585	2015-02-09 18:16:43 +00:00
Lang Hames	f1ea70a68b	[Orc] Try another workaround for the GCC 4.7.2 ICE introduced in r228557. NFC. llvm-svn: 228568	2015-02-09 07:47:32 +00:00
Lang Hames	6983fc8f43	[Orc] Tweak lambda capture lists to try to avoid an ICE on gcc-4.7.2. NFC. Apparently gcc-4.7.2 is touchy about 'this' appearing in a lambda capture list along with other captures. I've rewritten my captures to try to avoid the issue. llvm-svn: 228567	2015-02-09 07:22:56 +00:00
Lang Hames	79fa1f9f13	[Orc] Fix the MSVC bots by using LLVM_EXPLICIT rather than explicit. llvm-svn: 228564	2015-02-09 04:46:41 +00:00
Lang Hames	92f9dd24ac	[Orc] Add a JITSymbol class to the Orc APIs, refactor APIs, update clients. This patch refactors a key piece of the Orc APIs: It removes the ::getSymbolAddress and ::lookupSymbolAddressIn methods, which returned target addresses (uint64_ts), and replaces them with ::findSymbol and ::findSymbolIn respectively, which return instances of the new JITSymbol type. Unlike the old methods, calling findSymbol or findSymbolIn does not cause the symbol to be immediately materialized when found. Instead, the symbol will be materialized if/when the getAddress method is called on the returned JITSymbol. This allows us to query for the existence of symbols without actually materializing them. In the future I expect more information to be attached to the JITSymbol class, for example whether the returned symbol is a weak or strong definition. This will allow us to properly handle weak symbols and multiple definitions. llvm-svn: 228557	2015-02-09 01:20:51 +00:00
Zachary Turner	71e3bf6e80	Make PDBSymbol's IPDBSymbol reference const. llvm-svn: 228553	2015-02-08 22:53:53 +00:00
Zachary Turner	754851ad80	DebugInfoPDB: Make the symbol base case hold an IPDBSession ref. Dumping a symbol often requires access to data that isn't inside the symbol hierarchy, but which is only accessible through the top-level session. This patch is a pure interface change to give symbols a reference to the session. llvm-svn: 228542	2015-02-08 20:58:09 +00:00
Bjorn Steinbrink	a6a56743c3	Correctly combine alias.scope metadata by a union instead of intersecting Summary: The alias.scope metadata represents sets of things an instruction might alias with. When generically combining the metadata from two instructions the result must be the union of the original sets, because the new instruction might alias with anything any of the original instructions aliased with. Reviewers: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7490 llvm-svn: 228525	2015-02-08 17:07:14 +00:00
Elena Demikhovsky	40c204cf7d	Masked Gather and Scatter Intrinsics. Gather and Scatter are new introduced intrinsics, comming after recently implemented masked load and store. This is the first patch for Gather and Scatter intrinsics. It includes only the syntax, parsing and verification. Gather and Scatter intrinsics allow to perform multiple memory accesses (read/write) in one vector instruction. The intrinsics are not target specific and will have the following syntax: Gather: declare <16 x i32> @llvm.masked.gather.v16i32(<16 x i32> <vector of ptrs>, i32 <alignment>, <16 x i1> <mask>, <16 x i32> <passthru>) declare <8 x float> @llvm.masked.gather.v8f32(<8 x float><vector of ptrs>, i32 <alignment>, <8 x i1> <mask>, <8 x float><passthru>) Scatter: declare void @llvm.masked.scatter.v8i32(<8 x i32><vector value to be stored> , <8 x i32><vector of ptrs> , i32 <alignment>, <8 x i1> <mask>) declare void @llvm.masked.scatter.v16i32(<16 x i32> <vector value to be stored> , <16 x i32> <vector of ptrs>, i32 <alignment>, <16 x i1><mask> ) Vector of ptrs - a set of source/destination addresses, to load/store the value. Mask - switches on/off vector lanes to prevent memory access for switched-off lanes vector of ptrs, value and mask should have the same vector width. These are code examples where gather / scatter should be used and will allow function vectorization ;void foo1(int * restrict A, int * restrict B, int * restrict C) { ; for (int i=0; i<SIZE; i++) { ; A[i] = B[C[i]]; ; } ;} ;void foo3(int * restrict A, int * restrict B) { ; for (int i=0; i<SIZE; i++) { ; A[B[i]] = i+5; ; } ;} Tests will come in the following patches, with CodeGen and Vectorizer. http://reviews.llvm.org/D7433 llvm-svn: 228521	2015-02-08 08:27:19 +00:00
Zachary Turner	ca172bdb48	Some cleanup for libpdb. This patch implements a few of the optional suggestions from the initial patch comitting libpdb. In particular, it implements a virtual function out of line for each of the concrete classes. A few other minor cleanups exist as well, such as using override instead of virtual, etc. llvm-svn: 228516	2015-02-08 00:29:29 +00:00
Benjamin Kramer	c705a27ee2	SCEV: Compress disposition pairs. Composing DenseMaps and SmallVectors is still somewhat suboptimal, but this at least halves the size of the vector elements. NFC. llvm-svn: 228497	2015-02-07 16:41:12 +00:00
Benjamin Kramer	28fba477c6	SmallVector: Move emplace_back to SmallVectorImpl. This resolves the strange effect that emplace_back is only available when the type contained in the vector is not trivially copyable. llvm-svn: 228496	2015-02-07 16:41:02 +00:00
Benjamin Kramer	a3a195bd27	Move DebugLocs around instead of copying. llvm-svn: 228491	2015-02-07 12:28:15 +00:00
Bruce Mitchener	efa6e79a0c	Add more DWARF 5 language constants. Differential Revision: http://reviews.llvm.org/D7430 llvm-svn: 228487	2015-02-07 06:35:30 +00:00
Zachary Turner	58559798ed	Change RHS-style decltype to LHS-style decltype<declval()>. Seems some compilers don't like the RHS-style decltype specifier. This should fix the buildbots. llvm-svn: 228484	2015-02-07 02:02:23 +00:00
Duncan P. N. Exon Smith	408c33af4a	Support: Add dwarf::getVirtuality() llvm-svn: 228474	2015-02-07 00:37:15 +00:00
Duncan P. N. Exon Smith	165b84bee4	Support: Use Dwarf.def for DW_VIRTUALITY, NFC Use definition file for `DW_VIRTUALITY_*`. Add a `DW_VIRTUALITY_max` both for ease of testing and for future use by the `LLParser`. llvm-svn: 228473	2015-02-07 00:36:23 +00:00
Duncan P. N. Exon Smith	1cb5884cd9	Support: Add dwarf::getAttributeEncoding() llvm-svn: 228470	2015-02-06 23:46:49 +00:00
Duncan P. N. Exon Smith	ee716fc78e	Support: Rewrite AttributeEncodingString(), NFC llvm-svn: 228469	2015-02-06 23:45:37 +00:00
Kevin Enderby	03d099fa2a	Add code to llvm-objdump so the -section option with -macho will dump literal sections with the Mach-O S_{4,8,16}BYTE_LITERALS section types. llvm-svn: 228465	2015-02-06 23:25:38 +00:00
Duncan P. N. Exon Smith	b2bb076de9	Support: Add dwarf::getLanguage() llvm-svn: 228458	2015-02-06 22:55:13 +00:00
Duncan P. N. Exon Smith	b69d0ef5cb	Support: Rewrite dwarf::LanguageString(), NFC llvm-svn: 228457	2015-02-06 22:53:19 +00:00
Lang Hames	1179d700be	[Orc] Add more missing headers. llvm-svn: 228454	2015-02-06 22:48:43 +00:00
Zachary Turner	743a32b50f	Resubmit "Create lib/DebugInfo/PDB" (r228428) This change resubmits the patch that broke the build, this time without unittests. The unittests will be submitted separately after the problem has been addressed: --Original Commit Message-- Create lib/DebugInfo/PDB. This patch creates a platform-independent interface to a PDB reader. There is currently no implementation of this interface, which will be provided in future patches. This defines the basic object model which any implementation must conform to. Reviewed by: David Blaikie Differential Revision: http://reviews.llvm.org/D7356 llvm-svn: 228435	2015-02-06 20:30:52 +00:00
Michael Zolotukhin	9630715912	Use estimated number of optimized insns in unroll-threshold computation. If complete-unroll could help us to optimize away N% of instructions, we might want to do this even if the final size would exceed loop-unroll threshold. However, we don't want to unroll huge loop, and we are add AbsoluteThreshold to avoid that - this threshold will never be crossed, even if we expect to optimize 99% instructions after that. llvm-svn: 228434	2015-02-06 20:20:40 +00:00
Michael Zolotukhin	bbf2ac3d22	[InstSimplify] Add SimplifyFPBinOp function. It is a variation of SimplifyBinOp, but it takes into account FastMathFlags. It is needed in inliner and loop-unroller to accurately predict the transformation's outcome (previously we dropped the flags and were too conservative in some cases). Example: float foo(float a, float b) { float r; if (a[1] b) r = /* a lot of expensive computations /; else r = 1; return r; } float boo(float a) { return foo(a, 0.0); } Without this patch, we don't inline 'foo' into 'boo'. llvm-svn: 228432	2015-02-06 20:02:51 +00:00
Zachary Turner	434f023380	Revert "Create lib/DebugInfo/PDB." This reverts commit 21028, as it is causing failures in LLVMConfig. llvm-svn: 228431	2015-02-06 20:00:18 +00:00
Zachary Turner	bfc7e60f16	Create lib/DebugInfo/PDB. This patch creates a platform-independent interface to a PDB reader. There is currently no implementation of this interface, which will be provided in future patches. This defines the basic object model which any implementation must conform to. Reviewed by: David Blaikie Differential Revision: http://reviews.llvm.org/D7356 llvm-svn: 228428	2015-02-06 19:44:09 +00:00
Lang Hames	af4f511096	[Orc] Add some missing headers. llvm-svn: 228426	2015-02-06 19:34:40 +00:00
Lang Hames	fcb5b36695	[Orc] Fix syntax error in LazyEmittingLayer::removeModuleSet. This was a trivial think-o, but it's in a method of a templated class and doesn't have any callers yet, so the compiler let it pass. I hope to add a unit test to cover this soon. llvm-svn: 228425	2015-02-06 19:34:04 +00:00
Quentin Colombet	77dfe32eb3	[LiveIntervalAnalysis] Speed up creation of live ranges for physical registers by using a segment set. The patch addresses a compile-time performance regression in the LiveIntervals analysis pass (see http://llvm.org/bugs/show_bug.cgi?id=18580). This regression is especially critical when compiling long functions. Our analysis had shown that the most of time is taken for generation of live intervals for physical registers. Insertions in the middle of the array of live ranges cause quadratic algorithmic complexity, which is apparently the main reason for the slow-down. Overview of changes: - The patch introduces an additional std::set<Segment>* member in LiveRange for storing segments in the phase of initial creation. The set is used if this member is not NULL, otherwise everything works the old way. - The set of operations on LiveRange used during initial creation (i.e. used by createDeadDefs and extendToUses) have been reimplemented to use the segment set if it is available. - After a live range is created the contents of the set are flushed to the segment vector, because the set is not as efficient as the vector for the later uses of the live range. After the flushing, the set is deleted and cannot be used again. - The set is only for live ranges computed in LiveIntervalAnalysis::computeLiveInRegUnits() and getRegUnit() but not in computeVirtRegs(), because I did not bring any performance benefits to computeVirtRegs() and for some examples even brought a slow down. Patch by Vaidas Gasiunas <vaidas.gasiunas@sap.com> Differential Revision: http://reviews.llvm.org/D6013 llvm-svn: 228421	2015-02-06 18:42:41 +00:00
Adam Nemet	2dda12d192	[LV] Move addRuntimeCheck to LoopAccessAnalysis This will allow it to be shared with the new Loop Distribution pass. getFirstInst is currently duplicated across LoopVectorize.cpp and LoopAccessAnalysis.cpp. This is a short-term work-around until we figure out a better solution. NFC. (The code moved is adjusted a bit for the name of the Loop member and that PtrRtCheck is now a reference rather than a pointer.) llvm-svn: 228418	2015-02-06 18:31:04 +00:00
Matthias Braun	696b7644dd	LiveInterval: Fix SubRange memory leak. llvm-svn: 228405	2015-02-06 17:28:47 +00:00
Benjamin Kramer	c44a1f1f54	Value: Remove superfluous typedefs and deprecated method. NFC. llvm-svn: 228400	2015-02-06 14:44:02 +00:00
Ramkumar Ramachandra	39bc517234	Introduce print-memderefs to test isDereferenceablePointer Since testing the function indirectly is tricky, introduce a direct print-memderefs pass, in the same spirit as print-memdeps, which prints dereferenceability information matched by FileCheck. Differential Revision: http://reviews.llvm.org/D7075 llvm-svn: 228369	2015-02-06 01:46:42 +00:00
Ahmed Bougacha	fccf28b772	[CodeGen] Add hook/combine to form vector extloads, enabled on X86. The combine that forms extloads used to be disabled on vector types, because "None of the supported targets knows how to perform load and sign extend on vectors in one instruction." That's not entirely true, since at least SSE4.1 X86 knows how to do those sextloads/zextloads (with PMOVS/ZX). But there are several aspects to getting this right. First, vector extloads are controlled by a profitability callback. For instance, on ARM, several instructions have folded extload forms, so it's not always beneficial to create an extload node (and trying to match extloads is a whole 'nother can of worms). The interesting optimization enables folding of s/zextloads to illegal (splittable) vector types, expanding them into smaller legal extloads. It's not ideal (it introduces some legalization-like behavior in the combine) but it's better than the obvious alternative: form illegal extloads, and later try to split them up. If you do that, you might generate extloads that can't be split up, but have a valid ext+load expansion. At vector-op legalization time, it's too late to generate this kind of code, so you end up forced to scalarize. It's better to just avoid creating egregiously illegal nodes. This optimization is enabled unconditionally on X86. Note that the splitting combine is happy with "custom" extloads. As is, this bypasses the actual custom lowering, and just unrolls the extload. But from what I've seen, this is still much better than the current custom lowering, which does some kind of unrolling at the end anyway (see for instance load_sext_4i8_to_4i64 on SSE2, and the added FIXME). Also note that the existing combine that forms extloads is now also enabled on legal vectors. This doesn't have a big effect on X86 (because sext+load is usually combined to sext_inreg+aextload). On ARM it fires on some rare occasions; that's for a separate commit. Differential Revision: http://reviews.llvm.org/D6904 llvm-svn: 228325	2015-02-05 18:31:02 +00:00
Ahmed Bougacha	2687e714ca	[CodeGen] Add isLoadExtLegalOrCustom helper to TargetLowering. llvm-svn: 228322	2015-02-05 18:15:59 +00:00
Michael Kuperstein	8ed19d08b2	Teach isDereferenceablePointer() to look through bitcast constant expressions. This fixes a LICM regression due to the new load+store pair canonicalization. Differential Revision: http://reviews.llvm.org/D7411 llvm-svn: 228284	2015-02-05 09:15:37 +00:00
Matt Arsenault	1c62631bbb	Add addrspacecast node to tablegen The node is still defined oddly so that the address spaces are not operands and not accessible from tablegen, but as-is this can now be used to write a ComplexPattern with an addrspacecast root node. llvm-svn: 228270	2015-02-05 03:35:34 +00:00
Matt Arsenault	e6e242235b	Add support for double / float to EndianStream Also add new unit tests for endian::Writer llvm-svn: 228269	2015-02-05 03:30:08 +00:00
Cameron Esfahani	a75b0eb54b	Value soft float calls as more expensive in the inliner. Summary: When evaluating floating point instructions in the inliner, ask the TTI whether it is an expensive operation. By default, it's not an expensive operation. This keeps the default behavior the same as before. The ARM TTI has been updated to return back TCC_Expensive for targets which don't have hardware floating point. Reviewers: chandlerc, echristo Reviewed By: echristo Subscribers: t.p.northover, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D6936 llvm-svn: 228263	2015-02-05 02:09:33 +00:00
Duncan P. N. Exon Smith	fe2e84e098	IR: Split out getOperandAs(), NFC llvm-svn: 228250	2015-02-05 01:07:47 +00:00
Sean Silva	9d2bea5968	[MC] Remove various unused MCAsmInfo parameters. llvm-svn: 228244	2015-02-05 00:58:51 +00:00
Duncan P. N. Exon Smith	7f33c007a0	ADT: Add int64_t interoperability to APSInt Add some API to `APSInt` to make it easier to compare with `int64_t`. - `APSInt::compareValues(APSInt, APSInt)` returns 1, -1 or 0 for greater, lesser, or equal, doing the right thing for mismatched "has-sign" and bitwidths. This is just like `isSameValue()` (and is now the implementation of it). - `APSInt::get(int64_t)` gets a signed `APSInt`. - `operator<(int64_t)`, etc., are implemented trivially via `get()` and `compareValues()`. - Also added `APSInt::getUnsigned(uint64_t)` to make it easier to test `compareValues()`. llvm-svn: 228239	2015-02-05 00:17:43 +00:00
Reid Kleckner	8127f1fbb0	Remove useless call to isOSCygMing() This used to do something when we modeled the Cygwin and MinGW environments as distinct OSs, but now it is not needed. llvm-svn: 228229	2015-02-04 23:17:19 +00:00
Matthias Braun	84aaa1dd81	MachineCSE: Clear dead-def flag on CSE. In case CSE reuses a previoulsy unused register the dead-def flag has to be cleared on the def operand, as exposed by the arm64-cse.ll test. This fixes PR22439 and the corresponding rdar://19694987 Differential Revision: http://reviews.llvm.org/D7395 llvm-svn: 228178	2015-02-04 19:35:16 +00:00
Reid Kleckner	6182b47caf	Add range adapters predecessors() and successors() for BBs Use them in two isolated transforms so we know they work and aren't dead code. llvm-svn: 228173	2015-02-04 19:14:57 +00:00
Juergen Ributzka	46ea5d06b7	Add missing include. llvm-svn: 228161	2015-02-04 18:16:53 +00:00
Alexey Samsonov	f9eb672e1c	SpecialCaseList: Add support for parsing multiple input files. Summary: This change allows users to create SpecialCaseList objects from multiple local files. This is needed to implement a proper support for -fsanitize-blacklist flag (allow users to specify multiple blacklists, in addition to default blacklist, see PR22431). DFSan can also benefit from this change, as DFSan instrumentation pass now accepts ABI-lists both from -fsanitize-blacklist= and -mllvm -dfsan-abilist flags. Go bindings are fixed accordingly. Test Plan: regression test suite Reviewers: pcc Subscribers: llvm-commits, axw, kcc Differential Revision: http://reviews.llvm.org/D7367 llvm-svn: 228155	2015-02-04 17:39:48 +00:00
Philip Reames	bea8f6fd03	Add a pass for inserting safepoints into (nearly) arbitrary IR This pass is responsible for figuring out where to place call safepoints and safepoint polls. It doesn't actually make the relocations explicit; that's the job of the RewriteStatepointsForGC pass (http://reviews.llvm.org/D6975). Note that this code is not yet finalized. Its moving in tree for incremental development, but further cleanup is needed and will happen over the next few days. It is not yet part of the standard pass order. Planned changes in the near future: - I plan on restructuring the statepoint rewrite to use the functions add to the IRBuilder a while back. - In the current pass, the function "gc.safepoint_poll" is treated specially but is not an intrinsic. I plan to make identifying the poll function a property of the GCStrategy at some point in the near future. - As follow on patches, I will be separating a collection of test cases we have out of tree and submitting them upstream. - It's not explicit in the code, but these two patches are introducing a new state for a statepoint which looks a lot like a patchpoint. There's no a transient form which doesn't yet have the relocations explicitly represented, but does prevent reordering of memory operations. Once this is in, I need to update actually make this explicit by reserving the 'unused' argument of the statepoint as a flag, updating the docs, and making the code explicitly check for such a thing. This wasn't really planned, but once I split the two passes - which was done for other reasons - the intermediate state fell out. Just reminds us once again that we need to merge statepoints and patchpoints at some point in the not that distant future. Future directions planned: - Identifying more cases where a backedge safepoint isn't required to ensure timely execution of a safepoint poll. - Tweaking the insertion process to generate easier to optimize IR. (For example, investigating making SplitBackedge) the default. - Adding opt-in flags for a GCStrategy to use this pass. Once done, add this pass to the actual pass ordering. Differential Revision: http://reviews.llvm.org/D6981 llvm-svn: 228090	2015-02-04 00:37:33 +00:00
Justin Bogner	620c405abf	InstrProf: Make CounterMappingRegions less confusing to construct Creating empty and expansion regions is awkward with the current API. Expose static methods to make this simpler. llvm-svn: 228075	2015-02-03 23:59:33 +00:00
Arnaud A. de Grandmaison	775136711f	[PBQP] Provide more information in the debug prints Based on a patch by Jonas Paulsson llvm-svn: 228068	2015-02-03 23:40:24 +00:00
Arnaud A. de Grandmaison	352dc10d81	[PBQP] Constify Graph::getEdgeNode1Id and Graph::getEdgeNode2Id llvm-svn: 228048	2015-02-03 22:02:45 +00:00
Duncan P. N. Exon Smith	55694c075d	IR: Assembly and bitcode for GenericDebugNode llvm-svn: 228041	2015-02-03 21:54:14 +00:00
Justin Bogner	84c1a035e8	InstrProf: Remove CoverageMapping::HasCodeBefore, it isn't used It's not entirely clear to me what this field was meant for, but it's always false. Remove it. llvm-svn: 228034	2015-02-03 21:35:36 +00:00
Duncan P. N. Exon Smith	cd5f9211e8	Support: Add string => unsigned mapping for DW_TAG Add `dwarf::getTag()` to translate from `StringRef` to `unsigned`. llvm-svn: 228031	2015-02-03 21:16:49 +00:00
Duncan P. N. Exon Smith	81de9dc80e	Support: Re-implement dwarf::TagString() using a .def file, NFC Also re-implements the `dwarf::Tag` enumerator. I've moved the mock tags into the enumerator since there's no other way to do this. Really they shouldn't be used at all (they're just a hack to identify `MDNode`s, but we have a class hierarchy for that now). llvm-svn: 228030	2015-02-03 21:13:16 +00:00
Colin LeMahieu	3534179cf0	[Hexagon] Converting XTYPE/SHIFT intrinsics. Cleaning out old intrinsic patterns and updating tests. llvm-svn: 228026	2015-02-03 20:40:52 +00:00
Jingyue Wu	4e99b65428	Add straight-line strength reduction to LLVM Summary: Straight-line strength reduction (SLSR) is implemented in GCC but not yet in LLVM. It has proven to effectively simplify statements derived from an unrolled loop, and can potentially benefit many other cases too. For example, LLVM unrolls #pragma unroll foo (int i = 0; i < 3; ++i) { sum += foo((b + i) * s); } into sum += foo(b * s); sum += foo((b + 1) * s); sum += foo((b + 2) * s); However, no optimizations yet reduce the internal redundancy of the three expressions: b * s (b + 1) * s (b + 2) * s With SLSR, LLVM can optimize these three expressions into: t1 = b * s t2 = t1 + s t3 = t2 + s This commit is only an initial step towards implementing a series of such optimizations. I will implement more (see TODO in the file commentary) in the near future. This optimization is enabled for the NVPTX backend for now. However, I am more than happy to push it to the standard optimization pipeline after more thorough performance tests. Test Plan: test/StraightLineStrengthReduce/slsr.ll Reviewers: eliben, HaoLiu, meheff, hfinkel, jholewinski, atrick Reviewed By: jholewinski, atrick Subscribers: karthikthecool, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7310 llvm-svn: 228016	2015-02-03 19:37:06 +00:00
Manman Ren	ecc02c6b0a	[LTO API] split lto_codegen_compile to lto_codegen_optimize and lto_codegen_compile_optimized. Also add lto_api_version. Before this commit, we can only dump the optimized bitcode after running lto_codegen_compile, but it includes some impacts of running codegen passes, one example is StackProtector pass. We will get assertion failure when running llc on the optimized bitcode, because StackProtector is effectively run twice. After splitting lto_codegen_compile, the linker can choose to dump the bitcode before running lto_codegen_compile_optimized. lto_api_version is added so ld64 can check for runtime-availability of the new API. rdar://19565500 llvm-svn: 228000	2015-02-03 18:39:15 +00:00
Adam Nemet	e6e4bf975c	[LoopVectorize] Fix rebase glitch in r227751 LoopVectorizationLegality::{getNumLoads,getNumStores} should forward to LoopAccessAnalysis now. Thanks to Takumi for noticing this! llvm-svn: 227992	2015-02-03 17:59:53 +00:00
Eric Christopher	cc62f1ae1b	Only access TLOF via the TargetMachine, not TargetLowering. llvm-svn: 227949	2015-02-03 07:22:52 +00:00
Lang Hames	624168574b	[PBQP Regalloc] Pre-spill vregs that have no legal physregs. The PBQP::RegAlloc::MatrixMetadata class assumes that matrices have at least two rows/columns (for the spill option plus at least one physreg). This patch ensures that that invariant is met by pre-spilling vregs that have no physreg options so that no node (and no corresponding edges) need be added to the PBQP graph. This fixes a bug in an out-of-tree target that was identified by Jonas Paulsson. Thanks for tracking this down Jonas! llvm-svn: 227942	2015-02-03 06:14:06 +00:00
Justin Bogner	67c8b5392c	InstrProf: Simplify RawCoverageMappingReader's API slightly This is still kind of a weird API, but dropping the (partial) update of the passed in CoverageMappingRecord makes it a little easier to understand and use. llvm-svn: 227900	2015-02-03 00:20:11 +00:00
Jingyue Wu	34a8e5e1ea	Resurrect the assertion removed by r227717 Summary: MSVC can compile "LoopID->getOperand(0) == LoopID" when LoopID is MDNode*. Test Plan: no regression Reviewers: mkuper Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7327 llvm-svn: 227853	2015-02-02 20:41:11 +00:00
Duncan P. N. Exon Smith	267f14474d	IR: Allow GenericDebugNode construction from MDString Allow `GenericDebugNode` construction directly from `MDString`, rather than requiring `StringRef`s. I've refactored the `StringRef` constructors to use these. There's no real functionality change here, except for exposing the lower-level API. The purpose of this is to simplify construction of string operands when reading bitcode. It's unnecessarily indirect to parse an `MDString` ID, lookup the `MDString` in the bitcode reader list, get the `StringRef` out of that, and then have `GenericDebugNode::getImpl()` use `MDString::get()` to acquire the original `MDString`. Instead, this allows the bitcode reader to directly pass in the `MDString`. llvm-svn: 227848	2015-02-02 20:01:03 +00:00
Duncan P. N. Exon Smith	4505bfa490	IR: Extract DEFINE_MDNODE_GET(), NFC llvm-svn: 227847	2015-02-02 19:55:21 +00:00
Duncan P. N. Exon Smith	25ffa9ed9c	IR: Separate helpers for string operands, NFC llvm-svn: 227846	2015-02-02 19:54:05 +00:00
Duncan P. N. Exon Smith	439bf9404e	IR: Split out DebugInfoMetadata.h, NFC Move debug-info-centred `Metadata` subclasses into their own header/source file. A couple of private template functions are needed from both `Metadata.cpp` and `DebugInfoMetadata.cpp`, so I've moved them to `lib/IR/MetadataImpl.h`. llvm-svn: 227835	2015-02-02 18:53:21 +00:00
David Blaikie	87c973c9d7	STLExtras: Provide less/equal functors with templated function call operators, plus a deref'ing functor template utility Similar to the C++14 void specializations of these templates, useful as a stop-gap until LLVM switches to '14. Example use-cases in tblgen because I saw some functors that looked like they could be simplified/refactored. Reviewers: dexonsmith Differential Revision: http://reviews.llvm.org/D7324 llvm-svn: 227828	2015-02-02 18:35:10 +00:00
Duncan P. N. Exon Smith	1f7b5ff9bc	Fix some file headers, NFC llvm-svn: 227826	2015-02-02 18:20:15 +00:00
Eric Christopher	2aab2ce529	Remove unnecessary forward declaration. llvm-svn: 227813	2015-02-02 17:38:40 +00:00
Lang Hames	dcc8377028	[Orc] Make the ObjectLinkingLayer take ownership of object files until finalization time. As currently implemented, RuntimeDyldELF requires the original object file to be avaible when relocations are being resolved. This patch ensures that the ObjectLinkingLayer preserves it until then. In the future RuntimeDyldELF should be rewritten to remove this requirement, at which point this patch can be reverted. Regression test cases for Orc (which include coverage of this bug) will be committed shortly. llvm-svn: 227778	2015-02-02 04:32:17 +00:00
Lang Hames	8275580ce1	[Orc] Add sensible defaults for the ObjectLinkingLayer constructor. llvm-svn: 227776	2015-02-02 01:03:10 +00:00
Benjamin Kramer	18cda2e8dc	FoldingSetVectorIterator is just a subset of pointee_iterator, remove it. llvm-svn: 227761	2015-02-01 19:26:05 +00:00
Adam Nemet	83140dfa69	Include cstddef in EquivalenceClasses.h This is to try to appease bots complaining that ptrdiff_t is undefined in LoopAccessAnalysis.cpp. llvm-svn: 227757	2015-02-01 17:21:06 +00:00
Adam Nemet	2884269478	[LoopVectorize] Move LoopAccessAnalysis to its own module Other than moving code and adding the boilerplate for the new files, the code being moved is unchanged. There are a few global functions that are shared with the rest of the LoopVectorizer. I moved these to the new module as well (emitLoopAnalysis, stripIntegerCast, replaceSymbolicStrideSCEV) along with the Report class used by emitLoopAnalysis. There is probably room for further improvement in this area. I kept DEBUG_TYPE "loop-vectorize" because it's used as the PassName with emitOptimizationRemarkAnalysis. This will obviously have to change. NFC. This is part of the patchset that splits out the memory dependence logic from LoopVectorizationLegality into a new class LoopAccessAnalysis. LoopAccessAnalysis will be used by the new Loop Distribution pass. llvm-svn: 227756	2015-02-01 16:56:15 +00:00
Adam Nemet	287f8b34a3	[LoopVectorize] Make hasVectorInstrinsicScalarOpd inline VectorUtils.h needs to be included in LoopAccessAnalysis.cpp for getIntrinsicIDForCall but hasVectorInstrinsicScalarOpd is not used by this module. NFC. This is part of the patchset that splits out the memory dependence logic from LoopVectorizationLegality into a new class LoopAccessAnalysis. LoopAccessAnalysis will be used by the new Loop Distribution pass. llvm-svn: 227753	2015-02-01 16:56:05 +00:00
Michael Kuperstein	41ae9af2e3	[X86] Convert esp-relative movs of function arguments to pushes, step 2 This moves the transformation introduced in r223757 into a separate MI pass. This allows it to cover many more cases (not only cases where there must be a reserved call frame), and perform rudimentary call folding. It still doesn't have a heuristic, so it is enabled only for optsize/minsize, with stack alignment <= 8, where it ought to be a fairly clear win. (Re-commit of r227728) Differential Revision: http://reviews.llvm.org/D6789 llvm-svn: 227752	2015-02-01 16:56:04 +00:00

... 2 3 4 5 6 ...

22128 Commits