llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Benjamin Kramer	4579422ac9	Update the docs to require at least MSVC 2013. llvm-svn: 229323	2015-02-15 19:34:17 +00:00
Arnaud A. de Grandmaison	1c6984dc94	[PBQP] Assert conservativelly allocatable nodes are spilled by choice. llvm-svn: 229302	2015-02-15 10:35:31 +00:00
Ramkumar Ramachandra	af4f23c6ae	InstCombine: propagate deref via new addDereferenceableAttr The "dereferenceable" attribute cannot be added via .addAttribute(), since it also expects a size in bytes. AttrBuilder#addAttribute or AttributeSet#addAttribute is wrapped by classes Function, InvokeInst, and CallInst. Add corresponding wrappers to AttrBuilder#addDereferenceableAttr. Having done this, propagate the dereferenceable attribute via gc.relocate, adding a test to exercise it. Note that -datalayout is required during execution over and above -instcombine, because InstCombine only optionally requires DataLayoutPass. Differential Revision: http://reviews.llvm.org/D7510 llvm-svn: 229265	2015-02-14 19:37:54 +00:00
Richard Smith	a95b03186b	[modules] Try harder to stop DebugInfo/PDB/DIA being built if not available. llvm-svn: 229243	2015-02-14 05:54:56 +00:00
Zachary Turner	ee319cd955	llvm-pdbdump: Only dump whitelisted global symbols. Dumping the global scope contains a lot of very uninteresting things and is generally polluted with a lot of random junk. Furthermore, it dumps values unsorted, making it hard to read. This patch dumps known interesting types only, and as a side effect sorts the list by symbol type. llvm-svn: 229232	2015-02-14 03:54:28 +00:00
Duncan P. N. Exon Smith	33685ffe5d	CodeGen: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) Also, add `Function::getFnStackAlignment()`, and canonicalize: getAttributes().getStackAlignment(AttributeSet::FunctionIndex) => getFnStackAlignment() llvm-svn: 229208	2015-02-14 01:44:41 +00:00
Matthias Braun	cad0e84d8e	Revert "On ELF, put PIC jump tables in a non executable section." This reverts commit r228939. The commit broke something in the output of exception handling tables on darwin x86-64. llvm-svn: 229203	2015-02-14 01:16:54 +00:00
Richard Smith	571f7ccf5d	[modules] Split off a separate module for DebugInfo/PDB/DIA so that its headers don't get included on systems where the DIA SDK is unavailable. llvm-svn: 229200	2015-02-14 00:47:20 +00:00
Reid Kleckner	56669ab852	Unify the two EH personality classification routines I wrote We only need one. llvm-svn: 229193	2015-02-14 00:21:02 +00:00
Frederic Riss	f729d83709	DWARFUnit: Add a couple of helpers to access the DIE array. To be used in dsymutil (or any other client that wants to take advantage of the fact that DIEs are stored in a vector). llvm-svn: 229179	2015-02-13 23:18:24 +00:00
Richard Smith	471dcba2f5	[modules] Mark include/llvm/Support/Dwarf.def as being a textually-included header. llvm-svn: 229154	2015-02-13 21:06:45 +00:00
Richard Smith	1007da1563	Clean up some inappropriate choices of type in the bitcode reader. None of these are expected to fix any 64->32 bit real truncation issues. llvm-svn: 229153	2015-02-13 21:05:11 +00:00
Benjamin Kramer	2220034e93	Reapply r229142 with some enable_if magic to avoid memcpying between differing types. Original commit message: SmallVector: Resolve a long-standing fixme by using the existing unitialized_copy dispatch. This makes append() use memcpy for trivially copyable types. llvm-svn: 229149	2015-02-13 20:45:14 +00:00
Benjamin Kramer	b6f1935109	Revert r229142. It breaks the world for unknown reasons. llvm-svn: 229144	2015-02-13 19:45:28 +00:00
Benjamin Kramer	e9e8f9f6de	SmallVector: Resolve a long-standing fixme by using the existing unitialized_copy dispatch. This makes append() use memcpy for trivially copyable types. llvm-svn: 229142	2015-02-13 19:20:39 +00:00
Zachary Turner	56655b0bb0	llvm-pdbdump: Improve printing of functions and signatures. This correctly prints the function pointers, and also prints function signatures for symbols as opposed to just types. So actual functions in your program will now be printed with full name and signature, as opposed to just name as before. llvm-svn: 229129	2015-02-13 17:57:09 +00:00
Arnaud A. de Grandmaison	b1fde3b904	[PBQP] Conservativelly allocatable nodes can be spilled and give a better solution Although such nodes are allocatable, the cost of spilling may be less than allocating to register, so spilling the node may provide a better solution. The assert does not account for this case, so remove it for now. llvm-svn: 229103	2015-02-13 12:04:42 +00:00
Chandler Carruth	18e8c62883	[PM] Remove the old 'PassManager.h' header file at the top level of LLVM's include tree and the use of using declarations to hide the 'legacy' namespace for the old pass manager. This undoes the primary modules-hostile change I made to keep out-of-tree targets building. I sent an email inquiring about whether this would be reasonable to do at this phase and people seemed fine with it, so making it a reality. This should allow us to start bootstrapping with modules to a certain extent along with making it easier to mix and match headers in general. The updates to any code for users of LLVM are very mechanical. Switch from including "llvm/PassManager.h" to "llvm/IR/LegacyPassManager.h". Qualify the types which now produce compile errors with "legacy::". The most common ones are "PassManager", "PassManagerBase", and "FunctionPassManager". llvm-svn: 229094	2015-02-13 10:01:29 +00:00
Chandler Carruth	33dabe4f44	Re-sort #include lines using my handy dandy ./utils/sort_includes.py script. This is in preparation for changes to lots of include lines. llvm-svn: 229088	2015-02-13 09:09:03 +00:00
Zachary Turner	0d71c47d7a	Fix the windows build again. Grrr, MSVC. llvm-svn: 229081	2015-02-13 07:55:29 +00:00
Chandler Carruth	6c78cd7569	Revert a series of commits starting at r228886 which is triggering some regressions for LLDB on Linux. Rafael indicated on lldb-dev that we should just go ahead and revert these but that he wasn't at a computer. The patches backed out are as follows: r228980: Add support for having multiple sections with the name and ... r228889: Invert the section relocation map. r228888: Use the existing SymbolTableIndex intsead of doing a lookup. r228886: Create the Section -> Rel Section map when it is first needed. These patches look pretty nice to me, so hoping its not too hard to get them re-instated. =D llvm-svn: 229080	2015-02-13 07:52:39 +00:00
Zachary Turner	fb935fca62	Fix non-windows builds unhappy about a missing header. llvm-svn: 229079	2015-02-13 07:45:49 +00:00
Zachary Turner	96be932905	llvm-pdbdump: Add more comprehensive dumping of symbol types. In particular this patch adds the ability to dump complete function signature information including argument types as correctly formatted strings. A side effect of this is that almost all symbol and meta types are now formatted. llvm-svn: 229076	2015-02-13 07:40:03 +00:00
Craig Topper	eaf6d626b1	[X86] Remove int_x86_sse2_psll_dq_bs and int_x86_sse2_psrl_dq_bs intrinsics. The builtins aren't used by clang. llvm-svn: 229069	2015-02-13 06:07:24 +00:00
Craig Topper	3ab5637fda	[X86] Remove references to builtin names that have been removed from clang. Hope to remove the intrinsics themselves soon. llvm-svn: 229068	2015-02-13 06:07:14 +00:00
Duncan P. N. Exon Smith	c41b59bc4f	IR: Drop never-used defaults for DIBuilder::createTemplate*(), NFC No caller specifies anything different; these parameters are dead code and probably always have been. The new hierarchy doesn't bother with the fields at all (see r228607 and r228652). llvm-svn: 229037	2015-02-13 03:35:29 +00:00
Duncan P. N. Exon Smith	71f770946c	Bitcode: Remove confusing '?' from r229004, NFC The name is always part of the record, it just might be empty. Remove the `?` for clarity. llvm-svn: 229032	2015-02-13 02:43:38 +00:00
Duncan P. N. Exon Smith	84cf08e569	Bitcode: Add trailing comma to MetadataCodes, NFC Suggested in the review of r229004, this should simplify diffs in the future. llvm-svn: 229031	2015-02-13 02:41:36 +00:00
Duncan P. N. Exon Smith	8dc64a4707	AsmWriter/Bitcode: MDImportedEntity llvm-svn: 229025	2015-02-13 01:46:02 +00:00
Duncan P. N. Exon Smith	baf6eacc58	AsmWriter/Bitcode: MDObjCProperty llvm-svn: 229024	2015-02-13 01:43:22 +00:00
Duncan P. N. Exon Smith	e023c0f5eb	AsmWriter/Bitcode: MDExpression llvm-svn: 229023	2015-02-13 01:42:09 +00:00
Duncan P. N. Exon Smith	c9450daed2	AsmWriter/Bitcode: MDLocalVariable llvm-svn: 229022	2015-02-13 01:39:44 +00:00
Duncan P. N. Exon Smith	58b49ba795	AsmWriter/Bitcode: MDGlobalVariable llvm-svn: 229020	2015-02-13 01:35:40 +00:00
Duncan P. N. Exon Smith	d136432599	AsmWriter/Bitcode: MDTemplate{Type,Value}Parameter llvm-svn: 229019	2015-02-13 01:34:32 +00:00
Duncan P. N. Exon Smith	c96d92ad70	AsmWriter/Bitcode: MDNamespace llvm-svn: 229018	2015-02-13 01:32:09 +00:00
Duncan P. N. Exon Smith	affacdfc5b	AsmWriter/Bitcode: MDLexicalBlockFile llvm-svn: 229017	2015-02-13 01:30:42 +00:00
Duncan P. N. Exon Smith	b3ef6197cf	AsmWriter/Bitcode: MDLexicalBlock llvm-svn: 229016	2015-02-13 01:29:28 +00:00
Duncan P. N. Exon Smith	52584d6996	AsmWriter/Bitcode: MDSubprogram llvm-svn: 229014	2015-02-13 01:26:47 +00:00
Duncan P. N. Exon Smith	21bc2cacec	AsmWriter/Bitcode: MDCompileUnit llvm-svn: 229013	2015-02-13 01:25:10 +00:00
Zachary Turner	bacf14945c	Improve llvm-pdbdump output display. This patch adds a number of improvements to llvm-pdbdump. 1) Dumping of the entire global scope, and not only those symbols that live in individual compilands. 2) Prepend class name to member functions and data 3) Improved display of bitfields. 4) Support for dumping more kinds of data symbols. llvm-svn: 229012	2015-02-13 01:23:51 +00:00
Duncan P. N. Exon Smith	51dcb8de94	AsmWriter/Bitcode: MDSubroutineType llvm-svn: 229011	2015-02-13 01:22:59 +00:00
Duncan P. N. Exon Smith	c4bb6d7bbb	AsmWriter/Bitcode: MDDerivedType and MDCompositeType llvm-svn: 229009	2015-02-13 01:20:38 +00:00
Duncan P. N. Exon Smith	4428ff1087	AsmWriter/Bitcode: MDFile llvm-svn: 229007	2015-02-13 01:19:14 +00:00
Duncan P. N. Exon Smith	38e2854cc3	AsmWriter/Bitcode: MDBasicType llvm-svn: 229005	2015-02-13 01:14:58 +00:00
Duncan P. N. Exon Smith	8b689964a4	AsmWriter/Bitcode: MDEnumerator llvm-svn: 229004	2015-02-13 01:14:11 +00:00
Duncan P. N. Exon Smith	9879c4ea87	AsmWriter/Bitcode: MDSubrange llvm-svn: 229003	2015-02-13 01:10:38 +00:00
Duncan P. N. Exon Smith	94f67658e0	IR: Add MDExpression::ExprOperand Port `DIExpression::Operand` over to `MDExpression::ExprOperand`. The logic is needed directly in `MDExpression` to support printing in assembly. llvm-svn: 229002	2015-02-13 01:07:46 +00:00
Duncan P. N. Exon Smith	661eb5dea8	Support: Add dwarf::getOperationEncoding() llvm-svn: 229001	2015-02-13 01:05:00 +00:00
Duncan P. N. Exon Smith	980b9ef6c3	Support: Rewrite LocationAtom and OperationEncodingString(), NFC Use `Dwarf.def` more. llvm-svn: 229000	2015-02-13 01:04:08 +00:00
Akira Hatanaka	53f74bf662	[LinkModules] Change the way ModuleLinker merges triples. This commit makes the following changes: - Stop issuing a warning when the triples' string representations do not match exactly if the Triple objects generated from the strings compare equal. - On Apple platforms, choose the triple that has the larger minimum version number. rdar://problem/16743513 Differential Revision: http://reviews.llvm.org/D7591 llvm-svn: 228999	2015-02-13 00:40:41 +00:00
Rafael Espindola	4467ec2e41	Add support for having multiple sections with the same name and comdat. Using this in combination with -ffunction-sections allows LLVM to output a .o file with mulitple sections named .text. This saves space by avoiding long unique names of the form .text.<C++ mangled name>. llvm-svn: 228980	2015-02-12 23:29:51 +00:00
David Blaikie	52492e048a	Add missing override. llvm-svn: 228974	2015-02-12 22:58:53 +00:00
Zachary Turner	8f5e13b9c7	Attempt to fix the build again. llvm-svn: 228964	2015-02-12 21:25:58 +00:00
Zachary Turner	5a969b378c	Attempt to fix Linux builds after r228960. llvm-svn: 228962	2015-02-12 21:17:07 +00:00
Rafael Espindola	83518ea75f	Remove mostly unused setters. Most of the code was setting the TargetOptions directly. llvm-svn: 228961	2015-02-12 21:16:34 +00:00
Zachary Turner	6cde1e9388	Add concrete type overloads to PDBSymbol::findChildren(). Frequently you only want to iterate over children of a specific type (e.g. functions). Previously you would get back a generic interface that allowed iteration over the base symbol type, which you would have to dyn_cast<> each one of. With this patch, we allow the user to specify the concrete type as a template parameter, and it will return an iterator which returns instances of the concrete type directly. llvm-svn: 228960	2015-02-12 21:09:24 +00:00
Rafael Espindola	5feecddc53	On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 228939	2015-02-12 17:46:49 +00:00
Rafael Espindola	fb65819e24	Put each jump table in an independent section if the function is too. This allows the linker to GC both, fixing pr22557. llvm-svn: 228937	2015-02-12 17:16:46 +00:00
Benjamin Kramer	4b76aa3d46	MathExtras: Bring Count(Trailing\|Leading)Ones and CountPopulation in line with countTrailingZeros Update all callers. llvm-svn: 228930	2015-02-12 15:35:40 +00:00
Andrea Di Biagio	7ca0db442c	[TTI] Teach the cost heuristic how to query TLI to check if a zext/trunc is 'free' for the target. Now that SimplifyCFG uses TTI for the cost heuristic, we can teach BasicTTIImpl how to query TLI in order to get a more accurate cost for truncates and zero-extends. Before this patch, the basic cost heuristic in TargetTransformInfoImplCRTPBase would have conservatively returned a 'default' TCC_Basic for all zero-extends, and TCC_Free for truncates on native types. This patch improves the heuristic so that we query TLI (if available) to get more accurate answers. If TLI is available, then methods 'isZExtFree' and 'isTruncateFree' can be used to check if a zext/trunc is free for the target. Added more test cases to SimplifyCFG/X86/speculate-cttz-ctlz.ll. With this change, SimplifyCFG is now able to speculate a 'cheap' cttz/ctlz immediately followed by a free zext/trunc. Differential Revision: http://reviews.llvm.org/D7585 llvm-svn: 228923	2015-02-12 14:17:24 +00:00
Benjamin Kramer	c7a7636094	BitVector: Remove manual bit width dispatch, this is handled by templates NFC. llvm-svn: 228922	2015-02-12 14:02:58 +00:00
Benjamin Kramer	d08a40831d	MathExtras: Parametrize count(Trailing\|Leading)Zeros on the type size. Otherwise we will always select the generic version for e.g. unsigned long if uint64_t is typedef'd to 'unsigned long long'. Also remove enable_if hacks in favor of static_assert. llvm-svn: 228921	2015-02-12 13:47:29 +00:00
Adrian Prantl	9ec54ab53b	Generalize DIBuilder's createReplaceableForwardDecl() to a more flexible createReplaceableCompositeType() that allows to create non-forward-declared temporary nodes. Paired commit with CFE. llvm-svn: 228852	2015-02-11 17:45:05 +00:00
Andrea Di Biagio	70c7608263	[TTI] Improved cost heuristic for cttz/ctlz calls. This patch is a follow-up of r228826 (see code-review: D7506). Now that SimplifyCFG uses TargetTransformInfo for cost analysis, we have to fix the cost heuristic for intrinsic calls to cttz/ctlz. This patch defines method 'getIntrinsicCost' in BasicTTIImpl: now, BasicTTIImpl queries TLI to check if a call to cttz/ctlz is cheap for the target. Added test cases in Transforms/SimplifyCFG/X86 to verify that on x86, SimplifyCFG only speculates a call to cttz/ctlz if it is cheap. Differential Revision: http://reviews.llvm.org/D7554 llvm-svn: 228829	2015-02-11 14:22:18 +00:00
Arnaud A. de Grandmaison	bfad2ea31a	[PBQP] Cautiously update edge costs in the solver The NodeMetadata are maintained in an incremental way. When an edge between 2 nodes has its cost updated, in the course of graph reduction for example, the NodeMetadata need first to have the old edge cost removed, then the new edge cost added. Only once the NodeMetadata have been fully updated, it becomes safe to consider promoting the nodes to the ConservativelyAllocatable or OptimallyReducible sets. Previously, this promotion was occuring right after the removing the old cost, and this was breaking the assumption that a ConservativelyAllocatable should not be spilled. This patch also adds asserts to: - enforces the invariant that a node's reduction can not be downgraded, - only not provably allocatable or optimally reducible nodes can be spilled. llvm-svn: 228816	2015-02-11 08:25:36 +00:00
Reid Kleckner	86643b627c	Don't promote asynch EH invokes of nounwind functions to calls If the landingpad of the invoke is using a personality function that catches asynch exceptions, then it can catch a trap. Also add some landingpads to invalid LLVM IR test cases that lack them. Over-the-shoulder reviewed by David Majnemer. llvm-svn: 228782	2015-02-11 01:23:16 +00:00
Zachary Turner	473b4aac78	Rewrite llvm-pdbdump in terms of LLVMDebugInfoPDB. This makes llvm-pdbdump available on all platforms, although it will currently fail to create a dumper if there is no PDB reader implementation for the current platform. It implements dumping of compilands and children, which is less information than was previously available, but it has to be rewritten from scratch using the new set of interfaces, so the rest of the functionality will be added back in subsequent commits. llvm-svn: 228755	2015-02-10 22:43:25 +00:00
Zachary Turner	422f6e29d5	Provide DIA implementation of DebugInfoPDB. This implements DebugInfoPDB when the DIA SDK is present on the system. Specifically, this means that the following conditions are met: 1) You are building on Windows. 2) You are building with MSVC. 3) Visual Studio did not corrupt the installation of DIA due to a known issue with side-by-side installations of VS2012 and VS2013. If all of these conditions are true, you will be able to pass a value of PDB_Reader::DIA to PDB::createPdbReader(). There are no tests for this yet, as any test will be in the form of a lit test which tests the llvm-pdbdump.exe, which still needs to be rewritten in terms of this library. llvm-svn: 228747	2015-02-10 21:17:52 +00:00
Aaron Ballman	e5024a035a	Now use the __debugbreak intrinsic instead of calling RaiseException; it requires no forward declares and still calls VEH. llvm-svn: 228745	2015-02-10 21:13:04 +00:00
Aaron Ballman	39da612547	Changing the status code generated by LLVM_BUILTIN_TRAP on Windows to be something categorized as a valid error code. Fixes crashing uses (such as not --crash) with existing sys::Wait behavior. llvm-svn: 228738	2015-02-10 20:13:52 +00:00
Andrew Kaylor	fff974fc6d	Adding support for llvm.eh.begincatch and llvm.eh.endcatch intrinsics and beginning the documentation of native Windows exception handling. Differential Revision: http://reviews.llvm.org/D7398 llvm-svn: 228733	2015-02-10 19:52:43 +00:00
Duncan P. N. Exon Smith	73d123e7bc	IR: Add MDNode::replaceWithPermanent() Add new API for converting temporaries that may self-reference. Self-referencing nodes are not allowed to be uniqued, so sending them into `replaceWithUniqued()` is dangerous (and this commit adds assertions that prevent it). `replaceWithPermanent()` has similar semantics to `get()` followed by calls to `replaceOperandWith()`. In particular, if there's a self-reference, it returns a distinct node; otherwise, it returns a uniqued one. Like `replaceWithUniqued()` and `replaceWithDistinct()` (well, it calls out to them) it mutates the temporary node in place if possible, only calling `replaceAllUsesWith()` on a uniquing collision. llvm-svn: 228726	2015-02-10 19:13:46 +00:00
Paul Robinson	b0fca412c4	Explicitly initialize a flag in a default constructor. Works around a Visual C++ issue. Patch by Douglas Yung! llvm-svn: 228699	2015-02-10 15:30:02 +00:00
Aaron Ballman	a7a249093b	Re-committing r228628 with a fix for 64-bit builds. On Windows, we now use RaiseException to generate the kind of trap we require (one which calls our vectored exception handler), and fall back to using a volatile write to simulate a trap elsewhere. llvm-svn: 228691	2015-02-10 14:28:11 +00:00
Lang Hames	27d2677ad7	[Orc] Fix a bug in the LazyEmittingLayer - capture names by value (as std::strings) rather than StringRefs in JITSymbol get-address lambda. Capturing a StringRef by-value is still effectively capturing a reference, which is no good here because the referenced string may be gone by the time the lambda is being evaluated the original value may be gone. Make sure to capture a std::string instead. No test case: This bug doesn't manifest under OrcMCJITReplacement, since it keeps IR modules (from which the StringRefs are sourced) alive permanently. llvm-svn: 228676	2015-02-10 07:35:39 +00:00
Lang Hames	132f37f8e5	[Orc] Add missing casserts header to JITSymbol.h. llvm-svn: 228675	2015-02-10 07:26:19 +00:00
Zachary Turner	0d4fc2d795	Define HAVE_DIA_SDK on Windows when DIA is present. This allows all CMake projects, as well as C++ code, to detect if and when DIA SDK is available for use so that we can enable the DIA-based PDB reader implementation. Differential Revision: http://reviews.llvm.org/D7457 Reviewed By: Chandler Carruth llvm-svn: 228669	2015-02-10 05:04:25 +00:00
Duncan P. N. Exon Smith	61535117b4	IR: Remove unnecessary fields from MDTemplateParameter I noticed this fields were never used in r228607, but I neglected to propagate that into `MDTemplateParameter` until now. This really should have been done before commit in r228640; sorry for the churn. llvm-svn: 228652	2015-02-10 01:59:57 +00:00
Duncan P. N. Exon Smith	6c66615baf	IR: Add accessors to MDExpression Add some accessors to `MDExpression`. llvm-svn: 228648	2015-02-10 01:36:46 +00:00
Duncan P. N. Exon Smith	9fcf9cd379	AsmParser: Add stubs for specialized MDNodes, NFC Well, the exact error from the failed parse will change, but... llvm-svn: 228644	2015-02-10 01:08:16 +00:00
Duncan P. N. Exon Smith	1c43bf9cfb	IR: Add specialized debug info metadata nodes Add specialized debug info metadata nodes that match the `DIDescriptor` wrappers (used by `DIBuilder`) closely. Assembly and bitcode support to follow soon (it'll mostly just be obvious), but this sketches in today's schema. This is the first big commit (well, the only big one aside from the testcase changes that'll come when I move this into place) for PR22464. I've marked a bunch of obvious changes as `TODO`s in the source; I plan to make those changes promptly after this hierarchy is moved underneath `DIDescriptor`, but for now I'm aiming mostly to match the status quo. llvm-svn: 228640	2015-02-10 00:52:32 +00:00
Lang Hames	2359267cca	[Orc] Back out one of the GCC ICE workarounds from r228568. NFC. llvm-svn: 228637	2015-02-10 00:37:26 +00:00
Aaron Ballman	da25678483	Reverting r228628; it broke at least one builder due to the forward declare of RaiseException. llvm-svn: 228633	2015-02-10 00:00:54 +00:00
Adrian Prantl	f10ec50249	Debug info: Use DW_OP_bit_piece instead of DW_OP_piece in the intermediate representation. This - increases consistency by using the same granularity everywhere - allows for pieces < 1 byte - DW_OP_piece didn't actually allow storing an offset. Part of PR22495. llvm-svn: 228631	2015-02-09 23:57:15 +00:00
Duncan P. N. Exon Smith	8456fa3c41	ADT: Allow up to 18 arguments in hash_combine() I just realized that the specialized metadata node patch I'm about to commit won't compile on old compilers. Bump `hash_combine()`'s support for non-variadic templates to 18 (I tested this by reversing the logic in the #ifdef). llvm-svn: 228629	2015-02-09 23:21:05 +00:00
Aaron Ballman	230da450ab	On Windows, we now use RaiseException to generate the kind of trap we require (one which calls our vectored exception handler), and fall back to using a volatile write to simulate a trap elsewhere. llvm-svn: 228628	2015-02-09 23:11:39 +00:00
Ramkumar Ramachandra	545e586a0e	[Statepoint] Improve two asserts, fix some style (NFC) Summary: It's important that our users immediately know what gc.safepoint_poll is. Also fix the style of the declaration of CreateGCStatepoint, in preparation for another change that will wrap it. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7517 llvm-svn: 228626	2015-02-09 23:02:10 +00:00
Duncan P. N. Exon Smith	ec76782680	IR: Take uint64_t in DIBuilder::createExpression() `DIExpression` deals with `uint64_t`, so it doesn't make sense that `createExpression()` is created from `int64_t`. Switch to `uint64_t` to unify them. I've temporarily left in the `int64_t` version, which forwards to the `uint64_t` version. I'll delete it once I've updated the callers. llvm-svn: 228619	2015-02-09 22:13:27 +00:00
Duncan P. N. Exon Smith	03fb95aca6	IR: Document horrible abuse of loose DIDescriptor, NFC I'll circle back and fix this somehow; for now I just don't want to forget about it. llvm-svn: 228608	2015-02-09 21:26:34 +00:00
Duncan P. N. Exon Smith	c4219c892a	IR: Remove dead code in DITemplate* These are never referenced or filled in. llvm-svn: 228607	2015-02-09 21:23:34 +00:00
Ben Langmuir	615dcdb7b4	Reduce the LockFileManager timeout, and provide unsafeRemoveLockFile 5 minutes is an eternity, so try to strike a better balance between waiting long enough for any reasonable module build and not so long that users kill the process because they think it's hanging. Also give the client a way to delete the lock file after a timeout. llvm-svn: 228603	2015-02-09 20:34:24 +00:00
Sanjoy Das	db6aec61b3	Address post-commit review for rL228587: make it explicit that the <NW> bit of a SCEVAddRecExpr does not depend on the sign of the step and the start value of the step. llvm-svn: 228595	2015-02-09 19:39:00 +00:00
Sanjoy Das	2602c601e8	Clarify the wording on what it means for a SCEVAddRecExpr to be <NW>. llvm-svn: 228587	2015-02-09 18:44:42 +00:00
Lang Hames	a19eaab874	[Orc] Revert r228567 (GCC ICE workaround) - it doesn't seem to have helped. As far as I can tell r228568 was the right workaround, and r228567 was unnecessary. If reverting this causes problems on the bots I'll reinstate it. llvm-svn: 228585	2015-02-09 18:16:43 +00:00
Lang Hames	f1ea70a68b	[Orc] Try another workaround for the GCC 4.7.2 ICE introduced in r228557. NFC. llvm-svn: 228568	2015-02-09 07:47:32 +00:00
Lang Hames	6983fc8f43	[Orc] Tweak lambda capture lists to try to avoid an ICE on gcc-4.7.2. NFC. Apparently gcc-4.7.2 is touchy about 'this' appearing in a lambda capture list along with other captures. I've rewritten my captures to try to avoid the issue. llvm-svn: 228567	2015-02-09 07:22:56 +00:00
Lang Hames	79fa1f9f13	[Orc] Fix the MSVC bots by using LLVM_EXPLICIT rather than explicit. llvm-svn: 228564	2015-02-09 04:46:41 +00:00
Lang Hames	92f9dd24ac	[Orc] Add a JITSymbol class to the Orc APIs, refactor APIs, update clients. This patch refactors a key piece of the Orc APIs: It removes the ::getSymbolAddress and ::lookupSymbolAddressIn methods, which returned target addresses (uint64_ts), and replaces them with ::findSymbol and ::findSymbolIn respectively, which return instances of the new JITSymbol type. Unlike the old methods, calling findSymbol or findSymbolIn does not cause the symbol to be immediately materialized when found. Instead, the symbol will be materialized if/when the getAddress method is called on the returned JITSymbol. This allows us to query for the existence of symbols without actually materializing them. In the future I expect more information to be attached to the JITSymbol class, for example whether the returned symbol is a weak or strong definition. This will allow us to properly handle weak symbols and multiple definitions. llvm-svn: 228557	2015-02-09 01:20:51 +00:00
Zachary Turner	71e3bf6e80	Make PDBSymbol's IPDBSymbol reference const. llvm-svn: 228553	2015-02-08 22:53:53 +00:00
Zachary Turner	754851ad80	DebugInfoPDB: Make the symbol base case hold an IPDBSession ref. Dumping a symbol often requires access to data that isn't inside the symbol hierarchy, but which is only accessible through the top-level session. This patch is a pure interface change to give symbols a reference to the session. llvm-svn: 228542	2015-02-08 20:58:09 +00:00
Bjorn Steinbrink	a6a56743c3	Correctly combine alias.scope metadata by a union instead of intersecting Summary: The alias.scope metadata represents sets of things an instruction might alias with. When generically combining the metadata from two instructions the result must be the union of the original sets, because the new instruction might alias with anything any of the original instructions aliased with. Reviewers: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7490 llvm-svn: 228525	2015-02-08 17:07:14 +00:00
Elena Demikhovsky	40c204cf7d	Masked Gather and Scatter Intrinsics. Gather and Scatter are new introduced intrinsics, comming after recently implemented masked load and store. This is the first patch for Gather and Scatter intrinsics. It includes only the syntax, parsing and verification. Gather and Scatter intrinsics allow to perform multiple memory accesses (read/write) in one vector instruction. The intrinsics are not target specific and will have the following syntax: Gather: declare <16 x i32> @llvm.masked.gather.v16i32(<16 x i32> <vector of ptrs>, i32 <alignment>, <16 x i1> <mask>, <16 x i32> <passthru>) declare <8 x float> @llvm.masked.gather.v8f32(<8 x float><vector of ptrs>, i32 <alignment>, <8 x i1> <mask>, <8 x float><passthru>) Scatter: declare void @llvm.masked.scatter.v8i32(<8 x i32><vector value to be stored> , <8 x i32><vector of ptrs> , i32 <alignment>, <8 x i1> <mask>) declare void @llvm.masked.scatter.v16i32(<16 x i32> <vector value to be stored> , <16 x i32> <vector of ptrs>, i32 <alignment>, <16 x i1><mask> ) Vector of ptrs - a set of source/destination addresses, to load/store the value. Mask - switches on/off vector lanes to prevent memory access for switched-off lanes vector of ptrs, value and mask should have the same vector width. These are code examples where gather / scatter should be used and will allow function vectorization ;void foo1(int * restrict A, int * restrict B, int * restrict C) { ; for (int i=0; i<SIZE; i++) { ; A[i] = B[C[i]]; ; } ;} ;void foo3(int * restrict A, int * restrict B) { ; for (int i=0; i<SIZE; i++) { ; A[B[i]] = i+5; ; } ;} Tests will come in the following patches, with CodeGen and Vectorizer. http://reviews.llvm.org/D7433 llvm-svn: 228521	2015-02-08 08:27:19 +00:00
Zachary Turner	ca172bdb48	Some cleanup for libpdb. This patch implements a few of the optional suggestions from the initial patch comitting libpdb. In particular, it implements a virtual function out of line for each of the concrete classes. A few other minor cleanups exist as well, such as using override instead of virtual, etc. llvm-svn: 228516	2015-02-08 00:29:29 +00:00
Benjamin Kramer	c705a27ee2	SCEV: Compress disposition pairs. Composing DenseMaps and SmallVectors is still somewhat suboptimal, but this at least halves the size of the vector elements. NFC. llvm-svn: 228497	2015-02-07 16:41:12 +00:00
Benjamin Kramer	28fba477c6	SmallVector: Move emplace_back to SmallVectorImpl. This resolves the strange effect that emplace_back is only available when the type contained in the vector is not trivially copyable. llvm-svn: 228496	2015-02-07 16:41:02 +00:00
Benjamin Kramer	a3a195bd27	Move DebugLocs around instead of copying. llvm-svn: 228491	2015-02-07 12:28:15 +00:00
Bruce Mitchener	efa6e79a0c	Add more DWARF 5 language constants. Differential Revision: http://reviews.llvm.org/D7430 llvm-svn: 228487	2015-02-07 06:35:30 +00:00
Zachary Turner	58559798ed	Change RHS-style decltype to LHS-style decltype<declval()>. Seems some compilers don't like the RHS-style decltype specifier. This should fix the buildbots. llvm-svn: 228484	2015-02-07 02:02:23 +00:00
Duncan P. N. Exon Smith	408c33af4a	Support: Add dwarf::getVirtuality() llvm-svn: 228474	2015-02-07 00:37:15 +00:00
Duncan P. N. Exon Smith	165b84bee4	Support: Use Dwarf.def for DW_VIRTUALITY, NFC Use definition file for `DW_VIRTUALITY_*`. Add a `DW_VIRTUALITY_max` both for ease of testing and for future use by the `LLParser`. llvm-svn: 228473	2015-02-07 00:36:23 +00:00
Duncan P. N. Exon Smith	1cb5884cd9	Support: Add dwarf::getAttributeEncoding() llvm-svn: 228470	2015-02-06 23:46:49 +00:00
Duncan P. N. Exon Smith	ee716fc78e	Support: Rewrite AttributeEncodingString(), NFC llvm-svn: 228469	2015-02-06 23:45:37 +00:00
Kevin Enderby	03d099fa2a	Add code to llvm-objdump so the -section option with -macho will dump literal sections with the Mach-O S_{4,8,16}BYTE_LITERALS section types. llvm-svn: 228465	2015-02-06 23:25:38 +00:00
Duncan P. N. Exon Smith	b2bb076de9	Support: Add dwarf::getLanguage() llvm-svn: 228458	2015-02-06 22:55:13 +00:00
Duncan P. N. Exon Smith	b69d0ef5cb	Support: Rewrite dwarf::LanguageString(), NFC llvm-svn: 228457	2015-02-06 22:53:19 +00:00
Lang Hames	1179d700be	[Orc] Add more missing headers. llvm-svn: 228454	2015-02-06 22:48:43 +00:00
Zachary Turner	743a32b50f	Resubmit "Create lib/DebugInfo/PDB" (r228428) This change resubmits the patch that broke the build, this time without unittests. The unittests will be submitted separately after the problem has been addressed: --Original Commit Message-- Create lib/DebugInfo/PDB. This patch creates a platform-independent interface to a PDB reader. There is currently no implementation of this interface, which will be provided in future patches. This defines the basic object model which any implementation must conform to. Reviewed by: David Blaikie Differential Revision: http://reviews.llvm.org/D7356 llvm-svn: 228435	2015-02-06 20:30:52 +00:00
Michael Zolotukhin	9630715912	Use estimated number of optimized insns in unroll-threshold computation. If complete-unroll could help us to optimize away N% of instructions, we might want to do this even if the final size would exceed loop-unroll threshold. However, we don't want to unroll huge loop, and we are add AbsoluteThreshold to avoid that - this threshold will never be crossed, even if we expect to optimize 99% instructions after that. llvm-svn: 228434	2015-02-06 20:20:40 +00:00
Michael Zolotukhin	bbf2ac3d22	[InstSimplify] Add SimplifyFPBinOp function. It is a variation of SimplifyBinOp, but it takes into account FastMathFlags. It is needed in inliner and loop-unroller to accurately predict the transformation's outcome (previously we dropped the flags and were too conservative in some cases). Example: float foo(float a, float b) { float r; if (a[1] b) r = /* a lot of expensive computations /; else r = 1; return r; } float boo(float a) { return foo(a, 0.0); } Without this patch, we don't inline 'foo' into 'boo'. llvm-svn: 228432	2015-02-06 20:02:51 +00:00
Zachary Turner	434f023380	Revert "Create lib/DebugInfo/PDB." This reverts commit 21028, as it is causing failures in LLVMConfig. llvm-svn: 228431	2015-02-06 20:00:18 +00:00
Zachary Turner	bfc7e60f16	Create lib/DebugInfo/PDB. This patch creates a platform-independent interface to a PDB reader. There is currently no implementation of this interface, which will be provided in future patches. This defines the basic object model which any implementation must conform to. Reviewed by: David Blaikie Differential Revision: http://reviews.llvm.org/D7356 llvm-svn: 228428	2015-02-06 19:44:09 +00:00
Lang Hames	af4f511096	[Orc] Add some missing headers. llvm-svn: 228426	2015-02-06 19:34:40 +00:00
Lang Hames	fcb5b36695	[Orc] Fix syntax error in LazyEmittingLayer::removeModuleSet. This was a trivial think-o, but it's in a method of a templated class and doesn't have any callers yet, so the compiler let it pass. I hope to add a unit test to cover this soon. llvm-svn: 228425	2015-02-06 19:34:04 +00:00
Quentin Colombet	77dfe32eb3	[LiveIntervalAnalysis] Speed up creation of live ranges for physical registers by using a segment set. The patch addresses a compile-time performance regression in the LiveIntervals analysis pass (see http://llvm.org/bugs/show_bug.cgi?id=18580). This regression is especially critical when compiling long functions. Our analysis had shown that the most of time is taken for generation of live intervals for physical registers. Insertions in the middle of the array of live ranges cause quadratic algorithmic complexity, which is apparently the main reason for the slow-down. Overview of changes: - The patch introduces an additional std::set<Segment>* member in LiveRange for storing segments in the phase of initial creation. The set is used if this member is not NULL, otherwise everything works the old way. - The set of operations on LiveRange used during initial creation (i.e. used by createDeadDefs and extendToUses) have been reimplemented to use the segment set if it is available. - After a live range is created the contents of the set are flushed to the segment vector, because the set is not as efficient as the vector for the later uses of the live range. After the flushing, the set is deleted and cannot be used again. - The set is only for live ranges computed in LiveIntervalAnalysis::computeLiveInRegUnits() and getRegUnit() but not in computeVirtRegs(), because I did not bring any performance benefits to computeVirtRegs() and for some examples even brought a slow down. Patch by Vaidas Gasiunas <vaidas.gasiunas@sap.com> Differential Revision: http://reviews.llvm.org/D6013 llvm-svn: 228421	2015-02-06 18:42:41 +00:00
Adam Nemet	2dda12d192	[LV] Move addRuntimeCheck to LoopAccessAnalysis This will allow it to be shared with the new Loop Distribution pass. getFirstInst is currently duplicated across LoopVectorize.cpp and LoopAccessAnalysis.cpp. This is a short-term work-around until we figure out a better solution. NFC. (The code moved is adjusted a bit for the name of the Loop member and that PtrRtCheck is now a reference rather than a pointer.) llvm-svn: 228418	2015-02-06 18:31:04 +00:00
Matthias Braun	696b7644dd	LiveInterval: Fix SubRange memory leak. llvm-svn: 228405	2015-02-06 17:28:47 +00:00
Benjamin Kramer	c44a1f1f54	Value: Remove superfluous typedefs and deprecated method. NFC. llvm-svn: 228400	2015-02-06 14:44:02 +00:00
Ramkumar Ramachandra	39bc517234	Introduce print-memderefs to test isDereferenceablePointer Since testing the function indirectly is tricky, introduce a direct print-memderefs pass, in the same spirit as print-memdeps, which prints dereferenceability information matched by FileCheck. Differential Revision: http://reviews.llvm.org/D7075 llvm-svn: 228369	2015-02-06 01:46:42 +00:00
Ahmed Bougacha	fccf28b772	[CodeGen] Add hook/combine to form vector extloads, enabled on X86. The combine that forms extloads used to be disabled on vector types, because "None of the supported targets knows how to perform load and sign extend on vectors in one instruction." That's not entirely true, since at least SSE4.1 X86 knows how to do those sextloads/zextloads (with PMOVS/ZX). But there are several aspects to getting this right. First, vector extloads are controlled by a profitability callback. For instance, on ARM, several instructions have folded extload forms, so it's not always beneficial to create an extload node (and trying to match extloads is a whole 'nother can of worms). The interesting optimization enables folding of s/zextloads to illegal (splittable) vector types, expanding them into smaller legal extloads. It's not ideal (it introduces some legalization-like behavior in the combine) but it's better than the obvious alternative: form illegal extloads, and later try to split them up. If you do that, you might generate extloads that can't be split up, but have a valid ext+load expansion. At vector-op legalization time, it's too late to generate this kind of code, so you end up forced to scalarize. It's better to just avoid creating egregiously illegal nodes. This optimization is enabled unconditionally on X86. Note that the splitting combine is happy with "custom" extloads. As is, this bypasses the actual custom lowering, and just unrolls the extload. But from what I've seen, this is still much better than the current custom lowering, which does some kind of unrolling at the end anyway (see for instance load_sext_4i8_to_4i64 on SSE2, and the added FIXME). Also note that the existing combine that forms extloads is now also enabled on legal vectors. This doesn't have a big effect on X86 (because sext+load is usually combined to sext_inreg+aextload). On ARM it fires on some rare occasions; that's for a separate commit. Differential Revision: http://reviews.llvm.org/D6904 llvm-svn: 228325	2015-02-05 18:31:02 +00:00
Ahmed Bougacha	2687e714ca	[CodeGen] Add isLoadExtLegalOrCustom helper to TargetLowering. llvm-svn: 228322	2015-02-05 18:15:59 +00:00
Michael Kuperstein	8ed19d08b2	Teach isDereferenceablePointer() to look through bitcast constant expressions. This fixes a LICM regression due to the new load+store pair canonicalization. Differential Revision: http://reviews.llvm.org/D7411 llvm-svn: 228284	2015-02-05 09:15:37 +00:00
Matt Arsenault	1c62631bbb	Add addrspacecast node to tablegen The node is still defined oddly so that the address spaces are not operands and not accessible from tablegen, but as-is this can now be used to write a ComplexPattern with an addrspacecast root node. llvm-svn: 228270	2015-02-05 03:35:34 +00:00
Matt Arsenault	e6e242235b	Add support for double / float to EndianStream Also add new unit tests for endian::Writer llvm-svn: 228269	2015-02-05 03:30:08 +00:00
Cameron Esfahani	a75b0eb54b	Value soft float calls as more expensive in the inliner. Summary: When evaluating floating point instructions in the inliner, ask the TTI whether it is an expensive operation. By default, it's not an expensive operation. This keeps the default behavior the same as before. The ARM TTI has been updated to return back TCC_Expensive for targets which don't have hardware floating point. Reviewers: chandlerc, echristo Reviewed By: echristo Subscribers: t.p.northover, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D6936 llvm-svn: 228263	2015-02-05 02:09:33 +00:00
Duncan P. N. Exon Smith	fe2e84e098	IR: Split out getOperandAs(), NFC llvm-svn: 228250	2015-02-05 01:07:47 +00:00
Sean Silva	9d2bea5968	[MC] Remove various unused MCAsmInfo parameters. llvm-svn: 228244	2015-02-05 00:58:51 +00:00
Duncan P. N. Exon Smith	7f33c007a0	ADT: Add int64_t interoperability to APSInt Add some API to `APSInt` to make it easier to compare with `int64_t`. - `APSInt::compareValues(APSInt, APSInt)` returns 1, -1 or 0 for greater, lesser, or equal, doing the right thing for mismatched "has-sign" and bitwidths. This is just like `isSameValue()` (and is now the implementation of it). - `APSInt::get(int64_t)` gets a signed `APSInt`. - `operator<(int64_t)`, etc., are implemented trivially via `get()` and `compareValues()`. - Also added `APSInt::getUnsigned(uint64_t)` to make it easier to test `compareValues()`. llvm-svn: 228239	2015-02-05 00:17:43 +00:00
Reid Kleckner	8127f1fbb0	Remove useless call to isOSCygMing() This used to do something when we modeled the Cygwin and MinGW environments as distinct OSs, but now it is not needed. llvm-svn: 228229	2015-02-04 23:17:19 +00:00
Matthias Braun	84aaa1dd81	MachineCSE: Clear dead-def flag on CSE. In case CSE reuses a previoulsy unused register the dead-def flag has to be cleared on the def operand, as exposed by the arm64-cse.ll test. This fixes PR22439 and the corresponding rdar://19694987 Differential Revision: http://reviews.llvm.org/D7395 llvm-svn: 228178	2015-02-04 19:35:16 +00:00
Reid Kleckner	6182b47caf	Add range adapters predecessors() and successors() for BBs Use them in two isolated transforms so we know they work and aren't dead code. llvm-svn: 228173	2015-02-04 19:14:57 +00:00
Juergen Ributzka	46ea5d06b7	Add missing include. llvm-svn: 228161	2015-02-04 18:16:53 +00:00
Alexey Samsonov	f9eb672e1c	SpecialCaseList: Add support for parsing multiple input files. Summary: This change allows users to create SpecialCaseList objects from multiple local files. This is needed to implement a proper support for -fsanitize-blacklist flag (allow users to specify multiple blacklists, in addition to default blacklist, see PR22431). DFSan can also benefit from this change, as DFSan instrumentation pass now accepts ABI-lists both from -fsanitize-blacklist= and -mllvm -dfsan-abilist flags. Go bindings are fixed accordingly. Test Plan: regression test suite Reviewers: pcc Subscribers: llvm-commits, axw, kcc Differential Revision: http://reviews.llvm.org/D7367 llvm-svn: 228155	2015-02-04 17:39:48 +00:00
Rafael Espindola	8b442bd3a1	Fix warning: "function declaration isn’t a prototype" llvm-svn: 228139	2015-02-04 13:30:28 +00:00
Philip Reames	bea8f6fd03	Add a pass for inserting safepoints into (nearly) arbitrary IR This pass is responsible for figuring out where to place call safepoints and safepoint polls. It doesn't actually make the relocations explicit; that's the job of the RewriteStatepointsForGC pass (http://reviews.llvm.org/D6975). Note that this code is not yet finalized. Its moving in tree for incremental development, but further cleanup is needed and will happen over the next few days. It is not yet part of the standard pass order. Planned changes in the near future: - I plan on restructuring the statepoint rewrite to use the functions add to the IRBuilder a while back. - In the current pass, the function "gc.safepoint_poll" is treated specially but is not an intrinsic. I plan to make identifying the poll function a property of the GCStrategy at some point in the near future. - As follow on patches, I will be separating a collection of test cases we have out of tree and submitting them upstream. - It's not explicit in the code, but these two patches are introducing a new state for a statepoint which looks a lot like a patchpoint. There's no a transient form which doesn't yet have the relocations explicitly represented, but does prevent reordering of memory operations. Once this is in, I need to update actually make this explicit by reserving the 'unused' argument of the statepoint as a flag, updating the docs, and making the code explicitly check for such a thing. This wasn't really planned, but once I split the two passes - which was done for other reasons - the intermediate state fell out. Just reminds us once again that we need to merge statepoints and patchpoints at some point in the not that distant future. Future directions planned: - Identifying more cases where a backedge safepoint isn't required to ensure timely execution of a safepoint poll. - Tweaking the insertion process to generate easier to optimize IR. (For example, investigating making SplitBackedge) the default. - Adding opt-in flags for a GCStrategy to use this pass. Once done, add this pass to the actual pass ordering. Differential Revision: http://reviews.llvm.org/D6981 llvm-svn: 228090	2015-02-04 00:37:33 +00:00
Justin Bogner	620c405abf	InstrProf: Make CounterMappingRegions less confusing to construct Creating empty and expansion regions is awkward with the current API. Expose static methods to make this simpler. llvm-svn: 228075	2015-02-03 23:59:33 +00:00
Arnaud A. de Grandmaison	775136711f	[PBQP] Provide more information in the debug prints Based on a patch by Jonas Paulsson llvm-svn: 228068	2015-02-03 23:40:24 +00:00
Arnaud A. de Grandmaison	352dc10d81	[PBQP] Constify Graph::getEdgeNode1Id and Graph::getEdgeNode2Id llvm-svn: 228048	2015-02-03 22:02:45 +00:00
Duncan P. N. Exon Smith	55694c075d	IR: Assembly and bitcode for GenericDebugNode llvm-svn: 228041	2015-02-03 21:54:14 +00:00
Justin Bogner	84c1a035e8	InstrProf: Remove CoverageMapping::HasCodeBefore, it isn't used It's not entirely clear to me what this field was meant for, but it's always false. Remove it. llvm-svn: 228034	2015-02-03 21:35:36 +00:00
Duncan P. N. Exon Smith	cd5f9211e8	Support: Add string => unsigned mapping for DW_TAG Add `dwarf::getTag()` to translate from `StringRef` to `unsigned`. llvm-svn: 228031	2015-02-03 21:16:49 +00:00
Duncan P. N. Exon Smith	81de9dc80e	Support: Re-implement dwarf::TagString() using a .def file, NFC Also re-implements the `dwarf::Tag` enumerator. I've moved the mock tags into the enumerator since there's no other way to do this. Really they shouldn't be used at all (they're just a hack to identify `MDNode`s, but we have a class hierarchy for that now). llvm-svn: 228030	2015-02-03 21:13:16 +00:00
Colin LeMahieu	3534179cf0	[Hexagon] Converting XTYPE/SHIFT intrinsics. Cleaning out old intrinsic patterns and updating tests. llvm-svn: 228026	2015-02-03 20:40:52 +00:00
Jingyue Wu	4e99b65428	Add straight-line strength reduction to LLVM Summary: Straight-line strength reduction (SLSR) is implemented in GCC but not yet in LLVM. It has proven to effectively simplify statements derived from an unrolled loop, and can potentially benefit many other cases too. For example, LLVM unrolls #pragma unroll foo (int i = 0; i < 3; ++i) { sum += foo((b + i) * s); } into sum += foo(b * s); sum += foo((b + 1) * s); sum += foo((b + 2) * s); However, no optimizations yet reduce the internal redundancy of the three expressions: b * s (b + 1) * s (b + 2) * s With SLSR, LLVM can optimize these three expressions into: t1 = b * s t2 = t1 + s t3 = t2 + s This commit is only an initial step towards implementing a series of such optimizations. I will implement more (see TODO in the file commentary) in the near future. This optimization is enabled for the NVPTX backend for now. However, I am more than happy to push it to the standard optimization pipeline after more thorough performance tests. Test Plan: test/StraightLineStrengthReduce/slsr.ll Reviewers: eliben, HaoLiu, meheff, hfinkel, jholewinski, atrick Reviewed By: jholewinski, atrick Subscribers: karthikthecool, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7310 llvm-svn: 228016	2015-02-03 19:37:06 +00:00
Rafael Espindola	b0cffcc2a8	Fix duplicated symbol error. llvm-svn: 228012	2015-02-03 19:25:53 +00:00
Manman Ren	ecc02c6b0a	[LTO API] split lto_codegen_compile to lto_codegen_optimize and lto_codegen_compile_optimized. Also add lto_api_version. Before this commit, we can only dump the optimized bitcode after running lto_codegen_compile, but it includes some impacts of running codegen passes, one example is StackProtector pass. We will get assertion failure when running llc on the optimized bitcode, because StackProtector is effectively run twice. After splitting lto_codegen_compile, the linker can choose to dump the bitcode before running lto_codegen_compile_optimized. lto_api_version is added so ld64 can check for runtime-availability of the new API. rdar://19565500 llvm-svn: 228000	2015-02-03 18:39:15 +00:00
Adam Nemet	e6e4bf975c	[LoopVectorize] Fix rebase glitch in r227751 LoopVectorizationLegality::{getNumLoads,getNumStores} should forward to LoopAccessAnalysis now. Thanks to Takumi for noticing this! llvm-svn: 227992	2015-02-03 17:59:53 +00:00
Eric Christopher	cc62f1ae1b	Only access TLOF via the TargetMachine, not TargetLowering. llvm-svn: 227949	2015-02-03 07:22:52 +00:00
Lang Hames	624168574b	[PBQP Regalloc] Pre-spill vregs that have no legal physregs. The PBQP::RegAlloc::MatrixMetadata class assumes that matrices have at least two rows/columns (for the spill option plus at least one physreg). This patch ensures that that invariant is met by pre-spilling vregs that have no physreg options so that no node (and no corresponding edges) need be added to the PBQP graph. This fixes a bug in an out-of-tree target that was identified by Jonas Paulsson. Thanks for tracking this down Jonas! llvm-svn: 227942	2015-02-03 06:14:06 +00:00
Justin Bogner	67c8b5392c	InstrProf: Simplify RawCoverageMappingReader's API slightly This is still kind of a weird API, but dropping the (partial) update of the passed in CoverageMappingRecord makes it a little easier to understand and use. llvm-svn: 227900	2015-02-03 00:20:11 +00:00
Jingyue Wu	34a8e5e1ea	Resurrect the assertion removed by r227717 Summary: MSVC can compile "LoopID->getOperand(0) == LoopID" when LoopID is MDNode*. Test Plan: no regression Reviewers: mkuper Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7327 llvm-svn: 227853	2015-02-02 20:41:11 +00:00
Duncan P. N. Exon Smith	267f14474d	IR: Allow GenericDebugNode construction from MDString Allow `GenericDebugNode` construction directly from `MDString`, rather than requiring `StringRef`s. I've refactored the `StringRef` constructors to use these. There's no real functionality change here, except for exposing the lower-level API. The purpose of this is to simplify construction of string operands when reading bitcode. It's unnecessarily indirect to parse an `MDString` ID, lookup the `MDString` in the bitcode reader list, get the `StringRef` out of that, and then have `GenericDebugNode::getImpl()` use `MDString::get()` to acquire the original `MDString`. Instead, this allows the bitcode reader to directly pass in the `MDString`. llvm-svn: 227848	2015-02-02 20:01:03 +00:00
Duncan P. N. Exon Smith	4505bfa490	IR: Extract DEFINE_MDNODE_GET(), NFC llvm-svn: 227847	2015-02-02 19:55:21 +00:00
Duncan P. N. Exon Smith	25ffa9ed9c	IR: Separate helpers for string operands, NFC llvm-svn: 227846	2015-02-02 19:54:05 +00:00
Duncan P. N. Exon Smith	439bf9404e	IR: Split out DebugInfoMetadata.h, NFC Move debug-info-centred `Metadata` subclasses into their own header/source file. A couple of private template functions are needed from both `Metadata.cpp` and `DebugInfoMetadata.cpp`, so I've moved them to `lib/IR/MetadataImpl.h`. llvm-svn: 227835	2015-02-02 18:53:21 +00:00
David Blaikie	87c973c9d7	STLExtras: Provide less/equal functors with templated function call operators, plus a deref'ing functor template utility Similar to the C++14 void specializations of these templates, useful as a stop-gap until LLVM switches to '14. Example use-cases in tblgen because I saw some functors that looked like they could be simplified/refactored. Reviewers: dexonsmith Differential Revision: http://reviews.llvm.org/D7324 llvm-svn: 227828	2015-02-02 18:35:10 +00:00
Duncan P. N. Exon Smith	1f7b5ff9bc	Fix some file headers, NFC llvm-svn: 227826	2015-02-02 18:20:15 +00:00
Eric Christopher	2aab2ce529	Remove unnecessary forward declaration. llvm-svn: 227813	2015-02-02 17:38:40 +00:00
Lang Hames	dcc8377028	[Orc] Make the ObjectLinkingLayer take ownership of object files until finalization time. As currently implemented, RuntimeDyldELF requires the original object file to be avaible when relocations are being resolved. This patch ensures that the ObjectLinkingLayer preserves it until then. In the future RuntimeDyldELF should be rewritten to remove this requirement, at which point this patch can be reverted. Regression test cases for Orc (which include coverage of this bug) will be committed shortly. llvm-svn: 227778	2015-02-02 04:32:17 +00:00
Lang Hames	8275580ce1	[Orc] Add sensible defaults for the ObjectLinkingLayer constructor. llvm-svn: 227776	2015-02-02 01:03:10 +00:00
Benjamin Kramer	18cda2e8dc	FoldingSetVectorIterator is just a subset of pointee_iterator, remove it. llvm-svn: 227761	2015-02-01 19:26:05 +00:00
Adam Nemet	83140dfa69	Include cstddef in EquivalenceClasses.h This is to try to appease bots complaining that ptrdiff_t is undefined in LoopAccessAnalysis.cpp. llvm-svn: 227757	2015-02-01 17:21:06 +00:00
Adam Nemet	2884269478	[LoopVectorize] Move LoopAccessAnalysis to its own module Other than moving code and adding the boilerplate for the new files, the code being moved is unchanged. There are a few global functions that are shared with the rest of the LoopVectorizer. I moved these to the new module as well (emitLoopAnalysis, stripIntegerCast, replaceSymbolicStrideSCEV) along with the Report class used by emitLoopAnalysis. There is probably room for further improvement in this area. I kept DEBUG_TYPE "loop-vectorize" because it's used as the PassName with emitOptimizationRemarkAnalysis. This will obviously have to change. NFC. This is part of the patchset that splits out the memory dependence logic from LoopVectorizationLegality into a new class LoopAccessAnalysis. LoopAccessAnalysis will be used by the new Loop Distribution pass. llvm-svn: 227756	2015-02-01 16:56:15 +00:00
Adam Nemet	287f8b34a3	[LoopVectorize] Make hasVectorInstrinsicScalarOpd inline VectorUtils.h needs to be included in LoopAccessAnalysis.cpp for getIntrinsicIDForCall but hasVectorInstrinsicScalarOpd is not used by this module. NFC. This is part of the patchset that splits out the memory dependence logic from LoopVectorizationLegality into a new class LoopAccessAnalysis. LoopAccessAnalysis will be used by the new Loop Distribution pass. llvm-svn: 227753	2015-02-01 16:56:05 +00:00
Michael Kuperstein	41ae9af2e3	[X86] Convert esp-relative movs of function arguments to pushes, step 2 This moves the transformation introduced in r223757 into a separate MI pass. This allows it to cover many more cases (not only cases where there must be a reserved call frame), and perform rudimentary call folding. It still doesn't have a heuristic, so it is enabled only for optsize/minsize, with stack alignment <= 8, where it ought to be a fairly clear win. (Re-commit of r227728) Differential Revision: http://reviews.llvm.org/D6789 llvm-svn: 227752	2015-02-01 16:56:04 +00:00
Michael Kuperstein	f73ce6a4c9	Revert r227728 due to bad line endings. llvm-svn: 227746	2015-02-01 16:15:07 +00:00
Chandler Carruth	fd3086476a	[multiversion] Kill FunctionTargetTransformInfo, TTI itself is now per-function and supports the exact desired interface. llvm-svn: 227743	2015-02-01 14:37:03 +00:00
Chandler Carruth	a2cd22e25f	[multiversion] Remove the function parameter from the unrolling preferences interface on TTI now that all of TTI is per-function. llvm-svn: 227741	2015-02-01 14:31:23 +00:00
Chandler Carruth	59453ca4a8	[multiversion] Switch the TTI queries from TargetMachine to Subtarget now that we have a correct and cached subtarget specific to the function. Also, finish providing a cached per-function subtarget in the core LLVMTargetMachine -- that layer hadn't switched over yet. The only use of the TargetMachine was to re-lookup a subtarget for a particular function to work around the fact that TTI was immutable. Now that it is per-function and we haved a cached subtarget, use it. This still leaves a few interfaces with real warts on them where we were passing Function objects through the TTI interface. I'll remove these and clean their usage up in subsequent commits now that this isn't necessary. llvm-svn: 227738	2015-02-01 14:22:17 +00:00
Chandler Carruth	6ea38a46d2	[multiversion] Remove the cached TargetMachine pointer from the intermediate TTI implementation template and instead query up to the derived class for both the TargetMachine and the TargetLowering. Most of the derived types had a TLI cached already and there is no need to store a less precisely typed target machine pointer. This will in turn make it much cleaner to look up the TLI via a per-function subtarget instead of the generic subtarget, and it will pave the way toward pulling the subtarget used for unroll preferences into the same form once we are always using the function to look up the correct subtarget. llvm-svn: 227737	2015-02-01 14:01:15 +00:00
Chandler Carruth	e33a4b8bd7	[multiversion] Remove another place we were "handling" nullptr even though it was never a reasonable input. llvm-svn: 227736	2015-02-01 13:21:04 +00:00
Chandler Carruth	3ed152b528	[multiversion] Switch all of the targets over to use the TargetIRAnalysis access path directly rather than implementing getTTI. This even removes getTTI from the interface. It's more efficient for each target to just register a precise callback that creates their specific TTI. As part of this, all of the targets which are building their subtargets individually per-function now build their TTI instance with the function and thus look up the correct subtarget and cache it. NVPTX, R600, and XCore currently don't leverage this functionality, but its trivial for them to add it now. llvm-svn: 227735	2015-02-01 13:20:00 +00:00
Chandler Carruth	c67d7f29c0	[multiversion] Remove a false freedom to leave the TargetMachine pointer null. For some reason some of the original TTI code supported a null target machine. This seems to have been legacy, and I made matters worse when refactoring this code by spreading that pattern further through the various targets. The TargetMachine can't actually be null, and it doesn't make sense to support that use case. I've now consistently removed it and removed all of the code trying to cope with that situation. This is probably good, as several targets didn't cope with it being null despite the null default argument in their constructors. =] llvm-svn: 227734	2015-02-01 12:38:24 +00:00
Chandler Carruth	46a63acccc	[multiversion] Implement the old pass manager's TTI wrapper pass in terms of the new pass manager's TargetIRAnalysis. Yep, this is one of the nicer bits of the new pass manager's design. Passes can in many cases operate in a vacuum and so we can just nest things when convenient. This is particularly convenient here as I can now consolidate all of the TargetMachine logic on this analysis. The most important change here is that this pushes the function we need TTI for all the way into the TargetMachine, and re-creates the TTI object for each function rather than re-using it for each function. We're now prepared to teach the targets to produce function-specific TTI objects with specific subtargets cached, etc. One piece of feedback I'd love here is whether its worth renaming any of this stuff. None of the names really seem that awesome to me at this point, but TargetTransformInfoWrapperPass is particularly ... odd. TargetIRAnalysisWrapper might make more sense. I would want to do that rename separately anyways, but let me know what you think. llvm-svn: 227731	2015-02-01 12:26:09 +00:00
Chandler Carruth	89da465927	[multiversion] Thread a function argument through all the callers of the getTTI method used to get an actual TTI object. No functionality changed. This just threads the argument and ensures code like the inliner can correctly look up the callee's TTI rather than using a fixed one. The next change will use this to implement per-function subtarget usage by TTI. The changes after that should eliminate the need for FTTI as that will have become the default. llvm-svn: 227730	2015-02-01 12:01:35 +00:00
Michael Kuperstein	2f448f269c	[X86] Convert esp-relative movs of function arguments to pushes, step 2 This moves the transformation introduced in r223757 into a separate MI pass. This allows it to cover many more cases (not only cases where there must be a reserved call frame), and perform rudimentary call folding. It still doesn't have a heuristic, so it is enabled only for optsize/minsize, with stack alignment <= 8, where it ought to be a fairly clear win. Differential Revision: http://reviews.llvm.org/D6789 llvm-svn: 227728	2015-02-01 11:44:44 +00:00
Chandler Carruth	75361818c7	[PM] Clean up a stale comment that came from a differnt pass when I created this header. llvm-svn: 227727	2015-02-01 11:35:56 +00:00
Chandler Carruth	e1550cbb3c	[PM] Port SimplifyCFG to the new pass manager. This should be sufficient to replace the initial (minor) function pass pipeline in Clang with the new pass manager. I'll probably add an (off by default) flag to do that just to ensure we can get extra testing. llvm-svn: 227726	2015-02-01 11:34:21 +00:00
Chandler Carruth	b4f6fbea29	[PM] Port EarlyCSE to the new pass manager. I've added RUN lines both to the basic test for EarlyCSE and the target-specific test, as this serves as a nice test that the TTI layer in the new pass manager is in fact working well. llvm-svn: 227725	2015-02-01 10:51:23 +00:00
Chandler Carruth	a44e21779b	[PM] Teach the module-to-function adaptor to not run function passes over declarations. This is both quite unproductive and causes things to crash, for example domtree would just assert. I've added a declaration and a domtree run to the basic high-level tests for the new pass manager. llvm-svn: 227724	2015-02-01 10:47:25 +00:00
Chandler Carruth	7424f96c51	[PM] Switch to a ranged based for loop. NFC llvm-svn: 227723	2015-02-01 10:40:21 +00:00
Chandler Carruth	4efb41707c	[PM] Port TTI to the new pass manager, introducing a TargetIRAnalysis to produce it. This adds a function to the TargetMachine that produces this analysis via a callback for each function. This in turn faves the way to produce a different TTI per-function with the correct subtarget cached. I've also done the necessary wiring in the opt tool to thread the target machine down and make it available to the pass registry so that we can construct this analysis from a target machine when available. llvm-svn: 227721	2015-02-01 10:11:22 +00:00
Jingyue Wu	da72eac553	[NVPTX] Emit .pragma "nounroll" for loops marked with nounroll Summary: CUDA driver can unroll loops when jit-compiling PTX. To prevent CUDA driver from unrolling a loop marked with llvm.loop.unroll.disable is not unrolled by CUDA driver, we need to emit .pragma "nounroll" at the header of that loop. This patch also extracts getting unroll metadata from loop ID metadata into a shared helper function. Test Plan: test/CodeGen/NVPTX/nounroll.ll Reviewers: eliben, meheff, jholewinski Reviewed By: jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7041 llvm-svn: 227703	2015-02-01 02:27:45 +00:00
Chandler Carruth	0cdc876795	[PM] Remove a bunch of stale TTI creation method declarations. I nuked their definitions, but forgot to clean up all the declarations which are in different files. llvm-svn: 227698	2015-02-01 00:22:15 +00:00
Chandler Carruth	ad2d6dd7d3	[PM] Switch the TargetMachine interface from accepting a pass manager base which it adds a single analysis pass to, to instead return the type erased TargetTransformInfo object constructed for that TargetMachine. This removes all of the pass variants for TTI. There is now a single TTI pass in the Analysis layer. All of the Analysis <-> Target communication is through the TTI's type erased interface itself. While the diff is large here, it is nothing more that code motion to make types available in a header file for use in a different source file within each target. I've tried to keep all the doxygen comments and file boilerplate in line with this move, but let me know if I missed anything. With this in place, the next step to making TTI work with the new pass manager is to introduce a really simple new-style analysis that produces a TTI object via a callback into this routine on the target machine. Once we have that, we'll have the building blocks necessary to accept a function argument as well. llvm-svn: 227685	2015-01-31 11:17:59 +00:00
Chandler Carruth	b2d6052871	[PM] Change the core design of the TTI analysis to use a polymorphic type erased interface and a single analysis pass rather than an extremely complex analysis group. The end result is that the TTI analysis can contain a type erased implementation that supports the polymorphic TTI interface. We can build one from a target-specific implementation or from a dummy one in the IR. I've also factored all of the code into "mix-in"-able base classes, including CRTP base classes to facilitate calling back up to the most specialized form when delegating horizontally across the surface. These aren't as clean as I would like and I'm planning to work on cleaning some of this up, but I wanted to start by putting into the right form. There are a number of reasons for this change, and this particular design. The first and foremost reason is that an analysis group is complete overkill, and the chaining delegation strategy was so opaque, confusing, and high overhead that TTI was suffering greatly for it. Several of the TTI functions had failed to be implemented in all places because of the chaining-based delegation making there be no checking of this. A few other functions were implemented with incorrect delegation. The message to me was very clear working on this -- the delegation and analysis group structure was too confusing to be useful here. The other reason of course is that this is much more natural fit for the new pass manager. This will lay the ground work for a type-erased per-function info object that can look up the correct subtarget and even cache it. Yet another benefit is that this will significantly simplify the interaction of the pass managers and the TargetMachine. See the future work below. The downside of this change is that it is very, very verbose. I'm going to work to improve that, but it is somewhat an implementation necessity in C++ to do type erasure. =/ I discussed this design really extensively with Eric and Hal prior to going down this path, and afterward showed them the result. No one was really thrilled with it, but there doesn't seem to be a substantially better alternative. Using a base class and virtual method dispatch would make the code much shorter, but as discussed in the update to the programmer's manual and elsewhere, a polymorphic interface feels like the more principled approach even if this is perhaps the least compelling example of it. ;] Ultimately, there is still a lot more to be done here, but this was the huge chunk that I couldn't really split things out of because this was the interface change to TTI. I've tried to minimize all the other parts of this. The follow up work should include at least: 1) Improving the TargetMachine interface by having it directly return a TTI object. Because we have a non-pass object with value semantics and an internal type erasure mechanism, we can narrow the interface of the TargetMachine to just do what we need: build and return a TTI object that we can then insert into the pass pipeline. 2) Make the TTI object be fully specialized for a particular function. This will include splitting off a minimal form of it which is sufficient for the inliner and the old pass manager. 3) Add a new pass manager analysis which produces TTI objects from the target machine for each function. This may actually be done as part of #2 in order to use the new analysis to implement #2. 4) Work on narrowing the API between TTI and the targets so that it is easier to understand and less verbose to type erase. 5) Work on narrowing the API between TTI and its clients so that it is easier to understand and less verbose to forward. 6) Try to improve the CRTP-based delegation. I feel like this code is just a bit messy and exacerbating the complexity of implementing the TTI in each target. Many thanks to Eric and Hal for their help here. I ended up blocked on this somewhat more abruptly than I expected, and so I appreciate getting it sorted out very quickly. Differential Revision: http://reviews.llvm.org/D7293 llvm-svn: 227669	2015-01-31 03:43:40 +00:00
Eric Christopher	cc4cd0396b	Remove the last vestiges of resetOperationActions. llvm-svn: 227648	2015-01-31 00:21:17 +00:00
Lang Hames	0450021714	[PBQP] Fix transposed worst row/column check in handleAdd/RemoveNode in the PBQP allocator. Patch by Jonas Paulsson. Thanks Jonas! llvm-svn: 227628	2015-01-30 22:28:49 +00:00
Eric Christopher	2ab75347e5	Add a similar templated cast for getSubtarget off of the MachineFunction to save typing a lot of static_casts. llvm-svn: 227621	2015-01-30 22:02:19 +00:00
Adrian Prantl	94fa62f69f	Inliner: Use replaceDbgDeclareForAlloca() instead of splicing the instruction and generalize it to optionally dereference the variable. Follow-up to r227544. llvm-svn: 227604	2015-01-30 19:37:48 +00:00
Zachary Turner	9a7f59f9ea	Move DebugInfo to DebugInfo/DWARF. In preparation for adding PDB support to LLVM, this moves the DWARF parsing code to its own subdirectory under DebugInfo, and renames LLVMDebugInfo to LLVMDebugInfoDWARF. This is purely a mechanical / build system change. Differential Revision: http://reviews.llvm.org/D7269 Reviewed by: Eric Christopher llvm-svn: 227586	2015-01-30 18:07:45 +00:00

... 2 3 4 5 6 ...

22704 Commits