llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 21:13:02 +02:00

Author	SHA1	Message	Date
David Majnemer	c0087ca297	WinCOFF: Transform IR expressions featuring __ImageBase into image relative relocations MSVC on x64 requires that we create image relative symbol references to refer to RTTI data. Seeing as how there is no way to explicitly make reference to a given relocation type in LLVM IR, pattern match expressions of the form &foo - &__ImageBase. Differential Revision: http://llvm-reviews.chandlerc.com/D2523 llvm-svn: 199312	2014-01-15 09:16:42 +00:00
Rafael Espindola	32f031d9a5	Return an error_code from materializeAllPermanently. llvm-svn: 199275	2014-01-14 23:51:27 +00:00
Rafael Espindola	487862e5fb	Use error_code in Module::materializeAll. llvm-svn: 199269	2014-01-14 23:02:01 +00:00
Nico Rieck	964a13bb4e	Decouple dllexport/dllimport from linkage Representing dllexport/dllimport as distinct linkage types prevents using these attributes on templates and inline functions. Instead of introducing further mixed linkage types to include linkonce and weak ODR, the old import/export linkage types are replaced with a new separate visibility-like specifier: define available_externally dllimport void @f() {} @Var = dllexport global i32 1, align 4 Linkage for dllexported globals and functions is now equal to their linkage without dllexport. Imported globals and functions must be either declarations with external linkage, or definitions with AvailableExternallyLinkage. llvm-svn: 199218	2014-01-14 15:22:47 +00:00
Nico Rieck	e8a579c6bc	Revert "Decouple dllexport/dllimport from linkage" Revert this for now until I fix an issue in Clang with it. This reverts commit r199204. llvm-svn: 199207	2014-01-14 12:38:32 +00:00
Nico Rieck	6203d44313	Decouple dllexport/dllimport from linkage Representing dllexport/dllimport as distinct linkage types prevents using these attributes on templates and inline functions. Instead of introducing further mixed linkage types to include linkonce and weak ODR, the old import/export linkage types are replaced with a new separate visibility-like specifier: define available_externally dllimport void @f() {} @Var = dllexport global i32 1, align 4 Linkage for dllexported globals and functions is now equal to their linkage without dllexport. Imported globals and functions must be either declarations with external linkage, or definitions with AvailableExternallyLinkage. llvm-svn: 199204	2014-01-14 11:55:03 +00:00
Chandler Carruth	220fc7eab1	[PM] Fix stale header blocker, found by Duncan Smith in code review! llvm-svn: 199185	2014-01-14 05:50:19 +00:00
Chandler Carruth	98adff6224	[PM] Split DominatorTree into a concrete analysis result object which can be used by both the new pass manager and the old. This removes it from any of the virtual mess of the pass interfaces and lets it derive cleanly from the DominatorTreeBase<> template. In turn, tons of boilerplate interface can be nuked and it turns into a very straightforward extension of the base DominatorTree interface. The old analysis pass is now a simple wrapper. The names and style of this split should match the split between CallGraph and CallGraphWrapperPass. All of the users of DominatorTree have been updated to match using many of the same tricks as with CallGraph. The goal is that the common type remains the resulting DominatorTree rather than the pass. This will make subsequent work toward the new pass manager significantly easier. Also in numerous places things became cleaner because I switched from re-running the pass (!!! mid way through some other passes run!!!) to directly recomputing the domtree. llvm-svn: 199104	2014-01-13 13:07:17 +00:00
Chandler Carruth	b973631374	[PM][cleanup] Clean up comments and use modern doxygen in this file. This is a precursor to breaking the pass that computes the DominatorTree apart from the concrete DominatorTree. llvm-svn: 199103	2014-01-13 13:06:58 +00:00
Elena Demikhovsky	e635ade802	AVX-512: Embedded Rounding Control - encoding and printing Changed intrinsics for vrcp14/vrcp28 vrsqrt14/vrsqrt28 - aligned with GCC. llvm-svn: 199102	2014-01-13 12:55:03 +00:00
Chandler Carruth	59e885531a	[PM] Pull the generic graph algorithms and data structures for dominator trees into the Support library. These are all expressed in terms of the generic GraphTraits and CFG, with no reliance on any concrete IR types. Putting them in support clarifies that and makes the fact that the static analyzer in Clang uses them much more sane. When moving the Dominators.h file into the IR library I claimed that this was the right home for it but not something I planned to work on. Oops. So why am I doing this? It happens to be one step toward breaking the requirement that IR verification can only be performed from inside of a pass context, which completely blocks the implementation of verification for the new pass manager infrastructure. Fixing it will also allow removing the concept of the "preverify" step (WTF???) and allow the verifier to cleanly flag functions which fail verification in a way that precludes even computing dominance information. Currently, that results in a fatal error even when you ask the verifier to not fatally error. It's awesome like that. The yak shaving will continue... llvm-svn: 199095	2014-01-13 10:52:56 +00:00
Chandler Carruth	6e834c5459	[cleanup] Switch comments to use '\brief' style instead of '@brief' style, and remove some unnecessary comments (the code is perfectly self-documenting here). Also clang-format the function declarations as they wrap cleanly now. llvm-svn: 199084	2014-01-13 09:31:09 +00:00
Chandler Carruth	ee051af6e2	[cleanup] Move the Dominators.h and Verifier.h headers into the IR directory. These passes are already defined in the IR library, and it doesn't make any sense to have the headers in Analysis. Long term, I think there is going to be a much better way to divide these matters. The dominators code should be fully separated into the abstract graph algorithm and have that put in Support where it becomes obvious that evn Clang's CFGBlock's can use it. Then the verifier can manually construct dominance information from the Support-driven interface while the Analysis library can provide a pass which both caches, reconstructs, and supports a nice update API. But those are very long term, and so I don't want to leave the really confusing structure until that day arrives. llvm-svn: 199082	2014-01-13 09:26:24 +00:00
Chandler Carruth	2fbea03f0f	[PM] Add module and function printing passes for the new pass manager. This implements the legacy passes in terms of the new ones. It adds basic testing using explicit runs of the passes. Next up will be wiring the basic output mechanism of opt up when the new pass manager is engaged unless bitcode writing is requested. llvm-svn: 199049	2014-01-12 12:15:39 +00:00
Chandler Carruth	dbb2d752f7	[PM] Revert an accidental commit of total BS code. This was halfway through being editted, and I forgot to delete it before committing. What's more awesome is that it compiles cleanly! llvm-svn: 199048	2014-01-12 11:41:43 +00:00
Chandler Carruth	613d3c5c3d	[PM] Simplify the interface exposed for IR printing passes. Nothing was using the ability of the pass to delete the raw_ostream it printed to, and nothing was trying to pass it a pointer to the raw_ostream. Also, the function variant had a different order of arguments from all of the others which was just really confusing. Now the interface accepts a reference, doesn't offer to delete it, and uses a consistent order. The implementation of the printing passes haven't been updated with this simplification, this is just the API switch. llvm-svn: 199044	2014-01-12 11:30:46 +00:00
Chandler Carruth	076d51813d	[PM] Rename the IR printing pass header to a more generic and correct name to match the source file which I got earlier. Update the include sites. Also modernize the comments in the header to use the more recommended doxygen style. llvm-svn: 199041	2014-01-12 11:10:32 +00:00
Chandler Carruth	c0a4ac3c7f	[PM] Un-indent this file-level namespace. It's far more common to not indent the outer-most llvm namespace in header files. llvm-svn: 199040	2014-01-12 10:56:57 +00:00
Chandler Carruth	5a5a877cc2	[PM] Add names to passes under the new pass manager, and a debug output mode that can be used to debug the execution of everything. No support for analyses here, that will come later. This already helps show parts of the opt commandline integration that isn't working. Tests of that will start using it as the bugs are fixed. llvm-svn: 199004	2014-01-11 11:52:05 +00:00
Chandler Carruth	53ce9f07be	[PM] Somehow I missed the header guards on this file. Yikes! llvm-svn: 199003	2014-01-11 10:59:00 +00:00
Eric Christopher	8d9cd2a9e2	Fix odd whitespace. llvm-svn: 198978	2014-01-11 00:23:11 +00:00
Rafael Espindola	4ef724a8d2	Use 'w' instead of 'c' to represent the win32 mangling. This change was requested to avoid confusion if we ever support non windows coff systems. llvm-svn: 198938	2014-01-10 13:42:12 +00:00
Nadav Rotem	0ee224c122	Re-remove dead code. This reverts r198854. llvm-svn: 198879	2014-01-09 19:22:07 +00:00
Nadav Rotem	4b218fdee6	Revert r198819 - "Remove dead code." llvm-svn: 198854	2014-01-09 07:50:34 +00:00
Chandler Carruth	53468087f3	Put the functionality for printing a value to a raw_ostream as an operand into the Value interface just like the core print method is. That gives a more conistent organization to the IR printing interfaces -- they are all attached to the IR objects themselves. Also, update all the users. This removes the 'Writer.h' header which contained only a single function declaration. llvm-svn: 198836	2014-01-09 02:29:41 +00:00
Rafael Espindola	13382e6515	Remove dead code. llvm-svn: 198819	2014-01-09 00:32:54 +00:00
Chandler Carruth	a41aa8b867	Remove vestigal bits of MC from the mangler. It no longer uses this, and having the include could cause weird layering problems between the IR and MC libraries. llvm-svn: 198796	2014-01-08 21:59:22 +00:00
Elena Demikhovsky	1ecccf9364	AVX-512: Added more intrinsics for pmin/pmax, pabs, blend, pmuldq. llvm-svn: 198745	2014-01-08 10:54:22 +00:00
Rafael Espindola	4dc5af8bc2	Move the llvm mangler to lib/IR. This makes it available to tools that don't link with target (like llvm-ar). llvm-svn: 198708	2014-01-07 21:19:40 +00:00
Chandler Carruth	7aa902a488	Move the LLVM IR asm writer header files into the IR directory, as they are part of the core IR library in order to support dumping and other basic functionality. Rename the 'Assembly' include directory to 'AsmParser' to match the library name and the only functionality left their -- printing has been in the core IR library for quite some time. Update all of the #includes to match. All of this started because I wanted to have the layering in good shape before I started adding support for printing LLVM IR using the new pass infrastructure, and commandline support for the new pass infrastructure. llvm-svn: 198688	2014-01-07 12:34:26 +00:00
Chandler Carruth	87f14b4eec	Re-sort all of the includes with ./utils/sort_includes.py so that subsequent changes are easier to review. About to fix some layering issues, and wanted to separate out the necessary churn. Also comment and sink the include of "Windows.h" in three .inc files to match the usage in Memory.inc. llvm-svn: 198685	2014-01-07 11:48:04 +00:00
Elena Demikhovsky	591c25725f	AVX-512: added intrinsic vcvtpd2ps (with rounding mode and without) llvm-svn: 198593	2014-01-06 08:45:54 +00:00
Elena Demikhovsky	034a667c24	AVX-512: Added more intrinsics for convert and min/max. Removed vzeroupper from AVX-512 mode - our optimization gude does not recommend to insert vzeroupper at all. llvm-svn: 198557	2014-01-05 10:46:09 +00:00
Chandler Carruth	8d4f29ba83	Fix a bug in IRBuilder that's been there for who knows how long. It failed to correctly propagate the NUW and NSW flags to the constant folder for two instructions. I've added a unittest to cover flag propagation for the rest of the instructions and constant expressions. llvm-svn: 198538	2014-01-05 03:22:33 +00:00
Reid Kleckner	4881e00ef7	Fix MSVC warning about missing return in DataLayout llvm-svn: 198465	2014-01-03 23:51:09 +00:00
Rafael Espindola	eae6386a1e	Make the llvm mangler depend only on DataLayout. Before this patch any program that wanted to know the final symbol name of a GlobalValue had to link with Target. This patch implements a compromise solution where the mangler uses DataLayout. This way, any tool that already links with Target (llc, clang) gets the exact behavior as before and new IR files can be mangled without linking with Target. With this patch the mangler is constructed with just a DataLayout and DataLayout is extended to include the information the Mangler needs. llvm-svn: 198438	2014-01-03 19:21:54 +00:00
Rafael Espindola	95d600810f	Remove the 's' DataLayout specification During the years there have been some attempts at figuring out how to align byval arguments. A look at the commit log suggests that they were * Use the ABI alignment. * When that was not sufficient for x86-64, I added the 's' specification to DataLayout. * When that was not sufficient Evan added the virtual getByValTypeAlignment. * When even that was not sufficient, we just got the FE to add the alignment to the byval. This patch is just a simple cleanup that removes my first attempt at fixing the problem. I also added an AArch64 implementation of getByValTypeAlignment to make sure this patch is a nop. I also left the 's' parsing for backward compatibility. I will send a short email to llvmdev about the change for anyone maintaining an out of tree target. llvm-svn: 198287	2014-01-01 22:29:43 +00:00
Elena Demikhovsky	7174584583	AVX-512: Added intrinsics for vcvt, vcvtt, vrndscale, vcmp Printing rounding control. Enncoding for EVEX_RC (rounding control). llvm-svn: 198277	2014-01-01 15:12:34 +00:00
Craig Topper	04690a3406	Mark some Type and EVT methods as LLVM_READONLY. llvm-svn: 198115	2013-12-28 16:17:26 +00:00
Dmitri Gribenko	f28ad7e5bf	Remove the AnyPointerSize and AnyEndianness enumerators, which were left from LLVM's early days. Today LLVM IR is always target-specific. llvm-svn: 197772	2013-12-20 03:11:07 +00:00
Reid Kleckner	f795c3e4a9	Begin adding docs and IR-level support for the inalloca attribute The inalloca attribute is designed to support passing C++ objects by value in the Microsoft C++ ABI. It behaves the same as byval, except that it always implies that the argument is in memory and that the bytes are never copied. This attribute allows the caller to take the address of an outgoing argument's memory and execute arbitrary code to store into it. This patch adds basic IR support, docs, and verification. It does not attempt to implement any lowering or fix any possibly broken transforms. When this patch lands, a complete description of this feature should appear at http://llvm.org/docs/InAlloca.html . Differential Revision: http://llvm-reviews.chandlerc.com/D2173 llvm-svn: 197645	2013-12-19 02:14:12 +00:00
Quentin Colombet	67a68c0b99	Add warning capabilities in LLVM. This reapplies r197438 and fixes the link-time circular dependency between IR and Support. The fix consists in moving the diagnostic support into IR. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197508	2013-12-17 17:47:22 +00:00
Quentin Colombet	71b4c4cbe8	Revert r197438 and r197447 until we figure out how to avoid circular dependency at link time llvm-svn: 197451	2013-12-17 01:19:59 +00:00
Quentin Colombet	6369ce9a04	Add warning capabilities in LLVM. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197438	2013-12-16 23:22:51 +00:00
NAKAMURA Takumi	4b39753783	[CMake] Introduce LLVM_INCLUDE_DIR. llvm-svn: 197392	2013-12-16 15:05:39 +00:00
Rafael Espindola	0846fac895	Pointer sizes are stored in Bytes. Fix variables names to say so. Also update for the current naming style. llvm-svn: 197283	2013-12-13 23:15:20 +00:00
Andrew Trick	e726cc0278	Grow the stackmap/patchpoint format to hold 64-bit IDs. llvm-svn: 197255	2013-12-13 18:37:10 +00:00
Chad Rosier	551789d294	[AArch64] Refactor NEON floating-point Max/Min/Maxnm/Minnm across vector AArch64 intrinsics to use f32 types, rather than their vector equivalents. llvm-svn: 197090	2013-12-11 23:21:25 +00:00
Chad Rosier	c251a82254	[AArch64] Add NEON scalar floating-point compare LLVM AArch64 intrinsics that use f32/f64 types, rather than their vector equivalents. llvm-svn: 197068	2013-12-11 21:03:46 +00:00
Chad Rosier	0b1fef12e8	[AArch64] Refactor the NEON scalar floating-point reciprocal step and floating-point reciprocal square root step LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197067	2013-12-11 21:03:43 +00:00
Chad Rosier	43daaa765b	[AArch64] Refactor the NEON scalar floating-point reciprocal estimate, floating- point reciprocal exponent, and floating-point reciprocal square root estimate LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197066	2013-12-11 21:03:40 +00:00
Chad Rosier	29ed5c4552	[AArch64] Refactor the NEON floating-point absolute difference LLVM AArch64 intrinsic to use f32/f64 types, rather than their vector equivalents. llvm-svn: 196965	2013-12-10 21:33:59 +00:00
Chad Rosier	5394f9c916	[AArch64] Refactor the NEON signed/unsigned floating-point convert to fixed-point LLVM AArch64 intrinsics to use f32/f64, rather than their vector equivalents. llvm-svn: 196964	2013-12-10 21:33:56 +00:00
Chad Rosier	b2112dc6c3	[AArch64] Overload NEON signed/unsigned floating-point convert to fixed-point and fixed-point convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196963	2013-12-10 21:33:53 +00:00
Chad Rosier	3d7979609e	[AArch64] Overload NEON signed/unsigned integer convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196962	2013-12-10 21:33:50 +00:00
Chad Rosier	7e9f19f92d	[AArch64] Refactor the Neon vector/scalar floating-point convert intrinsics so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196930	2013-12-10 16:11:39 +00:00
Chad Rosier	0b6c7be6f7	[AArch64] Refactor the Neon vector/scalar floating-point convert implementation. Specifically, reuse the ARM intrinsics when possible. llvm-svn: 196926	2013-12-10 15:35:33 +00:00
Elena Demikhovsky	57057960b0	AVX-512: changed intrinsics for mask operations llvm-svn: 196918	2013-12-10 13:53:10 +00:00
Elena Demikhovsky	b3a0e7bbed	AVX-512: Changed intrinsics of VPCONFLICT to match GCC builtin form llvm-svn: 196914	2013-12-10 11:58:35 +00:00
Daniel Sanders	89ddadb4c5	[mips][msa] Correct sld and sldi builtins. Summary: The result register of these instructions is also the first operand. Reviewers: jacksprat, dsanders Reviewed By: dsanders Differential Revision: http://llvm-reviews.chandlerc.com/D2362 Differential Revision: http://llvm-reviews.chandlerc.com/D2363 llvm-svn: 196910	2013-12-10 11:37:00 +00:00
Kevin Qin	746aa8a55e	[AArch64 NEON] Support poly128_t and implement relevant intrinsic. llvm-svn: 196887	2013-12-10 06:48:35 +00:00
Chad Rosier	8ba851adda	[AArch64] Refactor the NEON scalar reduce pairwise intrinsics, so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196833	2013-12-09 22:47:38 +00:00
Chad Rosier	850366132e	[AArch64] Remove q and non-q intrinsic definitions in the NEON scalar reduce pairwise implementation, using an overloaded definition instead. llvm-svn: 196831	2013-12-09 22:47:31 +00:00
Duncan P. N. Exon Smith	7cf419b14c	Fix comments for PassDebuggingString No functionality change. Changing comments to match code. llvm-svn: 196713	2013-12-08 01:28:17 +00:00
Rafael Espindola	6f395c44b1	Remove the notion of primitive types. They were out of place since the introduction of arbitrary precision integer types. This also synchronizes the documentation to Types.h, so it refers to first class types and single value types. llvm-svn: 196661	2013-12-07 19:34:20 +00:00
Matt Arsenault	e6d3e47f0e	Add getBitCastOrAddrSpaceCast llvm-svn: 196637	2013-12-07 02:58:41 +00:00
Rafael Espindola	76ea3a04cd	Remove unused value. llvm-svn: 196635	2013-12-07 02:27:52 +00:00
Kaelyn Uhrain	ee5ebb655a	Fix the segfault reported in PR 11990. The sefault occurs due to an infinite loop when the verifier tries to determine the size of a type of the form "%rt = type { %rt }" while checking an alloca of the type. llvm-svn: 196626	2013-12-07 00:13:34 +00:00
Cameron McInally	ca9f2bc25b	Update AVX512 vector blend intrinsic names. llvm-svn: 196581	2013-12-06 13:35:35 +00:00
Michael Ilseman	fb9a99d2cf	Use present fast-math flags when applicable in CreateBinOp We were previously not adding fast-math flags through CreateBinOp() when it happened to be making a floating point binary operator. This patch updates it to do so similarly to directly calling CreateF*(). llvm-svn: 196438	2013-12-05 00:32:09 +00:00
Manman Ren	380360c403	Debug Info: Move the constant for Debug Info Version from Dwarf.h to Metadata.h. Suggested by Eric. llvm-svn: 196144	2013-12-02 20:09:52 +00:00
Chad Rosier	ca062e81db	[AArch64] Add support for NEON scalar floating-point absolute difference. llvm-svn: 195803	2013-11-27 01:45:58 +00:00
Chad Rosier	1337fcc721	[AArch64] Add support for NEON scalar floating-point to integer convert instructions. llvm-svn: 195788	2013-11-26 22:17:37 +00:00
Chandler Carruth	a6255babf7	[PM] Fix a stale comment after my last refactoring spoted by Joey in review! llvm-svn: 195757	2013-11-26 12:00:58 +00:00
Chandler Carruth	cd1b8cdb36	[PM] Remove four extraneous 'typename's that Clang (in C++11 mode) is happy with but GCC complains about. I'm assuming both compilers are correct and these are optional in C++11 because I'm too tired to read the standard. ;] llvm-svn: 195748	2013-11-26 11:31:06 +00:00
Chandler Carruth	f3c4692f91	[PM] Factor the overwhelming majority of the interface boiler plate out of the two analysis managers into a CRTP base class that can be shared and re-used in building any analysis manager. This will in turn simplify adding yet another analysis manager to the system. The base class provides all of the interface sugar for the analysis manager delegating the functionality back through DerivedT methods which operate on simple pass IDs. It also provides the pass registration, storage, and lookup system which is common across the various formulations of analysis managers. llvm-svn: 195747	2013-11-26 11:24:37 +00:00
Cameron McInally	2ff051483c	Add an intrinsic for the SSE2 PAUSE instruction. llvm-svn: 195697	2013-11-26 00:20:43 +00:00
Chandler Carruth	bc6ef9dce9	[PM] Complete the cross-layer interfaces with a Module-to-Function proxy. This lets a function pass query a module analysis manager. However, the interface is const to indicate that only cached results can be safely queried. With this, I think the new pass manager is largely functionally complete for modules and analyses. Still lots to test, and need to generalize to SCCs and Loops, and need to build an adaptor layer to support the use of existing Pass objects in the new managers. llvm-svn: 195538	2013-11-23 01:25:07 +00:00
Chandler Carruth	a3ea32a7e4	[PM] Add support to the analysis managers to query explicitly for cached results. This is the last piece of infrastructure needed to effectively support querying up the analysis layers. The next step will be to introduce a proxy which provides access to those layers with appropriate use of const to direct queries to the safe interface. llvm-svn: 195525	2013-11-23 00:38:42 +00:00
Chandler Carruth	be663f7b2a	[PM] Switch the downward invalidation to be incremental where only the one function's analyses are invalidated at a time. Also switch the preservation of the proxy to fully preserve the lower (function) analyses. Combined, this gets both upward and downward analysis invalidation to a point I'm happy with: - A function pass invalidates its function analyses, and its parent's module analyses. - A module pass invalidates all of its functions' analyses including the set of which functions are in the module. - A function pass can preserve a module analysis pass. - If all function passes preserve a module analysis pass, that preservation persists. If any doesn't the module analysis is invalidated. - A module pass can opt into managing all function analysis invalidation itself or none. - The conservative default is none, and the proxy takes the maximally conservative approach that works even if the set of functions has changed. - If a module pass opts into managing function analysis invalidation it has to propagate the invalidation itself, the proxy just does nothing. The only thing really missing is a way to query for a cached analysis or nothing at all. With this, function passes can more safely request a cached module analysis pass without fear of it accidentally running part way through. llvm-svn: 195519	2013-11-22 23:38:07 +00:00
Chandler Carruth	46f0053a63	[PM] Remove a FIXME comment that was fixed by my recent refactorings: now the access to the manager is via the proxy that ensures it behaves correctly. llvm-svn: 195518	2013-11-22 23:37:54 +00:00
Chandler Carruth	12640d8d49	[PM] Remove extraneous space that I left in there. llvm-svn: 195453	2013-11-22 12:26:40 +00:00
Chandler Carruth	df1a8fd535	[PM] Teach the analysis managers to pass themselves as arguments to the run methods of the analysis passes. Also generalizes and re-uses the SFINAE for transformation passes so that users can write an analysis pass and only accept an analysis manager if that is useful to their pass. This completes the plumbing to make an analysis manager available through every pass's run method if desired so that passes no longer need to be constructed around them. llvm-svn: 195451	2013-11-22 12:11:02 +00:00
Chandler Carruth	fe39062aad	[PM] Reverse the template arguments 'PassT' and 'AnalysisManagerT' in several templates. The previous order didn't make any sense as it separated 'IRUnitT' and 'AnalysisManagerT', the types which are essentially paired and passed along together throughout the layers. llvm-svn: 195450	2013-11-22 11:55:38 +00:00
Chandler Carruth	7692110912	[PM] Remove the IRUnitT typedef requirement for analysis passes. Since the analysis managers were split into explicit function and module analysis managers, it is now completely trivial to specify this when building up the concept and model types explicitly, and it is impossible to end up with a type error at run time. We instantiate a template when registering a pass that will enforce the requirement at a type-system level, and we produce a dynamic error on all the other query paths to the analysis manager if the pass in question isn't registered. llvm-svn: 195447	2013-11-22 11:46:33 +00:00
Chandler Carruth	8370a1a333	[PM] Fix the analysis templates' usage of IRUnitT. This is supposed to be the whole type of the IR unit, and so we shouldn't pass a pointer to it but rather the value itself. In turn, we need to provide a 'Module *' as that type argument (for example). This will become more relevant with SCCs or other units which may not be passed as a pointer type, but also brings consistency with the transformation pass templates. llvm-svn: 195445	2013-11-22 11:34:43 +00:00
Chandler Carruth	dee2e63512	[PM] Simplify how the SFINAE for AnalysisResultModel is applied by factoring it out into the default template argument so clients don't have to even think about it. llvm-svn: 195402	2013-11-22 00:48:49 +00:00
Chandler Carruth	28195a6d83	[PM] Switch analysis managers to be threaded through the run methods rather than the constructors of passes. This simplifies the APIs of passes significantly and removes an error prone pattern where the same manager had to be given to every different layer. With the new API the analysis managers themselves will have to be cross connected with proxy analyses that allow a pass at one layer to query for the analysis manager of another layer. The proxy will both expose a handle to the other layer's manager and it will provide the invalidation hooks to ensure things remain consistent across layers. Finally, the outer-most analysis manager has to be passed to the run method of the outer-most pass manager. The rest of the propagation is automatic. I've used SFINAE again to allow passes to completely disregard the analysis manager if they don't need or want to care. This helps keep simple things simple for users of the new pass manager. Also, the system specifically supports passing a null pointer into the outer-most run method if your pass pipeline neither needs nor wants to deal with analyses. I find this of dubious utility as while some passes don't care about analysis, I'm not sure there are any real-world users of the pass manager itself that need to avoid even creating an analysis manager. But it is easy to support, so there we go. Finally I renamed the module proxy for the function analysis manager to the more verbose but less confusing name of FunctionAnalysisManagerModuleProxy. I hate this name, but I have no idea what else to name these things. I'm expecting in the fullness of time to potentially have the complete cross product of types at the proxy layer: {Module,SCC,Function,Loop,Region}AnalysisManager{Module,SCC,Function,Loop,Region}Proxy (except for XAnalysisManagerXProxy which doesn't make any sense) This should make it somewhat easier to do the next phases which is to build the upward proxy and get its invalidation correct, as well as to make the invalidation within the Module -> Function mapping pass be more fine grained so as to invalidate fewer fuction analyses. After all of the proxy analyses are done and the invalidation working, I'll finally be able to start working on the next two fun fronts: how to adapt an existing pass to work in both the legacy pass world and the new one, and building the SCC, Loop, and Region counterparts. Fun times! llvm-svn: 195400	2013-11-22 00:43:29 +00:00
Chandler Carruth	4733811090	[PM] Fix typo and trailing space. llvm-svn: 195340	2013-11-21 11:04:53 +00:00
Chandler Carruth	a087921555	[PM] Widen the interface for invalidate on an analysis result now that it is completely optional, and sink the logic for handling the preserved analysis set into it. This allows us to implement the delegation logic desired in the proxy module analysis for the function analysis manager where if the proxy itself is preserved we assume the set of functions hasn't changed and we do a fine grained invalidation by walking the functions in the module and running the invalidate for them all at the manager level and letting it try to invalidate any passes. This in turn makes it blindingly obvious why we should hoist the invalidate trait and have two collections of results. That allows handling invalidation for almost all analyses without indirect calls and it allows short circuiting when the preserved set is all. llvm-svn: 195338	2013-11-21 10:53:05 +00:00
Chandler Carruth	41afa5d52d	[PM] Add support for using SFINAE to reflect on an analysis's result type and detect whether or not it provides an 'invalidate' member the analysis manager should use. This lets the overwhelming common case of not caring about custom behavior when an analysis is invalidated be the the obvious default behavior with no code written by the author of an analysis. Only when they write code specifically to handle invalidation does it get used. Both cases are actually covered by tests here. The test analysis uses the default behavior, and the proxy module analysis actually has custom behavior on invalidation that is firing correctly. (In fact, this is the analysis which was the primary motivation for having custom invalidation behavior in the first place.) llvm-svn: 195332	2013-11-21 09:10:21 +00:00
Ana Pazos	5ddc31e426	Implemented Neon scalar by element intrinsics. Intrinsics implemented: vqdmull_lane, vqdmulh_lane, vqrdmulh_lane, vqdmlal_lane, vqdmlsl_lane scalar Neon intrinsics. llvm-svn: 195327	2013-11-21 07:37:04 +00:00
Chandler Carruth	dbfa25a6b6	[PM] Add a module analysis pass proxy for the function analysis manager. This proxy will fill the role of proxying invalidation events down IR unit layers so that when a module changes we correctly invalidate function analyses. Currently this is a very coarse solution -- any change blows away the entire thing -- but the next step is to make invalidation handling more nuanced so that we can propagate specific amounts of invalidation from one layer to the next. The test is extended to place a module pass between two function pass managers each of which have preserved function analyses which get correctly invalidated by the module pass that might have changed what functions are even in the module. llvm-svn: 195304	2013-11-21 02:11:31 +00:00
Chandler Carruth	5bbc7e8ce9	[PM] Add the preservation system to the new pass manager. This adds a new set-like type which represents a set of preserved analysis passes. The set is managed via the opaque PassT::ID() void*s. The expected convenience templates for interacting with specific passes are provided. It also supports a symbolic "all" state which is represented by an invalid pointer in the set. This state is nicely saturating as it comes up often. Finally, it supports intersection which is used when finding the set of preserved passes after N different transforms. The pass API is then changed to return the preserved set rather than a bool. This is much more self-documenting than the previous system. Returning "none" is a conservatively correct solution just like returning "true" from todays passes and not marking any passes as preserved. Passes can also be dynamically preserved or not throughout the run of the pass, and whatever gets returned is the binding state. Finally, preserving "all" the passes is allowed for no-op transforms that simply can't harm such things. Finally, the analysis managers are changed to instead of blindly invalidating all of the analyses, invalidate those which were not preserved. This should rig up all of the basic preservation functionality. This also correctly combines the preservation moving up from one IR-layer to the another and the preservation aggregation across N pass runs. Still to go is incrementally correct invalidation and preservation across IR layers incrementally during N pass runs. That will wait until we have a device for even exposing analyses across IR layers. While the core of this change is obvious, I'm not happy with the current testing, so will improve it to cover at least some of the invalidation that I can test easily in a subsequent commit. llvm-svn: 195241	2013-11-20 11:31:50 +00:00
Chandler Carruth	37fa148ed0	[PM] Make the function pass manager more regular. The FunctionPassManager is now itself a function pass. When run over a function, it runs all N of its passes over that function. This is the 1:N mapping in the pass dimension only. This allows it to be used in either a ModulePassManager or potentially some other manager that works on IR units which are supersets of Functions. This commit also adds the obvious adaptor to map from a module pass to a function pass, running the function pass across every function in the module. The test has been updated to use this new pattern. llvm-svn: 195192	2013-11-20 04:39:16 +00:00
Chandler Carruth	9f55f1934e	[PM] Split the analysis manager into a function-specific interface and a module-specific interface. This is the first of many steps necessary to generalize the infrastructure such that we can support both a Module-to-Function and Module-to-SCC-to-Function pass manager nestings. After a lot of attempts that never worked and didn't even make it to a committable state, it became clear that I had gotten the layering design of analyses flat out wrong. Four days later, I think I have most of the plan for how to correct this, and I'm starting to reshape the code into it. This is just a baby step I'm afraid, but starts separating the fundamentally distinct concepts of function analysis passes and module analysis passes so that in subsequent steps we can effectively layer them, and have a consistent design for the eventual SCC layer. As part of this, I've started some interface changes to make passes more regular. The module pass accepts the module in the run method, and some of the constructor parameters are gone. I'm still working out exactly where constructor parameters vs. method parameters will be used, so I expect this to fluctuate a bit. This actually makes the invalidation less "correct" at this phase, because now function passes don't invalidate module analysis passes, but that was actually somewhat of a misfeature. It will return in a better factored form which can scale to other units of IR. The documentation has gotten less verbose and helpful. llvm-svn: 195189	2013-11-20 04:01:38 +00:00
Filip Pizlo	d0169a8474	Expose the fence instruction via the C API. llvm-svn: 195173	2013-11-20 00:07:49 +00:00
Hao Liu	fcc294f3dd	Implement the newly added ACLE functions for ld1/st1 with 2/3/4 vectors. The functions are like: vst1_s8_x2 ... llvm-svn: 194990	2013-11-18 06:31:53 +00:00
Matt Arsenault	763728c818	Fix spacing, forward declare order. llvm-svn: 194985	2013-11-18 02:51:33 +00:00
Chandler Carruth	07fce80a5a	[PM] Completely remove support for explicit 'require' methods on the AnalysisManager. All this method did was assert something and we have a perfectly good way to trigger that assert from the query path. llvm-svn: 194947	2013-11-17 03:18:05 +00:00
Ana Pazos	b1568fd504	Implemented aarch64 Neon scalar vmulx_lane intrinsics Implemented aarch64 Neon scalar vfma_lane intrinsics Implemented aarch64 Neon scalar vfms_lane intrinsics Implemented legacy vmul_n_f64, vmul_lane_f64, vmul_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. Implemented legacy vfma_lane_f64, vfms_lane_f64, vfma_laneq_f64, vfms_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. llvm-svn: 194888	2013-11-15 23:32:10 +00:00
Chad Rosier	6b1d577e71	[AArch64] Fix the scalar NEON ACLE functions so that they return float/double rather than the vector equivalent. llvm-svn: 194853	2013-11-15 21:28:10 +00:00
Cameron McInally	cae8bdeb82	Add AVX512 unmasked FMA intrinsics and support. llvm-svn: 194824	2013-11-15 17:01:14 +00:00
Matt Arsenault	9921608896	Add addrspacecast instruction. Patch by Michele Scandale! llvm-svn: 194760	2013-11-15 01:34:59 +00:00
Chandler Carruth	46b00ab145	Fix the header comment of the new pass manager stuff to not claim to be the legacy stuff. =] llvm-svn: 194689	2013-11-14 10:55:14 +00:00
Kevin Qin	7409a29609	[AArch64 neon] support poly64 and relevant intrinsic functions. llvm-svn: 194659	2013-11-14 03:27:58 +00:00
Kevin Qin	47a3b639e3	Implement aarch64 neon instruction class SIMD misc. llvm-svn: 194656	2013-11-14 02:44:13 +00:00
Jiangning Liu	5a9b5605ba	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194648	2013-11-14 01:57:32 +00:00
Chad Rosier	fae5b22550	[AArch64] Add support for legacy AArch32 NEON scalar shift by immediate instructions. This patch does not include the shift right and accumulate instructions. A number of non-overloaded intrinsics have been remove in favor of their overloaded counterparts. llvm-svn: 194598	2013-11-13 20:05:37 +00:00
Chandler Carruth	e238c58a05	Add another (perhaps better) video for Sean's talk. (Thanks Marshall!) llvm-svn: 194549	2013-11-13 02:49:38 +00:00
Chandler Carruth	4b8976e254	Give folks a reference to some material on the fundamental design pattern in use here. Addresses review feedback from Sean (thanks!) and others. llvm-svn: 194541	2013-11-13 01:51:36 +00:00
Chandler Carruth	4e1d27ef68	Introduce an AnalysisManager which is like a pass manager but with a lot more smarts in it. This is where most of the interesting logic that used to live in the implicit-scheduling-hackery of the old pass manager will live. Like the previous commits, note that this is a very early prototype! I expect substantial changes before this is ready to use. The core of the design is the following: - We have an AnalysisManager which can be used across a series of passes over a module. - The code setting up a pass pipeline registers the analyses available with the manager. - Individual transform passes can check than an analysis manager provides the analyses they require in order to fail-fast. - There is no implicit registration or scheduling. - Analysis passes are different from other passes: they produce an analysis result that is cached and made available via the analysis manager. - Cached results are invalidated automatically by the pass managers. - When a transform pass requests an analysis result, either the analysis is run to produce the result or a cached result is provided. There are a few aspects of this design that I know will change in subsequent commits: - Currently there is no "preservation" system, that needs to be added. - All of the analysis management should move up to the analysis library. - The analysis management needs to support at least SCC passes. Maybe loop passes. Living in the analysis library will facilitate this. - Need support for analyses which are both module and function passes. - Need support for pro-actively running module analyses to have cached results within a function pass manager. - Need a clear design for "immutable" passes. - Need support for requesting cached results when available and not re-running the pass even if that would be necessary. - Need more thorough testing of all of this infrastructure. There are other aspects that I view as open questions I'm hoping to resolve as I iterate a bit on the infrastructure, and especially as I start writing actual passes against this. - Should we have separate management layers for function, module, and SCC analyses? I think "yes", but I'm not yet ready to switch the code. Adding SCC support will likely resolve this definitively. - How should the 'require' functionality work? Should that be the only way to request results to ensure that passes always require things? - How should preservation work? - Probably some other things I'm forgetting. =] Look forward to more patches in shorter order now that this is in place. llvm-svn: 194538	2013-11-13 01:12:08 +00:00
Weiming Zhao	fcaf85bb3d	Export intrinsics:__builtin_arm_{dmb,dsb} to frontend llvm-svn: 194505	2013-11-12 19:57:43 +00:00
Chad Rosier	8d7ebe36dd	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Chad Rosier	4848250116	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Chandler Carruth	62e299ec37	[PM] Start sketching out the new module and function pass manager. This is still just a skeleton. I'm trying to pull together the experimentation I've done into committable chunks, and this is the first coherent one. Others will follow in hopefully short order that move this more toward a useful initial implementation. I still expect the design to continue evolving in small ways as I work through the different requirements and features needed here though. Keep in mind, all of this is off by default. Currently, this mostly exercises the use of a polymorphic smart pointer and templates to hide the polymorphism for the pass manager from the pass implementation. The next step will be more significant, adding the first framework of analysis support. llvm-svn: 194325	2013-11-09 13:09:08 +00:00
Chandler Carruth	f2e7a23acb	Move the old pass manager infrastructure into a legacy namespace and give the files a legacy prefix in the right directory. Use forwarding headers in the old locations to paper over the name change for most clients during the transitional period. No functionality changed here! This is just clearing some space to reduce renaming churn later on with a new system. Even when the new stuff starts to go in, it is going to be hidden behind a flag and off-by-default as it is still WIP and under development. This patch is specifically designed so that very little out-of-tree code has to change. I'm going to work as hard as I can to keep that the case. Only direct forward declarations of the PassManager class are impacted by this change. llvm-svn: 194324	2013-11-09 12:26:54 +00:00
Juergen Ributzka	f27436b708	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Jiangning Liu	59b8117b0b	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194085	2013-11-05 17:42:05 +00:00
Cameron McInally	02e4f56c18	Add support for AVX512 masked vector blend intrinsics. llvm-svn: 194006	2013-11-04 19:14:56 +00:00
Elena Demikhovsky	9c41e95ef5	AVX-512: fixed a typo in builtin name llvm-svn: 193988	2013-11-04 11:48:23 +00:00
Elena Demikhovsky	841cd7d09e	AVX-512: added VPCONFLICT instruction and intrinsics, added EVEX_KZ to tablegen llvm-svn: 193959	2013-11-03 13:46:31 +00:00
Rafael Espindola	af18aaf051	Remove linkonce_odr_auto_hide. linkonce_odr_auto_hide was in incomplete attempt to implement a way for the linker to hide symbols that are known to be available in every TU and whose addresses are not relevant for a particular DSO. It was redundant in that it all its uses are equivalent to linkonce_odr+unnamed_addr. Unlike those, it has never been connected to clang or llvm's optimizers, so it was effectively dead. Given that nothing produces it, this patch just nukes it (other than the llvm-c enum value). llvm-svn: 193865	2013-11-01 17:09:14 +00:00
Chad Rosier	fd7dc7524c	[AArch64] Add support for NEON scalar fixed-point convert to floating-point instructions. llvm-svn: 193816	2013-10-31 22:36:59 +00:00
Andrew Trick	bb45eecd46	Add new calling convention for WebKit Java Script. llvm-svn: 193812	2013-10-31 22:12:01 +00:00
Chad Rosier	aea5ba449f	[AArch64] Add support for NEON scalar shift immediate instructions. llvm-svn: 193790	2013-10-31 19:28:44 +00:00
Manman Ren	fcc7ee8c6f	Cleanup: update comments. llvm-svn: 193773	2013-10-31 17:25:22 +00:00
Andrew Trick	c9aa545bba	Add experimental stackmap intrinsics to definition file and documenation. llvm-svn: 193767	2013-10-31 17:18:14 +00:00
Andrew Trick	a8fb62f09f	Enable variable arguments support for intrinsics. llvm-svn: 193766	2013-10-31 17:18:11 +00:00
Cameron McInally	c38779faad	Add AVX512 unmasked integer broadcast intrinsics and support. llvm-svn: 193748	2013-10-31 13:56:31 +00:00
Daniel Sanders	f38eb463ae	[mips][msa] Correct definition of bins[lr] and CHECK-DAG-ize related tests llvm-svn: 193695	2013-10-30 15:45:42 +00:00
Daniel Sanders	a305fb792d	[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693	2013-10-30 15:20:38 +00:00
Chad Rosier	02e430c891	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193691	2013-10-30 15:19:37 +00:00
Cameron McInally	41fc626495	Refactor the AVX512 intrinsics. Cluster the intrinsics into the appropriate vector extension class within the .td file. llvm-svn: 193690	2013-10-30 15:19:10 +00:00
Daniel Sanders	b9f927b698	[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687	2013-10-30 14:45:14 +00:00
Rafael Espindola	ccf00b3446	Clarify that GlobalVariables definitions must have an initializer. llvm-svn: 193609	2013-10-29 13:44:11 +00:00
Elena Demikhovsky	0e6849495e	AVX-512: PMIN/PMAX intrinsics and patterns Patch by Cameron McInally <cameron.mcinally@nyu.edu> llvm-svn: 193497	2013-10-27 08:18:37 +00:00
Shuxin Yang	eb29e658a0	Revert r193251 : Use address-taken to disambiguate global variable and indirect memops. llvm-svn: 193489	2013-10-27 03:08:44 +00:00
Elena Demikhovsky	da06b9b278	AVX-512: added VCVTPH2PS, VCVTPS2PH with intrinsics llvm-svn: 193312	2013-10-24 07:16:35 +00:00
Yuchen Wu	f3f29a7db5	Fixed doxygen comment to match Module.cpp llvm-svn: 193273	2013-10-23 21:25:44 +00:00
Shuxin Yang	45a453cafe	Use address-taken to disambiguate global variable and indirect memops. Major steps include: 1). introduces a not-addr-taken bit-field in GlobalVariable 2). GlobalOpt pass sets "not-address-taken" if it proves a global varirable dosen't have its address taken. 3). AA use this info for disambiguation. llvm-svn: 193251	2013-10-23 17:28:19 +00:00
Chad Rosier	838b6065b8	[AArch64] Add the constraint to NEON scalar mla/mls instructions. llvm-svn: 193117	2013-10-21 20:11:47 +00:00
Matheus Almeida	1760b2c642	[mips][msa] Fix definition of SLD instruction. The second parameter of the SLD intrinsic is the number of columns (GPR) to slide left the source array. llvm-svn: 193076	2013-10-21 11:47:56 +00:00
Chad Rosier	9a6d485c7f	[AArch64] Add support for NEON scalar three register different instruction class. The instruction class includes the signed saturating doubling multiply-add long, signed saturating doubling multiply-subtract long, and the signed saturating doubling multiply long instructions. llvm-svn: 192908	2013-10-17 18:12:29 +00:00
Daniel Sanders	f588d8546a	[mips][msa] Added lsa instruction llvm-svn: 192895	2013-10-17 13:38:20 +00:00
Daniel Sanders	46c979a117	[mips][msa] Removed ldx.[bhwd] and stx.[bhwd]. These were present in a previous version of the MSA spec but are not present in the published version. There is no hardware that uses these instructions. llvm-svn: 192888	2013-10-17 12:16:03 +00:00
Chad Rosier	3ed3565e0f	[AArch64] Add support for NEON scalar negate instruction. llvm-svn: 192843	2013-10-16 21:04:39 +00:00
Chad Rosier	aaa3bb367a	[AArch64] Add support for NEON scalar absolute value instruction. llvm-svn: 192842	2013-10-16 21:04:34 +00:00
Chad Rosier	929e07c8ea	Update comment. llvm-svn: 192806	2013-10-16 16:30:10 +00:00
Chad Rosier	a195d145b8	[AArch64] Add support for NEON scalar signed saturating accumulated of unsigned value and unsigned saturating accumulate of signed value instructions. llvm-svn: 192800	2013-10-16 16:09:02 +00:00
Will Dietz	53ec6883be	TypeFinder: prefer iterative algorithm to keep stack usage low. Introduce subtype_reverse_iterator to maintain the numbering assigned during the recursive type walk. llvm-svn: 192770	2013-10-16 04:10:06 +00:00
Craig Topper	037594e792	Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from x86_sse42_crc32_32_8 and was not mapped to a clang builtin. I'm not even sure why this form of the instruction is even called out explicitly in the docs. Also add AutoUpgrade support to convert it into the other intrinsic with appropriate trunc and zext. llvm-svn: 192672	2013-10-15 05:20:47 +00:00
Chad Rosier	40761dc629	[AArch64] Add support for NEON scalar integer compare instructions. llvm-svn: 192596	2013-10-14 14:37:20 +00:00
Rafael Espindola	f0991565b5	Add a GlobalAlias::isValidLinkage to reduce code duplication. Thanks to Reid Kleckner for the suggestion. llvm-svn: 192298	2013-10-09 16:07:32 +00:00
Elena Demikhovsky	f24ecf7862	AVX-512: Added VRCP28 and VRSQRT28 instructions and intrinsics. llvm-svn: 192283	2013-10-09 08:16:14 +00:00
Chad Rosier	d30c4af71b	[AArch64] Add support for NEON scalar floating-point reciprocal estimate, reciprocal exponent, and reciprocal square root estimate instructions. llvm-svn: 192242	2013-10-08 22:09:04 +00:00
Matt Arsenault	f3a608f7ff	Fix duplicated assertions. Do what some other instructions do, and add an assert method. llvm-svn: 192236	2013-10-08 21:11:12 +00:00
Chad Rosier	e281a17b84	[AArch64] Add support for NEON scalar signed/unsigned integer to floating-point convert instructions. llvm-svn: 192231	2013-10-08 20:43:30 +00:00
Benjamin Kramer	f9db6465f7	IRBuilder: Downgrade InsertPointGuard's instruction pointer to a raw pointer. Sadly this loses the checking from AssertingVH, but apparently storing the end() of a BasicBlock into an AssertingVH has bad consequences as it's not really an instruction. llvm-svn: 192209	2013-10-08 17:44:56 +00:00
Matt Arsenault	9c8541d286	Change objectsize intrinsic to accept different address spaces. Bitcasting everything to i8* won't work. Autoupgrade the old intrinsic declarations to use the new mangling. llvm-svn: 192117	2013-10-07 18:06:48 +00:00
Elena Demikhovsky	cb8eaca2e4	AVX-512: added scalar convert instructions and intrinsics. Fixed load folding in VPERM2I instruction. llvm-svn: 192063	2013-10-06 13:11:09 +00:00
Craig Topper	0a8f3fc996	Remove unneeded TBM intrinsics. The arithmetic/logical operation patterns are sufficient. llvm-svn: 192039	2013-10-05 19:22:59 +00:00
Jiangning Liu	6d9b4a0e25	Implement aarch64 neon instruction set AdvSIMD (Across). llvm-svn: 192028	2013-10-05 08:22:10 +00:00
Craig Topper	5ac188d0f2	Add patterns for selecting TBM instructions from logical operations. Patch from Yunzhong Gao. llvm-svn: 191871	2013-10-03 04:16:45 +00:00
Pete Cooper	9b04f39eb6	Add v4f16 to supported value types. This is useful for some ARM intrinsics such as VCVTN which does a <4 x float> <-> <4 x half> conversion. llvm-svn: 191870	2013-10-03 03:29:21 +00:00
Joey Gouly	12afb60cf2	[ARM] Introduce the 'sevl' instruction in ARMv8. This also removes the restriction on the immediate field of the 'hint' instruction. llvm-svn: 191744	2013-10-01 12:39:11 +00:00
Benjamin Kramer	33abdcddb3	IRBuilder: Add RAII objects to reset insertion points or fast math flags. Inspired by the object from the SLPVectorizer. This found a minor bug in the debug loc restoration in the vectorizer where the location of a following instruction was attached instead of the location from the original instruction. llvm-svn: 191673	2013-09-30 15:39:48 +00:00
Benjamin Kramer	bd7dc07ee2	IRBuilder: Move fast math flags to IRBuilderBase. They don't depend on the templated stuff. llvm-svn: 191672	2013-09-30 15:39:27 +00:00
Yunzhong Gao	e51da27a74	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Daniel Sanders	0987676281	[mips][msa] Implemented insert.d intrinsic. This intrinsic is lowered into an equivalent INSERT_VECTOR_ELT which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191521	2013-09-27 13:36:54 +00:00
Daniel Sanders	3c43957555	[mips][msa] Implemented fill.d intrinsic. This intrinsic is lowered into an equivalent BUILD_VECTOR which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191519	2013-09-27 13:20:41 +00:00
Daniel Sanders	935673af60	[mips][msa] Implemented copy_[us].d intrinsic. This intrinsic is lowered into equivalent copy_s.w instructions during legalization. llvm-svn: 191518	2013-09-27 13:04:21 +00:00
Daniel Sanders	7c64721346	[mips][msa] Added support for matching vshf from normal IR (i.e. not intrinsics) llvm-svn: 191301	2013-09-24 14:02:15 +00:00
Daniel Sanders	0167ec55f4	[mips][msa] Added support for matching bsel and bseli from normal IR (i.e. not intrinsics) This required correcting the definition of the bsel and bseli intrinsics. llvm-svn: 191290	2013-09-24 12:04:44 +00:00
Jiangning Liu	5867567c41	Initial support for Neon scalar instructions. Patch by Ana Pazos. 1.Added support for v1ix and v1fx types. 2.Added Scalar Pairwise Reduce instructions. 3.Added initial implementation of Scalar Arithmetic instructions. llvm-svn: 191263	2013-09-24 02:47:27 +00:00
Reid Kleckner	333fd129ac	Explicitly request unsigned enum types when desired The underlying type of all plain enums in MSVC is 'int', even if the enumerator contains large 32-bit unsigned values or values greater than UINT_MAX. The only way to get a large or unsigned enum type is to request it explicitly with the C++11 strong enum types feature. However, since LLVM isn't C++11 yet, I had to add a conditional LLVM_ENUM_INT_TYPE to Compiler.h to control its usage. The motivating true positive for this change is compiling PointerIntPair with MSVC for win64. The PointerIntMask value is supposed to be pointer sized value of all ones with some low zeros. Instead, it's truncated to 32-bits! We are only saved later because it is sign extended back in the AND with int64_t, and we happen to want all ones. This silences lots of -Wmicrosoft warnings during a clang self-host targeting Windows. llvm-svn: 191241	2013-09-23 23:26:57 +00:00
Daniel Sanders	34cb8f3e4d	[mips][msa] Added support for matching insert and copy from normal IR (i.e. not intrinsics) Changes to MIPS SelectionDAG: * Added nodes VEXTRACT_[SZ]EXT_ELT to represent extract and extend in a single operation and implemented the DAG combines necessary to fold sign/zero extends into the extract. llvm-svn: 191199	2013-09-23 14:03:12 +00:00
Amara Emerson	7ad0409c56	[ARMv8] Add support for the v8 cryptography extensions. llvm-svn: 190996	2013-09-19 11:59:01 +00:00
Joey Gouly	e14ac63b96	[ARMv8] Add CRC instructions. Patch by Bradley Smith! llvm-svn: 190928	2013-09-18 09:45:55 +00:00
Ben Langmuir	c0ab36fe2e	Add llvm.x86.* intrinsics for Intel SHA Extensions Add llvm.x86.* intrinsics for all of the Intel SHA Extensions instructions, as well as tests. Also remove mayLoad and hasSideEffects, which can be inferred from the instruction patterns. llvm-svn: 190864	2013-09-17 13:44:39 +00:00
Craig Topper	3a6a903891	Make a more clear AVX-512 section header that matches similar in the file. llvm-svn: 190843	2013-09-17 03:34:09 +00:00
Matt Arsenault	513e7539be	MemCpyOptimizer: Use max legal int size instead of pointer size If there are no legal integers, assume 1 byte. This makes more sense than using the pointer size as a guess for the maximum GPR width. It is conceivable to want to use some 64-bit pointers on a target where 64-bit integers aren't legal. llvm-svn: 190817	2013-09-16 22:43:16 +00:00
Peter Collingbourne	cf3b1a2910	Implement function prefix data as an IR feature. Previous discussion: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html Differential Revision: http://llvm-reviews.chandlerc.com/D1191 llvm-svn: 190773	2013-09-16 01:08:15 +00:00
Matt Arsenault	279eb73c79	Fix comment to match what the assert actually enforces llvm-svn: 190566	2013-09-12 01:07:54 +00:00
Daniel Sanders	534d28aa11	[mips][msa] Corrected the definition of the dotp_[su].[hwd] intrinsics The elements of the operands should be half the width of the elements of the result. llvm-svn: 190505	2013-09-11 09:59:17 +00:00
Daniel Sanders	32227b7995	[mips][msa] Removed unsupported dot product instructions (dotp_[su].b) The dotp_[su].b instructions never existed in any revision of the MSA spec. llvm-svn: 190398	2013-09-10 09:51:43 +00:00
Bob Wilson	6c7d3717b3	Revert patches to add case-range support for PR1255. The work on this project was left in an unfinished and inconsistent state. Hopefully someone will eventually get a chance to implement this feature, but in the meantime, it is better to put things back the way the were. I have left support in the bitcode reader to handle the case-range bitcode format, so that we do not lose bitcode compatibility with the llvm 3.3 release. This reverts the following commits: 155464, 156374, 156377, 156613, 156704, 156757, 156804 156808, 156985, 157046, 157112, 157183, 157315, 157384, 157575, 157576, 157586, 157612, 157810, 157814, 157815, 157880, 157881, 157882, 157884, 157887, 157901, 158979, 157987, 157989, 158986, 158997, 159076, 159101, 159100, 159200, 159201, 159207, 159527, 159532, 159540, 159583, 159618, 159658, 159659, 159660, 159661, 159703, 159704, 160076, 167356, 172025, 186736 llvm-svn: 190328	2013-09-09 19:14:35 +00:00
Manman Ren	de03bcdbec	TBAA: add isTBAAVtableAccess to MDNode so clients can call the function instead of having its own implementation. The implementation of isTBAAVtableAccess is in TypeBasedAliasAnalysis.cpp since it is related to the format of TBAA metadata. The path for struct-path tbaa will be exercised by test/Instrumentation/ThreadSanitizer/read_from_global.ll, vptr_read.ll, and vptr_update.ll when struct-path tbaa is on by default. llvm-svn: 190216	2013-09-06 22:47:05 +00:00
Matt Arsenault	4c2083b14a	Use type helper functions. llvm-svn: 190113	2013-09-06 00:37:24 +00:00
Joey Gouly	071ca2ff6d	[ARMv8] Implement the new DMB/DSB operands. This removes the custom ISD Node: MEMBARRIER and replaces it with an intrinsic. llvm-svn: 190055	2013-09-05 15:35:24 +00:00
Rafael Espindola	357980289a	Revert "Add r159136 back now that pr13124 has been fixed." This reverts commit r189886. I found a corner case where this optimization is not valid: Say we have a "linkonce_odr unnamed_addr" in two translation units: * In TU 1 this optimization kicks in and makes it hidden. * In TU 2 it gets const merged with a constant that is not unnamed_addr, resulting in a non unnamed_addr constant with default visibility. * The static linker rules for combining visibility them produce a hidden symbol, which is incorrect from the point of view of the non unnamed_addr constant. The one place we can do this is when we know that the symbol is not used from another TU in the same shared object, i.e., during LTO. I will move it there. llvm-svn: 189954	2013-09-04 16:09:01 +00:00
Hao Liu	b344ca7aa3	Inplement aarch64 neon instructions in AdvSIMD(shift). About 24 shift instructions: sshr,ushr,ssra,usra,srshr,urshr,srsra,ursra,sri,shl,sli,sqshlu,sqshl,uqshl,shrn,sqrshrun,sqshrn,uqshr,sqrshrn,uqrshrn,sshll,ushll and 4 convert instructions: scvtf,ucvtf,fcvtzs,fcvtzu llvm-svn: 189925	2013-09-04 09:28:24 +00:00
Rafael Espindola	8b9c0a576e	Add r159136 back now that pr13124 has been fixed. Original message: If a constant or a function has linkonce_odr linkage and unnamed_addr, mark hidden. Being linkonce_odr guarantees that it is available in every dso that needs it. Being a constant/function with unnamed_addr guarantees that the copies don't have to be merged. llvm-svn: 189886	2013-09-03 23:34:36 +00:00
Daniel Sanders	3fe5e48ec9	[mips][msa] Added IntrNoMem and removed Commutative from sub intrinsics. This changes the SelectionDAG nodes from ISD::INTRINSIC_W_CHAIN to ISD::INTRINSIC_WO_CHAIN which enables easy lowering to equivalent SelectionDAG nodes (e.g. __builtin_msa_sub_w -> ISD::SUB) in future patches since nodes such as ISD::SUB do not have a chain. It also corrects an obvious mistake, namely that the subtract intrinsics were marked as being commutative. As per a similar change in r189106 (http://llvm.org/viewvc/llvm-project?rev=189106&view=rev) there isn’t a new testcase in this patch since the existing tests should test the intrinsics to the same standard and the best I can do for a testcase would be a fragile pass/maybe test of whether memory operations can (and do) cross the intrinsic. llvm-svn: 189784	2013-09-03 09:45:20 +00:00
Daniel Sanders	96e29ee174	[mips][msa] Added IntrNoMem to the floating-point intrinsics. This changes the SelectionDAG nodes from ISD::INTRINSIC_W_CHAIN to ISD::INTRINSIC_WO_CHAIN which enables easy lowering to equivalent SelectionDAG nodes (e.g. __builtin_msa_fadd_w -> ISD::FADD) in future patches since nodes such as ISD::FADD do not have a chain. As per a similar change in r189106 (http://llvm.org/viewvc/llvm-project?rev=189106&view=rev) there isn’t a new testcase in this patch since the existing tests should test the intrinsics to the same standard and the best I can do for a testcase would be a fragile pass/maybe test of whether memory operations can (and do) cross the intrinsic. llvm-svn: 189782	2013-09-03 09:35:20 +00:00
Tim Northover	490c4c1bda	ARM: remove unused v(add\|sub)hn and vqdml[as]l intrinsics. Clang is now generating cleaner IR, so this removes the old variants which should be completely unused. llvm-svn: 189481	2013-08-28 14:33:33 +00:00
Daniel Sanders	7d6b0c31fc	[mips][msa] Added bnz.df, bnz.v, bz.df, and bz.v These intrinsics are legalized to V(ALL\|ANY)_(NON)?ZERO nodes, are matched as SN?Z_[BHWDV]_PSEUDO pseudo's, and emitted as a branch/mov sequence to evaluate to 0 or 1. Note: The resulting code is sub-optimal since it doesnt seem to be possible to feed the result of an intrinsic directly into a brcond. At the moment it uses (SETCC (VALL_ZERO $ws), 0, SETEQ) and similar which unnecessarily evaluates the boolean twice. llvm-svn: 189478	2013-08-28 12:14:50 +00:00
Daniel Sanders	86a3b104b1	[mips][msa] Added load/store intrinsics. llvm-svn: 189476	2013-08-28 12:04:29 +00:00
Daniel Sanders	6583601738	[mips][msa] Added move.v llvm-svn: 189471	2013-08-28 10:44:47 +00:00
Daniel Sanders	21800e80c1	[mips][msa] Added cfcmsa, and ctcmsa The MSA control registers have been added as reserved registers, and are only used via ISD::Copy(To\|From)Reg. The intrinsics are lowered into these nodes. llvm-svn: 189468	2013-08-28 10:26:24 +00:00
Daniel Sanders	3740f20366	[mips][msa] Added f[cs]af, f[cs]or, f[cs]ueq, f[cs]ul[et], f[cs]une, fsun, ftrunc_[su], hadd_[su], hsub_[su], sr[al]r, sr[al]ri llvm-svn: 189467	2013-08-28 10:12:09 +00:00
Daniel Sanders	eb5b945b08	[mips][msa] Few MSA Builtins have side-effects. Added IntrNoMem to those that don't. llvm-svn: 189106	2013-08-23 12:21:25 +00:00
Andrea Di Biagio	b486212f5a	Add function attribute 'optnone'. This function attribute indicates that the function is not optimized by any optimization or code generator passes with the exception of interprocedural optimization passes. llvm-svn: 189101	2013-08-23 11:53:55 +00:00
Chandler Carruth	ab55d8d98c	Add a new helper method to Value to strip in-bounds constant offsets of pointers, but accumulate the offset into an APInt in the process of stripping it. This is a pretty handy thing to have, such as when trying to determine if two pointers are at some constant relative offset. I'll be committing a patch shortly to use it for exactly that purpose. llvm-svn: 189000	2013-08-22 11:25:11 +00:00
Chandler Carruth	7ac6f80730	Clean up the doxygen formatting of the comments on the strip* methods on Value. These methods probably don't belong here, and I'm discussing moving the lot of them to a better home, but for now I'm about to extend their functionality and wanted to tidy them up first. llvm-svn: 188997	2013-08-22 10:12:18 +00:00
Daniel Sanders	30561c36b8	[mips][msa] Removed fcge, fcgt, fsge, fsgt These instructions were present in a draft spec but were removed before publication. llvm-svn: 188782	2013-08-20 09:41:47 +00:00
Daniel Sanders	91c40d80de	[mips][msa] Added insve llvm-svn: 188777	2013-08-20 09:22:54 +00:00
Daniel Sanders	15341e9a12	[mips][msa] Added and.v, bmnz.v, bmz.v, bsel.v, nor.v, or.v, xor.v llvm-svn: 188767	2013-08-20 08:38:21 +00:00
Hal Finkel	8f395a803a	Add a llvm.copysign intrinsic This adds a llvm.copysign intrinsic; We already have Libfunc recognition for copysign (which is turned into the FCOPYSIGN SDAG node). In order to autovectorize calls to copysign in the loop vectorizer, we need a corresponding intrinsic as well. In addition to the expected changes to the language reference, the loop vectorizer, BasicTTI, and the SDAG builder (the intrinsic is transformed into an FCOPYSIGN node, just like the function call), this also adds FCOPYSIGN to a few lists in LegalizeVector{Ops,Types} so that vector copysigns can be expanded. In TargetLoweringBase::initActions, I've made the default action for FCOPYSIGN be Expand for vector types. This seems correct for all in-tree targets, and I think is the right thing to do because, previously, there was no way to generate vector-values FCOPYSIGN nodes (and most targets don't specify an action for vector-typed FCOPYSIGN). llvm-svn: 188728	2013-08-19 23:35:46 +00:00
Peter Collingbourne	5c5e108012	Introduce non-const overloads for GlobalAlias::{get,resolve}AliasedGlobal. llvm-svn: 188725	2013-08-19 23:13:33 +00:00
Elena Demikhovsky	af085f619a	AVX-512: compiler intrinsics llvm-svn: 188654	2013-08-19 06:55:01 +00:00
Juergen Ributzka	dedbd665dd	The vbroadcastsi256 intrinsic does not exactly resemble the GCC builtin. The GCC builtin expects the arguments to be passed by val, whereas the LLVM intrinsic expects a pointer instead. This is related to PR 16581 and rdar:14747994. llvm-svn: 188608	2013-08-17 16:38:37 +00:00
Jack Carter	2c2f78cead	[Mips][msa] Added the simple builtins (madd_q to xori) Includes: madd_q, maddr_q, maddv, max_[asu], maxi_[su], min_[asu], mini_[su], mod_[su], msub_q, msubr_q, msubv, mul_q, mulr_q, mulv, nloc, nlzc, nori, ori, pckev, pckod, pcnt, sat_[su], shf, sld, sldi, sll, slli, splat, splati, sr[al], sr[al]i, subs_[su], subss_u, subus_s, subv, subvi, vshf, xori Patch by Daniel Sanders llvm-svn: 188460	2013-08-15 14:22:07 +00:00
Jack Carter	8798c3bae2	[Mips][msa] Added the simple builtins (fadd to ftq) Includes: fadd, fceq, fcg[et], fclass, fcl[et], fcne, fcun, fdiv, fexdo, fexp2, fexup[lr], ffint_[su], ffql, ffqr, fill, flog2, fmadd, fmax, fmax_a, fmin, fmin_a, fmsub, fmul, frint, frcp, frsqrt, fseq, fsge, fsgt, fsle, fslt, fsne, fsqr, fsub, ftint_s, ftq Patch by Daniel Sanders llvm-svn: 188458	2013-08-15 13:45:36 +00:00
Jack Carter	80890657b3	[Mips][msa] Added the simple builtins (add_a to dpsub[su], ilvev to ldi) Includes: add_a, adds_[asu], addv, addvi, andi.b, asub_[su].[bhwd], aver?_[su]_[bhwd], bclr, bclri, bins[lr], bins[lr]i, bmnzi, bmzi, bneg, bnegi, bseli, bset, bseti, c(eq\|ne), c(eq\|ne)i, cl[et]_[su], cl[et]i_[su], copy_[su].[bhw], div_[su], dotp_[su], dpadd_[su], dpsub_[su], ilvev, ilvl, ilvod, ilvr, insv, insve, ldi Patch by Daniel Sanders llvm-svn: 188457	2013-08-15 12:24:57 +00:00
Michael Gottesman	e32ebb94bd	[stackprotector] Added intrinsic llvm.stackprotectorcheck. llvm-svn: 188191	2013-08-12 18:35:32 +00:00
Hal Finkel	bdc7aa32c1	Add ISD::FROUND for libm round() All libm floating-point rounding functions, except for round(), had their own ISD nodes. Recent PowerPC cores have an instruction for round(), and so here I'm adding ISD::FROUND so that round() can be custom lowered as well. For the most part, this is straightforward. I've added an intrinsic and a matching ISD node just like those for nearbyint() and friends. The SelectionDAG pattern I've named frnd (because ISD::FP_ROUND has already claimed fround). This will be used by the PowerPC backend in a follow-up commit. llvm-svn: 187926	2013-08-07 22:49:12 +00:00
Elena Demikhovsky	cb3f9da2e3	AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. Added intrinsics and tests. llvm-svn: 187717	2013-08-05 08:52:21 +00:00
Robert Lytton	8fc4bdfaae	remove executable permission from IntrinsicsXCore.td llvm-svn: 187584	2013-08-01 17:17:59 +00:00
Tim Northover	dbac87d1fc	AArch64: add initial NEON support Patch by Ana Pazos. - Completed implementation of instruction formats: AdvSIMD three same AdvSIMD modified immediate AdvSIMD scalar pairwise - Completed implementation of instruction classes (some of the instructions in these classes belong to yet unfinished instruction formats): Vector Arithmetic Vector Immediate Vector Pairwise Arithmetic - Initial implementation of instruction formats: AdvSIMD scalar two-reg misc AdvSIMD scalar three same - Intial implementation of instruction class: Scalar Arithmetic - Initial clang changes to support arm v8 intrinsics. Note: no clang changes for scalar intrinsics function name mangling yet. - Comprehensive test cases for added instructions To verify auto codegen, encoding, decoding, diagnosis, intrinsics. llvm-svn: 187567	2013-08-01 09:20:35 +00:00
Robert Lytton	c10bbf30c8	XCore target: add GCCBuiltin to four intrinsics The following are made available by clang in the XCore ABI __builtin_bitrev __builtin_getid __builtin_getps __builtin_setps llvm-svn: 187566	2013-08-01 08:41:32 +00:00
Matt Arsenault	f58e03599f	Revert "Remove isCastable since nothing uses it now" Apparently dragonegg uses it. llvm-svn: 187454	2013-07-30 22:02:14 +00:00
Matt Arsenault	2121a49954	Remove isCastable since nothing uses it now llvm-svn: 187448	2013-07-30 21:11:17 +00:00
Matt Arsenault	c825ddd4ca	Change behavior of calling bitcasted alias functions. It will now only convert the arguments / return value and call the underlying function if the types are able to be bitcasted. This avoids using fp<->int conversions that would occur before. llvm-svn: 187444	2013-07-30 20:45:05 +00:00
Matt Arsenault	823928b4b1	Re-add DataLayout pointer size convenience functions. These were reverted in r167222 along with the rest of the last different address space pointer size attempt. These will be used in later commits. llvm-svn: 187223	2013-07-26 17:37:20 +00:00
Rafael Espindola	32f9d6abe2	Remove the mblaze backend from llvm. Approval in here http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/064169.html llvm-svn: 187145	2013-07-25 18:55:05 +00:00
Bill Wendling	6420062d1b	Add a way to add a kind-value string pair to an attribute. llvm-svn: 187138	2013-07-25 18:34:24 +00:00
Rafael Espindola	d4a363d0de	Make these methods const correct. Thanks to Nick Lewycky for noticing it. llvm-svn: 187098	2013-07-25 02:50:08 +00:00
Bill Wendling	2bf753029b	Add helpful accessor methods to get the specified function attribute. llvm-svn: 187088	2013-07-24 23:45:00 +00:00
Matt Arsenault	355591860e	Fix missing const llvm-svn: 186857	2013-07-22 18:58:53 +00:00
Joey Gouly	cfa16b3bc1	[ARMv8] Implement the NEON instructions VRINT{N, X, A, Z, M, P}. llvm-svn: 186688	2013-07-19 16:34:16 +00:00
Joey Gouly	933fb028d7	[ARMv8] Add NEON instructions VCVT{A, N, P, M}. llvm-svn: 186574	2013-07-18 11:53:22 +00:00
Adrian Prantl	5c3c7fab07	Get rid of the Dis/EnableDebugLocations() API. I'm moving this functionality into clang instead. llvm-svn: 186549	2013-07-18 00:27:46 +00:00
Joey Gouly	bc02a480d0	[ARMv8] Add support for the NEON instructions vmaxnm/vminnm. This adds a new class for non-predicable NEON instructions and a new DecoderNamespace for v8 NEON instructions. llvm-svn: 186504	2013-07-17 13:59:38 +00:00
Manman Ren	929ebf85f2	Add getModuleFlag(StringRef Key) to query a module flag given Key. No functionality change. llvm-svn: 186470	2013-07-16 23:21:16 +00:00
Tim Northover	69d676cd12	ARM: implement ldrex, strex and clrex intrinsics Intrinsics already existed for the 64-bit variants, so these support operations of size at most 32-bits. llvm-svn: 186392	2013-07-16 09:46:55 +00:00
Craig Topper	d5e9e015c6	Remove unneeded forward declarations. llvm-svn: 186244	2013-07-13 08:28:45 +00:00
Benjamin Kramer	ebffe260a7	Mark MDNode::getOperand as readonly. We can't inline it but we can still CSE calls to it. llvm-svn: 186156	2013-07-12 12:05:13 +00:00
Charles Davis	2b2075f834	Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. Summary: This patch adds explicit calling convention types for the Win64 and System V/x86-64 ABIs. This allows code to override the default, and use the Win64 convention on a target that wants to use SysV (and vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU attributes. Reviewers: CC: llvm-svn: 186144	2013-07-12 06:02:35 +00:00
Nadav Rotem	3276e11bb5	IRBuilder: add an assertion that checks if we try to get a debug loc from ->end(); llvm-svn: 185952	2013-07-09 17:54:22 +00:00
Nadav Rotem	edc4aa7d9e	Fix a bug in IRBuilder::ClearInsertionPoint. The IR Builder needs to reset both the BB and the insert point inside the BB. llvm-svn: 185883	2013-07-08 23:27:43 +00:00
Nick Lewycky	181e3475a3	Add missing per-argument doesNotAccessMemory accessors. No functionality change since it has no callers today. llvm-svn: 185775	2013-07-07 08:29:51 +00:00
Nick Lewycky	7b093a1c2f	Extend 'readonly' and 'readnone' to work on function arguments as well as functions. Make the function attributes pass add it to known library functions and when it can deduce it. llvm-svn: 185735	2013-07-06 00:29:58 +00:00
Matt Arsenault	bccd895589	Fix extra whitespace / formatting llvm-svn: 185238	2013-06-28 23:24:05 +00:00
Justin Holewinski	0f70140107	[NVPTX] Remove i8 register class. PTX support for i8 (.b8, .u8, .s8) is rather poor and we're better off just ignoring it and letting LLVM expand all i8 ops out to i16. llvm-svn: 185174	2013-06-28 17:57:59 +00:00
Michael Gottesman	fe055b3806	Added support for the Builtin attribute. The Builtin attribute is an attribute that can be placed on function call site that signal that even though a function is declared as being a builtin, rdar://problem/13727199 llvm-svn: 185049	2013-06-27 00:25:01 +00:00
Kostya Serebryany	874b298dbc	add Function::removeFnAttr() llvm-svn: 184536	2013-06-21 07:38:09 +00:00
Chris Lattner	e529c33f15	remove some @deprecated markers: LLVM APIs aren't deprecated, they are removed when obsolete. These APIs are still used, and the constant APIs are actually really important. Removing these makes -Wdocumentation more useful. llvm-svn: 184170	2013-06-18 04:57:25 +00:00
Derek Schuff	4c429cad03	Make PrologEpilogInserter save/restore all callee saved registers in functions which call __builtin_unwind_init() __builtin_unwind_init() is an undocumented gcc intrinsic which has this effect, and is used in libgcc_eh. Goes part of the way toward fixing PR8541. llvm-svn: 183984	2013-06-14 16:15:29 +00:00
Jakub Staszak	bd36771f2d	#include <climits> instead of <limits.h> in C++ header file. llvm-svn: 183957	2013-06-13 23:49:09 +00:00
Benjamin Kramer	13dba6bf7c	Move getRealLinkageName to a common place and remove all the duplicates of it. Also simplify code a bit while there. No functionality change. llvm-svn: 183076	2013-06-01 17:51:14 +00:00
Matt Arsenault	67844568c6	Fix wrong comment. Null is not acceptable. llvm-svn: 182979	2013-05-31 01:40:30 +00:00
Jim Grosbach	b2f4f2ae6d	Tidy up. Whitespace. llvm-svn: 182689	2013-05-24 22:53:06 +00:00
Diego Novillo	d1f091f169	Add a new function attribute 'cold' to functions. Other than recognizing the attribute, the patch does little else. It changes the branch probability analyzer so that edges into blocks postdominated by a cold function are given low weight. Added analysis and code generation tests. Added documentation for the new attribute. llvm-svn: 182638	2013-05-24 12:26:52 +00:00
Daniel Malea	22adabba07	Re-implement DebugIR in a way that does not subclass AssemblyWriter: - move AsmWriter.h from public headers into lib - marked all AssemblyWriter functions as non-virtual; no need to override them - DebugIR now "plugs into" AssemblyWriter with an AssemblyAnnotationWriter helper - exposed flags to control hiding of a) debug metadata b) debug intrinsic calls C/R: Paul Redmond llvm-svn: 182617	2013-05-23 22:34:33 +00:00
Justin Holewinski	2a53cbfbe1	[NVPTX] Add @llvm.nvvm.sqrt.f() intrinsic llvm-svn: 182394	2013-05-21 16:51:30 +00:00
Benjamin Kramer	216200bec5	Enable pod-like optimizations for pred and succ iterators. llvm-svn: 182257	2013-05-20 13:12:58 +00:00
Eli Bendersky	c0e010b564	Remove dead code. This method is not being used/tested anywhere. llvm-svn: 181943	2013-05-15 22:41:28 +00:00
Hal Finkel	91bd48d046	Implement PPC counter loops as a late IR-level pass The old PPCCTRLoops pass, like the Hexagon pass version from which it was derived, could only handle some simple loops in canonical form. We cannot directly adapt the new Hexagon hardware loops pass, however, because the Hexagon pass contains a fundamental assumption that non-constant-trip-count loops will contain a guard, and this is not always true (the result being that incorrect negative counts can be generated). With this commit, we replace the pass with a late IR-level pass which makes use of SE to calculate the backedge-taken counts and safely generate the loop-count expressions (including any necessary max() parts). This IR level pass inserts custom intrinsics that are lowered into the desired decrement-and-branch instructions. The most fragile part of this new implementation is that interfering uses of the counter register must be detected on the IR level (and, on PPC, this also includes any indirect branches in addition to function calls). Also, to make all of this work, we need a variant of the mtctr instruction that is marked as having side effects. Without this, machine-code level CSE, DCE, etc. illegally transform the resulting code. Hopefully, this can be improved in the future. This new pass is smaller than the original (and much smaller than the new Hexagon hardware loops pass), and can handle many additional cases correctly. In addition, the preheader-creation code has been copied from LoopSimplify, and after we decide on where it belongs, this code will be refactored so that it can be explicitly shared (making this implementation even smaller). The new test-case files ctrloop-{le,lt,ne}.ll have been adapted from tests for the new Hexagon pass. There are a few classes of loops that this pass does not transform (noted by FIXMEs in the files), but these deficiencies can be addressed within the SE infrastructure (thus helping many other passes as well). llvm-svn: 181927	2013-05-15 21:37:41 +00:00
Daniel Malea	864bcbd2ef	Pull up AssemblyWriter interface into header to allow subclassing - made all functions virtual so that subclasses can specialize them - add printInstructionLine so that subclasses can choose whether or not to print the newline character (without having to implement printBasicBlock() - added a second constructor to AssemblyWriter that does not require a SlotTracker, as required in order to keep the SlotTracker helper class outside AsmWriter.h and buried in the implementation. llvm-svn: 181466	2013-05-08 20:38:31 +00:00
Rafael Espindola	15a39ed0e8	Fix const merging when an alias of a const is llvm.used. We used to disable constant merging not only if a constant is llvm.used, but also if an alias of a constant is llvm.used. This change fixes that. llvm-svn: 181175	2013-05-06 01:48:55 +00:00
Dmitri Gribenko	82c92dc3dd	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Akira Hatanaka	e615afe086	[mips] Remove "Commutative" from property list of non-commutative intrinsics. llvm-svn: 180988	2013-05-03 01:29:31 +00:00
Adrian Prantl	9d3bc41173	Provide an API to temporarily suppress DebugLocations from being attached to emitted instructions. Use this if you want an instruction to be counted towards the prologue or if there is no useful source location. rdar://problem/13442648 llvm-svn: 180929	2013-05-02 17:27:49 +00:00
Filip Pizlo	dd62846c56	This patch breaks up Wrap.h so that it does not have to include all of the things, and renames it to CBindingWrapping.h. I also moved CBindingWrapping.h into Support/. This new file just contains the macros for defining different wrap/unwrap methods. The calls to those macros, as well as any custom wrap/unwrap definitions (like for array of Values for example), are put into corresponding C++ headers. Doing this required some #include surgery, since some .cpp files relied on the fact that including Wrap.h implicitly caused the inclusion of a bunch of other things. This also now means that the C++ headers will include their corresponding C API headers; for example Value.h must include llvm-c/Core.h. I think this is harmless, since the C API headers contain just external function declarations and some C types, so I don't believe there should be any nasty dependency issues here. llvm-svn: 180881	2013-05-01 20:59:00 +00:00
Peng Cheng	376ff965a1	get rid of windows warning: warning C4946: reinterpret_cast used between related classes llvm-svn: 180852	2013-05-01 15:04:18 +00:00
Peng Cheng	4111249364	get rid of windows warning: warning C4800: forcing value to bool 'true' or 'false' (performance warning) llvm-svn: 180851	2013-05-01 15:00:07 +00:00
Peng Cheng	341bcbef57	replace reinterpret_cast by cast or remove reinterpret_cast to get rid of windows warning: warning C4946: reinterpret_cast used between related classes. llvm-svn: 180850	2013-05-01 14:54:01 +00:00
Rafael Espindola	0a4792bcbe	Now that the underlying issue is fixed, revert r180750 and r180722. The cause of the windows failures was fixed by r180791. Revert to the state after Sabre's original revert. Original message: revert r179735, it has no testcases, and doesn't really make sense. llvm-svn: 180844	2013-05-01 13:07:03 +00:00
Duncan Sands	bc453888ba	Correct comment: there is no numTys parameter any more now that this is using ArrayRef. llvm-svn: 180840	2013-05-01 07:54:55 +00:00
Rafael Espindola	b2fd483e24	Change getSlotIndex to return unsigned. The actual storage was already using unsigned, but the interface was using uint64_t. This is wasteful on 32 bits and looks to be the root causes of a miscompilation on Windows where a value was being sign extended to 64bits to compare with the result of getSlotIndex. Patch by Pasi Parviainen! llvm-svn: 180791	2013-04-30 16:53:38 +00:00
Reid Kleckner	3bf655a0d0	Revert "revert r179735, it has no testcases, and doesn't really make sense." This un-reverts r179735 and reverts commit r180574. This fixes assertion failures for me locally and should fix the failures on Windows reported widely on llvm-dev. We should check if the bots caught this and if so why not. llvm-svn: 180722	2013-04-29 18:23:53 +00:00
Manman Ren	c576d690b0	Struct-path aware TBAA: change the format of TBAAStructType node. We switch the order of offset and field type to make TBAAStructType node (name, parent node, offset) similar to scalar TBAA node (name, parent node). TypeIsImmutable is added to TBAAStructTag node. llvm-svn: 180654	2013-04-27 00:26:11 +00:00
Chris Lattner	49248bb367	revert r179735, it has no testcases, and doesn't really make sense. llvm-svn: 180574	2013-04-25 20:34:16 +00:00
Stephen Lin	9d99ba2071	Add CodeGen support for functions that always return arguments via a new parameter attribute 'returned', which is taken advantage of in target-independent tail call opportunity detection and in ARM call lowering (when placed on an integral first parameter). llvm-svn: 179925	2013-04-20 05:14:40 +00:00
Bill Wendling	4f17a0e079	Make the TargetIndependent flag have the right boolean value. llvm-svn: 179798	2013-04-18 21:45:04 +00:00
Bill Wendling	2bc73801cb	Cleanup patch: Semantics of parameters named Index and Idx were inconsistent between "include/llvm/IR/Attributes.h", "lib/IR/AttributeImpl.h" and "lib/IR/Attributes.cpp": sometimes these were fixed 1-based indexes of IR parameters (or AttributeSet::ReturnIndex for IR return values or AttributeSet::FunctionIndex for IR functions), other times they were the internal slot for storage in the underlying AttributeSetImpl. I renamed usage of the former to "Index" and usage of the latter to "Slot" ("Slot" was already being used consistently for the latter in a subset of cases) Patch by Stephen Lin! llvm-svn: 179791	2013-04-18 20:17:28 +00:00
Bill Wendling	480edeaf79	This patch addresses two cleanup issues: 1. Verify::VerifyParameterAttrs in "lib/IR/Verifier.cpp" and AttrBuilder::removeFunctionOnlyAttrs in "lib/IR/Attributes.cpp" (only called by Verify::VerifyFunctionAttrs) separately maintained a list of function-only attribute types. I've consolidated the logic into a new function used for both cases in "lib/IR/Verifier.cpp", so this logic is in one place (other than the AsmParser front-end) 2. Various functions in "lib/IR/Verifier.cpp" passed AttributeSet around by reference needlessly, as it's just a handle to an immutable pimpl body. Patch by Stephen Lin! llvm-svn: 179790	2013-04-18 20:15:25 +00:00
Bill Wendling	503365830b	Add an option `-enable-old-style-attr-syntax' to print out function attributes in the "old" style. It's sometimes beneficial to emit a testcase with the old style attribute syntax. Allow someone to do this. <rdar://problem/13563209> llvm-svn: 179735	2013-04-17 23:35:59 +00:00
Eli Bendersky	6bcfbc1d6e	Cleanup naming: DataLayout s/TD/DL/ llvm-svn: 179601	2013-04-16 15:41:18 +00:00
Manman Ren	744adaa6e5	TBAA: add utility to create a TBAA scalar type node llvm-svn: 179331	2013-04-11 22:51:30 +00:00
Hal Finkel	df94a6f10e	PPC Altivec load/store intrinsics can be marked IntrRead[Write]ArgMem llvm-svn: 178983	2013-04-07 15:32:40 +00:00
Manman Ren	346f0fb858	Add MDBuilder utilities for path-aware TBAA. Add utilities to create struct nodes in TBAA type DAG and to create path-aware tags. The format of struct nodes in TBAA type DAG: a unique name, a list of fields with field offsets and field types. The format of path-aware tags: a base type in TBAA type DAG, an access type and an offset relative to the base type. llvm-svn: 178564	2013-04-02 19:50:49 +00:00
Michael Liao	427149cbcf	Add support of RDSEED defined in AVX2 extension llvm-svn: 178314	2013-03-28 23:41:26 +00:00
Rafael Espindola	8a0ed6dcd6	Cleanup the simplify_type implementation. As far as simplify_type is concerned, there are 3 kinds of smart pointers: * const correct: A 'const MyPtr<int> &' produces a 'const int'. A 'MyPtr<int> &' produces a 'int '. * always const: Even a 'MyPtr<int> &' produces a 'const int'. no const: Even a 'const MyPtr<int> &' produces a 'int'. This patch then does the following: Removes the unused specializations. Since they are unused, it is hard to know which kind should be implemented. * Make sure we don't drop const. * Fix the default forwarding so that const correct pointer only need one specialization. * Simplifies the existing specializations. llvm-svn: 178147	2013-03-27 16:43:11 +00:00
Michael Liao	bd3f6b0eea	Add XTEST codegen support llvm-svn: 178083	2013-03-26 22:47:01 +00:00
Bill Wendling	52cf114e8c	Revert r177675. This is language-specific and shouldn't be in the API. llvm-svn: 177748	2013-03-22 18:46:32 +00:00
Arnaud A. de Grandmaison	7a4226244b	InstCombine: Improve the result bitvect type when folding (cmp pred (load (gep GV, i)) C) to a bit test. The original code used i32, and i64 if legal. This introduced unneeded casts when they aren't legal, or when the index variable i has another type. In order of preference: try to use i's type; use the smallest fitting legal type (using an added DataLayout method); default to i32. A testcase checks that this works when the index gep operand is i16. Patch by : Ahmed Bougacha <ahmed.bougacha@gmail.com> Reviewed by : Duncan llvm-svn: 177712	2013-03-22 08:25:01 +00:00
Bill Wendling	05a454cd6d	Add a query to tell if a landing pad has a catch-all. llvm-svn: 177675	2013-03-21 23:01:03 +00:00
Chandler Carruth	8613f86d1c	Hoist the definition of getTypeSizeInBits to be inlinable and in the header. This method is called in the hot path for many passes, SROA is what caught my interest. A common pattern is that which branch of the switch should be taken is known in the callsite and so it is a very good candidate for inlining and simplification. Moving it into the header allows the optimizer to fold a lot of boring, repeatitive code in callers of this routine. I'm seeing pretty significant speedups in parts of SROA and I suspect other passes will see similar speedups if they end up working with type sizes frequently. I've not seen any significant growth of the binaries as a consequence, but let me know if you see anything suspicious here. llvm-svn: 177632	2013-03-21 09:52:22 +00:00
Benjamin Kramer	bb22dec29c	Remove default copy ctor/assignment, makes AttributeSet trivially copyable. And enables SmallVector's pod optimizations. llvm-svn: 177281	2013-03-18 12:14:30 +00:00
Reed Kotler	6f984e6349	Add some additonal attribute helper functions. Test will be on follow up putback to clang for mips16. llvm-svn: 176968	2013-03-13 20:20:08 +00:00
Pete Cooper	7a5199df98	Add a doFinalization method to the DataLayout pass. This pass is meant to be immutable, however it holds mutable state to cache StructLayouts. This method will allow the pass manager to clear the mutable state between runs. Note that unfortunately it is still necessary to have the destructor, even though it does the same thing as doFinalization. This is because most TargetMachines embed a DataLayout on which doFinalization isn't run as its never added to the pass manager. I also didn't think it was necessary to complication things with a deInit method for which doFinalization and ~DataLayout both call as there's only one field of mutable state. If we had more fields to finalize i'd have added this. llvm-svn: 176877	2013-03-12 17:37:31 +00:00
Benjamin Kramer	2d44bd7bc4	Fix tautological compare. Not sure why this didn't trigger any test failures. llvm-svn: 176652	2013-03-07 20:56:18 +00:00
Jakub Staszak	d62f609790	Change Index type from unsigned long to unsigned. This should fix PR14980. llvm-svn: 176645	2013-03-07 20:21:27 +00:00
Jakub Staszak	9f20ea9a91	Remove trailing spaces. llvm-svn: 176643	2013-03-07 20:04:17 +00:00
Shuxin Yang	048b100cc5	Memory Dependence Analysis (not mem-dep test) take advantage of "invariant.load" metadata. The "invariant.load" metadata indicates the memory unit being accessed is immutable. A load annotated with this metadata can be moved across any store. As I am not sure if it is legal to move such loads across barrier/fence, this change dose not allow such transformation. rdar://11311484 Thank Arnold for code review. llvm-svn: 176562	2013-03-06 17:48:48 +00:00
Peter Collingbourne	8b72c382d6	Modify {Call,Invoke}Inst::addAttribute to take an AttrKind. llvm-svn: 176397	2013-03-02 01:20:18 +00:00
Michael Ilseman	6bd55f4125	Cache the result of Function::getIntrinsicID() in a DenseMap attached to the LLVMContext. This reduces the time actually spent doing string to ID conversion and shows a 10% improvement in compile time for a particularly bad case that involves ARM Neon intrinsics (these have many overloads). Patch by Jean-Luc Duprat! llvm-svn: 176365	2013-03-01 18:48:54 +00:00
Peng Cheng	7600fc5ab4	test commit to use consistent comment notation. llvm-svn: 176353	2013-03-01 16:49:35 +00:00

... 4 5 6 7 8 ...

673 Commits