llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Lang Hames	7883c4d5ae	Teach LocalStackSlotAllocation that stackmaps/patchpoints don't have range constraints on their frame offsets. llvm-svn: 195950	2013-11-29 06:35:30 +00:00
Hal Finkel	69f21285ed	Create a PPC440 SchedMachineModel Some of the older PPC processor definitions don't have associated SchedMachineModels; correct this for the PPC440. llvm-svn: 195949	2013-11-29 06:32:17 +00:00
Hal Finkel	c5a38fd3e6	Fixup PPC440 load/store operand latencies The operand latencies for loads and stores in the PPC440 itinerary were wrong (the store operands are all inputs, and the "with update" (pre-increment) instructions need a latency for the additional output). llvm-svn: 195948	2013-11-29 06:19:43 +00:00
Hal Finkel	a9d93b1740	Adjust PPC440 operand latencies The operand latencies for the PPC440 should be specified relative to dispatch, not relative to the initial fetch-and-decode stages. Because most instructions (ignoring bypass) wait in dispatch until their operands are ready, this is modeled as reading input operands "at dispatch" (0 cycles after issue), and so every input and output operand has 4 cycles subtracted from it. This could alter scheduling slightly, but I don't expect a large effect. llvm-svn: 195947	2013-11-29 05:59:00 +00:00
Hal Finkel	f086fc01ab	Don't model the fetch and decode units for the PPC440 Modeling the fetch and decode units in the PPC440 itinerary does not add anything to the hazard detection capability (and so modeling them just wastes compile time). No functionality change intended. llvm-svn: 195946	2013-11-29 05:58:38 +00:00
Lang Hames	82e8d4faa9	Remove unused variable from r195944. llvm-svn: 195945	2013-11-29 03:36:53 +00:00
Lang Hames	067c025250	Refactor a lot of patchpoint/stackmap related code to simplify and make it target independent. Most of the x86 specific stackmap/patchpoint handling was necessitated by the use of the native address-mode format for frame index operands. PEI has now been modified to treat stackmap/patchpoint similarly to DEBUG_INFO, allowing us to use a simple, platform independent register/offset pair for frame indexes on stackmap/patchpoints. Notes: - Folding is now platform independent and automatically supported. - Emiting patchpoints with direct memory references now just involves calling the TargetLoweringBase::emitPatchPoint utility method from the target's XXXTargetLowering::EmitInstrWithCustomInserter method. (See X86TargetLowering for an example). - No more ugly platform-specific operand parsers. This patch shouldn't change the generated output for X86. llvm-svn: 195944	2013-11-29 03:07:54 +00:00
Hao Liu	b9fa1067c7	AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. llvm-svn: 195941	2013-11-29 02:11:22 +00:00
Jiangning Liu	afc7f71eb3	Add missing pattern for supporting intrinsic function vbsl_f64 with argument double floating point. llvm-svn: 195938	2013-11-29 01:37:15 +00:00
Kevin Qin	b95721d200	[AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. llvm-svn: 195936	2013-11-29 01:29:16 +00:00
Stephen Canon	d8aaca93a6	Rein in overzealous InstCombine of fptrunc(OP(fpextend, fpextend)). llvm-svn: 195934	2013-11-28 21:38:05 +00:00
Rafael Espindola	3f4a857bd5	Refactor to remove a bit of duplication. No functionality change. llvm-svn: 195933	2013-11-28 20:12:44 +00:00
Benjamin Kramer	fd8fd4246f	Silence sign-compare warning and reduce nesting. No functionality change. llvm-svn: 195932	2013-11-28 19:58:56 +00:00
Rafael Espindola	7e7db10302	Remove an always true parameter. llvm-svn: 195931	2013-11-28 19:35:07 +00:00
NAKAMURA Takumi	9b851e876f	[CMake] Let add_public_tablegen_target() provide intrinsics_gen, too. I think, in principle, intrinsics_gen may be added explicitly. That said, it can be added incidentally, since each target already has dependencies to llvm-tblgen. Almost all source files depend on both CommonTaleGen and intrinsics_gen. Explicit add_dependencies() have been pruned under lib/Target. llvm-svn: 195929	2013-11-28 17:04:31 +00:00
NAKAMURA Takumi	99f544b37e	[CMake] Let add_public_tablegen_target responsible to provide dependency to CommonTableGen. add_public_tablegen_target adds *CommonTableGen to LLVM_COMMON_DEPENDS. LLVM_COMMON_DEPENDS affects add_llvm_library (and other add_target stuff) within its scope. llvm-svn: 195927	2013-11-28 17:04:04 +00:00
Rafael Espindola	02c35a15de	The global prefix is always one char. Don't use a string for it. llvm-svn: 195926	2013-11-28 17:00:49 +00:00
NAKAMURA Takumi	46b765a4a3	[CMake] Prune include_directories() in llvm/lib/Target, take #2 . I forgot to commit them. They were staging in my local repo. llvm-svn: 195924	2013-11-28 15:30:37 +00:00
Daniel Sanders	86f254d104	[mips] Revert test commit r195922. llvm-svn: 195923	2013-11-28 15:26:33 +00:00
Daniel Sanders	5b3619e21b	[mips] A test commit to test my Herald and Audit workflow Will be reverted in the next commit llvm-svn: 195922	2013-11-28 15:25:43 +00:00
NAKAMURA Takumi	5dbd3bcf3d	[CMake] Prune include_directories() in llvm/lib/Target. add_llvm_target() sets them. llvm-svn: 195921	2013-11-28 14:53:30 +00:00
NAKAMURA Takumi	128a461f6a	Add newline at eof. llvm-svn: 195920	2013-11-28 14:52:52 +00:00
Rafael Espindola	05d05c4e8a	Use the mangler consistently instead of using getGlobalPrefix directly. llvm-svn: 195911	2013-11-28 08:59:52 +00:00
Hal Finkel	e49bc01fba	Don't share functional units among the PPC itineraries Instead of sharing functional unit names between the various PPC itineraries, give each core its own unit names prefixed with the core name. This follows the convention used by other backends (such as ARM), and removes a non-obvious ordering dependency between the various PPCSchedule*.td files. No functionality change intended. llvm-svn: 195908	2013-11-28 06:05:59 +00:00
Jiangning Liu	7f44dcb9f4	Remove the variable only used by assert to avoid the build failure caused by build options [-Werror,-Wunused-variable]. llvm-svn: 195905	2013-11-28 01:34:55 +00:00
Hao Liu	2f617213ef	AArch64: Fix a bug about disassembling post-index load single element to 4 vectors llvm-svn: 195903	2013-11-28 01:07:45 +00:00
Reed Kotler	deb5d6d05e	Check in conditional branches for constant islands. Still need to finish conditional branches for very large targets. That will be the next small patch. Everything now should in principle work as good (functionality wise) as without constant islands so we decided at Mips/Imagination to make constant islands the default for Mips16 now so that it will get excercised a lot and this port is still experimentatl though hopefully soon we will change the status. Some more cleanup and code review is in order but things are converging fast. llvm-svn: 195902	2013-11-28 00:56:37 +00:00
Akira Hatanaka	a964ea01fe	[mips] Redefine TAILCALL as a pseudo instruction. No functionality change. llvm-svn: 195896	2013-11-27 23:58:32 +00:00
David Blaikie	91268863a6	DebugInfo: Do not include variables only referenced by templates in aranges. ARanges included even extern variables referenced by pointer non-type template parameters even though that variable isn't part of this compilation unit. llvm-svn: 195895	2013-11-27 23:53:52 +00:00
Akira Hatanaka	f831bd9772	Add MipsOptimizePICCall.cpp to CMakeLists.txt. llvm-svn: 195894	2013-11-27 23:47:25 +00:00
Akira Hatanaka	ff17fbeebc	[mips] Implement the following optimizations using dominance information to make PIC calls a little more efficient: 1. Remove instructions setting up $gp if it is known that a function has been called at least once. 2. Save the address of a called function in a register instead of loading it from the GOT at every call site. llvm-svn: 195892	2013-11-27 23:38:42 +00:00
Hal Finkel	40fc5609c6	Add IIC_ prefix to PPC instruction-class names This adds the IIC_ prefix to the instruction itinerary class names, giving the PPC backend a naming convention for itinerary classes that is more consistent with that used by the X86 and ARM backends. Instruction scheduling in the PPC backend needs a bunch of cleanup and improvement (especially for the ooo cores). This is just a preliminary step. No functionality change intended. llvm-svn: 195890	2013-11-27 23:26:09 +00:00
Rafael Espindola	916039a184	Don't set GlobalPrefix to the default value. llvm-svn: 195884	2013-11-27 21:57:54 +00:00
Rafael Espindola	b424a0de8f	The R600 has its own asm printer which doesn't use GlobalPrefix. Drop it. llvm-svn: 195883	2013-11-27 21:52:37 +00:00
Tom Stellard	95624c101d	R600: Expand vector FABS NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195881	2013-11-27 21:23:39 +00:00
Tom Stellard	eac3acc854	R600/SI: Implement spilling of SGPRs v5 SGPRs are spilled into VGPRs using the {READ,WRITE}LANE_B32 instructions. v2: - Fix encoding of Lane Mask - Use correct register flags, so we don't overwrite the low dword when restoring multi-dword registers. v3: - Register spilling seems to hang the GPU, so replace all shaders that need spilling with a dummy shader. v4: - Fix *LANE definitions - Change destination reg class for 32-bit SMRD instructions v5: - Remove small optimization that was crashing Serious Sam 3. https://bugs.freedesktop.org/show_bug.cgi?id=68224 https://bugs.freedesktop.org/show_bug.cgi?id=71285 NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195880	2013-11-27 21:23:35 +00:00
Tom Stellard	d386cdf4d0	R600/SI: Use SGPR_32 register class for 32-bit SMRD outputs Writing to the M0 register from an SMRD instruction hangs the GPU, so we need to use the SGPR_32 register class, which does not include M0. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195879	2013-11-27 21:23:29 +00:00
Tom Stellard	0a14ce13e1	R600: Add support for ISD::FROUND NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195878	2013-11-27 21:23:20 +00:00
Lang Hames	cf7d9a9193	Show stackmap entry encodings in stackmap debug logs. This makes it easier to cross-reference debug output with encoded stack-maps, and to create stackmap test-cases. llvm-svn: 195874	2013-11-27 20:10:16 +00:00
Rafael Espindola	9eb375af22	Remove dead code. MO_ExternalSymbol and MO_JumpTableIndex don't show up in inline asm. llvm-svn: 195861	2013-11-27 18:38:14 +00:00
Rafael Espindola	71b51215cd	Convert two if sequences to switches. llvm-svn: 195859	2013-11-27 18:26:51 +00:00
Rafael Espindola	c0477ed90c	Use a switch. llvm-svn: 195857	2013-11-27 18:18:24 +00:00
Rafael Espindola	c099bf8035	Use the same tls section name as msvc. We currently error in clang with: "error: thread-local storage is unsupported for the current target", but we can start to get the llvm level ready. When compiling template<typename T> struct foo { static __declspec(thread) int bar; }; template<typename T> __declspec(therad) int foo<T>::bar; template struct foo<int>; msvc produces SECTION HEADER #3 .tls$ name 0 physical address 0 virtual address 4 size of raw data 12F file pointer to raw data (0000012F to 00000132) 0 file pointer to relocation table 0 file pointer to line numbers 0 number of relocations 0 number of line numbers C0301040 flags Initialized Data COMDAT; sym= "public: static int foo<int>::bar" (?bar@?$foo@H@@2HA) 4 byte align Read Write gcc produces a ".data$__emutls_v.<symbol>" for the testcase with __declspec(thread) replaced with thread_local. llvm-svn: 195849	2013-11-27 15:52:11 +00:00
Rafael Espindola	8edf1bc0ac	Remove more dead code now that this is only used for inline asm. MO_ConstantPoolIndex is handled in printLeaMemReference. MO_JumpTableIndex and MO_ExternalSymbol don't show up in inline asm. llvm-svn: 195847	2013-11-27 15:13:06 +00:00
Jiangning Liu	d9270b7a51	Fix the AArch64 NEON bug exposed by checking constant integer argument range of ACLE intrinsics. llvm-svn: 195843	2013-11-27 14:02:25 +00:00
Rafael Espindola	533a55aa15	Convert more methods in static helpers. llvm-svn: 195826	2013-11-27 07:34:09 +00:00
Rafael Espindola	20a45908b6	Convert these methods into static functions. llvm-svn: 195825	2013-11-27 07:14:26 +00:00
Rafael Espindola	22b6ec4d69	Cleanup and test X86AsmPrinter::printPCRelImm. It is only used for asm printing. On X86 we put basic block addresses on register before passing them to inline asm, so the MO_MachineBasicBlock case was dead. MO_ExternalSymbol was dead since any symbol being passed to inline asm is represented as MO_GlobalAddress. The MO_GlobalAddress and MO_Register cases were not tested. llvm-svn: 195824	2013-11-27 06:53:13 +00:00
Hal Finkel	4dc7aae3a3	Fix comment in PPCA2Model llvm-svn: 195807	2013-11-27 03:12:56 +00:00
Rafael Espindola	5e593c2e45	Remove dead argument. llvm-svn: 195806	2013-11-27 02:25:20 +00:00
Chad Rosier	ca062e81db	[AArch64] Add support for NEON scalar floating-point absolute difference. llvm-svn: 195803	2013-11-27 01:45:58 +00:00
Rafael Espindola	3ceb67b21b	Use simple section names for COMDAT sections on COFF. With this patch we use simple names for COMDAT sections (like .text or .bss). This matches the MSVC behavior. When merging it is the COMDAT symbol that is used to decide if two sections should be merged, so there is no point in building a fancy name. This survived a bootstrap on mingw32. llvm-svn: 195798	2013-11-27 01:18:37 +00:00
Nadav Rotem	dc01e91cf5	PR1860 - We can't save a list of ExtractElement instructions to CSE because some of these instructions may be removed and optimized in future iterations. Instead we save a list of basic blocks that we need to CSE. llvm-svn: 195791	2013-11-26 22:24:25 +00:00
Eric Christopher	1713a7abd0	80-column fixups. llvm-svn: 195790	2013-11-26 22:23:27 +00:00
Chad Rosier	1337fcc721	[AArch64] Add support for NEON scalar floating-point to integer convert instructions. llvm-svn: 195788	2013-11-26 22:17:37 +00:00
Arnold Schwaighofer	d0c05d2c84	LoopVectorizer: Truncate i64 trip counts of i32 phis if necessary In signed arithmetic we could end up with an i64 trip count for an i32 phi. Because it is signed arithmetic we know that this is only defined if the i32 does not wrap. It is therefore safe to truncate the i64 trip count to a i32 value. Fixes PR18049. llvm-svn: 195787	2013-11-26 22:11:23 +00:00
Reed Kotler	06b47695fb	Fix a bug related to constant islands for Mips16 and mips16/32 dual mode. The determination of when we are doing constant pools was being made too early in the asm printer. llvm-svn: 195781	2013-11-26 20:38:40 +00:00
Diego Novillo	0929297da9	Refactor some code in SampleProfile.cpp I'm adding new functionality in the sample profiler. This will require more data to be kept around for each function, so I moved the structure SampleProfile that we keep for each function into a separate class. There are no functional changes in this patch. It simply provides a new home where to place all the new data that I need to propagate weights through edges. There are some other name and minor edits throughout. llvm-svn: 195780	2013-11-26 20:37:33 +00:00
Michael Liao	8c702e1a18	Fix PR18054 - Fix bug in (vsext (vzext x)) -> (vsext x) in SIGN_EXTEND_IN_REG lowering where we need to check whether x is a vector type (in-reg type) of i8, i16 or i32; otherwise, that optimization is not valid. llvm-svn: 195779	2013-11-26 20:31:31 +00:00
David Blaikie	bbf2455d59	DwarfDebug: Include type units in accelerator tables. Since type units aren't in the CUMap, use the DwarfUnits list to iterate over units for tasks such as accelerator table building. llvm-svn: 195776	2013-11-26 19:14:34 +00:00
Renato Golin	1b5ffb71ea	Fix spurious return introduced by my earlier patch to DebugInfo llvm-svn: 195775	2013-11-26 18:54:37 +00:00
Nadav Rotem	643eb4c26e	PR18060 - When we RAUW values with ExtractElement instructions in some cases we generate PHI nodes with multiple entries from the same basic block but with different values. Enabling CSE on ExtractElement instructions make sure that all of the RAUWed instructions are the same. llvm-svn: 195773	2013-11-26 17:29:19 +00:00
Renato Golin	3594306ff3	Add return to DIType::Verify Code scanner ran by Sylvestre Ledru got a no_return bug in DebugInfo.cpp. Adding the return statements that should be there. llvm-svn: 195772	2013-11-26 16:47:00 +00:00
Stepan Dyatkovskiy	83455f2b60	PR17925 bugfix. Short description. This issue is about case of treating pointers as integers. We treat pointers as different if they references different address space. At the same time, we treat pointers equal to integers (with machine address width). It was a point of false-positive. Consider next case on 32bit machine: void foo0(i32 addrespace(1)* %p) void foo1(i32 addrespace(2)* %p) void foo2(i32 %p) foo0 != foo1, while foo1 == foo2 and foo0 == foo2. As you can see it breaks transitivity. That means that result depends on order of how functions are presented in module. Next order causes merging of foo0 and foo1: foo2, foo0, foo1 First foo0 will be merged with foo2, foo0 will be erased. Second foo1 will be merged with foo2. Depending on order, things could be merged we don't expect to. The fix: Forbid to treat any pointer as integer, except for those, who belong to address space 0. llvm-svn: 195769	2013-11-26 16:11:03 +00:00
Timur Iskhodzhanov	a7f0ef25ed	Rename DwarfException methods so the new names are consistent with DwarfDebug and the style guide llvm-svn: 195763	2013-11-26 13:34:55 +00:00
Tim Northover	f0a2ff9091	Darwin-ARM: use movw/movt for static relocations llvm-svn: 195759	2013-11-26 12:45:05 +00:00
Chandler Carruth	f3c4692f91	[PM] Factor the overwhelming majority of the interface boiler plate out of the two analysis managers into a CRTP base class that can be shared and re-used in building any analysis manager. This will in turn simplify adding yet another analysis manager to the system. The base class provides all of the interface sugar for the analysis manager delegating the functionality back through DerivedT methods which operate on simple pass IDs. It also provides the pass registration, storage, and lookup system which is common across the various formulations of analysis managers. llvm-svn: 195747	2013-11-26 11:24:37 +00:00
Richard Sandiford	b3250399ac	[SystemZ] Fix incorrect use of RISBG for a zero-extended right shift We would wrongly transform the testcase into the equivalent of an AND with 1. The problem was that, when testing whether the shifted-in bits of the right shift were significant, we used the width of the final zero-extended result rather than the width of the shifted value. llvm-svn: 195731	2013-11-26 10:53:16 +00:00
Chandler Carruth	5be5f8d16c	[PM] Split the CallGraph out from the ModulePass which creates the CallGraph. This makes the CallGraph a totally generic analysis object that is the container for the graph data structure and the primary interface for querying and manipulating it. The pass logic is separated into its own class. For compatibility reasons, the pass provides wrapper methods for most of the methods on CallGraph -- they all just forward. This will allow the new pass manager infrastructure to provide its own analysis pass that constructs the same CallGraph object and makes it available. The idea is that in the new pass manager, the analysis pass's 'run' method returns a concrete analysis 'result'. Here, that result is a 'CallGraph'. The 'run' method will typically do only minimal work, deferring much of the work into the implementation of the result object in order to be lazy about computing things, but when (like DomTree) there is some up-front computation, the analysis does it prior to handing the result back to the querying pass. I know some of this is fairly ugly. I'm happy to change it around if folks can suggest a cleaner interim state, but there is going to be some amount of unavoidable ugliness during the transition period. The good thing is that this is very limited and will naturally go away when the old pass infrastructure goes away. It won't hang around to bother us later. Next up is the initial new-PM-style call graph analysis. =] llvm-svn: 195722	2013-11-26 04:19:30 +00:00
Chandler Carruth	f5b149f90b	[PM] Reformat some code with clang-format as I'm going to be editting as part of generalizing the call graph infrastructure for the new pass manager. llvm-svn: 195718	2013-11-26 03:45:26 +00:00
Kevin Qin	1370a1e1ee	Refactored the implementation of AArch64 NEON instruction ZIP, UZP and TRN. Fix a bug when mixed use of vget_high_u8() and vuzp_u8(). llvm-svn: 195716	2013-11-26 03:26:47 +00:00
Kevin Qin	95c8b28223	[AArch64]Implement 128 bit register copy with NEON. llvm-svn: 195713	2013-11-26 02:33:42 +00:00
Andrew Trick	95afafe3fa	StackMap: Implement support for DirectMemRefOp. A Direct stack map location records the address of frame index. This address is itself the value that the runtime requested. This differs from IndirectMemRefOp locations, which refer to a stack locations from which the requested values must be loaded. Direct locations can directly communicate the address if an alloca, while IndirectMemRefOp handle register spills. For example: entry: %a = alloca i64... llvm.experimental.stackmap(i32 <ID>, i32 <shadowBytes>, i64* %a) Since both the alloca and stackmap intrinsic are in the entry block, and the intrinsic takes the address of the alloca, the runtime can assume that LLVM will not substitute alloca with any intervening value. This must be verified by the runtime by checking that the stack map's location is a Direct location type. The runtime can then determine the alloca's relative location on the stack immediately after compilation, or at any time thereafter. This differs from Register and Indirect locations, because the runtime can only read the values in those locations when execution reaches the instruction address of the stack map. llvm-svn: 195712	2013-11-26 02:03:25 +00:00
Andrew Trick	95115e649e	whitespace llvm-svn: 195711	2013-11-26 02:03:20 +00:00
Chandler Carruth	dc82f92af7	Lift self-copy protection up to the header file and add self-move protection to the same layer. This is in line with Howard's advice on how best to handle self-move assignment as he explained on SO[1]. It also ensures that implementing swap with move assignment continues to work in the case of self-swap. [1]: http://stackoverflow.com/questions/9322174/move-assignment-operator-and-if-this-rhs llvm-svn: 195705	2013-11-26 00:54:44 +00:00
Chandler Carruth	9233af88ac	Fix a self-memcpy which only breaks under Valgrind's memcpy implementation. Silliness, but it'll be a trivial performance optimization. This should clear up a failure on the vg_leak bot. llvm-svn: 195704	2013-11-26 00:44:36 +00:00
Chandler Carruth	157fe29e22	[PM] Rename the 'Mod' member to the more idiomatic 'M'. No functionality changed. llvm-svn: 195701	2013-11-26 00:37:23 +00:00
David Blaikie	2284510f45	DebugInfo: Remove CompileUnit::constructTypeDIEImpl now that it's just a simple wrapper again. r195698 moved the type unit checking up into getOrCreateTypeDIE so remove the redundant check and fold the functions back together again. llvm-svn: 195700	2013-11-26 00:35:04 +00:00
David Blaikie	98277f8277	DebugInfo: Avoid emitting pubtype entries for type DIEs that just indirect to a type unit. llvm-svn: 195698	2013-11-26 00:22:37 +00:00
Cameron McInally	2ff051483c	Add an intrinsic for the SSE2 PAUSE instruction. llvm-svn: 195697	2013-11-26 00:20:43 +00:00
David Blaikie	64a5628952	DebugInfo: Pubtypes: Coelesce pubtype registration with accelerator type registration. It might be possible to eventually use one data structure, but I haven't looked at the exact criteria used for accelerator tables and pubtypes to see if there's good reason for the differences between the two or not. llvm-svn: 195696	2013-11-26 00:15:27 +00:00
Rafael Espindola	c1e50a3473	Do the string comparison in the constructor instead of once per nop. Thanks to Roman Divacky for the suggestion. llvm-svn: 195684	2013-11-25 20:50:03 +00:00
Rafael Espindola	fa5cbd5557	Don't use nopl in cpus that don't support it. Patch by Mikulas Patocka. I added the test. I checked that for cpu names that gas knows about, it also doesn't generate nopl. The modified cpus: i686 - there are i686-class CPUs that don't have nopl: Via c3, Transmeta Crusoe, Microsoft VirtualBox - see https://bbs.archlinux.org/viewtopic.php?pid=775414 k6, k6-2, k6-3, winchip-c6, winchip2 - these are 586-class CPUs via c3 c3-2 - see https://bugs.archlinux.org/task/19733 as a proof that Via c3 and c3-Nehemiah don't have nopl llvm-svn: 195679	2013-11-25 20:15:14 +00:00
David Peixotto	647697e4ae	ARM integrated assembler generates incorrect nop opcode This patch fixes a bug in the assembler that was causing bad code to be emitted. When switching modes in an assembly file (e.g. arm to thumb mode) we would always emit the opcode from the original mode. Consider this small example: $ cat align.s .code 16 foo: add r0, r0 .align 3 add r0, r0 $ llvm-mc -triple armv7-none-linux align.s -filetype=obj -o t.o $ llvm-objdump -triple thumbv7 -d t.o Disassembly of section .text: foo: 0: 00 44 add r0, r0 2: 00 f0 20 e3 blx #4195904 6: 00 00 movs r0, r0 8: 00 44 add r0, r0 This shows that we have actually emitted an arm nop (e320f000) instead of a thumb nop. Unfortunately, this encodes to a thumb branch which causes bad things to happen when compiling assembly code with align directives. The fix is to notify the ARMAsmBackend when we switch mode. The MCMachOStreamer was already doing this correctly. This patch makes the same change for the MCElfStreamer. There is still a bug in the way nops are emitted for alignment because the MCAlignment fragment does not store the correct mode. The ARMAsmBackend will emit nops for the last mode it knew about. In the example above, we still generate an arm nop if we add a `.code 32` to the end of the file. PR18019 llvm-svn: 195677	2013-11-25 19:11:13 +00:00
Bill Wendling	0fe82ef0aa	Unrevert r195599 with testcase fix. I'm not sure how it was checking for the wrong values... PR18023. llvm-svn: 195670	2013-11-25 18:05:22 +00:00
Tim Northover	86362cbc00	Fix indentation typo llvm-svn: 195660	2013-11-25 17:04:35 +00:00
Tim Northover	35cd30aae4	ARM: remove special cases for Darwin dynamic-no-pic mode. These are handled almost identically to static mode (and ELF's global address materialisation), except that a symbol may have "$non_lazy_ptr" appended. This can be handled by passing appropriate flags along with the instruction instead of using entirely separate pseudo-instructions. llvm-svn: 195655	2013-11-25 16:24:52 +00:00
Rafael Espindola	a355ffef1b	Fix .comm and .lcomm on COFF. These should not use COMDATs. GNU as uses .bss for .lcomm and section 0 for .comm. Given static int a; int b; MSVC puts both in .bss. This patch then puts both .comm and .lcomm on .bss. With this change we agree with gas on .lcomm, are much closer on .comm and clang-cl matches msvc on the above example. llvm-svn: 195654	2013-11-25 16:06:04 +00:00
Rafael Espindola	b5790f7596	Refactor to make the .bss, .data and .text sections available for other uses. No functionality change. llvm-svn: 195653	2013-11-25 16:00:32 +00:00
Benjamin Kramer	9f1338e49d	Make helper function static. llvm-svn: 195650	2013-11-25 15:40:24 +00:00
Tim Northover	9a6b1a0022	ARM: remove unused patterns. There is no sane way for an LEApcrel (= single ADR) instruction to generate a global address on any ARM target I know of. Fortunately, no-one was trying to any more, but there were vestigial patterns. llvm-svn: 195644	2013-11-25 14:40:57 +00:00
Amara Emerson	368f3c89e8	[ARM] Enable FeatureMP for Cortex-A5 by default. Patch by Oliver Stannard. llvm-svn: 195640	2013-11-25 13:17:15 +00:00
Amara Emerson	dfecbfdfc2	Revert r195599 as it broke the builds. llvm-svn: 195636	2013-11-25 11:24:18 +00:00
Daniel Sanders	054e9e0703	Fixed tryFoldToZero() for vector types that need expansion. Summary: Moved the requirement for SelectionDAG::getConstant() to return legally typed nodes slightly earlier. There were two optional DAGCombine passes that were missed out and were required to produce type-legal DAGs. Simplified a code-path in tryFoldToZero() to use SelectionDAG::getConstant(). This provides support for both promoted and expanded vector types whereas the previous code only supported promoted vector types. Fixes a "Type for zero vector elements is not legal" assertion detected by an llvm-stress generated test. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2251 llvm-svn: 195635	2013-11-25 11:14:43 +00:00
Tim Northover	1b70928c82	X86: enable AVX2 under Haswell native compilation Patch by Adam Strzelecki llvm-svn: 195632	2013-11-25 09:52:59 +00:00
Bill Wendling	f4bc87d59d	Don't look past volatile loads. A volatile load should block us from trying to coalesce stores. PR18023 llvm-svn: 195599	2013-11-25 05:01:21 +00:00
Hao Liu	66ab312f94	Fixed a bug about disassembling AArch64 post-index load/store single element instructions. ie. echo "0x00 0x04 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble echo "0x00 0x00 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble will be disassembled into the same instruction st1 {v0b}[0], [x0], x0. llvm-svn: 195591	2013-11-25 01:53:26 +00:00
NAKAMURA Takumi	035ff3c7bf	SparcFrameLowering.cpp: Prune 'DL' [-Wunused-variable] llvm-svn: 195590	2013-11-25 00:52:46 +00:00
Chandler Carruth	0e71e8676c	Output a bit more information in the debug printing for MBP. This was useful when analyzing parts of zlib's behavior here. llvm-svn: 195588	2013-11-25 00:43:41 +00:00
Venkatraman Govindaraju	d0d03fae95	[Sparc] Emit large negative adjustments to SP/FP with sethi+xor instead of sethi+or. This generates correct code for both sparc32 and sparc64. llvm-svn: 195576	2013-11-24 20:23:25 +00:00
Venkatraman Govindaraju	71233183a9	[Sparc]: Implement LEA pattern for sparcv9. llvm-svn: 195575	2013-11-24 20:07:35 +00:00
Venkatraman Govindaraju	73dd53211d	[SparcV9]: Do not emit .register directives for global registers that are clobbered by calls but not used in the function itself. llvm-svn: 195574	2013-11-24 18:41:49 +00:00
Venkatraman Govindaraju	0c27a5ac2c	[SparcV9] Enable custom lowering of DYNAMIC_STACKALLOC in sparc64. llvm-svn: 195573	2013-11-24 17:41:41 +00:00
Reed Kotler	6088c0e228	Make sure that for C++ emitting LwConstant32 pseudos, that it corresponds to what is needed for constant islands. The prescan method for Mips16 constant islands will eventually go away. It is only temporary and should be done earlier when the instructions are first created or from the DAG. If we keep it here we need to handle better the situation where constant islands is called multiple times since don't want to prescan more than once. llvm-svn: 195569	2013-11-24 06:18:50 +00:00
Reed Kotler	eb75f46c95	Fix a funny bug I introduced during conversion of ARM constant islands to Mips. I had to move some code and I moved a declaration forward past it's first use in the function but by nutty coincidence there was another variable of the same name and type and with completely unrelated function that was declared globally in the class so no compilation error ensued. It required some unusual conditions for it to even matter. Caused test case casts.c in test-suite to fail during compilation with a duplicate symbol error. I would have noticed it during final code review for this port. llvm-svn: 195565	2013-11-24 02:53:09 +00:00
Chandler Carruth	bc6ef9dce9	[PM] Complete the cross-layer interfaces with a Module-to-Function proxy. This lets a function pass query a module analysis manager. However, the interface is const to indicate that only cached results can be safely queried. With this, I think the new pass manager is largely functionally complete for modules and analyses. Still lots to test, and need to generalize to SCCs and Loops, and need to build an adaptor layer to support the use of existing Pass objects in the new managers. llvm-svn: 195538	2013-11-23 01:25:07 +00:00
David Blaikie	221a129b24	DwarfDebug: Move ownership of CompileUnits into DwarfUnits This avoids the need for an extra list of SkeletonCUs and associated cleanup while staging things to be cleaner for further type unit improvements. Also hopefully fixes a memory leak introduced in r195166. llvm-svn: 195536	2013-11-23 01:17:34 +00:00
Chandler Carruth	a1094eb135	Migrate metadata information from scalar to vector instructions during SLP vectorization. Based on the code in BBVectorizer. Fixes PR17741. Patch by Raul Silvera, reviewed by Hal and Nadav. Reformatted by my driving of clang-format. =] llvm-svn: 195528	2013-11-23 00:48:34 +00:00
Chandler Carruth	a3ea32a7e4	[PM] Add support to the analysis managers to query explicitly for cached results. This is the last piece of infrastructure needed to effectively support querying up the analysis layers. The next step will be to introduce a proxy which provides access to those layers with appropriate use of const to direct queries to the safe interface. llvm-svn: 195525	2013-11-23 00:38:42 +00:00
Eric Christopher	adc49d7aab	Refactor DW_AT_ranges handling to use labels for ranges rather than a non-relocatable number offset. One fixme to make the ranges as discrete data structures and have range lists explicitly represented rather than as a list of symbols. llvm-svn: 195523	2013-11-23 00:05:29 +00:00
Eric Christopher	4018f1e649	Reformat const for readability. llvm-svn: 195522	2013-11-23 00:05:06 +00:00
Chandler Carruth	be663f7b2a	[PM] Switch the downward invalidation to be incremental where only the one function's analyses are invalidated at a time. Also switch the preservation of the proxy to fully preserve the lower (function) analyses. Combined, this gets both upward and downward analysis invalidation to a point I'm happy with: - A function pass invalidates its function analyses, and its parent's module analyses. - A module pass invalidates all of its functions' analyses including the set of which functions are in the module. - A function pass can preserve a module analysis pass. - If all function passes preserve a module analysis pass, that preservation persists. If any doesn't the module analysis is invalidated. - A module pass can opt into managing all function analysis invalidation itself or none. - The conservative default is none, and the proxy takes the maximally conservative approach that works even if the set of functions has changed. - If a module pass opts into managing function analysis invalidation it has to propagate the invalidation itself, the proxy just does nothing. The only thing really missing is a way to query for a cached analysis or nothing at all. With this, function passes can more safely request a cached module analysis pass without fear of it accidentally running part way through. llvm-svn: 195519	2013-11-22 23:38:07 +00:00
Tom Stellard	5da7926d0a	R600/SI: Fixing handling of condition codes We were ignoring the ordered/onordered bits and also the signed/unsigned bits of condition codes when lowering the DAG to MachineInstrs. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195514	2013-11-22 23:07:58 +00:00
Yuchen Wu	43df88959d	llvm-cov: Split entry blocks in GCNOProfiling.cpp. gcov expects every function to contain an entry block that unconditionally branches into the next block. clang does not implement basic blocks in this manner, so gcov did not output correct branch info if the entry block branched to multiple blocks. This change splits every function's entry block into an empty block and a block with the rest of the instructions. The instrumentation code will take care of the rest. llvm-svn: 195513	2013-11-22 23:07:45 +00:00
Manman Ren	7590fee969	Debug Info: move StripDebugInfo from StripSymbols.cpp to DebugInfo.cpp. We can share the implementation between StripSymbols and dropping debug info for metadata versions that do not match. Also update the comments to match the implementation. A follow-on patch will drop the "Debug Info Version" module flag in StripDebugInfo. llvm-svn: 195505	2013-11-22 22:06:31 +00:00
Jim Grosbach	02f7297367	X86: Perform integer comparisons at i32 or larger. Utilizing the 8 and 16 bit comparison instructions, even when an input can be folded into the comparison instruction itself, is typically not worth it. There are too many partial register stalls as a result, leading to significant slowdowns. By always performing comparisons on at least 32-bit registers, performance of the calculation chain leading to the comparison improves. Continue to use the smaller comparisons when minimizing size, as that allows better folding of loads into the comparison instructions. rdar://15386341 llvm-svn: 195496	2013-11-22 19:57:47 +00:00
Matt Arsenault	35acaad8c7	StructurizeCFG: Fix verification failure with some loops. If the beginning of the loop was also the entry block of the function, branches were inserted to the entry block which isn't allowed. If this occurs, create a new dummy function entry block that branches to the start of the loop. llvm-svn: 195493	2013-11-22 19:24:39 +00:00
Matt Arsenault	9afcbf3562	StructurizeCFG: Fix inverting a branch on an argument llvm-svn: 195492	2013-11-22 19:24:37 +00:00
Paul Robinson	0ef8735f0a	Teach ISel not to optimize 'optnone' functions (revised). Improvements over r195317: - Set/restore EnableFastISel flag instead of just running FastISel within SelectAllBasicBlocks; the flag is checked in various places, and FastISel won't run properly if those places don't do the right thing. - Test looks for normal ISel versus FastISel behavior, and not something more subtle that doesn't work everywhere. Based on work by Andrea Di Biagio. llvm-svn: 195491	2013-11-22 19:11:24 +00:00
Andrew Trick	09cec15235	DEBUG shouldEvict decisions llvm-svn: 195490	2013-11-22 19:07:42 +00:00
Andrew Trick	e1bdea2b2b	Minor cleanup. EvictionCost ctor was confusing relative to the other costs floating around in the code. llvm-svn: 195489	2013-11-22 19:07:38 +00:00
Andrew Trick	0167d0293a	patchpoint: factor SD builder code for live vars. Plain stackmap also optimizes Constant values now. llvm-svn: 195488	2013-11-22 19:07:36 +00:00
Andrew Trick	e150fb7056	patchpoint: eliminate hard coded operand indices. llvm-svn: 195487	2013-11-22 19:07:33 +00:00
Rafael Espindola	524e7f35de	Add a fixed version of r195470 back. The fix is simply to use CurI instead of I when handling aliases to avoid accessing a invalid iterator. original message: Convert linkonce* to weak* instead of strong. Also refactor the logic into a helper function. This is an important improve on mingw where the linker complains about mixed weak and strong symbols. Converting to weak ensures that the symbol is not dropped, but keeps in a comdat, making the linker happy. llvm-svn: 195477	2013-11-22 17:58:12 +00:00
Michael Liao	0f7c6dee5e	Fix PR18014 - When simplifying the mask generation for BLEND, check whether that mask is also consumed by other non-BLEND insns. If true, skip that simplification. llvm-svn: 195476	2013-11-22 17:56:57 +00:00
Richard Sandiford	d5298a3795	[SystemZ] Fix TMHH and TMHL usage for z10 with -O0 I've no idea why I decided to handle TMxx differently from all the other high/low logic operations, but it was a stupid thing to do. The high registers aren't available as separate 32-bit registers on z10, so subreg_h32 can't be used on a GR64 there. I've normally been testing with z196 and with -O3 and so hadn't noticed this until now. llvm-svn: 195473	2013-11-22 17:28:28 +00:00
Rafael Espindola	749aa1e00d	Revert "Convert linkonce* to weak* instead of strong." This reverts commit r195470. Debugging failure in some bots. llvm-svn: 195472	2013-11-22 17:09:34 +00:00
Richard Sandiford	82ac8f6b68	Add a Scalarizer pass. llvm-svn: 195471	2013-11-22 16:58:05 +00:00
Rafael Espindola	2ac1404bee	Convert linkonce* to weak* instead of strong. Also refactor the logic into a helper function. This is an important improvement on mingw where the linker complains about mixed weak and strong symbols. Converting to weak ensures that the symbol is not dropped, but keeps in a comdat, making the linker happy. llvm-svn: 195470	2013-11-22 16:14:30 +00:00
Arnold Schwaighofer	3fa9376236	SLPVectorizer: Fix whitespace errors. llvm-svn: 195468	2013-11-22 15:47:17 +00:00
Rafael Espindola	7660b5bf36	Don't produce tail calls when the caller is x86_thiscallcc. The callee will not pop the stack for us. llvm-svn: 195467	2013-11-22 15:18:28 +00:00
Daniel Sanders	f576325902	Fix typo in a comment added in r195455. Credit to Matheus Almeida for spotting it. llvm-svn: 195456	2013-11-22 13:22:52 +00:00
Daniel Sanders	f10fe5a89a	[mips][msa] Fix corner case for integer constant splats with undef values. lowerBUILD_VECTOR() was treating integer constant splats as being legal regardless of whether they had undef values. This caused instruction selection failures when the undefs were legalized to zero, making the constant non-splat. Fixed this by requiring HasAnyUndef to be false for a integer constant splat to be legal. If it is true, a new node is generated with the undefs replaced with the necessary values to remain a splat. llvm-svn: 195455	2013-11-22 13:14:06 +00:00
Chandler Carruth	df1a8fd535	[PM] Teach the analysis managers to pass themselves as arguments to the run methods of the analysis passes. Also generalizes and re-uses the SFINAE for transformation passes so that users can write an analysis pass and only accept an analysis manager if that is useful to their pass. This completes the plumbing to make an analysis manager available through every pass's run method if desired so that passes no longer need to be constructed around them. llvm-svn: 195451	2013-11-22 12:11:02 +00:00
Richard Barton	98aacc3f23	Add support for Cortex-A12. Patch by Oliver Stannard! llvm-svn: 195448	2013-11-22 11:53:16 +00:00
Chandler Carruth	8370a1a333	[PM] Fix the analysis templates' usage of IRUnitT. This is supposed to be the whole type of the IR unit, and so we shouldn't pass a pointer to it but rather the value itself. In turn, we need to provide a 'Module *' as that type argument (for example). This will become more relevant with SCCs or other units which may not be passed as a pointer type, but also brings consistency with the transformation pass templates. llvm-svn: 195445	2013-11-22 11:34:43 +00:00
Daniel Sanders	d301ede02d	[mips][msa] Float vector constants cannot use ldi.[wd] directly. Bitcast from the appropriate integer vector type. Fixes an instruction selection failure detected by llvm-stress. llvm-svn: 195444	2013-11-22 11:24:50 +00:00
Kostya Serebryany	3c8539795c	Revert r195318 as it causes miscompilation (PR18029) llvm-svn: 195439	2013-11-22 10:30:39 +00:00
Hao Liu	684d7e8968	Fix a Cygwin build failure caused by enum values starting with '_', which is conflicted with some platform macros. This patch only renames variables, no functional change. llvm-svn: 195432	2013-11-22 09:24:41 +00:00
Hao Liu	4c6cc894d2	Fix the bugs about AArch64 Load/Store vector types and bitcast between i64 and vector types. e.g. "%tmp = load <2 x i64>* %ptr" can't be selected. "%tmp = bitcast i64 %in to <2 x i32>" can't be selected. llvm-svn: 195424	2013-11-22 08:47:22 +00:00
Hao Liu	b1bce975ea	Revert last change by haoliu because of buildbot failure. llvm-svn: 195423	2013-11-22 08:34:54 +00:00
Hao Liu	eb2535e203	Fix a Cygwin build failure caused by enum values starting with '_', which is conflicted with some platform macros. This solution only renames variables, no functional change. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195421	2013-11-22 08:17:16 +00:00
Jiangning Liu	a50f9e81f3	For AArch64 back-end instruction selection, lower Neon_Lowxxx with EXTRCT_SUBREG. llvm-svn: 195408	2013-11-22 02:45:13 +00:00
Yi Jiang	74286d427a	SLP Vectorizer: Extract cost will only be added once even if the scalar has multiple external uses. llvm-svn: 195406	2013-11-22 01:57:02 +00:00
Lang Hames	433095d3fe	Fix a typo where we were creating <def,kill> operands instead of <def,dead> ones. Add an assertion to make sure we catch this in the future. Fixes <rdar://problem/15464559>. llvm-svn: 195401	2013-11-22 00:46:32 +00:00
Chandler Carruth	28195a6d83	[PM] Switch analysis managers to be threaded through the run methods rather than the constructors of passes. This simplifies the APIs of passes significantly and removes an error prone pattern where the same manager had to be given to every different layer. With the new API the analysis managers themselves will have to be cross connected with proxy analyses that allow a pass at one layer to query for the analysis manager of another layer. The proxy will both expose a handle to the other layer's manager and it will provide the invalidation hooks to ensure things remain consistent across layers. Finally, the outer-most analysis manager has to be passed to the run method of the outer-most pass manager. The rest of the propagation is automatic. I've used SFINAE again to allow passes to completely disregard the analysis manager if they don't need or want to care. This helps keep simple things simple for users of the new pass manager. Also, the system specifically supports passing a null pointer into the outer-most run method if your pass pipeline neither needs nor wants to deal with analyses. I find this of dubious utility as while some passes don't care about analysis, I'm not sure there are any real-world users of the pass manager itself that need to avoid even creating an analysis manager. But it is easy to support, so there we go. Finally I renamed the module proxy for the function analysis manager to the more verbose but less confusing name of FunctionAnalysisManagerModuleProxy. I hate this name, but I have no idea what else to name these things. I'm expecting in the fullness of time to potentially have the complete cross product of types at the proxy layer: {Module,SCC,Function,Loop,Region}AnalysisManager{Module,SCC,Function,Loop,Region}Proxy (except for XAnalysisManagerXProxy which doesn't make any sense) This should make it somewhat easier to do the next phases which is to build the upward proxy and get its invalidation correct, as well as to make the invalidation within the Module -> Function mapping pass be more fine grained so as to invalidate fewer fuction analyses. After all of the proxy analyses are done and the invalidation working, I'll finally be able to start working on the next two fun fronts: how to adapt an existing pass to work in both the legacy pass world and the new one, and building the SCC, Loop, and Region counterparts. Fun times! llvm-svn: 195400	2013-11-22 00:43:29 +00:00
Tom Stellard	f02139b6c9	R600: Implement TargetInstrInfo::isLegalToSplitMBBAt() Splitting a basic block will create a new ALU clause, so we need to make sure we aren't moving uses of registers that are local to their current clause into a new one. I had a test case for this, but unfortunately unrelated schedule changes invalidated it, and I wasn't been able to come up with another one. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195399	2013-11-22 00:41:08 +00:00
Tom Stellard	c2f05239d7	SelectionDAG: Optimize expansion of vec_type = BITCAST scalar_type The legalizer can now do this type of expansion for more type combinations without loading and storing to and from the stack. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195398	2013-11-22 00:41:05 +00:00
Tom Stellard	439debedd3	Split SETCC if VSELECT requires splitting too. This patch is a rewrite of the original patch commited in r194542. Instead of relying on the type legalizer to do the splitting for us, we now peform the splitting ourselves in the DAG combiner. This is necessary for the case where the vector mask is a legal type after promotion and still wouldn't require splitting. Patch by: Juergen Ributzka NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195397	2013-11-22 00:39:23 +00:00
Eric Christopher	8b3b8ae8cc	In Dwarf 3 (and Dwarf 2) attributes whose value are offsets into a section use the form DW_FORM_data4 whilst in Dwarf 4 and later they use the form DW_FORM_sec_offset. This patch updates the places where such attributes are generated to use the appropriate form depending on the Dwarf version. The DIE entries affected have the following tags: DW_AT_stmt_list, DW_AT_ranges, DW_AT_location, DW_AT_GNU_pubnames, DW_AT_GNU_pubtypes, DW_AT_GNU_addr_base, DW_AT_GNU_ranges_base It also adds a hidden command line option "--dwarf-version=<uint>" to llc which allows the version of Dwarf to be generated to override what is specified in the metadata; this makes it possible to update existing tests to check the debugging information generated for both Dwarf 4 (the default) and Dwarf 3 using the same metadata. Patch (slightly modified) by Keith Walker! llvm-svn: 195391	2013-11-21 23:46:41 +00:00
Ekaterina Romanova	eda4e2e4a7	SHLD/SHRD are VectorPath (microcode) instructions known to have poor latency on certain architectures. While generating SHLD/SHRD instructions is acceptable when optimizing for size, optimizing for speed on these platforms should be implemented using alternative sequences of instructions composed of add, adc, shr, shl, or and lea which are directPath instructions. These alternative instructions not only have a lower latency but they also increase the decode bandwidth by allowing simultaneous decoding of a third directPath instruction. AMD's processors family K7, K8, K10, K12, K15 and K16 are known to have SHLD/SHRD instructions with very poor latency. Optimization guides for these processors recommend using an alternative sequence of instructions. For these AMD's processors, I disabled folding (or (x << c) \| (y >> (64 - c))) when we are not optimizing for size. It might be beneficial to disable this folding for some of the Intel's processors. However, since I couldn't find specific recommendations regarding using SHLD/SHRD instructions on Intel's processors, I haven't disabled this peephole for Intel. llvm-svn: 195383	2013-11-21 23:21:26 +00:00
Peter Collingbourne	89b5505b6b	Introduce two command-line flags for the instrumentation pass to control whether the labels of pointers should be ignored in load and store instructions The new command line flags are -dfsan-ignore-pointer-label-on-store and -dfsan-ignore-pointer-label-on-load. Their default value matches the current labelling scheme. Additionally, the function __dfsan_union_load is marked as readonly. Patch by Lorenzo Martignoni! Differential Revision: http://llvm-reviews.chandlerc.com/D2187 llvm-svn: 195382	2013-11-21 23:20:54 +00:00
Eric Christopher	b1461615d9	Move member variable up to where the rest of non-DWARF5 variables reside. llvm-svn: 195380	2013-11-21 22:56:11 +00:00
Daniel Sanders	0e60951a47	[mips][msa] Fix a corner case in performORCombine() when combining nodes into VSELECT. Mask == ~InvMask asserts if the width of Mask and InvMask differ. The combine isn't valid (with two exceptions, see below) if the widths differ so test for this before testing Mask == ~InvMask. In the specific cases of Mask=~0 and InvMask=0, as well as Mask=0 and InvMask=~0, the combine is still valid. However, there are more appropriate combines that could be used in these cases such as folding x & 0 to 0, or x & ~0 to x. llvm-svn: 195364	2013-11-21 16:11:31 +00:00
Artyom Skrobov	3d8780e502	[ARM] add basic Cortex-A7 support to LLVM backend llvm-svn: 195358	2013-11-21 14:03:21 +00:00
Daniel Sanders	a556d0abd7	Add support for legalizing SETNE/SETEQ by inverting the condition code and the result of the comparison. Summary: LegalizeSetCCCondCode can now legalize SETEQ and SETNE by returning the inverse condition and requesting that the caller invert the result of the condition. The caller of LegalizeSetCCCondCode must handle the inverted CC, and they do so as follows: SETCC, BR_CC: Invert the result of the SETCC with SelectionDAG::getNOT() SELECT_CC: Swap the true/false operands. This is necessary for MSA which lacks an integer SETNE instruction. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2229 llvm-svn: 195355	2013-11-21 13:24:49 +00:00
Evgeniy Stepanov	405470ce24	[msan] Propagate condition origin in select instruction. llvm-svn: 195349	2013-11-21 12:00:24 +00:00
Daniel Sanders	5e17920764	[mips][msa/dsp] Only do DSP combines if DSP is enabled. Fixes a crash (null pointer dereferenced) when MSA is enabled. llvm-svn: 195343	2013-11-21 11:40:14 +00:00
NAKAMURA Takumi	656eba2e1e	Whitespace. llvm-svn: 195341	2013-11-21 11:08:31 +00:00
NAKAMURA Takumi	add94e0548	Revert r195317 (and r195333), "Teach ISel not to optimize 'optnone' functions." It broke, at least, i686 target. It is reproducible with "llc -mtriple=i686-unknown". FYI, it didn't appear to add either "-O0" or "-fast-isel". llvm-svn: 195339	2013-11-21 10:55:15 +00:00
Chandler Carruth	a087921555	[PM] Widen the interface for invalidate on an analysis result now that it is completely optional, and sink the logic for handling the preserved analysis set into it. This allows us to implement the delegation logic desired in the proxy module analysis for the function analysis manager where if the proxy itself is preserved we assume the set of functions hasn't changed and we do a fine grained invalidation by walking the functions in the module and running the invalidate for them all at the manager level and letting it try to invalidate any passes. This in turn makes it blindingly obvious why we should hoist the invalidate trait and have two collections of results. That allows handling invalidation for almost all analyses without indirect calls and it allows short circuiting when the preserved set is all. llvm-svn: 195338	2013-11-21 10:53:05 +00:00
Ana Pazos	86d72bbede	Implemented Neon scalar vdup_lane intrinsics. Fixed scalar dup alias and added test case. llvm-svn: 195330	2013-11-21 08:16:15 +00:00
Ana Pazos	5ddc31e426	Implemented Neon scalar by element intrinsics. Intrinsics implemented: vqdmull_lane, vqdmulh_lane, vqrdmulh_lane, vqdmlal_lane, vqdmlsl_lane scalar Neon intrinsics. llvm-svn: 195327	2013-11-21 07:37:04 +00:00
Kostya Serebryany	1513e9969b	Don't speculate loads under ThreadSanitizer Summary: Don't speculate loads under ThreadSanitizer. This fixes https://code.google.com/p/thread-sanitizer/issues/detail?id=40 Also discussed here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-November/067929.html Reviewers: chandlerc Reviewed By: chandlerc CC: llvm-commits, dvyukov Differential Revision: http://llvm-reviews.chandlerc.com/D2227 llvm-svn: 195324	2013-11-21 07:29:28 +00:00
Bill Wendling	07a5510fa2	The basic problem is that some mainstream programs cannot deal with the way clang optimizes tail calls, as in this example: int foo(void); int bar(void) { return foo(); } where the call is transformed to: calll .L0$pb .L0$pb: popl %eax .Ltmp0: addl $_GLOBAL_OFFSET_TABLE_+(.Ltmp0-.L0$pb), %eax movl foo@GOT(%eax), %eax popl %ebp jmpl *%eax # TAILCALL However, the GOT references must all be resolved at dlopen() time, and so this approach cannot be used with lazy dynamic linking (e.g. using RTLD_LAZY), which usually populates the PLT with stubs that perform the actual resolving. This patch changes X86TargetLowering::LowerCall() to skip tail call optimization, if the called function is a global or external symbol. Patch by Dimitry Andric! PR15086 llvm-svn: 195318	2013-11-21 07:04:30 +00:00
Paul Robinson	eba6ab82dd	Teach ISel not to optimize 'optnone' functions. Based on work by Andrea Di Biagio. llvm-svn: 195317	2013-11-21 06:33:32 +00:00
Reed Kotler	caba86b795	Add, to constant islands, long jumps similar to ARM far branch. llvm-svn: 195312	2013-11-21 05:13:23 +00:00
Yuchen Wu	5480edb76f	llvm-cov: Don't assume FileChecksum was generated. For cases where emitProfileArcs() was called but emitProfileNotes() was not, set the CfgChecksum to 0. llvm-svn: 195311	2013-11-21 04:53:39 +00:00
Yuchen Wu	ef87eca111	llvm-cov: Formatting change. llvm-svn: 195310	2013-11-21 04:12:10 +00:00
Yuchen Wu	d218a85f8c	llvm-cov: Fixed some bugs related to file checksum. Added call to update CfgChecksum. Made FileChecksum a vector, separate for each source file. llvm-svn: 195309	2013-11-21 04:01:05 +00:00
Chandler Carruth	dbfa25a6b6	[PM] Add a module analysis pass proxy for the function analysis manager. This proxy will fill the role of proxying invalidation events down IR unit layers so that when a module changes we correctly invalidate function analyses. Currently this is a very coarse solution -- any change blows away the entire thing -- but the next step is to make invalidation handling more nuanced so that we can propagate specific amounts of invalidation from one layer to the next. The test is extended to place a module pass between two function pass managers each of which have preserved function analyses which get correctly invalidated by the module pass that might have changed what functions are even in the module. llvm-svn: 195304	2013-11-21 02:11:31 +00:00
Eric Christopher	14dfe16fc9	Move DebugInfoOffset member near the other data member it helps describe. llvm-svn: 195299	2013-11-21 01:29:16 +00:00
Eric Christopher	5cb8bb6bc9	Reflow some documentation and remove whitespace comments. Move DebugInfoOffset data member up with the rest of the data members. llvm-svn: 195298	2013-11-21 01:29:13 +00:00
Eric Christopher	61450632e7	Add more documenation for the lookup tables data members. llvm-svn: 195297	2013-11-21 01:16:31 +00:00
Eric Christopher	8c89e3899c	Reorder language in the CompileUnit description and add a comment. Language may only be a temporary addition. llvm-svn: 195296	2013-11-21 01:14:00 +00:00
Eric Christopher	874a7d424a	Update comment. llvm-svn: 195293	2013-11-21 01:01:30 +00:00
Eric Christopher	d2217c5bea	Constify the DIEs used for pubname and pubtype tables. Propagate through findAttribute etc. llvm-svn: 195290	2013-11-21 00:48:22 +00:00
Nick Kledzik	11ec8eba6c	revert r194655 llvm-svn: 195285	2013-11-21 00:20:10 +00:00
Hal Finkel	fb82ed6bb5	PPC popcnt[dw] do not have record forms The instruction definitions incorrectly specified that popcntd and popcntw have record forms; they do not. This mistake was causing invalid code generation. llvm-svn: 195272	2013-11-20 20:54:55 +00:00
Benjamin Kramer	40f6475264	MachineBlockPlacement: Strengthen the source order bias when picking an exit block. We now only allow breaking source order if the exit block frequency is significantly higher than the other exit block. The actual bias is currently under a flag so the best cut-off can be found; the flag defaults to the old behavior. The idea is to get some benchmark coverage over different values for the flag and pick the best one. When we require the new frequency to be at least 20% higher than the old frequency I see a 5% speedup on zlib's deflate when compressing a random file on x86_64/westmere. Hal reported a small speedup on Fhourstones on a BG/Q and no regressions in the test suite. The test case is the full long_match function from zlib's deflate. I was reluctant to add it for previous tweaks to branch probabilities because it's large and potentially fragile, but changed my mind since it's an important use case and more likely to break with all the current work going into the PGO infrastructure. Differential Revision: http://llvm-reviews.chandlerc.com/D2202 llvm-svn: 195265	2013-11-20 19:08:44 +00:00
David Blaikie	bd559db852	DwarfCompileUnit: Initialize DebugInfoOffset. While not strictly necessary (the class has an invariant that "setDebugInfoOffset" is called before "getDebugInfoOffset" - anyone client that actually gets the default zero offset is buggy/broken) this is consistent with the code as originally written and the removal of the initialization was an accident in r195166. Suggested by Manman Ren. llvm-svn: 195263	2013-11-20 18:52:39 +00:00
David Blaikie	7888a33645	CR feedback for r195166: Add comments regarding type unit mapping and type units disabling cross-CU sharing. Changes suggested by Manman Ren. llvm-svn: 195262	2013-11-20 18:40:16 +00:00
Chandler Carruth	ffaacacd23	Make the moved-from SmallPtrSet be a valid, empty, small-state object. Enhance the tests to actually require moves in C++11 mode, in addition to testing the moved-from state. Further enhance the tests to cover copy-assignment into a moved-from object and moving a large-state object. (Note that we can't really test small-state vs. large-state as that isn't an observable property of the API really.) This should finish addressing review on r195239. llvm-svn: 195261	2013-11-20 18:29:56 +00:00
Daniel Sanders	9a39d9e82f	[mips][msa] Pseudo instructions require HasMSA too. Inherit from MSAPseudo instead of MipsPseudo There's no test case for this commit. This is because it is doubtful that the incorrect behaviour can actually trigger. When MSA is not enabled, the type legalizer should have eliminated all occurrences of patterns the affected pseudo-instruction could possibly match before instruction selection occurs. llvm-svn: 195252	2013-11-20 14:32:28 +00:00
Daniel Sanders	276174cc51	[mips][msa] Remove unused instruction class MSA_I8_X_DESC_BASE llvm-svn: 195245	2013-11-20 13:01:10 +00:00
Chandler Carruth	5bbc7e8ce9	[PM] Add the preservation system to the new pass manager. This adds a new set-like type which represents a set of preserved analysis passes. The set is managed via the opaque PassT::ID() void*s. The expected convenience templates for interacting with specific passes are provided. It also supports a symbolic "all" state which is represented by an invalid pointer in the set. This state is nicely saturating as it comes up often. Finally, it supports intersection which is used when finding the set of preserved passes after N different transforms. The pass API is then changed to return the preserved set rather than a bool. This is much more self-documenting than the previous system. Returning "none" is a conservatively correct solution just like returning "true" from todays passes and not marking any passes as preserved. Passes can also be dynamically preserved or not throughout the run of the pass, and whatever gets returned is the binding state. Finally, preserving "all" the passes is allowed for no-op transforms that simply can't harm such things. Finally, the analysis managers are changed to instead of blindly invalidating all of the analyses, invalidate those which were not preserved. This should rig up all of the basic preservation functionality. This also correctly combines the preservation moving up from one IR-layer to the another and the preservation aggregation across N pass runs. Still to go is incrementally correct invalidation and preservation across IR layers incrementally during N pass runs. That will wait until we have a device for even exposing analyses across IR layers. While the core of this change is obvious, I'm not happy with the current testing, so will improve it to cover at least some of the invalidation that I can test easily in a subsequent commit. llvm-svn: 195241	2013-11-20 11:31:50 +00:00
Chandler Carruth	8070950ee8	Give SmallPtrSet move semantics when we have R-value references. Somehow, this ADT got missed which is moderately terrifying considering the efficiency of move for it. The code to implement move semantics for it is pretty horrible currently but was written to reasonably closely match the rest of the code. Unittests that cover both copying and moving (at a basic level) added. llvm-svn: 195239	2013-11-20 11:14:33 +00:00
NAKAMURA Takumi	efd1623a5d	X86ISelLowering.cpp: Mark a variable VT as LLVM_ATTRIBUTE_UNUSED. [-Wunused-variable] llvm-svn: 195238	2013-11-20 10:55:22 +00:00
NAKAMURA Takumi	d114df4bce	Whitespace. llvm-svn: 195237	2013-11-20 10:55:15 +00:00
Elena Demikhovsky	a11395e99e	Fixed compilation error. llvm-svn: 195230	2013-11-20 09:23:22 +00:00
Elena Demikhovsky	692524f3bd	AVX-512: Concat 4 128-bit vectors in one 512-bit vector. llvm-svn: 195229	2013-11-20 09:10:40 +00:00
Chandler Carruth	37fa148ed0	[PM] Make the function pass manager more regular. The FunctionPassManager is now itself a function pass. When run over a function, it runs all N of its passes over that function. This is the 1:N mapping in the pass dimension only. This allows it to be used in either a ModulePassManager or potentially some other manager that works on IR units which are supersets of Functions. This commit also adds the obvious adaptor to map from a module pass to a function pass, running the function pass across every function in the module. The test has been updated to use this new pattern. llvm-svn: 195192	2013-11-20 04:39:16 +00:00
Yuchen Wu	734fa40b2a	llvm-cov: Added file checksum to gcno and gcda files. Instead of permanently outputting "MVLL" as the file checksum, clang will create gcno and gcda checksums by hashing the destination block numbers of every arc. This allows for llvm-cov to check if the two gcov files are synchronized. Regenerated the test files so they contain the checksum. Also added negative test to ensure error when the checksums don't match. llvm-svn: 195191	2013-11-20 04:15:05 +00:00
Chandler Carruth	9f55f1934e	[PM] Split the analysis manager into a function-specific interface and a module-specific interface. This is the first of many steps necessary to generalize the infrastructure such that we can support both a Module-to-Function and Module-to-SCC-to-Function pass manager nestings. After a lot of attempts that never worked and didn't even make it to a committable state, it became clear that I had gotten the layering design of analyses flat out wrong. Four days later, I think I have most of the plan for how to correct this, and I'm starting to reshape the code into it. This is just a baby step I'm afraid, but starts separating the fundamentally distinct concepts of function analysis passes and module analysis passes so that in subsequent steps we can effectively layer them, and have a consistent design for the eventual SCC layer. As part of this, I've started some interface changes to make passes more regular. The module pass accepts the module in the run method, and some of the constructor parameters are gone. I'm still working out exactly where constructor parameters vs. method parameters will be used, so I expect this to fluctuate a bit. This actually makes the invalidation less "correct" at this phase, because now function passes don't invalidate module analysis passes, but that was actually somewhat of a misfeature. It will return in a better factored form which can scale to other units of IR. The documentation has gotten less verbose and helpful. llvm-svn: 195189	2013-11-20 04:01:38 +00:00
Hal Finkel	d1fc028d62	PPC: Optimize rldicl generation for masked shifts Masking operations (where only some number of the low bits are being kept) are selected to rldicl(x, 0, mb). If x is a logical right shift (which would become rldicl(y, 64-n, n)), we might be able to fold the two instructions together: rldicl(rldicl(x, 64-n, n), 0, mb) -> rldicl(x, 64-n, mb) for n <= mb The right shift is really a left rotate followed by a mask, and if the explicit mask is a more-restrictive sub-mask of the mask implied by the shift, only one rldicl is needed. llvm-svn: 195185	2013-11-20 01:10:15 +00:00
Eric Christopher	08442ee86e	Remove polymorphic destruction for DIE. DIEBlocks are owned elsewhere and not polymorphically deleted and they are the only thing that derive from DIE. llvm-svn: 195183	2013-11-20 00:54:31 +00:00
Eric Christopher	3d1796838e	Remove capability for polymorphic destruction from LexicalScope and LexicalScopes, we're not using it. llvm-svn: 195182	2013-11-20 00:54:28 +00:00
Eric Christopher	84ee513dcd	Grammar. llvm-svn: 195181	2013-11-20 00:54:25 +00:00
Eric Christopher	2624bc8ba4	Formatting, 80-col, trailing whitespace. llvm-svn: 195180	2013-11-20 00:54:19 +00:00
Jack Carter	06678b5af4	long line correction llvm-svn: 195179	2013-11-20 00:32:32 +00:00
Jack Carter	7a28c79335	long line correction llvm-svn: 195175	2013-11-20 00:12:44 +00:00
Filip Pizlo	d0169a8474	Expose the fence instruction via the C API. llvm-svn: 195173	2013-11-20 00:07:49 +00:00
Aditya Nandakumar	5054946d57	Fixed an extra for(typo) in the comments llvm-svn: 195171	2013-11-19 23:51:32 +00:00
Jack Carter	122959c3e9	long lines and white space correction llvm-svn: 195170	2013-11-19 23:43:22 +00:00
David Blaikie	e40c1e850f	DebugInfo: Partial implementation of DWARF type units. Emit DW_TAG_type_units into the debug_info section using compile unit headers. This is bogus/unusable by debuggers, but testable and provides more isolated review. Subsequent patches will include support for type unit headers and emission into the debug_types section, as well as comdat grouping the types based on their hash. Also the CompileUnit type will be renamed 'Unit' and relevant portions pulled out into respective CompileUnit and TypeUnit types. llvm-svn: 195166	2013-11-19 23:08:21 +00:00
David Blaikie	395dc41d93	DebugInfo: Constify accelerator table handling, and separate type accelarator insertion in preparation for a second use of this code from type units. llvm-svn: 195164	2013-11-19 22:51:04 +00:00
Arnold Schwaighofer	242935ec8c	SLPVectorizer: Fix stale for Value pointer array We are slicing an array of Value pointers and process those slices in a loop. The problem is that we might invalidate a later slice by vectorizing a former slice. Use a WeakVH to track the pointer. If the pointer is deleted or RAUW'ed we can tell. The test case will only fail when running with libgmalloc. radar://15498655 llvm-svn: 195162	2013-11-19 22:20:20 +00:00
Arnold Schwaighofer	3149313505	SLPVectorizer: Fix whitespace errors llvm-svn: 195161	2013-11-19 22:20:18 +00:00
Petar Jovanovic	ddac1ebfb9	[mips] Resolve relocation for the stubs in MCJIT when load address is known Instead of processing relocation for branch to stubs right away, emit a modified relocation and add it to queue to be resolved later when final load address is known. This resolves seven MIPS MCJIT issues that were caused by missing relocation fixups at the end. llvm-svn: 195157	2013-11-19 21:56:00 +00:00
Juergen Ributzka	8e480fdae5	[DAG] Refactor vector splitting code in SelectionDAG. No functional change intended. Reviewed by Tom llvm-svn: 195156	2013-11-19 21:20:17 +00:00
Rafael Espindola	4833910f66	Make it explicit that nulls are not allowed in names. The object files we support use null terminated strings, so there is no way to support these. This patch adds an assert to catch bad API use and an error check in the .ll parser. llvm-svn: 195155	2013-11-19 21:12:39 +00:00
Yuchen Wu	aba14a2b9e	llvm-cov: Moved printing after error checks. llvm-svn: 195153	2013-11-19 20:57:20 +00:00
Jack Carter	6943b6e5c6	reverts 195057 per request llvm-svn: 195152	2013-11-19 20:53:28 +00:00
Yuchen Wu	3f48050d3c	llvm-cov: Added constness property to methods. Added constness to methods that shouldn't modify objects. Replaced operator[] lookup in maps with find() instead. llvm-svn: 195151	2013-11-19 20:33:32 +00:00
Benjamin Kramer	f5a7aea34b	DataLayout: value initialize globals to avoid static construction. llvm-svn: 195150	2013-11-19 20:28:04 +00:00
Rafael Espindola	5d21406399	Support multiple COFF sections with the same name but different COMDAT. This is the first step to fix pr17918. It extends the .section directive a bit, inspired by what the ELF one looks like. The problem with using linkonce is that given .section foo .linkonce.... .section foo .linkonce we would already have switched sections when getting to .linkonce. The cleanest solution seems to be to add the comdat information in the .section itself. llvm-svn: 195148	2013-11-19 19:52:52 +00:00
Andrew Trick	a5f0869aed	Obvious pasto survived a couple rounds of cleanup. Caught by Aaron Ballman. llvm-svn: 195138	2013-11-19 18:29:45 +00:00
John Thompson	844100cb0c	YAML I/O - Added default trait support for std:string. Making another attempt at this, this time doing a clean build on Linux, and running the LLVM, clang, and extra tests, to try to make sure there's no problems. llvm-svn: 195134	2013-11-19 17:28:21 +00:00
Cameron McInally	9232b52359	Fix assembly operands for the SSE2 cvtsd2ss instruction. llvm-svn: 195129	2013-11-19 14:36:00 +00:00
Simon Atanasyan	226923909e	[Mips] Adjust float ABI settings in case of MIPS16 mode. Hard float for mips16 means essentially to compile as soft float but to use a runtime library for soft float that is written with native mips32 floating point instructions (those runtime routines run in mips32 hard float mode). The patch reviewed by Reed Kotler. llvm-svn: 195123	2013-11-19 12:20:17 +00:00
Eric Christopher	03cbb881d9	Formatting and 80-col. llvm-svn: 195122	2013-11-19 09:28:34 +00:00
Eric Christopher	8b203da235	Fix comment. llvm-svn: 195121	2013-11-19 09:11:26 +00:00
Eric Christopher	525b98f23c	Refactor the section emission code to remove duplicates now that we can emit various sections in any order. No functional change. llvm-svn: 195120	2013-11-19 09:04:50 +00:00
Eric Christopher	c80c663188	Reformat file. llvm-svn: 195119	2013-11-19 09:04:36 +00:00
Chandler Carruth	4d8e469cd3	Fix an issue where SROA computed different results based on the relative order of slices of the alloca which have exactly the same size and other properties. This was found by a perniciously unstable sort implementation used to flush out buggy uses of the algorithm. The fundamental idea is that findCommonType should return the best common type it can find across all of the slices in the range. There were two bugs here previously: 1) We would accept an integer type smaller than a byte-width multiple, and if there were different bit-width integer types, we would accept the first one. This caused an actual failure in the testcase updated here when the sort order changed. 2) If we found a bad combination of types or a non-load, non-store use before an integer typed load or store we would bail, but if we found the integere typed load or store, we would use it. The correct behavior is to always use an integer typed operation which covers the partition if one exists. While a clever debugging sort algorithm found problem #1 in our existing test cases, I have no useful test case ideas for #2. I spotted in by inspection when looking at this code. llvm-svn: 195118	2013-11-19 09:03:18 +00:00
Michael Ilseman	0f69f39907	Add support for software expansion of 64-bit integer division instructions. Patch by Dmitri Shtilman! llvm-svn: 195116	2013-11-19 06:54:19 +00:00
Andrew Trick	5b8040c957	Fix patchpoint comments. llvm-svn: 195103	2013-11-19 05:05:43 +00:00
Andrew Trick	9f7d826e8a	Use symbolic operands in the patchpoint folding routine and fix a spilling bug. Fixes <rdar://15487687> [JS] AnyRegCC argument ends up being spilled llvm-svn: 195094	2013-11-19 03:29:59 +00:00
Andrew Trick	15aac659a7	Add an abstraction to handle patchpoint operands. Hard-coded operand indices were scattered throughout lowering stages and layers. It was super bug prone. llvm-svn: 195093	2013-11-19 03:29:56 +00:00
Hao Liu	b26dfe0306	Implement AArch64 neon instructions class SIMD lsone and SIMD lone-post. llvm-svn: 195078	2013-11-19 02:17:05 +00:00
Eric Christopher	106d6557a8	Remove unused special member functions and reformat. llvm-svn: 195077	2013-11-19 02:01:07 +00:00
Eric Christopher	51aecac48c	Fix previous commit and fully remove variable. llvm-svn: 195076	2013-11-19 01:52:38 +00:00
Eric Christopher	0d2c47bfb4	Remove unused variable. llvm-svn: 195075	2013-11-19 01:50:29 +00:00
Jiangning Liu	42b7a215f4	Implement AArch64 SISD intrinsics for vget_high and vget_low. llvm-svn: 195074	2013-11-19 01:46:48 +00:00
Kevin Qin	7b74269765	implement MC layer of AArch64 neon instruction PMULL and PMULL2 with 128 bit integer. llvm-svn: 195072	2013-11-19 01:40:25 +00:00
Jiangning Liu	7c858f236d	Add predicate for AArch64 crypto instructions. llvm-svn: 195071	2013-11-19 01:38:31 +00:00
Jack Carter	8bb31d387d	[Mips] Support for MicroMips STO refactoring. No true functional changes. Change the "hack" name of emitMipsHackSTOCG to emitSymSTO. Remove demonstration code in AsmParser for emitMipsHackSTOCG and emitMipsHackELFFlags. The STO field is in an ELF symbol and is not an explicit directive. That said, we are missing the compliment call in AsmParser and that will need to be addressed soon. XFAIL dummy tests for emitMipsHackELFFlags and emitMipsHackELFFlags. These will built out with following patches. llvm-svn: 195067	2013-11-19 01:25:18 +00:00
Juergen Ributzka	5357a6d64b	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. The memory leaks in this version have been fixed. Thanks Alexey for pointing them out. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 195064	2013-11-19 00:57:56 +00:00
David Blaikie	f941cd8a75	DwarfDebug: Move trailing else to the same line as prior closing brace llvm-svn: 195060	2013-11-18 23:59:04 +00:00
David Blaikie	0fc401fb8f	DwarfDebug: Remove some more redundant explicit constructions. llvm-svn: 195059	2013-11-18 23:57:26 +00:00
Jack Carter	4091753064	[Mips] MipsTargetStreamer refactoring. No functionality changes. llvm-svn: 195057	2013-11-18 23:55:27 +00:00
David Blaikie	f1abd8ad9c	DebugInfo: Simplify a few more explicit constructions, underconstrained types, and make DIType(MDNode) explicit like all the other DI node ctors. llvm-svn: 195055	2013-11-18 23:33:32 +00:00
Reid Kleckner	552118c34a	Revert "COFF: Emit all MCSymbols rather than filtering out some of them" This reverts commit r190888, to fix PR17967. The original change wasn't the right way to get @feat.00 into the object file. The right fix is to make @feat.00 be a global symbol. llvm-svn: 195053	2013-11-18 23:08:12 +00:00
Adrian Prantl	afeac86924	Debug info: Let LowerDbgDeclare perfom the dbg.declare -> dbg.value lowering only for load/stores to scalar allocas. The resulting values confuse the backend and don't add anything because we can describe array-allocas with a dbg.declare intrinsic just fine. rdar://problem/15464571 llvm-svn: 195052	2013-11-18 23:04:38 +00:00
Paul Robinson	27ef03dc70	The 'optnone' attribute means don't inline anything into this function (except functions marked always_inline). Functions with 'optnone' must also have 'noinline' so they don't get inlined into any other function. Based on work by Andrea Di Biagio. llvm-svn: 195046	2013-11-18 21:44:03 +00:00
Matt Arsenault	be108f1643	R600/SI: Fix moveToVALU when the first operand is VSrc. Moving into a VSrc doesn't always work, since it could be replaced with an SGPR later. llvm-svn: 195042	2013-11-18 20:09:55 +00:00
Matt Arsenault	cdea5c8fe0	R600/SI: Fix multiple SGPR reads when using VCC. No other SGPR operands are allowed, so if VCC is used, move the other to a VGPR. llvm-svn: 195041	2013-11-18 20:09:50 +00:00
Matt Arsenault	485f69c9cf	R600/SI: Implement add i64, but do not yet enable. Test doesn't actually check the output. I need to fix add i64 being matched for the addressing calculations. llvm-svn: 195040	2013-11-18 20:09:47 +00:00
Matt Arsenault	2b3d70daa8	R600/SI: Specify SSrc operands llvm-svn: 195039	2013-11-18 20:09:43 +00:00
Matt Arsenault	1e729e94db	R600/SI: addc / adde i32 are legal llvm-svn: 195038	2013-11-18 20:09:40 +00:00

... 3 4 5 6 7 ...

65767 Commits