llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
David Blaikie	5e586ea3ed	DwarfDebug: Avoid unnecessary abbreviation lookup when emitting DIEs DIEs already contain references directly to their DIEAbbrev, use that instead of looking it up based on index. llvm-svn: 196446	2013-12-05 01:01:41 +00:00
David Blaikie	a2745869cd	DwarfDebug: Remove trivial function wrapper llvm-svn: 196445	2013-12-05 01:01:37 +00:00
Eric Christopher	b81841285f	80-column. llvm-svn: 196442	2013-12-05 00:36:21 +00:00
Eric Christopher	2d6d0fc3f2	Remove special handling for DW_AT_ranges support by constructing the values with the correct behavior. llvm-svn: 196441	2013-12-05 00:36:17 +00:00
Logan Chien	558333e1e1	[mc] Fix ELF st_other flag. ELF_Other_Weakref and ELF_Other_ThumbFunc seems to be LLVM internal ELF symbol flags. These should not be emitted to object file. This commit defines ELF_STO_Shift for the target-defined flags for st_other, and increase the value of ELF_Other_Shift to 16. llvm-svn: 196440	2013-12-05 00:34:11 +00:00
Eric Christopher	d051658b5c	Fix comment. llvm-svn: 196437	2013-12-05 00:13:15 +00:00
Cameron McInally	675f9245aa	Add AVX512 patterns for v16i32 broadcast and v2i64 zero extend load. Patch by Aleksey Bader. llvm-svn: 196435	2013-12-05 00:11:25 +00:00
Eric Christopher	4670039c5b	Fix typo. llvm-svn: 196434	2013-12-04 23:55:09 +00:00
David Blaikie	561838d222	DwarfUnit: Correct comment by generalizing over all units, not just compilation units. Code review feedback on r196394 by Paul Robinson. llvm-svn: 196433	2013-12-04 23:39:02 +00:00
Kevin Enderby	218f72b95b	Fix a bug in darwin's 32-bit X86 handling of evaluating fixups. Where it would use a scattered relocation entry but falls back to a normal relocation entry because the FixupOffset is more than 24-bits. The bug is in the X86MachObjectWriter::RecordScatteredRelocation() where it changes reference parameter FixedValue but then returns false to indicate it did not create a scattered relocation entry. The fix is simply to save the original value of the parameter FixedValue at the start of the method and restore it if we are returning false in that case. rdar://15526046 llvm-svn: 196432	2013-12-04 23:36:24 +00:00
Eric Christopher	f46aa7d453	Update comment. llvm-svn: 196431	2013-12-04 23:24:38 +00:00
Eric Christopher	a054e191ad	Update comment. llvm-svn: 196430	2013-12-04 23:24:28 +00:00
Eric Christopher	435de44e9d	Remove incorrect comment and pointless cast. llvm-svn: 196427	2013-12-04 23:05:21 +00:00
Eric Christopher	f8ead1f600	const on its own line is confusing. llvm-svn: 196426	2013-12-04 22:54:45 +00:00
David Peixotto	b6710ff7c7	Add support for parsing ARM symbol variants on ELF targets ARM symbol variants are written with parens instead of @ like this: .word __GLOBAL_I_a(target1) This commit adds support for parsing these symbol variants in expressions. We introduce a new flag to MCAsmInfo that indicates the parser should use parens to parse the symbol variant. The expression parser is modified to look for symbol variants using parens instead of @ when the corresponding MCAsmInfo flag is true. The MCAsmInfo parens flag is enabled only for ARM on ELF. By adding this flag to MCAsmInfo, we are able to get rid of redundant ARM-specific symbol variants and use the generic variants instead (e.g. VK_GOT instead of VK_ARM_GOT). We use the new UseParensForSymbolVariant attribute in MCAsmInfo to correctly print the symbol variants for arm. To achive this we need to keep a handle to the MCAsmInfo in the MCSymbolRefExpr class that we can check when printing the symbol variant. Updated Tests: Changed case of symbol variant to match the generic kind. test/CodeGen/ARM/tls-models.ll test/CodeGen/ARM/tls1.ll test/CodeGen/ARM/tls2.ll test/CodeGen/Thumb2/tls1.ll test/CodeGen/Thumb2/tls2.ll PR18080 llvm-svn: 196424	2013-12-04 22:43:20 +00:00
Eric Christopher	1d7adf97f3	Simplify check. llvm-svn: 196422	2013-12-04 22:29:02 +00:00
Eric Christopher	b46f56f8d6	Reformat slightly. llvm-svn: 196421	2013-12-04 22:26:43 +00:00
Eric Christopher	5d21b8449e	Make RangeSpanList take a symbol for the beginning of the range rather than magically making the names match. llvm-svn: 196419	2013-12-04 22:04:50 +00:00
David Blaikie	cd548dcc12	DwarfDebug: Unconditionalize trivial asm comments While we still have a few (~4) non-trivial comments with string concatenation, etc that should remain conditionalized, these trivial literal comments can be simplified. llvm-svn: 196416	2013-12-04 21:51:05 +00:00
David Blaikie	7ce191027c	DwarfDebug: Reduce code duplication for sec offset emission llvm-svn: 196414	2013-12-04 21:31:26 +00:00
Eric Christopher	522b2d4cf3	Couple of small logical cleanups to use !empty rather than other checks. No functional change. llvm-svn: 196412	2013-12-04 21:20:15 +00:00
Yuchen Wu	9559c6af5d	llvm-cov: Replace size() with empty() in bool check. llvm-svn: 196400	2013-12-04 19:18:23 +00:00
Eric Christopher	93cee79b17	Use move and stack allocation for RangeSpanLists. As a result make a few things more const as well because we're now using const references to refer to iterators. llvm-svn: 196398	2013-12-04 19:06:58 +00:00
David Blaikie	f92edbcbd0	DebugInfo: Remove unused start/end labels for the debug_abbrevs section Since we always emit only one abbrevation section (shared by all the compilation units in this module) there's no need for a separate label at the start of each one (and we weren't using the CU ID anyway, so there really was only one label). Use the section label instead and drop the wholely unused debug_abbrev_end label. llvm-svn: 196394	2013-12-04 18:12:28 +00:00
Cameron McInally	97a9fa294d	Fix assembly syntax for AVX512 vector blend instructions. llvm-svn: 196393	2013-12-04 18:05:36 +00:00
David Blaikie	1ca7b250be	DebugInfo: Avoid recreating matching labels in disparate places. Instead, reuse the same MCSymbol - this should make the code easier to follow by avoiding hard to trace dependencies between different bits of code. llvm-svn: 196392	2013-12-04 17:55:41 +00:00
Michael Liao	7de99b9277	[X86] Check YMM31/ZMM31 as well - No test case as there's no calling convention preserve YMM31/ZMM31 only llvm-svn: 196391	2013-12-04 17:44:22 +00:00
Chad Rosier	59cd9a3090	Update the UseFusedMAC definition to directly specify its dependence on having VFP4. Patch by Daniel Stewart! llvm-svn: 196390	2013-12-04 17:16:36 +00:00
Cameron McInally	9c9a78a238	Suppress '(x < y) ? a : 0 -> (x < y) & a' transform on X86 architectures with dedicated mask registers. Patch by Aleksey Bader. llvm-svn: 196386	2013-12-04 14:52:33 +00:00
Daniel Jasper	ca41e63412	Un-revert r196358: "llvm-cov: Added support for function checksums." And add the proper fix. llvm-svn: 196367	2013-12-04 08:57:17 +00:00
Daniel Jasper	a7dd8af910	Revert r196358: "llvm-cov: Added support for function checksums." This currently breaks clang/test/CodeGen/code-coverage.c. The root cause is that the newly introduced access to Funcs[j] is out of bounds. llvm-svn: 196365	2013-12-04 08:23:33 +00:00
Kevin Qin	f5b717aa75	[AArch64 Neon] Add ACLE intrinsic vceqz_f64. llvm-svn: 196362	2013-12-04 08:02:34 +00:00
Kevin Qin	f93a2e8673	[AArch64 NEON] Add missing compare intrinsics. llvm-svn: 196360	2013-12-04 07:53:28 +00:00
Yuchen Wu	b1a23c9951	llvm-cov: Added support for function checksums. The function checksums are hashed from the concatenation of the function name and line number. llvm-svn: 196358	2013-12-04 06:00:17 +00:00
Yuchen Wu	37b184215d	llvm-cov: Added checks for ident, checksum, name. Added additional checks for the Identifier, CfgChecksum and Name for each GCOVFunction. Also added function names in error messages. llvm-svn: 196356	2013-12-04 05:42:28 +00:00
Yuchen Wu	01a610d829	llvm-cov: Capitalized GCNO and GCDA for consistency. llvm-svn: 196354	2013-12-04 05:07:36 +00:00
Yuchen Wu	0b08e75d64	llvm-cov: Split GCOVFile's read into GCNO and GCDA. This splits the file-scope read() function into readGCNO() and readGCDA(). Also broke file format read into functions that first read the file type, then check the version. llvm-svn: 196353	2013-12-04 04:49:23 +00:00
Rafael Espindola	cd5c36b48b	Fix a funny typo. Thanks for j`ey and Sean Silva for noticing it. llvm-svn: 196344	2013-12-04 02:26:54 +00:00
Rafael Espindola	f394b917ee	Produce deterministic coff files. llvm-svn: 196341	2013-12-04 02:02:55 +00:00
Juergen Ributzka	f7f5626671	[Stackmap] Emit multi-byte nops for X86. llvm-svn: 196334	2013-12-04 00:39:08 +00:00
Reed Kotler	45b4f281f2	final patch for very long conditional branches for mips16 constant islands. this completes the basic port of ARM constant islands to Mips16. More testing, code review, cleanup is in order but basically everything seems to be working. A bug in gas is preventing some of the runtime testing but I hope to resolve this soon. llvm-svn: 196331	2013-12-03 23:42:51 +00:00
Eric Christopher	907181fbbb	Update comment grammar and contents. llvm-svn: 196323	2013-12-03 22:05:55 +00:00
Rafael Espindola	167cd3e1cb	Fix mingw32 thiscall + sret. Unlike msvc, when handling a thiscall + sret gcc will * Put the sret in %ecx * Put the this pointer is (%esp) This fixes, for example, calling stringstream::str. llvm-svn: 196312	2013-12-03 20:51:23 +00:00
Michael Gottesman	7f9ded48f1	Fixed various whitespace/spelling/80+ issues. llvm-svn: 196310	2013-12-03 20:21:17 +00:00
Reid Kleckner	09e088be4c	Return true on success in cl::ExpandResponseFiles This fixes a logic bug pointed out by Juraj Ivancic. No behavior change because none of the in-tree clients of cl::ExpandResponseFiles check the return value. In this case, the @prefixed arguments are left in the command line. Downstream command line processing has the opportunity to emit errors about it, so this isn't that bad. llvm-svn: 196306	2013-12-03 19:13:18 +00:00
Timur Iskhodzhanov	8deac270cf	Fix a typo in a comment llvm-svn: 196304	2013-12-03 18:57:43 +00:00
David Blaikie	ad61b972a6	Avoid buffer copies when a Twine already is a StringRef. llvm-svn: 196301	2013-12-03 18:18:28 +00:00
Yunzhong Gao	05c1966c8c	Teach the internalize pass to skip dllexported symbols because they could be referenced in a way that even the linker does not see. Differential Revision: http://llvm-reviews.chandlerc.com/D2280 llvm-svn: 196300	2013-12-03 18:05:14 +00:00
Timur Iskhodzhanov	2340a0ee1c	Reland 196270 "Generalize debug info / EH emission in AsmPrinter" Addressing the existense AMDGPUAsmPrinter and other subclasses of AsmPrinter llvm-svn: 196288	2013-12-03 15:10:23 +00:00
NAKAMURA Takumi	c0b01ff922	Revert r196270, "Generalize debug info / EH emission in AsmPrinter" It broke CodeGen/R600 tests with +Asserts. llvm-svn: 196272	2013-12-03 13:15:54 +00:00
Timur Iskhodzhanov	8ce8c7a5d7	Generalize debug info / EH emission in AsmPrinter llvm-svn: 196270	2013-12-03 12:05:18 +00:00
James Molloy	674f480ca4	Addrspacecasts are no-ops on ARM. Testcase added. llvm-svn: 196269	2013-12-03 11:23:11 +00:00
Richard Sandiford	f84d58699b	[SystemZ] Fix choice of known-zero mask in insertion optimization The backend converts 64-bit ORs into subreg moves if the upper 32 bits of one operand and the low 32 bits of the other are known to be zero. It then tries to peel away redundant ANDs from the upper 32 bits. Since AND masks are canonicalized to exclude known-zero bits, the test ORs the mask and the known-zero bits together before checking for redundancy. The problem was that it was using the wrong node when checking for known-zero bits, so could drop ANDs that were still needed. llvm-svn: 196267	2013-12-03 11:01:54 +00:00
Michael Liao	fecace10ac	Enhance the fix of PR17631 - The fix to PR17631 fixes part of the cases where 'vzeroupper' should not be issued before 'call' insn. There're other cases where helper calls will be inserted not limited to epilog. These helper calls do not follow the standard calling convention and won't clobber any YMM registers. (So far, all call conventions will clobber any or part of YMM registers.) This patch enhances the previous fix to cover more cases 'vzerosupper' should not be inserted by checking if that function call won't clobber any YMM registers and skipping it if so. llvm-svn: 196261	2013-12-03 09:17:32 +00:00
Hao Liu	547dc86218	[AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn llvm-svn: 196210	2013-12-03 06:06:55 +00:00
Hao Liu	f922fde3de	AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. llvm-svn: 196208	2013-12-03 05:58:30 +00:00
NAKAMURA Takumi	62ddd8c9e0	Whitespace. llvm-svn: 196203	2013-12-03 05:28:27 +00:00
Hao Liu	fea9943555	AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". llvm-svn: 196198	2013-12-03 03:39:47 +00:00
Yuchen Wu	bf24669ebf	llvm-cov: Cleaned up print() function slightly. Changed while to for loop. Removed unnecessary if statement. llvm-svn: 196194	2013-12-03 01:35:31 +00:00
Jiangning Liu	24b3414579	Add some missing pattern matches for AArch64 Neon intrinsics like vuqadd_s64 and friends. llvm-svn: 196192	2013-12-03 01:33:52 +00:00
Jiangning Liu	3f5f9eefd0	Add some missing pattern matches for AArch64 Neon intrinsics like vmull_high_n_s16 and friends. llvm-svn: 196190	2013-12-03 01:29:32 +00:00
Rafael Espindola	b4f8027fa6	Don't set PrivateGlobalPrefix for NVPTX and R600. These targets have special asm printers that don't use these. llvm-svn: 196187	2013-12-03 01:03:35 +00:00
Yuchen Wu	cd7628e981	llvm-cov: Removed output to STDOUT/specified file. Instead of asking the user to specify a single file to output coverage info and defaulting to STDOUT, llvm-cov now creates files for each source file with a naming system of: <source filename> + ".llcov". This is what gcov does and although it can clutter the working directory with numerous coverage files, it will be easier to hook the llvm-cov output to tools which operate on this assumption (such as lcov). llvm-svn: 196184	2013-12-03 00:57:11 +00:00
Michael Gottesman	576426afb7	Added MachineBlockFrequencyInfo::view for displaying the block frequency propagation graph via graphviz. This is useful for debugging issues in the BlockFrequency implementation since one can easily visualize where probability mass and other errors occur in the propagation. This is the MI version of r194654. llvm-svn: 196183	2013-12-03 00:49:33 +00:00
Eric Christopher	9cdc5dc3f1	Refactor the handling of lexical block and inline scope ranges into a single function. No functional change. llvm-svn: 196181	2013-12-03 00:45:59 +00:00
Eric Christopher	05bec0f59b	Update doxygen tags. llvm-svn: 196180	2013-12-03 00:45:56 +00:00
Eric Christopher	e4a344abbf	Reorder member function declarations to match source order. llvm-svn: 196179	2013-12-03 00:45:54 +00:00
Eric Christopher	6532a47d78	Make ranges and range lists be a discrete entity that can be located and emitted per function and CU. Begins coalescing ranges as a first class entity through debug info. No functional change. llvm-svn: 196178	2013-12-03 00:45:45 +00:00
Yuchen Wu	7c75739af8	llvm-cov: Store blocks rather than counts per line. Each line stores all the blocks that execute on that line, instead of only storing the line counts previously accumulated. This provides more information for each line, and will be useful for options in enabling block and branch information. llvm-svn: 196177	2013-12-03 00:38:21 +00:00
Yuchen Wu	117936917a	llvm-cov: Added edge struct for traversal in block. Added GCOVEdge which are simple structs owned by the GCOVFunction that stores the source and destination GCOVBlocks, as well as the counts. Changed GCOVBlocks so that it stores a vector of source GCOVEdges and a vector of destination GCOVEdges, rather than just the block number. Storing the block number was only useful for knowing the number of edges and for debug info. Using a struct is useful for traversing the edges, especially back edges which may be needed later. llvm-svn: 196175	2013-12-03 00:24:44 +00:00
Yuchen Wu	32c43e892c	llvm-cov: Split up reading of GCNO and GCDA files. There are now two functions: readGCNO() and readGCDA(). llvm-svn: 196173	2013-12-03 00:15:49 +00:00
Manman Ren	d514bc9bbe	Debug Info: rename getDebugInfoVersionFromModule to getDebugMetadataVersionFromModule. Suggested by Eric. llvm-svn: 196172	2013-12-03 00:12:14 +00:00
Hal Finkel	2fbbf9299e	Remove PPCScoreboardHazardRecognizer PPCScoreboardHazardRecognizer was a subclass of ScoreboardHazardRecognizer which did only one thing: filtered out nodes in EmitInstruction for which DAG->getInstrDesc(SU) returned NULL. This used to be the case for PPC pseudo instructions. As far as I can tell, this is no longer true, and so we can use ScoreboardHazardRecognizer directly. llvm-svn: 196171	2013-12-02 23:52:46 +00:00
Rafael Espindola	5f33387c40	Refactor the setting of PrivateGlobalPrefix. No functionality change. llvm-svn: 196170	2013-12-02 23:39:26 +00:00
Rafael Espindola	d2ccd34a64	Don't set PrivateGlobalPrefix twice in the same function. llvm-svn: 196169	2013-12-02 23:26:31 +00:00
Rafael Espindola	d41a7b228c	Convert two char* that are only ever used as booleans to bool. llvm-svn: 196168	2013-12-02 23:04:51 +00:00
Kay Tiong Khoo	dd9c210766	Use local variable for repeated use rather than 'get' method. No functional change intended. llvm-svn: 196164	2013-12-02 22:23:32 +00:00
Kay Tiong Khoo	40a987d7ca	Move variables to where they are used and give them better names. No functional change intended. llvm-svn: 196163	2013-12-02 22:20:40 +00:00
Kay Tiong Khoo	14489a85ab	Rename variables to be consistent (CST -> Cst). No functional change intended. llvm-svn: 196161	2013-12-02 22:11:56 +00:00
David Blaikie	85fdef8bea	Remove unnecessary/commented-out header inclusion. Review feedback from Eric Christopher on r196140 llvm-svn: 196160	2013-12-02 22:11:08 +00:00
David Blaikie	8e02b3d92e	DebugInfo: Rename generic unit references to "TheU" instead of TheCU now that they might be type units instead of compile units. CR feedback from Eric Christopher on r196139. llvm-svn: 196159	2013-12-02 22:09:48 +00:00
Manman Ren	644e8f4a5b	Debug Info: drop debug info via upgrading path if version number does not match. Add a helper function getDebugInfoVersionFromModule to return the debug info version number for a module. "Verifier/module-flags-1.ll" checks for verification errors. It will seg fault when calling getDebugInfoVersionFromModule because of the incorrect format for module flags in the testing case. We make getModuleFlagsMetadata more robust by checking for error conditions. PR17982 llvm-svn: 196158	2013-12-02 21:29:56 +00:00
Chad Rosier	bcca7559f8	[AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! llvm-svn: 196151	2013-12-02 21:05:16 +00:00
Mark Seaborn	b08ae3c322	InlineFunction.cpp: Remove a return value that is always false Remove some associated dead code. This cleanup is associated with PR17872. llvm-svn: 196147	2013-12-02 20:50:59 +00:00
David Blaikie	c06b6fc581	DebugInfo: Rename DwarfCompileUnit.* to DwarfUnit.* to match their contents. llvm-svn: 196140	2013-12-02 19:33:15 +00:00
David Blaikie	8f5d7bdcbe	DebugInfo: Refactor CompileUnit into a Unit baseclass and CompileUnit/TypeUnit derived classes. Header/cpp file rename to follow immediately - just splitting out the commits for ease of review/reading to demonstrate that the renaming changes are entirely mechanical. llvm-svn: 196139	2013-12-02 19:33:10 +00:00
David Blaikie	74530dc54e	DebugInfo: Type Units: Propagate the correct DW_AT_language into type units. llvm-svn: 196130	2013-12-02 18:44:29 +00:00
Kay Tiong Khoo	5257afa264	Conservative fix for PR17827 - don't optimize a shift + and + compare sequence where the shift is logical unless the comparison is unsigned llvm-svn: 196129	2013-12-02 18:43:59 +00:00
Vincent Lejeune	26780e84f1	R600: Workaround for cayman loop bug llvm-svn: 196121	2013-12-02 17:29:37 +00:00
Rafael Espindola	c36d63a948	Move getSymbolWithGlobalValueBase to TargetLoweringObjectFile. This allows it to be used in TargetLoweringObjectFileImpl.cpp. llvm-svn: 196117	2013-12-02 16:25:47 +00:00
Alp Toker	a522b043fd	Introduce poor man's consumeToken() in X86AsmParser This makes the code a little more idiomatic. No change in behaviour. llvm-svn: 196113	2013-12-02 16:06:06 +00:00
Rafael Espindola	cee11d6f43	Remove dead code. MO_JumpTableIndex and MO_ExternalSymbol don't show up on inline asm. Keeping parts of the old asm printer just to print inline asm to a string that we then parse back looks like a hack. llvm-svn: 196111	2013-12-02 15:36:37 +00:00
Rafael Espindola	5b33cec985	Output .eh_frames on COFF too now that the integrated as is used on mingw. llvm-svn: 196104	2013-12-02 14:59:34 +00:00
Tim Northover	c144b1204e	ARM: decide whether to use movw/movt based on "minsize" attribute. llvm-svn: 196102	2013-12-02 14:46:26 +00:00
NAKAMURA Takumi	5aa98ebb35	XCoreFrameLowering.cpp: Use [in,out] instead of [in] [out]. [-Wdocumentation] llvm-svn: 196094	2013-12-02 11:31:25 +00:00
Robert Lytton	aec919de4b	XCore target: Make handling of large frames not dependent upon an FP. eliminateFrameIndex() has been reworked to handle both small & large frames with either a FP or SP. An additional Slot is required for Scavenging spills when not using FP for large frames. Reworked the handling of Register Scavenging. Whether we are using an FP or not, whether it is a large frame or not, and whether we are using a large code model or not are now independent. llvm-svn: 196091	2013-12-02 11:05:28 +00:00
Tim Northover	46df9f449d	ARM: add pseudo-instructions for lit-pool global materialisation These are used by MachO only at the moment, and (much like the existing MOVW/MOVT set) work around the fact that the labels used in the actual instructions often contain PC-dependent components, which means that repeatedly materialising the same global can't be CSEed. With small modifications, it could be adapted to how ELF finds the address of _GLOBAL_OFFSET_TABLE_, which would give similar benefits in PIC mode there. llvm-svn: 196090	2013-12-02 10:35:41 +00:00
Benjamin Kramer	d153673433	XCore: Unbreak C++11 build. llvm-svn: 196089	2013-12-02 10:29:26 +00:00
Robert Lytton	7a58a4e90d	XCore target: fix large code model 'select' indirect address handling. llvm-svn: 196088	2013-12-02 10:18:37 +00:00
Robert Lytton	3eb24d0e61	XCore target: Add large code model When using large code model: Global objects larger than 'CodeModelLargeSize' bytes are placed in sections named with a trailing ".large" The folded global address of such objects are lowered into the const pool. During inspection it was noted that LowerConstantPool() was using a default offset of zero. A fix was made, but due to only offsets of zero being generated, testing only verifies the change is not detrimental. Correct the flags emitted for explicitly specified sections. We assume the size of the object queried by getSectionForConstant() is never greater than CodeModelLargeSize. To handle greater than CodeModelLargeSize, changes to AsmPrinter would be required. llvm-svn: 196087	2013-12-02 10:18:31 +00:00
Robert Lytton	75d72dfcd2	XCore target: Fix eliminateFrameIndex() to handle large frames Large frame offsets are loaded from the ConstantPool. Where possible, offsets are encoded using the smaller MKMSK instruction. Large frame offsets can only be used when there is a frame-pointer. llvm-svn: 196085	2013-12-02 10:18:19 +00:00
Robert Lytton	9c8a9af745	XCore target: Enable frames larger than 65535 to be lowered llvm-svn: 196084	2013-12-02 10:18:14 +00:00
Kostya Serebryany	e71e08d007	[tsan] fix instrumentation of vector vptr updates (https://code.google.com/p/thread-sanitizer/issues/detail?id=43 ) llvm-svn: 196079	2013-12-02 08:07:15 +00:00
Rafael Espindola	29768368be	Remove leftovers from a non-MC asm printer. llvm-svn: 196068	2013-12-02 05:42:16 +00:00
Rafael Espindola	b121e6ba41	Remove #if 0 declarations. llvm-svn: 196067	2013-12-02 05:24:28 +00:00
Rafael Espindola	427ca8d886	Change the default of AsmWriterClassName and isMCAsmWriter. llvm-svn: 196065	2013-12-02 04:55:42 +00:00
Rafael Espindola	2f18f751ff	Remove dead declarations. llvm-svn: 196063	2013-12-02 04:18:19 +00:00
Rafael Espindola	3192965fa3	Refactor for clarity and efficiency. The PPC GetSymbolFromOperand already prefixed stubs of MO_ExternalSymbol, so this should be a nop. llvm-svn: 196059	2013-12-02 03:26:43 +00:00
Tim Northover	bcd72d7348	ARM: fix bug in -Oz stack adjustment folding Previously, we clobbered callee-saved registers when folding an "add sp, #N" into a "pop {rD, ...}" instruction. This change checks whether a register we're going to add to the "pop" could actually be live outside the function before doing so and should fix the issue. This should fix PR18081. llvm-svn: 196046	2013-12-01 14:16:24 +00:00
Benjamin Kramer	68c312e788	Revamp error checking in the ms inline asm parser. - Actually abort when an error occurred. - Check that the frontend lookup worked when parsing length/size/type operators. Tested by a clang test. PR18096. llvm-svn: 196044	2013-12-01 11:47:42 +00:00
Bill Wendling	0f97d98496	Use accessor methods instead. llvm-svn: 196006	2013-12-01 03:40:42 +00:00
Bill Wendling	178e2b5358	Use 'unsigned char' to get this past gcc error message: error: invalid conversion from 'unsigned char' to '{anonymous}::Sequence' llvm-svn: 196004	2013-12-01 03:36:07 +00:00
Hal Finkel	725757ccc8	Add a scheduling model (with itinerary) for the PPC POWER7 This adds a scheduling model for the POWER7 (P7) core, and enables the machine-instruction scheduler when targeting the P7. Scheduling for the P7, like earlier ooo PPC cores, requires considering both dispatch group hazards, and functional unit resources and latencies. These are both modeled in a combined itinerary. Dispatch group formation is still handled by the post-RA scheduler (which still needs to be updated for the P7, but nevertheless does a pretty good job). One interesting aspect of this change is that I've also enabled to use of AA duing CodeGen for the P7 (just as it is for the embedded cores). The benchmark results seem to support this decision (see below), and while this is normally useful for in-order cores, and not for ooo cores like the P7, I think that the dispatch slot hazards are enough like in-order resources to make the AA useful. Test suite significant performance differences (where negative is a speedup, and positive is a regression) vs. the current situation: MultiSource/Benchmarks/BitBench/drop3/drop3 with AA: N/A without AA: -28.7614% +/- 19.8356% (significantly against AA) MultiSource/Benchmarks/FreeBench/neural/neural with AA: -17.7406% +/- 11.2712% without AA: N/A (significantly in favor of AA) MultiSource/Benchmarks/SciMark2-C/scimark2 with AA: -11.2079% +/- 1.80543% without AA: -11.3263% +/- 2.79651% MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt with AA: -41.8649% +/- 17.0053% without AA: -34.5256% +/- 23.7072% MultiSource/Benchmarks/mafft/pairlocalalign with AA: 25.3016% +/- 17.8614% without AA: 38.6629% +/- 14.9391% (significantly in favor of AA) MultiSource/Benchmarks/sim/sim with AA: N/A without AA: 13.4844% +/- 7.18195% (significantly in favor of AA) SingleSource/Benchmarks/BenchmarkGame/Large/fasta with AA: 15.0664% +/- 6.70216% without AA: 12.7747% +/- 8.43043% SingleSource/Benchmarks/BenchmarkGame/puzzle with AA: 82.2713% +/- 26.3567% without AA: 75.7525% +/- 41.1842% SingleSource/Benchmarks/Misc/flops-2 with AA: -37.1621% +/- 20.7964% without AA: -35.2342% +/- 20.2999% (significantly in favor of AA) These are 99.5% confidence intervals from 5 runs per configuration. Regarding the choice to turn on AA during CodeGen, of these results, four seem significantly in favor of using AA, and one seems significantly against. I'm not making this decision based on these numbers alone, but these results seem consistent with results I have from other tests, and so I think that, on balance, using AA is a win. llvm-svn: 195981	2013-11-30 20:55:12 +00:00
Hal Finkel	fa2b249f38	Split some PPC itinerary classes In preparation for adding scheduling definitions for the POWER7, split some PPC itinerary classes so that the P7's latencies and hazards can be better described. For the most part, this means differentiating indexed from non-index pre-increment loads and stores. Also, differentiate single from double-precision sqrt. No functionality change intended (except for a more-specific latency for single-precision sqrt on the A2). llvm-svn: 195980	2013-11-30 20:41:13 +00:00
Zoran Jovanovic	b3e74abf46	Fixed issue with microMIPS long branch. llvm-svn: 195975	2013-11-30 19:12:28 +00:00
Daniel Sanders	65ab9582ba	[mips][msa] MSA loads and stores have a 10-bit offset. Account for this when lowering FrameIndex. This prevents the compiler from emitting invalid ld.[bhwd]'s and st.[bhwd]'s when the stack frame is between 512 and 32,768 bytes in size. llvm-svn: 195973	2013-11-30 13:47:57 +00:00
Daniel Sanders	f397466fb3	[mips][msa] A small refactor to reduce patch noise in my next commit No functional change. An if-statement has been split into two nested if-statements. llvm-svn: 195972	2013-11-30 13:15:21 +00:00
Andrew Trick	b7e697ed41	Reverse the order of eviction checks for possible compile time savings. No functionality. llvm-svn: 195969	2013-11-29 23:49:38 +00:00
Reed Kotler	95269c69db	Part 1 of 3 patches that completes very long conditional branches in constant islands for Mips16. We introdcuce JalB16 as a synomnym for Jal16. It makes it easier to read and is also necessary because Jal16 is a call instruction but JalB16 is being used as a branch. Various parts of LLVM will not work properly even in this late stage of the backend if we use what was declared as a call instruction to function as a branch. For one, basic block labels may not get emitted in some situations. llvm-svn: 195968	2013-11-29 22:32:56 +00:00
Zoran Jovanovic	b8cffe14c6	Revert revision 195965. llvm-svn: 195967	2013-11-29 22:10:02 +00:00
Zoran Jovanovic	797919cb22	Fixed issue with microMIPS long branch. llvm-svn: 195965	2013-11-29 21:41:24 +00:00
Hal Finkel	2d7cc00415	Adjust PPC A2 input operand latencies On the PPC A2, instructions are only issued after their input operands are ready. Model this by specifying that input operands are read at dispatch (0 cycles after issue). This changes all input operand latencies from 1 to 0. Significant test-suite performance changes (these are 99.5% confidence intervals on 6 runs for both before and after): speedups: MultiSource/Benchmarks/sim/sim -1.21915% +/- 0.175063% MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -1.23946% +/- 1.05133% SingleSource/Benchmarks/Misc/flops-2 -1.24237% +/- 0.681362% MultiSource/Applications/JM/lencod/lencod -1.33992% +/- 0.757498% MultiSource/Benchmarks/TSVC/InductionVariable-flt/InductionVariable-flt -1.51802% +/- 1.21468% MultiSource/Benchmarks/TSVC/GlobalDataFlow-flt/GlobalDataFlow-flt -2.18818% +/- 1.28605% MultiSource/Benchmarks/TSVC/Packing-flt/Packing-flt -2.21977% +/- 1.19499% SingleSource/Benchmarks/BenchmarkGame/spectral-norm -2.29822% +/- 0.671871% MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl -2.40975% +/- 0.355931% SingleSource/Benchmarks/Misc/fp-convert -2.41899% +/- 1.04751% MultiSource/Benchmarks/TSVC/Searching-dbl/Searching-dbl -2.50349% +/- 0.126765% SingleSource/Benchmarks/Misc/flops-3 -3.00214% +/- 0.700795% MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt -3.56995% +/- 3.2929% MultiSource/Applications/sgefa/sgefa -4.24908% +/- 2.00413% MultiSource/Benchmarks/ASC_Sequoia/IRSmk/IRSmk -18.1294% +/- 3.96489% regressions: MultiSource/Benchmarks/TSVC/Reductions-dbl/Reductions-dbl 1.03249% +/- 0.178547% MultiSource/Applications/hexxagon/hexxagon 1.16597% +/- 0.285235% MultiSource/Benchmarks/TSVC/IndirectAddressing-flt/IndirectAddressing-flt 1.39576% +/- 1.07855% SingleSource/Benchmarks/Misc-C++/stepanov_v1p2 1.71539% +/- 0.173182% MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1 1.90013% +/- 0.866472% MultiSource/Benchmarks/TSVC/Recurrences-dbl/Recurrences-dbl 2.39854% +/- 1.05914% MultiSource/Benchmarks/TSVC/ControlFlow-dbl/ControlFlow-dbl 2.4402% +/- 0.817904% MultiSource/Benchmarks/TSVC/LoopRestructuring-dbl/LoopRestructuring-dbl 5.87997% +/- 3.3172% MultiSource/Benchmarks/Trimaran/netbench-crc/netbench-crc 9.02643% +/- 5.79591% MultiSource/Benchmarks/VersaBench/bmm/bmm 10.3517% +/- 1.227% Obviously, there are data points on both sides of this; but I think, overall, this supports making the change. llvm-svn: 195951	2013-11-29 07:04:59 +00:00
Lang Hames	7883c4d5ae	Teach LocalStackSlotAllocation that stackmaps/patchpoints don't have range constraints on their frame offsets. llvm-svn: 195950	2013-11-29 06:35:30 +00:00
Hal Finkel	69f21285ed	Create a PPC440 SchedMachineModel Some of the older PPC processor definitions don't have associated SchedMachineModels; correct this for the PPC440. llvm-svn: 195949	2013-11-29 06:32:17 +00:00
Hal Finkel	c5a38fd3e6	Fixup PPC440 load/store operand latencies The operand latencies for loads and stores in the PPC440 itinerary were wrong (the store operands are all inputs, and the "with update" (pre-increment) instructions need a latency for the additional output). llvm-svn: 195948	2013-11-29 06:19:43 +00:00
Hal Finkel	a9d93b1740	Adjust PPC440 operand latencies The operand latencies for the PPC440 should be specified relative to dispatch, not relative to the initial fetch-and-decode stages. Because most instructions (ignoring bypass) wait in dispatch until their operands are ready, this is modeled as reading input operands "at dispatch" (0 cycles after issue), and so every input and output operand has 4 cycles subtracted from it. This could alter scheduling slightly, but I don't expect a large effect. llvm-svn: 195947	2013-11-29 05:59:00 +00:00
Hal Finkel	f086fc01ab	Don't model the fetch and decode units for the PPC440 Modeling the fetch and decode units in the PPC440 itinerary does not add anything to the hazard detection capability (and so modeling them just wastes compile time). No functionality change intended. llvm-svn: 195946	2013-11-29 05:58:38 +00:00
Lang Hames	82e8d4faa9	Remove unused variable from r195944. llvm-svn: 195945	2013-11-29 03:36:53 +00:00
Lang Hames	067c025250	Refactor a lot of patchpoint/stackmap related code to simplify and make it target independent. Most of the x86 specific stackmap/patchpoint handling was necessitated by the use of the native address-mode format for frame index operands. PEI has now been modified to treat stackmap/patchpoint similarly to DEBUG_INFO, allowing us to use a simple, platform independent register/offset pair for frame indexes on stackmap/patchpoints. Notes: - Folding is now platform independent and automatically supported. - Emiting patchpoints with direct memory references now just involves calling the TargetLoweringBase::emitPatchPoint utility method from the target's XXXTargetLowering::EmitInstrWithCustomInserter method. (See X86TargetLowering for an example). - No more ugly platform-specific operand parsers. This patch shouldn't change the generated output for X86. llvm-svn: 195944	2013-11-29 03:07:54 +00:00
Hao Liu	b9fa1067c7	AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. llvm-svn: 195941	2013-11-29 02:11:22 +00:00
Jiangning Liu	afc7f71eb3	Add missing pattern for supporting intrinsic function vbsl_f64 with argument double floating point. llvm-svn: 195938	2013-11-29 01:37:15 +00:00
Kevin Qin	b95721d200	[AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. llvm-svn: 195936	2013-11-29 01:29:16 +00:00
Stephen Canon	d8aaca93a6	Rein in overzealous InstCombine of fptrunc(OP(fpextend, fpextend)). llvm-svn: 195934	2013-11-28 21:38:05 +00:00
Rafael Espindola	3f4a857bd5	Refactor to remove a bit of duplication. No functionality change. llvm-svn: 195933	2013-11-28 20:12:44 +00:00
Benjamin Kramer	fd8fd4246f	Silence sign-compare warning and reduce nesting. No functionality change. llvm-svn: 195932	2013-11-28 19:58:56 +00:00
Rafael Espindola	7e7db10302	Remove an always true parameter. llvm-svn: 195931	2013-11-28 19:35:07 +00:00
NAKAMURA Takumi	9b851e876f	[CMake] Let add_public_tablegen_target() provide intrinsics_gen, too. I think, in principle, intrinsics_gen may be added explicitly. That said, it can be added incidentally, since each target already has dependencies to llvm-tblgen. Almost all source files depend on both CommonTaleGen and intrinsics_gen. Explicit add_dependencies() have been pruned under lib/Target. llvm-svn: 195929	2013-11-28 17:04:31 +00:00
NAKAMURA Takumi	99f544b37e	[CMake] Let add_public_tablegen_target responsible to provide dependency to CommonTableGen. add_public_tablegen_target adds *CommonTableGen to LLVM_COMMON_DEPENDS. LLVM_COMMON_DEPENDS affects add_llvm_library (and other add_target stuff) within its scope. llvm-svn: 195927	2013-11-28 17:04:04 +00:00
Rafael Espindola	02c35a15de	The global prefix is always one char. Don't use a string for it. llvm-svn: 195926	2013-11-28 17:00:49 +00:00
NAKAMURA Takumi	46b765a4a3	[CMake] Prune include_directories() in llvm/lib/Target, take #2 . I forgot to commit them. They were staging in my local repo. llvm-svn: 195924	2013-11-28 15:30:37 +00:00
Daniel Sanders	86f254d104	[mips] Revert test commit r195922. llvm-svn: 195923	2013-11-28 15:26:33 +00:00
Daniel Sanders	5b3619e21b	[mips] A test commit to test my Herald and Audit workflow Will be reverted in the next commit llvm-svn: 195922	2013-11-28 15:25:43 +00:00
NAKAMURA Takumi	5dbd3bcf3d	[CMake] Prune include_directories() in llvm/lib/Target. add_llvm_target() sets them. llvm-svn: 195921	2013-11-28 14:53:30 +00:00
NAKAMURA Takumi	128a461f6a	Add newline at eof. llvm-svn: 195920	2013-11-28 14:52:52 +00:00
Rafael Espindola	05d05c4e8a	Use the mangler consistently instead of using getGlobalPrefix directly. llvm-svn: 195911	2013-11-28 08:59:52 +00:00
Hal Finkel	e49bc01fba	Don't share functional units among the PPC itineraries Instead of sharing functional unit names between the various PPC itineraries, give each core its own unit names prefixed with the core name. This follows the convention used by other backends (such as ARM), and removes a non-obvious ordering dependency between the various PPCSchedule*.td files. No functionality change intended. llvm-svn: 195908	2013-11-28 06:05:59 +00:00
Jiangning Liu	7f44dcb9f4	Remove the variable only used by assert to avoid the build failure caused by build options [-Werror,-Wunused-variable]. llvm-svn: 195905	2013-11-28 01:34:55 +00:00
Hao Liu	2f617213ef	AArch64: Fix a bug about disassembling post-index load single element to 4 vectors llvm-svn: 195903	2013-11-28 01:07:45 +00:00
Reed Kotler	deb5d6d05e	Check in conditional branches for constant islands. Still need to finish conditional branches for very large targets. That will be the next small patch. Everything now should in principle work as good (functionality wise) as without constant islands so we decided at Mips/Imagination to make constant islands the default for Mips16 now so that it will get excercised a lot and this port is still experimentatl though hopefully soon we will change the status. Some more cleanup and code review is in order but things are converging fast. llvm-svn: 195902	2013-11-28 00:56:37 +00:00
Akira Hatanaka	a964ea01fe	[mips] Redefine TAILCALL as a pseudo instruction. No functionality change. llvm-svn: 195896	2013-11-27 23:58:32 +00:00

1 2 3 4 5 ...

65789 Commits