llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 04:52:54 +02:00

Author	SHA1	Message	Date
Daniel Jasper	623166a4f9	Further fix to llvm-cov test. It turns out that in some build systems, tests are executed in a non-writable directory. Hopefully, this finally fixes the issue. llvm-svn: 196256	2013-12-03 08:21:14 +00:00
Daniel Jasper	531677d105	Fix llvm-cov test as suggested in r196228's post commit review. llvm-svn: 196255	2013-12-03 07:56:23 +00:00
Daniel Jasper	37be421ce0	Copy input files to test directory. With r196184, llvm-cov creates a new file right next to the input file. However, the Inputs-directory can't simply be assumed to be writable under all build systems. Also, this prevents a new source file from showing up in the source tree if the test aborts before the call to "rm". llvm-svn: 196228	2013-12-03 07:35:32 +00:00
Hao Liu	547dc86218	[AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn llvm-svn: 196210	2013-12-03 06:06:55 +00:00
Hao Liu	f922fde3de	AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. llvm-svn: 196208	2013-12-03 05:58:30 +00:00
NAKAMURA Takumi	f0e7a4b85f	llvm-cov.test: Resurrect part of r194694 for win32 hosts. llvm-svn: 196207	2013-12-03 05:40:25 +00:00
Hao Liu	fea9943555	AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". llvm-svn: 196198	2013-12-03 03:39:47 +00:00
NAKAMURA Takumi	90ce128a31	llvm/test/Transforms/SampleProfile/syntax.ll: Relax an expression, not to check locale-dependent message. llvm-svn: 196195	2013-12-03 02:20:53 +00:00
Jiangning Liu	24b3414579	Add some missing pattern matches for AArch64 Neon intrinsics like vuqadd_s64 and friends. llvm-svn: 196192	2013-12-03 01:33:52 +00:00
Jiangning Liu	3f5f9eefd0	Add some missing pattern matches for AArch64 Neon intrinsics like vmull_high_n_s16 and friends. llvm-svn: 196190	2013-12-03 01:29:32 +00:00
Yuchen Wu	cd7628e981	llvm-cov: Removed output to STDOUT/specified file. Instead of asking the user to specify a single file to output coverage info and defaulting to STDOUT, llvm-cov now creates files for each source file with a naming system of: <source filename> + ".llcov". This is what gcov does and although it can clutter the working directory with numerous coverage files, it will be easier to hook the llvm-cov output to tools which operate on this assumption (such as lcov). llvm-svn: 196184	2013-12-03 00:57:11 +00:00
Manman Ren	644e8f4a5b	Debug Info: drop debug info via upgrading path if version number does not match. Add a helper function getDebugInfoVersionFromModule to return the debug info version number for a module. "Verifier/module-flags-1.ll" checks for verification errors. It will seg fault when calling getDebugInfoVersionFromModule because of the incorrect format for module flags in the testing case. We make getModuleFlagsMetadata more robust by checking for error conditions. PR17982 llvm-svn: 196158	2013-12-02 21:29:56 +00:00
Manman Ren	3b572dc7ed	Update Ocaml/vmcore.ml to emit a "Debug Info Version" module flag. llvm-svn: 196156	2013-12-02 21:25:56 +00:00
Chad Rosier	bcca7559f8	[AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! llvm-svn: 196151	2013-12-02 21:05:16 +00:00
David Blaikie	74530dc54e	DebugInfo: Type Units: Propagate the correct DW_AT_language into type units. llvm-svn: 196130	2013-12-02 18:44:29 +00:00
Kay Tiong Khoo	5257afa264	Conservative fix for PR17827 - don't optimize a shift + and + compare sequence where the shift is logical unless the comparison is unsigned llvm-svn: 196129	2013-12-02 18:43:59 +00:00
Vincent Lejeune	26780e84f1	R600: Workaround for cayman loop bug llvm-svn: 196121	2013-12-02 17:29:37 +00:00
Diego Novillo	a4c6fce65c	Add tests for profile sample file parsing. The profile file parser needed some tests for its parsing actions. This adds tests for each of the error messages emitted by the parser. llvm-svn: 196106	2013-12-02 15:12:50 +00:00
Rafael Espindola	5b33cec985	Output .eh_frames on COFF too now that the integrated as is used on mingw. llvm-svn: 196104	2013-12-02 14:59:34 +00:00
Tim Northover	c144b1204e	ARM: decide whether to use movw/movt based on "minsize" attribute. llvm-svn: 196102	2013-12-02 14:46:26 +00:00
Robert Lytton	aec919de4b	XCore target: Make handling of large frames not dependent upon an FP. eliminateFrameIndex() has been reworked to handle both small & large frames with either a FP or SP. An additional Slot is required for Scavenging spills when not using FP for large frames. Reworked the handling of Register Scavenging. Whether we are using an FP or not, whether it is a large frame or not, and whether we are using a large code model or not are now independent. llvm-svn: 196091	2013-12-02 11:05:28 +00:00
Tim Northover	46df9f449d	ARM: add pseudo-instructions for lit-pool global materialisation These are used by MachO only at the moment, and (much like the existing MOVW/MOVT set) work around the fact that the labels used in the actual instructions often contain PC-dependent components, which means that repeatedly materialising the same global can't be CSEed. With small modifications, it could be adapted to how ELF finds the address of _GLOBAL_OFFSET_TABLE_, which would give similar benefits in PIC mode there. llvm-svn: 196090	2013-12-02 10:35:41 +00:00
Robert Lytton	7a58a4e90d	XCore target: fix large code model 'select' indirect address handling. llvm-svn: 196088	2013-12-02 10:18:37 +00:00
Robert Lytton	3eb24d0e61	XCore target: Add large code model When using large code model: Global objects larger than 'CodeModelLargeSize' bytes are placed in sections named with a trailing ".large" The folded global address of such objects are lowered into the const pool. During inspection it was noted that LowerConstantPool() was using a default offset of zero. A fix was made, but due to only offsets of zero being generated, testing only verifies the change is not detrimental. Correct the flags emitted for explicitly specified sections. We assume the size of the object queried by getSectionForConstant() is never greater than CodeModelLargeSize. To handle greater than CodeModelLargeSize, changes to AsmPrinter would be required. llvm-svn: 196087	2013-12-02 10:18:31 +00:00
Robert Lytton	c3b700cb09	XCore target: extend tests in preparation llvm-svn: 196086	2013-12-02 10:18:24 +00:00
Robert Lytton	75d72dfcd2	XCore target: Fix eliminateFrameIndex() to handle large frames Large frame offsets are loaded from the ConstantPool. Where possible, offsets are encoded using the smaller MKMSK instruction. Large frame offsets can only be used when there is a frame-pointer. llvm-svn: 196085	2013-12-02 10:18:19 +00:00
Robert Lytton	9c8a9af745	XCore target: Enable frames larger than 65535 to be lowered llvm-svn: 196084	2013-12-02 10:18:14 +00:00
Kostya Serebryany	e71e08d007	[tsan] fix instrumentation of vector vptr updates (https://code.google.com/p/thread-sanitizer/issues/detail?id=43 ) llvm-svn: 196079	2013-12-02 08:07:15 +00:00
Alp Toker	fcc4ea594d	Rename test with misspelt filename llvm-svn: 196064	2013-12-02 04:31:36 +00:00
Rafael Espindola	cf111ab2af	Also test the created stubs on 32 bits. llvm-svn: 196052	2013-12-01 21:24:30 +00:00
Andrew Trick	26c262f3a7	Add -mcpu to stackmap.ll llvm-svn: 196051	2013-12-01 18:17:05 +00:00
Tim Northover	bcd72d7348	ARM: fix bug in -Oz stack adjustment folding Previously, we clobbered callee-saved registers when folding an "add sp, #N" into a "pop {rD, ...}" instruction. This change checks whether a register we're going to add to the "pop" could actually be live outside the function before doing so and should fix the issue. This should fix PR18081. llvm-svn: 196046	2013-12-01 14:16:24 +00:00
Michael Kuperstein	356b61c610	Ensure bitcode encoding of linkage types stays stable. Patch by Boaz Ouriel llvm-svn: 196042	2013-12-01 10:16:35 +00:00
Hal Finkel	14673817db	Convert a PPC test from grep to FileCheck Convert this test to FileCheck, and improve it to check for the instructions it is trying to exclude instead of checking for register use (especially because grepping for r1 can be thrown off, for example, by a use of r12). llvm-svn: 195979	2013-11-30 20:04:33 +00:00
Hal Finkel	ded988ca4c	Desensitize a couple of PPC regression tests Use CHECK-DAG to make these regression tests more resilient against changes in instruction scheduling. llvm-svn: 195978	2013-11-30 19:52:28 +00:00
Hal Finkel	1cdcead814	Update the cpu specified on some PPC regression tests Some of these tests did not specify a cpu but were also sensitive to instruction scheduling and/or register assignment choices. A few others similarly-sensitive tests specified a cpu (often the POWER7), and while the P7 currently uses the default model for PPC64, this will soon change. For those tests which should not really be cpu-dependent anyway, the cpu is set to the generic 'ppc64'. llvm-svn: 195977	2013-11-30 19:39:27 +00:00
Zoran Jovanovic	335dc8689e	Test case for issue with microMIPS long branch. llvm-svn: 195976	2013-11-30 19:13:15 +00:00
Daniel Sanders	65ab9582ba	[mips][msa] MSA loads and stores have a 10-bit offset. Account for this when lowering FrameIndex. This prevents the compiler from emitting invalid ld.[bhwd]'s and st.[bhwd]'s when the stack frame is between 512 and 32,768 bytes in size. llvm-svn: 195973	2013-11-30 13:47:57 +00:00
Juergen Ributzka	7150312963	Force CPU type to unbreak unit tests on Haswell machines. llvm-svn: 195971	2013-11-30 03:07:16 +00:00
Reed Kotler	95269c69db	Part 1 of 3 patches that completes very long conditional branches in constant islands for Mips16. We introdcuce JalB16 as a synomnym for Jal16. It makes it easier to read and is also necessary because Jal16 is a call instruction but JalB16 is being used as a branch. Various parts of LLVM will not work properly even in this late stage of the backend if we use what was declared as a call instruction to function as a branch. For one, basic block labels may not get emitted in some situations. llvm-svn: 195968	2013-11-29 22:32:56 +00:00
Zoran Jovanovic	b8cffe14c6	Revert revision 195965. llvm-svn: 195967	2013-11-29 22:10:02 +00:00
Petar Jovanovic	f12c338160	mips: XFAIL llvm-cov test XFAIL llvm-cov.test for MIPS until big-endian issues are fixed for llvm-cov. The test does pass on MIPS little-endian. llvm-svn: 195966	2013-11-29 21:59:09 +00:00
Zoran Jovanovic	797919cb22	Fixed issue with microMIPS long branch. llvm-svn: 195965	2013-11-29 21:41:24 +00:00
Hao Liu	b9fa1067c7	AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. llvm-svn: 195941	2013-11-29 02:11:22 +00:00
Jiangning Liu	844201423a	Add missing test case for bsl_f64 support of AArch64 NEON. llvm-svn: 195939	2013-11-29 01:38:08 +00:00
Kevin Qin	b95721d200	[AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. llvm-svn: 195936	2013-11-29 01:29:16 +00:00
Stephen Canon	d8aaca93a6	Rein in overzealous InstCombine of fptrunc(OP(fpextend, fpextend)). llvm-svn: 195934	2013-11-28 21:38:05 +00:00
Hao Liu	2f617213ef	AArch64: Fix a bug about disassembling post-index load single element to 4 vectors llvm-svn: 195903	2013-11-28 01:07:45 +00:00
Reed Kotler	deb5d6d05e	Check in conditional branches for constant islands. Still need to finish conditional branches for very large targets. That will be the next small patch. Everything now should in principle work as good (functionality wise) as without constant islands so we decided at Mips/Imagination to make constant islands the default for Mips16 now so that it will get excercised a lot and this port is still experimentatl though hopefully soon we will change the status. Some more cleanup and code review is in order but things are converging fast. llvm-svn: 195902	2013-11-28 00:56:37 +00:00
David Blaikie	91268863a6	DebugInfo: Do not include variables only referenced by templates in aranges. ARanges included even extern variables referenced by pointer non-type template parameters even though that variable isn't part of this compilation unit. llvm-svn: 195895	2013-11-27 23:53:52 +00:00
Akira Hatanaka	ff17fbeebc	[mips] Implement the following optimizations using dominance information to make PIC calls a little more efficient: 1. Remove instructions setting up $gp if it is known that a function has been called at least once. 2. Save the address of a called function in a register instead of loading it from the GOT at every call site. llvm-svn: 195892	2013-11-27 23:38:42 +00:00
Tom Stellard	95624c101d	R600: Expand vector FABS NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195881	2013-11-27 21:23:39 +00:00
Tom Stellard	eac3acc854	R600/SI: Implement spilling of SGPRs v5 SGPRs are spilled into VGPRs using the {READ,WRITE}LANE_B32 instructions. v2: - Fix encoding of Lane Mask - Use correct register flags, so we don't overwrite the low dword when restoring multi-dword registers. v3: - Register spilling seems to hang the GPU, so replace all shaders that need spilling with a dummy shader. v4: - Fix *LANE definitions - Change destination reg class for 32-bit SMRD instructions v5: - Remove small optimization that was crashing Serious Sam 3. https://bugs.freedesktop.org/show_bug.cgi?id=68224 https://bugs.freedesktop.org/show_bug.cgi?id=71285 NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195880	2013-11-27 21:23:35 +00:00
Tom Stellard	d386cdf4d0	R600/SI: Use SGPR_32 register class for 32-bit SMRD outputs Writing to the M0 register from an SMRD instruction hangs the GPU, so we need to use the SGPR_32 register class, which does not include M0. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195879	2013-11-27 21:23:29 +00:00
Tom Stellard	0a14ce13e1	R600: Add support for ISD::FROUND NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195878	2013-11-27 21:23:20 +00:00
Rafael Espindola	c377a725ca	Use FileCheck and expand the test a bit. In particular, check the name of the symbol we are putting in the constant pool. llvm-svn: 195865	2013-11-27 19:22:14 +00:00
Rafael Espindola	c099bf8035	Use the same tls section name as msvc. We currently error in clang with: "error: thread-local storage is unsupported for the current target", but we can start to get the llvm level ready. When compiling template<typename T> struct foo { static __declspec(thread) int bar; }; template<typename T> __declspec(therad) int foo<T>::bar; template struct foo<int>; msvc produces SECTION HEADER #3 .tls$ name 0 physical address 0 virtual address 4 size of raw data 12F file pointer to raw data (0000012F to 00000132) 0 file pointer to relocation table 0 file pointer to line numbers 0 number of relocations 0 number of line numbers C0301040 flags Initialized Data COMDAT; sym= "public: static int foo<int>::bar" (?bar@?$foo@H@@2HA) 4 byte align Read Write gcc produces a ".data$__emutls_v.<symbol>" for the testcase with __declspec(thread) replaced with thread_local. llvm-svn: 195849	2013-11-27 15:52:11 +00:00
Jiangning Liu	d9270b7a51	Fix the AArch64 NEON bug exposed by checking constant integer argument range of ACLE intrinsics. llvm-svn: 195843	2013-11-27 14:02:25 +00:00
Rafael Espindola	22b6ec4d69	Cleanup and test X86AsmPrinter::printPCRelImm. It is only used for asm printing. On X86 we put basic block addresses on register before passing them to inline asm, so the MO_MachineBasicBlock case was dead. MO_ExternalSymbol was dead since any symbol being passed to inline asm is represented as MO_GlobalAddress. The MO_GlobalAddress and MO_Register cases were not tested. llvm-svn: 195824	2013-11-27 06:53:13 +00:00
Chad Rosier	ca062e81db	[AArch64] Add support for NEON scalar floating-point absolute difference. llvm-svn: 195803	2013-11-27 01:45:58 +00:00
Rafael Espindola	3ceb67b21b	Use simple section names for COMDAT sections on COFF. With this patch we use simple names for COMDAT sections (like .text or .bss). This matches the MSVC behavior. When merging it is the COMDAT symbol that is used to decide if two sections should be merged, so there is no point in building a fancy name. This survived a bootstrap on mingw32. llvm-svn: 195798	2013-11-27 01:18:37 +00:00
Nadav Rotem	dc01e91cf5	PR1860 - We can't save a list of ExtractElement instructions to CSE because some of these instructions may be removed and optimized in future iterations. Instead we save a list of basic blocks that we need to CSE. llvm-svn: 195791	2013-11-26 22:24:25 +00:00
Chad Rosier	1337fcc721	[AArch64] Add support for NEON scalar floating-point to integer convert instructions. llvm-svn: 195788	2013-11-26 22:17:37 +00:00
Arnold Schwaighofer	d0c05d2c84	LoopVectorizer: Truncate i64 trip counts of i32 phis if necessary In signed arithmetic we could end up with an i64 trip count for an i32 phi. Because it is signed arithmetic we know that this is only defined if the i32 does not wrap. It is therefore safe to truncate the i64 trip count to a i32 value. Fixes PR18049. llvm-svn: 195787	2013-11-26 22:11:23 +00:00
Reed Kotler	06b47695fb	Fix a bug related to constant islands for Mips16 and mips16/32 dual mode. The determination of when we are doing constant pools was being made too early in the asm printer. llvm-svn: 195781	2013-11-26 20:38:40 +00:00
Michael Liao	8c702e1a18	Fix PR18054 - Fix bug in (vsext (vzext x)) -> (vsext x) in SIGN_EXTEND_IN_REG lowering where we need to check whether x is a vector type (in-reg type) of i8, i16 or i32; otherwise, that optimization is not valid. llvm-svn: 195779	2013-11-26 20:31:31 +00:00
David Blaikie	bbf2455d59	DwarfDebug: Include type units in accelerator tables. Since type units aren't in the CUMap, use the DwarfUnits list to iterate over units for tasks such as accelerator table building. llvm-svn: 195776	2013-11-26 19:14:34 +00:00
Nadav Rotem	643eb4c26e	PR18060 - When we RAUW values with ExtractElement instructions in some cases we generate PHI nodes with multiple entries from the same basic block but with different values. Enabling CSE on ExtractElement instructions make sure that all of the RAUWed instructions are the same. llvm-svn: 195773	2013-11-26 17:29:19 +00:00
Stepan Dyatkovskiy	83455f2b60	PR17925 bugfix. Short description. This issue is about case of treating pointers as integers. We treat pointers as different if they references different address space. At the same time, we treat pointers equal to integers (with machine address width). It was a point of false-positive. Consider next case on 32bit machine: void foo0(i32 addrespace(1)* %p) void foo1(i32 addrespace(2)* %p) void foo2(i32 %p) foo0 != foo1, while foo1 == foo2 and foo0 == foo2. As you can see it breaks transitivity. That means that result depends on order of how functions are presented in module. Next order causes merging of foo0 and foo1: foo2, foo0, foo1 First foo0 will be merged with foo2, foo0 will be erased. Second foo1 will be merged with foo2. Depending on order, things could be merged we don't expect to. The fix: Forbid to treat any pointer as integer, except for those, who belong to address space 0. llvm-svn: 195769	2013-11-26 16:11:03 +00:00
Tim Northover	f0a2ff9091	Darwin-ARM: use movw/movt for static relocations llvm-svn: 195759	2013-11-26 12:45:05 +00:00
Richard Sandiford	b3250399ac	[SystemZ] Fix incorrect use of RISBG for a zero-extended right shift We would wrongly transform the testcase into the equivalent of an AND with 1. The problem was that, when testing whether the shifted-in bits of the right shift were significant, we used the width of the final zero-extended result rather than the width of the shifted value. llvm-svn: 195731	2013-11-26 10:53:16 +00:00
Kevin Qin	1370a1e1ee	Refactored the implementation of AArch64 NEON instruction ZIP, UZP and TRN. Fix a bug when mixed use of vget_high_u8() and vuzp_u8(). llvm-svn: 195716	2013-11-26 03:26:47 +00:00
Kevin Qin	95c8b28223	[AArch64]Implement 128 bit register copy with NEON. llvm-svn: 195713	2013-11-26 02:33:42 +00:00
Andrew Trick	95afafe3fa	StackMap: Implement support for DirectMemRefOp. A Direct stack map location records the address of frame index. This address is itself the value that the runtime requested. This differs from IndirectMemRefOp locations, which refer to a stack locations from which the requested values must be loaded. Direct locations can directly communicate the address if an alloca, while IndirectMemRefOp handle register spills. For example: entry: %a = alloca i64... llvm.experimental.stackmap(i32 <ID>, i32 <shadowBytes>, i64* %a) Since both the alloca and stackmap intrinsic are in the entry block, and the intrinsic takes the address of the alloca, the runtime can assume that LLVM will not substitute alloca with any intervening value. This must be verified by the runtime by checking that the stack map's location is a Direct location type. The runtime can then determine the alloca's relative location on the stack immediately after compilation, or at any time thereafter. This differs from Register and Indirect locations, because the runtime can only read the values in those locations when execution reaches the instruction address of the stack map. llvm-svn: 195712	2013-11-26 02:03:25 +00:00
David Blaikie	aeec78b126	DebugInfo: Update test case due to dumper improvements in r195698 The dumper was only dumping one pubtypes set and it was /always/ dumping one pubtypes set even when there were zero sets. Now that the dumper correctly dumps zero, one, or many sets, we can update this test case to test for the absolute absence of a set rather than a bogus/accidental zero-valued set. llvm-svn: 195706	2013-11-26 01:11:02 +00:00
David Blaikie	98277f8277	DebugInfo: Avoid emitting pubtype entries for type DIEs that just indirect to a type unit. llvm-svn: 195698	2013-11-26 00:22:37 +00:00
Cameron McInally	2ff051483c	Add an intrinsic for the SSE2 PAUSE instruction. llvm-svn: 195697	2013-11-26 00:20:43 +00:00
Chandler Carruth	497a42d1b9	Add the test case that I missed when committing r195528. Doh! llvm-svn: 195691	2013-11-25 22:24:27 +00:00
Rafael Espindola	ae17ac667e	Use -triple to fix the test on non-ELF hosts. llvm-svn: 195682	2013-11-25 20:46:18 +00:00
Rafael Espindola	fa5cbd5557	Don't use nopl in cpus that don't support it. Patch by Mikulas Patocka. I added the test. I checked that for cpu names that gas knows about, it also doesn't generate nopl. The modified cpus: i686 - there are i686-class CPUs that don't have nopl: Via c3, Transmeta Crusoe, Microsoft VirtualBox - see https://bbs.archlinux.org/viewtopic.php?pid=775414 k6, k6-2, k6-3, winchip-c6, winchip2 - these are 586-class CPUs via c3 c3-2 - see https://bugs.archlinux.org/task/19733 as a proof that Via c3 and c3-Nehemiah don't have nopl llvm-svn: 195679	2013-11-25 20:15:14 +00:00
David Peixotto	647697e4ae	ARM integrated assembler generates incorrect nop opcode This patch fixes a bug in the assembler that was causing bad code to be emitted. When switching modes in an assembly file (e.g. arm to thumb mode) we would always emit the opcode from the original mode. Consider this small example: $ cat align.s .code 16 foo: add r0, r0 .align 3 add r0, r0 $ llvm-mc -triple armv7-none-linux align.s -filetype=obj -o t.o $ llvm-objdump -triple thumbv7 -d t.o Disassembly of section .text: foo: 0: 00 44 add r0, r0 2: 00 f0 20 e3 blx #4195904 6: 00 00 movs r0, r0 8: 00 44 add r0, r0 This shows that we have actually emitted an arm nop (e320f000) instead of a thumb nop. Unfortunately, this encodes to a thumb branch which causes bad things to happen when compiling assembly code with align directives. The fix is to notify the ARMAsmBackend when we switch mode. The MCMachOStreamer was already doing this correctly. This patch makes the same change for the MCElfStreamer. There is still a bug in the way nops are emitted for alignment because the MCAlignment fragment does not store the correct mode. The ARMAsmBackend will emit nops for the last mode it knew about. In the example above, we still generate an arm nop if we add a `.code 32` to the end of the file. PR18019 llvm-svn: 195677	2013-11-25 19:11:13 +00:00
Bill Wendling	0fe82ef0aa	Unrevert r195599 with testcase fix. I'm not sure how it was checking for the wrong values... PR18023. llvm-svn: 195670	2013-11-25 18:05:22 +00:00
Rafael Espindola	a355ffef1b	Fix .comm and .lcomm on COFF. These should not use COMDATs. GNU as uses .bss for .lcomm and section 0 for .comm. Given static int a; int b; MSVC puts both in .bss. This patch then puts both .comm and .lcomm on .bss. With this change we agree with gas on .lcomm, are much closer on .comm and clang-cl matches msvc on the above example. llvm-svn: 195654	2013-11-25 16:06:04 +00:00
Amara Emerson	368f3c89e8	[ARM] Enable FeatureMP for Cortex-A5 by default. Patch by Oliver Stannard. llvm-svn: 195640	2013-11-25 13:17:15 +00:00
Amara Emerson	dfecbfdfc2	Revert r195599 as it broke the builds. llvm-svn: 195636	2013-11-25 11:24:18 +00:00
Daniel Sanders	054e9e0703	Fixed tryFoldToZero() for vector types that need expansion. Summary: Moved the requirement for SelectionDAG::getConstant() to return legally typed nodes slightly earlier. There were two optional DAGCombine passes that were missed out and were required to produce type-legal DAGs. Simplified a code-path in tryFoldToZero() to use SelectionDAG::getConstant(). This provides support for both promoted and expanded vector types whereas the previous code only supported promoted vector types. Fixes a "Type for zero vector elements is not legal" assertion detected by an llvm-stress generated test. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2251 llvm-svn: 195635	2013-11-25 11:14:43 +00:00
Bill Wendling	f4bc87d59d	Don't look past volatile loads. A volatile load should block us from trying to coalesce stores. PR18023 llvm-svn: 195599	2013-11-25 05:01:21 +00:00
Hao Liu	66ab312f94	Fixed a bug about disassembling AArch64 post-index load/store single element instructions. ie. echo "0x00 0x04 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble echo "0x00 0x00 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble will be disassembled into the same instruction st1 {v0b}[0], [x0], x0. llvm-svn: 195591	2013-11-25 01:53:26 +00:00
Venkatraman Govindaraju	d0d03fae95	[Sparc] Emit large negative adjustments to SP/FP with sethi+xor instead of sethi+or. This generates correct code for both sparc32 and sparc64. llvm-svn: 195576	2013-11-24 20:23:25 +00:00
Venkatraman Govindaraju	73dd53211d	[SparcV9]: Do not emit .register directives for global registers that are clobbered by calls but not used in the function itself. llvm-svn: 195574	2013-11-24 18:41:49 +00:00
Venkatraman Govindaraju	0c27a5ac2c	[SparcV9] Enable custom lowering of DYNAMIC_STACKALLOC in sparc64. llvm-svn: 195573	2013-11-24 17:41:41 +00:00
Reed Kotler	6088c0e228	Make sure that for C++ emitting LwConstant32 pseudos, that it corresponds to what is needed for constant islands. The prescan method for Mips16 constant islands will eventually go away. It is only temporary and should be done earlier when the instructions are first created or from the DAG. If we keep it here we need to handle better the situation where constant islands is called multiple times since don't want to prescan more than once. llvm-svn: 195569	2013-11-24 06:18:50 +00:00
Reed Kotler	6a8a859a63	Update older test cases for latest patch. llvm-svn: 195566	2013-11-24 03:37:56 +00:00
Reed Kotler	eb75f46c95	Fix a funny bug I introduced during conversion of ARM constant islands to Mips. I had to move some code and I moved a declaration forward past it's first use in the function but by nutty coincidence there was another variable of the same name and type and with completely unrelated function that was declared globally in the class so no compilation error ensued. It required some unusual conditions for it to even matter. Caused test case casts.c in test-suite to fail during compilation with a duplicate symbol error. I would have noticed it during final code review for this port. llvm-svn: 195565	2013-11-24 02:53:09 +00:00
Manman Ren	e53617a3e6	Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. Make tests more robust by removing hard-coded metadata numbers in CHECK lines. llvm-svn: 195535	2013-11-23 01:16:29 +00:00
Tom Stellard	5da7926d0a	R600/SI: Fixing handling of condition codes We were ignoring the ordered/onordered bits and also the signed/unsigned bits of condition codes when lowering the DAG to MachineInstrs. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195514	2013-11-22 23:07:58 +00:00
Manman Ren	f0d5143ea6	Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. llvm-svn: 195504	2013-11-22 21:49:45 +00:00
Jim Grosbach	02f7297367	X86: Perform integer comparisons at i32 or larger. Utilizing the 8 and 16 bit comparison instructions, even when an input can be folded into the comparison instruction itself, is typically not worth it. There are too many partial register stalls as a result, leading to significant slowdowns. By always performing comparisons on at least 32-bit registers, performance of the calculation chain leading to the comparison improves. Continue to use the smaller comparisons when minimizing size, as that allows better folding of loads into the comparison instructions. rdar://15386341 llvm-svn: 195496	2013-11-22 19:57:47 +00:00
Matt Arsenault	35acaad8c7	StructurizeCFG: Fix verification failure with some loops. If the beginning of the loop was also the entry block of the function, branches were inserted to the entry block which isn't allowed. If this occurs, create a new dummy function entry block that branches to the start of the loop. llvm-svn: 195493	2013-11-22 19:24:39 +00:00
Matt Arsenault	9afcbf3562	StructurizeCFG: Fix inverting a branch on an argument llvm-svn: 195492	2013-11-22 19:24:37 +00:00

1 2 3 4 5 ...

21891 Commits