llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Tim Northover	e13264c995	ARM: ensure fixed-point conversions have sane types We were generating intrinsics for NEON fixed-point conversions that didn't exist (e.g. float -> i16). There are two cases to consider: + iN is smaller than float. In this case we can do the conversion but need an extend or truncate as well. + iN is larger than float. In this case using the NEON conversion would be incorrect so we don't perform any combining. llvm-svn: 185158	2013-06-28 15:29:25 +00:00
Tilmann Scheller	9392d20bbe	ARM: Fix pseudo-instructions for SRS (Store Return State). The mapping between SRS pseudo-instructions and SRS native instructions was incorrect, the correct mapping is: srsfa -> srsib srsea -> srsia srsfd -> srsdb srsed -> srsda This fixes <rdar://problem/14214734>. llvm-svn: 185155	2013-06-28 15:09:46 +00:00
Rafael Espindola	2da320119c	Improve comment. llvm-svn: 185141	2013-06-28 10:55:41 +00:00
Alexey Samsonov	f24a594c21	Make a switch in createBinary fully-covered. Add forgotten macho_dsym_companion case. llvm-svn: 185139	2013-06-28 09:44:05 +00:00
Patrik Hagglund	9c5af31019	Suppress GCC "control reaches end of non-void function" warning. llvm-svn: 185136	2013-06-28 06:54:05 +00:00
Manman Ren	5bedd08922	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. For cases where we know the type of a DI metadata, use assert. Also update testing cases to make them conform to the format of DI classes. llvm-svn: 185135	2013-06-28 05:43:10 +00:00
David Blaikie	a376b6ab57	Integrate Assembler: Support X86_64_DTPOFF64 relocations llvm-svn: 185131	2013-06-28 04:24:32 +00:00
Rafael Espindola	8eae838533	Improvements to unique_file and createUniqueDirectory. * Don't try to create parent directories in unique_file. It had two problem: * It violates the contract that it is atomic. If the directory creation success and the file creation fails, we would return an error but the file system was modified. * When creating a temporary file clang would have to first check if the parent directory existed or not to avoid creating one when it was not supposed to. * More efficient implementations of createUniqueDirectory and the unique_file that produces only the file name. Now all 3 just call into a static function passing what they want (name, file or directory). Clang also has to be updated, so tests might fail if a bot picks up this commit and not the corresponding clang one. llvm-svn: 185126	2013-06-28 03:48:47 +00:00
Rafael Espindola	4cfcd31f11	Don't ask for a mode when we are not keeping the file. llvm-svn: 185123	2013-06-28 01:05:47 +00:00
Arnold Schwaighofer	d6aee045b3	LoopVectorize: Preserve debug location info radar://14169017 llvm-svn: 185122	2013-06-28 00:38:54 +00:00
Matt Arsenault	64654e8350	Fix using arg_end() - arg_begin() instead of arg_size() llvm-svn: 185121	2013-06-28 00:25:40 +00:00
Peter Collingbourne	205194c023	Rename DIBuilder::createNullPtrType to createUnspecifiedType and introduce a zero-argument createNullPtrType function for creating the canonical nullptr type. Differential Revision: http://llvm-reviews.chandlerc.com/D1050 llvm-svn: 185114	2013-06-27 22:50:59 +00:00
Michael Gottesman	cbe62d543c	Revert "Revert "[APFloat] Removed APFloat constructor which initialized to either zero/NaN but allowed you to arbitrarily set the category of the float."" This reverts commit r185099. Looks like both the ppc-64 and mips bots are still failing after I reverted this change. Since: 1. The mips bot always performs a clean build, 2. The ppc64-bot failed again after a clean build (I asked the ppc-64 maintainers to clean the bot which they did... Thanks Will!), I think it is safe to assume that this change was not the cause of the failures that said builders were seeing. Thus I am recomitting. llvm-svn: 185111	2013-06-27 21:58:19 +00:00
Michael Gottesman	f4d4b7d828	Revert "[APFloat] Removed APFloat constructor which initialized to either zero/NaN but allowed you to arbitrarily set the category of the float." This reverts commit r185095. This is causing a FileCheck failure on the 3dnow intrinsics on at least the mips/ppc bots but not on the x86 bots. Reverting while I figure out what is going on. llvm-svn: 185099	2013-06-27 20:40:11 +00:00
Arnold Schwaighofer	c0e3a07c99	LoopVectorize: Cache edge masks created during if-conversion Otherwise, we end up with an exponential IR blowup. Fixes PR16472. llvm-svn: 185097	2013-06-27 20:31:06 +00:00
Michael Gottesman	1b9f5c3f5a	[APFloat] Removed APFloat constructor which initialized to either zero/NaN but allowed you to arbitrarily set the category of the float. The category which an APFloat belongs to should be dependent on the actual value that the APFloat has, not be arbitrarily passed in by the user. This will prevent inconsistency bugs where the category and the actual value in APFloat differ. I also fixed up all of the references to this constructor (which were only in LLVM). llvm-svn: 185095	2013-06-27 19:50:52 +00:00
Nadav Rotem	edc63580c8	Get rid of the unused class member. llvm-svn: 185086	2013-06-27 17:54:10 +00:00
Nadav Rotem	311bda941c	CostModel: improve the cost model for load/store of non power-of-two types such as <3 x float>, which are popular in graphics. llvm-svn: 185085	2013-06-27 17:52:04 +00:00
Arnold Schwaighofer	ccd78deec7	LoopVectorize: Use vectorized loop invariant gep index anchored in loop Use vectorized instruction instead of original instruction anchored in the original loop. Fixes PR16452 and t2075.c of PR16455. llvm-svn: 185081	2013-06-27 15:11:55 +00:00
Serge Pavlov	252358c083	Use MCFillFragment for zero-initialized data. It fixes PR16338 (ICE when compiling very large two-dimensional array). Differential Revision: http://llvm-reviews.chandlerc.com/D1043 llvm-svn: 185080	2013-06-27 14:35:03 +00:00
Joey Gouly	42f1898415	Add a Subtarget feature 'v8fp' to the ARM backend. llvm-svn: 185073	2013-06-27 11:49:26 +00:00
Benjamin Kramer	c8a5213f3b	Remove unused variable. llvm-svn: 185072	2013-06-27 11:26:41 +00:00
Benjamin Kramer	a42cac6f28	Don't cast away constness. llvm-svn: 185071	2013-06-27 11:07:42 +00:00
Richard Sandiford	609a7eb0a1	[SystemZ] Allow LA and LARL to be rematerialized llvm-svn: 185069	2013-06-27 09:42:10 +00:00
Richard Sandiford	a2d164d53e	[SystemZ] Allow immediate moves to be rematerialized llvm-svn: 185068	2013-06-27 09:38:48 +00:00
Richard Sandiford	964ffa104f	[SystemZ] Add conditional store patterns Add pseudo conditional store instructions, so that we use: branch foo: store foo: instead of: load branch foo: move foo: store z196 has real 32-bit and 64-bit conditional stores, but we don't use any z196 instructions yet. llvm-svn: 185065	2013-06-27 09:27:40 +00:00
Rafael Espindola	ac62522b9b	Add a convenience createUniqueDirectory function. There are a few valid situation where we care about the structure inside a directory, but not about the directory itself. A simple example is for unit testing directory traversal. PathV1 had a function like this, add one to V2 and port existing users of the created temp file and delete it hack to using it. llvm-svn: 185059	2013-06-27 03:45:31 +00:00
Arnold Schwaighofer	18efca433e	LoopVectorize: Don't store a reversed value in the vectorized value map When we store values for reversed induction stores we must not store the reversed value in the vectorized value map. Another instruction might use this value. This fixes 3 test cases of PR16455. llvm-svn: 185051	2013-06-27 00:45:41 +00:00
Michael Gottesman	fe055b3806	Added support for the Builtin attribute. The Builtin attribute is an attribute that can be placed on function call site that signal that even though a function is declared as being a builtin, rdar://problem/13727199 llvm-svn: 185049	2013-06-27 00:25:01 +00:00
Nadav Rotem	195bbbe54b	No need to use a Set when a vector would do. llvm-svn: 185047	2013-06-27 00:14:13 +00:00
Nadav Rotem	897ca82595	SLP: When searching for vectorization opportunities scan the blocks in post-order because we grow chains upwards. llvm-svn: 185041	2013-06-26 23:44:45 +00:00
Nadav Rotem	962b32446e	SLP: Dont erase instructions during vectorization because it prevents the outerloops from iterating over the instructions. llvm-svn: 185040	2013-06-26 23:43:23 +00:00
Michael Gottesman	98d0fadcd5	In InstCombine{AddSub,MulDivRem} convert APFloat.isFiniteNonZero() && !APFloat.isDenormal => APFloat.isNormal. llvm-svn: 185037	2013-06-26 23:17:31 +00:00
Michael Gottesman	4852d1e28a	[APFloat] Convert all references to fcNormal to references to isFiniteNonZero(). Currently inside APFloat fcNormal still implies the old definition of Normal (i.e. isFiniteNonZero) instead of the proper IEEE-754R definition that the external method isNormal() uses. This patch prepares for the internal switch inside APFloat by converting all references that check if a category is fcNormal directly with an indirect call via isFiniteNonZero(). llvm-svn: 185036	2013-06-26 23:17:28 +00:00
Eric Christopher	2004fbdda9	Revert "Debug Info: clean up usage of Verify." as it's breaking bots. This reverts commit r185020 llvm-svn: 185032	2013-06-26 22:44:57 +00:00
Reid Kleckner	780d4ccf9e	Fix a crash bug in dumping options with groups Option groups don't have prefixes. Option dumping is basically dead code unless there is something wrong with the option table, so this isn't an important crasher. llvm-svn: 185031	2013-06-26 22:43:37 +00:00
Stephen Lin	d1d52203d6	Clarify and doxygen-ify comments llvm-svn: 185030	2013-06-26 22:27:50 +00:00
Chad Rosier	2b164134e8	[Mips Disassembler] Have the DecodeCCRRegisterClass function use the getReg function to lookup the proper tablegen'ed register enumeration. Previously, it was using the encoded value directly. llvm-svn: 185026	2013-06-26 22:23:32 +00:00
Stephen Lin	25c0cb5ba9	ARM: Proactively ensure that the LowerCallResult hack for 'this'-returns is not used for incompatible calling conventions. (Currently, ARM 'this'-returns are handled in the standard calling convention case by treating R0 as preserved and doing some extra magic in LowerCallResult; this may not apply to calling conventions added in the future so this patch provides and documents an interface for indicating such) llvm-svn: 185024	2013-06-26 21:42:14 +00:00
Manman Ren	868703ebb8	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. llvm-svn: 185020	2013-06-26 21:26:10 +00:00
Stephen Lin	98ccb9d1ce	Minor formatting fix to ARMBaseRegisterInfo::getCalleeSavedRegs llvm-svn: 185016	2013-06-26 20:19:06 +00:00
Rafael Espindola	9edd4f6c46	Rename PathV2 to just Path now that it is the only one. llvm-svn: 185015	2013-06-26 19:33:03 +00:00
Akira Hatanaka	677f305c97	[mips] Do not emit ".option pic0" if target is mips64. llvm-svn: 185012	2013-06-26 19:08:49 +00:00
Akira Hatanaka	1f2c22ad07	[mips] Improve code generation for constant multiplication using shifts, adds and subs. llvm-svn: 185011	2013-06-26 18:48:17 +00:00
Rafael Espindola	86155d2520	Use enums instead of raw octal values. Patch by 罗勇刚(Yonggang Luo). llvm-svn: 184971	2013-06-26 17:28:04 +00:00
Nadav Rotem	860aebf69a	Erase all of the instructions that we RAUWed llvm-svn: 184969	2013-06-26 17:16:09 +00:00
Joey Gouly	f4bea6681b	Add a subtarget feature 'v8' to the ARM backend. This allows for targeting the ARMv8 AArch32 variant. llvm-svn: 184967	2013-06-26 16:58:26 +00:00
Nadav Rotem	e0a5a586b8	Do not add cse-ed instructions into the visited map because we dont want to consider them as a candidate for replacement of instructions to be visited. llvm-svn: 184966	2013-06-26 16:54:53 +00:00
Tim Northover	2bf8ffa196	ARM: fix more cases where predication may or may not be allowed Unfortunately this addresses two issues (by the time I'd disentangled the logic it wasn't worth putting it back to half-broken): + Coprocessor instructions should all be predicable in Thumb mode. + BKPT should never be predicable. llvm-svn: 184965	2013-06-26 16:52:40 +00:00
Tim Northover	817190b1e4	ARM: allow predicated barriers in Thumb mode The barrier instructions are only "always-execute" in ARM mode, they can quite happily sit inside an IT block in Thumb. llvm-svn: 184964	2013-06-26 16:52:32 +00:00
Joey Gouly	a82562fd5b	Remove the 'generic' CPU from the ARM eabi attributes printer. Make v4 the default ARM architecture attribute, to match CodeGen. llvm-svn: 184962	2013-06-26 16:39:06 +00:00
Rafael Espindola	dc28d9e2d2	PathV1 is deprecated since the 18th of Dec 2010. Remove it. llvm-svn: 184960	2013-06-26 16:24:35 +00:00
Ulrich Weigand	c2c8aeb508	[PowerPC] Accept 17-bit signed immediates for addis The assembler currently strictly verifies that immediates for s16imm operands are in range (-32768 ... 32767). This matches the behaviour of the GNU assembler, with one exception: gas allows, as a special case, operands in an extended range (-65536 .. 65535) for the addis instruction only (and its extended mnemonic lis). The main reason for this seems to be to allow using unsigned 16-bit operands for lis, e.g. like lis %r1, 0xfedc. Since this has been supported by gas for a long time, and assembler source code seen "in the wild" actually exploits this feature, this patch adds equivalent support to LLVM for compatibility reasons. llvm-svn: 184946	2013-06-26 13:49:53 +00:00
Ulrich Weigand	66a94dc7aa	[PowerPC] Support symbolic u16imm operands Currently, all instructions taking s16imm operands support symbolic operands. However, for u16imm operands, we only support actual immediate integers. This causes the assembler to reject code like ori %r5, %r5, symbol@l This patch changes the u16imm operand definition to likewise accept symbolic operands. In fact, s16imm and u16imm can share the same encoding routine, now renamed to getImm16Encoding. llvm-svn: 184944	2013-06-26 13:49:15 +00:00
Amaury de la Vieuville	8a7e3e2195	ARM: operands should be explicit when disassembled llvm-svn: 184943	2013-06-26 13:39:07 +00:00
Venkatraman Govindaraju	9841400828	[Sparc]: Add memory operands for the frame references in the storeRegToStackSlot and loadRegFromStackSlot. llvm-svn: 184935	2013-06-26 12:40:16 +00:00
Elena Demikhovsky	eec89cbdd6	Fixed a comment. llvm-svn: 184933	2013-06-26 12:15:53 +00:00
Elena Demikhovsky	ea4d3808e5	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Kostya Serebryany	16c49ed479	[asan] workaround for PR16277: don't instrument AllocaInstr with alignment more than the redzone size llvm-svn: 184928	2013-06-26 09:49:52 +00:00
Kostya Serebryany	07a94c6709	[asan] add option -asan-keep-uninstrumented-functions llvm-svn: 184927	2013-06-26 09:18:17 +00:00
Rafael Espindola	5c2b78684b	Remove calls to Path in #ifdefs that don't seem to be used in any of the bots :-( llvm-svn: 184920	2013-06-26 06:10:32 +00:00
Rafael Espindola	cfa225ba98	Fix the build when __APPLE__ is defined. llvm-svn: 184917	2013-06-26 05:25:44 +00:00
Rafael Espindola	1c94419f89	Remove sys::GetMainExecutable. llvm-svn: 184916	2013-06-26 05:05:37 +00:00
Rafael Espindola	07ffa7d08b	Port GetMainExecutable over to PathV2. I will remove the V1 version as soon as I change clang in the next commit. llvm-svn: 184914	2013-06-26 05:01:35 +00:00
Rafael Espindola	2d8fd3934d	Remove PathWithStatus. llvm-svn: 184910	2013-06-26 04:15:55 +00:00
Nick Lewycky	e6e35eda7b	dbgs() << Instruction doesn't print a newline on the end any more. Update these debug statements to add a missing newline. Also canonicalize to '\n' instead of "\n"; the latter calls a function with a loop the former does not. llvm-svn: 184897	2013-06-26 00:30:18 +00:00
Adrian Prantl	cf96a5d3de	s/C++0x/C++11/ llvm-svn: 184892	2013-06-25 23:42:37 +00:00
Nadav Rotem	a8fba65221	SLPVectorizer: support slp-vectorization of PHINodes between basic blocks llvm-svn: 184888	2013-06-25 23:04:09 +00:00
Jakob Stoklund Olesen	a4ca837638	Print block frequencies in decimal form. This is easier to read than the internal fixed-point representation. If anybody knows the correct algorithm for converting fixed-point numbers to base 10, feel free to fix it. llvm-svn: 184881	2013-06-25 21:57:38 +00:00
Tom Stellard	3854f648a8	R600: Use new getNamedOperandIdx function generated by TableGen llvm-svn: 184880	2013-06-25 21:22:18 +00:00
Arnold Schwaighofer	730386bc34	X86 cost model: Vectorizing integer division is a bad idea radar://14057959 llvm-svn: 184872	2013-06-25 19:14:09 +00:00
Bob Wilson	f1bf7886b8	Fix SROA to avoid unnecessary scalar conversions for 1-element vectors. When a 1-element vector alloca is promoted, a store instruction can often be rewritten without converting the value to a scalar and using an insertelement instruction to stuff it into the new alloca. This patch just adds a check to skip that conversion when it is unnecessary. This turns out to be really important for some ARM Neon operations where <1 x i64> is used to get around the fact that i64 is not a legal type. llvm-svn: 184870	2013-06-25 19:09:50 +00:00
Manman Ren	e8f3721e22	Remove unused code. No functionality change. llvm-svn: 184866	2013-06-25 18:49:55 +00:00
Bill Wendling	7d3ed86eaa	The GCDA 402 format won't have a second checksum either. llvm-svn: 184864	2013-06-25 18:13:52 +00:00
Ulrich Weigand	3e23cfcde6	[PowerPC] Support @got modifier Add VK_... values and relocation types necessary to support the @got family of modifiers. Used by the asm parser only. llvm-svn: 184860	2013-06-25 16:49:50 +00:00
Rafael Espindola	4ff51c0bcf	Move GetEXESuffix to the one place it is used. llvm-svn: 184853	2013-06-25 14:42:30 +00:00
Rafael Espindola	c818977b7b	Remove sys::PathSeparator. llvm-svn: 184852	2013-06-25 14:32:45 +00:00
Aaron Watry	7dc8fb34e1	R600: Consolidate expansion of v2i32/v4i32 ops for EG/SI By default, we expand these operations for both EG and SI. Move the duplicated code into a common space for now. If the targets ever actually implement these operations as instructions, we can override that in the relevant target. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184848	2013-06-25 13:55:57 +00:00
Aaron Watry	1ee98e598b	R600/SI: Expand xor v2i32/v4i32 Add test cases for both vector sizes on SI and also add v2i32 test for EG. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184846	2013-06-25 13:55:52 +00:00
Aaron Watry	73046ba281	R600/SI: Expand urem of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UREM produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184844	2013-06-25 13:55:46 +00:00
Aaron Watry	c00dd00a32	R600/SI: Expand udiv v[24]i32 for SI and v2i32 for EG Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UDIV produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184843	2013-06-25 13:55:43 +00:00
Aaron Watry	0b4bbc3714	R600/SI: Expand ashr of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184842	2013-06-25 13:55:40 +00:00
Aaron Watry	0bf6dc888a	R600/SI: Expand srl of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184841	2013-06-25 13:55:37 +00:00
Aaron Watry	eafbde78e9	R600/SI: Expand shl of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184840	2013-06-25 13:55:32 +00:00
Aaron Watry	d9f602bd35	R600/SI: Expand or of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184839	2013-06-25 13:55:29 +00:00
Aaron Watry	688f496d43	R600/SI: Expand mul of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184838	2013-06-25 13:55:26 +00:00
Aaron Watry	35d817a307	R600/SI: Expand and of v2i32/v4i32 for SI Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184837	2013-06-25 13:55:23 +00:00
Benjamin Kramer	3b56c8dd50	BlockFrequency: Bump up the entry frequency a bit. This is a band-aid to fix the most severe regressions we're seeing from basing spill decisions on block frequencies, until we have a better solution. llvm-svn: 184835	2013-06-25 13:34:40 +00:00
Ulrich Weigand	e5affb6d66	[PowerPC] Add extended rotate/shift mnemonics This adds all missing extended rotate/shift mnemonics to the asm parser. llvm-svn: 184834	2013-06-25 13:17:41 +00:00
Ulrich Weigand	da66ee086f	[PowerPC] Add rldcr/rldic instructions This adds pattern for the rldcr and rldic instructions (the last instruction from the rotate/shift family that were missing). They are currently used only by the asm parser. llvm-svn: 184833	2013-06-25 13:17:10 +00:00
Ulrich Weigand	d51b3cd01b	[PowerPC] Add extended subtract mnemonics This adds support for the extended subtract mnemonics to the asm parser: subi subis subic subic. sub sub. subc subc. llvm-svn: 184832	2013-06-25 13:16:48 +00:00
Justin Holewinski	f085b28335	[NVPTX] Default pointer type doesn't make sense for getParamSymbol() llvm-svn: 184831	2013-06-25 12:22:21 +00:00
Nadav Rotem	8fcb707c24	Fix a typo in the code that collected the costs recursively. llvm-svn: 184827	2013-06-25 05:30:56 +00:00
Rafael Espindola	35fe018057	keep only the StringRef version of getFileOrSTDIN. llvm-svn: 184826	2013-06-25 05:28:34 +00:00
Rafael Espindola	525437f64a	Don't assume ResultPath is null terminated. llvm-svn: 184824	2013-06-25 04:23:46 +00:00
Andrew Trick	18751012bb	Revert "Temporarily enable MI-Sched on X86." This reverts commit 98a9b72e8c56dc13a2617de84503a3d78352789c. llvm-svn: 184823	2013-06-25 02:48:58 +00:00
Tom Stellard	fa154aaa39	R600/SI: Report unaligned memory accesses as legal for > 32-bit types In reality, some unaligned memory accesses are legal for 32-bit types and smaller too, but it all depends on the address space. Allowing unaligned loads/stores for > 32-bit types is mainly to prevent the legalizer from splitting one load into multiple loads of smaller types. https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 184822	2013-06-25 02:39:35 +00:00
Tom Stellard	ddc78167d3	R600: Add support for i32 loads from the constant address space on Cayman Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 184821	2013-06-25 02:39:30 +00:00
Tom Stellard	b4ab710b43	R600/SI: Add support for v4i32 and v4f32 kernel args Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 184820	2013-06-25 02:39:25 +00:00
Tom Stellard	0300b2cb3a	R600: Fix typo in R600Schedule.td This should only make a difference in programs that use a lot of the vector ALU instructions like BFI_INT and BIT_ALIGN. There is a slight improvement in the phatk bitcoin mining kernel with this patch on Evergreen (vector size == 1): Before: 1173 Instruction Groups / 9520 dwords After: 1167 Instruction Groups / 9510 dwords Reviewed-by: Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 184819	2013-06-25 02:39:20 +00:00
NAKAMURA Takumi	c0b121df0b	PPCAsmParser.cpp: Quote "@l/@ha" in comments. [-Wdocumentation] llvm-svn: 184809	2013-06-25 01:14:20 +00:00
Rafael Espindola	df18681407	Cleanup in unique_file when we only want the name. This is really ugly, but it is no worse than what we have in clang right now and it is better to get it working first and clean/optimize it afterwards. Will be tested from clang in the next patch. llvm-svn: 184802	2013-06-25 00:49:40 +00:00
Eric Christopher	85478a853a	80-column and tab character fixes. llvm-svn: 184792	2013-06-24 23:20:02 +00:00
Eric Christopher	6abbfb6bb7	Formatting. llvm-svn: 184788	2013-06-24 21:34:55 +00:00
Adrian Prantl	3768276ade	typo. llvm-svn: 184783	2013-06-24 21:19:43 +00:00
Eric Christopher	50e46e8378	Use const references instead of pointers to references that are never modified. No functional change. llvm-svn: 184781	2013-06-24 21:07:27 +00:00
Ulrich Weigand	655ef3283d	[PowerPC] Support some miscellaneous mnemonics in the asm parser This adds support for the following extended mnemonics: xnop mr. not not. la llvm-svn: 184767	2013-06-24 18:08:03 +00:00
David Blaikie	2bf3b1e948	DebugInfo: DIBuilder changes to match DIEnumerator changes in r184694 Representing enumerators by int64 instead of uint64 for now. At some point we need to address the underlying issue of representation depending on the specific enumeration. llvm-svn: 184761	2013-06-24 17:34:33 +00:00
Benjamin Kramer	bc7599b681	PPC: Remove default case from fully covered switch. llvm-svn: 184758	2013-06-24 17:03:25 +00:00
Aaron Watry	d3d63bb8fd	R600: Fix spelling error in comment our -> or llvm-svn: 184756	2013-06-24 16:57:57 +00:00
Ulrich Weigand	719e95004a	[PowerPC] Add predicted forms of branches This adds support for the predicted forms of branches (+/-). There are three cases to consider: - Branches using a PPC::Predicate code For these, I've added new PPC::Predicate codes corresponding to the BO values for predicted branch forms, and updated insn printing to print them correctly. I've also added new aliases for the asm parser matching the new forms. - bt/bf I've added new aliases matching to gBC etc. - bd(n)z variants I've added new instruction patterns for the predicted forms. In all cases, the new patterns are used for the asm parser only. (The new infrastructure ought to be sufficient to allow use by the compiler too at some point.) llvm-svn: 184754	2013-06-24 16:52:04 +00:00
Nadav Rotem	eff545235c	Rename the variable to fix a warning. Thanks Andy Gibbs. llvm-svn: 184749	2013-06-24 15:59:47 +00:00
NAKAMURA Takumi	c94525d76e	NVPTXTargetObjectFile.h: Initialize some pointers as NULL in the constructor of NVPTXTargetObjectFile. ~NVPTXTargetObjectFile() tries to delete them. It caused crash on some hosts since r184595. llvm-svn: 184728	2013-06-24 13:19:41 +00:00
Ulrich Weigand	5349a508ea	[PowerPC] Add t/f branch mnemonics to asm parser This adds the bt/bf/bd(n)zt/bd(n)zf mnemonics as aliases for the asm parser, resolving to the generic conditional patterns. llvm-svn: 184725	2013-06-24 12:49:20 +00:00
Arnold Schwaighofer	0a98597e80	Reapply 184685 after the SetVector iteration order fix. This should hopefully have fixed the stage2/stage3 miscompare on the dragonegg testers. "LoopVectorize: Use the dependence test utility class We now no longer need alias analysis - the cases that alias analysis would handle are now handled as accesses with a large dependence distance. We can now vectorize loops with simple constant dependence distances. for (i = 8; i < 256; ++i) { a[i] = a[i+4] * a[i+8]; } for (i = 8; i < 256; ++i) { a[i] = a[i-4] * a[i-8]; } We would be able to vectorize about 200 more loops (in many cases the cost model instructs us no to) in the test suite now. Results on x86-64 are a wash. I have seen one degradation in ammp. Interestingly, the function in which we now vectorize a loop is never executed so we probably see some instruction cache effects. There is a 2% improvement in h264ref. There is one or the other TSCV loop kernel that speeds up. radar://13681598" llvm-svn: 184724	2013-06-24 12:09:15 +00:00
Arnold Schwaighofer	75b76bf92f	LoopVectorize: Use SetVector for the access set We are creating the runtime checks using this set so we need a deterministic iteration order. llvm-svn: 184723	2013-06-24 12:09:12 +00:00
Ulrich Weigand	5f83058705	[PowerPC] Support generic conditional branches in asm parser This adds instruction patterns to cover the generic forms of the conditional branch instructions. This allows the assembler to support the generic mnemonics. The compiler will still generate the various specific forms of the instruction that were already supported. llvm-svn: 184722	2013-06-24 11:55:21 +00:00
Ulrich Weigand	0dd44327b0	[PowerPC] Support absolute branches There is currently only limited support for the "absolute" variants of branch instructions. This patch adds support for the absolute variants of all branches that are currently otherwise supported. This requires adding new fixup types so that the correct variant of relocation type can be selected by the object writer. While the compiler will continue to usually choose the relative branch variants, this will allow the asm parser to fully support the absolute branches, with either immediate (numerical) or symbolic target addresses. No change in code generation intended. llvm-svn: 184721	2013-06-24 11:03:33 +00:00
Ulrich Weigand	19b0f3dd0c	[PowerPC] Support bd(n)zl and bd(n)zlrl This adds support for the bd(n)zl and bd(n)zlrl instructions. The patterns are currently used for the asm parser only. llvm-svn: 184720	2013-06-24 11:02:38 +00:00
Ulrich Weigand	5c11c4d795	[PowerPC] Support b(cond)l in the asm parser This patch adds support for the conditional variants of bl. The pattern is currently used by the asm parser only. llvm-svn: 184719	2013-06-24 11:02:19 +00:00
Ulrich Weigand	3a0dd840c3	[PowerPC] Support blrl and variants in the asm parser This patch adds support for blrl and its conditional variants. The patterns are (currently) used for the asm parser only. llvm-svn: 184718	2013-06-24 11:01:55 +00:00
Vladimir Medic	4c032a5386	This patch introduces RegisterOperand class into Mips FPU instruction definitions and adds dedicated parser methods to MipsAsmParser. It is the first in a series of patches that should fix the problems with parsing Mips FPU instructions and optimize the code in MipsAsmParser. llvm-svn: 184716	2013-06-24 10:05:34 +00:00
Michael Gottesman	a893ca16f4	[APFloat] Added support for parsing float strings which contain {inf,-inf,NaN,-NaN}. llvm-svn: 184713	2013-06-24 09:58:05 +00:00
Michael Gottesman	82b2233f5f	[APFloat] Added make{Zero,Inf} methods and implemented get{Zero,Inf} on top of them. llvm-svn: 184712	2013-06-24 09:58:02 +00:00
Michael Gottesman	9b847e431c	[APFloat] Removed a assert from significandParts() which says that one can only access the significand of FiniteNonZero/NaN floats. The method significandParts() is a helper method meant to ease access to APFloat's significand by allowing the user to not need to be aware of whether or not the APFloat is using memory allocated in the instance itself or in an external array. This assert says that one can only access the significand of FiniteNonZero/NaN floats. This makes it cumbersome and more importantly dangerous when one wishes to zero out the significand of a zero/infinity value since one will have to deal with the aforementioned quandary related to how the memory in APFloat is allocated. llvm-svn: 184711	2013-06-24 09:57:59 +00:00
Michael Gottesman	181a1cd575	[APFloat] Rename macro convolve => PackCategoriesIntoKey so that it is clear what APFloat is actually using said macro for. In the context of APFloat, seeing a macro called convolve suggests that APFloat is using said value in some sort of convolution somewhere in the source code. This is misleading. I also added a documentation comment to the macro. llvm-svn: 184710	2013-06-24 09:57:57 +00:00
Amaury de la Vieuville	550e6ef18f	ARM: check predicate bits for thumb instructions When encoded to thumb, VFP instruction and VMOV/VDUP between scalar and core registers, must have their predicate bit to 0b1110. llvm-svn: 184707	2013-06-24 09:15:01 +00:00
Amaury de la Vieuville	37b2270352	ARM: rGPR is meant to be unpredictable, not undefined llvm-svn: 184706	2013-06-24 09:14:54 +00:00
Andrew Trick	716b547d13	Temporarily enable MI-Sched on X86. Sorry for the unit test churn. I'll try to make the change permanently next time. llvm-svn: 184705	2013-06-24 09:13:20 +00:00
Amaury de la Vieuville	5a373a526e	ARM: fix thumb1 nop decoding In thumb1, NOP is a pseudo-instruction equivalent to mov r8, r8. However the disassembler should not use this alias. llvm-svn: 184703	2013-06-24 09:11:53 +00:00
Amaury de la Vieuville	6eecd3f2cb	ARM: fix IT decoding mask == 0 -> UNPRED llvm-svn: 184702	2013-06-24 09:11:45 +00:00
Amaury de la Vieuville	0d7ac788f2	ARM: enable decoding of pc-relative PLD/PLI llvm-svn: 184701	2013-06-24 09:11:38 +00:00
Chandler Carruth	9788884067	Add a flag to defer vectorization into a phase after the inliner and its CGSCC pass manager. This should insulate the inlining decisions from the vectorization decisions, however it may have both compile time and code size problems so it is just an experimental option right now. Adding this based on a discussion with Arnold and it seems at least worth having this flag for us to both run some experiments to see if this strategy is workable. It may solve some of the regressions seen with the loop vectorizer. llvm-svn: 184698	2013-06-24 07:21:47 +00:00
Arnold Schwaighofer	f022b11b08	Revert "LoopVectorize: Use the dependence test utility class" This reverts commit cbfa1ca993363ca5c4dbf6c913abc957c584cbac. We are seeing a stage2 and stage3 miscompare on some dragonegg bots. llvm-svn: 184690	2013-06-24 06:10:41 +00:00
Michael Gottesman	21ef64bbfd	[APFloat] Rename llvm::exponent_t => llvm::APFloat::ExponentType. exponent_t is only used internally in APFloat and no exponent_t values are exposed via the APFloat API. In light of such conditions it does not make any sense to gum up the llvm namespace with said type. Plus it makes it clearer that exponent_t is associated with APFloat. llvm-svn: 184686	2013-06-24 04:06:23 +00:00
Arnold Schwaighofer	c49cd1a668	LoopVectorize: Use the dependence test utility class We now no longer need alias analysis - the cases that alias analysis would handle are now handled as accesses with a large dependence distance. We can now vectorize loops with simple constant dependence distances. for (i = 8; i < 256; ++i) { a[i] = a[i+4] * a[i+8]; } for (i = 8; i < 256; ++i) { a[i] = a[i-4] * a[i-8]; } We would be able to vectorize about 200 more loops (in many cases the cost model instructs us no to) in the test suite now. Results on x86-64 are a wash. I have seen one degradation in ammp. Interestingly, the function in which we now vectorize a loop is never executed so we probably see some instruction cache effects. There is a 2% improvement in h264ref. There is one or the other TSCV loop kernel that speeds up. radar://13681598 llvm-svn: 184685	2013-06-24 03:55:48 +00:00
Arnold Schwaighofer	f9828b092b	LoopVectorize: Add utility class for checking dependency among accesses This class checks dependences by subtracting two Scalar Evolution access functions allowing us to catch very simple linear dependences. The checker assumes source order in determining whether vectorization is safe. We currently don't reorder accesses. Positive true dependencies need to be a multiple of VF otherwise we impede store-load forwarding. llvm-svn: 184684	2013-06-24 03:55:45 +00:00
Arnold Schwaighofer	67714fedcd	LoopVectorize: Add utility class for building sets of dependent accesses Sets of dependent accesses are built by unioning sets based on underlying objects. This class will be used by the upcoming dependence checker. llvm-svn: 184683	2013-06-24 03:55:44 +00:00
Nadav Rotem	6c2ae14dc5	SLP Vectorizer: Add support for vectorizing parts of the tree. Untill now we detected the vectorizable tree and evaluated the cost of the entire tree. With this patch we can decide to trim-out branches of the tree that are not profitable to vectorizer. Also, increase the max depth from 6 to 12. In the worse possible case where all of the code is made of diamond-shaped graph this can bring the cost to 2**10, but diamonds are not very common. llvm-svn: 184681	2013-06-24 02:52:43 +00:00
Andrew Trick	99b0b0ab75	Fix tail merging to assign the (more) correct BasicBlock when splitting. This makes it possible to write unit tests that are less susceptible to minor code motion, particularly copy placement. block-placement.ll covers this case with -pre-RA-sched=source which will soon be default. One incorrectly named block is already fixed, but without this fix, enabling new coalescing and scheduling would cause more failures. llvm-svn: 184680	2013-06-24 01:55:01 +00:00
Nadav Rotem	5f8e32a66f	SLP Vectorizer: Fix a bug in the code that does CSE on the generated gather sequences. Make sure that we don't replace and RAUW two sequences if one does not dominate the other. llvm-svn: 184674	2013-06-23 21:57:27 +00:00
Nadav Rotem	8aa1211383	SLP Vectorizer: Erase instructions outside the vectorizeTree method. The RAII builder location guard is saving a reference to instructions, so we can't erase instructions during vectorization. llvm-svn: 184671	2013-06-23 19:38:56 +00:00
David Blaikie	2075b3d872	DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation llvm-svn: 184669	2013-06-23 18:31:11 +00:00
Andrew Trick	2fca851aaf	Add MI-Sched support for x86 macro fusion. This is an awful implementation of the target hook. But we don't have abstractions yet for common machine ops, and I don't see any quick way to make it table-driven. llvm-svn: 184664	2013-06-23 09:00:28 +00:00
Nadav Rotem	03f4c0b02d	SLP Vectorizer: Implement a simple CSE optimization for the gather sequences. llvm-svn: 184660	2013-06-23 06:15:46 +00:00
Nadav Rotem	3dc5b0a65a	SLP Vectorizer: Implement multi-block slp-vectorization. Rewrote the SLP-vectorization as a whole-function vectorization pass. It is now able to vectorize chains across multiple basic blocks. It still does not vectorize PHIs, but this should be easy to do now that we scan the entire function. I removed the support for extracting values from trees. We are now able to vectorize more programs, but there are some serious regressions in many workloads (such as flops-6 and mandel-2). llvm-svn: 184647	2013-06-22 21:34:10 +00:00
David Blaikie	88d5262317	DebugInfo: Support (using GNU extensions) for template template parameters and parameter packs llvm-svn: 184643	2013-06-22 18:59:11 +00:00
Chad Rosier	d00211e479	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
Benjamin Kramer	bd4cfec8c9	Revert "FunctionAttrs: Merge attributes once instead of doing it for every argument." It doesn't work as I intended it to. This reverts commit r184638. llvm-svn: 184641	2013-06-22 16:56:32 +00:00
Benjamin Kramer	e0fbc3ba1f	FunctionAttrs: Merge attributes once instead of doing it for every argument. It has become an expensive operation. No functionality change. llvm-svn: 184638	2013-06-22 15:51:19 +00:00

1 2 3 4 5 ...

62262 Commits