llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 04:52:54 +02:00

Author	SHA1	Message	Date
Saleem Abdulrasool	75c162a52d	ARM: enable tail call optimisation on Thumb 2 Tail call optimisation was previously disabled on all targets other than iOS5.0+. This enables the tail call optimisation on all Thumb 2 capable platforms. The test adjustments are to remove the IR hint "tail" to function invocation. The tests were designed assuming that tail call optimisations would not kick in which no longer holds true. llvm-svn: 203575	2014-03-11 15:09:44 +00:00
Erik Verbruggen	11cc704d2c	Fix crash in PRE. After r203553 overflow intrinsics and their non-intrinsic (normal) instruction get hashed to the same value. This patch prevents PRE from moving an instruction into a predecessor block, and trying to add a phi node that gets two different types (the intrinsic result and the non-intrinsic result), resulting in a failing assert. llvm-svn: 203574	2014-03-11 15:07:32 +00:00
Tim Northover	d658ec1424	ARM: simplify EmitAtomicBinary64 ATOMIC_STORE operations always get here as a lowered ATOMIC_SWAP, so there's no need for any code to handle them specially. There should be no functionality change so no tests. llvm-svn: 203567	2014-03-11 13:19:55 +00:00
Benjamin Kramer	c4a4a8061a	Remove copy ctors that did the same thing as the default one. The code added nothing but potentially disabled move semantics and made types non-trivially copyable. llvm-svn: 203563	2014-03-11 11:32:49 +00:00
Tim Northover	68c567a38a	IR: add a second ordering operand to cmpxhg for failure The syntax for "cmpxchg" should now look something like: cmpxchg i32* %addr, i32 42, i32 3 acquire monotonic where the second ordering argument gives the required semantics in the case that no exchange takes place. It should be no stronger than the first ordering constraint and cannot be either "release" or "acq_rel" (since no store will have taken place). rdar://problem/15996804 llvm-svn: 203559	2014-03-11 10:48:52 +00:00
Erik Verbruggen	c2bf18261b	GVN: fix hashing of extractvalue. My last commit did not add the indexes to the hashed value for extractvalue. Adding that back in. llvm-svn: 203558	2014-03-11 10:21:30 +00:00
Erik Verbruggen	638ff95018	GVN: merge overflow intrinsics with non-overflow instructions. When an overflow intrinsic is followed by a non-overflow instruction, replace the latter with an extract. For example: %sadd = tail call { i32, i1 } @llvm.sadd.with.overflow.i32(i32 %a, i32 %b) %sadd3 = add i32 %a, %b Here the add statement will be replaced by an extract. When an overflow intrinsic follows a non-overflow instruction, a clone of the intrinsic is inserted before the normal instruction, which makes it the same as the previous case. Subsequent runs of GVN can then clean up the duplicate instructions and insert the extract. This fixes PR8817. llvm-svn: 203553	2014-03-11 09:36:48 +00:00
Saleem Abdulrasool	caaf63404a	Object: rename ARMV7 to ARMNT The official specifications state the name to be ARMNT (as per the Microsoft Portable Executable and Common Object Format Specification v8.3). llvm-svn: 203530	2014-03-11 03:08:37 +00:00
Duncan P. N. Exon Smith	f9624311ce	Cleanup whitespace llvm-svn: 203529	2014-03-11 02:44:45 +00:00
Matt Arsenault	16c4bdf77e	R600: Calculate store mask instead of using switch. llvm-svn: 203527	2014-03-11 01:38:53 +00:00
Jim Grosbach	3b6ef12947	X86: Enable ISel of 16-bit MOVBE instructions. When the MOVBE instructions are available, use them for 16-bit endian swapping as well as for 32 and 64 bit. The patterns were already present on the instructions, but weren't being matched because the operation was unconditionally marked to 'Expand.' Change that to be conditional on whether the MOVBE instructions are available. Use 'rolw' to implement the in-register version (32 and 64 bit have the dedicated 'bswap' instruction for that). Patch by Louis Gerbarg <lgg@apple.com>. rdar://15479984 llvm-svn: 203524	2014-03-11 00:44:14 +00:00
Evan Cheng	9a155c5f78	Follow up to r203488. Code clean up to eliminate a lot of copy+paste. llvm-svn: 203520	2014-03-11 00:24:20 +00:00
Matt Arsenault	ea5d59f5ac	Remove incomplete comment llvm-svn: 203518	2014-03-11 00:01:37 +00:00
Matt Arsenault	3595b7ee79	Move trivial getter into header. llvm-svn: 203517	2014-03-11 00:01:34 +00:00
Matt Arsenault	805d9618a9	Use .data() instead of &x[0] llvm-svn: 203516	2014-03-11 00:01:31 +00:00
Matt Arsenault	5ce0aee456	Fix indentation llvm-svn: 203515	2014-03-11 00:01:27 +00:00
Matt Arsenault	f90ec08530	Fix non 2-space indentation. llvm-svn: 203514	2014-03-11 00:01:25 +00:00
Duncan P. N. Exon Smith	2635636165	Module: Don't rename in getOrInsertFunction() During LTO, user-supplied definitions of C library functions often exist. -instcombine uses Module::getOrInsertFunction() to get a handle on library functions (e.g., @puts, when optimizing @printf). Previously, Module::getOrInsertFunction() would rename any matching functions with local linkage, and create a new declaration. In LTO, this is the opposite of desired behaviour, as it skips by the user-supplied version of the library function and creates a new undefined reference which the linker often cannot resolve. After some discussing with Rafael on the list, it looks like it's undesired behaviour. If a consumer actually needs this behaviour, we should add new API with a more explicit name. I added two testcases: one specifically for the -instcombine behaviour and one for the LTO flow. <rdar://problem/16165191> llvm-svn: 203513	2014-03-10 23:42:28 +00:00
Raul E. Silvera	9f07f16d94	When analyzing vectors of element type that require legalization, the legalization cost must be included to get an accurate estimation of the total cost of the scalarized vector. The inaccurate cost triggered unprofitable SLP vectorization on 32-bit X86. Summary: Include legalization overhead when computing scalarization cost Reviewers: hfinkel, nadav CC: chandlerc, rnk, llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2992 llvm-svn: 203509	2014-03-10 22:59:13 +00:00
Diego Novillo	dd37be24ca	Use discriminator information in sample profiles. Summary: When the sample profiles include discriminator information, use the discriminator values to distinguish instruction weights in different basic blocks. This modifies the BodySamples mapping to map <line, discriminator> pairs to weights. Instructions on the same line but different blocks, will use different discriminator values. This, in turn, means that the blocks may have different weights. Other changes in this patch: - Add tests for positive values of line offset, discriminator and samples. - Change data types from uint32_t to unsigned and int and do additional validation. Reviewers: chandlerc CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2857 llvm-svn: 203508	2014-03-10 22:41:28 +00:00
Justin Bogner	88d0488496	IR: Slightly more verbose error in Verifier Extend the error message generated by the Verifier when an intrinsic name does not match the expected mangling to include the expected name. Simplifies debugging. Patch by Philip Reames! llvm-svn: 203490	2014-03-10 21:22:44 +00:00
Benjamin Kramer	108d24886e	MemCpyOpt: When merging memsets also merge the trivial case of two memsets with the same destination. The testcase is from PR19092, but I think the bug described there is actually a clang issue. llvm-svn: 203489	2014-03-10 21:05:13 +00:00
Evan Cheng	b0fdca31bc	For functions with ARM target specific calling convention, when simplify-libcall optimize a call to a llvm intrinsic to something that invovles a call to a C library call, make sure it sets the right calling convention on the call. e.g. extern double pow(double, double); double t(double x) { return pow(10, x); } Compiles to something like this for AAPCS-VFP: define arm_aapcs_vfpcc double @t(double %x) #0 { entry: %0 = call double @llvm.pow.f64(double 1.000000e+01, double %x) ret double %0 } declare double @llvm.pow.f64(double, double) #1 Simplify libcall (part of instcombine) will turn the above into: define arm_aapcs_vfpcc double @t(double %x) #0 { entry: %__exp10 = call double @__exp10(double %x) #1 ret double %__exp10 } declare double @__exp10(double) The pre-instcombine code works because calls to LLVM builtins are special. Instruction selection will chose the right calling convention for the call. However, the code after instcombine is wrong. The call to __exp10 will use the C calling convention. I can think of 3 options to fix this. 1. Make "C" calling convention just work since the target should know what CC is being used. This doesn't work because each function can use different CC with the "pcs" attribute. 2. Have Clang add the right CC keyword on the calls to LLVM builtin. This will work but it doesn't match the LLVM IR specification which states these are "Standard C Library Intrinsics". 3. Fix simplify libcall so the resulting calls to the C routines will have the proper CC keyword. e.g. %__exp10 = call arm_aapcs_vfpcc double @__exp10(double %x) #1 This works and is the solution I implemented here. Both solutions #2 and #3 would work. After carefully considering the pros and cons, I decided to implement #3 for the following reasons. 1. It doesn't change the "spec" of the intrinsics. 2. It's a self-contained fix. There are a couple of potential downsides. 1. There could be other places in the optimizer that is broken in the same way that's not addressed by this. 2. There could be other calling conventions that need to be propagated by simplify-libcall that's not handled. But for now, this is the fix that I'm most comfortable with. llvm-svn: 203488	2014-03-10 20:49:45 +00:00
Sasa Stankovic	37538d4bfa	[mips] Implement NaCl sandboxing of loads, stores and SP changes: * Add masking instructions before loads and stores (in MC layer). * Add masking instructions after SP changes (in MC layer). * Forbid loads, stores and SP changes in delay slots (in MI layer). Differential Revision: http://llvm-reviews.chandlerc.com/D2904 llvm-svn: 203484	2014-03-10 20:34:23 +00:00
Eli Bendersky	82b7097fc4	Make sure NVPTX doesn't emit symbol names that aren't valid in PTX. NVPTX, like the other backends, relies on generic symbol name sanitizing done by MCSymbol. However, the ptxas assembler is more stringent and disallows some additional characters in symbol names. See PR19099 for more details. llvm-svn: 203483	2014-03-10 20:05:42 +00:00
Tim Northover	1b5043b6fb	llvm-c: expose unnamedaddr field of globals Patch by Manuel Jacob. llvm-svn: 203482	2014-03-10 19:24:35 +00:00
Reed Kotler	e1cab9f9f1	Fix regression with -O0 for mips . llvm-svn: 203469	2014-03-10 16:31:25 +00:00
Benjamin Kramer	a461fb6a91	[C++11] Modernize the IR library a bit. No functionality change. llvm-svn: 203465	2014-03-10 15:03:06 +00:00
Daniel Sanders	cb9a997e5e	[mips][fp64] Add an implicit def to MFHC1 claiming that it reads the lower 32-bits of 64-bit FPR Summary: This is a white lie to workaround a widespread bug in the -mfp64 implementation. The problem is that none of the 32-bit fpu ops mention the fact that they clobber the upper 32-bits of the 64-bit FPR. This allows MFHC1 to be scheduled on the wrong side of most 32-bit FPU ops. Fixing that requires a major overhaul of the FPU implementation which can't be done right now due to time constraints. MFHC1 is one of two affected instructions. These instructions are the only FPU instructions that don't read or write the lower 32-bits. We therefore pretend that it reads the bottom 32-bits to artificially create a dependency and prevent the scheduler changing the behaviour of the code. The other instruction is MTHC1 which will be fixed once I've have found a failing test case for it. The testcase is test-suite/SingleSource/UnitTests/Vector/simple.c when given TARGET_CFLAGS="-mips32r2 -mfp64 -mmsa". Reviewers: jacksprat, matheusalmeida Reviewed By: jacksprat Differential Revision: http://llvm-reviews.chandlerc.com/D2966 llvm-svn: 203464	2014-03-10 15:01:57 +00:00
Matheus Almeida	4fff721d3a	[mips] Assembly parser must invoke the target streamer to handle .set reorder macro. llvm-svn: 203459	2014-03-10 13:21:10 +00:00
Tim Northover	2f522988cc	AArch64: fix LowerCONCAT_VECTORS for new CodeGen. The function was making too many assumptions about its input: 1. The NEON_VDUP optimisation was far too aggressive, assuming (I think) that the input would always be BUILD_VECTOR. 2. We were treating most unknown concats as legal (by returning Op rather than SDValue()). I think only concats of pairs of vectors are actually legal. http://llvm.org/PR19094 llvm-svn: 203450	2014-03-10 09:34:07 +00:00
Craig Topper	2e98354909	[C++11] Remove 'virtual' keyword from methods marked with 'override' keyword. llvm-svn: 203444	2014-03-10 05:29:18 +00:00
Craig Topper	813f30aa7e	[C++11] Remove 'virtual' keyword from methods marked with 'override' keyword. llvm-svn: 203442	2014-03-10 03:53:12 +00:00
Chandler Carruth	81d2cd22df	[AArch64] Fix a use of uninitialized memory introduced in r203125, and caught by the MSan bootstrap build bot. This should hopefully get the bot green at long last. llvm-svn: 203441	2014-03-10 03:52:47 +00:00
Craig Topper	8abcd5aecd	De-virtualize a method since it doesn't override anything and isn't overridden itself. llvm-svn: 203440	2014-03-10 03:22:59 +00:00
Craig Topper	1735ce1ba2	[C++11] Add 'override' keyword to virtual methods that override their base class. llvm-svn: 203439	2014-03-10 03:19:03 +00:00
Chandler Carruth	8f25783c45	[TTI] There is actually no realistic way to pop TTI implementations off the stack of the analysis group because they are all immutable passes. This is made clear by Craig's recent work to use override systematically -- we weren't overriding anything for 'finalizePass' because there is no such thing. This is kind of a lame restriction on the API -- we can no longer push and pop things, we just set up the stack and run. However, I'm not invested in building some better solution on top of the existing (terrifying) immutable pass and legacy pass manager. llvm-svn: 203437	2014-03-10 02:45:14 +00:00
Chandler Carruth	c65b944bd3	[LCG] Ran clang-format over this too and it pointed out some fixes. llvm-svn: 203435	2014-03-10 02:14:14 +00:00
Chandler Carruth	6c5196f889	[PM] While I'm here, fix a few other clang-format issues. Pulls some lines under 80-columns, etc. llvm-svn: 203434	2014-03-10 02:12:14 +00:00
Craig Topper	e7c9ce2777	[C++11] Add 'override' keyword to virtual methods that override their base class. llvm-svn: 203433	2014-03-10 02:09:33 +00:00
David Majnemer	3c62f51f77	MC: Appease the buildbots This is fallout from r203429. llvm-svn: 203430	2014-03-10 01:04:18 +00:00
David Majnemer	c247617301	MC: Cleanup MCSectionMachO::ParseSectionSpecifier Split by comma once instead of multiple times. Moving this upfront makes the rest of the code considerably simpler. No functional change. llvm-svn: 203429	2014-03-10 00:55:07 +00:00
Venkatraman Govindaraju	a11c82efc1	[Sparc] Add support for decoding 'swap' instruction. llvm-svn: 203424	2014-03-09 23:32:07 +00:00
Bob Wilson	c6be80dd79	Fix inconsistent whitespace. llvm-svn: 203423	2014-03-09 23:17:28 +00:00
Craig Topper	1893daf524	[C++11] Add 'override' keyword to virtual methods that override their base class. llvm-svn: 203418	2014-03-09 18:03:14 +00:00
Benjamin Kramer	a82d000f71	StackColoring: Use range-based for loops. No functionality change. llvm-svn: 203415	2014-03-09 15:44:45 +00:00
Benjamin Kramer	5d89230ee2	MachineModuleInfo: Turn nested std::pairs into a proper struct. llvm-svn: 203414	2014-03-09 15:44:39 +00:00
Benjamin Kramer	488ab03435	SimplifyCFG: Simplify the weight scaling algorithm. No change in functionality. llvm-svn: 203413	2014-03-09 14:42:55 +00:00
Chandler Carruth	33aa4aeebe	[LCG] Simplify a bunch of the LCG code with range for loops and auto. Still more work to be done here to leverage C++11, but this clears out the glaring issues. llvm-svn: 203395	2014-03-09 12:20:34 +00:00
Chandler Carruth	5fc3eb73b0	[PM] Switch new pass manager from polymorphic_ptr to unique_ptr now that it is available. Also make the move semantics sufficiently correct to tolerate move-only passes, as the PassManagers are move-only passes. llvm-svn: 203391	2014-03-09 11:49:53 +00:00

1 2 3 4 5 ...

67633 Commits