llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
David Majnemer	f3d73f0449	[InstCombine] Don't transform (X+INT_MAX)>=(Y+INT_MAX) -> (X<=Y) This miscompile came about because we tried to use a transform which was only appropriate for xor operators when addition was present. This fixes PR26407. llvm-svn: 259375	2016-02-01 17:37:56 +00:00
Jun Bum Lim	b95afc3d46	[ValueTracking] Improve isKnownNonZero for PHI of non-zero constants It is clear that a PHI is a non-zero if all incoming values are non-zero constants. llvm-svn: 259370	2016-02-01 17:03:07 +00:00
Sanjay Patel	834c52c879	[InstCombine] simplify masked load intrinsics with all ones or zeros masks A masked load with a zero mask means there's no load. A masked load with an allOnes mask means it's a normal vector load. Differential Revision: http://reviews.llvm.org/D16691 llvm-svn: 259369	2016-02-01 17:00:10 +00:00
Geoff Berry	51dcbb04da	[PrologEpilogInserter] Add some debug output for callee-save frame object allocation Reviewers: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16733 llvm-svn: 259367	2016-02-01 16:47:51 +00:00
Geoff Berry	eacbf522af	[AArch64] Simplify callee-save register save/restore. NFC. Summary: Simplify callee-save register save/restore code generation by remembering the size of the callee-save area when it is computed so we don't have to scan the prologue/epilogue instructions again later to reconstruct it. This is intended to simplify follow-on changes that reduce the number of registers saved/restored. Reviewers: mcrosier, jmolloy, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16732 llvm-svn: 259365	2016-02-01 16:29:19 +00:00
Matthew Simpson	7c41d95a48	[LV] Rename RdxPHIsToFix to PHIsToFix (NFC) In the future, we will vectorize recurrences other than reductions. This patch renames a few variables and updates their associated comments to enable them to be reused for non-reduction PHI nodes. This change was requested in the review for D16197. llvm-svn: 259364	2016-02-01 16:07:01 +00:00
Asaf Badouh	7d5bdf84bb	[X86][AVX512VBMI] add encoding and intrinsics for Multishift Differential Revision: http://reviews.llvm.org/D16399 llvm-svn: 259363	2016-02-01 15:48:21 +00:00
Vasileios Kalintiris	2d209abe7a	[mips] Split large test file into 3 smaller ones. Remove the old select.ll file and use select-int.ll, select-flt.ll, select-dbl.ll for testing selects on integers, floats & doubles respectivelly. llvm-svn: 259361	2016-02-01 15:19:35 +00:00
Daniel Sanders	878dadf925	[mips] Range check uimm16 and fix several bugs this revealed. Summary: The bugs were: * teq and similar take 4-bit unsigned immediates on microMIPS. * teqi and similar have side-effects like teq do. * shll_s.w and shra_r.w take 5-bit unsigned immediates. * The various DSP ext* instructions take a 5-bit immediate. * repl.qh takes an 8-bit unsigned immediate. * repl.ph takes a 10-bit unsigned immediate. * rddsp/wrdsp take a 10-bit unsigned immediate. * teqi and similar take signed 16-bit immediates (10-bit for microMIPS). * Out-of-range immediate macros for or/xor take a simm32/simm64 depending on architecture. I'll fix the simm64 case properly when I reach simm32. lui is a bit more lenient than GAS and accepts signed immediates in addition to unsigned. This is because MipsMCExpr can produce signed values when constant folding and it currently lacks a way of knowing it should fold to an unsigned value. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D15446 llvm-svn: 259360	2016-02-01 15:13:31 +00:00
Amjad Aboud	73fb50fd3e	Improved macro emission in dwarf. Changed emitting offset of macinfo entry into compiler unit DIE to use "addSectionLabel" method rather than explicitly calculating size/offset of macro entry. Differential Revision: http://reviews.llvm.org/D16292 llvm-svn: 259358	2016-02-01 14:09:41 +00:00
Matthew Simpson	e1825f3030	Reapply commit r258404 with fix. The previous patch caused PR26364. The fix is to ensure that we don't enter a cycle when iterating over use-def chains. llvm-svn: 259357	2016-02-01 13:38:29 +00:00
JF Bastien	56abfb8d21	WebAssembly NFC: simplify control flow This should now be easier to read. llvm-svn: 259349	2016-02-01 10:46:16 +00:00
Ewan Crawford	9b293cca97	DWARF RenderScript vendor extension Patch adds a DWARF language vendor extension for RenderScript. We are already using this identifier in LLDB with a hard coded value, so it's preferable to use a LLVM generated enum instead. The language is intended to be added to the next version of the standard. See http://www.dwarfstd.org/ShowIssue.php?issue=150331.1 Reviewers: dexonsmith, echristo Subscribers: probinson domipheus, srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D16409 llvm-svn: 259348	2016-02-01 10:39:24 +00:00
Igor Breger	15632eed43	AVX512: fix mask handling for gather/scatter/prefetch intrinsics. Differential Revision: http://reviews.llvm.org/D16755 llvm-svn: 259346	2016-02-01 09:57:15 +00:00
Simon Pilgrim	f62b33f8d4	[X86][SSE] Find source of the inserted element of INSERTPS Minor patch to trace back through target shuffles to the source of the inserted element in a (V)INSERTPS shuffle. Differential Revision: http://reviews.llvm.org/D16652 llvm-svn: 259343	2016-02-01 08:59:30 +00:00
Igor Breger	fa62fb9857	AVX512 : Fix SETCCE lowering for KNL 32 bit. Differential Revision: http://reviews.llvm.org/D16752 llvm-svn: 259342	2016-02-01 07:56:09 +00:00
Frederic Riss	b74de3916b	[dsymutil] Skip mach-o paired relocations Noticed while working on scattered relocations. I do not think these relocs can actually happen in the debug_info section, but if they happen the code would mishandle them. Explicitely skip them and warn if we encounter one. llvm-svn: 259341	2016-02-01 04:43:14 +00:00
David Majnemer	56b2a51bb8	[X86] Cleanup the WinEHState pass Remove unnecessary includes and class state. No functional change intended. llvm-svn: 259340	2016-02-01 04:28:59 +00:00
Frederic Riss	e4eb654c9f	[dsymutil] Support scattered relocs. Although it seems like clang will never emit scattered relocations in the debug information (at least I couldn't find a way), we have too support them for the benefit of other compilers. As clang doesn't generate them, the included testcase was produced from hacked up assembly. llvm-svn: 259339	2016-02-01 03:44:22 +00:00
David Majnemer	feb8745705	Revert r258580 and r258581. Those commits created an artificial edge from a cleanup to a synthesized catchswitch in order to get the MSVC personality routine to execute cleanups which don't cleanupret and are not wrapped by a catchswitch. This worked well enough but is not a complete solution in situations where there the cleanup infinite loops. However, the real deal breaker behind this approach comes about from a degenerate case where the cleanup is post-dominated by unreachable and throws an exception. This ends poorly because the catchswitch will inadvertently catch the exception. Because of this we should go back to our previous behavior of not executing certain cleanups (identical behavior with the Itanium ABI implementation in clang, GCC and ICC). N.B. I think this could be salvaged by making the catchpad rethrow the exception and properly transforming throwing calls in the cleanup into invokes. llvm-svn: 259338	2016-02-01 03:29:38 +00:00
Craig Topper	2f7fb265dd	[TableGen] Store result of getInstructionsByEnumValue in an ArrayRef instead of accidentally copying to a vector. llvm-svn: 259336	2016-02-01 01:33:42 +00:00
Frederic Riss	1d92f7befd	[MCDwarf] Fix encoding of line tables with weird custom parameters With poorly chosen custom parameters, the line table encoding logic would sometimes end up generating a special opcode bigger than 255, which is wrong. The set of default parameters that LLVM uses isn't subject to this bug. When carefully chosing the line table parameters, it's impossible to fall into the corner case that this patch fixes. The standard however doesn't require that these parameters be carefully chosen. And even if it did, we shouldn't generate broken encoding. Add a unittest for this specific encoding bug, and while at it, create some unit tests for the encoding logic using different sets of parameters. llvm-svn: 259334	2016-01-31 22:06:35 +00:00
Craig Topper	548b47cf16	Remove utostr_32 as it has no uses anymore. llvm-svn: 259331	2016-01-31 20:00:26 +00:00
Craig Topper	0df6bdba52	Replace usages of llvm::utostr_32 with just llvm::utostr. While this is less efficient, its unclear the few places that were using the _32 version were doing so for efficiency. llvm-svn: 259330	2016-01-31 20:00:24 +00:00
Craig Topper	9c91186b26	Merge utohex_buffer into utohexstr, it's only caller. Also change utohexstr to use the std::string constructor that takes a start and end pointer. This saves a call to strlen. NFC llvm-svn: 259329	2016-01-31 20:00:22 +00:00
Sanjay Patel	4c5a2daa39	add helper function for minnum/maxnum ; NFC llvm-svn: 259326	2016-01-31 16:35:23 +00:00
Sanjay Patel	433350d1d4	use range-based for loop; NFC llvm-svn: 259325	2016-01-31 16:34:48 +00:00
Sanjay Patel	5b8745669b	fix formatting; NFC llvm-svn: 259324	2016-01-31 16:34:11 +00:00
Sanjay Patel	9cba4166c0	simplify; NFC llvm-svn: 259323	2016-01-31 16:33:33 +00:00
Sanjay Patel	2921b47c54	clean up; NFC function names, comments, formatting, typos llvm-svn: 259322	2016-01-31 16:32:23 +00:00
JF Bastien	adbc41abb9	WebAssembly: more failures are gone llvm-svn: 259321	2016-01-31 08:19:40 +00:00
JF Bastien	3b85804577	WebAssembly: update expected failures r259305 fixed a few assertions around FrameIndex, and I forgot to update these failures despite having run the torture tests. llvm-svn: 259320	2016-01-31 08:05:05 +00:00
Frederic Riss	fa5f95611b	[dsymutil] Fix FileCheck command. Damn case-insensitive filesystem... llvm-svn: 259319	2016-01-31 04:39:16 +00:00
Frederic Riss	f067530e7a	[dsymutil] Fix handling of common symbols. llvm-dsymutil was misinterpreting the value of common symbols as their address when it actually contains their size. This didn't impact llvm-dsymutil's ability to link the debug information for common symbols because these are always found by name and not by address. Things could however go wrong when the size of a common object matched the object file address of another symbol. Depending on the link order of the symbols the common object might incorrectly evict this other object from the address to symbol mapping, and then link the evicted symbol with a wrong binary address. Use the new ability to have symbols without an object file address to fix this. llvm-svn: 259318	2016-01-31 04:29:34 +00:00
Frederic Riss	b7d5b18cde	[dsymutil] Allow debug map mappings with no object file address. NFC This change just changes the data structure that ties symbol names, object file address and linked binary addresses to accept mappings with no object file address. Such symbol mappings are not fed into the debug map yet, so this patch is NFC. A subsequent patch will make use of this functionality for common symbols. llvm-svn: 259317	2016-01-31 04:29:22 +00:00
Tim Shen	85566f8175	[SelectionDAG] Eliminate exponential behavior in WalkChainUsers llvm-svn: 259315	2016-01-31 03:59:34 +00:00
Craig Topper	755daee3dd	No need to use utostr/utohexstr when writing into a raw_ostream. NFC llvm-svn: 259314	2016-01-31 01:55:15 +00:00
Craig Topper	d406f34e77	Shrink character buffer size in raw_ostream::write_hex to 16 characters intead of 20 as that's the largest string a 64-bit hex value can be. llvm-svn: 259313	2016-01-31 01:12:38 +00:00
Craig Topper	10624d8809	Use std::end instead of repeating buffer sizes. llvm-svn: 259312	2016-01-31 01:12:35 +00:00
Craig Topper	ca667eb8a2	Convert int to Twine instead of using utostr since it was already being added to a Twine. NFC llvm-svn: 259308	2016-01-31 00:15:35 +00:00
Jingyue Wu	8437a2db93	[doc] improve the doc for CUDA 1. Mentioned that CUDA support works best with trunk. 2. Simplified the example by removing its dependency on the CUDA samples. 3. Explain the --cuda-gpu-arch flag. llvm-svn: 259307	2016-01-30 23:48:47 +00:00
Derek Schuff	2f77371cea	[WebAssembly] Fix uses of FrameIndex as store values Previously the code assumed all uses of FI on loads and stores were as addresses. This checks whether the use is the address or a value and handles the latter case as it does for non-memory instructions. llvm-svn: 259306	2016-01-30 21:43:08 +00:00
JF Bastien	d89bb7340c	WebAssembly: don't optimize frameindex store The previous code was incorrect (can't getReg a frameindex). We could instead optimize it to reduce tree height, but I'm not sure that's worthwhile yet because we then try to eliminate the frameindex. This patch also fixes frame index elimination for operations which may load or store: it used to assume the base was operand 2 and immediate offset operand 1. That's not true for stores, where they're 4 and 3. llvm-svn: 259305	2016-01-30 14:11:26 +00:00
JF Bastien	2125e2f3c7	WebAssembly NFC: fix build warning WebAssemblyFrameLowering.cpp:158:44: warning: enumeral and non-enumeral type in conditional expression [enabled by default] llvm-svn: 259303	2016-01-30 11:19:26 +00:00
Gerolf Hoflehner	9e6bf2b7f2	[BasicAA] NFC - revised comment for function adjustToPointerSize() llvm-svn: 259300	2016-01-30 05:58:38 +00:00
Gerolf Hoflehner	48b45da969	[BasicAA] Fix for missing must alias (D16343) llvm-svn: 259299	2016-01-30 05:52:53 +00:00
Gerolf Hoflehner	b4582b54dd	[BasicAA] Update on r259290 - added missing cast llvm-svn: 259298	2016-01-30 05:35:09 +00:00
Matt Arsenault	2699008644	AMDGPU: Fix emitting invalid workitem intrinsics for HSA The AMDGPUPromoteAlloca pass was emitting the read.local.size calls, which with HSA was incorrectly selected to reading from the offset mesa uses off of the kernarg pointer. Error on intrinsics which aren't supported by HSA, and start emitting the correct IR to read the workgroup size out of the dispatch pointer. Also initialize the pass so it can be tested with opt, and start moving towards not depending on the subtarget as an argument. Start emitting errors for the intrinsics not handled with HSA. llvm-svn: 259297	2016-01-30 05:19:45 +00:00
Matt Arsenault	bcaaea3448	AMDGPU: Stop checking intrinsics not used by HSA for dispatch-ptr Only the dispatch.ptr intrinsic is supposed to be used now to get the workgroup size, and the read.local.size intrinsics do not work correctly. llvm-svn: 259296	2016-01-30 05:10:59 +00:00
Matt Arsenault	d68c0fe0a6	InstCombine: fabs(x) * fabs(x) -> x * x llvm-svn: 259295	2016-01-30 05:02:00 +00:00

... 6 7 8 9 10 ...

127228 Commits