llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Kostya Serebryany	48444a7a27	[libFuzzer] better algorithm for -minimize_crash llvm-svn: 284299	2016-10-15 01:00:24 +00:00
Tom Stellard	eeebdabf5d	AMDGPU/SI: Handle s_getreg hazard in GCNHazardRecognizer Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25526 llvm-svn: 284298	2016-10-15 00:58:14 +00:00
Justin Bogner	5d3507866f	ADT: Use LLVM_NODISCARD instead of LLVM_ATTRIBUTE_UNUSED_RESULT for APInt Instead of annotating (most of) the APInt API, we can just annotate the type directly. This is less code and it will warn in more cases. llvm-svn: 284297	2016-10-15 00:22:06 +00:00
Evgeny Astigeevich	13c7ac9fcc	[NFC] Loop Versioning for LICM code clean up - Removed unused class members. - Made class internal data private. - Made class scoped data function scoped where it's possible. - Replace naked new/delete with unique_ptr. - Made resources guaranteed to be freed. Differential Revision: https://reviews.llvm.org/D25464 llvm-svn: 284290	2016-10-14 23:00:36 +00:00
Tim Northover	dc91ae935f	GlobalISel: rename legalizer components to match others. The previous names were both misleading (the MachineLegalizer actually contained the info tables) and inconsistent with the selector & translator (in having a "Machine") prefix. This should make everything sensible again. The only functional change is the name of a couple of command-line options. llvm-svn: 284287	2016-10-14 22:18:18 +00:00
Justin Bogner	c292d8927c	Support: Add LLVM_NODISCARD with C++17's [[nodiscard]] semantics This is essentially a more powerful version of our current LLVM_ATTRIBUTE_UNUSED_RESULT, in that it can also be applied to types and generate warnings whenever an object of that type is returned by value and the value is discarded. I'll replace uses of LLVM_ATTRIBUTE_UNUSED_RESULT and remove that macro in follow up commits. llvm-svn: 284286	2016-10-14 22:04:17 +00:00
Mehdi Amini	829f79de2d	hardware_physical_concurrency() should return 1 when LLVM is built with LLVM_ENABLE_THREADS=OFF llvm-svn: 284283	2016-10-14 21:32:35 +00:00
Tim Northover	21105681d4	PowerPC: specify full triple to avoid different Darwin asm syntax. llvm-svn: 284281	2016-10-14 21:25:29 +00:00
Sanjay Patel	55f9ef6ad9	[ARM] add tests for PR30660 llvm-svn: 284280	2016-10-14 20:52:43 +00:00
Sanjay Patel	a8a51aac31	[PowerPC] add tests for PR30661 llvm-svn: 284279	2016-10-14 20:51:41 +00:00
Guozhi Wei	19888a3d98	[PPC] Shorter sequence to load 64bit constant with same hi/lo words This is a patch to implement pr30640. When a 64bit constant has the same hi/lo words, we can use rldimi to copy the low word into high word of the same register. This optimization caused failure of test case bperm.ll because of not optimal heuristic in function SelectAndParts64. It chooses AND or ROTATE to extract bit groups from a register, and OR them together. This optimization lowers the cost of loading 64bit constant mask used in AND method, and causes different code sequence. But actually ROTATE method is better in this test case. The reason is in ROTATE method the final OR operation can be avoided since rldimi can insert the rotated bits into target register directly. So this patch also enhances SelectAndParts64 to prefer ROTATE method when the two methods have same cost and there are multiple bit groups need to be ORed together. Differential Revision: https://reviews.llvm.org/D25521 llvm-svn: 284276	2016-10-14 20:41:50 +00:00
Kostya Serebryany	f9c10b3ff9	[libFuzzer] remove subdir fuzzer-test-suite as it is now superseded with https://github.com/google/fuzzer-test-suite llvm-svn: 284275	2016-10-14 20:26:40 +00:00
Kostya Serebryany	c7f377f70d	[libFuzzer] add -trace_cmp=1 (guiding mutations based on the observed CMP instructions). This is a reincarnation of the previously deleted -use_traces, but using a different approach for collecting traces. Still a toy, but at least it scales well. Also fix -merge in trace-pc-guard mode llvm-svn: 284273	2016-10-14 20:20:33 +00:00
Saleem Abdulrasool	6db78758e9	vim: add `norecurse` attribute Add missing attribute to the keyword set. llvm-svn: 284270	2016-10-14 19:48:34 +00:00
Saleem Abdulrasool	3fe8f442d0	vim: add `comdat` keyword The attribute may be applied to a function. Highlight it as a keyword. llvm-svn: 284269	2016-10-14 19:48:31 +00:00
Sanjay Patel	6ea6210357	[DAG] avoid creating illegal node when transforming negated shifted sign bit Eli noted this potential bug in the post-commit thread for: https://reviews.llvm.org/rL284239 ...but I'm not sure how to trigger it, so there's no test case yet. llvm-svn: 284268	2016-10-14 19:46:31 +00:00
Tom Stellard	ac33187376	AMDGPU/SI: Use new SimplifyDemandedBits helper for multi-use operations Summary: We are using this helper for our 24-bit arithmetic combines, so we are now able to eliminate multi-use operations that mask the high-bits of 24-bit inputs (e.g. and x, 0xffffff) Reviewers: arsenm, nhaehnle Subscribers: tony-tye, arsenm, kzhuravl, wdng, nhaehnle, llvm-commits, yaxunl Differential Revision: https://reviews.llvm.org/D24672 llvm-svn: 284267	2016-10-14 19:14:29 +00:00
Tom Stellard	0eccb3e2ea	TargetLowering: Add SimplifyDemandedBits() helper to TargetLoweringOpt Summary: The main purpose of this new helper is to enable simplifying operations that have multiple uses. SimplifyDemandedBits does not handle multiple uses currently, and this new function makes it possible to optimize: and v1, v0, 0xffffff mul24 v2, v1, v1 ; Multiply ignoring high 8-bits. To: mul24 v2, v0, v0 Where before this would not be optimized, because v1 has multiple uses. Reviewers: bogner, arsenm Subscribers: nhaehnle, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D24964 llvm-svn: 284266	2016-10-14 19:14:26 +00:00
Krzysztof Parzyszek	7e79b59e6f	The real fix for post-r284255 failures llvm-svn: 284264	2016-10-14 19:06:25 +00:00
Krzysztof Parzyszek	8ac6d87b06	Workaround to eliminate check-llvm failures after r284255 llvm-svn: 284262	2016-10-14 18:36:42 +00:00
David L Kreitzer	1233356719	Add a pass to optimize patterns of vectorized interleaved memory accesses for X86. The pass optimizes as a unit the entire wide load + shuffles pattern produced by interleaved vectorization. This initial patch optimizes one pattern (64-bit elements interleaved by a factor of 4). Future patches will generalize to additional patterns. Patch by Farhana Aleen Differential revision: http://reviews.llvm.org/D24681 llvm-svn: 284260	2016-10-14 18:20:41 +00:00
Tom Stellard	19c9255423	AMDGPU/SI: Don't allow unaligned scratch access Summary: The hardware doesn't support this. Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25523 llvm-svn: 284257	2016-10-14 18:10:39 +00:00
Krzysztof Parzyszek	ad19e572fb	[RDF] Switch RegisterRef to be a pair (Register, LaneMask) Use PackedRegisterRef to store the register information in the graph nodes. This commit also removes support for virtual registers. It has never been tested or used. It will be possible to add it back if there is a need. llvm-svn: 284255	2016-10-14 17:57:55 +00:00
David L Kreitzer	d68b19752a	[safestack] Use non-thread-local unsafe stack pointer for Contiki OS Patch by Michael LeMay Differential revision: http://reviews.llvm.org/D19852 llvm-svn: 284254	2016-10-14 17:56:00 +00:00
Eric Christopher	259a7b3aba	Revert "In preparation for removing getNameWithPrefix off of TargetMachine," as it's causing sanitizer/memory issues until I can track down this set. This reverts commit r284203 llvm-svn: 284252	2016-10-14 17:28:23 +00:00
Vedant Kumar	163ed3d9fd	[Coverage] Support loading multiple binaries into a CoverageMapping Add support for loading multiple coverage readers into a single CoverageMapping instance. This should make it easier to prepare a unified coverage report for multiple binaries. Differential Revision: https://reviews.llvm.org/D25535 llvm-svn: 284251	2016-10-14 17:16:53 +00:00
Rafael Espindola	f3f7f4bc54	Move alignTo computation inside the if. This is an improvement when compiling with llvm. llvm doesn't inline the call to insert, so the align is always executed and shows up in the profile. With gcc the call to insert is inlined and the align computation moved and done only if needed. With this patch we explicitly only compute it if it is needed. In the two tests with debug info, the speedup was scylla master 3.008959365 patch 2.932080942 1.02621974786x faster firefox master 6.709823604 patch 6.592387227 1.01781393795x faster In all others the difference was in the noise. llvm-svn: 284249	2016-10-14 17:01:39 +00:00
Pierre Gousseau	1c56ea6dd5	[X86] Take advantage of the lzcnt instruction on btver2 architectures when ORing comparisons to zero. This change adds transformations such as: zext(or(setcc(eq, (cmp x, 0)), setcc(eq, (cmp y, 0)))) To: srl(or(ctlz(x), ctlz(y)), log2(bitsize(x)) This optimisation is beneficial on Jaguar architecture only, where lzcnt has a good reciprocal throughput. Other architectures such as Intel's Haswell/Broadwell or AMD's Bulldozer/PileDriver do not benefit from it. For this reason the change also adds a "HasFastLZCNT" feature which gets enabled for Jaguar. Differential Revision: https://reviews.llvm.org/D23446 llvm-svn: 284248	2016-10-14 16:41:38 +00:00
Sanjay Patel	4538ac9f97	[InstCombine] use m_APInt to allow sub with constant folds for splat vectors llvm-svn: 284247	2016-10-14 16:31:54 +00:00
Mehdi Amini	23c66e03b1	[docs] Update some obsolete information in BitCodeFormat docs. Summary: * Describe new (3.3) parameter attribute group encoding, leaving old encoding there with a note about legacy * Bring TYPE_BLOCK docs up to date * Remove docs about obsolete (pre 3.0) TYPE_SYMTAB_BLOCK, TST_CODE_ENTRY * Fix a couple of incorrect comments and remove one unused enum definition along the way This addresses https://llvm.org/bugs/show_bug.cgi?id=28941. Patch by: Ismail Badawi <ibadawi@cisco.com> Differential Revision: https://reviews.llvm.org/D25623 llvm-svn: 284246	2016-10-14 16:23:09 +00:00
Sanjay Patel	037f524f36	[InstCombine] add tests for missing vector folds llvm-svn: 284245	2016-10-14 15:55:34 +00:00
Sanjay Patel	8c7be2e26d	[InstCombine] auto-generate checks llvm-svn: 284244	2016-10-14 15:41:25 +00:00
Sanjay Patel	b1fc9d9f65	[InstCombine] remove redundant test This test was apparently checking for 2 independent folds, but we have plenty of tests for those individual folds already. We are lacking vector tests, however, because we don't have the shift folds for vectors. llvm-svn: 284243	2016-10-14 15:36:28 +00:00
Sanjay Patel	e5bc6982b9	[InstCombine] update test to use FileCheck and auto-generate checks llvm-svn: 284242	2016-10-14 15:30:31 +00:00
Sanjay Patel	8024bbbb08	[InstCombine] sub X, sext(bool Y) -> add X, zext(bool Y) Prefer add/zext because they are better supported in terms of value-tracking. Note that the backend should be prepared for this IR canonicalization (including vector types) after: https://reviews.llvm.org/rL284015 Differential Revision: https://reviews.llvm.org/D25135 llvm-svn: 284241	2016-10-14 15:24:31 +00:00
David L Kreitzer	96ecdb67e0	Define "contiki" OS specifier. Patch by Michael LeMay Differential revision: http://reviews.llvm.org/D24897 llvm-svn: 284240	2016-10-14 14:41:46 +00:00
Sanjay Patel	54524ed76a	[DAG] add folds for negated shifted sign bit The same folds exist in InstCombine already. This came up as part of: https://reviews.llvm.org/D25485 llvm-svn: 284239	2016-10-14 14:26:47 +00:00
Sanjay Patel	12d99d69b4	[x86] add tests to show missing folds for negated shifted sign bit llvm-svn: 284238	2016-10-14 14:14:40 +00:00
Nicolai Haehnle	5c5844a79f	AMDGPU: Select 64-bit {ADD,SUB}{C,E} nodes Summary: This will be used for 64-bit MULHU, which is in turn used for the 64-bit divide-by-constant optimization (see D24822). Reviewers: arsenm, tstellarAMD Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25289 llvm-svn: 284224	2016-10-14 10:30:00 +00:00
Diana Picus	e2ef35cd54	[GlobalISel] Get the AArch64 tests to work on Linux Mostly this just means changing the triple from aarch64-apple-ios to the generic aarch64--. Only one test needs more significant changes, but GlobalISel already does the right thing so it's ok to just change the checks. Differential Revision: https://reviews.llvm.org/D25532 llvm-svn: 284223	2016-10-14 10:19:40 +00:00
Nicolai Haehnle	0d87d9745d	Fix use-after-frees Extracted from D25313, as suggested by Justin Bogner. llvm-svn: 284220	2016-10-14 09:49:51 +00:00
Simon Dardis	7bd8310ae6	[mips] Fix aui/daui/dahi/dati for MIPSR6 For compatiblity with binutils, define these instructions to take two registers with a 16bit unsigned immediate. Both of the registers have to be same for dahi and dati. Reviewers: dsanders, zoran.jovanovic Differential Review: https://reviews.llvm.org/D21473 llvm-svn: 284218	2016-10-14 09:31:42 +00:00
Nicolai Haehnle	1c18014661	AMDGPU: Fix use-after-frees Reviewers: arsenm, tstellarAMD Subscribers: kzhuravl, wdng, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D25312 llvm-svn: 284215	2016-10-14 09:03:04 +00:00
Michael Zuckerman	6441a0b461	[x86][ms-inline-asm] use of "jmp short" in asm is not supported Committing in the name of Ziv Izhar: After check-all and LGTM . The following patch is for compatability with Microsoft. Microsoft ignores the keyword "short" when used after a jmp, for example: __asm { jmp short label label: } A test for that patch will be added in another patch, since it's located in clang's codegen tests. Link will be added shortly. link to test: https://reviews.llvm.org/D24958 Differential Revision: https://reviews.llvm.org/D24957 llvm-svn: 284211	2016-10-14 08:09:40 +00:00
Craig Topper	938c990f79	[DAGCombiner] Teach createBuildVecShuffle to handle cases where input vectors are less than half of the output vector size. This will be needed by a future commit to support sign/zero extending from v8i8 to v8i64 which requires a sign/zero_extend_vector_inreg to be created which requires v8i8 to be concatenated upto v64i8 and goes through this code. llvm-svn: 284204	2016-10-14 06:00:42 +00:00
Eric Christopher	bf50905153	In preparation for removing getNameWithPrefix off of TargetMachine, sink the current behavior into the callers and sink TargetMachine::getNameWithPrefix into TargetMachine::getSymbol. llvm-svn: 284203	2016-10-14 05:47:41 +00:00
Eric Christopher	403df02984	Tidy the calls to getCurrentSection().first -> getCurrentSectionOnly to help readability a bit. llvm-svn: 284202	2016-10-14 05:47:37 +00:00
Eric Christopher	4274c0be4b	Tidy up example of getting the pointer size. llvm-svn: 284201	2016-10-14 05:45:46 +00:00
Konstantin Zhuravlyov	b970547eb1	[AMDGPU] Emit 32-bit lo/hi got and pc relative variant kinds for external and global address space variables Differential Revision: https://reviews.llvm.org/D25562 llvm-svn: 284196	2016-10-14 04:37:34 +00:00
Konstantin Zhuravlyov	8c3f44a8af	[AMDGPU] Add 32-bit lo/hi got and pc relative variant kinds and emit appropriate relocations Differential Revision: https://reviews.llvm.org/D25548 llvm-svn: 284195	2016-10-14 04:21:32 +00:00

1 2 3 4 5 ...

139539 Commits