llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Michael Zolotukhin	20e33533a3	[SCEV] Compute affine range in another way to avoid bitwidth extending. Summary: This approach has two major advantages over the existing one: 1. We don't need to extend bitwidth in our computations. Extending bitwidth is a big issue for compile time as we often end up working with APInts wider than 64bit, which is a slow case for APInt. 2. When we zero extend a wrapped range, we lose some information (we replace the range with [0, 1 << src bit width)). Thus, avoiding such extensions better preserves information. Correctness testing: I ran 'ninja check' with assertions that the new implementation of getRangeForAffineAR gives the same results as the old one (this functionality is not present in this patch). There were several failures - I inspected them manually and found out that they all are caused by the fact that we're returning more accurate results now (see bullet (2) above). Without such assertions 'ninja check' works just fine, as well as SPEC2006. Compile time testing: CTMark/Os: - mafft/pairlocalalign -16.98% - tramp3d-v4/tramp3d-v4 -12.72% - lencod/lencod -11.51% - Bullet/bullet -4.36% - ClamAV/clamscan -3.66% - 7zip/7zip-benchmark -3.19% - sqlite3/sqlite3 -2.95% - SPASS/SPASS -2.74% - Average -5.81% Performance testing: The changes are expected to be neutral for runtime performance. Reviewers: sanjoy, atrick, pete Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30477 llvm-svn: 297992	2017-03-16 21:07:38 +00:00
Derek Schuff	678c4acd7c	[WebAssembly] Fix some broken type encodings in wasm binary A recent change switch the in-memory wasm value types to be signed integers, but I missing a few cases where these were being writing to the binary. Differential Revision: https://reviews.llvm.org/D31014 Patch by Sam Clegg llvm-svn: 297991	2017-03-16 20:49:48 +00:00
Zachary Turner	82a118f453	Silence -Wcovered-switch-default warning. llvm-svn: 297990	2017-03-16 20:45:11 +00:00
Sanjay Patel	7f15f4047b	[InstCombine] avoid breaking up bitcasted vector min/max patterns (PR32306) As the related tests show, we're not canonicalizing to this form for scalars or vectors yet, but this solves the immediate problem in: https://bugs.llvm.org/show_bug.cgi?id=32306 llvm-svn: 297989	2017-03-16 20:42:45 +00:00
Jonathan Roelofs	77b3b41bd4	Clarify a comment about mapped_iterator. NFC llvm-svn: 297988	2017-03-16 20:37:59 +00:00
Peter Collingbourne	9e27a1de1f	ArchiveWriter: Remove unused variables. NFC. llvm-svn: 297987	2017-03-16 20:35:07 +00:00
Sanjay Patel	9c0f4dd8de	[InstCombine] add tests for PR32306 and missed min/max canonicalization; NFC llvm-svn: 297986	2017-03-16 20:31:38 +00:00
Reid Kleckner	2e55d5b779	Create msbuild only when using MSVC Summary: I could be wrong but it seems to have no use for MinGW. Related diff: https://reviews.llvm.org/D29772 Reviewers: chandlerc, rnk Reviewed By: rnk Patch by: Mateusz Mikuła Subscribers: rnk, llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D29952 llvm-svn: 297985	2017-03-16 20:24:14 +00:00
Zachary Turner	707767bc86	[PDB] Add support for parsing Flags from PDB Stream. This was discovered when running `llvm-pdbdump diff` against two files, the second of which was generated by running the first one through pdb2yaml and then yaml2pdb. The second one was missing some bytes from the PDB Stream, and tracking this down showed that at the end of the PDB Stream were some additional bytes that we were ignoring. Looking back to the reference code, these seem to specify some additional flags that indicate whether the PDB supports various optional features. This patch adds support for reading, writing, and round-tripping these flags through YAML and the raw dumper, and updates the tests accordingly. llvm-svn: 297984	2017-03-16 20:19:11 +00:00
Zachary Turner	2830b2b9fe	[llvm-pdbdump] Add support for diffing the PDB Stream. In doing so I discovered that we completely ignore some bytes of the PDB Stream after we "finish" loading it. These bytes seem to specify some additional information about what kind of data is present in the PDB. A subsequent patch will add code to read in those fields and store their values. llvm-svn: 297983	2017-03-16 20:18:41 +00:00
Adrian Prantl	ac2a0e4e02	Revert commit r297971 because of issues reported by msan. llvm-svn: 297982	2017-03-16 20:11:54 +00:00
Matthias Braun	df0cc24783	TargetInstrInfo: Provide default implementation of isTailCall(). In fact this default implementation should be the only implementation, keep it virtual for now to accomodate targets that don't model flags correctly. Differential Revision: https://reviews.llvm.org/D30747 llvm-svn: 297980	2017-03-16 20:02:30 +00:00
Daniel Sanders	5e338819c0	[globalisel] Correct one more simple immediate that should be a ConstantInt. llvm-svn: 297979	2017-03-16 19:59:19 +00:00
Craig Topper	36518a85a4	[AVX-512] Add tests for kandn, kor, kxor, and kxnor intrinsics. llvm-svn: 297978	2017-03-16 19:58:06 +00:00
Zachary Turner	cff39e0d5e	[llvm-pdbdump] clang-format Diff.cpp Looks like this file did not have clang-format run on it when its initial revision was committed. llvm-svn: 297977	2017-03-16 19:52:15 +00:00
Simon Pilgrim	0ae87b3299	Remove redundant conditions (PR31753). NFCI. llvm-svn: 297976	2017-03-16 19:52:00 +00:00
Adrian Prantl	cdac62a67f	Fix unused variable warnings. llvm-svn: 297973	2017-03-16 18:33:01 +00:00
Adrian Prantl	333e7f1840	Salvage debug info from instructions about to be deleted This patch improves debug info quality in InstCombine by looking at values that are about to be deleted, checking whether there are any dbg.value instrinsics referring to them, and potentially encoding the semantics of the deleted instruction into the dbg.value's DIExpression. In the example in the testcase (which was extracted from XNU) there is a sequence of %4 = load %struct.entry, %struct.entry* %next2, align 8, !dbg !41 %5 = bitcast %struct.entry* %4 to i8, !dbg !42 %add.ptr4 = getelementptr inbounds i8, i8 %5, i64 -8, !dbg !43 %6 = bitcast i8* %add.ptr4 to %struct.entry, !dbg !44 call void @llvm.dbg.value(metadata %struct.entry %6, i64 0, metadata !20, metadata !21), !dbg 34 When these instructions are eliminated by instcombine one after another, we can still salvage the otherwise dead debug info: - Bitcasts have no effect, so have the dbg.value point to operand(0) - Loads can be expressed via a DW_OP_deref - Constant gep instructions can be replaced by DWARF expression arithmetic The API introduced by this patch is not specific to instcombine and can be useful in other places, too. rdar://problem/30725338 Differential Revision: https://reviews.llvm.org/D30919 llvm-svn: 297971	2017-03-16 18:22:52 +00:00
Peter Collingbourne	cb13bea3ce	LTO: Create temporary cache files in the cache directory instead of $TMPDIR. This fixes a race condition where another linker process can observe a partially written file if we copy it from another file system, and allows the link to be independent of the amount of free disk space in $TMPDIR. Differential Revision: https://reviews.llvm.org/D31045 llvm-svn: 297970	2017-03-16 18:20:06 +00:00
Adrian Prantl	28df726508	Attempt to fix bot failure on Windows. Looks like this expression was accidentally using 32-bit arithmetic. llvm-svn: 297969	2017-03-16 18:06:04 +00:00
Daniel Sanders	11cc60599e	[globalisel] Correct G_CONSTANT path of selectArithImmed() Earlier stages of GlobalISel always use ConstantInt in G_CONSTANT so that's what we should check for. This fixes a crash introduced in r297782. llvm-svn: 297968	2017-03-16 18:04:50 +00:00
Adrian Prantl	e4152cc6a7	Rearrange fields. NFC. llvm-svn: 297967	2017-03-16 17:42:47 +00:00
Adrian Prantl	6400d47a67	Rename methods in DwarfExpression to adhere to the LLVM coding guidelines. NFC. llvm-svn: 297966	2017-03-16 17:42:45 +00:00
Adrian Prantl	018bf36c06	PR32288: More efficient encoding for DWARF expr subregister access. Citing http://bugs.llvm.org/show_bug.cgi?id=32288 The DWARF generated by LLVM includes this location: 0x55 0x93 0x04 DW_OP_reg5 DW_OP_piece(4) When GCC's DWARF is simply 0x55 (DW_OP_reg5) without the DW_OP_piece. I believe it's reasonable to assume the DWARF consumer knows which part of a register logically holds the value (low bytes, high bytes, how many bytes, etc) for a primitive value like an integer. This patch gets rid of the redundant DW_OP_piece when a subregister is at offset 0. It also adds previously missing subregister masking when a subregister is followed by another operation. (This reapplies r297960 with two additional testcase updates). rdar://problem/31069390 https://reviews.llvm.org/D31010 llvm-svn: 297965	2017-03-16 17:14:56 +00:00
Reid Kleckner	60f04f6ae0	[cmake] Refactor warning flag logic to use Unix warnings with clang-cl Summary: clang-cl understands the GCC-style -W[no-]foo flags, and for the most part ignores MSVC -wd flags. So, let's pass the curated set of warning flags we use on Unix on Windows. We can also stop passing /W4 -wd*, which for the most part corresponds to -Wall -Wextra with a bunch of flags that we mostly ignore. I had to disable -Wnon-virtual-dtor on Windows, because it fires on every COM class ever. I filed PR32286 to fix this. So far I've only found two instances of -Wstring-conversion in the WinASan code, which I'll fix. Other than that we seem clean. Reviewers: hans Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D30992 llvm-svn: 297964	2017-03-16 17:05:16 +00:00
Reid Kleckner	e96622fef1	[IR] Inline some Function accessors I checked that all of these out-of-line methods previously compiled to simple loads and bittests, so they are pretty good candidates for inlining. In particular, arg_size() and arg_empty() are popular and are just two loads, so they seem worth inlining. llvm-svn: 297963	2017-03-16 16:57:31 +00:00
Adrian Prantl	6390e091a8	Revert "PR32288: More efficient encoding for DWARF expr subregister access." This reverts commit 2bf453116889a576956892ea9683db4fcd96e30e while investigating buildbot breakage. llvm-svn: 297962	2017-03-16 16:38:22 +00:00
Adrian Prantl	2aad07acf0	PR32288: More efficient encoding for DWARF expr subregister access. Citing http://bugs.llvm.org/show_bug.cgi?id=32288 The DWARF generated by LLVM includes this location: 0x55 0x93 0x04 DW_OP_reg5 DW_OP_piece(4) When GCC's DWARF is simply 0x55 (DW_OP_reg5) without the DW_OP_piece. I believe it's reasonable to assume the DWARF consumer knows which part of a register logically holds the value (low bytes, high bytes, how many bytes, etc) for a primitive value like an integer. This patch gets rid of the redundant DW_OP_piece when a subregister is at offset 0. It also adds previously missing subregister masking when a subregister is followed by another operation. rdar://problem/31069390 https://reviews.llvm.org/D31010 llvm-svn: 297960	2017-03-16 16:34:14 +00:00
Hiroshi Inoue	eeea09cdad	Test commit. llvm-svn: 297959	2017-03-16 16:30:06 +00:00
Stanislav Mekhanoshin	e7e6d76e45	[AMDGPU] Run always inliner early in opt We can mark functions to always inline early in the opt. Since we do not have call support this early inlining creates opportunities for inter-procedural optimizations which would not occur otherwise. Differential Revision: https://reviews.llvm.org/D31016 llvm-svn: 297958	2017-03-16 16:11:46 +00:00
Simon Pilgrim	3ee3b0b700	[X86] Add PR22338 test case llvm-svn: 297957	2017-03-16 15:10:42 +00:00
Aditya Kumar	b2d9a58bf0	Fix: Refactor SimplifyCFG:canSinkInstructions [NFC] Differential Revision: https://reviews.llvm.org/D30116 llvm-svn: 297955	2017-03-16 14:09:18 +00:00
Jonas Paulsson	2ad0eeeb69	[BasicTTIImpl] Bugfix in getIntrinsicInstrCost() Don't call getScalarizationOverhead(RetTy, true, false) if RetTy is void type. Review: Hal Finkel https://reviews.llvm.org/D31024 llvm-svn: 297954	2017-03-16 14:05:34 +00:00
Bjorn Pettersson	89cbb4f80d	[InstCombine] Liberate assert in InstCombiner::visitZExt Summary: The call to canEvaluateZExtd in InstCombiner::visitZExt may return with BitsToClear == SrcTy->getScalarSizeInBits(), but there is an assert that BitsToClear should be smaller than SrcTy->getScalarSizeInBits(). I have a test case that triggers the assert, but it only happens for my downstream target. I've not been able to trigger it for any upstream target. The assert triggered for a piece of code such as this %shr1 = lshr i16 undef, 15 ... %shr2 = lshr i16 %shr1, 1 %conv = zext i16 %shr2 to i32 Normally the lshr instructions are constant folded before we visit the zext (that is why it is so hard to reproduce). The original pattern, before instcombine, is of course a lot more complicated in my test case. The shift count in the second lshr is for example determined by the outcome of a PHI instruction. It seems like other rewrites by instcombine leads up to the pattern above. And then the zext is pulled from the worklist, and visited (hitting the assert), before we detect that the lshr instrucions can be constant folded. Anyway, since the canEvaluateZExtd may return with BitsToClear equal to SrcTy->getScalarSizeInBits(), and since the rewrite that converts the expression type to avoid a zero extend works also for the case where SrcBitsKept ends up being zero, then it should be OK to liberate the assert to assert(BitsToClear <= SrcTy->getScalarSizeInBits() && "Unreasonable BitsToClear"); Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D30993 llvm-svn: 297952	2017-03-16 13:22:01 +00:00
Eric Liu	236e5336c2	Revert "[PGO] Value profile for size of memory intrinsic calls" This commit reverts r297897 and r297909. llvm-svn: 297951	2017-03-16 13:16:35 +00:00
James Henderson	dff2d6c589	[Support] Add support for getting file system permissions on Windows and implement sys::fs::set/getPermissions to work with them This change adds support for functions to set and get file permissions, in a similar manner to the C++17 permissions() function in <filesystem>. The setter uses chmod on Unix systems and SetFileAttributes on Windows, setting the permissions as passed in. The getter simply uses the existing status() function. Prior to this change, status() would always return an unknown value for the permissions on a Windows file, making it impossible to test the new function on Windows. I have therefore added support for this as well. On Linux, prior to this change, the permissions included the file type, which should actually be accessed via a different member of the file_status class. Note that on Windows, only the *_write permission bits have any affect - if any are set, the file is writable, and if not, the file is read-only. This is in common with what MSDN describes for their behaviour of std::filesystem::permissions(), and also what boost::filesystem does. The motivation behind this change is so that we can easily test behaviour on read-only files in LLVM unit tests, but I am sure that others may find it useful in some situations. Reviewers: zturner, amccarth, aaron.ballman Differential Revision: https://reviews.llvm.org/D30736 llvm-svn: 297945	2017-03-16 11:22:09 +00:00
Chandler Carruth	cbb9fdaf79	[PM/Inliner] Fix a bug in r297374 where we would leave stale calls in the work queue and crash when trying to visit them after deleting the function containing those calls. llvm-svn: 297940	2017-03-16 10:45:42 +00:00
Chandler Carruth	5dc13f0626	[PM/Inliner] Add a test case that encapsulates the core issue addressed in r297374. I've extracted a small version of this from the C++ metaprogram Richard came up with to exercise these kinds of issues and written comments to describe both how to reproduce a fresh version of the test case and what likely failure modes are. The test case is still a bit brittle as it depends on the particular inline cost modeling and SCC visitation order, but it definitely would have caught the bug right away when developing things so it seems a really valuable test case to have. llvm-svn: 297935	2017-03-16 10:13:55 +00:00
Oren Ben Simhon	43ec62fd2b	Fixing typos. llvm-svn: 297932	2017-03-16 08:15:52 +00:00
Jonas Paulsson	cbcaf13b31	[SelectionDAG] Optimize VSELECT->SETCC of incompatible or illegal types. Don't scalarize VSELECT->SETCC when operands/results needs to be widened, or when the type of the SETCC operands are different from those of the VSELECT. (VSELECT SETCC) and (VSELECT (AND/OR/XOR (SETCC,SETCC))) are handled. The previous splitting of VSELECT->SETCC in DAGCombiner::visitVSELECT() is no longer needed and has been removed. Updated tests: test/CodeGen/ARM/vuzp.ll test/CodeGen/NVPTX/f16x2-instructions.ll test/CodeGen/X86/2011-10-19-widen_vselect.ll test/CodeGen/X86/2011-10-21-widen-cmp.ll test/CodeGen/X86/psubus.ll test/CodeGen/X86/vselect-pcmp.ll Review: Eli Friedman, Simon Pilgrim https://reviews.llvm.org/D29489 llvm-svn: 297930	2017-03-16 07:17:12 +00:00
Tobias Grosser	ced11aa6c7	[ADCE] Remove redundent code [NFC] Summary: In commit r289548 ([ADCE] Add code to remove dead branches) a redundant loop nest was accidentally introduced, which implements exactly the same functionality as has already been available right after. This redundancy has been found when inspecting the ADCE code in the context of our recent discussions on post-dominator modeling. This redundant code was also eliminated by r296535 (which sparked the discussion), but only as part of a larger semantic change of the post-dominance modeling. As this redundency in [ADCE] is really just an oversight completely independent of the post-dominance changes under discussion, we remove this redundancy independently. Reviewers: dberlin, david2050 Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31023 llvm-svn: 297929	2017-03-16 03:59:23 +00:00
Peter Collingbourne	166c32751d	Fix pessimising moves. llvm-svn: 297928	2017-03-16 03:54:38 +00:00
Peter Collingbourne	d7d9d85053	Support: Add a cache pruning policy parser. The idea is that the policy string fully specifies the policy and is portable between clients. Differential Revision: https://reviews.llvm.org/D31020 llvm-svn: 297927	2017-03-16 03:42:00 +00:00
Colin LeMahieu	904fab62c8	[Hexagon] Fixing test from 297920 to add Hexagon triple. llvm-svn: 297926	2017-03-16 01:52:10 +00:00
Kyle Butt	18b438a329	CodeGen: BlockPlacement: Reduce TriangleChainCount to 2 This produces a 1% speedup on an important internal Google benchmark (protocol buffers), with no other regressions in google or in the llvm test-suite. Only 5 targets in the entire llvm test-suite are affected, and on those 5 targets the size increase is 0.027% llvm-svn: 297925	2017-03-16 01:32:29 +00:00
David Blaikie	7c5f386f72	Check that line table entries don't spill over between sections even if they're empty at the end of a section llvm-svn: 297922	2017-03-16 00:52:18 +00:00
David Blaikie	4db99c4517	Simplify/make more explicit (by making less explicit in some ways) some function calls llvm-svn: 297921	2017-03-16 00:43:19 +00:00
Colin LeMahieu	4beb0106e7	[Hexagon] Updating inline saturate lanes for v62 version. llvm-svn: 297920	2017-03-16 00:35:28 +00:00
Kostya Serebryany	d2773b13e1	[libFuzzer] add a dummy LLVMFuzzerMutate to afl/afl_driver.cpp (https://github.com/google/oss-fuzz/issues/460 ) llvm-svn: 297919	2017-03-15 23:54:40 +00:00
Kostya Serebryany	6644fe3b16	[libFuzzer] call __sanitizer_print_memory_profile with two params llvm-svn: 297916	2017-03-15 23:27:53 +00:00

1 2 3 4 5 ...

146320 Commits