llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 13:33:37 +02:00

Author	SHA1	Message	Date
Konstantin Zhuravlyov	079d4d7f9f	AMDGPU: Do not lower fast unsafe div for safe, f32, with fp32 denormals Differential Revision: https://reviews.llvm.org/D32085 llvm-svn: 301023	2017-04-21 19:25:33 +00:00
Sanjay Patel	dc32edcf06	[InstCombine] use isSubsetOf() for efficiency C \| ~D == -1 ~(C \| ~D) == 0 ~C & D == 0 D & ~C == 0 D.isSubsetOf(C) llvm-svn: 301021	2017-04-21 19:16:52 +00:00
Akira Hatanaka	6e48df725c	[AArch64] Improve code generation for logical instructions taking immediate operands. This commit adds an AArch64 dag-combine that optimizes code generation for logical instructions taking immediate operands. The optimization uses demanded bits to change a logical instruction's immediate operand so that the immediate can be folded into the immediate field of the instruction. This recommits r300932 and r300930, which was causing dag-combine to loop forever. The problem was that optimizeLogicalImm was returning true even when there was no change to the immediate node (which happened when the immediate was all zeros or ones), which caused dag-combine to push and pop the same node to the work list over and over again without making any progress. This commit fixes the bug by returning false early in optimizeLogicalImm if the immediate is all zeros or ones. Also, it changes the code to compare the immediate with 0 or Mask rather than calling countPopulation. rdar://problem/18231627 Differential Revision: https://reviews.llvm.org/D5591 llvm-svn: 301019	2017-04-21 18:53:12 +00:00
Artur Pilipenko	e473bed70c	[InstCombine] fadd double (sitofp x), y check that the promotion is valid Doing these transformations check that the result of integer addition is representable in the FP type. (fadd double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (fadd double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This is a fix for https://bugs.llvm.org//show_bug.cgi?id=27036 Reviewed By: andrew.w.kaylor, scanon, spatel Differential Revision: https://reviews.llvm.org/D31182 llvm-svn: 301018	2017-04-21 18:45:25 +00:00
Kuba Mracek	96eb5eb014	Fixup for r301007: Restrict the -D hack to Darwin. llvm-svn: 301017	2017-04-21 18:19:56 +00:00
Zachary Turner	00c4e359e7	[BitVector] Add find_last() and find_last_unset(). Differential Revision: https://reviews.llvm.org/D32302 llvm-svn: 301014	2017-04-21 18:07:46 +00:00
Kuba Mracek	3c94cd1a2e	Revert r301010: Bot failures on Windows, NetBSD and even some old Darwin. llvm-svn: 301012	2017-04-21 18:02:22 +00:00
Kuba Mracek	6cdb32b6dd	[libFuzzer] Always build libFuzzer There are two reasons why users might want to build libfuzzer: - To fuzz LLVM itself - To get the libFuzzer.a archive file, so that they can attach it to their code This change always builds libfuzzer, and supports the second use case if the specified flag is set. The point of this patch is to have something that can potentially be shipped with the compiler, and this also ensures that the version of libFuzzer is correct to use with that compiler. Patch by George Karpenkov. Differential Revision: https://reviews.llvm.org/D32096 llvm-svn: 301010	2017-04-21 17:47:44 +00:00
Kuba Mracek	140d5ceba9	[libFuzzer] Changing thread_local to __thread in libFuzzer Old Apple compilers do not support thread_local keyword. This patch adds -Dthread_local=__thread when the compiler doesn't support thread_local. Differential Revision: https://reviews.llvm.org/D32312 llvm-svn: 301007	2017-04-21 17:39:50 +00:00
Zachary Turner	312ef84bfd	Add llvm-cvtres to LLVMBuild.txt It wasn't getting picked up as an implicit project, so it wasn't being built. llvm-svn: 301006	2017-04-21 17:37:31 +00:00
Joel Jones	80a708bab0	[AArch64] Refactor instruction selection lowering for addresses. NFCI Factor out the common code used for generating addresses into common templated functions that call overloaded versions of a new function, getTargetNode. Tested with make check-llvm with targets AArch64. Differential Revision: https://reviews.llvm.org/D32169 llvm-svn: 301005	2017-04-21 17:31:03 +00:00
Zachary Turner	032a6ec198	Add empty shell of llvm-cvtres. This marks the beginning of an effort to port remaining MSVC toolchain miscellaneous utilities to all platforms. Currently clang-cl shells out to certain additional tools such as the IDL compiler, resource compiler, and a few other tools, but as these tools are Windows-only it limits the ability of clang to target Windows on other platforms. having a full suite of these tools directly in LLVM should eliminate this constraint. The current implementation provides no actual functionality, it is just an empty skeleton executable for the purposes of making incremental changes. Differential Revision: https://reviews.llvm.org/D32095 Patch by Eric Beckmann (ecbeckmann@google.com) llvm-svn: 301004	2017-04-21 17:30:29 +00:00
Tim Northover	46d72d07a9	ARM: don't try to create an i8 -> i32 vpaddl. DAG combine was mistakenly assuming that the step-up it was looking at was always a doubling, but it can sometimes be a larger extension in which case we'd crash. llvm-svn: 301002	2017-04-21 17:21:59 +00:00
Kuba Mracek	58d3c4ad08	[libFuzzer] Check for target(popcnt) capability before usage Older compilers (e.g. LLVM 3.4) do not support the attribute target("popcnt"). In order to support those, this diff check the attribute support using the preprocessor. Patch by George Karpenkov. Differential Revision: https://reviews.llvm.org/D32311 llvm-svn: 300999	2017-04-21 16:57:37 +00:00
Craig Topper	f21b1ae94a	[ValueTracking] Use APInt::setAllBits and APInt::intersects to simplify some code. NFC llvm-svn: 300997	2017-04-21 16:43:32 +00:00
Craig Topper	e0891c7e01	[APInt] Add compare/compareSigned methods that return -1, 0, 1. Reimplement slt/ult and friends using them Currently sle and ule have to call slt/ult and eq to get the proper answer. This results in extra code for both calls and additional scans of multiword APInts. This patch replaces slt/ult with a compareSigned/compare that can return -1, 0, or 1 so we can cover all the comparison functions with a single call. While I was there I removed the activeBits calls and other checks at the start of the slow part of ult. Both of the activeBits calls potentially scan through each of the APInts separately. I can't imagine that's any better than just scanning them in parallel and doing the compares. Now we just share the code with tcCompare. These changes seem to be good for about a 7-8k reduction on the size of the opt binary on my local x86-64 build. Differential Revision: https://reviews.llvm.org/D32339 llvm-svn: 300995	2017-04-21 16:13:15 +00:00
Juergen Ributzka	0345ae55f6	Remove empty and unused header file. llvm-svn: 300994	2017-04-21 16:05:01 +00:00
Daniel Sanders	a3de070727	[globalisel][tablegen] Import SelectionDAG's rule predicates and support the equivalent in GIRule. Summary: The SelectionDAG importer now imports rules with Predicate's attached via Requires, PredicateControl, etc. These predicates are implemented as bitset's to allow multiple predicates to be tested together. However, unlike the MC layer subtarget features, each target only pays for it's own predicates (e.g. AArch64 doesn't have 192 feature bits just because X86 needs a lot). Both AArch64 and X86 derive at least one predicate from the MachineFunction or Function so they must re-initialize AvailableFeatures before each function. They also declare locals in <Target>InstructionSelector so that computeAvailableFeatures() can use the code from SelectionDAG without modification. Reviewers: rovka, qcolombet, aditya_nandakumar, t.p.northover, ab Reviewed By: rovka Subscribers: aemerson, rengolin, dberris, kristof.beyls, llvm-commits, igorb Differential Revision: https://reviews.llvm.org/D31418 llvm-svn: 300993	2017-04-21 15:59:56 +00:00
Craig Topper	c3980cd309	[SimplifyCFG] Fix the determination of PostBB in conditional store merging to handle the targets on the second branch being commuted Currently we choose PostBB as the single successor of QFB, but its possible that QTB's single successor is QFB which would make QFB the correct choice. Differential Revision: https://reviews.llvm.org/D32323 llvm-svn: 300992	2017-04-21 15:53:42 +00:00
Wei Mi	31b250d89f	[ConstHoisting] Add BFI in constanthoisting pass and select the best insertion places based on it. Existing constant hoisting pass will merge a group of contants in a small range and hoist the const materialization code to the common dominator of their uses. However, if the uses are all in cold pathes, existing implementation may hoist the materialization code from cold pathes to a hot place. This may hurt performance. The patch introduces BFI to the pass and selects the best insertion places based on it. The change is controlled by an option consthoist-with-block-frequency which is off by default for now. Differential Revision: https://reviews.llvm.org/D28962 llvm-svn: 300989	2017-04-21 15:50:16 +00:00
Chad Rosier	097b9bdbd2	[AArch64][Falkor] Refine modeling of store-release exclusive instructions. llvm-svn: 300987	2017-04-21 14:58:32 +00:00
Joel Jones	4169663d9d	[Mips] Document Mips Backend Relocation Principles This revision documents the combination of C++ and table-gen code that handles relocations and addresses. Thanks for Simon Dardis for the careful reviews. Differential Revision: https://reviews.llvm.org/D31628 llvm-svn: 300986	2017-04-21 14:49:27 +00:00
Chad Rosier	ca140999d4	[AArch64][Falkor] Refine resource needs of STRQ with register offset. llvm-svn: 300984	2017-04-21 14:33:13 +00:00
Matthew Simpson	ca617e5f11	[LV] Model if-converted phi node costs Phi nodes in non-header blocks are converted to select instructions after if-conversion. This patch updates the cost model to account for the selects. Differential Revision: https://reviews.llvm.org/D31906 llvm-svn: 300980	2017-04-21 14:14:54 +00:00
Daniel Sanders	0deb184c59	Revert r300964 + r300970 - [globalisel][tablegen] Import SelectionDAG's rule predicates and support the equivalent in GIRule. It's causing llvm-clang-x86_64-expensive-checks-win to fail to compile and I haven't worked out why. Reverting to make it green while I figure it out. llvm-svn: 300978	2017-04-21 14:09:20 +00:00
Sanjay Patel	58b6a4e3f2	[InstCombine] prefer xor with -1 because 'not' is easier to understand (PR32706) This matches the demanded bits behavior in the DAG and should fix: https://bugs.llvm.org/show_bug.cgi?id=32706 Differential Revision: https://reviews.llvm.org/D32255 llvm-svn: 300977	2017-04-21 14:03:54 +00:00
Chad Rosier	633c2c7f8f	[AArch64][Falkor] Refine loads/stores that require an extra LD pipe. llvm-svn: 300976	2017-04-21 13:55:41 +00:00
Chad Rosier	6aaa4fd30c	[AArch64][Falkor] Fix number of microops for WriteSTIdx missed in r300892. llvm-svn: 300975	2017-04-21 13:37:01 +00:00
Chad Rosier	6266ca1e87	[AArch64] Fix a few missed pre/post-inc in Falkor. llvm-svn: 300974	2017-04-21 13:36:57 +00:00
Diana Picus	ecb726ef18	[ARM] GlobalISel: Add support for G_TRUNC Select them as copies. We only select if both the source and the destination are on the same register bank, so this shouldn't cause any trouble. llvm-svn: 300971	2017-04-21 13:16:50 +00:00
Daniel Sanders	565f401ece	[globalisel][tablegen] Try again to fix builds on old MSVC's after r300964 This should fix llvm-clang-x86_64-expensive-checks-win I reproduced the error using the following code: namespace llvm { // Moving this out of the llvm namespace fixes the error. template<unsigned NumBits> class PredicateBitsetImpl {}; } namespace { const unsigned MAX_SUBTARGET_PREDICATES = 11; // This works on Clang but is broken on MSVC // using PredicateBitset = PredicateBitsetImpl<MAX_SUBTARGET_PREDICATES>; // Some versions emit a syntax error here ("error C2061: syntax error: identifier // 'PredicateBitsetImpl'") but others accept it and only emit the C3646 below. // // This works on Clang and MSVC using PredicateBitset = llvm::PredicateBitsetImpl<MAX_SUBTARGET_PREDICATES>; class Foo { private: PredicateBitset A; // error C3646: 'A': unknown override specifier }; } llvm-svn: 300970	2017-04-21 12:51:43 +00:00
Daniel Sanders	1be3450361	Revert: r300966 - [globalisel][tablegen] Attempt to fix builds on old MSVC's after r300964 It didn't fix the builder. llvm-svn: 300968	2017-04-21 12:08:25 +00:00
Diana Picus	f032b2d237	[ARM] GlobalISel: Make struct arguments fail elegantly The condition in isSupportedType didn't handle struct/array arguments properly. Fix the check and add a test to make sure we use the fallback path in this kind of situation. The test deals with some common cases where the call lowering should error out. There are still some issues here that need to be addressed (tail calls come to mind), but they can be addressed in other patches. llvm-svn: 300967	2017-04-21 11:53:01 +00:00
Daniel Sanders	b760107b03	[globalisel][tablegen] Attempt to fix builds on old MSVC's after r300964 This should fix llvm-clang-x86_64-expensive-checks-win llvm-svn: 300966	2017-04-21 11:29:29 +00:00
Daniel Sanders	9e0319164d	[globalisel][tablegen] Import SelectionDAG's rule predicates and support the equivalent in GIRule. Summary: The SelectionDAG importer now imports rules with Predicate's attached via Requires, PredicateControl, etc. These predicates are implemented as bitset's to allow multiple predicates to be tested together. However, unlike the MC layer subtarget features, each target only pays for it's own predicates (e.g. AArch64 doesn't have 192 feature bits just because X86 needs a lot). Both AArch64 and X86 derive at least one predicate from the MachineFunction or Function so they must re-initialize AvailableFeatures before each function. They also declare locals in <Target>InstructionSelector so that computeAvailableFeatures() can use the code from SelectionDAG without modification. Reviewers: rovka, qcolombet, aditya_nandakumar, t.p.northover, ab Reviewed By: rovka Subscribers: aemerson, rengolin, dberris, kristof.beyls, llvm-commits, igorb Differential Revision: https://reviews.llvm.org/D31418 llvm-svn: 300964	2017-04-21 10:27:20 +00:00
Clement Courbet	f483ef9b1e	typo llvm-svn: 300963	2017-04-21 09:21:05 +00:00
Clement Courbet	8e62ffe24e	add skylake llvm-svn: 300962	2017-04-21 09:21:01 +00:00
Clement Courbet	c038908215	add 32 bit tests llvm-svn: 300961	2017-04-21 09:20:58 +00:00
Clement Courbet	c1111a1ba6	use repmovsb when optimizing forminsize llvm-svn: 300960	2017-04-21 09:20:55 +00:00
Clement Courbet	cb0dca1bff	Rename FastString flag. llvm-svn: 300959	2017-04-21 09:20:50 +00:00
Clement Courbet	bd60233285	add more tests llvm-svn: 300958	2017-04-21 09:20:44 +00:00
Clement Courbet	7015beafe5	X86 memcpy: use REPMOVSB instead of REPMOVS{Q,D,W} for inline copies when the subtarget has fast strings. This has two advantages: - Speed is improved. For example, on Haswell thoughput improvements increase linearly with size from 256 to 512 bytes, after which they plateau: (e.g. 1% for 260 bytes, 25% for 400 bytes, 40% for 508 bytes). - Code is much smaller (no need to handle boundaries). llvm-svn: 300957	2017-04-21 09:20:39 +00:00
George Rimar	e970c2c743	[DWARF] - Refactoring: localize handling of relocations in a single place. This is splitted from D32228, currently DWARF parsers code has few places that applied relocations values manually. These places has similar duplicated code. Patch introduces separate method that can be used to obtain relocated value. That helps to reduce code and simplifies things. Differential revision: https://reviews.llvm.org/D32284 llvm-svn: 300956	2017-04-21 09:12:18 +00:00
Clement Courbet	a18c2fc21d	Delete dead code llvm-svn: 300952	2017-04-21 07:40:59 +00:00
Artyom Skrobov	92d7e04b17	[Thumb1] The recently added tADCS and tSBCS pseudo-instructions were missing `Uses = [CPSR]` Summary: Thanks to Oliver Stannard for helping catch this. Reviewers: olista01, efriedma Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D31815 llvm-svn: 300951	2017-04-21 07:35:21 +00:00
Serguei Katkov	802138fb6a	[AsmWriter] Eliminate warning. NFC This patch eliminates the following warning lib/IR/AsmWriter.cpp:1128:57: warning: suggest parentheses around '&&' within '\|\|' [-Wparentheses] (StrVal[1] >= '0' && StrVal[1] <= '9')) && Reviewers: timshen, rnk, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D32337 llvm-svn: 300950	2017-04-21 06:14:38 +00:00
George Burgess IV	4cb8e1523d	[MSSA] Clean up the updater a bit. NFC - Mark an internal function static - Remove the llvm namespace (just holding on to the `using namespace llvm;` Works on My Machine(TM)) llvm-svn: 300947	2017-04-21 04:54:52 +00:00
Davide Italiano	4469a6f083	[PartialInliner] Fix crash when inlining functions with unreachable blocks. CodeExtractor looks up the dominator node corresponding to return blocks when splitting them. If one of these blocks is unreachable, there's no node in the Dom and CodeExtractor crashes because it doesn't check for domtree node validity. In theory, we could add just a check for skipping null DTNodes in `splitReturnBlock` but the fix I propose here is slightly different. To the best of my knowledge, unreachable blocks are irrelevant for the algorithm, therefore we can just skip them when building the candidate set in the constructor. Differential Revision: https://reviews.llvm.org/D32335 llvm-svn: 300946	2017-04-21 04:25:00 +00:00
Serguei Katkov	1ad7264e1c	[BPI] Add multiplication by scalar operators to BranchProbability This patch just adds two operators to BranchProbability class: (BP * scalar) and (BP *= scalar). Reviewers: junbuml, chandlerc, sanjoy, vsk Reviewed By: chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32334 llvm-svn: 300945	2017-04-21 03:14:30 +00:00
Serguei Katkov	6a3381464c	[AsmWriter/APFloat] FP constant printing: Avoid usage of locale dependent snprinf This should fix the bug https://bugs.llvm.org/show_bug.cgi?id=12906 To print the FP constant AsmWriter does the following: 1) convert FP value to String (actually using snprintf function which is locale dependent). 2) Convert String back to FP Value 3) Compare original and got FP values. If they are not equal just dump as hex. The problem happens on the 2nd step when APFloat does not expect group delimiter or fraction delimiter other than period symbol and so on, which can be produced on the first step if LLVM library is used in an environment with corresponding locale set. To fix this issue the locale independent APFloat:toString function is used. However it prints FP values slightly differently than snprintf does. Specifically it suppress trailing zeros in significant, use capital E and so on. It results in 117 test failures during make check. To avoid this I've also updated APFloat.toString a bit to pass make check at least. Reviewers: sberg, bogner, majnemer, sanjoy, timshen, rnk Reviewed By: timshen, rnk Subscribers: rnk, llvm-commits Differential Revision: https://reviews.llvm.org/D32276 llvm-svn: 300943	2017-04-21 02:52:17 +00:00

1 2 3 4 5 ...

147841 Commits