llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 13:02:52 +02:00

Author	SHA1	Message	Date
Craig Topper	c755f719e9	[APInt] Fix bugs in isShiftedMask to match behavior of the similar function in MathExtras.h This removes a parameter from the routine that was responsible for a lot of the issue. It was a bit count that had to be set to the BitWidth of the APInt and would get passed to getLowBitsSet. This guaranteed the call to getLowBitsSet would create an all ones value. This was then compared to (V \| (V-1)). So the only shifted masks we detected had to have the MSB set. The one in tree user is a transform in InstCombine that never fires due to earlier transforms covering the case better. I've submitted a patch to remove it completely, but for now I've just adapted it to the new interface for isShiftedMask. llvm-svn: 299273	2017-03-31 22:23:42 +00:00
Konstantin Zhuravlyov	cfca8f7feb	[AMDGPU] Fix typo in test filename. NFC. llvm-svn: 299271	2017-03-31 22:14:54 +00:00
Derek Schuff	e5945bcdb2	Add virtual destructor to WasmYAML::Section or avoid memory leak Tested locally with -DLLVM_USE_SANITIZER=Address Differential Revision: https://reviews.llvm.org/D31551 Patch by Sam Clegg llvm-svn: 299270	2017-03-31 22:14:14 +00:00
Bob Haarman	b5a9d6bfb3	LTO: call getRealLinkageName on IRNames before feeding to getGUID Summary: GlobalValue has two getGUID methods: an instance method and a static method. The static method takes a string, which is expected to be what GlobalValue::getRealLinkageName() would return. In LTO.cpp, we were not doing this consistently, sometimes passing an IR name instead. This change makes it so that we call getRealLinkageName() first, making the static getGUID return value consistent with the instance method. Without this change, compiling FileCheck with ThinLTO on Windows fails with numerous undefined symbol errors. With the change, it builds successfully. Reviewers: pcc, rnk Reviewed By: pcc Subscribers: tejohnson, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D31444 llvm-svn: 299268	2017-03-31 21:56:30 +00:00
Craig Topper	9e862912a3	[InstCombine] When adding an Instruction and its Users to the worklist at the same time, make sure we put the Users in first. Then put in the instruction. This way we ensure we immediately revisit the instruction and do any additional optimizations before visiting the users. Otherwise we might visit the users, then the instruction, then users again, then instruction again. llvm-svn: 299267	2017-03-31 21:35:30 +00:00
Sanjay Patel	815f1495f5	[DAGCombiner] refactor and/or-of-setcc to get rid of duplicated code; NFCI llvm-svn: 299266	2017-03-31 21:30:50 +00:00
Reid Kleckner	999ed511ee	Fix binary static archive that got mangled by patch llvm-svn: 299265	2017-03-31 21:16:22 +00:00
Reid Kleckner	2f8f001e86	[llvm-ar] Extract objects to their basename in the CWD This is helpful when extracting objects from archives produced by MSVC's lib.exe, which users absolute paths to describe the archive members. llvm-svn: 299264	2017-03-31 21:10:53 +00:00
Craig Topper	5d174cd9c8	[InstCombine] Add test case demonstrating missed opportunities for removing add/sub when the LSBs of one input are known to be 0 and MSBs of the output aren't consumed. llvm-svn: 299263	2017-03-31 21:08:37 +00:00
Krzysztof Parzyszek	3058529228	[Hexagon] Remove unused variables Found by PVS-Studio. Fixes llvm.org/PR31676. llvm-svn: 299262	2017-03-31 21:03:59 +00:00
Krzysztof Parzyszek	ed7792e41e	[Hexagon] Fix typo in HexagonEarlyIfCConv.cpp Found by PVS-Studio. Fixes llvm.org/PR32480. llvm-svn: 299258	2017-03-31 20:36:00 +00:00
Stephen Canon	aadb07a152	Fix 80-column violation in previous commit. llvm-svn: 299257	2017-03-31 20:35:02 +00:00
Stephen Canon	dc865d22d7	Fix APFloat mod (committing for simonbyrne) The previous version was prone to intermediate rounding or overflow. Differential Revision: https://reviews.llvm.org/D29346 llvm-svn: 299256	2017-03-31 20:31:33 +00:00
Sanjay Patel	fa8ce143bf	[DAGCombiner] add fold for 'All sign bits set?' (and (setlt X, 0), (setlt Y, 0)) --> (setlt (and X, Y), 0) We have 7 similar folds, but this one got away. The fact that the x86 test with a branch didn't change is probably a separate bug. We may also be missing this and the related folds in instcombine. llvm-svn: 299252	2017-03-31 20:28:06 +00:00
Stanislav Mekhanoshin	7f7b59840c	[AMDGPU] Remove assumption that vector and scalar types do not alias Differential Revision: https://reviews.llvm.org/D31547 llvm-svn: 299250	2017-03-31 20:16:54 +00:00
Craig Topper	74bd2ab89a	[APInt] Remove shift functions from APIntOps namespace. Replace the few users with the APInt class methods. NFCI llvm-svn: 299248	2017-03-31 20:01:16 +00:00
Joerg Sonnenberger	d51f93d34b	Do not translate rint into nearbyint, but truncate it like nearbyint. A common way to implement nearbyint is by fiddling with the floating point environment and calling rint. This is used at least by the BSD libm and musl. As such, canonicalizing the latter to the former will create infinite loops for libm and generally pessimize performance, at least when the generic C versions are used. This change preserves the rint in the libcall translation and also handles the domain truncation logic, so that rint with float argument will be reduced to rintf etc. llvm-svn: 299247	2017-03-31 19:58:07 +00:00
Matt Arsenault	c3c5eef5bb	AMDGPU: Remove unnecessary ands when f16 is legal Add a new node to act as a fancy bitcast from f16 operations to i32 that implicitly zero the high 16-bits of the result. Alternatively could try making v2f16 legal and canonicalizing on build_vectors. llvm-svn: 299246	2017-03-31 19:53:03 +00:00
Jan Vesely	3a1dc32be9	AMDGPU/R600: Fix amdgpu alias analysis pass. R600 uses higher AS number to access kernel parameters Fixes: r298846 Differential Revision: https://reviews.llvm.org/D31520 llvm-svn: 299245	2017-03-31 19:26:23 +00:00
Sanjay Patel	cf58148c0b	[PowerPC] add tests for setcc+setcc+logic; NFC These are the same tests added for x86 with r299238, but PPC doesn't specify all branches as cheap, so we see different patterns in tests with branches. llvm-svn: 299244	2017-03-31 18:51:03 +00:00
Craig Topper	25a319fb68	[APInt] Rewrite getLoBits in a way that will do one less memory allocation in the multiword case. Rewrite getHiBits to use the class method version of lshr instead of the one in APIntOps. NFCI llvm-svn: 299243	2017-03-31 18:48:14 +00:00
Craig Topper	1077a9da9c	[APInt] Remove unused functions from the APIntOps namespace. The corresponding methods on the APInt object should be used instead. NFC llvm-svn: 299242	2017-03-31 18:30:01 +00:00
Sanjay Patel	502b9e4f48	[DAGCombiner] remove redundant code and add comments; NFCI llvm-svn: 299241	2017-03-31 18:18:58 +00:00
Balaram Makam	127d0e73d2	[AArch64] Add new subtarget feature to fold LSL into address mode. Summary: This feature enables folding of logical shift operations of up to 3 places into addressing mode on Kryo and Falkor that have a fastpath LSL. Reviewers: mcrosier, rengolin, t.p.northover Subscribers: junbuml, gberry, llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D31113 llvm-svn: 299240	2017-03-31 18:16:53 +00:00
Sanjay Patel	cf9e1adbff	[x86] add/consolidate tests for setcc+setcc+and/or; NFC llvm-svn: 299238	2017-03-31 17:55:07 +00:00
Adam Nemet	dc5d7f0cc6	Improve DebugInfo/strip-loop-metadata.ll test This wasn't covering for the case where you have multiple latches and hence the use of the same loop-id which needs to be mapped to the same loop-id. llvm-svn: 299237	2017-03-31 17:51:12 +00:00
Piotr Padlewski	142150de89	[MSSA] Small test fix llvm-svn: 299235	2017-03-31 17:39:07 +00:00
Craig Topper	40bbf69094	[AVX-512] Update lowering for gather/scatter prefetch intrinsics to match the immediate encodings the frontend uses based on the _MM_HINT_T0/T1 constant values in clang's headers. Our _MM_HINT_T0/T1 constant values are 3/2 which matches gcc, but not icc or Intel documentation. Interestingly gcc had this same bug on their implementation of the gather/scatter builtins at one point too. Fixes PR32411. llvm-svn: 299234	2017-03-31 17:24:29 +00:00
Rafael Espindola	4c46bce1b0	Rename variable. Requested on post commit code review. llvm-svn: 299232	2017-03-31 17:11:51 +00:00
Dehao Chen	20bf5cf253	Fix the InstCombine to reserve the VP metadata and sets correct call count. Summary: Currently the VP metadata was dropped when InstCombine converts a call to direct call. This patch converts the VP metadata to branch_weights so that its hotness is recorded. Reviewers: eraman, davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31344 llvm-svn: 299228	2017-03-31 15:59:52 +00:00
Jan Sjodin	5d3fb065f5	Refactor code to create getFallThrough method in MachineBasicBlock. Differential Revision: https://reviews.llvm.org/D27264 llvm-svn: 299227	2017-03-31 15:55:37 +00:00
Kristof Beyls	2a0353c292	Remove name space pollution from Signals.cpp llvm-svn: 299224	2017-03-31 14:58:52 +00:00
Petar Jovanovic	94d4d3db3b	[mips][msa] Prevent output operand from commuting for dpadd_[su].df ins Implementation of TargetInstrInfo::findCommutedOpIndices for MIPS target, restricting commutativity to second and third operand only for dpaadd_[su].df instructions therein. Prior to this change, there were cases where the vector that is to be added to the dot product of the other two could take a position other than the first one in the instruction, generating false output in the destination vector. Such behavior has been noticed in the two functions generating v2i64 output values so far. Other ones may exhibit such behavior as well, just not for the vector operands which are present in the test at the moment. Tests altered so that the function's first operand is a constant splat so that it can be loaded with a ldi instruction, since that is the case in which the erroneous instruction operand placement has occurred. We check that the register which is present in the ldi instruction is placed as the first operand in the corresponding dpadd instruction. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D30827 llvm-svn: 299223	2017-03-31 14:31:55 +00:00
Kristof Beyls	2f9a65d919	Remove more name space pollution from .inc files llvm-svn: 299222	2017-03-31 14:26:44 +00:00
Simon Pilgrim	ef98e5126a	[DAGCombiner] Add ComputeNumSignBits vector demanded elements support to ASHR and INSERT_VECTOR_ELT Followup to D31311 llvm-svn: 299221	2017-03-31 14:21:50 +00:00
Jonas Paulsson	de98d3678e	[SystemZ] Make sure of correct regclasses in insertSelect() Since LOCR only accepts GR32 virtual registers, its operands must be copied into this regclass in insertSelect(), when an LOCR is built. Otherwise, the case where the source operand was GRX32 will produce invalid IR. Review: Ulrich Weigand llvm-svn: 299220	2017-03-31 14:06:59 +00:00
Simon Pilgrim	026e8c9b44	[DAGCombiner] Add vector demanded elements support to ComputeNumSignBits Currently ComputeNumSignBits returns the minimum number of sign bits for all elements of vector data, when we may only be interested in one/some of the elements. This patch adds a DemandedElts argument that allows us to specify the elements we actually care about. The original ComputeNumSignBits implementation calls with a DemandedElts demanding all elements to match current behaviour. Scalar types set this to 1. I've only added support for BUILD_VECTOR and EXTRACT_VECTOR_ELT so far, all others will default to demanding all elements but can be updated in due course. Followup to D25691. Differential Revision: https://reviews.llvm.org/D31311 llvm-svn: 299219	2017-03-31 13:54:09 +00:00
Kristof Beyls	1397ecba80	Do not pollute the namespace in a header file. llvm-svn: 299218	2017-03-31 13:48:21 +00:00
Rafael Espindola	8faf0df49e	Add a %basename substitution. This will be used to avoid various call to basename in the asan tests. llvm-svn: 299216	2017-03-31 13:41:10 +00:00
Jonas Paulsson	85e07f410f	[SystemZ] Skip DAGCombining of vector node for older subtargets. Even on older subtargets that lack vector support, there may be vector values with just one element in the input program. These are converted during DAG legalization to scalar values. The pre-legalize SystemZ DAGCombiner methods should in this circumstance not touch these nodes. This patch adds a check for this in SystemZTargetLowering::combineEXTRACT_VECTOR_ELT(). Review: Ulrich Weigand llvm-svn: 299213	2017-03-31 13:22:59 +00:00
Kristof Beyls	d6adfb4c85	Make naming in Host.h in line with coding standards. Based on post-commit review comments by Chandler Carruth on https://reviews.llvm.org/D31236. Thanks! llvm-svn: 299211	2017-03-31 13:06:40 +00:00
Rafael Espindola	455a6e04ae	Use the current working directory in the glob expansion This fixes tests that do things like mkdir <dir> cd <dir> .. <cmd> *.foo llvm-svn: 299209	2017-03-31 12:46:39 +00:00
Yaron Keren	7e27a0ae9d	Update comment for r299098 per feedback from James Henderson. llvm-svn: 299207	2017-03-31 12:08:45 +00:00
Max Kazantsev	ccddc942de	[ScalarEvolution] Re-enable Predicate implication from operations The patch rL298481 was reverted due to crash on clang-with-lto-ubuntu build. The reason of the crash was type mismatch between either a or b and RHS in the following situation: LHS = sext(a +nsw b) > RHS. This is quite rare, but still possible situation. Normally we need to cast all {a, b, RHS} to their widest type. But we try to avoid creation of new SCEV that are not constants to avoid initiating recursive analysis that can take a lot of time and/or cache a bad value for iterations number. To deal with this, in this patch we reject this case and will not try to analyze it if the type of sum doesn't match with the type of RHS. In this situation we don't need to create any non-constant SCEVs. This patch also adds an assertion to the method IsProvedViaContext so that we could fail on it and not go further into range analysis etc (because in some situations these analyzes succeed even when the passed arguments have wrong types, what should not normally happen). The patch also contains a fix for a problem with too narrow scope of the analysis caused by wrong usage of predicates in recursive invocations. The regression test on the said failure: test/Analysis/ScalarEvolution/implied-via-addition.ll Reviewers: reames, apilipenko, anna, sanjoy Reviewed By: sanjoy Subscribers: mzolotukhin, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D31238 llvm-svn: 299205	2017-03-31 12:05:30 +00:00
Kristof Beyls	cbb1a575f1	Do not pollute the namespace in a header file. llvm-svn: 299203	2017-03-31 12:00:24 +00:00
Sam Kolton	e9700205c5	[AMDGPU] SDWA Peephole: improve search for immediates in SDWA patterns Previously compiler often extracted common immediates into specific register, e.g.: ``` %vreg0 = S_MOV_B32 0xff; %vreg2 = V_AND_B32_e32 %vreg0, %vreg1 %vreg4 = V_AND_B32_e32 %vreg0, %vreg3 ``` Because of this SDWA peephole failed to find SDWA convertible pattern. E.g. in previous example this could be converted into 2 SDWA src operands: ``` SDWA src: %vreg2 src_sel:BYTE_0 SDWA src: %vreg4 src_sel:BYTE_0 ``` With this change peephole check if operand is either immediate or register that is copy of immediate. llvm-svn: 299202	2017-03-31 11:42:43 +00:00
Simon Pilgrim	4864971eb2	[DAGCombiner] Add vector demanded elements support to computeKnownBitsForTargetNode Follow up to D25691, this sets up the plumbing necessary to support vector demanded elements support in known bits calculations in target nodes. Differential Revision: https://reviews.llvm.org/D31249 llvm-svn: 299201	2017-03-31 11:24:16 +00:00
Simon Pilgrim	e19a952db2	Spelling mistakes in comments. NFCI. llvm-svn: 299197	2017-03-31 10:59:37 +00:00
Simon Pilgrim	0a4e3a3959	Fix MSVC 'not all control paths return a value' warning llvm-svn: 299195	2017-03-31 10:46:47 +00:00
Simon Pilgrim	dc4c90ee77	Fix signed/unsigned warning llvm-svn: 299194	2017-03-31 10:45:35 +00:00

... 3 4 5 6 7 ...

147142 Commits