llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Clement Courbet	fa18c6d0db	Revert "[MergeICmps] Disable mergeicmps if the target does not want to handle memcmp expansion." Breaks clang-stage1-cmake-RA-incremental/llvm/test/Transforms/MergeICmps/X86/tuple-four-int8.ll This reverts commit 3038c459d67f8898ffa295d54a013b280690abfa. llvm-svn: 314972	2017-10-05 08:03:39 +00:00
Craig Topper	9c3200ed1a	[InstCombine] Fix a vector splat handling bug in transformZExtICmp. We were using an i1 type and then zero extending to a vector. Instead just create the 0/1 directly as a ConstantInt with the correct type. No need to ask ConstantExpr to zero extend for us. This bug is a bit tricky to hit because it requires us to visit a zext of an icmp that would normally be simplified to true/false, but that icmp hasnt' been visited yet. In the test case this zext and icmp were created by visiting a udiv and due to worklist ordering we got to the zext first. Fixes PR34841. llvm-svn: 314971	2017-10-05 07:59:11 +00:00
Clement Courbet	18754ff013	[MergeICmps] Disable mergeicmps if the target does not want to handle memcmp expansion. Summary: This is to avoid e.g. merging two cheap icmps if the target is not going to expand to something nice later. Reviewers: dberlin, spatel Subscribers: davide, nemanjai Differential Revision: https://reviews.llvm.org/D38232 llvm-svn: 314970	2017-10-05 07:49:09 +00:00
Mikael Holmen	4c352af309	Minor refactoring regarding Cast::isNoopCast(), NFC Summary: FastISel::hasTrivialKill() was the only user of the "IntPtrTy" version of Cast::isNoopCast(). According to review comments in D37894 we could instead use the "DataLayout" version of the method, and thus get rid of the "IntPtrTy" versions of isNoopCast() completely. With the above done, the remaining isNoopCast() could then be simplified a bit more. Reviewers: arsenm Reviewed By: arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D38497 llvm-svn: 314969	2017-10-05 07:07:09 +00:00
Dean Michael Berris	7c47c0e53b	[XRay][tools] Support arg1 logging entries in the basic logging mode Summary: The arg1 logging handler changed in compiler-rt to start writing a different type for entries encountered when logging the first argument of XRay-instrumented functions. This change allows the trace loader to support reading these record types as well as prepare for when the basic (naive) mode implementation starts writing down the argument payloads. Without this change, binaries with arg1 logging support enabled start writing unreadable logs for any of the XRay tracing tools. Reviewers: pelikan Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38550 llvm-svn: 314967	2017-10-05 05:18:17 +00:00
Sean Fertile	3695295d20	Enabling new pass manager in LTO (and thinLTO) link step. Adds the option 'new-pass-manager' to the gold pluggin to enable using the new pass manager during the lto/thinlto link step. Patch by Graham Yiu. Differential Revision: https://reviews.llvm.org/D38517 llvm-svn: 314963	2017-10-05 01:48:42 +00:00
Xinliang David Li	9b1601a096	Revert r314928 to investigate thinLTO bootstrap failure llvm-svn: 314961	2017-10-05 01:40:13 +00:00
Eugene Zelenko	6515aa7ae4	[X86] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 314953	2017-10-05 00:33:50 +00:00
Matt Arsenault	d79bc6a3f1	AMDGPU: Add comment about clamps llvm-svn: 314952	2017-10-05 00:13:20 +00:00
Matt Arsenault	263f0dbf65	AMDGPU: Do not fold clamp instructions when sources are different Patch by hakzsam (Samuel Pitoiset) llvm-svn: 314951	2017-10-05 00:13:17 +00:00
Craig Topper	d93296dbb4	[InstCombine] Improve support for ashr in foldICmpAndShift We can support ashr similar to lshr, if we know that none of the shifted in bits are used. In that case SimplifyDemandedBits would normally convert it to lshr. But that conversion doesn't happen if the shift has additional users. Differential Revision: https://reviews.llvm.org/D38521 llvm-svn: 314945	2017-10-04 23:06:13 +00:00
Matt Arsenault	6dee648d90	AMDGPU: Fix not accounting for instruction size in bundles These were counted as 0. Fixes branch limit exceeded errors in some large programs. llvm-svn: 314944	2017-10-04 22:59:12 +00:00
Konstantin Zhuravlyov	298e3abdb4	AMDGPU: Correctly set EI_OSABI based on the os Differential Revision: https://reviews.llvm.org/D38555 llvm-svn: 314943	2017-10-04 22:44:13 +00:00
Adrian Prantl	6d68e5d516	clang-format file. llvm-svn: 314942	2017-10-04 22:26:19 +00:00
Adrian Prantl	55b7ddd894	delete commented out code. llvm-svn: 314941	2017-10-04 22:26:19 +00:00
Sanjoy Das	6419f6461c	Do not call Loop::getName on possibly dead loops This fixes PR34832. llvm-svn: 314938	2017-10-04 22:02:27 +00:00
Xin Tong	61d1a4c598	[MachineBlockPlacement] Make sure PreferredLoopExit is cleared everytime new loop is processed Summary: Rotate on exit that actually exits the current loop. Reviewers: davidxl, danielcdh, iteratee, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38563 llvm-svn: 314937	2017-10-04 21:39:25 +00:00
Hans Wennborg	e7ea46e7a9	Fix a -Wparentheses warning. NFC. llvm-svn: 314936	2017-10-04 21:14:07 +00:00
Justin Lebar	c8a4e60a71	Convert an APInt to int64_t properly in TTI::getGEPCost(). Summary: If the pointer width is 32 bits and the calculated GEP offset is negative, we call APInt::getLimitedValue(), which does a zero-extension of the offset. That's wrong -- we should do an sext. Fixes a bug introduced in rL314362 and found by Evgeny Astigeevich. Reviewers: efriedma Subscribers: sanjoy, javed.absar, llvm-commits, eastig Differential Revision: https://reviews.llvm.org/D38557 llvm-svn: 314935	2017-10-04 20:47:33 +00:00
Marcello Maggioni	e3a4a32d00	[LoopDeletion] Move deleteDeadLoop to to LoopUtils. NFC llvm-svn: 314934	2017-10-04 20:42:46 +00:00
Rafael Espindola	e01aed9846	Bring r314809 back. But now include a check for CPU_COUNT so we still build on 10 year old versions of glibc. Original message: Use sched_getaffinity instead of std:🧵:hardware_concurrency. The issue with std:🧵:hardware_concurrency is that it forwards to libc and some implementations (like glibc) don't take thread affinity into consideration. With this change a llvm program that can execute in only 2 cores will use 2 threads, even if the machine has 32 cores. This makes benchmarking a lot easier, but should also help if someone doesn't want to use all cores for compilation for example. llvm-svn: 314931	2017-10-04 20:27:01 +00:00
Sanjay Patel	95e7a2f063	[SimplifyCFG] put the optional assumption cache pointer in the options struct; NFCI This is a follow-up to https://reviews.llvm.org/D38138. I fixed the capitalization of some functions because we're changing those lines anyway and that helped verify that we weren't accidentally dropping any options by using default param values. llvm-svn: 314930	2017-10-04 20:26:25 +00:00
Xinliang David Li	63c93124d5	Recommit r314561 after fixing msan build failure (trial 2) Incoming val defined by terminator instruction which also requires bitcasts can not be handled. llvm-svn: 314928	2017-10-04 20:17:55 +00:00
Guozhi Wei	e2af0b6a83	[TargetTransformInfo] Check if function pointer is valid before calling isLoweredToCall Function isLoweredToCall can only accept non-null function pointer, but a function pointer can be null for indirect function call. So check it before calling isLoweredToCall from getInstructionLatency. Differential Revision: https://reviews.llvm.org/D38204 llvm-svn: 314927	2017-10-04 20:14:08 +00:00
Jun Bum Lim	a6292a6c0f	Recommit : Use the basic cost if a GEP is not used as addressing mode Recommitting r314517 with the fix for handling ConstantExpr. Original commit message: Currently, getGEPCost() returns TCC_FREE whenever a GEP is a legal addressing mode in the target. However, since it doesn't check its actual users, it will return FREE even in cases where the GEP cannot be folded away as a part of actual addressing mode. For example, if an user of the GEP is a call instruction taking the GEP as a parameter, then the GEP may not be folded in isel. llvm-svn: 314923	2017-10-04 18:33:52 +00:00
Daniel Neilson	46cdd30793	Revert D38481 due to missing cmake check for CPU_COUNT Summary: This reverts D38481. The change breaks systems with older versions of glibc. It injects a use of CPU_COUNT() from sched.h without checking to ensure that the function exists first. Reviewers: Subscribers: llvm-svn: 314922	2017-10-04 18:19:03 +00:00
Simon Pilgrim	480172cb76	[X86][AVX] Improve (i8 bitcast (v8i1 x)) handling for v8i64/v8f64 512-bit vector compare results. AVX1/AVX2 targets were missing a chance to use vmovmskps for v8f32/v8i32 results for bool vector bitcasts llvm-svn: 314921	2017-10-04 18:00:42 +00:00
Krzysztof Parzyszek	87db7408be	[Hexagon] Add a member Subtarget to HexagonInstrInfo, NFC llvm-svn: 314920	2017-10-04 18:00:15 +00:00
Hans Wennborg	05b9db2476	Revert r314886 "[X86] Improvement in CodeGen instruction selection for LEAs (re-applying post required revision changes.)" It broke the Chromium / SQLite build; see PR34830. > Summary: > 1/ Operand folding during complex pattern matching for LEAs has been > extended, such that it promotes Scale to accommodate similar operand > appearing in the DAG. > e.g. > T1 = A + B > T2 = T1 + 10 > T3 = T2 + A > For above DAG rooted at T3, X86AddressMode will no look like > Base = B , Index = A , Scale = 2 , Disp = 10 > > 2/ During OptimizeLEAPass down the pipeline factorization is now performed over LEAs > so that if there is an opportunity then complex LEAs (having 3 operands) > could be factored out. > e.g. > leal 1(%rax,%rcx,1), %rdx > leal 1(%rax,%rcx,2), %rcx > will be factored as following > leal 1(%rax,%rcx,1), %rdx > leal (%rdx,%rcx) , %edx > > 3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops, > thus avoiding creation of any complex LEAs within a loop. > > Reviewers: lsaba, RKSimon, craig.topper, qcolombet, jmolloy > > Reviewed By: lsaba > > Subscribers: jmolloy, spatel, igorb, llvm-commits > > Differential Revision: https://reviews.llvm.org/D35014 llvm-svn: 314919	2017-10-04 17:54:06 +00:00
Jake Ehrlich	dd2311d56c	[llvm-objcopy] Fix major layout bugs in llvm-objcopy Somehow a few massive errors slipped though the cracks of testing. 1. The code in Segment::finalize was left over from the old layout algorithm. In certain situations this would cause very strange issues with segment layout. For instance in the shift-segments.test case it would cause the second segment to have the same offset as the first. 2. In debugging this I discovered another issue. Namely section alignment was not being computed based on Section->Align but instead Section->Offset which is bizarre and makes no sense. I have no clue how it worked in the first place. This issue is also fixed 3. Fixing #2 exposed a bug where things were not being written past the end of the file that technically should have been. This was because in certain cases (like overlapping-segments) the end of the file wouldn't always be bumped if the offset could be chosen relative to an existing segment that already had it's offset chosen. For fully nested segments this is fine but for overlapping segments this leaves the end of the file short. So I changed how the offset is bumped when looping though segments. Differential Revision: https://reviews.llvm.org/D38436 llvm-svn: 314918	2017-10-04 17:44:42 +00:00
Jakub Kuderski	5dd41a27d4	[Dominators] Take fast path when applying <=1 updates Summary: This patch teaches `DT.applyUpdates` to take the fast when applying zero or just one update and makes it not run the internal batch updater machinery. With this patch, it should no longer make sense to have a special check in user's code that checks the update sequence size before applying them, e.g. ``` if (!MyUpdates.empty()) DT.applyUpdates(MyUpdates); ``` or ``` if (MyUpdates.size() == 1) if (...) DT.insertEdge(...) else DT.deleteEdge(...) ``` Reviewers: dberlin, brzycki, davide, grosser, sanjoy Reviewed By: dberlin, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38541 llvm-svn: 314917	2017-10-04 17:32:55 +00:00
Simon Pilgrim	82e8ba8ebe	[X86][SSE] Add support for lowering v8i16 binary shuffles to PACKSS/PACKUS Missed in D38472 llvm-svn: 314916	2017-10-04 17:31:28 +00:00
Francis Ricci	e99e1a675f	[test] Fix append_path in the empty case Summary: normpath() was being called on an empty string and appended to the environment variable in the case where the environment variable was unset. This led to ":." being appended to the path, since normpath() of an empty string is '.', presumably to represent cwd. Reviewers: zturner, sqlbyme, modocache Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38542 llvm-svn: 314915	2017-10-04 17:30:28 +00:00
Craig Topper	cd20fc982b	[X86] Redefine MOVSS/MOVSD instructions to take VR128 regclass as input instead of FR32/FR64 This patch redefines the MOVSS/MOVSD instructions to take VR128 as its second input. This allows the MOVSS/SD->BLEND commute to work without requiring a COPY to be inserted. This should fix PR33079 Overall this looks to be an improvement in the generated code. I haven't checked the EXPENSIVE_CHECKS build but I'll do that and update with results. Differential Revision: https://reviews.llvm.org/D38449 llvm-svn: 314914	2017-10-04 17:20:12 +00:00
Balaram Makam	ebc41ed3ca	"[ARM] Mark flaky test MachineBranchProb.ll unsupported again for ARM/AArch64" r314857 changed the CFG that resulted in the flaky test MachineBranchProb.ll to fail the bots again. Marking it as unsupported for ARM/AArch64 again until we find the cause. llvm-svn: 314912	2017-10-04 16:45:24 +00:00
Yonghong Song	b8478376b9	bpf: fix an insn encoding issue for neg insn Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 314911	2017-10-04 16:11:52 +00:00
Adam Nemet	04df6745f9	[OptRemark] Move YAML writing to IR Before the patch this was in Analysis. Moving it to IR and making it implicit part of LLVMContext::diagnose allows the full opt-remark facility to be used outside passes e.g. the pass manager. Jessica is planning to use this to report function size after each pass. The same could be used for time reports. Tested with BUILD_SHARED_LIBS=On. llvm-svn: 314909	2017-10-04 15:18:11 +00:00
Adam Nemet	195b4a91cd	Also update MachineORE after r314874. llvm-svn: 314908	2017-10-04 15:18:07 +00:00
Sanjay Patel	f3b50f7a80	[InstCombine] add 'exact' variants of all tests; NFC We can likely remove most of these as redundant in the near future, but I'm trying to make sure I don't introduce any regressions with D38514. llvm-svn: 314907	2017-10-04 15:17:25 +00:00
Clement Courbet	bbebad00da	[NFC] clang-format lib/Transforms/Scalar/MergeICmps.cpp llvm-svn: 314906	2017-10-04 15:13:52 +00:00
Simon Pilgrim	567ea9f488	[X86][SSE] Early out from ComputeNumSignBitsForTargetNode. NFCI. Early out from vector shift by immediates that will exceed eltsize - don't bother making an unnecessary ComputeNumSignBits recursive call. llvm-svn: 314903	2017-10-04 13:41:26 +00:00
Simon Pilgrim	5d05b399c3	[X86][SSE] Add support for lowering unary shuffles to PACKSS/PACKUS Extension to D38472 llvm-svn: 314901	2017-10-04 13:12:08 +00:00
George Rimar	9e031dd8b9	[gold-plugin] - Fix compilation after LLVM update (r314883). NFC. llvm-svn: 314899	2017-10-04 11:00:30 +00:00
Dylan McKay	680a47c4f6	[AVR] Implement LPMWRdZ pseudo-instruction's expansion. FIXME: implementation is mostly copy-pasted from LDWRdPtr, so we should refactor a bit and unify the two Patch by Gerdo Erdi. llvm-svn: 314898	2017-10-04 10:37:22 +00:00
Dylan McKay	569e524df6	[AVR] Factor out mayLoad in tablegen patterns Patch by Gergo Erdi. llvm-svn: 314897	2017-10-04 10:36:07 +00:00
Dylan McKay	9862729490	[AVR] Elaborate LDWRdPtr into `ld r, X++; ld r+1, X` Patch by Gergo Erdi. llvm-svn: 314896	2017-10-04 10:33:36 +00:00
Dylan McKay	10d9b07b8a	[AVR] Insert JMP for long branches Previously, on long branches (relative jumps of >4 kB), an assertion failure was hit, as AVRInstrInfo::insertIndirectBranch was not implemented. Despite its name, it is called by the branch relaxator for all unconditional jumps. Patch by Thomas Backman. llvm-svn: 314891	2017-10-04 09:51:28 +00:00
Dylan McKay	e4b79b6c71	[AVR] Fix displacement overflow for LDDW/STDW In some cases, the code generator attempts to generate instructions such as: lddw r24, Y+63 which expands to: ldd r24, Y+63 ldd r25, Y+64 # Oops! This is actually ld r25, Y in the binary This commit limits the first offset to 62, and thus the second to 63. It also updates some asserts in AVRExpandPseudoInsts.cpp, including for INW and OUTW, which appear to be unused. Patch by Thomas Backman. llvm-svn: 314890	2017-10-04 09:51:21 +00:00
Oliver Stannard	ed10b53988	[ARM] Add diag string for movw/movt immediates in assembly This adds diagnostics for invalid immediate operands to the MOVW and MOVT instructions (ARM and Thumb). Differential revision: https://reviews.llvm.org/D31879 llvm-svn: 314888	2017-10-04 09:24:54 +00:00
Oliver Stannard	33d655fd7a	[ARM, Asm] Change grammar of immediate operand diagnostics Currently, our diagnostics for assembly operands are not consistent. Some start with (for example) "immediate operand must be ...", and some with "operand must be an immediate ...". I think the latter form is preferable for a few reasons: * It's unambiguous that it is referring to the expected type of operand, not the type the user provided. For example, the user could provide an register operand, and get a message taking about an operand is if it is already an immediate, just not in the accepted range. * It allows us to have a consistent style once we add diagnostics for operands that could take two forms, for example a label or pc-relative memory operand. Differential revision: https://reviews.llvm.org/D36689 llvm-svn: 314887	2017-10-04 09:18:07 +00:00

1 2 3 4 5 ...

155023 Commits