llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Craig Topper	a8988785a1	[InstCombine] Disable recursion in foldGEPICmp for vector pointer GEPs Due to missing vector support in this function, recursion can generate worse code in some cases. llvm-svn: 370221	2019-08-28 15:40:34 +00:00
Simon Pilgrim	7239307170	Fix uninitialized variable warning in cppcheck. NFCI. InstCombiner::MaxArraySizeForCombine is set outside the constructor so we need to ensure it has a default initialization value. llvm-svn: 370220	2019-08-28 15:19:49 +00:00
David Bolvansky	71dbbcd659	[NFC] Added a comment to avoid possible confusion llvm-svn: 370217	2019-08-28 15:04:48 +00:00
Ryan Taylor	9200d20ca6	[AMDGPU] Adjust number of SGPRs available in Calling Convention This reduces the number of SGPRs due to some concerns about running out of SGPRs if you make all the SGPRs that aren't reserved available for the calling convention. Change-Id: Idb4ca4dc72f5b6808cb524ff7270915a8de5b4c1 llvm-svn: 370215	2019-08-28 15:00:45 +00:00
Simon Pilgrim	81a10972e0	Remove duplicate 'BitWidth' variable. NFCI. llvm-svn: 370212	2019-08-28 14:37:44 +00:00
Johannes Doerfert	388e7de434	[Attributor] Restrict liveness and return information to functions Summary: Until we have proper call-site information we should not recompute liveness and return information for each call site. This patch directly uses the function versions and introduces TODOs at the usage sites. The required iterations to get to the fixpoint are most of the time reduced by this change and we always avoid work duplication. Reviewers: sstefan1, uenoku Subscribers: hiraditya, bollu, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66562 llvm-svn: 370208	2019-08-28 14:09:14 +00:00
Simon Pilgrim	18cce90beb	InstCombiner::visitSelectInst - rename Pred to MinMaxPred to stop shadow variable warning. NFCI. We have a lot of Predicate variables, all similarly named.... llvm-svn: 370207	2019-08-28 14:05:38 +00:00
Vlad Tsyrklevich	b2c1fe3130	Reland "[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same time." This relands this commit, I mistakenly reverted the original change thinking it was the cause of the observed MSan failures but it was not. llvm-svn: 370206	2019-08-28 14:04:09 +00:00
Hans Wennborg	14230be8c9	[SelectionDAG] Don't generate libcalls for wide shifts on Windows (PR42711) Neither libgcc or compiler-rt are usually used on Windows, so these functions can't be called. Differential revision: https://reviews.llvm.org/D66880 llvm-svn: 370204	2019-08-28 13:55:10 +00:00
Amaury Sechet	d6302afdcb	[X86] Add test for rotate combining when add X, X is used instead of shl X, 1. NFC llvm-svn: 370203	2019-08-28 13:52:32 +00:00
Vlad Tsyrklevich	08db50a420	Revert "[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same time." This reverts commit r370032, it was causing check-llvm failures on sanitizer-x86_64-linux-bootstrap-msan llvm-svn: 370198	2019-08-28 13:15:08 +00:00
Simon Pilgrim	3deef14031	[DAGCombine] Fix cppcheck shadow variable warning. NFCI. We already have an outer Ops variable. llvm-svn: 370197	2019-08-28 12:48:41 +00:00
Simon Atanasyan	e31779f54e	[mips] Use less registers to load address of TargetExternalSymbol There is no pattern matched `add hi, (MipsLo texternalsym)`. As a result, loading an address of 32-bit symbol requires two registers and one more additional instruction: ``` addiu $1, $zero, %lo(foo) lui $2, %hi(foo) addu $25, $2, $1 ``` This patch adds the missed pattern and enables generation more effective set of instructions: ``` lui $1, %hi(foo) addiu $25, $1, %lo(foo) ``` Differential Revision: https://reviews.llvm.org/D66771 llvm-svn: 370196	2019-08-28 12:35:53 +00:00
Amaury Sechet	4b092471e1	[TargetLowering] Add buildLegalVectorShuffle facility to help build legal shuffles Summary: There are at least 2 ways to express the same shuffle. Various pieces of code explicit check for both option, but other places do not when they would benefit from doing it. This patches refactor the codebase to use buildLegalVectorShuffle in order to make that behavior more consistent. Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri Subscribers: javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66804 llvm-svn: 370190	2019-08-28 12:00:06 +00:00
Simon Pilgrim	4ee564a42f	[DAGCombine] Remove LoadedSlice::Cost default 'ForCodeSize' constructor arguments. NFCI. These were always being passed in and it allowed me to add the explicit tag to stop a cppcheck warning about 1 argument constructors. llvm-svn: 370189	2019-08-28 11:50:36 +00:00
Nico Weber	ffe94e8e3f	gn build: Merge r370187 llvm-svn: 370188	2019-08-28 11:42:20 +00:00
David Green	3c01cd4463	[ARM] Move MVEVPTBlockPass to a separate file. NFC This just pulls the MVEVPTBlockPass into a separate file, as opposed to being wrapped up in Thumb2ITBlockPass. Differential revision: https://reviews.llvm.org/D66579 llvm-svn: 370187	2019-08-28 11:37:31 +00:00
David Green	4ccc1cffed	[MVE] VMOVX patterns This adds fp16 VMOVX patterns, using the same patterns as rL362482 with some adjustments for MVE. It allows us to move fp16 registers without going into and out of gprs. VMOVX is able to move the top bits from a fp16 in a fp reg into the bottom bits of another register, zeroing the rest. This can be used for odd MVE register lanes. The top bits are not read by fp16 instructions, so no move is required there if we are dealing with even lanes. Differential revision: https://reviews.llvm.org/D66793 llvm-svn: 370184	2019-08-28 10:13:23 +00:00
Hans Wennborg	002a337ed8	[LLVM-C] Fix ByVal Attribute crashing With the introduction of the typed byval attribute change there was no way that the LLVM-C API could create the correct class Attribute. If a program that uses the C API creates a ByVal attribute and annotates a function with that attribute LLVM will crash when it assembles or write that module containing the function out as bitcode. This change is a minimal fix to at least allow code to work, this is because the byval change is on the 9.0 and I don't want to introduce new LLVM-C API this late in the release cycle. By Jakob Bornecrantz! Differential revision: https://reviews.llvm.org/D66144 llvm-svn: 370176	2019-08-28 09:21:56 +00:00
Ayal Zaks	26b6527bad	[LV] Fold tail by masking - handle reductions Allow vectorizing loops that have reductions when tail is folded by masking. A select is introduced in VPlan, choosing between the last value carried by the loop-exit/live-out instruction of the reduction, and the penultimate value carried by the reduction phi, according to the "i < n" mask of fold-tail. This select replaces the last value as the live-out value of the loop. Differential Revision: https://reviews.llvm.org/D66720 llvm-svn: 370173	2019-08-28 09:02:23 +00:00
Sam Parker	e4a5828aa1	[ARM][ParallelDSP] Change search for muls rL369567 reverted a couple of recent changes made to ARMParallelDSP because of a miscompilation error: PR43073. The issue stemmed from an underlying bug that was caused by adding muls into a reduction before it was proved that they could be executed in parallel with another mul. Most of the changes here are from the previously reverted commits. The additional changes have been made area: 1) The Search function now doesn't insert any muls into the Reduction object. That now happens once the search has successfully finished. 2) For any muls added into the reduction but that weren't paired, we accumulate their values as an input into the smlad. Differential Revision: https://reviews.llvm.org/D66660 llvm-svn: 370171	2019-08-28 08:51:13 +00:00
David Bolvansky	742b320cda	[NFC] Unbreak tests llvm-svn: 370170	2019-08-28 08:42:40 +00:00
David Bolvansky	418464dc6f	[NFC] Updated test llvm-svn: 370169	2019-08-28 08:40:45 +00:00
David Bolvansky	c32ac7b1f0	Annotate return values of allocation functions with dereferenceable_or_null Summary: Example define dso_local noalias i8* @_Z6maixxnv() local_unnamed_addr #0 { entry: %call = tail call noalias dereferenceable_or_null(64) i8* @malloc(i64 64) #6 ret i8* %call } Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: aaron.ballman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66651 llvm-svn: 370168	2019-08-28 08:28:20 +00:00
Yi Kong	97b06562dc	[llvm-objdump] Add the missing ARMv8 subarch detection Differential Revision: https://reviews.llvm.org/D66849 llvm-svn: 370163	2019-08-28 06:37:22 +00:00
Fangrui Song	062f09fe21	[LoopFusion] Fix another -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off build llvm-svn: 370156	2019-08-28 03:12:40 +00:00
Matt Arsenault	c4538e58af	AMDGPU/GlobalISel: Fix constraining scalar and/or/xor If the result register already had a register class assigned, the sources may not have been properly constrained. llvm-svn: 370150	2019-08-28 02:11:03 +00:00
Richard Trieu	153062896c	Revert r370105 - Update two x86 datalayouts for r370083, looks like racing commits r370083 has been reverted, which this change depends on. llvm-svn: 370147	2019-08-28 01:55:06 +00:00
Vedant Kumar	ec283a6ca4	[test] Speculative fix for r369966 on llvm-clang-x86_64-win Run the MIR pipeline in this test to completion to try and avoid a "Bad machine code" error. Build failure: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190826/688338.html llvm-svn: 370145	2019-08-28 01:39:36 +00:00
Vlad Tsyrklevich	b6303e0ac6	Revert "Change the X86 datalayout to add three address spaces for 32 bit signed," This reverts commit r370083 because it caused check-lld failures on sanitizer-x86_64-linux-fast. llvm-svn: 370142	2019-08-28 01:08:54 +00:00
Matt Arsenault	c8b31ec555	AMDGPU/GlobalISel: Implement addrspacecast for 32-bit constant addrspace llvm-svn: 370140	2019-08-28 00:58:24 +00:00
Philip Reames	3b4f4ee245	[NFC] Assert preconditions and merge all users into one codepath in Loads.cpp llvm-svn: 370128	2019-08-27 23:36:31 +00:00
Craig Topper	14aab0af29	[InstCombine] Disable some portions of foldGEPICmp for GEPs that return a vector of pointers. Fix other portions. llvm-svn: 370114	2019-08-27 21:38:56 +00:00
Luis Marques	180f0a33cd	[RISCV] Implement RISCVRegisterInfo::getPointerRegClass Fixes bug 43041 Differential Revision: https://reviews.llvm.org/D66752 llvm-svn: 370113	2019-08-27 21:37:57 +00:00
Craig Topper	56ca5f11a7	[Analysis] Improve EmitGEPOffset handling of vector GEPs with scalar indices. This patch splats the scalar index if necessary before using it in any integer casts or other arithmetic. llvm-svn: 370112	2019-08-27 21:31:44 +00:00
Reid Kleckner	8c87cc042d	Update two x86 datalayouts for r370083, looks like racing commits llvm-svn: 370105	2019-08-27 19:55:10 +00:00
Amara Emerson	ced654127e	[GlobalISel] Replace hard coded dynamic alloca handling with G_DYN_STACKALLOC. This change moves the actual stack pointer manipulation into the legalizer, available to targets via lower(). The codegen is slightly different because we're using explicit masks instead of G_PTRMASK, and using G_SUB rather than adding a negative amount via G_GEP. Differential Revision: https://reviews.llvm.org/D66678 llvm-svn: 370104	2019-08-27 19:54:27 +00:00
Philip Reames	661116b15b	[Loads/SROA] Remove blatantly incorrect code and fix a bug revealed in the process The code we had isSafeToLoadUnconditionally was blatantly wrong. This function takes a "Size" argument which is supposed to describe the span loaded from. Instead, the code use the size of the pointer passed (which may be unrelated!) and only checks that span. For any Size > LoadSize, this can and does lead to miscompiles. Worse, the generic code just a few lines above correctly handles the cases which are valid. So, let's delete said code. Removing this code revealed two issues: 1) As noted by jdoerfert the removed code incorrectly handled external globals. The test update in SROA is to stop testing incorrect behavior. 2) SROA was confusing bytes and bits, but this wasn't obvious as the Size parameter was being essentially ignored anyway. Fixed. Differential Revision: https://reviews.llvm.org/D66778 llvm-svn: 370102	2019-08-27 19:34:43 +00:00
Matt Arsenault	d565c170bf	DAG: computeNumSignBits for MUL Copied directly from the IR version. Most of the testcases I've added for this are somewhat problematic because they really end up testing the yet to be implemented version for MUL_I24/MUL_U24. llvm-svn: 370099	2019-08-27 19:05:33 +00:00
Matt Arsenault	a8f6d310ad	AMDGPU: Add baseline test for num sign bits of mul llvm-svn: 370098	2019-08-27 19:01:02 +00:00
Jason Liu	fd2aefafba	[XCOFF][AIX] Generate symbol table entries with llvm-readobj Summary: This patch implements main entry and auxiliary entries of symbol table generation for llvm-readobj on AIX. The source code of aix_xcoff_xlc_test8.o (compile with xlc) is: -bash-4.2$ cat test8.c extern int i; extern int TestforXcoff; extern int fun(int i); static int static_i; char* p="abcd"; int fun1(int j) { static_i++; j++; j=j+*p; return j; } int main() { i++; fun(i); return fun1(i); } Patch provided by DiggerLin Differential Revision: https://reviews.llvm.org/D65240 llvm-svn: 370097	2019-08-27 18:54:46 +00:00
Reid Kleckner	fa08039609	Revert Autogenerate the shebang lines for tools/opt-viewer This reverts r369486 (git commit 8d18384809957cc923752e10a86adab129e3df48) The opt-viewer tests don't pass after this change, and fixing them isn't trivial. opt-viewer.py imports optmap, which requires adjusting pythonpath, which is more work than I'm willing to do to fix forward. llvm-svn: 370095	2019-08-27 18:31:29 +00:00
Praveen Velliengiri	33c62d6ddc	[ORCv2] - New Speculate Query Implementation Summary: This patch introduces, SequenceBBQuery - new heuristic to find likely next callable functions it tries to find the blocks with calls in order of execution sequence of Blocks. It still uses BlockFrequencyAnalysis to find high frequency blocks. For a handful of hottest blocks (plan to customize), the algorithm traverse and discovered the caller blocks along the way to Entry Basic Block and Exit Basic Block. It uses Block Hint, to stop traversing the already visited blocks in both direction. It implicitly assumes that once the block is visited during discovering entry or exit nodes, revisiting them again does not add much. It also branch probability info (cached result) to traverse only hot edges (planned to customize) from hot blocks. Without BPI, the algorithm mostly return's all the blocks in the CFG with calls. It also changes the heuristic queries, so they don't maintain states. Hence it is safe to call from multiple threads. It also implements, new instrumentation to avoid jumping into JIT on every call to the function with the help _orc_speculate.decision.block and _orc_speculate.block. "Speculator Registration Mechanism is also changed" - kudos to @lhames Open to review, mostly looking to change implementation of SequeceBBQuery heuristics with good data structure choices. Reviewers: lhames, dblaikie Reviewed By: lhames Subscribers: mgorny, hiraditya, mgrang, llvm-commits, lhames Tags: #speculative_compilation_in_orc, #llvm Differential Revision: https://reviews.llvm.org/D66399 llvm-svn: 370092	2019-08-27 18:23:36 +00:00
Andrea Di Biagio	cf4aba9b59	[Tblgen][MCA] Add the ability to mark groups as LoadQueue and StoreQueue. NFCI Before this patch, users were not allowed to optionally mark processor resource groups as load/store queues. That is because tablegen class MemoryQueue was originally declared as expecting a ProcResource template argument (instead of a more generic ProcResourceKind). That was an oversight, since the original intention from D54957 was to let user mark any processor resource as either load/store queue. This patch adds the ability to use processor resource groups in MemoryQueue definitions. This is not a user visible change. Differential Revision: https://reviews.llvm.org/D66810 llvm-svn: 370091	2019-08-27 18:20:34 +00:00
Matt Arsenault	318ea61295	AMDGPU: Add amdgpu-32bit-address-high-bits to MIR serialization llvm-svn: 370089	2019-08-27 18:18:38 +00:00
Lang Hames	29ae644374	[JITLink] Fix bogus TimerGroup constructor call. llvm-svn: 370088	2019-08-27 18:05:05 +00:00
Matt Arsenault	e31d0615ec	AMDGPU: Fix crash from inconsistent register types for v3i16/v3f16 This is something of a workaround since computeRegisterProperties seems to be doing the wrong thing. llvm-svn: 370086	2019-08-27 17:51:56 +00:00
Praveen Velliengiri	54e79945ee	[ORC] NFC remove unimplemented query Summary: CFGWalk Query is unimplemented for valid reasons. But the declaration got included in commit file. Reviewers: lhames, dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66289 llvm-svn: 370085	2019-08-27 17:51:50 +00:00
Jessica Paquette	2f041a67b9	Recommit "[GlobalISel] Import patterns containing INSERT_SUBREG" I thought `llvm::sort` was stable for some reason but it's not. Use `llvm::stable_sort` in `CodeGenTarget::getSuperRegForSubReg`. Original patch: https://reviews.llvm.org/D66498 llvm-svn: 370084	2019-08-27 17:47:06 +00:00
Amy Huang	1082859b72	Change the X86 datalayout to add three address spaces for 32 bit signed, 32 bit unsigned, and 64 bit pointers. llvm-svn: 370083	2019-08-27 17:46:53 +00:00

... 2 3 4 5 6 ...

184156 Commits