llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Matt Arsenault	7a99ca5220	GlobalISel: Fix narrowScalar for G_ANYEXT results This is nearly the same as G_ZEXT.	2020-01-15 08:58:57 -05:00
Matt Arsenault	222988e058	TableGen: Delete some copy constuctors Some register related machinery relies on uniqued, static pointers for register classes and subregisters, so try to make sure these are never copied.	2020-01-15 08:58:57 -05:00
Matt Arsenault	6f8d4d409e	TableGen/GlobalISel: Don't take reference to temporary values These return temporary Optional<> values which are immediately destroyed. I'm not sure why no sanitizers seem to have caught this, but I encountered crashes on these in a future patch.	2020-01-15 08:58:57 -05:00
Matt Arsenault	79b7d20721	TableGen/GlobalISel: Don't reconstruct CodeGenRegBank The maps for dealing with the relationships between different register classes and subregister indexes rely on unique pointers for every class/index. By constructing a second copy of CodeGenRegBank, two different pointer values existed for a given subregister depending on where you were querying. Use the existing CodeGenRegBank owned by the CodeGenTarget instead of constructing a second copy. This avoids incorrectly failing map lookups in a future change.	2020-01-15 08:58:57 -05:00
Luís Marques	90543021a2	[RISCV] Fix test for inline asm z constraint modifier Summary: Use an `i` constraint in the test, to correctly trigger the code for handling the `z` constraint modifier. Reviewers: asb, lenary, jrtc27 Reviewed By: lenary, jrtc27 Tags: #llvm Differential Revision: https://reviews.llvm.org/D72134	2020-01-15 13:50:50 +00:00
Djordje Todorovic	82df06c202	[llvm-locstats] Add the --compare option Draw a plot showing the difference in debug loc coverage on two files provided. Differential Revision: https://reviews.llvm.org/D71870	2020-01-15 14:35:29 +01:00
Nemanja Ivanovic	be1be6dd59	[PowerPC] Legalize saturating vector add/sub These intrinsics and the corresponding ISD nodes were recently added. PPC has instructions that do this for vectors. Legalize them and add patterns to emit the satuarting instructions. Differential revision: https://reviews.llvm.org/D71940	2020-01-15 07:00:38 -06:00
Hans Wennborg	54ef354524	Bump the trunk major version to 11 and clear the release notes.	2020-01-15 13:38:01 +01:00
Simon Pilgrim	cbce8903bb	Revert rG6078f2fedcac5797ac39ee5ef3fd7a35ef1202d5 - "[AArch64][GlobalISel]: Support @llvm.{return,frame}address selection." These intrinsics expand to a variable number of instructions so just like in ISelLowering.cpp we use custom code to deal with them. Committing Tim's original patch. Differential Revision: https://reviews.llvm.org/D65656 ---- Breaks EXPENSIVE_CHECKS builds.	2020-01-15 12:37:37 +00:00
Zakk Chen	56c93f0a78	[RISCV] Support ABI checking with per function target-features if users don't specific -mattr, the default target-feature come from IR attribute. Reviewers: lenary, asb Reviewed By: lenary, asb Tags: #llvm Differential Revision: https://reviews.llvm.org/D70837	2020-01-15 04:35:01 -08:00
Zakk Chen	47d581269e	Revert "[RISCV] Support ABI checking with per function target-features" This reverts commit 109e4d12edda07bdec139de36d9fdb6f73399f92.	2020-01-15 04:32:57 -08:00
Simon Pilgrim	1f062c1ac9	RegisterClassInfo::computePSetLimit - assert that we actually find a register. Fixes "pointer is null" clang static analyzer warning.	2020-01-15 12:18:12 +00:00
Simon Pilgrim	e62249c995	Fix "pointer is null" static analyzer warning. NFCI. Use cast<> instead of dyn_cast<> since the pointer is always dereferenced and cast<> will perform the null assertion for us.	2020-01-15 12:18:11 +00:00
Georgii Rymar	46e5f3f87e	[yaml2obj/obj2yaml] - Add support for SHT_RELR sections. Note: this is a reland with a trivial 2 lines fix in ELFState<ELFT>::writeSectionContent. It adds a check similar to ones we already have for other sections to fix the case revealed by bots, like http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/60744. The encoded sequence of Elf*_Relr entries in a SHT_RELR section looks like [ AAAAAAAA BBBBBBB1 BBBBBBB1 ... AAAAAAAA BBBBBB1 ... ] i.e. start with an address, followed by any number of bitmaps. The address entry encodes 1 relocation. The subsequent bitmap entries encode up to 63(31) relocations each, at subsequent offsets following the last address entry. More information is here: https://github.com/llvm-mirror/llvm/blob/master/lib/Object/ELF.cpp#L272 This patch adds a support for these sections. Differential revision: https://reviews.llvm.org/D71872	2020-01-15 15:15:24 +03:00
Benjamin Kramer	2eb89734d5	[AArch64][SVE] Fold variable into assert to silence unused variable warnings in Release builds	2020-01-15 12:50:27 +01:00
Arkady Shlykov	a04db53f1a	[NFC] Adjust test cases numbering, test commit. Summary: Test case test14 is missing, adjust the numbering to have a consecutive range. Also a test commit to verify commit access.	2020-01-15 03:44:57 -08:00
Djordje Todorovic	e09ba3bb82	[llvm-locstats] Fix the docs Add the missing picture for the documentation.	2020-01-15 12:32:01 +01:00
Georgii Rymar	fd4218737d	Revert "[yaml2obj/obj2yaml] - Add support for SHT_RELR sections." This reverts commit 46d11e30ee807accefd14e0b7f306647963a39b5. It broke bots. E.g. http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/60744	2020-01-15 14:19:00 +03:00
Russell Gallop	6c5de528ea	[Support] Replace Windows __declspec(thread) with thread_local for LLVM_THREAD_LOCAL Windows minimum host tools version is now VS2017, which supports C++11 thread_local so use this for LLVM_THREAD_LOCAL instead of declspec(thread). According to [1], thread_local is implemented with declspec(thread) so this should be NFC. [1] https://docs.microsoft.com/en-us/cpp/cpp/thread?view=vs-2017 Differential Revision: https://reviews.llvm.org/D72399	2020-01-15 11:15:25 +00:00
Cullen Rhodes	15430bed65	[AArch64][SVE] Add ptest intrinsics Summary: Implements the following intrinsics: * @llvm.aarch64.sve.ptest.any * @llvm.aarch64.sve.ptest.first * @llvm.aarch64.sve.ptest.last Reviewers: sdesmalen, efriedma, dancgr, mgudim, cameron.mcinally, rengolin Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72398	2020-01-15 11:15:01 +00:00
Djordje Todorovic	260af8e320	[llvm-locstats] Add the --draw-plot option When using the option, draw the histogram representing the debug location buckets. The resulting histogram will be saved in a png file. Differential Revision: https://reviews.llvm.org/D71869	2020-01-15 12:00:43 +01:00
Georgii Rymar	aeadfe5798	[yaml2obj/obj2yaml] - Add support for SHT_RELR sections. The encoded sequence of Elf*_Relr entries in a SHT_RELR section looks like [ AAAAAAAA BBBBBBB1 BBBBBBB1 ... AAAAAAAA BBBBBB1 ... ] i.e. start with an address, followed by any number of bitmaps. The address entry encodes 1 relocation. The subsequent bitmap entries encode up to 63(31) relocations each, at subsequent offsets following the last address entry. More information is here: https://github.com/llvm-mirror/llvm/blob/master/lib/Object/ELF.cpp#L272 This patch adds a support for these sections. Differential revision: https://reviews.llvm.org/D71872	2020-01-15 13:54:08 +03:00
Djordje Todorovic	ada36eafbb	[llvm-locstats][NFC] Support OOP concept Making these changes, the code becomes more robust and easier for adding the new features. -Introduce the LocationStats class representing the statistics -Add the pretty_print() method in the LocationStats class -Add additional '-' for the program options -Add the verify_program_inputs() function -Add the parse_locstats() function -Rename 'results' => 'opts' -Add more comments Differential Revision: https://reviews.llvm.org/D71868	2020-01-15 11:41:09 +01:00
Zakk Chen	cffbac9542	[RISCV] Support ABI checking with per function target-features if users don't specific -mattr, the default target-feature come from IR attribute.	2020-01-15 02:30:43 -08:00
Igor Kudrin	f3b2fc24d8	[DWARF] Fix DWARFDebugAranges to support 64-bit CU offsets. DWARFContext, the only user of this class, can already handle such offsets. Differential Revision: https://reviews.llvm.org/D71834	2020-01-15 17:19:08 +07:00
LLVM GN Syncbot	19664201dc	[gn build] Port 0dc6c249bff	2020-01-15 09:58:27 +00:00
cdevadas	73078830c2	[AMDGPU] Invert the handling of skip insertion. The current implementation of skip insertion (SIInsertSkip) makes it a mandatory pass required for correctness. Initially, the idea was to have an optional pass. This patch inserts the s_cbranch_execz upfront during SILowerControlFlow to skip over the sections of code when no lanes are active. Later, SIRemoveShortExecBranches removes the skips for short branches, unless there is a sideeffect and the skip branch is really necessary. This new pass will replace the handling of skip insertion in the existing SIInsertSkip Pass. Differential revision: https://reviews.llvm.org/D68092	2020-01-15 15:18:16 +05:30
Kazushi (Jam) Marukawa	2a68ddf041	[VE] Minimal codegen for empty functions Summary: This patch implements minimal VE code generation for empty function bodies (no args, no value return). Contents * empty function code generation test. * Minimal function prologue & epilogue emission * Instruction formats and instruction definitions as far as required for the empty function prologue & epilogue. * I64 register class definitions. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D72598	2020-01-15 09:55:16 +01:00
Craig Topper	6e6d49cbe7	[X86] Don't call LowerUINT_TO_FP_i32 for i32->f80 on 32-bit targets with sse2. We were performing an emulated i32->f64 in the SSE registers, then storing that value to memory and doing a extload into the X87 domain. After this patch we'll now just store the i32 to memory along with an i32 0. Then do a 64-bit FILD to f80 completely in the X87 unit. This matches what we do without SSE.	2020-01-15 00:43:07 -08:00
David Green	74af6f3fff	[ARM] Reegenerate MVE tests. NFC The mve-phireg.ll test no longer really tests what it was added for, but the original case was fairly complex. I've left the test in as a general codegen test.	2020-01-15 08:10:38 +00:00
Hideto Ueno	c32c9e8840	[Attributor] AAValueConstantRange: Value range analysis using constant range Summary: This patch introduces `AAValueConstantRange`, which answers a possible range for integer value in a specific program point. One of the motivations is propagating existing `range` metadata. (I think we need to change the situation that `range` metadata cannot be put to Argument). The state is a tuple of `ConstantRange` and it is initialized to (known, assumed) = ([-∞, +∞], empty). Currently, AAValueConstantRange is created in `getAssumedConstant` method when `AAValueSimplify` returns `nullptr`(worst state). Supported - BinaryOperator(add, sub, ...) - CmpInst(icmp eq, ...) - !range metadata `AAValueConstantRange` is not intended to extend to polyhedral range value analysis. Reviewers: jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: phosek, davezarzycki, baziotis, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71620	2020-01-15 16:34:23 +09:00
David Green	677a0376c4	[Scheduler] Adjust interface of CreateTargetMIHazardRecognizer to use ScheduleDAGMI. NFC All the callers of this function will be ScheduleDAGMI from the MachineScheduler. This allows us to use the extra info available in ScheduleDAGMI without resorting to awkward casts.	2020-01-15 07:21:44 +00:00
Justin Hibbits	1d2d48be7e	[PowerPC] Fix powerpcspe subtarget enablement in llvm backend Summary: As currently written, -target powerpcspe will enable SPE regardless of disabling the feature later on in the command line. Instead, change this to just set a default CPU to 'e500' instead of a generic CPU. As part of this, add FeatureSPE to the e500 definition. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D72673	2020-01-14 22:07:03 -06:00
Tom Stellard	fb33bc86b6	CMake: Make most target symbols hidden by default Summary: For builds with LLVM_BUILD_LLVM_DYLIB=ON and BUILD_SHARED_LIBS=OFF this change makes all symbols in the target specific libraries hidden by default. A new macro called LLVM_EXTERNAL_VISIBILITY has been added to mark symbols in these libraries public, which is mainly needed for the definitions of the LLVMInitialize* functions. This patch reduces the number of public symbols in libLLVM.so by about 25%. This should improve load times for the dynamic library and also make abi checker tools, like abidiff require less memory when analyzing libLLVM.so One side-effect of this change is that for builds with LLVM_BUILD_LLVM_DYLIB=ON and LLVM_LINK_LLVM_DYLIB=ON some unittests that access symbols that are no longer public will need to be statically linked. Before and after public symbol counts (using gcc 8.2.1, ld.bfd 2.31.1): nm before/libLLVM-9svn.so \| grep ' [A-Zuvw] ' \| wc -l 36221 nm after/libLLVM-9svn.so \| grep ' [A-Zuvw] ' \| wc -l 26278 Reviewers: chandlerc, beanz, mgorny, rnk, hans Reviewed By: rnk, hans Subscribers: merge_guards_bot, luismarques, smeenai, ldionne, lenary, s.egerton, pzheng, sameer.abuasal, MaskRay, wuzish, echristo, Jim, hiraditya, michaelplatings, chapuni, jholewinski, arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, javed.absar, sbc100, jgravelle-google, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, mgrang, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, kristina, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D54439	2020-01-14 19:46:52 -08:00
Hubert Tong	b77ca50922	DWARFDebugLine.cpp: Restore LF line endings rG7e02406f6cf180a8c89ce64665660e7cc9dbc23e switched the file to CRLF line endings.	2020-01-14 21:23:39 -05:00
Philip Reames	f156908a7a	[BranchAlign] Add master --x86-branches-within-32B-boundaries flag This flag was originally part of D70157, but was removed as we carved away pieces of the review. Since we have the nop support checked in, and it appears mature(), I think it's time to add the master flag. For now, it will default to nop padding, but once the prefix padding support lands, we'll update the defaults. () I can now confirm that downstream testing of the changes which have landed to date - nop padding and compiler support for suppressions - is passing all of the functional testing we've thrown at it. There might still be something lurking, but we've gotten enough coverage to be confident of the basic approach. Note that the new flag can be used either when assembling an .s file, or when using the integrated assembler directly from the compiler. The later will use all of the suppression mechanism and should always generate correct code. We don't yet have assembly syntax for the suppressions, so passing this directly to the assembler w/a raw .s file may result in broken code. Use at your own risk. Also note that this isn't the wiring for the clang option. I think the most recent review for that is D72227, but I've lost track, so that might be off. Differential Revision: https://reviews.llvm.org/D72738	2020-01-14 18:17:53 -08:00
Reid Kleckner	5b796ed527	[Win64] Handle FP arguments more gracefully under -mno-sse Pass small FP values in GPRs or stack memory according the the normal convention. This is what gcc -mno-sse does on Win64. I adjusted the conditions under which we emit an error to check if the argument or return value would be passed in an XMM register when SSE is disabled. This has a side effect of no longer emitting an error for FP arguments marked 'inreg' when targetting x86 with SSE disabled. Our calling convention logic was already assigning it to FP0/FP1, and then we emitted this error. That seems unnecessary, we can ignore 'inreg' and compile it without SSE. Reviewers: jyknight, aemerson Differential Revision: https://reviews.llvm.org/D70465	2020-01-14 17:19:35 -08:00
Michael Liao	6eff62fa7c	[amdgpu] Fix typos in a test case. - There are typos introduced due to merge.	2020-01-14 20:08:39 -05:00
Craig Topper	0c7f16034d	[X86] Drop an unneeded FIXME. NFC The extload on X87 is free.	2020-01-14 17:05:46 -08:00
Craig Topper	835f91f74c	[X86] Swap the 0 and the fudge factor in the constant pool for the 32-bit mode i64->f32/f64/f80 uint_to_fp algorithm. This allows us to generate better code for selecting the fixup to load. Previously when the sign was set we had to load offset 0. And when it was clear we had to load offset 4. This required a testl, setns, zero extend, and finally a mul by 4. By switching the offsets we can just shift the sign bit into the lsb and multiply it by 4.	2020-01-14 17:05:23 -08:00
Michael Liao	78a5d52036	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU. Summary: - `dead-mi-elimination` assumes MIR in the SSA form and cannot be arranged after phi elimination or DeSSA. It's enhanced to handle the dead register definition by skipping use check on it. Once a register def is `dead`, all its uses, if any, should be `undef`. - Re-arrange the DIE in RA phase for AMDGPU by placing it directly after `detect-dead-lanes`. - Many relevant tests are refined due to different register assignment. Reviewers: rampitec, qcolombet, sunfish Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72709	2020-01-14 19:26:15 -05:00
Michael Liao	b78c93696a	[DAGCombine] Replace `getIntPtrConstant()` with `getVectorIdxTy()`. - Prefer `getVectorIdxTy()` as the index operand type for `EXTRACT_SUBVECTOR` as targets expect different types by overloading `getVectorIdxTy()`.	2020-01-14 17:03:05 -05:00
Amara Emerson	efaddf0ae6	[AArch64][GlobalISel]: Support @llvm.{return,frame}address selection. These intrinsics expand to a variable number of instructions so just like in ISelLowering.cpp we use custom code to deal with them. Committing Tim's original patch. Differential Revision: https://reviews.llvm.org/D65656	2020-01-14 13:41:21 -08:00
Craig Topper	eddb373ccc	[LegalizeTypes] Remove untested code from ExpandIntOp_UINT_TO_FP This code is untested in tree because the "APFloat::semanticsPrecision(sem) >= SrcVT.getSizeInBits() - 1" check is false for most combinations for int and fp types except maybe i32 and f64. For that you would need i32 to be an illegal type, but f64 to be legal and have custom handling for legalizing the split sint_to_fp. The precision check itself was added in 2010 to fix a double rounding issue in the algorithm that would occur if the sint_to_fp was not able to do the conversion without rounding. Differential Revision: https://reviews.llvm.org/D72728	2020-01-14 13:15:29 -08:00
Fedor Sergeev	1ce76c63a2	[GVN] fix comment/argument name to match actual implementation. NFC	2020-01-15 03:58:04 +07:00
Nikita Popov	2120a6f80c	[InstCombine] Fix worklist management when removing guard intrinsic When multiple guard intrinsics are merged into one, currently the result of eraseInstFromFunction() is returned -- however, this should only be done if the current instruction is being removed. In this case we're removing a different instruction and should instead report that the current one has been modified by returning it. For this test case, this reduces the number of instcombine iterations from 5 to 2 (the minimum possible). Differential Revision: https://reviews.llvm.org/D72558	2020-01-14 21:47:48 +01:00
Danilo Carvalho Grael	22fb556e5b	[SVE] Add patterns for MUL immediate instruction. Summary: Add the missing MUL pattern for integer immediate instructions. Reviewers: sdesmalen, huntergr, efriedma, c-rhodes, kmclaughlin Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits, amehsan Tags: #llvm Differential Revision: https://reviews.llvm.org/D72654	2020-01-14 15:26:19 -05:00
Nikita Popov	361e5d3bdf	[NewPM] Port MergeFunctions pass This ports the MergeFunctions pass to the NewPM. This was rather straightforward, as no analyses are used. Additionally MergeFunctions needs to be conditionally enabled in the PassBuilder, but I left that part out of this patch. Differential Revision: https://reviews.llvm.org/D72537	2020-01-14 20:55:41 +01:00
Nikita Popov	c8831fb193	[InstCombine] Fix infinite loop due to bitcast <-> phi transforms Fix for https://bugs.llvm.org/show_bug.cgi?id=44245. The optimizeBitCastFromPhi() and FoldPHIArgOpIntoPHI() end up fighting against each other, because optimizeBitCastFromPhi() assumes that bitcasts of loads will get folded. This doesn't happen here, because a dangling phi node prevents the one-use fold in https://github.com/llvm/llvm-project/blob/master/llvm/lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp#L620-L628 from triggering. This patch fixes the issue by explicitly performing the load combine as part of the bitcast of phi transform. Other attempts to force the load to be combined first were ultimately too unreliable. Differential Revision: https://reviews.llvm.org/D71164	2020-01-14 20:45:13 +01:00
Nikita Popov	1b8770ebf1	[InstCombine] Make combineLoadToNewType a method; NFC So it can be reused as part of other combines. In particular for D71164.	2020-01-14 20:40:03 +01:00

1 2 3 4 5 ...

190113 Commits