llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Christopher Tetreault	76b23db219	[SVE] Remove default-false VectorType::get Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84212	2020-08-12 10:37:05 -07:00
David Green	51c55be3c2	[Scheduler] Fix typo in comments. NFC	2020-08-12 18:36:05 +01:00
Xing GUO	748a235862	[DWARFYAML] Make the address size of compilation units optional. This patch makes the 'AddrSize' field optional. If the address size is missing, yaml2obj will infer it from the object file. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85805	2020-08-12 21:47:32 +08:00
Igor Kudrin	cde37ff605	[llvm-dwarfdump] Avoid crashing if an abbreviation offset is invalid. Note that DWARFUnit::getAbbreviations() returns nullptr if the abbreviations could not be read, but callers used the returned pointer without checking. Differential Revision: https://reviews.llvm.org/D85738	2020-08-12 16:01:53 +07:00
David Sherwood	96c98d5a46	[SVE][CodeGen] Fix issues with EXTRACT_SUBVECTOR when using scalable FP vectors In this patch I have fixed two issues: 1. Our SVE tuple get/set intrinsics were using the wrong constant type for the index passed to EXTRACT_SUBVECTOR. I have fixed this by using the function SelectionDAG::getVectorIdxConstant to create the value. Also, I have updated the documentation for EXTRACT_SUBVECTOR describing what type the constant index should be and we now enforce this when creating the node. 2. The AArch64 backend was missing the appropriate patterns for extracting certain subvectors (nxv4f16 and nxv2f32) from legal SVE types. I have added them as part of this patch. The only way that I could find to test the new patterns was to use the SVE tuple get intrinsics, although I realise it looks a bit unusual. Tests added here: test/CodeGen/AArch64/sve-extract-subvector.ll Differential Revision: https://reviews.llvm.org/D85516	2020-08-12 08:35:46 +01:00
Kiran Chandramohan	f33b4004b5	[MLIR,OpenMP] Lowering of parallel operation: proc_bind clause 2/n This patch adds the translation of the proc_bind clause in a parallel operation. The values that can be specified for the proc_bind clause are specified in the OMP.td tablegen file in the llvm/Frontend/OpenMP directory. From this single source of truth enumeration for proc_bind is generated in llvm and mlir (used in specification of the parallel Operation in the OpenMP dialect). A function to return the enum value from the string representation is also generated. A new header file (DirectiveEmitter.h) containing definitions of classes directive, clause, clauseval etc is created so that it can be used in mlir as well. Reviewers: clementval, jdoerfert, DavidTruby Differential Revision: https://reviews.llvm.org/D84347	2020-08-12 08:03:13 +01:00
Petr Hosek	6abbc429da	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-11 20:22:11 -07:00
Vedant Kumar	60a6ae1b59	Revert "[Instruction] Add updateLocationAfterHoist helper" This reverts commit 4a646ca9e2caf70d6312714770f516fb83b7e3cb. This is causing some bots to fail with "!dbg attachment points at wrong subprogram for function", like: http://lab.llvm.org:8011/builders/sanitizer-windows/builds/67958/steps/stage%201%20check/logs/stdio	2020-08-11 14:54:09 -07:00
Vedant Kumar	62d3804379	[Instruction] Add updateLocationAfterHoist helper Introduce a helper on Instruction which can be used to update the debug location after hoisting. Use this in GVN and LICM, where we were mistakenly introducing new line 0 locations after hoisting (the docs recommend dropping the location in this case). For more context, see the discussion in https://reviews.llvm.org/D60913. Differential Revision: https://reviews.llvm.org/D85670	2020-08-11 14:05:20 -07:00
Nikita Popov	587bdc1d95	[InstSimplify] Respect CanUseUndef in more places Similar to what we do in IIQ, add an isUndefValue() helper that checks for undef values while respective CanUseUndef. This makes it much easier to search for places that don't respect the flag yet.	2020-08-11 21:53:33 +02:00
diggerlin	b434098ac8	[AIX][XCOFF] change the operand of branch instruction from symbol name to qualified symbol name for function declarations SUMMARY: 1. in the patch , remove setting storageclass in function .getXCOFFSection and construct function of class MCSectionXCOFF there are XCOFF::StorageMappingClass MappingClass; XCOFF::SymbolType Type; XCOFF::StorageClass StorageClass; in the MCSectionXCOFF class, these attribute only used in the XCOFFObjectWriter, (asm path do not need the StorageClass) we need get the value of StorageClass, Type,MappingClass before we invoke the getXCOFFSection every time. actually , we can get the StorageClass of the MCSectionXCOFF from it's delegated symbol. 2. we also change the oprand of branch instruction from symbol name to qualify symbol name. for example change bl .foo extern .foo to bl .foo[PR] extern .foo[PR] 3. and if there is reference indirect call a function bar. we also add extern .bar[PR] Reviewers: Jason liu, Xiangling Liao Differential Revision: https://reviews.llvm.org/D84765	2020-08-11 15:26:19 -04:00
Jessica Paquette	8e321c8c5a	[GlobalISel] Combine (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) This implements ``` (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) ``` when `op` is an extend, a shift, or an and. This is similar to `DAGCombiner::hoistLogicOpWithSameOpcodeHands` (with a bunch of missing cases, e.g. G_TRUNC, G_BITCAST, etc.) This is implemented so it works both pre and post-legalization. This also adds a general way to add a series of instructions in a combine. (`applyBuildInstructionSteps`). Differential Revision: https://reviews.llvm.org/D85050	2020-08-11 10:40:06 -07:00
Matt Arsenault	1c0be51aa1	GlobalISel: Fix typo	2020-08-11 13:08:56 -04:00
Lang Hames	f0d37fd69e	[ORC] Move file-descriptor based raw byte channel into a public header. This will enable re-use in other llvm tools.	2020-08-11 09:50:58 -07:00
Nikita Popov	577d874016	[InstSimplify] Forbid undef folds in expandBinOp This is the replacement for D84250 based on D84792. As we recursively fold with the same value twice, we need to disable undef folds, to prevent an undef from being folded to two different values. Reverting rG00f3579aea6e3d4a4b7464c3db47294f71cef9e4 and using the test case from https://reviews.llvm.org/D83360#2145793, it no longer performs the incorrect fold. Differential Revision: https://reviews.llvm.org/D85684	2020-08-11 18:39:24 +02:00
Jay Foad	928c1dd7ef	[GlobalISel] Add G_ABS This is equivalent to the new llvm.abs intrinsic added by D84125 with is_int_min_poison=0. Differential Revision: https://reviews.llvm.org/D85718	2020-08-11 16:34:37 +01:00
Sanjay Patel	5b7d18ac79	[InstSimplify] fold min/max with matching min/max operands I think this is the last remaining translation of an existing instcombine transform for the corresponding cmp+sel idiom. This interpretation is more general though - we can remove mismatched signed/unsigned combinations in addition to the more obvious cases. min/max(X, Y) must produce X or Y as the result, so this is just another clause in the existing transform that was already matching a min/max of min/max.	2020-08-11 11:23:15 -04:00
Valentin Clement	402fcd0af2	[flang][directives] Use TableGen information for clause classes in parse-tree This patch takes advantage of the directive information and tablegen generation to replace the clauses class parse tree and in the dump parse tree sections. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D85549	2020-08-11 10:44:14 -04:00
Matt Arsenault	674304dfb1	GlobalISel: Implement bitcast action for G_INSERT_VECTOR_ELT This mirrors the support for the equivalent extracts. This also creates a huge mess that would be greatly improved if we had any bit operation combines.	2020-08-11 10:39:14 -04:00
Matt Arsenault	c9ab6823ab	TableGen/GlobalISel: Hack the operand order for atomic_store ISD::ATOMIC_STORE arbitrarily has the operands in the opposite order from regular ISD::STORE, which always introduced an annoying duplication of patterns to handle both cases. Since in GlobalISel there's just the one G_STORE, we need to swap the operands to correctly emit the type check for the pointer operand. Some work started in 20aafa31569b5157e792daa8860d71dd0df8a53a to migrate SelectionDAG to use ISD::STORE for atomics, but that work seems to have stalled. Since this is the pretty much the last operation which matters which isn't supported for AMDGPU, use this compatibility hack to unblock declaring it functionally complete. Not sure what's going on with the pending_phis AArch64 test. It seems it didn't always use atomics, and I'm not sure what it was originally testing matters anymore.	2020-08-11 10:22:44 -04:00
clementval	2623773dd2	Revert "[flang][directives] Use TableGen information for clause classes in parse-tree" This reverts commit bf93edc4758b512e84b70ec6335d15b0cf386a18. Buildbot failure	2020-08-11 09:54:04 -04:00
Valentin Clement	72a67d9950	[flang][directives] Use TableGen information for clause classes in parse-tree This patch takes advantage of the directive information and tablegen generation to replace the clauses class parse tree and in the dump parse tree sections. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D85549	2020-08-11 09:43:11 -04:00
David Stenberg	331775f51f	[DebugInfo] Allow GNU macro extension to be read Allow the GNU .debug_macro extension to be parsed and printed by llvm-dwarfdump. In an upcoming patch support will be added for emitting that format also. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D82974	2020-08-11 13:30:52 +02:00
David Stenberg	9fb94de7cb	[DebugInfo] Introduce GNU macro extension entry encodings This is a preparatory patch for allowing the GNU .debug_macro extension, which is a precursor to the DWARF 5 format, to be emitted by LLVM for earlier DWARF versions. The entries share the same encoding and behavior as in DWARF5; there are just more entries in the DWARF 5 format. Therefore, we could have used those existing DWARF 5 entries, but I think that explicitly referring to the GNU macro variants makes the code more clear. The defines that this patch introduces can be found in GCC in the dwarf2.h header: https://gcc.gnu.org/git/?p=gcc.git;a=blob; f=include/dwarf2.h; h=0b6facfd4cf4c02320c7328114231b128ab42d5e; hb=dccbf1e2a6e544f71b4a5795f0c79015db019fc3#l425 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D82972	2020-08-11 13:30:52 +02:00
Kerry McLaughlin	76e22108d4	[CodeGen] Refactor getMemBasePlusOffset & getObjectPtrOffset to accept a TypeSize Changes the Offset arguments to both functions from int64_t to TypeSize & updates all uses of the functions to create the offset using TypeSize::Fixed() Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D85220	2020-08-11 12:17:10 +01:00
Kai Nacke	b0aab57571	[SystemZ/ZOS] Add binary format goff and operating system zos to the triple Adds the binary format goff and the operating system zos to the triple class. goff is selected as default binary format if zos is choosen as operating system. No further functionality is added. Reviewers: efriedma, tahonermann, hubert.reinterpertcast, MaskRay Reviewed By: efriedma, tahonermann, hubert.reinterpertcast Differential Revision: https://reviews.llvm.org/D82081	2020-08-11 05:26:26 -04:00
Florian Hahn	cd4df5279d	[SCEVExpander] Add helper to clean up instrs inserted while expanding. SCEVExpander already tracks which instructions have been inserted n InsertedValues/InsertedPostIncValues. This patch adds an additional vector to collect the instructions in insertion order. This can then be used to remove exactly the instructions inserted by the expander. This replaces ExpandedValuesCleaner, which in some cases might remove values not inserted by the expander (e.g. if a value was dead before insertion and is then used during expansion). Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D84327	2020-08-11 09:30:31 +01:00
Shinji Okumura	aa6d7628ea	[Attributor][NFC] Connect AAPotentialValues with AAValueSimplify This patch enables `AAValueSimplify` to use information from `AAPotentialValues` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85668	2020-08-11 15:52:02 +09:00
Haowei Wu	f44281335e	Revert "Move ELFObjHandler to TextAPI library" This reverts commit e6f8ba12e6ea97cbf4374b70e78309c2d859ca1c due to build failures.	2020-08-10 21:31:29 -07:00
Haowei Wu	1fa1012fcc	Move ELFObjHandler to TextAPI library This change moves ELFObjHandler to llvm/TextAPI library so it can be used by different llvm tools.	2020-08-10 21:23:39 -07:00
Wang, Pengfei	72838e8fb2	[X86][FPEnv] Teach X86 mask compare intrinsics to respect strict FP semantics. When we use mask compare intrinsics under strict FP option, the masked elements shouldn't raise any exception. So, we cann't replace the intrinsic with a full compare + "and" operation. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D85385	2020-08-11 10:28:41 +08:00
Yuanfang Chen	08ee0eb96e	[CodeGen] Make MMI immutable NPM pass	2020-08-10 17:52:42 -07:00
Lang Hames	ce159bf33c	[llvm-jitlink] Update llvm-jitlink to use TargetProcessControl.	2020-08-10 17:19:48 -07:00
Johannes Doerfert	d5ca7458b9	[OpenMP][NFC] Reuse OMPIRBuilder `struct ident_t` handling in Clang Replace the `ident_t` handling in Clang with the methods offered by the OMPIRBuilder. This cuts down on the clang code as well as the differences between the two, making further transitions easier. Tests have changed but there should not be a real functional change. The most interesting difference is probably that we stop generating local ident_t allocations for now and just use globals. Given that this happens only with debug info, the location part of the `ident_t` is probably bigger than the test anyway. As the location part is already a global, we can avoid the allocation, memcpy, and store in favor of a constant global that is slightly bigger. This can be revisited if there are complications. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D80735	2020-08-10 17:13:26 -05:00
jasonliu	a056348a49	[XCOFF][AIX] Use TE storage mapping class when large code model is enabled Summary: Use TE SMC instead of TC SMC in large code model mode, so that large code model TOC entries could get placed after all the small code model TOC entries, which reduces the chance of TOC overflow. Reviewed By: Xiangling_L Differential Revision: https://reviews.llvm.org/D85455	2020-08-10 19:52:10 +00:00
Craig Topper	1e510916fc	[BreakFalseDeps][X86] Move operand loop out of X86's getUndefRegClearance and put in the pass. X86 is the only user of this interface in tree. Previously the X86 pass would loop over operands looking for one undef operand for the pass to fix. But there could theoretically be multiple operands to fix. So it makes more sense for the pass to do the looping and ask the target if an operand needs to be fixed.	2020-08-10 10:32:29 -07:00
Xiangling Liao	0a11ef2eea	[AIX] Static init frontend recovery and backend support On the frontend side, this patch recovers AIX static init implementation to use the linkage type and function names Clang chooses for sinit related function. On the backend side, this patch sets correct linkage and function names on aliases created for sinit/sterm functions. Differential Revision: https://reviews.llvm.org/D84534	2020-08-10 10:10:49 -04:00
James Henderson	73d127e3f5	[DebugInfo] Don't error for zero-length arange entries Although the DWARF specification states that .debug_aranges entries can't have length zero, these can occur in the wild. There's no particular reason to enforce this part of the spec, since functionally they have no impact. The patch removes the error and introduces a new warning for premature terminator entries which does not stop parsing. This is a relanding of cb3a598c87db, adding the missing obj2yaml part that was needed. Fixes https://bugs.llvm.org/show_bug.cgi?id=46805. See also https://reviews.llvm.org/D71932 which originally introduced the error. Reviewed by: ikudrin, dblaikie, Higuoxing Differential Revision: https://reviews.llvm.org/D85313	2020-08-10 14:57:52 +01:00
Matt Arsenault	b0828481d3	PeepholeOptimizer: Use Register	2020-08-10 08:49:36 -04:00
Sanjay Patel	65e73449e2	[InstCombine] auto-generate test checks; NFC	2020-08-10 08:27:38 -04:00
Nico Weber	962b0abc0f	Revert "[DebugInfo] Don't error for zero-length arange entries" This reverts commit cb3a598c87db2db997401b82dfb3f7f80707194e. Breaks build of check-llvm dep obj2yaml everywhere.	2020-08-10 08:20:35 -04:00
James Henderson	66278e7306	[DebugInfo] Don't error for zero-length arange entries Although the DWARF specification states that .debug_aranges entries can't have length zero, these can occur in the wild. There's no particular reason to enforce this part of the spec, since functionally they have no impact. The patch removes the error and introduces a new warning for premature terminator entries which does not stop parsing. Fixes https://bugs.llvm.org/show_bug.cgi?id=46805. See also https://reviews.llvm.org/D71932 which originally introduced the error. Reviewed by: ikudrin, dblaikie Differential Revision: https://reviews.llvm.org/D85313	2020-08-10 12:48:31 +01:00
Qiu Chaofan	4605d02c43	[PowerPC] Add intrinsic to read or set FPSCR register This patch introduces two intrinsics: llvm.ppc.setflm and llvm.ppc.readflm. They read from or write to FPSCR register (floating-point status & control) which contains rounding mode and exception status. To ensure correctness of program, we need to prevent FP operations from being moved across these intrinsics (mffs/mtfsf instruction), so here I set them as scheduling boundaries. We can relax such restriction if FPSCR is modeled well in the future. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D84914	2020-08-10 18:27:45 +08:00
Petar Avramovic	5d6a53d942	AMDGPU/GlobalISel: Lower G_FREM Add custom lower for G_FREM. Differential Revision: https://reviews.llvm.org/D84324	2020-08-10 10:10:46 +02:00
Vitaly Buka	5845ad5ad5	[NFC][Attributor] Add missing override	2020-08-09 23:30:42 -07:00
Shinji Okumura	6da8fa8433	[Attributor][NFC][AAPotentialValues] Change interface of PotentialValuesState Previously `PotentialValuesState` inherited `BooleanState`. We have to add `getAssumed` to the state in order to use `clampStateAndIndicateChange` (which will be used in `AAPotentialValuesArgument`). However `BooleanState::getAssumed` is not a virtual function and we cannot override it. Therefore, I changed the state not to inherit `BooleanState` and add `getAssumed` to it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85610	2020-08-10 09:18:10 +09:00
Florian Hahn	9f66518378	[InstSimplify] Make sure CanUseUndef is initialized in all cases. This should fix a bunch of buildbot failures.	2020-08-09 19:47:16 +01:00
Florian Hahn	564f5c4ac7	[InstSimplify/NewGVN] Add option to control the use of undef. Making use of undef is not safe if the simplification result is not used to replace all uses of the result. This leads to problems in NewGVN, which does not replace all uses in the IR directly. See PR33165 for more details. This patch adds an option to SimplifyQuery to disable the use of undef. Note that I've only guarded uses if isa<UndefValue>/m_Undef where SimplifyQuery is currently available. If we agree on the general direction, I'll update the remaining uses. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84792	2020-08-09 19:16:56 +01:00
Florian Hahn	4fd7f95ec0	[SCEVExpander] Avoid re-using existing casts if it means updating users. Currently the SCEVExpander tries to re-use existing casts, even if they are not exactly at the insertion point it was asked to create the cast. To do so in some case, it creates a new cast at the insertion point and updates all users to use the new cast. This behavior is problematic, because it changes the IR outside of the instructions created during the expansion. Therefore we cannot completely undo all changes made during expansion. This re-use should be only an extra optimization, so only using the new cast in the expanded instructions should not be a correctness issue. There are many cases equivalent instructions are created during expansion. This patch also adjusts findInsertPointAfter to skip instructions inserted during expansion. This enables re-using existing casts without the renaming any uses, by picking a better insertion point. Reviewed By: efriedma, lebedev.ri Differential Revision: https://reviews.llvm.org/D84399	2020-08-09 13:25:17 +01:00
Petr Hosek	6c27d09879	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit ccbc1485b55ff4acd21bcfafbf7aec4ed0fd818d which is still failing on the Windows MLIR bots.	2020-08-08 17:08:23 -07:00
Petr Hosek	af8170b5ad	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-08 16:44:08 -07:00
Yuanfang Chen	526e29b5b9	Reland "Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager"" This relands commit 320eab2d558fde0b61437e9b9075bfd301c2c474. The test failed because it was looking for x86-linux target unconditionally. Now it gets the default target.	2020-08-07 16:40:49 -07:00
Arthur Eubanks	834a6fc438	[NewPM] Print 'Skipping pass' as pass instrumentation If OptNoneInstrumentation prints it instead, 'Skipping pass' will print for even required passes. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85493	2020-08-07 15:02:02 -07:00
Sameer Arora	df248b9fac	[llvm-libtool-darwin] Add constant CPU_SUBTYPE_ARM64_V8 Add support for constant MachO::CPU_SUBTYPE_ARM64_V8. This constant is needed so as to match `llvm-libtool-darwin`'s behavior to that of cctools' libtool when `-arch_only` flag is passed in on command line. Reviewed by jhenderson, alexshap, smeenai Differential Revision: https://reviews.llvm.org/D85041	2020-08-07 14:09:27 -07:00
Vitaly Buka	6669d78639	Revert "[StackSafety] Skip ambiguous lifetime analysis" This reverts commit 0b2616a8045cb776ea1514c3401d0a8577de1060. Crashes with safe-stack.	2020-08-07 14:02:50 -07:00
Matt Arsenault	fc0dd4b853	GlobalISel: Handle zext(sext x) in artifact combiner This eliminates the illegal intermediate s8 value in the added test.	2020-08-07 16:37:46 -04:00
Yuanfang Chen	cee8d8ef70	Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager" This reverts commit 911565d1085d9447363fe8ad041817436c4998fe. Broke some non-Linux bots.	2020-08-07 11:59:58 -07:00
Yuanfang Chen	4240996330	[NewPM][CodeGen] Introduce machine pass and machine pass manager machine pass could define four methods: - `PreservedAnalyses run(MachineFunction &, MachineFunctionAnalysisManager &)` - `Error doInitialization(Module &, MachineFunctionAnalysisManager &)` - `Error doFinalization(Module &, MachineFunctionAnalysisManager &)` - `Error run(Module &, MachineFunctionAnalysisManager &)` machine pass manger: - MachineFunctionAnalysisManager: Basically an AnalysisManager<MachineFunction> augmented with the ability to register and query IR analyses - MachineFunctionPassManager: support only two methods, `addPass` and `run` Reviewed By: arsenm, asbirlea, aeubanks Differential Revision: https://reviews.llvm.org/D67687	2020-08-07 11:00:31 -07:00
Yuanfang Chen	377ad5f083	[NewPM] Only verify loop for nonskipped user loop pass No verification for pass mangers since it is not needed. No verification for skipped loop pass since the asserted condition is not used. Add a BeforeNonSkippedPass callback for this. The callback needs more inputs than its parameters to work so the callback is added on-the-fly. Reviewed By: aeubanks, asbirlea Differential Revision: https://reviews.llvm.org/D84977	2020-08-07 11:00:31 -07:00
Bevin Hansson	7c243aea4b	[Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics. Summary: This patch adds two intrinsics, llvm.sshl.sat and llvm.ushl.sat, which perform signed and unsigned saturating left shift, respectively. These are useful for implementing the Embedded-C fixed point support in Clang, originally discussed in http://lists.llvm.org/pipermail/llvm-dev/2018-August/125433.html and http://lists.llvm.org/pipermail/cfe-dev/2018-May/058019.html Reviewers: leonardchan, craig.topper, bjope, jdoerfert Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83216	2020-08-07 15:09:24 +02:00
Igor Kudrin	eabe546cbc	[DebugInfo] Clean up DIEUnit. NFC. This removes members of the DIEUnit class which were used only in unit tests. Note also that child classes shadowed some of these methods, namely, getDwarfVersion() was overridden in DwartfUnit and getLength() was overridden in DwarfCompileUnit. Differential Revision: https://reviews.llvm.org/D85436	2020-08-07 15:55:44 +07:00
Shinji Okumura	02e369eb69	[Attributor] AAPotentialValues Interface This is a split patch of D80991. This patch introduces AAPotentialValues and its interface only. For more detail of AAPotentialValues abstract attribute, see the original patch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D83283	2020-08-07 17:35:12 +09:00
Christian Kühnel	e33dafd15e	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit 1adc494bce44f6004994deed61b30d4b71fe1d05. This patch broke the Windows compilation on buildbot and pre-merge testing: http://lab.llvm.org:8011/builders/mlir-windows/builds/5945 https://buildkite.com/llvm-project/llvm-master-build/builds/780	2020-08-07 09:36:49 +02:00
biplmish	f87266ec05	[PowerPC] Implement Vector Extract Low/High Order Builtins in LLVM/Clang This patch implements the function prototypes vec_extractl and vec_extracth in altivec.h to utilize the vector extract double element instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D84622	2020-08-07 01:02:29 -05:00
QingShan Zhang	b0aa16911a	[PowerPC] Support constrained fp operation for setcc The constrained fp operation fcmp was added by https://reviews.llvm.org/D69281. This patch is trying to add the support for PowerPC backend. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D81727	2020-08-07 05:16:36 +00:00
Vitaly Buka	3b944733de	[StackSafety] Skip ambiguous lifetime analysis If we can't identify alloca used in lifetime marker we need to assume to worst case scenario. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D84630	2020-08-06 19:10:33 -07:00
Vitaly Buka	567e88646c	[LTO,NFC] Skip generateParamAccessSummary when empty addGlobalValueSummary can check newly added FunctionSummary and set HasParamAccess to mark that generateParamAccessSummary is needed. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D85182	2020-08-06 19:01:19 -07:00
Arthur Eubanks	f51f94852f	[NewPM] Add callback for skipped passes Parallel to https://reviews.llvm.org/D84772. Will use this for printing when a pass is skipped. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85478	2020-08-06 18:58:59 -07:00
Matt Arsenault	b1038416e3	GlobalISel: Implement lower for G_INSERT_VECTOR_ELT	2020-08-06 19:29:17 -04:00
Snehasish Kumar	bfbcf062be	[NFC] Rename BBSectionsPrepare -> BasicBlockSections. Rename the BBSectionsPrepare pass as suggested by the review comment in https://reviews.llvm.org/D85368. Differential Revision: https://reviews.llvm.org/D85380	2020-08-06 13:12:06 -07:00
Matt Arsenault	fc03bd4465	GlobalISel: Implement fewerElementsVector for G_EXTRACT_VECTOR_ELT Use the same basic strategy as LegalizeVectorTypes. Try to index into smaller pieces if there's a constant index, and otherwise fall back to a stack temporary.	2020-08-06 14:33:16 -04:00
Matt Arsenault	f305dea485	AMDGPU: Define raw/struct variants of buffer atomic fadd Somehow the new FP atomic buffer intrinsics ended up using the legacy style for buffer intrinsics.	2020-08-06 13:36:19 -04:00
Simon Pilgrim	0add02e052	PDBExtras.h - remove unnecessary raw_ostream forward declaration. NFCI. We already need to include raw_ostream.h, also add missing StringRef.h implicit dependency.	2020-08-06 16:31:56 +01:00
Sanjay Patel	12ef22f0e5	[PatternMatch] allow intrinsic form of min/max with existing matchers I skimmed the existing users of these matchers and don't see any problems (eg, the caller assumes the matched value was a select instruction without checking). So I think we can generalize the matching to allow the new intrinsics or the cmp+select idioms. I did not find any unit tests for the matchers, so added some basics there. The instsimplify tests are adapted from existing tests for the cmp+select pattern and cover the folds in simplifyICmpWithMinMax(). Differential Revision: https://reviews.llvm.org/D85230	2020-08-06 10:50:24 -04:00
Raphael Isemann	bc5b29dac3	Revert "PDBExtras.h - remove unnecessary raw_ostream forward declaration. NFCI." This reverts commit 87c5437afd273e909e0fed3389de7531d5452ea5. The commit includes several headers in the middle of a function, which breaks pretty much everything.	2020-08-06 15:15:43 +02:00
Simon Pilgrim	a7159bc3c5	BitstreamRemarkParser.h - remove unnecessary includes. NFCI. Remove unused includes, moving to the lib header or cpp file as necessary.	2020-08-06 13:17:53 +01:00
Simon Pilgrim	e07ee0759f	Fix include sorting order. NFC	2020-08-06 11:46:53 +01:00
Simon Pilgrim	b4317ecea4	PDBExtras.h - remove unnecessary raw_ostream forward declaration. NFCI. We already need to include raw_ostream.h, also add missing StringRef.h and cstdint implicit dependencies. Remove unnecessary includes from PDBExtras.cpp	2020-08-06 11:28:42 +01:00
David Green	77d21dcd3f	[LoopVectorizer] Inloop vector reductions Arm MVE has multiple instructions such as VMLAVA.s8, which (in this case) can take two 128bit vectors, sign extend the inputs to i32, multiplying them together and sum the result into a 32bit general purpose register. So taking 16 i8's as inputs, they can multiply and accumulate the result into a single i32 without any rounding/truncating along the way. There are also reduction instructions for plain integer add and min/max, and operations that sum into a pair of 32bit registers together treated as a 64bit integer (even though MVE does not have a plain 64bit addition instruction). So giving the vectorizer the ability to use these instructions both enables us to vectorize at higher bitwidths, and to vectorize things we previously could not. In order to do that we need a way to represent that the reduction operation, specified with a llvm.experimental.vector.reduce when vectorizing for Arm, occurs inside the loop not after it like most reductions. This patch attempts to do that, teaching the vectorizer about in-loop reductions. It does this through a vplan recipe representing the reductions that the original chain of reduction operations is replaced by. Cost modelling is currently just done through a prefersInloopReduction TTI hook (which follows in a later patch). Differential Revision: https://reviews.llvm.org/D75069	2020-08-06 10:10:50 +01:00
Roman Lebedev	3b33b0cf64	[InstCombine] (-NSW x) s> x --> x s< 0 (PR39480) Name: (-x) s> x --> x s< 0 %neg_x = sub nsw i8 0, %x ; %x must not be INT_MIN %r = icmp sgt i8 %neg_x, %x => %r = icmp slt i8 %x, 0 https://rise4fun.com/Alive/ZslD https://bugs.llvm.org/show_bug.cgi?id=39480	2020-08-06 11:50:34 +03:00
Xing GUO	47c44c8042	[DWARFYAML][debug_info] Pull out dwarf::FormParams from DWARFYAML::Unit. Unit.Format, Unit.Version and Unit.AddrSize are replaced with dwarf::FormParams in D84496 to get rid of unnecessary functions getOffsetSize() and getRefSize(). However, that change makes it difficult to make AddrSize optional (Optional<uint8_t>). This change pulls out dwarf::FormParams from DWARFYAML::Unit and use it as a helper struct in DWARFYAML::emitDebugInfo(). Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D85296	2020-08-06 16:39:00 +08:00
Craig Topper	7ca0daba97	[X86] Rename X86::getImpliedFeatures to X86::updateImpliedFeatures and pass clang's StringMap directly to it. No point in building a vector of StringRefs for clang to apply to the StringMap. Just pass the StringMap and modify it directly.	2020-08-06 00:20:46 -07:00
Petr Hosek	70737c97db	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-05 16:07:11 -07:00
Greg Clayton	4e442757fd	Add verification for DW_AT_decl_file and DW_AT_call_file. LTO builds have been creating invalid DWARF and one of the errors was a file index that was out of bounds. "llvm-dwarfdump --verify" will check all file indexes for line tables already, but there are no checks for the validity of file indexes in attributes. The verification will verify if there is a DW_AT_decl_file/DW_AT_call_file that: - there is a line table for the compile unit - the file index is valid - the encoding is appropriate Tests are added that test all of the above conditions. Differential Revision: https://reviews.llvm.org/D84817	2020-08-05 15:30:13 -07:00
Rahman Lavaee	3a9a3691ee	[Propeller]: Use a descriptive temporary symbol name for the end of the basic block. This patch changes the functionality of AsmPrinter to name the basic block end labels as LBB_END${i}_${j}, with ${i} being the identifier for the function and ${j} being the identifier for the basic block. The new naming scheme is consistent with how basic block labels are named (.LBB${i}_{j}), and how function end symbol are named (.Lfunc_end${i}) and helps to write stronger tests for the upcoming patch for BB-Info section (as proposed in https://lists.llvm.org/pipermail/llvm-dev/2020-July/143512.html). The end label is used with basicblock-labels (BB-Info section in future) and basicblock-sections to compute the size of basic blocks and basic block sections, respectively. For BB sections, the section containing the entry basic block will not have a BB end label since it already gets the function end-label. This label is cached for every basic block (CachedEndMCSymbol) like the label for the basic block (CachedMCSymbol). Differential Revision: https://reviews.llvm.org/D83885	2020-08-05 13:17:19 -07:00
Stanislav Mekhanoshin	d688e1d62e	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Evgenii Stepanov	03cf51be93	[msan] Remove readnone and friends from call sites. MSan removes readnone/readonly and similar attributes from callees, because after MSan instrumentation those attributes no longer apply. This change removes the attributes from call sites, as well. Failing to do this may cause DSE of paramTLS stores before calls to readonly/readnone functions. Differential Revision: https://reviews.llvm.org/D85259	2020-08-05 10:34:45 -07:00
Jordan Rupprecht	eb9074b6d8	Revert "[LoopVectorizer] Inloop vector reductions" This reverts commit e9761688e41cb979a1fa6a79eb18145a75104933. It breaks the build: ``` ~/src/llvm-project/llvm/lib/Analysis/IVDescriptors.cpp:868:10: error: no viable conversion from returned value of type 'SmallVector<[...], 8>' to function return type 'SmallVector<[...], 4>' return ReductionOperations; ```	2020-08-05 10:24:15 -07:00
Mircea Trofin	3bd1a7f753	[TFUtils] Expose untyped accessor to evaluation result tensors These were implementation detail, but become necessary for generic data copying. Also added const variations to them, and move assignment, since we had a move ctor (and the move assignment helps in a subsequent patch). Differential Revision: https://reviews.llvm.org/D85262	2020-08-05 10:22:45 -07:00
David Green	8e671cc375	[LoopVectorizer] Inloop vector reductions Arm MVE has multiple instructions such as VMLAVA.s8, which (in this case) can take two 128bit vectors, sign extend the inputs to i32, multiplying them together and sum the result into a 32bit general purpose register. So taking 16 i8's as inputs, they can multiply and accumulate the result into a single i32 without any rounding/truncating along the way. There are also reduction instructions for plain integer add and min/max, and operations that sum into a pair of 32bit registers together treated as a 64bit integer (even though MVE does not have a plain 64bit addition instruction). So giving the vectorizer the ability to use these instructions both enables us to vectorize at higher bitwidths, and to vectorize things we previously could not. In order to do that we need a way to represent that the reduction operation, specified with a llvm.experimental.vector.reduce when vectorizing for Arm, occurs inside the loop not after it like most reductions. This patch attempts to do that, teaching the vectorizer about in-loop reductions. It does this through a vplan recipe representing the reductions that the original chain of reduction operations is replaced by. Cost modelling is currently just done through a prefersInloopReduction TTI hook (which follows in a later patch). Differential Revision: https://reviews.llvm.org/D75069	2020-08-05 18:14:05 +01:00
Georgii Rymar	f2d8b0a536	[llvm-readobj] - Make decode_relrs() don't return Expected<>. NFCI. The `decode_relrs` helper is declared as: `Expected<std::vector<Elf_Rel>> decode_relrs(Elf_Relr_Range relrs) const;` it never returns an error though and hence can be simplified to return a vector. Differential revision: https://reviews.llvm.org/D85302	2020-08-05 17:05:47 +03:00
Simon Pilgrim	7502dabc07	DWARFVerifier.h - remove unnecessary forward declarations and includes. NFCI.	2020-08-05 12:42:44 +01:00
Simon Pilgrim	e9e363a4b8	GISelWorkList.h - remove unnecessary includes. NFCI.	2020-08-05 12:00:28 +01:00
Simon Pilgrim	8d6d113d6f	CallLowering.h - remove unnecessary CCState forward declaration. NFCI. Already defined in CallingConvLower.h	2020-08-05 12:00:28 +01:00
Hans Wennborg	c132b24d91	Revert "[CMake] Simplify CMake handling for zlib" This quietly disabled use of zlib on Windows even when building with -DLLVM_ENABLE_ZLIB=FORCE_ON. > Rather than handling zlib handling manually, use find_package from CMake > to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, > HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is > set to YES, which requires the distributor to explicitly select whether > zlib is enabled or not. This simplifies the CMake handling and usage in > the rest of the tooling. > > This is a reland of abb0075 with all followup changes and fixes that > should address issues that were reported in PR44780. > > Differential Revision: https://reviews.llvm.org/D79219 This reverts commit 10b1b4a231a485f1711d576e6131f6755e008abe and follow-ups 64d99cc6abed78c00a2a7863b02ce54911a5264f and f9fec0447e12da9e8cf4b628f6d45f4941e7d182.	2020-08-05 12:31:44 +02:00
Georgii Rymar	af54a4ae5b	[llvm-readobj/elf] - Add a testing for --stackmap and refine the implementation. Currently, we only test the `--stackmap` option here: https://github.com/llvm/llvm-project/blob/master/llvm/test/Object/stackmap-dump.test it uses a precompiled MachO binary currently and I've found no tests for this option for ELF. The implementation also has issues. For example, it might assert on a wrong version of the .llvm-stackmaps section. Or it might crash on an empty or truncated section. This patch introduces a new tools/llvm-readobj/ELF test file as well as implements a few basic checks to catch simple crashes/issues It also eliminates `unwrapOrError` calls in `printStackMap()`. Differential revision: https://reviews.llvm.org/D85208	2020-08-05 13:09:04 +03:00
Yevgeny Rouban	e8fe9281f9	DomTree: Make PostDomTree indifferent to block successors swap Fixed the commit c35585e209efe69e2233bdc5ecd23bed7b735ba3. This is a fix for the bug 46098 where PostDominatorTree is unexpectedly changed by InstCombine's branch swapping transformation. This patch fixes PostDomTree builder. While looking for the furthest away node in a reverse unreachable subgraph this patch runs DFS with successors in their function order. This order is indifferent to the order of successors, so is the furthest away node. Reviewers: kuhar, nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D84763	2020-08-05 14:26:32 +07:00
Mehdi Amini	c8fb2f08a3	Revert "DomTree: Make PostDomTree immune to block successors swap" This reverts commit c35585e209efe69e2233bdc5ecd23bed7b735ba3. The MLIR is broken with this patch, reproduce by adding -DLLVM_ENABLE_PROJECTS=mlir to the cmake configuration and build `ninja tools/mlir/lib/IR/CMakeFiles/obj.MLIRIR.dir/Dominance.cpp.o`	2020-08-05 04:32:44 +00:00
Evgeniy Brevnov	08e662f69a	[BPI][NFC] Unify handling of normal and SCC based loops This is one more NFC part extracted from D79485. Normal and SCC based loops have very different representation and have to be handled separatly each time we deal with loops. D79485 is going to introduce much more extensive use of loops what will be problematic with out this change. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D84838	2020-08-05 11:19:24 +07:00
Yevgeny Rouban	dd982a9b72	DomTree: Make PostDomTree immune to block successors swap This is another fix for the bug 46098 where PostDominatorTree is unexpectedly changed by InstCombine's branch swapping transformation. This patch fixes PostDomTree builder. While looking for the furthest away node in a reverse unreachable subgraph this patch runs DFS with successors in their function order. This order is indifferent to the order of successors, so is the furthest away node. Reviewers: kuhar, nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D84763	2020-08-05 11:06:54 +07:00
Matt Arsenault	dd7ad288a4	GlobalISel: Move load/store lowering to separate functions	2020-08-04 22:03:51 -04:00
Mircea Trofin	96e978c534	[llvm] Expose type and element count-related APIs on TensorSpec Added a mechanism to check the element type, get the total element count, and the size of an element. Differential Revision: https://reviews.llvm.org/D85250	2020-08-04 17:32:16 -07:00
Krzysztof Parzyszek	0ead9ed228	[RDF] Add operator<<(raw_ostream&, RegisterAggr), NFC	2020-08-04 18:40:07 -05:00
Krzysztof Parzyszek	242b3118d9	[RDF] Use hash-based containers, cache extra information This improves performance.	2020-08-04 18:36:49 -05:00
Yonghong Song	2c21d9e520	BPF: simplify IR generation for __builtin_btf_type_id() This patch simplified IR generation for __builtin_btf_type_id(). For __builtin_btf_type_id(obj, flag), previously IR builtin looks like if (obj is a lvalue) llvm.bpf.btf.type.id(obj.ptr, 1, flag) !type else llvm.bpf.btf.type.id(obj, 0, flag) !type The purpose of the 2nd argument is to differentiate __builtin_btf_type_id(obj, flag) where obj is a lvalue vs. __builtin_btf_type_id(obj.ptr, flag) Note that obj or obj.ptr is never used by the backend and the `obj` argument is only used to derive the type. This code sequence is subject to potential llvm CSE when - obj is the same .e.g., nullptr - flag is the same - metadata type is different, e.g., typedef of struct "s" and strust "s". In the above, we don't want CSE since their metadata is different. This patch change IR builtin to llvm.bpf.btf.type.id(seq_num, flag) !type and seq_num is always increasing. This will prevent potential llvm CSE. Also report an error if the type name is empty for remote relocation since remote relocation needs non-empty type name to do relocation against vmlinux. Differential Revision: https://reviews.llvm.org/D85174	2020-08-04 16:29:42 -07:00
Krzysztof Parzyszek	a88523630e	[RDF] Cache register aliases in PhysicalRegisterInfo This improves performance of PhysicalRegisterInfo::makeRegRef.	2020-08-04 18:10:00 -05:00
Arthur Eubanks	ff7fade869	[Hexagon] Use InstSimplify instead of ConstantProp This is the last remaining use of ConstantProp, migrate it to InstSimplify in the goal of removing ConstantProp. Add -hexagon-instsimplify option to enable skipping of instsimplify in tests that can't handle the extra optimization. Differential Revision: https://reviews.llvm.org/D85047	2020-08-04 15:42:39 -07:00
Krzysztof Parzyszek	3bf7627b97	[RDF] Remove uses of RDFRegisters::normalize (deprecate) This function has been reduced to an identity function for some time.	2020-08-04 17:02:12 -05:00
Matt Arsenault	ba4d17c159	GlobalISel: Add utilty for getting function argument live ins Get the argument register and ensure there's a copy to the virtual register. AMDGPU and AArch64 have similarish code to get the livein value, and I also want to use this in multiple places. This is a bit more aggressive about setting the register class than the original function, but that's probably OK. I think we're missing a few verifier checks for function live ins. I noticed AArch64's calling convention code is not actually adding liveins to functions, only the entry block (which apparently might not matter that much?). There should probably be a verifier check that entry block live ins are also live into the function. We also might need a verifier check that the copy to the livein virtual register is in the entry block.	2020-08-04 16:55:55 -04:00
Matt Arsenault	6dabf8d720	GlobalISel: Add node mappings for frameindex/blockaddress	2020-08-04 15:13:49 -04:00
Nikita Popov	a43b952b5a	[SCCP] Propagate inequalities Teach SCCP to create notconstant lattice values from inequality assumes and nonnull metadata, and update getConstant() to make use of them. Additionally isOverdefined() needs to be changed to consider notconstant an overdefined value. Handling inequality branches is delayed until our branch on undef story in other passes has been improved. Differential Revision: https://reviews.llvm.org/D83643	2020-08-04 20:20:52 +02:00
Xing GUO	7bc6977b8b	[DWARFYAML] Fix unintialized value Is64BitAddrSize. NFC. This patch fixes the undefined behavior that reported by ubsan. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/44524/	2020-08-05 00:28:17 +08:00
Cameron McInally	0f45fb4bbc	[GlobalISel] Don't transform FSUB(-0, X) -> FNEG(X) in GlobalISel. This patch stops unconditionally transforming FSUB(-0, X) into an FNEG(X) while building the MIR. This corresponds with the SelectionDAGISel change in D84056. Differential Revision: https://reviews.llvm.org/D85139	2020-08-04 11:27:09 -05:00
Fangrui Song	f6b39f50d1	[llvm-symbolizer] Switch command line parsing from llvm::cl to OptTable for the advantage outlined by D83639 ([OptTable] Support grouped short options) Some behavior changes: * -i={0,false} is removed. Use --no-inlines instead. * --demangle={0,false} is removed. Use --no-demangle instead * -untag-addresses={0,false} is removed. Use --no-untag-addresses instead Added a higher level API OptTable::parseArgs which handles optional initial options populated from an environment variable, expands response files recursively, and parses options. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83530	2020-08-04 08:53:15 -07:00
Yonghong Song	6a0708296c	[clang][BPF] support type exist/size and enum exist/value relocations This patch added the following additional compile-once run-everywhere (CO-RE) relocations: - existence/size of typedef, struct/union or enum type - enum value and enum value existence These additional relocations will make CO-RE bpf programs more adaptive for potential kernel internal data structure changes. For existence/size relocations, the following two code patterns are supported: 1. uint32_t __builtin_preserve_type_info((<type> )0, flag); 2. <type> var; uint32_t __builtin_preserve_field_info(var, flag); flag = 0 for existence relocation and flag = 1 for size relocation. For enum value existence and enum value relocations, the following code pattern is supported: uint64_t __builtin_preserve_enum_value((<enum_type> )<enum_value>, flag); flag = 0 means existence relocation and flag = 1 for enum value. relocation. In the above <enum_type> can be an enum type or a typedef to enum type. The <enum_value> needs to be an enumerator value from the same enum type. The return type is uint64_t to permit potential 64bit enumerator values. Differential Revision: https://reviews.llvm.org/D83242	2020-08-04 08:39:53 -07:00
Sander de Smalen	d910125a2c	[AArch64][SVE] Fix CFA calculation in presence of SVE objects. The CFA is calculated as (SP/FP + offset), but when there are SVE objects on the stack the SP offset is partly scalable and should instead be expressed as the DWARF expression: SP + offset + scalable_offset * VG where VG is the Vector Granule register, containing the number of 64bits 'granules' in a scalable vector. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84043	2020-08-04 11:47:06 +01:00
Xing GUO	60437c01c6	[YAMLTraits] Fix mapping <none> value that followed by comments. When mapping an optional value, if the value is <none> and followed by comments, there will be a parsing error. This patch helps fix this issue. e.g., When mapping the following YAML, ``` Sections: - Name: blah Type: SHT_foo Flags: [[FLAGS=<none>]] ## some comments. ``` the raw value of `ScalarNode` is "<none> " rather than "<none>". We need to remove the spaces. Differential Revision: https://reviews.llvm.org/D85180	2020-08-04 18:36:05 +08:00
Xing GUO	bbcdb3cf6d	[YAMLParser] Fix a typo: iff -> if. NFC.	2020-08-04 12:42:42 +08:00
hgreving	1fccd57ea4	[MC] Fix memory leak when allocating MCInst with bump allocator Adds the function createMCInst() to MCContext that creates a MCInst using a typed bump alloctor. MCInst contains a SmallVector<MCOperand, 8>. The SmallVector is POD only for <= 8 operands. The default untyped bump pointer allocator of MCContext does not delete the MCInst, so if the SmallVector grows, it's a leak. This fixes https://bugs.llvm.org/show_bug.cgi?id=46900.	2020-08-03 16:08:26 -07:00
Alina Sbirlea	592b072474	[MemorySSA] Restrict optimizations after a PhiTranslation. Merging alias results from different paths, when a path did phi translation is not necesarily correct. Conservatively terminate such paths. Aimed to fix PR46156. Differential Revision: https://reviews.llvm.org/D84905	2020-08-03 14:46:41 -07:00
Thomas Lively	d648be35e8	[WebAssembly] Implement prototype v128.load{32,64}_zero instructions Specified in https://github.com/WebAssembly/simd/pull/237, these instructions load the first vector lane from memory and zero the other lanes. Since these instructions are not officially part of the SIMD proposal, they are only available on an opt-in basis via LLVM intrinsics and clang builtin functions. If these instructions are merged to the proposal, this implementation will change so that the instructions will be generated from normal IR. At that point the intrinsics and builtin functions would be removed. This PR also changes the opcodes for the experimental f32x4.qfm{a,s} instructions because their opcodes conflicted with those of the v128.load{32,64}_zero instructions. The new opcodes were chosen to match those used in V8. Differential Revision: https://reviews.llvm.org/D84820	2020-08-03 13:54:00 -07:00
Jian Cai	7f1bdb7b20	[X86] support .nops directive Add support of .nops on X86. This addresses llvm.org/PR45788. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D82826	2020-08-03 11:50:56 -07:00
Hiroshi Yamauchi	0b0a5993c1	[PGO] Extend the value profile buckets for mem op sizes. Extend the memop value profile buckets to be more flexible (could accommodate a mix of individual values and ranges) and to cover more value ranges (from 11 to 22 buckets). Disabled behind a flag (to be enabled separately) and the existing code to be removed later. Differential Revision: https://reviews.llvm.org/D81682	2020-08-03 11:04:32 -07:00
Arthur Eubanks	77ef1e98d6	[NewPM][LoopVersioning] Port LoopVersioning to NPM Reviewed By: ychen, fhahn Differential Revision: https://reviews.llvm.org/D85063	2020-08-03 10:32:09 -07:00
Kevin P. Neal	0c24e52a4d	[FPEnv] IRBuilder fails to add strictfp attribute The strictfp attribute is required on all function calls in a function that is itself marked with the strictfp attribute. The IRBuilder knows this and has a method for adding the attribute to function call instructions. If a function being called has the strictfp attribute itself then the IRBuilder will refuse to add the attribute to the calling instruction despite being asked to add it. Eliminate this error. Differential Revision: https://reviews.llvm.org/D84878	2020-08-03 13:25:24 -04:00
Mircea Trofin	1cbf2902fb	[llvm] Add a parser from JSON to TensorSpec A JSON->TensorSpec utility we will use subsequently to specify additional outputs needed for certain training scenarios. Differential Revision: https://reviews.llvm.org/D84976	2020-08-03 09:49:31 -07:00
Xing GUO	8d54ee8a03	[DWARFYAML] Implement the .debug_loclists section. This patch implements the .debug_loclists section. There are only two DWARF expressions are implemented in this patch (DW_OP_consts, DW_OP_stack_value). We will implement more in the future. The YAML description of the .debug_loclists section is: ``` debug_loclists: - Format: DWARF32 ## Optional Length: 0x1234 ## Optional Version: 5 ## Optional (5 by default) AddressSize: 8 ## Optional SegmentSelectorSize: 0 ## Optional (0 by default) OffsetEntryCount: 1 ## Optional Offsets: [ 1 ] ## Optional Lists: - Entries: - Operator: DW_LLE_startx_endx Values: [ 0x1234, 0x4321 ] DescriptorsLength: 0x1234 ## Optional Descriptors: - Operator: DW_OP_consts Values: [ 0x1234 ] ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84234	2020-08-03 23:20:15 +08:00
Shinji Okumura	4e0af248b2	[NFC][APInt][DenseMapInfo] Move DenseMapAPIntKeyInfo into DenseMap.h as DenseMapInfo<APInt> `DenseMapAPIntKeyInfo` is now located in `lib/IR/LLVMContextImpl.h`. Moved it into `include/ADT/DenseMapInfo.h` to use it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85131	2020-08-03 23:31:13 +09:00
Matt Arsenault	7371d3a454	GlobalISel: Reimplement moreElementsVectorDst Use pad with undef and unmerge with unused results. This is annoyingly similar to several other places in LegalizerHelper, but they're all slightly different.	2020-08-03 09:03:48 -04:00
Georgii Rymar	c0d697302e	[yaml2obj] - Add a support for "<none>" value for all optional fields. It implements an approach suggested in the D84398 thread. With it the following: ``` Sections: - Name: .bar Type: SHT_PROGBITS Offset: [[MACRO=<none>]] ``` works just like the `Offset` key was not specified. It is useful for tests that want to have a default value for a field and to have a way to override it at the same time. Differential revision: https://reviews.llvm.org/D84526	2020-08-03 12:27:39 +03:00
Djordje Todorovic	635b6d3d23	[NFC] [MIR] Document the reg state flags This patch adds documentation for the RegState enumeration. Differential Revision: https://reviews.llvm.org/D84634	2020-08-03 09:03:24 +02:00
Fangrui Song	afa11d8b91	[MC] Default MCAsmBackend::mayNeedRelaxation() to false	2020-08-02 22:13:59 -07:00
Florian Hahn	5eea1c2f70	Recommit "[IPConstProp] Remove and move tests to SCCP." This reverts commit 59d6e814ce0e7b40b7cc3ab136b9af2ffab9c6f8. The cause for the revert (3 clang tests running opt -ipconstprop) was fixed by removing those lines.	2020-08-02 22:23:54 +01:00
Simon Pilgrim	5a07f0ee62	[IR] Add IRBuilderBase::CreateVectorSplat(ElementCount EC) variant As discussed on D81500, this adds a more general ElementCount variant of the build helper and converts the (non-scalable) unsigned NumElts variant to use it internally.	2020-08-02 16:55:38 +01:00
Matt Arsenault	e7dc03b43a	GlobalISel: Implement bitcast action for G_EXTRACT_VECTOR_ELEMENT For AMDGPU, vectors with elements < 32 bits should be indexed in 32-bit elements and the desired bits extracted from there. For elements > 64-bits, these should be reduce to 64/32 elements to enable the normal dynamic indexing paths. In the dynamic index cases, this produces shorter code most of the time. This does immediately regress the constant index cases, but this should be fixed once we have the most basic of shift combines. The element size > 64 case is pretty much ported from the exisiting DAG implementation for extract element promote. The increasing element size case is new.	2020-08-02 10:42:07 -04:00
Simon Pilgrim	2fc1176955	[DAG] TargetLowering::expandMUL_LOHI - pass SDLoc as const& Try to be more consistent with the SDLoc param in the TargetLowering methods. This also exposes an issue where we were passing a SDNode as a SDLoc, relying on the implicit SDLoc(SDNode) constructor.	2020-08-02 15:31:36 +01:00
Simon Pilgrim	c5bd9ad18f	[DAG] TargetLowering::LowerAsmOutputForConstraint - pass SDLoc as const& Try to be more consistent with the SDLoc param in the TargetLowering methods.	2020-08-02 15:12:02 +01:00
Shinji Okumura	7c6b247bf7	Revert "[Attributor] AAPotentialValues Interface" The commit cause build failure.	2020-08-02 22:49:52 +09:00
Shinji Okumura	5dd3ff33f5	[Attributor] AAPotentialValues Interface This is a split patch of D80991. This patch introduces AAPotentialValues and its interface only. For more detail of AAPotentialValues abstract attribute, see the original patch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D83283	2020-08-02 19:12:17 +09:00
Kazu Hirata	6e4cee6f1e	Use llvm::is_contained where appropriate (NFC) Use llvm::is_contained where appropriate (NFC) Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D85083	2020-08-01 21:51:06 -07:00
Florian Hahn	a60d0184bb	[LCSSA] Provide option for caller to clean up unused PHIs. formLCSSAForInstructions is used by SCEVExpander, which tracks all inserted instructions including LCSSA phis using asserting value handles. This means cleanup needs to happen in the caller. Extend formLCSSAForInstructions to take an optional pointer to a vector. If this argument is non-nullptr, instead of directly deleting the phis, add them to the vector, so the caller can process them. This should address various PPC buildbot failures, including http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/40567	2020-08-01 20:43:19 +01:00
Florian Hahn	9f0a79c54a	[LCSSA] Use IRBuilder for PHI creation. Use IRBuilder instead PHINode::Create. This should not impact the generated code, but IRBuilder provides a way to register callbacks for inserted instructions, which is convenient for some users. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D85037	2020-08-01 18:44:15 +01:00
Luofan Chen	1a5b0cb812	[Attributor][NFC] Update description for the dependency graph The word "dependency graph" is a bit misleading. When there is an edge from node A to B (A -> B), it actually mean that B depends on A and when the state of A is updated, B should also be updated. So I update the comment to make the description clearer. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85065	2020-08-01 21:27:16 +08:00
Chen Zheng	a58d9fd6a7	[SCEV] don't query getSCEV() for incomplete phis querying getSCEV() for incomplete phis leads to wrong cache value in `ExprToIVMap`, because incomplete phis may be simplified to same value before get SCEV expression. Reviewed By: lebedev.ri, mkazantsev Differential Revision: https://reviews.llvm.org/D77560	2020-08-01 02:38:54 -04:00
Sidharth Baveja	14aeea5cd6	[Loop Peeling] Separate the Loop Peeling Utilities from the Loop Unrolling Utilities Summary: This patch separates the Loop Peeling Utilities from Loop Unrolling. The reason for this change is that Loop Peeling is no longer only being used by loop unrolling; Patch D82927 introduces loop peeling with fusion, such that loops can be modified to have to same trip count, making them legal to be peeled. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D83056	2020-07-31 18:31:58 +00:00
Fangrui Song	bf5334b827	[Support][CommandLine] Delete unused llvm:🆑:ParseEnvrironmentOptions The function was added in 2003. It is not used and can be emulated with ParseCommandLineOptions.	2020-07-31 10:48:09 -07:00
Aditya Nandakumar	313711b2cf	[GISel] Add combiners for G_INTTOPTR and G_PTRTOINT https://reviews.llvm.org/D84909 Patch adds two new GICombinerRules, one for G_INTTOPTR and one for G_PTRTOINT. The G_INTTOPTR elides ptr2int(int2ptr(x)) to a copy of x, if the cast is within the same address space. The G_PTRTOINT elides int2ptr(ptr2int(x)) to a copy of x. Patch additionally adds new combiner tests for the AArch64 target to test these new combiner rules. Patch by mkitzan	2020-07-31 10:13:36 -07:00
Sameer Arora	fccf62295c	[llvm-libtool-darwin] Refactor Slice and writeUniversalBinary Refactoring `Slice` class and function `createUniversalBinary` from `llvm-lipo` into MachOUniversalWriter. This refactoring is necessary so as to use the refactored code for creating universal binaries under llvm-libtool-darwin. Reviewed by alexshap, smeenai Differential Revision: https://reviews.llvm.org/D84662	2020-07-31 09:22:35 -07:00
Xing GUO	8ad7694f28	[DWARFYAML][debug_aranges] Use yaml::Hex64 rather than uint64_t as length. NFC. It's better to use yaml::Hex64 as length in the tuples of the address range table.	2020-07-31 23:31:04 +08:00
Matt Arsenault	4eb4bb060f	Support addrspacecast initializers with isNoopAddrSpaceCast Moves isNoopAddrSpaceCast to the TargetMachine. It logically belongs with the DataLayout.	2020-07-31 10:42:43 -04:00
Xing GUO	9f60593353	[DWARFYAML] Make the debug_aranges entry optional. This patch makes the 'debug_aranges' entry optional. If the entry is empty, yaml2obj will only emit the header for it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84921	2020-07-31 20:18:53 +08:00
Xing GUO	3bfc8f644d	[DWARFYAML] Add helper function getDWARFEmitterByName(). NFC. In this patch, we add a helper function getDWARFEmitterByName(). This function returns the proper DWARF section emitting method by the name. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84952	2020-07-31 20:07:39 +08:00
Xing GUO	65b9283070	[DWARFYAML] Add emitDebug[GNU]Pub[names/types] functions. NFC. In this patch, emitDebugPubnames(), emitDebugPubtypes(), emitDebugGNUPubnames(), emitDebugGNUPubtypes() are added. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85003	2020-07-31 20:05:30 +08:00
Vitaly Buka	1bae08d2a5	[NFC] Remove unused GetUnderlyingObject paramenter Depends on D84617. Differential Revision: https://reviews.llvm.org/D84621	2020-07-31 02:10:03 -07:00
Balazs Benics	42bc0ed1ab	[analyzer] Fix out-of-tree only clang build by not relaying on private header It turned out that the D78704 included a private LLVM header, which is excluded from the LLVM install target. I'm substituting that `#include` with the public one by moving the necessary `#define` into that. There was a discussion about this at D78704 and on the cfe-dev mailing list. I'm also placing a note to remind others of this pitfall. Reviewed By: mgorny Differential Revision: https://reviews.llvm.org/D84929	2020-07-31 10:28:14 +02:00
Fangrui Song	2085074770	[MC] Support infix operator ! Disabled for Darwin mode. Also disabled for ARM which has compatible aliases (implied 'sp' operand in 'srs*' instructions like 'srsda #31!').	2020-07-30 23:25:53 -07:00
Lang Hames	8bc77257a4	[JITLink] Use correct Addressable constructor. Calling createAddressable(false) generates an absolute symbol. We want createAddressable(0, false), which generates an external symbol.	2020-07-30 22:48:57 -07:00
Wei Mi	b565123367	Fix a crash when the sample profile uses md5 and -sample-profile-merge-inlinee is enabled. When -sample-profile-merge-inlinee is enabled, new FunctionSamples may be created during profile merge without GUIDToFuncNameMap being initialized. That will occasionally cause compiler crash. The patch fixes it. Differential Revision: https://reviews.llvm.org/D84994	2020-07-30 21:21:06 -07:00
Vitaly Buka	4ee4573a60	[NFC] GetUnderlyingObject -> getUnderlyingObject I am going to touch them in the next patch anyway	2020-07-30 21:08:24 -07:00
Arthur Eubanks	050789b0c9	[NewPM] Don't print 'Invalidating all non-preserved analyses' If an analysis is actually invalidated, there's already a log statement for that: 'Invalidating analysis: FooAnalysis'. Otherwise the statement is not very useful. Reviewed By: asbirlea, ychen Differential Revision: https://reviews.llvm.org/D84981	2020-07-30 19:40:29 -07:00
Vitaly Buka	0093612032	[ValueTracking] Remove AllocaForValue parameter findAllocaForValue uses AllocaForValue to cache resolved values. The function is used only to resolve arguments of lifetime intrinsic which usually are not fare for allocas. So result reuse is likely unnoticeable. In followup patches I'd like to replace the function with GetUnderlyingObjects. Depends on D84616. Differential Revision: https://reviews.llvm.org/D84617	2020-07-30 18:48:34 -07:00
Vitaly Buka	fe28af466f	[NFC] Move findAllocaForValue into ValueTracking.h Differential Revision: https://reviews.llvm.org/D84616	2020-07-30 18:22:59 -07:00
kuterd	7bdcceaabf	[Attributor] Add time trace support. This patch addes time trace functionality to have a better understanding of the analysis times. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84980	2020-07-31 03:08:50 +03:00
Amara Emerson	dd34b50099	[AArch64][GlobalISel] Add legalization & selection support for G_INTRINSIC_LRINT. Differential Revision: https://reviews.llvm.org/D84552	2020-07-30 16:14:56 -07:00
Lang Hames	6ca1bd0130	[llvm-jitlink] Add -harness option to llvm-jitlink. The -harness option enables new testing use-cases for llvm-jitlink. It takes a list of objects to treat as a test harness for any regular objects passed to llvm-jitlink. If any files are passed using the -harness option then the following transformations are applied to all other files: (1) Symbols definitions that are referenced by the harness files are promoted to default scope. (This enables access to statics from test harness). (2) Symbols definitions that clash with definitions in the harness files are deleted. (This enables interposition by test harness). (3) All other definitions in regular files are demoted to local scope. (This causes untested code to be dead stripped, reducing memory cost and eliminating spurious unresolved symbol errors from untested code). These transformations allow the harness files to reference and interpose symbols in the regular object files, which can be used to support execution tests (including fuzz tests) of functions in relocatable objects produced by a build.	2020-07-30 15:26:19 -07:00
Lang Hames	eeb8251d74	[JITLink] Allow JITLinkContext::notifyResolved to return an Error. This allows clients to detect invalid transformations applied by JITLink passes (e.g. inserting or removing symbols in unexpected ways) and terminate linking with an error. This change is used to simplify the error propagation logic in ObjectLinkingLayer.	2020-07-30 15:26:18 -07:00
Nikita Popov	9bc2ea2f9d	[ConstantRange] Support abs with poison flag This just adds the ConstantRange support, including exhaustive testing. It's not wired up to the IR intrinsic flag yet.	2020-07-30 22:49:28 +02:00
Yuanfang Chen	e1803bebb8	[NewPM][PassInstrument] Add PrintPass callback to StandardInstrumentations Problem: Right now, our "Running pass" is not accurate when passes are wrapped in adaptor because adaptor is never skipped and a pass could be skipped. The other problem is that "Running pass" for a adaptor is before any "Running pass" of passes/analyses it depends on. (for example, FunctionToLoopPassAdaptor). So the order of printing is not the actual order. Solution: Doing things like PassManager::Debuglogging is very intrusive because we need to specify Debuglogging whenever adaptor is created. (Actually, right now we're not specifying Debuglogging for some sub-PassManagers. Check PassBuilder) This patch move debug logging for pass as a PassInstrument callback. We could be sure that all running passes are logged and in the correct order. This could also be used to implement hierarchy pass logging in legacy PM. We could also move logging of pass manager to this if we want. The test fixes looks messy. It includes changes: - Remove PassInstrumentationAnalysis - Remove PassAdaptor - If a PassAdaptor is for a real pass, the pass is added - Pass reorder (to the correct order), related to PassAdaptor - Add missing passes (due to Debuglogging not passed down) Reviewed By: asbirlea, aeubanks Differential Revision: https://reviews.llvm.org/D84774	2020-07-30 10:07:57 -07:00
Johannes Doerfert	49c4fd21d9	[OpenMP][IRBuilder] Support allocas in nested parallel regions We need to keep track of the alloca insertion point (which we already communicate via the callback to the user) as we place allocas as well. Reviewed By: fghanim, SouraVX Differential Revision: https://reviews.llvm.org/D82470	2020-07-30 10:19:39 -05:00
Simon Pilgrim	bfd473a1a3	Attributor.h - remove unnecessary includes. NFCI. Fix implicit cpp include dependencies.	2020-07-30 15:26:41 +01:00
Florian Hahn	1316430704	Revert "[IPConstProp] Remove and move tests to SCCP." This reverts commit e77624a3be942c7abba48942b3a8da3462070a3f. Looks like some clang tests manually invoke -ipconstprop via opt.....	2020-07-30 13:06:54 +01:00
Florian Hahn	ce3655671a	[IPConstProp] Remove and move tests to SCCP. As far as I know, ipconstprop has not been used in years and ipsccp has been used instead. This has the potential for confusion and sometimes leads people to spend time finding & reporting bugs as well as updating it to work with the latest API changes. This patch moves the tests over to SCCP. There's one functional difference I am aware of: ipconstprop propagates for each call-site individually, so for functions that are called with different constant arguments it can sometimes produce better results than ipsccp (at much higher compile-time cost).But IPSCCP can be thought to do so as well for internal functions and as mentioned earlier, the pass seems unused in practice (and there are no plans on working towards enabling it anytime). Also discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2020-July/143773.html Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84447	2020-07-30 12:36:27 +01:00
Simon Pilgrim	529210d2c9	VectorUtils.h - reduce unnecessary includes. NFC. Replace TargetLibraryInfo.h include with forward declaration and fix implicit dependencies. Reduce SmallSet.h include to SmallVector.h include.	2020-07-30 12:27:49 +01:00
Xing GUO	75e53e4e5b	[DWARFYAML] Make the 'Length' field of the address range table optional. This patch makes the 'Length' field of the address range table optional. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84911	2020-07-30 17:42:18 +08:00
Xing GUO	495386e223	[DWARFYAML] Make the 'AddressSize', 'SegmentSelectorSize' fields optional. This patch makes the 'AddressSize' and 'SegmentSelectorSize' fields of address range table optional. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84907	2020-07-30 17:39:58 +08:00
Serge Pavlov	1398f4e3fc	[Support] Class to facilitate file locking This change define RAII class `FileLocker` and methods `lock` and `tryLockFor` of the class `raw_fd_stream` to facilitate using file locks. Differential Revision: https://reviews.llvm.org/D79066	2020-07-30 13:42:20 +07:00
Yuanfang Chen	f89d6ca262	NFC. Fix comments	2020-07-29 19:18:01 -07:00
Matt Arsenault	e2b102c48a	GlobalISel: Handle llvm.roundeven I still think it's highly questionable that we have two intrinsics with identical behavior and only vary by the name of the libcall used if it happens to be lowered that way, but try to reduce the feature delta between SDAG and GlobalISel for recently added intrinsics. I'm not sure which opcode should be considered the canonical one, but lower roundeven back to round.	2020-07-29 20:01:12 -04:00
Mircea Trofin	ff4bf8bfb5	[llvm][NFC] TensorSpec abstraction for ML evaluator Further abstracting the specification of a tensor, to more easily support different types and shapes of tensor, and also to perform initialization up-front, at TFModelEvaluator construction time. Differential Revision: https://reviews.llvm.org/D84685	2020-07-29 16:29:21 -07:00
Nikita Popov	fca04145c2	[ConstantRange] Add API for intrinsics (NFC) This adds a common API for compute constant ranges of intrinsics. The intention here is that a) we can reuse the same code across different passes that handle constant ranges, i.e. this can be reused in SCCP b) we only have to add knowledge about supported intrinsics to ConstantRange, not any consumers. Differential Revision: https://reviews.llvm.org/D84587	2020-07-29 22:16:27 +02:00
Victor Huang	827e26bee8	[PowerPC] Support for R_PPC64_REL24_NOTOC calls where the caller has no TOC and the callee is not DSO local This patch supports the situation where caller does not have a valid TOC and calls using the R_PPC64_REL24_NOTOC relocation and the callee is not DSO local. In this case the call cannot be made directly since the callee may or may not require a valid TOC pointer. As a result this situation require a PC-relative plt stub to set up r12. Reviewed By: sfertile, MaskRay, stefanp Differential Revision: https://reviews.llvm.org/D83669	2020-07-29 19:49:28 +00:00
Florian Hahn	bf9e3782d5	Reland "[SCEVExpander] Add option to preserve LCSSA directly." This reverts the revert commit dc2867576886247cbe351e7c63618c09ab6af808. It includes a fix for Polly, which uses SCEVExpander on IR that is not in LCSSA form. Set PreserveLCSSA = false in that case, to ensure we do not introduce LCSSA phis where there were none before.	2020-07-29 20:41:53 +01:00
Amara Emerson	d7527ffdb2	[GlobalISel] Add G_INTRINSIC_LRINT and translate from llvm.lrint Differential Revision: https://reviews.llvm.org/D84551	2020-07-29 11:51:04 -07:00
Philip Reames	742638c542	[Statepoint] Consolidate relocation type tracking [NFC] Change the way we track how a particular pointer was relocated at a statepoint in selection dag. Previously, we used an optional<location> for the spill lowering, and a block local Register for the newly introduced vreg lowering. Combine all three lowerings (norelocate, spill, and vreg) into a single helper class, and keep a single copy of the information. This is submitted separately as it really does make the code more readible on it's own, but the indirect motivation is to move vreg tracking from StatepointLowering to FunctionLoweringInfo. This is the last piece needed to support cross block relocations with vregs; that will follow in a separate (non-NFC) patch.	2020-07-29 11:45:31 -07:00
Amara Emerson	9d49eb2e5c	[AArch64][GlobalISel] Selection support for vector DUP[X]lane instructions. In future, we'd like to use the perfect-shuffle mechanism to deal with these shuffle permutations. For now, this improves performance by avoiding the super-expensive const-pool load + tbl instruction. Differential Revision: https://reviews.llvm.org/D84866	2020-07-29 11:41:37 -07:00
Florian Hahn	9db6d6a866	Revert "[SCEVExpander] Add option to preserve LCSSA directly." This reverts commit 99166fd4fb422351f131fb1265cb85d5f6c5b8da, because it breaks the polly builders. polly/test/Isl/CodeGen/invariant_load_escaping_second_scop.ll fails because a apparently unnecessary LCSSA phi node is introduced. Make the bots green again, while I take a closer look.	2020-07-29 19:19:04 +01:00
Matt Arsenault	2428f2b78e	GlobalISel: Implement lower for G_EXTRACT_VECTOR_ELT Use the basic store to stack and reload.	2020-07-29 14:16:28 -04:00
Arthur Eubanks	6d7e104f99	[DFSan][NewPM] Port DataFlowSanitizer to NewPM Reviewed By: ychen, morehouse Differential Revision: https://reviews.llvm.org/D84707	2020-07-29 10:19:15 -07:00
Roman Lebedev	4a9109b967	[SimplifyCFG][LoopRotate] SimplifyCFG: disable common instruction hoisting by default, enable late in pipeline I've been looking at missed vectorizations in one codebase. One particular thing that stands out is that some of the loops reach vectorizer in a rather mangled form, with weird PHI's, and some of the loops aren't even in a rotated form. After taking a more detailed look, that happened because the loop's headers were too big by then. It is evident that SimplifyCFG's common code hoisting transform is at fault there, because the pattern it handles is precisely the unrotated loop basic block structure. Surprizingly, `SimplifyCFGOpt::HoistThenElseCodeToIf()` is enabled by default, and is always run, unlike it's friend, common code sinking transform, `SinkCommonCodeFromPredecessors()`, which is not enabled by default and is only run once very late in the pipeline. I'm proposing to harmonize this, and disable common code hoisting until //late// in pipeline. Definition of //late// may vary, here currently i've picked the same one as for code sinking, but i suppose we could enable it as soon as right after loop rotation happens. Experimentation shows that this does indeed unsurprizingly help, more loops got rotated, although other issues remain elsewhere. Now, this undoubtedly seriously shakes phase ordering. This will undoubtedly be a mixed bag in terms of both compile- and run- time performance, codesize. Since we no longer aggressively hoist+deduplicate common code, we don't pay the price of said hoisting (which wasn't big). That may allow more loops to be rotated, so we pay that price. That, in turn, that may enable all the transforms that require canonical (rotated) loop form, including but not limited to vectorization, so we pay that too. And in general, no deduplication means more [duplicate] instructions going through the optimizations. But there's still late hoisting, some of them will be caught late. As per benchmarks i've run {F12360204}, this is mostly within the noise, there are some small improvements, some small regressions. One big regression i saw i fixed in rG8d487668d09fb0e4e54f36207f07c1480ffabbfd, but i'm sure this will expose many more pre-existing missed optimizations, as usual :S llvm-compile-time-tracker.com thoughts on this: http://llvm-compile-time-tracker.com/compare.php?from=e40315d2b4ed1e38962a8f33ff151693ed4ada63&to=c8289c0ecbf235da9fb0e3bc052e3c0d6bff5cf9&stat=instructions * this does regress compile-time by +0.5% geomean (unsurprizingly) * size impact varies; for ThinLTO it's actually an improvement The largest fallout appears to be in GVN's load partial redundancy elimination, it spends much more time in `MemoryDependenceResults::getNonLocalPointerDependency()`. Non-local `MemoryDependenceResults` is widely-known to be, uh, costly. There does not appear to be a proper solution to this issue, other than silencing the compile-time performance regression by tuning cut-off thresholds in `MemoryDependenceResults`, at the cost of potentially regressing run-time performance. D84609 attempts to move in that direction, but the path is unclear and is going to take some time. If we look at stats before/after diffs, some excerpts: * RawSpeed (the target) {F12360200} * -14 (-73.68%) loops not rotated due to the header size (yay) * -272 (-0.67%) `"Number of live out of a loop variables"` - good for vectorizer * -3937 (-64.19%) common instructions hoisted * +561 (+0.06%) x86 asm instructions * -2 basic blocks * +2418 (+0.11%) IR instructions * vanilla test-suite + RawSpeed + darktable {F12360201} * -36396 (-65.29%) common instructions hoisted * +1676 (+0.02%) x86 asm instructions * +662 (+0.06%) basic blocks * +4395 (+0.04%) IR instructions It is likely to be sub-optimal for when optimizing for code size, so one might want to change tune pipeline by enabling sinking/hoisting when optimizing for size. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D84108	2020-07-29 20:05:30 +03:00
Yuanfang Chen	4fad307f88	[NewPM][PassInstrument] Make PrintIR and TimePasses to use before-pass-run callback Reviewed By: asbirlea, aeubanks Differential Revision: https://reviews.llvm.org/D84773	2020-07-29 08:26:36 -07:00
Yuanfang Chen	53414ad22f	[NewPM][PassInstrument] Add a new kind of before-pass callback that only get called if the pass is not skipped TODO * PrintIRInstrumentation and TimePassesHandler would be using this new callback. * "Running pass" logging will also be moved to use this callback. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D84772	2020-07-29 08:26:36 -07:00
Johannes Doerfert	dd2810924a	[OpenMP] Allow traits for the OpenMP context selector `isa` It was unclear what `isa` was supposed to mean so we did not provide any traits for this context selector. With this patch we will allow any string or identifier. We use the target attribute and target info to determine if the trait matches. In other words, we will check if the provided value is a target feature that is available (at the call site). Fixes PR46338 Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D83281	2020-07-29 10:22:27 -05:00
Florian Hahn	63c884fbb4	[SCEVExpander] Add option to preserve LCSSA directly. This patch teaches SCEVExpander to directly preserve LCSSA. As it is currently, SCEV does not look through PHI nodes in loops, as it might break LCSSA form. Once SCEVExpander can preserve LCSSA form, it should be safe for SCEV to look through PHIs. To preserve LCSSA form, this patch uses formLCSSAForInstructions on operands of newly created instructions, if the definition is inside a different loop than the new instruction. The final value we return from expandCodeFor may also need LCSSA phis, depending on the insert point. As no user for it exists there yet, create a temporary instruction at the insert point, which can be passed to formLCSSAForInstructions. This temporary instruction is removed after LCSSA construction. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D71538	2020-07-29 15:07:37 +01:00
Victor Campos	9c2a0c2f38	[Driver][ARM] Disable unsupported features when nofp arch extension is used A list of target features is disabled when there is no hardware floating-point support. This is the case when one of the following options is passed to clang: - -mfloat-abi=soft - -mfpu=none This option list is missing, however, the extension "+nofp" that can be specified in -march flags, such as "-march=armv8-a+nofp". This patch also disables unsupported target features when nofp is passed to -march. Differential Revision: https://reviews.llvm.org/D82948	2020-07-29 14:13:22 +01:00
Simon Pilgrim	44836ba0cf	[TTI] Move abs/smax/smin/umax/umin cost expansion to ICA getIntrinsicInstrCost variant This will simplify target overrides, and matches what we do for most integer intrinsic costs.	2020-07-29 13:44:38 +01:00
David Green	49873f2449	[Analysis] TTI: Add CastContextHint for getCastInstrCost Currently, getCastInstrCost has limited information about the cast it's rating, often just the opcode and types. Sometimes there is a context instruction as well, but it isn't trustworthy: for instance, when the vectorizer is rating a plan, it calls getCastInstrCost with the old instructions when, in fact, it's trying to evaluate the cost of the instruction post-vectorization. Thus, the current system can get the cost of certain casts incorrect as the correct cost can vary greatly based on the context in which it's used. For example, if the vectorizer queries getCastInstrCost to evaluate the cost of a sext(load) with tail predication enabled, getCastInstrCost will think it's free most of the time, but it's not always free. On ARM MVE, a VLD2 group cannot be extended like a normal VLDR can. Similar situations can come up with how masked loads can be extended when being split. To fix that, this path adds a new parameter to getCastInstrCost to give it a hint about the context of the cast. It adds a CastContextHint enum which contains the type of the load/store being created by the vectorizer - one for each of the types it can produce. Original patch by Pierre van Houtryve Differential Revision: https://reviews.llvm.org/D79162	2020-07-29 13:32:53 +01:00
Pushpinder Singh	62153b67c4	[CMAKE] Fix 'clean' target not working cmake was still considering the empty value of ${fake_version_inc} even if it was not defined. Reviewed By: vsapsai Differential Revision: https://reviews.llvm.org/D82847	2020-07-29 07:34:24 -04:00
Simon Pilgrim	58535386bb	[TTI] Add default cost expansion for abs/smax/smin/umax/umin intrinsics	2020-07-29 12:13:06 +01:00
Xing GUO	8f65f802cb	[DWARFYAML] Replace uint_t with yaml::Hex in the 'debug_aranges' entry. Normally, we use yaml::Hex* to describe the length, offsets, address/segment size. NFC.	2020-07-29 16:43:21 +08:00
Stefanos Baziotis	9255f8a1f0	[ADT][BitVector][NFC] Merge find_first_in() / find_first_unset_in() We can implement find_first_unset_in() in the same function if every BitWord we use is first flipped. Differential Revision: https://reviews.llvm.org/D84717	2020-07-29 04:51:22 +03:00
Thomas Lively	e2b0ae5192	[WebAssembly] Remove intrinsics for SIMD widening ops Instead, pattern match extends of extract_subvectors to generate widening operations. Since extract_subvector is not a legal node, this is implemented via a custom combine that recognizes extract_subvector nodes before they are legalized. The combine produces custom ISD nodes that are later pattern matched directly, just like the intrinsic was. Also removes the clang builtins for these operations since the instructions can now be generated from portable code sequences. Differential Revision: https://reviews.llvm.org/D84556	2020-07-28 18:25:55 -07:00
Joel E. Denny	7663136ac6	[FileCheck] Extend -dump-input with substitutions Substitutions are already reported in the diagnostics appearing before the input dump in the case of failed directives, and they're reported in traces (produced by `-vv -dump-input=never`) in the case of successful directives. However, those reports are not always convenient to view while investigating the input dump, so this patch adds the substitution report to the input dump too. For example: ``` $ cat check CHECK: hello [[WHAT:[a-z]+]] CHECK: [[VERB]] [[WHAT]] $ FileCheck -vv -DVERB=goodbye check < input \|& tail -8 <<<<<< 1: hello world check:1 ^~~~~~~~~~~ 2: goodbye word check:2'0 X~~~~~~~~~~~ error: no match found check:2'1 with "VERB" equal to "goodbye" check:2'2 with "WHAT" equal to "world" >>>>>> ``` Without this patch, the location reported for a substitution for a directive match is the directive's full match range. This location is misleading as it implies the substitution itself matches that range. This patch changes the reported location to just the match range start to suggest the substitution is known at the start of the match. (As in the above example, input dumps don't mark any range for substitutions. The location info in that case simply identifies the right line for the annotation.) Reviewed By: mehdi_amini, thopre Differential Revision: https://reviews.llvm.org/D83650	2020-07-28 19:15:18 -04:00
Alina Sbirlea	fe169bb30a	[DominatorTree] Simplify ChildrenGetter. Summary: Simplify ChildrenGetter to a simple wrapper around a GraphDiff call. GraphDiff already handles nullptr in children, so the special casing in clang can also be removed. Reviewers: kuhar, dblaikie Subscribers: llvm-commits, cfe-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D84713	2020-07-28 15:44:20 -07:00
Johannes Doerfert	2e0011cdf0	[SROA][Mem2Reg] Use efficient droppable use API (after D83976) Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84804	2020-07-28 17:41:01 -05:00
Matt Arsenault	c5f936ea9a	GlobalISel: Add typeIsNot LegalityPredicate This allows sorting the legal/custom rules first as is recommended	2020-07-28 16:49:55 -04:00
Mircea Trofin	d7c7affaca	[llvm][NFC] refactor setBlockFrequency for clarity. The refactoring encapsulates frequency calculation in MachineBlockFrequencyInfo, and renames the API to clarify its motivation. It should clarify frequencies may not be reset 'freely' by users of the analysis, as the API serves as a partial update to avoid a full analysis recomputation. Differential Revision: https://reviews.llvm.org/D84427	2020-07-28 13:04:11 -07:00
clementval	c0fb834caf	[openacc] Add missing newline at end of file	2020-07-28 15:26:27 -04:00
Bruno Ricci	eccc10ef08	[Support][NFC] Fix a Wdocumentation warning in FileSystem.h	2020-07-28 15:36:07 +01:00
Xing GUO	087a9bb761	[DWARFYAML] Add support for emitting custom range list content. This patch adds support for emitting custom range list content. We are able to handcraft a custom range list via the following syntax. ``` debug_rnglists: - Lists: - Entries: - Operator: DW_RLE_startx_endx Values: [ 0x1234, 0x1234 ] - Content: '1234567890abcdef' - Content: 'abcdef1234567890' ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84618	2020-07-28 22:11:16 +08:00
Jinsong Ji	a3d207d6bc	Re-land "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit bf544fa1c3cb80f24d85e84559fb11193846259f. Fixed the typo in PPCInstrInfo.cpp.	2020-07-28 14:00:11 +00:00
Georgii Rymar	9db223cc66	[yaml2obj] - Add a way to override sh_type section field. This adds the `ShType` key similar to others `Sh*` keys we have. My use case is the following. Imagine we have a `SHT_SYMTAB_SHNDX` section and want to hide it from a dumper. The natural way would be to do something like: ``` - Name: .symtab_shndx Type: [[TYPE=SHT_SYMTAB_SHNDX]] Entries: [ 0, 1 ] ``` and then change the TYPE from `SHT_SYMTAB_SHNDX` to something else, for example to `SHT_PROGBITS`. But we have a problem: regular sections does not have `Entries` key, so yaml2obj will be unable to produce a section. The solution is to introduce a `ShType` key to override the final type. This is not the first time I am facing the need to change the type. I was able to invent workarounds or solved issues differently in the past, but finally came to conclusion that we just should support the `ShType`. Differential revision: https://reviews.llvm.org/D84738	2020-07-28 16:16:42 +03:00
Luofan Chen	99d029f616	[Attributor] Add override keyword to the print function of AA The print() function in the `AbstractAttribute` structure overrides the function in the `AADepGraphNode`, so we need to mark it as override. This should fix a buildbot failure introduced by 5ee07dc.	2020-07-28 19:25:20 +08:00
Evgeniy Brevnov	06a0c0fb9c	[BPI][NFC] Consolidate code to deal with SCCs under a dedicated data structure. In order to facilitate review of D79485 here is a small NFC change which restructures code around handling of SCCs in BPI. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D84514	2020-07-28 17:42:33 +07:00
Luofan Chen	bd1fad1fc2	[Attributor] Track AA dependency using dependency graph This patch added dependency graph to the attributor so that we can dump the dependencies between AAs more easily. We can also apply general graph algorithms to the graph, making it easier for us to create deep wrappers. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D78861	2020-07-28 18:02:49 +08:00
Serge Pavlov	ed6fb746cb	[Support] Add file lock/unlock functions This is recommit of f51bc4fb60fb, reverted in 8577595e03fa, because the function `flock` is not available on Solaris. In this variant `flock` was replaced with `fcntl`, which is a POSIX function. New functions `lockFile`, `tryLockFile` and `unlockFile` implement simple file locking. They lock or unlock entire file. This must be enough to support simulataneous writes to log files in parallel builds. Differential Revision: https://reviews.llvm.org/D78896	2020-07-28 16:44:23 +07:00
Wei Mi	51d4708437	Supplement instr profile with sample profile. PGO profile is usually more precise than sample profile. However, PGO profile needs to be collected from loadtest and loadtest may not be representative enough to the production workload. Sample profile collected from production can be used as a supplement -- for functions cold in loadtest but warm/hot in production, we can scale up the related function in PGO profile if the function is warm or hot in sample profile. The implementation contains changes in compiler side and llvm-profdata side. Given an instr profile and a sample profile, for a function cold in PGO profile but warm/hot in sample profile, llvm-profdata will either mark all the counters in the profile to be -1 or scale up the max count in the function to be above hot threshold, depending on the zero counter ratio in the profile. The assumption is if there are too many counters being zero in the function profile, the profile is more likely to cause harm than good, then llvm-profdata will mark all the counters to be -1 indicating the function is hot but the profile is unaccountable. In compiler side, if a function profile with all -1 counters is seen, the function entry count will be set to be above hot threshold but its internal profile will be dropped. In the long run, it may be useful to let compiler support using PGO profile and sample profile at the same time, but that requires more careful design and more substantial changes to make two profiles work seamlessly. The patch here serves as a simple intermediate solution. Differential Revision: https://reviews.llvm.org/D81981	2020-07-27 20:17:40 -07:00
Alina Sbirlea	f4701f9f4b	[CFGDiff] Refactor Succ/Pred maps. Summary: Refactor Succ/Pred maps to have a single map lookup when constructing children. The preivous desing made sense when used by GraphTraits. This more closely matches the previous approach in DomTree. Reviewers: dblaikie Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D84567	2020-07-27 18:02:52 -07:00
Alina Sbirlea	98ef7ef22b	[DomTree] Remove dead code.[NFC]	2020-07-27 18:02:52 -07:00
Alina Sbirlea	bb6015d9c6	[GraphDiff] Use class method getChildren instead of GraphTraits. Summary: Use getChildren() method in GraphDiff instead of GraphTraits. This simplifies the code and allows for refactorigns inside GraphDiff. All usecase need not have a light-weight/copyable range. Clean GraphTraits implementation. Reviewers: dblaikie Subscribers: hiraditya, llvm-commits, george.burgess.iv Tags: #llvm Differential Revision: https://reviews.llvm.org/D84562	2020-07-27 16:12:34 -07:00
Arthur Eubanks	8826e94360	[DFSan] Remove unused DataFlowSanitizer vars Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D84704	2020-07-27 14:59:07 -07:00
Jinsong Ji	89408b2ab3	Revert "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit adffce71538e219aab4eeb024819baa7687262ff. This is breaking test-suite, revert while investigation.	2020-07-27 21:07:00 +00:00
Guillaume Chatelet	041ab5526b	[NFC] Move BitcodeCommon.h from Bitstream to Bitcode	2020-07-27 20:49:17 +00:00
Roman Lebedev	610300535f	[OpenMPOpt] Most SCC's are uninteresting, don't waste time on them (up to 16x faster) Summary: This seems obvious in hindsight, but the result is surprising. I've measured compile-time of `-openmpopt` pass standalone on RawSpeed unity build, and while there is some OpenMP stuff, most is not OpenMP. But nonetheless the pass does a lot of costly preparations before ever trying to look for OpenMP stuff in SCC. Numbers (n=25): 0.094624s -> 0.005976s, an -93.68% improvement, or ~16x Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: yaxunl, hiraditya, guansong, llvm-commits, sstefan1 Tags: #llvm Differential Revision: https://reviews.llvm.org/D84689	2020-07-27 23:36:34 +03:00
Jinsong Ji	2d65e976a4	[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support Per RFC http://lists.llvm.org/pipermail/llvm-dev/2020-April/141295.html no one is making use of QPX/A2Q/BGQ/BGP CNK anymore. This patch remove the support of QPX/A2Q in llvm, BGQ/BGP in clang, CNK support in openmp/polly. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D83915	2020-07-27 19:24:39 +00:00
Jon Roelofs	f305e07c2d	[TableGen][CGS] Print better errors on overlapping InstRW Differential Revision: https://reviews.llvm.org/D83588	2020-07-27 09:41:10 -06:00
Vy Nguyen	602a8b1df5	[llvm-exegesis] Check perf_branch_entry for field cycles Summary: Follow up to breakages reported in D77422 Reviewers: ondrasej, gchatelet Tags: #llvm Differential Revision: https://reviews.llvm.org/D84076	2020-07-27 11:31:13 -04:00
Simon Pilgrim	6a70e74d4b	IRPrintingPasses.h - simplify unnecessary header with forward declarations. NFC. Remove duplicate PassManager.h include in IRPrintingPasses.cpp	2020-07-27 14:51:28 +01:00
Guillaume Chatelet	7ab8564b32	[Alignment][NFC] Update Bitcodewriter to use Align Differential Revision: https://reviews.llvm.org/D83533	2020-07-27 08:16:45 +00:00
David Sherwood	7010946aee	[SVE] Don't use LocalStackAllocation for SVE objects I have introduced a new TargetFrameLowering query function: isStackIdSafeForLocalArea that queries whether or not it is safe for objects of a given stack id to be bundled into the local area. The default behaviour is to always bundle regardless of the stack id, however for AArch64 this is overriden so that it's only safe for fixed-size stack objects. There is future work here to extend this algorithm for multiple local areas so that SVE stack objects can be bundled together and accessed from their own virtual base-pointer. Differential Revision: https://reviews.llvm.org/D83859	2020-07-27 08:22:01 +01:00
Yuanfang Chen	8d83600e34	[NewPM] NFC. remove obsolete TODO comment The deleted TODO was implemented in D82344.	2020-07-26 22:32:24 -07:00
Xing GUO	33fdabb077	[DWARFYAML] Rename getUsedSectionNames() to getNonEmptySectionNames(). This patch renames getUsedSectionNames() to getNonEmptySectionNames. NFC.	2020-07-26 21:10:38 +08:00
Lang Hames	ef607c6171	[ORC] Rename TargetProcessControl DynamicLibraryHandle and loadLibrary. The new names, DylibHandle and loadDylib, are more concise and make clear that these utilities are for loading dynamic libraries, not static ones.	2020-07-25 15:21:43 -07:00
Lang Hames	8dd38c568c	[ORC] Don't require PageSize or Triple during TargetProcessControl construction Subclasses will commonly gather that information from a remote during construction, in which case they won't have meaningful values to pass to TargetProcessControl's constructor.	2020-07-25 15:21:43 -07:00
Simon Pilgrim	b71064b07e	SimplifyLibCalls - remove unnecessary header and forward declaration. NFC. We include TargetLibraryInfo.h so don't need to forward declare it, and we don't need to include TargetLibraryInfo.h in SimplifyLibCalls.cpp as well.	2020-07-25 12:58:39 +01:00
Simon Pilgrim	a3ff65cfbb	SymbolRemappingReader.h - pass Twine by reference not value. NFCI.	2020-07-25 12:48:14 +01:00
Nathan James	8b5de07c6a	[ADT] Add a range-based version of std::move Adds a range-based version of `std::move`, the version that moves a range, not the one that creates r-value references. Reviewed By: dblaikie, gamesh411 Differential Revision: https://reviews.llvm.org/D83902	2020-07-25 10:37:34 +01:00
Matt Arsenault	16ca0e0369	GlobalISel: Define mulfix/divfix opcodes The full expansion involves the funnel shifts, which depend on another patch to expand those.	2020-07-24 20:02:20 -04:00
Alina Sbirlea	0651a23551	Reapply "[DomTree] Replace ChildrenGetter with GraphTraits over GraphDiff." This is the part of the patch that's moving the Updates to a CFGDiff object. Splitting off from the clean-up work merging the two branches when BUI is null. Differential Revision: https://reviews.llvm.org/D77341	2020-07-24 14:10:50 -07:00
Johannes Doerfert	ac3ceab3a2	[Mem2Reg] Teach promote to register about droppable instructions This is the first of two patches to address PR46753. We basically allow mem2reg to promote allocas that are used in doppable instructions, for now that means `llvm.assume`. The uses of the alloca (or a bitcast or zero offset GEP from there) are replaced by `undef` in the droppable instructions. Reviewed By: Tyker Differential Revision: https://reviews.llvm.org/D83976	2020-07-24 15:15:38 -05:00
Valentin Clement	d92fc8cf98	[openmp] Clean up OMPKinds.def remove OMP_DIRECTIVE This patch removes the OMP_DIRECTIVE definition from OMPKinds.def since they are now defined in OMP.td and OMP_DIRECTIVE is not used anymore in the code. Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D84329	2020-07-24 15:06:54 -04:00
Nicolai Hähnle	005524128f	MachineBasicBlock: add printName method Common up some existing MBB name printing logic into a single place. Note that basic block dumping now prints the same set of attributes as the MIRPrinter. Change-Id: I8f022bbd922e831bc96d63143d7472c03282530b Differential Revision: https://reviews.llvm.org/D83253	2020-07-24 18:18:09 +02:00
Simon Pilgrim	a4fb77451c	Revert rG5dd566b7c7b78bd- "PassManager.h - remove unnecessary Function.h/Module.h includes. NFCI." This reverts commit 5dd566b7c7b78bd385418c72d63c79895be9ae97. Causing some buildbot failures that I'm not seeing on MSVC builds.	2020-07-24 13:02:33 +01:00
Simon Pilgrim	32d0701fa1	PassManager.h - remove unnecessary Function.h/Module.h includes. NFCI. PassManager.h is one of the top headers in the ClangBuildAnalyzer frontend worst offenders list. This exposes a large number of implicit dependencies on various forward declarations/includes in other headers that need addressing.	2020-07-24 12:40:50 +01:00
Xing GUO	9f9286af5c	[DWARFYAML] Replace 'Format', 'Version', etc with 'FormParams'. NFC. This patch replaces 'Format', 'Version' fields, etc with 'FormParams' to simplify codes. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D84496	2020-07-24 16:54:51 +08:00
Petr Hosek	300de51f92	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-07-23 23:05:36 -07:00
Mircea Trofin	c07fcb7a55	[llvm][NFC] Don't use llvm/Config/config.h in .h files config.h is excluded from installs, llvm-config.h isn't Differential Revision: https://reviews.llvm.org/D84459	2020-07-23 22:27:38 -07:00
Matt Arsenault	929709ee06	GlobalISel: Add scalarSameSizeAs LegalizeRule Widen or narrow a type to a type with the same scalar size as another. This can be used to force G_PTR_ADD/G_PTRMASK's scalar operand to match the bitwidth of the pointer type. Use this to disallow narrower types for G_PTRMASK.	2020-07-23 21:17:31 -04:00
Matt Arsenault	4fa5e95608	GlobalISel: Drop original type pointeriness in minScalarSameAs It is not useful to report WidenScalar for a pointer value, so always report a scalar value with the target size. This allows using this to clamp the scalar operand to the pointer size in operations like G_PTR_ADD or G_PTRMASK.	2020-07-23 21:17:18 -04:00
Matt Arsenault	b410d6597f	GlobalISel: Fix maxScalarIf predicate check The std::function itself was tested, not the result of the actual query. It seems like there should be a warning for this. Inline the check to avoid this.	2020-07-23 21:17:10 -04:00
Lang Hames	85d946b864	[ORC] Enable use of TargetProcessControl::getMemMgr with ObjectLinkingLayer. This patch makes ownership of the JITLinkMemoryManager by ObjectLinkingLayer optional: the layer can still own the memory manager but no longer has to. Evevntually we want to move to a state where ObjectLinkingLayer never owns its memory manager. For now allowing optional ownership makes it easier to develop classes that can dynamically use either RTDyldObjectLinkingLayer, which owns its memory managers, or ObjectLinkingLayer (e.g. LLJIT).	2020-07-23 16:18:57 -07:00

... 3 4 5 6 7 ...

42173 Commits