llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Christopher Tetreault	2eb769f5ef	[SVE] Remove bad calls to VectorType::getNumElements() from X86 Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D85156	2020-08-03 16:34:10 -07:00
hgreving	1fccd57ea4	[MC] Fix memory leak when allocating MCInst with bump allocator Adds the function createMCInst() to MCContext that creates a MCInst using a typed bump alloctor. MCInst contains a SmallVector<MCOperand, 8>. The SmallVector is POD only for <= 8 operands. The default untyped bump pointer allocator of MCContext does not delete the MCInst, so if the SmallVector grows, it's a leak. This fixes https://bugs.llvm.org/show_bug.cgi?id=46900.	2020-08-03 16:08:26 -07:00
Christopher Tetreault	c2ae55c000	[SVE] Remove bad call to VectorType::getNumElements() from AMDGPU Differential Revision: https://reviews.llvm.org/D85151	2020-08-03 15:56:10 -07:00
Christopher Tetreault	048aa8f2c1	[SVE] Remove bad call to VectorType::getNumElements() from ARM Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D85152	2020-08-03 15:41:14 -07:00
Jordan Rupprecht	cec25d0577	[NFC][ARM] Silence unused variable in release builds	2020-08-03 15:21:44 -07:00
Christopher Tetreault	055ddf033e	[SVE] Remove bad calls to VectorType::getNumElements() from PowerPC Differential Revision: https://reviews.llvm.org/D85154	2020-08-03 15:15:20 -07:00
Alina Sbirlea	592b072474	[MemorySSA] Restrict optimizations after a PhiTranslation. Merging alias results from different paths, when a path did phi translation is not necesarily correct. Conservatively terminate such paths. Aimed to fix PR46156. Differential Revision: https://reviews.llvm.org/D84905	2020-08-03 14:46:41 -07:00
Mitch Phillips	89e3449531	[HWASan] [GlobalISel] Add +tagged-globals backend feature for GlobalISel GlobalISel is the default ISel for aarch64 at -O0. Prior to D78465, GlobalISel didn't have support for dealing with address-of-global lowerings, so it fell back to SelectionDAGISel. HWASan Globals require special handling, as they contain the pointer tag in the top 16-bits, and are thus outside the code model. We need to generate a `movk` in the instruction sequence with a G3 relocation to ensure the bits are relocated properly. This is implemented in SelectionDAGISel, this patch does the same for GlobalISel. GlobalISel and SelectionDAGISel differ in their lowering sequence, so there are differences in the final instruction sequence, explained in `tagged-globals.ll`. Both of these implementations are correct, but GlobalISel is slightly larger code size / slightly slower (by a couple of arithmetic instructions). I don't see this as a problem for now as GlobalISel is only on by default at `-O0`. Reviewed By: aemerson, arsenm Differential Revision: https://reviews.llvm.org/D82615	2020-08-03 14:28:44 -07:00
David Green	fcac3fa8b2	[ARM] Convert VPSEL to VMOV in tail predicated loops VPSEL has slightly different semantics under tail predication (it can end up selecting from Qn, Qm and Qd). We do not model that at the moment so they block tail predicated loops from being formed. This just converts them into a predicated VMOV instead (via a VORR), allowing tail predication to happen whilst still modelling the original behaviour of the input. Differential Revision: https://reviews.llvm.org/D85110	2020-08-03 22:03:14 +01:00
Thomas Lively	d648be35e8	[WebAssembly] Implement prototype v128.load{32,64}_zero instructions Specified in https://github.com/WebAssembly/simd/pull/237, these instructions load the first vector lane from memory and zero the other lanes. Since these instructions are not officially part of the SIMD proposal, they are only available on an opt-in basis via LLVM intrinsics and clang builtin functions. If these instructions are merged to the proposal, this implementation will change so that the instructions will be generated from normal IR. At that point the intrinsics and builtin functions would be removed. This PR also changes the opcodes for the experimental f32x4.qfm{a,s} instructions because their opcodes conflicted with those of the v128.load{32,64}_zero instructions. The new opcodes were chosen to match those used in V8. Differential Revision: https://reviews.llvm.org/D84820	2020-08-03 13:54:00 -07:00
Mitch Phillips	1153fb0830	Revert "[X86][SSE] Shuffle combine blends to OR(X,Y) if the relevant elements are known zero." This reverts commit 219f32f4b68679563443cdaae7b8174c9976409a. Commit contains unsigned compasions that break bots that build with -Wsign-compare.	2020-08-03 13:48:30 -07:00
David Green	f605622825	[ARM] Test for converting VPSEL to VMOVT. NFC	2020-08-03 21:46:07 +01:00
Fangrui Song	6fcac4f990	[MC] Set sh_link to 0 if the associated symbol is undefined Part of https://bugs.llvm.org/show_bug.cgi?id=41734 LTO can drop externally available definitions. Such AssociatedSymbol is not associated with a symbol. ELFWriter::writeSection() will assert. Allow a SHF_LINK_ORDER section to have sh_link=0. We need to give sh_link a syntax, a literal zero in the linked-to symbol position, e.g. `.section name,"ao",@progbits,0` Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D72899	2020-08-03 13:43:48 -07:00
Jon Roelofs	040e0990b8	Fix typo: s/epomymous/eponymous/ NFC	2020-08-03 14:09:46 -06:00
Eli Friedman	baa9fabf16	[AArch64] Add missing isel patterns for fcvtzs/u intrinsic on v1f64. Fixes test-suite compile failure caused by 8dfb5d7. While I'm in the area, add some more test coverage to related operations, to make sure we aren't missing any other patterns.	2020-08-03 13:04:59 -07:00
Lang Hames	ec5336f4a3	[llvm-jitlink] Add support for static archives and MachO universal archives. Archives can now be specified as input files the same way that object files are. Archives will always be linked after all objects (regardless of the relative order of the inputs) but before any dynamic libraries or process symbols. This patch also relaxes matching for slice triples in StaticLibraryDefinitionGenerator in order to support this feature: Vendors need not match if the source vendor is unknown.	2020-08-03 12:58:00 -07:00
Hiroshi Yamauchi	4ea5511359	[PGO] Enable the extended value profile buckets for mem op sizes. Following up D81682 and enable the new, extended value profile buckets for mem op sizes. Differential Revision: https://reviews.llvm.org/D83903	2020-08-03 12:25:11 -07:00
Sanjay Patel	e545f2a08e	[InstSimplify] fold variations of max-of-min with common operand https://alive2.llvm.org/ce/z/ZtxpZ3	2020-08-03 15:02:46 -04:00
Sanjay Patel	7591195747	[InstSimplify] add tests for min-of-max variants; NFC	2020-08-03 15:02:46 -04:00
Arthur Eubanks	e7b631c76f	Fix layering violation Transforms/Utils -> Scalar Introduced in D85063.	2020-08-03 11:53:23 -07:00
Jian Cai	7f1bdb7b20	[X86] support .nops directive Add support of .nops on X86. This addresses llvm.org/PR45788. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D82826	2020-08-03 11:50:56 -07:00
Florian Hahn	53e60d5bc6	[ArgPromotion] Replace all md uses of promoted values with undef. Currently, ArgPromotion may leave metadata uses of promoted values, which will end up in the wrong function, creating invalid IR. PR33641 fixed this for dead arguments, but it can be also be triggered arguments with users that are promoted (see the updated test case). We also have to drop uses to them after promoting them. We need to do this after dealing with the non-metadata uses, so I also moved the empty use case to the loop that deals with updating the arguments of the new function. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D85127	2020-08-03 19:31:53 +01:00
LLVM GN Syncbot	b84726b927	[gn build] Port f78f509c758	2020-08-03 18:05:15 +00:00
Hiroshi Yamauchi	0b0a5993c1	[PGO] Extend the value profile buckets for mem op sizes. Extend the memop value profile buckets to be more flexible (could accommodate a mix of individual values and ranges) and to cover more value ranges (from 11 to 22 buckets). Disabled behind a flag (to be enabled separately) and the existing code to be removed later. Differential Revision: https://reviews.llvm.org/D81682	2020-08-03 11:04:32 -07:00
Joao Moreira	417137b6f5	[X86] Make ENDBR instruction a scheduling boundary Instructions should not be scheduled across ENDBR instructions, as this would result in the ENDBR being displaced, breaking the parity needed for the Indirect Branch Tracking feature of CET. Currently, the X86IndirectBranchTracking pass is later than the instruction scheduling in the pipeline, what causes the bug to be unnoticeable and very hard (if not unfeasible) to be triggered while compiling C files with the standard LLVM setup. Yet, for correctness and to prevent issues in future changes, the compiler should prevent the such scheduling. Differential Revision: https://reviews.llvm.org/D84862	2020-08-03 10:47:23 -07:00
Simon Pilgrim	be3e81f435	[X86][SSE] Shuffle combine blends to OR(X,Y) if the relevant elements are known zero. This allows us to remove the (depth violating) code in getFauxShuffleMask where we were combining the OR(SHUFFLE,SHUFFLE) shuffle inputs as well, and not just the OR(). This is a minor step toward being able to shuffle combine from/to SELECT/BLENDV as a faux shuffle.	2020-08-03 18:32:47 +01:00
Arthur Eubanks	77ef1e98d6	[NewPM][LoopVersioning] Port LoopVersioning to NPM Reviewed By: ychen, fhahn Differential Revision: https://reviews.llvm.org/D85063	2020-08-03 10:32:09 -07:00
Kevin P. Neal	0c24e52a4d	[FPEnv] IRBuilder fails to add strictfp attribute The strictfp attribute is required on all function calls in a function that is itself marked with the strictfp attribute. The IRBuilder knows this and has a method for adding the attribute to function call instructions. If a function being called has the strictfp attribute itself then the IRBuilder will refuse to add the attribute to the calling instruction despite being asked to add it. Eliminate this error. Differential Revision: https://reviews.llvm.org/D84878	2020-08-03 13:25:24 -04:00
Craig Topper	fb9c043569	[X86] Use h-register for final XOR of __builtin_parity on 64-bit targets. This adds an isel pattern and special XOR8rr_NOREX instruction to enable the use of h-registers for __builtin_parity. This avoids a copy and a shift instruction. The NOREX instruction is in case register allocation doesn't use the matching l-register for some reason. If a R8-R15 register gets picked instead, we won't be able to encode the instruction since an h-register can't be used with a REX prefix. Fixes PR46954	2020-08-03 10:10:17 -07:00
Mircea Trofin	1cbf2902fb	[llvm] Add a parser from JSON to TensorSpec A JSON->TensorSpec utility we will use subsequently to specify additional outputs needed for certain training scenarios. Differential Revision: https://reviews.llvm.org/D84976	2020-08-03 09:49:31 -07:00
Gui Andrade	aa6ea791fa	[Utils] Add noundef attribute to vim/emacs/vscode syntax scripts Differential Revision: https://reviews.llvm.org/D84553	2020-08-03 16:45:35 +00:00
Gui Andrade	23e6a33fdb	[MSAN] Instrument freeze instruction by clearing shadow Freeze always returns a defined value. This also prevents msan from checking the input shadow, which happened because freeze wasn't explicitly visited. Differential Revision: https://reviews.llvm.org/D85040	2020-08-03 16:42:17 +00:00
Florian Hahn	6d82efa764	[SCEV] If Start>=RHS, simplify (Start smin RHS) = RHS for trip counts. In some cases, it seems like we can get rid of unnecessary s/umins by using information from the loop guards (unless I am missing something). One place where this seems to be helpful in practice is when computing loop trip counts. This patch just changes howManyGreaterThans for now. Note that this requires a loop for which we can check 'is guarded'. On SPEC2000/SPEC2006/MultiSource, there are some notable changes for some programs in the number of loops unrolled and trip counts computed. ``` Same hash: 179 (filtered out) Remaining: 58 Metric: scalar-evolution.NumTripCountsComputed Program base patch diff test-suite...langs-C/compiler/compiler.test 25.00 31.00 24.0% test-suite.../Applications/SPASS/SPASS.test 2020.00 2323.00 15.0% test-suite...langs-C/allroots/allroots.test 29.00 32.00 10.3% test-suite.../Prolangs-C/loader/loader.test 17.00 18.00 5.9% test-suite...fice-ispell/office-ispell.test 253.00 265.00 4.7% test-suite...006/450.soplex/450.soplex.test 3552.00 3692.00 3.9% test-suite...chmarks/MallocBench/gs/gs.test 453.00 470.00 3.8% test-suite...ngs-C/assembler/assembler.test 29.00 30.00 3.4% test-suite.../Benchmarks/Ptrdist/bc/bc.test 263.00 270.00 2.7% test-suite...rks/FreeBench/pifft/pifft.test 722.00 741.00 2.6% test-suite...count/automotive-bitcount.test 41.00 42.00 2.4% test-suite...0/253.perlbmk/253.perlbmk.test 1417.00 1451.00 2.4% test-suite...000/197.parser/197.parser.test 387.00 396.00 2.3% test-suite...lications/sqlite3/sqlite3.test 1168.00 1189.00 1.8% test-suite...000/255.vortex/255.vortex.test 173.00 176.00 1.7% Metric: loop-unroll.NumUnrolled Program base patch diff test-suite...langs-C/compiler/compiler.test 1.00 3.00 200.0% test-suite.../Applications/SPASS/SPASS.test 134.00 234.00 74.6% test-suite...count/automotive-bitcount.test 3.00 4.00 33.3% test-suite.../Prolangs-C/loader/loader.test 3.00 4.00 33.3% test-suite...langs-C/allroots/allroots.test 3.00 4.00 33.3% test-suite...Source/Benchmarks/sim/sim.test 10.00 12.00 20.0% test-suite...fice-ispell/office-ispell.test 21.00 25.00 19.0% test-suite.../Benchmarks/Ptrdist/bc/bc.test 32.00 38.00 18.8% test-suite...006/450.soplex/450.soplex.test 300.00 352.00 17.3% test-suite...rks/FreeBench/pifft/pifft.test 60.00 69.00 15.0% test-suite...chmarks/MallocBench/gs/gs.test 57.00 63.00 10.5% test-suite...ngs-C/assembler/assembler.test 10.00 11.00 10.0% test-suite...0/253.perlbmk/253.perlbmk.test 145.00 157.00 8.3% test-suite...000/197.parser/197.parser.test 43.00 46.00 7.0% test-suite...TimberWolfMC/timberwolfmc.test 205.00 214.00 4.4% Geomean difference 7.6% ``` Fixes https://bugs.llvm.org/show_bug.cgi?id=46939 Fixes https://bugs.llvm.org/show_bug.cgi?id=46924 on X86. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D85046	2020-08-03 17:22:42 +01:00
Cameron McInally	60134c9850	[FPEnv] Don't transform FSUB(-0,X)->FNEG(X) in SelectionDAGBuilder. This patch stops unconditionally transforming FSUB(-0,X) into an FNEG(X) while building the DAG. There is also one small change to handle the new FSUB(-0,X) similarly to FNEG(X) in the AMDGPU backend. Differential Revision: https://reviews.llvm.org/D84056	2020-08-03 10:22:25 -05:00
Xing GUO	8d54ee8a03	[DWARFYAML] Implement the .debug_loclists section. This patch implements the .debug_loclists section. There are only two DWARF expressions are implemented in this patch (DW_OP_consts, DW_OP_stack_value). We will implement more in the future. The YAML description of the .debug_loclists section is: ``` debug_loclists: - Format: DWARF32 ## Optional Length: 0x1234 ## Optional Version: 5 ## Optional (5 by default) AddressSize: 8 ## Optional SegmentSelectorSize: 0 ## Optional (0 by default) OffsetEntryCount: 1 ## Optional Offsets: [ 1 ] ## Optional Lists: - Entries: - Operator: DW_LLE_startx_endx Values: [ 0x1234, 0x4321 ] DescriptorsLength: 0x1234 ## Optional Descriptors: - Operator: DW_OP_consts Values: [ 0x1234 ] ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D84234	2020-08-03 23:20:15 +08:00
Shinji Okumura	4e0af248b2	[NFC][APInt][DenseMapInfo] Move DenseMapAPIntKeyInfo into DenseMap.h as DenseMapInfo<APInt> `DenseMapAPIntKeyInfo` is now located in `lib/IR/LLVMContextImpl.h`. Moved it into `include/ADT/DenseMapInfo.h` to use it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85131	2020-08-03 23:31:13 +09:00
Sanjay Patel	7714e21eba	[InstCombine] reduce xor-of-or's bitwise logic (PR46955); 2nd try The 1st try at this (rG2265d01f2a5b) exposed what looks like unspecified behavior in C/C++ resulting in test variations. The arguments to BinaryOperator::CreateAnd() were both IRBuilder function calls, and the order in which they execute determines the order of the new instructions in the IR. But the order of function arg evaluation is not fixed by the rules of C/C++, so depending on compiler config, the test would fail because the test expected a single fixed ordering of instructions. Original commit message: I tried to use m_Deferred() on this, but didn't find a clean way to do that. http://bugs.llvm.org/PR46955 https://alive2.llvm.org/ce/z/2h6QTq	2020-08-03 10:21:56 -04:00
Xing GUO	f5eb618063	[DWARFYAML] Offsets should be omitted when the OffsetEntryCount is 0. The offsets field should be omitted when the 'OffsetEntryCount' entry is specified to be 0. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85006	2020-08-03 22:06:08 +08:00
Matt Arsenault	c177fc936f	GlobalISel: Handle arbitrary FewerElementsVector for G_IMPLICIT_DEF	2020-08-03 09:14:08 -04:00
Matt Arsenault	7162966fa9	AMDGPU/GlobalISel: Remove old hacks for boolean selection There were various hacks used to try to avoid making s1 SGPR vs. s1 VCC ambiguous after constraining the register before we had a strategy to deal with this. This also attempted to handle undef operands, which are now illegal gMIR.	2020-08-03 09:04:14 -04:00
Matt Arsenault	7371d3a454	GlobalISel: Reimplement moreElementsVectorDst Use pad with undef and unmerge with unused results. This is annoyingly similar to several other places in LegalizerHelper, but they're all slightly different.	2020-08-03 09:03:48 -04:00
Sanjay Patel	695d11fd09	Revert "[InstCombine] reduce xor-of-or's bitwise logic (PR46955)" This reverts commit 2265d01f2a5bd153959701e22f5be2a40e1674a3. Seeing bot failures after this change like: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/42586	2020-08-03 08:58:41 -04:00
Matt Arsenault	59ad55e97d	AMDGPU/GlobalISel: Apply load bitcast to s.buffer.load intrinsic Should also apply this to the non-scalar buffer loads.	2020-08-03 08:54:29 -04:00
Simon Pilgrim	38a46e3c56	[X86][SSE] Start shuffle combining from ANY_EXTEND_VECTOR_INREG on SSE targets We already do this on AVX (+ for ZERO_EXTEND_VECTOR_INREG), but this enables it for all SSE targets - we attempted something similar back at rL357057 but hit issues with the ZERO_EXTEND_VECTOR_INREG handling (PR41249). I'm still looking at the vector-mul.ll regression - which is due to 32-bit targets performing the load as a f64, resulting in the shuffle combiner thinking it has to create a shuffle in the float domain.	2020-08-03 13:41:48 +01:00
Matt Arsenault	e7682e1add	AMDGPU/GlobalISel: Fix selecting broken copies for s32->s64 anyext These should probably not be legal in the first place, but that might also be a pain.	2020-08-03 08:36:41 -04:00
Sanjay Patel	a8fb4d6932	[InstCombine] reduce xor-of-or's bitwise logic (PR46955) I tried to use m_Deferred() on this, but didn't find a clean way to do that. http://bugs.llvm.org/PR46955 https://alive2.llvm.org/ce/z/2h6QTq	2020-08-03 08:31:43 -04:00
Sanjay Patel	ad3d7a2e06	[InstCombine] add tests for xor-of-ors; NFC	2020-08-03 08:31:43 -04:00
Nicholas Guy	a32b9bcdd6	[ARM] Fix IT block generation after Thumb2SizeReduce with -Oz Fixes a regression caused by D82439, in which IT blocks were no longer being generated when -Oz is present. This was due to the CPSR register being marked as dead, while this case was not accounted for. Differential Revision: https://reviews.llvm.org/D83667	2020-08-03 13:20:32 +01:00
Georgii Rymar	d73aa6dbae	[llvm-readobj] - Don't stop dumping when the name of a relocation section can't be read. This removes undesired `unwrapOrError` calls from printRelocations() methods. Differential revision: https://reviews.llvm.org/D84408	2020-08-03 14:49:09 +03:00
Georgii Rymar	2f6470d04a	[llvm-readobj] - Massive test cases cleanup. This patch does the following: 1) Starts using YAML macro to reduce the number of YAML documents in tests. 2) Adds `#` before 'RUN'/`CHECK` lines in a few tests where it is missing. 3) Removes unused YAML keys. 4) Starts using `ENTSIZE=<none>` to simplify tests (see D84526). 5) Removes trailing white spaces in a few places. Differential revision: https://reviews.llvm.org/D85013	2020-08-03 14:07:06 +03:00

1 2 3 4 5 ...

201309 Commits