llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Nicolai Hähnle	d4eb4d6675	DomTree: Remove getChildren() accessor Summary: Avoid exposing details about how children are stored. This will enable subsequent type-erasure changes. New methods are introduced to cover common access patterns. Change-Id: Idb5f4b1b9c84e4cc71ddb39bb52a388682f5674f Reviewers: arsenm, RKSimon, mehdi_amini, courbet Subscribers: qcolombet, sdardis, wdng, hiraditya, jrtc27, zzheng, atanasyan, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83083	2020-07-06 21:58:11 +02:00
Wouter van Oortmerssen	850662dafb	[WebAssembly] Added 64-bit memory.grow/size/copy/fill This covers both the existing memory functions as well as the new bulk memory proposal. Added new test files since changes where also required in the inputs. Also removes unused init/drop intrinsics rather than trying to make them work for 64-bit. Differential Revision: https://reviews.llvm.org/D82821	2020-07-06 12:49:50 -07:00
Wouter van Oortmerssen	2d660a98ce	[WebAssembly] 64-bit memory limits	2020-07-06 12:40:45 -07:00
Kazushi (Jam) Marukawa	475cbdd9be	[VE] Support symbol with offset in assembly Summary: Change MCExpr to support Aurora VE's modifiers. Change asmparser to use existing MCExpr parser (parseExpression) to parse an expression contining symbols with modifiers and offsets. Also add several regression tests of MC layer. Reviewers: simoll, k-ishizaka Reviewed By: simoll Subscribers: hiraditya, llvm-commits Tags: #llvm, #ve Differential Revision: https://reviews.llvm.org/D83170	2020-07-07 04:16:51 +09:00
Kazushi (Jam) Marukawa	41dc1d54a5	[VE] Change to use isa Summary: Change to use isa instead of dyn_cast to avoid a warning. Reviewers: simoll, k-ishizaka Reviewed By: simoll Subscribers: hiraditya, llvm-commits Tags: #llvm, #ve Differential Revision: https://reviews.llvm.org/D83200	2020-07-07 03:48:49 +09:00
Matt Arsenault	6ca9a98429	AMDGPU: Don't ignore carry out user when expanding add_co_pseudo This was resulting in a missing vreg def in the use select instruction. The output of the pseudo doesn't make sense, since it really shouldn't have the vreg output in the first place, and instead an implicit scc def to match the real scalar behavior. We could have easier to understand tests if we selected scalar versions of the [us]{add\|sub}.with.overflow intrinsics. This does still end up producing vector code in the end, since it gets moved later.	2020-07-06 14:28:01 -04:00
Shuhong Liu	5279dece82	[AIX] Add system-aix to lit config file Summary: This is a complementary patch to D82100 since the aix builbot is still running the unsupported test shtest-format-argv0. Add system-aix to the sub llvm-lit config. Reviewers: daltenty, hubert.reinterpretcast Reviewed By: hubert.reinterpretcast Subscribers: delcypher, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82905	2020-07-06 12:54:12 -04:00
Luís Marques	0a0548042f	[RISCV] Fold ADDIs into load/stores with nonzero offsets We can often fold an ADDI into the offset of load/store instructions: (load (addi base, off1), off2) -> (load base, off1+off2) (store val, (addi base, off1), off2) -> (store val, base, off1+off2) This is possible when the off1+off2 continues to fit the 12-bit immediate. We remove the previous restriction where we would never fold the ADDIs if the load/stores had nonzero offsets. We now do the fold the the resulting constant still fits a 12-bit immediate, or if off1 is a variable's address and we know based on that variable's alignment that off1+offs2 won't overflow. Differential Revision: https://reviews.llvm.org/D79690	2020-07-06 17:32:57 +01:00
jasonliu	3b7308f12c	[XCOFF][AIX] Give symbol an internal name when desired symbol name contains invalid character(s) Summary: When a desired symbol name contains invalid character that the system assembler could not process, we need to emit .rename directive in assembly path in order for that desired symbol name to appear in the symbol table. Reviewed By: hubert.reinterpretcast, DiggerLin, daltenty, Xiangling_L Differential Revision: https://reviews.llvm.org/D82481	2020-07-06 15:49:15 +00:00
Oliver Stannard	0a88afaed7	[Support] Fix formatted_raw_ostream for UTF-8 * The getLine and getColumn functions need to update the position, or they will return stale data for buffered streams. This fixes a bug in the clang -analyzer-checker-option-help option, which was not wrapping the help text correctly when stdout is not a TTY. * If the stream contains multi-byte UTF-8 sequences, then the whole sequence needs to be considered to be a single character. This has the edge case that the buffer might fill up and be flushed part way through a character. * If the stream contains East Asian wide characters, these will be rendered twice as wide as other characters, so we need to increase the column count to match. This doesn't attempt to handle everything unicode can do (combining characters, right-to-left markers, ...), but hopefully covers most things likely to be common in messages and source code we might want to print. Differential revision: https://reviews.llvm.org/D76291	2020-07-06 16:18:15 +01:00
Roman Lebedev	c8b6177910	Reland "[ScalarEvolution] createSCEV(): recognize `udiv`/`urem` disguised as an `sdiv`/`srem`" This reverts commit d3e3f36ff1151f565730977ac4f663a2ccee48ae, which reverter the original commit 2c16100e6f72075564ea1f67fa5a82c269dafcd3, but with polly tests now actually passing.	2020-07-06 18:00:22 +03:00
David Green	5c3a471846	[ARM] MVE FP16 cost adjustments This adjusts the MVE fp16 cost model, similar to how we already do for integer casts. It uses the base cost of 1 per cvt for most fp extend / truncates, but adjusts it for loads and stores where we know that a extending load has been used to get the load into the correct lane, and only an MVE VCVTB is then needed. Differential Revision: https://reviews.llvm.org/D81813	2020-07-06 15:57:51 +01:00
Mikhail Goncharov	c0ff9444d8	Revert "[ScalarEvolution] createSCEV(): recognize `udiv`/`urem` disguised as an `sdiv`/`srem`" Summary: This reverts commit 2c16100e6f72075564ea1f67fa5a82c269dafcd3. ninja check-polly fails: Polly :: Isl/CodeGen/MemAccess/generate-all.ll Polly :: ScopInfo/multidim_srem.ll Reviewers: kadircet, bollu Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83230	2020-07-06 16:41:59 +02:00
Florian Hahn	2341e47245	[LV] Pass dbgs() to verifyFunction call. This is done in other places of the pass already and improves the output on verification failure.	2020-07-06 15:09:20 +01:00
Sanjay Patel	9a3f0992c0	[x86] add tests for vector select with non-splat bit-test condition; NFC Goes with D83181.	2020-07-06 09:50:47 -04:00
David Green	0c1fba645a	[ARM] Adjust default fp extend and trunc costs This adds some default costs for fp extends and truncates, generally costing them as 1 per lane. If the type is not legal then the cost will include a call to an __aeabi_ function. Some NEON code is also adjusted to make sure it applies to the expected types, now that fp16 is a more common thing. Differential Revision: https://reviews.llvm.org/D82458	2020-07-06 14:23:17 +01:00
Matt Arsenault	44220c5748	GlobalISel: Move finalizeLowering call later This matches the DAG behavior where this is called after the loop checking for calls. The AMDGPU implementation depends on knowing if there are calls in the function or not, so move this later. Another problem is finalizeLowering is actually called twice; I was seeing weird inconsistencies since the first call would produce unexpected results and the second run would correct them in some contexts. Since this requires disabling the verifier, and it's useful to serialize the MIR immediately after selection, FinalizeISel should probably not be a real pass.	2020-07-06 09:19:40 -04:00
Matt Arsenault	3bb38d40fc	AMDGPU/GlobalISel: Don't emit code for unused kernel arguments	2020-07-06 09:04:06 -04:00
Matt Arsenault	e17fdb6b7f	AMDGPU/GlobalISel: Fix hardcoded register number checks in test	2020-07-06 09:01:59 -04:00
Matt Arsenault	9a4e2cafde	AMDGPU: Fix fixed ABI SGPR arguments The default constructor wasn't setting isSet o the ArgDescriptor, so while these had the value set, they were treated as missing. This only ended up mattering in the indirect call case (and for regular calls in GlobalISel, which current doesn't have a way to support the variable ABI).	2020-07-06 09:01:18 -04:00
Matt Arsenault	25a6449fdc	AMDGPU/GlobalISel: Add some missing return tests	2020-07-06 09:01:18 -04:00
Simon Pilgrim	7f40cfe2c5	[X86][XOP] Add XOP target vselect-pcmp tests Noticed in the D83181 that XOP can probably do a lot more than other targets due to its vector shifts and vpcmov instructions	2020-07-06 13:58:26 +01:00
Simon Pilgrim	1700c587ab	Regenerate subreg liverange tests. NFC. To simplify the diffs in a patch in development.	2020-07-06 13:58:25 +01:00
Simon Pilgrim	db4d0a9002	Regenerate neon copy tests. NFC. To simplify the diffs in a patch in development.	2020-07-06 13:58:25 +01:00
Esme-Yi	5f873faf6c	[PowerPC] Legalize SREM/UREM directly on P9. Summary: As Bugzilla-35090 reported, the rationale for using custom lowering SREM/UREM should no longer be true. At the IR level, the div-rem-pairs pass performs the transformation where the remainder is computed from the result of the division when both a required. We should now be able to lower these directly on P9. And the pass also fixed the problem that divide is in a different block than the remainder. This is a patch to remove redundant code and make SREM/UREM legal directly on P9. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D82145	2020-07-06 11:47:31 +00:00
Jay Foad	4e57aaab54	[TargetLowering] Improve expansion of FSHL/FSHR by non-zero amount Use a simpler code sequence when the shift amount is known not to be zero modulo the bit width. Nothing much uses this until D77152 changes the translation of fshl and fshr intrinsics. Differential Revision: https://reviews.llvm.org/D82540	2020-07-06 12:07:14 +01:00
Jay Foad	5fecaffff0	[TargetLowering] Improve expansion of ROTL/ROTR Using a negation instead of a subtraction from a constant can save an instruction on some targets. Nothing much uses this until D77152 changes the translation of fshl and fshr intrinsics. Differential Revision: https://reviews.llvm.org/D82539	2020-07-06 12:07:14 +01:00
Sam McCall	515e16cc38	[Support] fix user_cache_directory on mac	2020-07-06 12:54:11 +02:00
Kai Nacke	738ac04d45	[SystemZ/ZOS] Implement getMainExecutable() and is_local_impl() Adds implementation of getMainExecutable() and is_local_impl() to Support/Unix/Path.inc. Both are needed to compile LLVM for z/OS. Reviewed By: hubert.reinterpretcast, emaste Differential Revision: https://reviews.llvm.org/D82544	2020-07-06 06:48:16 -04:00
Kai Nacke	ff7a2eb652	[SystemZ/ZOS] Define Endian constants for z/OS. This is needed to build LLVM on z/OS, as there is no header file which provides these constants. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D82368	2020-07-06 06:48:16 -04:00
Roman Lebedev	b92ab9b43c	[Scalarizer] visit{Insert,Extract}ElementInst(): avoid call arg evaluation order deps Compilers may evaluate call arguments in different order, which would result in different order of IR, which would break the tests. Spotted thanks to Dmitri Gribenko!	2020-07-06 13:42:35 +03:00
David Green	5fd5e48e95	[ARM] Add extra extend and trunc costs for cast instructions This expands the existing extend costs with a few extras for larger types than legal, which will usually be split under MVE. It also adds trunk support for the same thing. These should not have a large effect on many things, but makes the costs explicit and keeps a certain balance between the trunks and extends. Differential Revision: https://reviews.llvm.org/D82457	2020-07-06 11:33:05 +01:00
Sam McCall	8c3580fff2	[Support] Add path::user_config_directory for $XDG_CONFIG_HOME etc Reviewers: hokein Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83128	2020-07-06 12:20:55 +02:00
Roman Lebedev	91d014b246	[Scalarizer] ExtractElement handling w/ variable insert index (PR46524) Summary: Similar to D82961. Reviewers: bjope, cameron.mcinally, arsenm, jdoerfert Reviewed By: jdoerfert Subscribers: arphaman, wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82970	2020-07-06 13:19:33 +03:00
Roman Lebedev	018bb71c12	[Scalarizer] InsertElement handling w/ variable insert index (PR46524) Summary: I'm interested in taking the original C++ input, for which we currently are stuck with an alloca and producing roughly the lower IR, with neither an alloca nor a vector ops: https://godbolt.org/z/cRRWaJ For that, as intermediate step, i'd to somehow perform scalarization. As per @arsenmn suggestion, i'm trying to see if scalarizer can help me avoid writing a bicycle. I'm not sure if it's really intentional that variable insert is not handled currently. If it really is, and is supposed to stay that way (?), i guess i could guard it.. See [[ https://bugs.llvm.org/show_bug.cgi?id=46524 \| PR46524 ]]. Reviewers: bjope, cameron.mcinally, arsenm, jdoerfert Reviewed By: jdoerfert Subscribers: arphaman, uabelho, wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82961	2020-07-06 13:19:32 +03:00
Roman Lebedev	c0b1184e0b	[Scalarizer] ExtractElement handling w/ constant extract index Summary: It appears to be better IR-wise to aggressively scalarize it, rather than relying on gathering it, and leaving it as-is. Reviewers: jdoerfert, bjope, arsenm, cameron.mcinally Reviewed By: jdoerfert Subscribers: arphaman, wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83101	2020-07-06 13:19:32 +03:00
Roman Lebedev	bc4a979a8a	[Scalarizer] InsertElement handling w/ constant insert index Summary: As it can be clearly seen from the diff, this results in nicer IR. Reviewers: jdoerfert, arsenm, bjope, cameron.mcinally Reviewed By: jdoerfert Subscribers: arphaman, wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83102	2020-07-06 13:19:32 +03:00
Roman Lebedev	3c48d18cf9	[InstCombine] Lower infinite combine loop detection thresholds Summary: 1000 iteratons is still kinda a lot. Would it make sense to iteratively lower it, until it becomes `2`, with some delay inbetween in order to let users actually potentially encounter it? Reviewers: spatel, nikic, kuhar Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83160	2020-07-06 13:19:31 +03:00
David Green	01a228e646	[ARM] Use BaseT::getMemoryOpCost for getMemoryOpCost This alters getMemoryOpCost to use the Base TargetTransformInfo version that includes some additional checks for whether extending loads are legal. This will generally have the effect of making <2 x ..> and some <4 x ..> loads/stores more expensive, which in turn should help favour larger vector factors. Notably it alters the cost of a <4 x half>, which with the current codegen will be expensive if it is not extended. Differential Revision: https://reviews.llvm.org/D82456	2020-07-06 10:58:40 +01:00
Guillaume Chatelet	6e9457fc4d	Fix off by one error in Bitfields Differential Revision: https://reviews.llvm.org/D83192	2020-07-06 08:47:58 +00:00
Guillaume Chatelet	309087bb01	Fix 46594 - Alignment assertion failure in instcombine	2020-07-06 08:45:05 +00:00
Kazushi (Jam) Marukawa	3d2949ea2d	[VE] Correct stack alignment Summary: Change stack alignment from 64 bits to 128 bits to follow ABI correctly. And add a regression test for datalayout. Reviewers: simoll, k-ishizaka Reviewed By: simoll Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #llvm, #ve, #clang Differential Revision: https://reviews.llvm.org/D83173	2020-07-06 17:25:29 +09:00
Nikita Popov	7815df948c	[SCCP] Add test for range metadata (NFC)	2020-07-05 21:41:04 +02:00
David Green	684e62e531	[ARM] Remove hasSideEffects from FP converts Whether an instruction is deemed to have side effects in determined by whether it has a tblgen pattern that emits a single instruction. Because of the way a lot of the the vcvt instructions are specified either in dagtodag code or with patterns that emit multiple instructions, they don't get marked as not having side effects. This just marks them as not having side effects manually. It can help especially with instruction scheduling, to not create artificial barriers, but one of these tests also managed to produce fewer instructions. Differential Revision: https://reviews.llvm.org/D81639	2020-07-05 16:23:24 +01:00
Simon Pilgrim	16b569b2d7	[X86][SSE] Add PACKSS/PACKUS style patterns tests Similar to the proposed generic code generated by D61129 - there's still some shuffle combining improvements to go before that patch is ready.	2020-07-05 16:18:23 +01:00
Alexander Belyaev	ea3fabb802	[llvm] Cast to (void) the unused variable.	2020-07-05 12:33:58 +02:00
Fangrui Song	52308d71e7	Add tests for clang -fno-zero-initialized-in-bss and llc -nozero-initialized-in-bss And rename the CC1 option.	2020-07-04 23:26:57 -07:00
Georgy Komarov	564d25cbd9	[llvm-objcopy] Fix crash when removing symbol table at same time as adding a symbol This patch resolves crash that occurs when user wanted to remove all symbols and add a brand new one using: ``` llvm-objcopy -R .symtab --add-symbol foo=1234 in.o out.o ``` Before these changes the symbol table internally being null when adding new symbols. For now we will regenerate symtab in this case. This fixes: https://bugs.llvm.org/show_bug.cgi?id=43930 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82935	2020-07-05 05:14:00 +03:00
Thomas Lively	1c8a1d0f1c	[WebAssembly] Do not assume br_table range checks will be gt_u OSS-Fuzz and the Emscripten test suite uncovered some edge cases in which the range check instruction seemed to be an (i32.const 0) or other unexpected instruction, triggering an assertion. Unfortunately the reproducers are rather complicated, so they don't make good unit tests. This commit removes the bad assertion and conservatively optimizes range checks only when the range check instruction is i32.gt_u. Differential Revision: https://reviews.llvm.org/D83169	2020-07-04 18:11:24 -07:00
Nico Weber	00222f480c	[gn build] fix link of libclang_rt.asan_osx_dynamic.dylib if command line tools are not installed	2020-07-04 20:26:39 -04:00

1 2 3 4 5 ...

199657 Commits