llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 11:33:24 +02:00

Author	SHA1	Message	Date
Matt Arsenault	80a8f35107	AMDGPU: Fix not preserving alignent in call setups If an argument was passed on the stack, this was using the default alignment. I'm not sure there's an observable change from this. This was observable due to bugs in expansion of unaligned loads and stores, but since that is fixed I don't think this matters much. llvm-svn: 342133	2018-09-13 12:14:31 +00:00
Matt Arsenault	039a09c819	DAG: Fix expansion of unaligned FP loads and stores This was trying to scalarizing a scalar FP type, resulting in an assert. Fixes unaligned f64 stack stores for AMDGPU. llvm-svn: 342132	2018-09-13 12:14:23 +00:00
Matt Arsenault	03a36b7350	AMDGPU: Fix some outdated datalayouts in tests llvm-svn: 342131	2018-09-13 11:56:28 +00:00
Simon Pilgrim	7541b558e5	Fix unused variable warning. NFCI. llvm-svn: 342128	2018-09-13 10:54:23 +00:00
Tim Northover	9530f416eb	ARM: align loops to 4 bytes on Cortex-M3 and Cortex-M4. The Technical Reference Manuals for these two CPUs state that branching to an unaligned 32-bit instruction incurs an extra pipeline reload penalty. That's bad. This also enables the optimization at -Os since it costs on average one byte per loop in return for 1 cycle per iteration, which is pretty good going. llvm-svn: 342127	2018-09-13 10:28:05 +00:00
Dean Michael Berris	37533d7125	[XRay] Bug fixes for FDR custom event and arg-logging Summary: This change has a number of fixes for FDR mode in compiler-rt along with changes to the tooling handling the traces in llvm. In the runtime, we do the following: - Advance the "last record" pointer appropriately when writing the custom event data in the log. - Add XRAY_NEVER_INSTRUMENT in the rewinding routine. - When collecting the argument of functions appropriately marked, we should not attempt to rewind them (and reset the counts of functions that can be re-wound). In the tooling, we do the following: - Remove the state logic in BlockIndexer and instead rely on the presence/absence of records to indicate blocks. - Move the verifier into a loop associated with each block. Reviewers: mboerger, eizan Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D51965 llvm-svn: 342122	2018-09-13 09:25:42 +00:00
Alexander Timofeev	31b7ea7819	[AMDGPU] Load divergence predicate refactoring Differential revision: https://reviews.llvm.org/D51931 Reviewers: rampitec llvm-svn: 342120	2018-09-13 09:06:56 +00:00
Simon Atanasyan	ae110efea3	[mips] Enable the mnemonic spell corrector This implements suggesting alternative mnemonics when an invalid one is specified. For example `addru $9, $6, 17767` leads to the following error message: error: unknown instruction, did you mean: add, addiu, addu, maddu? Differential revision: https://reviews.llvm.org/D40646 llvm-svn: 342119	2018-09-13 08:38:03 +00:00
Clement Courbet	3faa4f1c4d	[llvm-exegesis][NFC] Remove dead parameter. llvm-svn: 342118	2018-09-13 08:06:29 +00:00
Clement Courbet	1931b181a5	[llvm-exegesis][NFC] Split BenchmarkRunner class Summary: The snippet-generation part goes to the SnippetGenerator class. This will allow benchmarking arbitrary code (see PR38437). Reviewers: gchatelet Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D51979 llvm-svn: 342117	2018-09-13 07:40:53 +00:00
Alexander Timofeev	4375b6f72e	[AMDGPU] Preliminary patch for divergence driven instruction selection. Load offset inlining pattern changed. Differential revision: https://reviews.llvm.org/D51975 Reviewers: rampitec llvm-svn: 342115	2018-09-13 06:34:56 +00:00
Craig Topper	ebf7a0ef71	[X86] Type legalize v2i32 div/rem by scalarizing rather than promoting Summary: Previously we type legalized v2i32 div/rem by promoting to v2i64. But we don't support div/rem of vectors so op legalization would then scalarize it using i64 scalar ops since it doesn't know about the original promotion. 64-bit scalar divides on Intel hardware are known to be slow and in 32-bit mode they require a libcall. This patch switches type legalization to do the scalarizing itself using i32. It looks like the division by power of 2 optimization is still kicking in and leaving the code as a vector. The division by other constant optimization doesn't kick in pre type legalization since it ignores illegal types. And previously, after type legalization we scalarized the v2i64 since we don't have v2i64 MULHS/MULHU support. Another option might be to widen v2i32 to v4i32 so we could do division by constant optimizations, but we'd have to be careful to only do that for constant divisors or we risk scalaring to 4 scalar divides. Reviewers: RKSimon, spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51325 llvm-svn: 342114	2018-09-13 06:13:37 +00:00
Saleem Abdulrasool	a19b59c8fe	ARM: correct the relocation type for `bl` on WoA The `IMAGE_REL_ARM_BRANCH20T` applies only to a `b.w` instruction. A thumb-2 `bl` should be relocated using a `IMAGE_REL_ARM_BRANCH24T`. Correct the relocation that we emit in such a case. Resolves PR38620! Based on the patch by Jordan Rhee! llvm-svn: 342109	2018-09-13 04:55:08 +00:00
Max Kazantsev	8cb9ec0295	[NFC] Add Requires: asserts where needed llvm-svn: 342108	2018-09-13 04:43:24 +00:00
Max Kazantsev	8a4653b5c7	[NFC] Use expensive asserts in relevant LICM tests llvm-svn: 342107	2018-09-13 04:00:39 +00:00
Thomas Lively	1eba42c218	Remove isAsCheapAsAMove from v128.const llvm-svn: 342106	2018-09-13 02:50:57 +00:00
Thomas Lively	85571dc7cf	Remove isAsCheapAsAMove from mem ops llvm-svn: 342105	2018-09-13 02:50:57 +00:00
Thomas Lively	c1b2dbb833	[WebAssembly] Add missing SIMD instruction attributes Summary: These attributes are copied from equivalent instructions in WebAssemblyInstrInfo.td. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51518 llvm-svn: 342104	2018-09-13 02:50:56 +00:00
David Blaikie	28c12e9cfb	STLExtras: Add some more algorithm wrappers llvm-svn: 342102	2018-09-13 00:02:03 +00:00
David Blaikie	a2ff546ca4	DebugInfo/PDB: Remove unused member llvm-svn: 342101	2018-09-13 00:02:02 +00:00
David Blaikie	970d7608f6	dwarfdump: Improve performance on large DWP files llvm-svn: 342099	2018-09-12 23:39:51 +00:00
Sanjay Patel	60b6a254a2	[DAGCombiner] improve formatting for select+setcc code; NFC llvm-svn: 342095	2018-09-12 23:03:50 +00:00
Adrian Prantl	0f8b4f2209	fix 80-column violation with clang-format llvm-svn: 342094	2018-09-12 22:57:28 +00:00
Zachary Turner	a4adbe577f	[PDB] Remove all clone() methods. These are dead code and encourage poor usage patterns, so I'm removing them. They weren't called anywhere anyway. llvm-svn: 342093	2018-09-12 22:57:03 +00:00
Krzysztof Parzyszek	3d8191f695	[Hexagon] Use shuffles when lowering "gather" shufflevectors Shufflevector instructions in LLVM IR that extract a subset of elements of a longer input into a shorter vector can be done using VECTOR_SHUFFLEs. This will avoid expanding them into constly extracts and inserts. llvm-svn: 342091	2018-09-12 22:14:52 +00:00
Krzysztof Parzyszek	6458f424f4	[Hexagon] Improve the selection algorithm in scalarizeShuffle Use topological ordering for newly generated nodes. llvm-svn: 342090	2018-09-12 22:10:58 +00:00
Kristina Brooks	a5be339044	[Support] sys::fs::directory_entry includes the file_type. This is available on most platforms (Linux/Mac/Win/BSD) with no extra syscalls. On other platforms (e.g. Solaris) we stat() if this information is requested. This will allow switching clang's VFS to efficiently expose (path, type) when traversing a directory. Currently it exposes an entire Status, but does so by calling fs::status() on all platforms. Almost all callers only need the path, and all callers only need (path, type). Patch by sammccall (Sam McCall) Differential Revision: https://reviews.llvm.org/D51918 llvm-svn: 342089	2018-09-12 22:08:10 +00:00
Vedant Kumar	5d6e7b38df	[llvm-cov] Delete custom JSON serialization code (NFC) Teach llvm-cov to use the new llvm JSON library, and remove some redundant/brittle JSON serialization tests. llvm-svn: 342088	2018-09-12 21:59:38 +00:00
Lang Hames	1771029439	[ORC] Merge ExecutionSessionBase with ExecutionSession by moving a couple of template methods in JITDylib out-of-line. This also splits JITDylib::define into a pair of template methods, one taking an lvalue reference and the other an rvalue reference. This simplifies the templates at the cost of a small amount of code duplication. llvm-svn: 342087	2018-09-12 21:49:02 +00:00
Lang Hames	0c2a797c97	[ORC] Add a special 'main' JITDylib that is created on ExecutionSession construction, a new convenience lookup method, and add-to layer methods. ExecutionSession now creates a special 'main' JITDylib upon construction. All subsequently created JITDylibs are added to the main JITDylib's search order by default (controlled by the AddToMainDylibSearchOrder parameter to ExecutionSession::createDylib). The main JITDylib's search order will be used in the future to properly handle cross-JITDylib weak symbols, with the first definition in this search order selected. This commit also adds a new ExecutionSession::lookup convenience method that performs a blocking lookup using the main JITDylib's search order, as this will be a very common operation for clients. Finally, new convenience overloads of IRLayer and ObjectLayer's add methods are introduced that add the given program representations to the main dylib, which is likely to be the common case. llvm-svn: 342086	2018-09-12 21:48:59 +00:00
Heejin Ahn	5565c40d7e	[WebAssembly] Make tied inline asm operands work again Summary: rL341389 broke code with tied register operands in inline assembly. For example, `asm("" : "=r"(var) : "0"(var));` The code above specifies the input operand to be in the same register with the output operand, tying the two register. This patch makes this kind of code work again. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, eraman, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51991 llvm-svn: 342084	2018-09-12 21:34:39 +00:00
Sanjay Patel	3526e6ff08	revert r341288 - [Reassociate] swap binop operands to increase factoring potential This causes or exposes indeterminism that is visible in the output of -reassociate. llvm-svn: 342083	2018-09-12 21:29:11 +00:00
Sanjay Patel	ad7dc8298b	[InstCombine] add tests for unsigned add overflow; NFC llvm-svn: 342082	2018-09-12 21:13:37 +00:00
Michael Berg	29d9b03630	Guard FMF context by excluding some FP operators from FPMathOperator Summary: Some FPMathOperators succeed and the retrieve FMF context when they never have it, we should omit these cases to keep from removing FMF context. For instance when we visit some FPMathOperator mapped Instructions which never have FMF flags and a Node was associated which does have FMF flags, that Node today will have all its flags cleared via the intersect operation. With this change, we exclude associating Nodes that never have FPMathOperator status under FMF. Reviewers: spatel, wristow, arsenm, hfinkel, aemerson Reviewed By: spatel Subscribers: llvm-commits, wdng Differential Revision: https://reviews.llvm.org/D51145 llvm-svn: 342081	2018-09-12 21:09:59 +00:00
Zachary Turner	5139ea485d	[PDB] Emit old fpo data to the PDB file. r342003 added support for emitting FPO data from the DEBUG_S_FRAMEDATA subsection of the .debug$S section to the PDB file. However, that is not the end of the story. FPO can end up in two different destinations in a PDB, each corresponding to a different FPO data source. The case handled by r342003 involves copying data from the DEBUG_S_FRAMEDATA subsection of the .debug$S section to the "New FPO" stream in the PDB, which is then referred to by the DBI stream. The case handled by this patch involves copying records from the .debug$F section of an object file to the "FPO" stream (or perhaps more aptly, the "Old FPO" stream) in the PDB file, which is also referred to by the DBI stream. The formats are largely similar, and the difference is mostly only visible in masm generated object files, such as some of the low-level CRT object files like memcpy. MASM doesn't appear to support writing the DEBUG_S_FRAMEDATA subsection, and instead just writes these records to the .debug$F section. Although clang-cl does not emit a .debug$F section ever, lld still needs to support it so we have good debugging for CRT functions. Differential Revision: https://reviews.llvm.org/D51958 llvm-svn: 342080	2018-09-12 21:02:01 +00:00
Krzysztof Parzyszek	7bc655bc82	[Hexagon] Use legalized type for extracted elements in scalarizeShuffle Scalarization of a shuffle will break up the source vectors into individual elements, and use them to assemble the resulting vector. An element type of a legal vector type may not necessarily be a legal scalar type, so make sure that the extracted values are extended to a legal scalar type. llvm-svn: 342079	2018-09-12 20:58:48 +00:00
Konstantin Zhuravlyov	6028eb217c	AMDGPU: Print all kernel descriptor directives (including the ones with default values) Change by Tony Tye Differential Revision: https://reviews.llvm.org/D51954 llvm-svn: 342077	2018-09-12 20:25:39 +00:00
Roman Lebedev	d6bc597cae	[NFC][InstCombine] Drop newly-added interference-tests-for-high-bit-check.ll Now that i have actually double-checked, no, there is no such interference possible... llvm-svn: 342076	2018-09-12 20:06:46 +00:00
Roman Lebedev	4699c9d461	[NFC][InstCombine] R38708 - inefficient pattern for high-bits checking. More complicated, canonical pattern: https://rise4fun.com/Alive/uhA https://godbolt.org/z/o4RB8D Also, we need to be careful not to skip some patters... https://bugs.llvm.org/show_bug.cgi?id=38708 llvm-svn: 342074	2018-09-12 19:44:26 +00:00
Konstantin Zhuravlyov	e4afca7347	AMDGPU: Re-apply r341982 after fixing the layering issue Move isa version determination into TargetParser. Also switch away from target features to CPU string when determining isa version. This fixes an issue when we output wrong isa version in the object code when features of a particular CPU are altered (i.e. gfx902 w/o xnack used to result in gfx900). llvm-svn: 342069	2018-09-12 18:50:47 +00:00
Roman Lebedev	2323cae6f1	[InstCombine] Inefficient pattern for high-bits checking (PR38708) Summary: It is sometimes important to check that some newly-computed value is non-negative and only `n` bits wide (where `n` is a variable.) There are many ways to check that: https://godbolt.org/z/o4RB8D The last variant seems best? (I'm sure there are some other variations i haven't thought of..) Let's handle the second variant first, since it is much simpler. https://rise4fun.com/Alive/LYjY https://bugs.llvm.org/show_bug.cgi?id=38708 Reviewers: spatel, craig.topper, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51985 llvm-svn: 342067	2018-09-12 18:19:43 +00:00
Julie Hockett	fb17de4213	[objcopy] make objcopy follow program header standards Submitted on behalf of Armando Montanez (amontanez@google.com). Objects with unused program headers copied by objcopy would always have nonzero values for program header offset and program header entry size. While technically valid, this atypical behavior triggers warnings in some tools. This change sets the two fields to zero when the program header is unused, better fitting the general expectations for unused program header data. Section headers behaved somewhat similarly (though only with the entry size), and are fixed in this revision as well. Differential Revision: https://reviews.llvm.org/D51961 llvm-svn: 342065	2018-09-12 17:56:31 +00:00
Thomas Lively	7409e6d86b	[WebAssembly] SIMD comparisons Summary: Match the ordering semantics of non-vector comparisons. For floating point comparisons that do not correspond to instructions, the tests check that some vector comparison instruction was emitted but do not care about the full implementation. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51765 llvm-svn: 342064	2018-09-12 17:56:00 +00:00
Diogo N. Sampaio	bccbb4da14	[ARM] Tighten f64<->f16 conversion requirements Fix missing Requires fields. Patch by Bernard Ogden (bogden) Reviewers: SjoerdMeijer, javed.absar, t.p.northover Reviewed By: t.p.northover Differential Revision: https://reviews.llvm.org/D51631 llvm-svn: 342061	2018-09-12 16:24:43 +00:00
Craig Topper	92e7d4079f	[X86] Remove isel patterns for ADCX instruction There's no advantage to this instruction unless you need to avoid touching other flag bits. It's encoding is longer, it can't fold an immediate, it doesn't write all the flags. I don't think gcc will generate this instruction either. Fixes PR38852. Differential Revision: https://reviews.llvm.org/D51754 llvm-svn: 342059	2018-09-12 15:47:34 +00:00
Florian Hahn	426ad98267	[PatternMatch] Use generic One,Two,ThreeOps_match classes (NFC). Currently we have a few duplicated matcher classes, which all do pretty much the same thing. This patch introduces generic One,Tow,ThreeOps_match classes which take the opcode the match as template argument. Reviewers: SjoerdMeijer, dneilson, spatel, arsenm Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D51044 llvm-svn: 342058	2018-09-12 14:52:38 +00:00
Wolfgang Pieb	0150bc5cfd	Reverting r342048, which caused UBSan failures in dsymutil. llvm-svn: 342056	2018-09-12 14:40:04 +00:00
Alexandros Lamprineas	67a00b872a	[GVNHoist] computeInsertionPoints() miscalculates IDF Fix for https://bugs.llvm.org/show_bug.cgi?id=38912. In GVNHoist::computeInsertionPoints() we iterate over the Value Numbers and calculate the Iterated Dominance Frontiers without clearing the IDFBlocks vector first. IDFBlocks ends up accumulating an insane number of basic blocks, which bloats the compilation time of SemaChecking.cpp with ubsan enabled. Differential Revision: https://reviews.llvm.org/D51980 llvm-svn: 342055	2018-09-12 14:28:23 +00:00
Roman Lebedev	0cada6afe9	[NFC][InstCombine] R38708 - inefficient pattern for high-bits checking. The simplest pattern for now: https://rise4fun.com/Alive/LYjY https://godbolt.org/z/o4RB8D https://bugs.llvm.org/show_bug.cgi?id=38708 llvm-svn: 342054	2018-09-12 14:11:37 +00:00
Sander de Smalen	1e20d76347	[AArch64] Implement aarch64_vector_pcs codegen support. This patch adds codegen support for the saving/restoring V8-V23 for functions specified with the aarch64_vector_pcs calling convention attribute, as added in patch D51477. Reviewers: t.p.northover, gberry, thegameg, rengolin, javed.absar, MatzeB Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D51479 llvm-svn: 342049	2018-09-12 12:10:22 +00:00

1 2 3 4 5 ...

169113 Commits