llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Matt Arsenault	852ef015d3	AMDGPU/GlobalISel: Legalize addrspacecast Use a placeholder constant for now on targets that need the load from the queue ptr. llvm-svn: 353497	2019-02-08 02:40:47 +00:00
Wouter van Oortmerssen	922e60889a	[WebAssembly] Fixed Disassembler ignoring endian swap on big endian. Summary: This fixes: https://bugs.llvm.org/show_bug.cgi?id=40620 Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57933 llvm-svn: 353496	2019-02-08 01:43:23 +00:00
Craig Topper	ca954e59d7	Fix the lowering issue of intrinsics llvm.localaddress on X86 Patch by Yuanke Luo Reviewers: craig.topper, annita.zhang, smaslov, rnk, wxiao3 Reviewed By: rnk Subscribers: efriedma, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57501 llvm-svn: 353492	2019-02-08 01:14:12 +00:00
Caroline Tice	03f7c7ba9c	lvm-dwarfdump: Stop counting out-of-line subprogram in the "inlined functions" statistic. DW_TAG_subprogram DIEs should not be counted in the inlined function statistic. This also addresses the source variables count, as that uses the inlined function count in its calculations. Differential revision: https://reviews.llvm.org/D57849 llvm-svn: 353491	2019-02-08 00:51:33 +00:00
Craig Topper	74d1af9d27	[X86] Add FPCW as a register and start using it as an implicit use on floating point instructions. Summary: FPCW contains the rounding mode control which we manipulate to implement fp to integer conversion by changing the roudning mode, storing the value to the stack, and then changing the rounding mode back. Because we didn't model FPCW and its dependency chain, other instructions could be scheduled into the middle of the sequence. This patch introduces the register and adds it as an implciit def of FLDCW and implicit use of the FP binary arithmetic instructions and store instructions. There are more instructions that need to be updated, but this is a good start. I believe this fixes at least the reduced test case from PR40529. Reviewers: RKSimon, spatel, rnk, efriedma, andrew.w.kaylor Subscribers: dim, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57735 llvm-svn: 353489	2019-02-08 00:44:39 +00:00
Eli Friedman	b87d675297	[AArch64] Fix condition for "high-vector" DUP optimizations. AArch64 NEON has a bunch of instructions with a "2" suffix that extract the top half of the source vectors, instead of the bottom half. We have some DAGCombines to try to take advantage of that. However, they assumed that any EXTRACT_VECTOR was extracting the high half of the vector in question. This issue has apparently existed since the AArch64 backend was merged. Fixes https://bugs.llvm.org/show_bug.cgi?id=40632 . Differential Revision: https://reviews.llvm.org/D57862 llvm-svn: 353486	2019-02-08 00:23:35 +00:00
Petar Jovanovic	22c328969d	[mips][micromips] Fix how values in .gcc_except_table are calculated When a landing pad is calculated in a program that is compiled for micromips with -fPIC flag, it will point to an even address. Such an error will cause a segmentation fault, as the instructions in micromips are aligned on odd addresses. This patch sets the last bit of the offset where a landing pad is, to 1, which will effectively be an odd address and point to the instruction exactly. r344591 fixed this issue for -static compilation. Patch by Aleksandar Beserminji. Differential Revision: https://reviews.llvm.org/D57677 llvm-svn: 353480	2019-02-07 22:57:33 +00:00
Sanjay Patel	5d6b63c7f7	[x86] fix formatting; NFC llvm-svn: 353477	2019-02-07 22:36:55 +00:00
Dan Gohman	f477a8d64d	[WebAssembly] Update test output after rL353474. NFC. llvm-svn: 353476	2019-02-07 22:33:50 +00:00
Dan Gohman	e6412b9d23	[WebAssembly] Fix imported function symbol names that differ from their import names in the .o format Add a flag to allow symbols to have a wasm import name which differs from the linker symbol name, allowing the linker to link code using the import_module attribute. This is the MC/Object portion of the patch. Differential Revision: https://reviews.llvm.org/D57632 llvm-svn: 353474	2019-02-07 22:03:32 +00:00
Quentin Colombet	ffd0bc8256	[InstCombine] Optimize `atomicrmw <op>, 0` into `load atomic` when possible This commit teaches InstCombine how to replace an atomicrmw operation into a simple load atomic. For a given `atomicrmw <op>`, this is possible when: 1. The ordering of that operation is compatible with a load (i.e., anything that doesn't have a release semantic). 2. <op> does not modify the value being stored Differential Revision: https://reviews.llvm.org/D57854 llvm-svn: 353471	2019-02-07 21:27:23 +00:00
Peter Collingbourne	471dc9e2cf	gn build: Make check-{clang,lld,llvm} pass on FreeBSD. Mostly achieved by assuming that anything that isn't Win or Mac is ELF, which seems reasonable enough for now. Differential Revision: https://reviews.llvm.org/D57870 llvm-svn: 353470	2019-02-07 21:24:30 +00:00
Florian Hahn	579b6a0fc9	[LV] Remove unnecessary assignment to UserIC. llvm-svn: 353469	2019-02-07 21:23:37 +00:00
Sanjay Patel	1dc7230d7d	[InstCombine] Fix crashing from (icmp (bitcast ([su]itofp X)), Y) This fixes a class of bugs introduced by D44367, which transforms various cases of icmp (bitcast ([su]itofp X)), Y to icmp X, Y. If the bitcast is between vector types with a different number of elements, the current code will produce bad IR along the lines of: icmp <N x i32> ..., <M x i32> <...>. This patch suppresses the transform if the bitcast changes the number of vector elements. Patch by: @AndrewScheidecker (Andrew Scheidecker) Differential Revision: https://reviews.llvm.org/D57871 llvm-svn: 353467	2019-02-07 21:12:01 +00:00
Adrian Prantl	10b7106901	Move SMTSolver dump() methods out-of-line. This broke modularized non-local-submodule-visibility builds because the function bodies pulled in extra dependencies. llvm-svn: 353465	2019-02-07 21:03:18 +00:00
Nikita Popov	439d6e3c64	[CodeGen] Handle vector UADDO, SADDO, USUBO, SSUBO This is part of https://bugs.llvm.org/show_bug.cgi?id=40442. Vector legalization is implemented for the add/sub overflow opcodes. UMULO/SMULO are also handled as far as legalization is concerned, but they don't support vector expansion yet (so no tests for them). The vector result widening implementation is suboptimal, because it could result in a legalization loop. Differential Revision: https://reviews.llvm.org/D57639 llvm-svn: 353464	2019-02-07 21:02:22 +00:00
Shoaib Meenai	b5d55878f8	[cmake] Pass LLVM_TEMPORARILY_ALLOW_OLD_TOOLCHAIN to NATIVE configure We should propagate this down to host builds so that e.g. people using an optimized tablegen can do the sub-configure successfully. llvm-svn: 353463	2019-02-07 20:58:04 +00:00
Sanjay Patel	c72d7a3615	[InstCombine] refactor folds for (icmp (bitcast X), Y); NFCI llvm-svn: 353462	2019-02-07 20:54:09 +00:00
Florian Hahn	b3a3f62af7	[LV] Prevent interleaving if computeMaxVF returned None. As discussed in D57382, interleaving should be avoided if computeMaxVF returns None, same as we currently do for vectorization. Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=6477 Reviewers: Ayal, dcaballe, hsaito, mkuper, rengolin Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D57837 llvm-svn: 353461	2019-02-07 20:49:10 +00:00
Matt Arsenault	c6797e2fd7	GlobalISel: Try to fix bot failures Don't rely on order of evaluation of function arguments. llvm-svn: 353460	2019-02-07 20:44:08 +00:00
Simon Pilgrim	ebb87b0b13	[DAGCombiner] (add (umax X, C), -C) --> (usubsat X, C) (PR40111) Move the (add (umax X, C), -C) --> (usubsat X, C) X86 combine into generic DAGCombiner First of a number of saturated arithmetic folds that can be moved out of X86-specific code for PR40111. Differential Revision: https://reviews.llvm.org/D57754 llvm-svn: 353457	2019-02-07 20:14:43 +00:00
Matt Arsenault	e739762e43	GlobalISel: Implement narrowScalar for shift main type This is pretty much directly ported from SelectionDAG. Doesn't include the shift by non-constant but known bits version, since there isn't a globalisel version of computeKnownBits yet. This shows a disadvantage of targets not specifically which type should be used for the shift amount. If type 0 is legalized before type 1, the operations on the shift amount type use the wider type (which are also less likely to legalize). This can be avoided by targets specifying legalization actions on type 1 earlier than for type 0. llvm-svn: 353455	2019-02-07 19:37:44 +00:00
Matt Arsenault	3f79f1900f	AMDGPU/GlobalISel: Restrict g_implicit_def legality llvm-svn: 353452	2019-02-07 19:10:15 +00:00
Matt Arsenault	adc563e457	GlobalISel: Fix artifact combiner constant legality checks for vectors Since G_CONSTANT is illegal for vectors, this needs to check what buildConstant will produce for a splat vector. llvm-svn: 353449	2019-02-07 18:58:28 +00:00
Matt Arsenault	3e905ec90e	AMDGPU/GlobalISel: Don't use g_implicit_def in a few tests llvm-svn: 353443	2019-02-07 18:33:22 +00:00
Nirav Dave	19069d354c	Revert "[DAG] Cleanup of unused node in SimplifySelectCC." Causes ASAN use-after-poison errors. llvm-svn: 353442	2019-02-07 18:31:05 +00:00
Reid Kleckner	4218167234	[InstrProf] Avoid reconstructing Triple, NFC llvm-svn: 353439	2019-02-07 18:16:22 +00:00
Matt Arsenault	a1f6f45ed6	AMDGPU/GlobalISel: Legalize fsqrt llvm-svn: 353438	2019-02-07 18:14:39 +00:00
Matt Arsenault	7b9b227552	AMDGPU/GlobalISel: Legalize some f16 operations llvm-svn: 353436	2019-02-07 18:03:11 +00:00
Teresa Johnson	a1faa7ab09	[HotColdSplit] With PGO add profile entry metadata to split cold function Summary: When compiling with profile data, ensure the split cold function gets cold function_entry_count metadata (just use 0 since it should be cold). Otherwise with function sections it will not be placed in the unlikely text section with other cold code. Reviewers: vsk Subscribers: sebpop, hiraditya, davidxl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57900 llvm-svn: 353434	2019-02-07 17:50:35 +00:00
Sanjay Patel	b2414895c7	[DAGCombiner] fold add/sub with bool operand based on target's boolean contents I noticed that we are missing this canonicalization in IR: rL352515 ...and then realized that we don't get this right in SDAG either, so this has to be fixed first regardless of what we choose to do in IR. The existing fold was limited to scalars and using the wrong predicate to guard the transform. We have a boolean contents TLI query that can be used to decide which direction to fold. This may eventually lead back to the problems/question in: https://bugs.llvm.org/show_bug.cgi?id=40486 ...but it makes no difference to that yet. Differential Revision: https://reviews.llvm.org/D57401 llvm-svn: 353433	2019-02-07 17:43:34 +00:00
Matt Arsenault	9ee013cd69	GlobalISel: Implement fewerElementsVector for shifts Introduce a new function which handles instructions with multiple type indices, but have the same number of vector elements. Also legalize v2s16 shifts when applicable. llvm-svn: 353432	2019-02-07 17:38:00 +00:00
Matt Arsenault	c48f0dc588	GlobalISel: Try to make legalize rules more useful for vectors Mostly keep the existing functions on scalars, but add versions which also operate based on the vector element size. llvm-svn: 353430	2019-02-07 17:25:51 +00:00
Nirav Dave	881a032ac6	[DAG] Cleanup of unused node in SimplifySelectCC. llvm-svn: 353428	2019-02-07 17:13:55 +00:00
Sanjay Patel	6147bbcf20	[x86] split more 256/512-bit shuffles in lowering This is intentionally a small step because it's hard to know exactly where we might introduce a conflicting transform with the code that tries to form wider shuffles. But I think this is safe - if we have a wide shuffle with 2 operands, then we should do better with an extract + narrow shuffle. Differential Revision: https://reviews.llvm.org/D57867 llvm-svn: 353427	2019-02-07 17:10:49 +00:00
Nirav Dave	79b54c4c01	[DAG] Cleanup unused node on failed SELECT Combine. llvm-svn: 353426	2019-02-07 16:57:50 +00:00
Jordan Rupprecht	ef95cdb624	[llvm-ar][libObject] Fix relative paths when nesting thin archives. Summary: When adding one thin archive to another, we currently chop off the relative path to the flattened members. For instance, when adding `foo/child.a` (which contains `x.txt`) to `parent.a`, when flattening it we should add it as `foo/x.txt` (which exists) instead of `x.txt` (which does not exist). As a note, this also undoes the `IsNew` parameter of handling relative paths in r288280. The unit test there still passes. This was reported as part of testing the kernel build with llvm-ar: https://patchwork.kernel.org/patch/10767545/ (see the second point). Reviewers: mstorsjo, pcc, ruiu, davide, david2050 Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57842 llvm-svn: 353424	2019-02-07 16:41:06 +00:00
Nirav Dave	73fee7dca5	[X86] Simplify casing. NFC. llvm-svn: 353417	2019-02-07 15:43:40 +00:00
Nirav Dave	09f8959870	[DAG] Cleanup unused nodes on failed store-to-load forward combine. llvm-svn: 353416	2019-02-07 15:38:14 +00:00
Alexandre Ganea	4710865de9	[CodeView] Fix cycles in debug info when merging Types with global hashes When type streams with forward references were merged using GHashes, cycles were introduced in the debug info. This was caused by GlobalTypeTableBuilder::insertRecordAs() not inserting the record on the second pass, thus yielding an empty ArrayRef at that record slot. Later on, upon PDB emission, TpiStreamBuilder::commit() would skip that empty record, thus offseting all indices that came after in the stream. This solution comes in two steps: 1. Fix the hash calculation, by doing a multiple-step resolution, iff there are forward references in the input stream. 2. Fix merge by resolving with multiple passes, therefore moving records with forward references at the end of the stream. This patch also adds support for llvm-readoj --codeview-ghash. Finally, fix dumpCodeViewMergedTypes() which previously could reference deleted memory. Fixes PR40221 Differential Revision: https://reviews.llvm.org/D57790 llvm-svn: 353412	2019-02-07 15:24:18 +00:00
Fangrui Song	46cc0ce60a	Fix misspelled filenames in file headers llvm-svn: 353408	2019-02-07 14:38:25 +00:00
Sam Parker	555093ddca	[LSR] Generate cross iteration indexes Modify GenerateConstantOffsetsImpl to create offsets that can be used by indexed addressing modes. If formulae can be generated which result in the constant offset being the same size as the recurrence, we can generate a pre-indexed access. This allows the pointer to be updated via the single pre-indexed access so that (hopefully) no add/subs are required to update it for the next iteration. For small cores, this can significantly improve performance DSP-like loops. Differential Revision: https://reviews.llvm.org/D55373 llvm-svn: 353403	2019-02-07 13:32:54 +00:00
Diana Picus	0e9dd4a5f9	[ARM GlobalISel] Support G_ICMP for Thumb2 Mark as legal and use the t2* equivalents of the arm mode instructions, e.g. t2CMPrr instead of plain CMPrr. llvm-svn: 353392	2019-02-07 11:05:33 +00:00
David Green	654d7ed583	[ARM] Reformat isRedundantFlagInstr for D57833. NFC llvm-svn: 353386	2019-02-07 10:51:04 +00:00
Jiong Wang	1bc276c3c4	[BPF] add code-gen support for JMP32 instructions JMP32 instructions has been added to eBPF ISA. They are 32-bit variants of existing BPF conditional jump instructions, but the comparison happens on low 32-bit sub-register only, therefore some unnecessary extensions could be saved. JMP32 instructions will only be available for -mcpu=v3. Host probe hook has been updated accordingly. JMP32 instructions will only be enabled in code-gen when -mattr=+alu32 enabled, meaning compiling the program using sub-register mode. For JMP32 encoding, it is a new instruction class, and is using the reserved eBPF class number 0x6. This patch has been tested by compiling and running kernel bpf selftests with JMP32 enabled. Acked-by: Yonghong Song <yhs@fb.com> Signed-off-by: Jiong Wang <jiong.wang@netronome.com> llvm-svn: 353384	2019-02-07 10:43:09 +00:00
Tim Northover	77f59ae9f5	AArch64: implement copy for paired GPR registers. When doing 128-bit atomics using CASP we might need to copy a GPRPair to a different register, but that was unimplemented up to now. llvm-svn: 353383	2019-02-07 10:35:34 +00:00
Craig Topper	eab000b4b2	[BranchFolding] Remove dead code for handling EHPad blocks Summary: This code tries to handle the case where IBB is an EHPad, but there's an earlier check that uses PBB->hasEHPadSuccessor(). Where PBB is a predecessor of IBB. The hasEHPadSuccessor function would have visited IBB and seen that it was an EHPad and returned false. This would prevent us from reaching this code with IBB as an EHPad. Looks like this code was originally added in rL37427 (ancient) and made dead in rL143001. Reviewers: rnk, void, efriedma Reviewed By: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D57358 llvm-svn: 353375	2019-02-07 06:21:28 +00:00
JF Bastien	406b84702a	Bump minimum toolchain version Summary: The RFC on moving past C++11 got good traction: http://lists.llvm.org/pipermail/llvm-dev/2019-January/129452.html This patch therefore bumps the toolchain versions according to our policy: llvm.org/docs/DeveloperPolicy.html#toolchain Subscribers: mgorny, jkorous, dexonsmith, llvm-commits, mehdi_amini, jyknight, rsmith, chandlerc, smeenai, hans, reames, lattner, lhames, erichkeane Differential Revision: https://reviews.llvm.org/D57264 llvm-svn: 353374	2019-02-07 05:20:00 +00:00
Mikhail R. Gadelha	dd981b7599	Move the SMT API to LLVM Moved everything SMT-related to LLVM and updated the cmake scripts. Differential Revision: https://reviews.llvm.org/D54978 llvm-svn: 353373	2019-02-07 03:19:45 +00:00
Peter Collingbourne	7a661cb3b5	gn build: Merge the test part of r353237. llvm-svn: 353369	2019-02-07 02:40:49 +00:00

1 2 3 4 5 ...

174911 Commits