llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Matthias Braun	e45ebab2b3	AArch64: Fix emergency spillslot being out of reach for large callframes Re-commit of r322200: The testcase shouldn't hit machineverifiers anymore with r322917 in place. Large callframes (calls with several hundreds or thousands or parameters) could lead to situations in which the emergency spillslot is out of range to be addressed relative to the stack pointer. This commit forces the use of a frame pointer in the presence of large callframes. This commit does several things: - Compute max callframe size at the end of instruction selection. - Add mirFileLoaded target callback. Use it to compute the max callframe size after loading a .mir file when the size wasn't specified in the file. - Let TargetFrameLowering::hasFP() return true if there exists a callframe > 255 bytes. - Always place the emergency spillslot close to FP if we have a frame pointer. - Note that `useFPForScavengingIndex()` would previously return false when a base pointer was available leading to the emergency spillslot getting allocated late (that's the whole effect of this callback). Which made no sense to me so I took this case out: Even though the emergency spillslot is technically not referenced by FP in this case we still want it allocated early. Differential Revision: https://reviews.llvm.org/D40876 llvm-svn: 322919	2018-01-19 03:16:36 +00:00
Matthias Braun	43dacf8f39	AArch64: Omit callframe setup/destroy when not necessary Do not create CALLSEQ_START/CALLSEQ_END when there is no callframe to setup and the callframe size is 0. - Fixes an invalid callframe nesting for byval arguments, which would look like this before this patch (as in `big-byval.ll`): ... ADJCALLSTACKDOWN 32768, 0, ... # Setup for extfunc ... ADJCALLSTACKDOWN 0, 0, ... # setup for memcpy ... BL &memcpy ... ADJCALLSTACKUP 0, 0, ... # destroy for memcpy ... BL &extfunc ADJCALLSTACKUP 32768, 0, ... # destroy for extfunc - Saves us two instructions in the common case of zero-sized stackframes. - Remove an unnecessary scheduling barrier (hence the small unittest changes). Differential Revision: https://reviews.llvm.org/D42006 llvm-svn: 322917	2018-01-19 02:45:38 +00:00
Sam Clegg	20a4ea66e4	[WebAssembly] Add test expectations for gcc C++ tests (gcc/testsuite/g++.dg) Differential Revision: https://reviews.llvm.org/D42226 llvm-svn: 322915	2018-01-19 01:40:52 +00:00
Lang Hames	a44f59218b	[ORC] Revert r322913 while I investigate an ASan failure. llvm-svn: 322914	2018-01-19 01:40:26 +00:00
Lang Hames	6784638299	[ORC] Redesign the JITSymbolResolver interface to support bulk queries. Bulk queries reduce IPC/RPC overhead for cross-process JITing and expose opportunities for parallel compilation. The two new query methods are lookupFlags, which finds the flags for each of a set of symbols; and lookup, which finds the address and flags for each of a set of symbols. (See doxygen comments for more details.) The existing JITSymbolResolver class is renamed LegacyJITSymbolResolver, and modified to extend the new JITSymbolResolver class using the following scheme: - lookupFlags is implemented by calling findSymbolInLogicalDylib for each of the symbols, then returning the result of calling getFlags() on each of these symbols. (Importantly: lookupFlags does NOT call getAddress on the returned symbols, so lookupFlags will never trigger materialization, and lookupFlags will never call findSymbol, so only symbols that are part of the logical dylib will return results.) - lookup is implemented by calling findSymbolInLogicalDylib for each symbol and falling back to findSymbol if findSymbolInLogicalDylib returns a null result. Assuming a symbol is found its getAddress method is called to materialize it and the result (if getAddress succeeds) is stored in the result map, or the error (if getAddress fails) is returned immediately from lookup. If any symbol is not found then lookup returns immediately with an error. This change will break any out-of-tree derivatives of JITSymbolResolver. This can be fixed by updating those classes to derive from LegacyJITSymbolResolver instead. llvm-svn: 322913	2018-01-19 01:12:40 +00:00
Craig Topper	1c2f80aef6	[X86] Add intrinsic support for the RDPID instruction This adds a new instrinsic to support the rdpid instruction. The implementation is a bit weird because the intrinsic is defined as always returning 32-bits, but the assembler support thinks the instruction produces a 64-bit register in 64-bit mode. But really it zeros the upper 32 bits. So I had to add separate patterns where 64-bit mode uses an extract_subreg. Differential Revision: https://reviews.llvm.org/D42205 llvm-svn: 322910	2018-01-18 23:52:31 +00:00
Sanjay Patel	489c2f874a	[InstSimplify] regenerate checks and add tests for commutes; NFC llvm-svn: 322907	2018-01-18 23:11:24 +00:00
Changpeng Fang	1f9a550e2b	AMDGPU/SI: Fix typos in d16 support patch the buffer intrinsics. llvm-svn: 322906	2018-01-18 22:57:57 +00:00
Reid Kleckner	bcdfe567cb	[CodeView] Add line numbers for inlined call sites We did this for inline call site line tables, but we hadn't done it for regular function line tables yet. This patch copies that logic from encodeInlineLineTable. llvm-svn: 322905	2018-01-18 22:55:43 +00:00
Reid Kleckner	6c728206f3	[CodeView] Sink complex inline functions to .cpp file, NFC I'm cleaning up this code before I attempt to fix a line table bug. llvm-svn: 322904	2018-01-18 22:55:14 +00:00
Changpeng Fang	48df1ecec5	AMDGPU/SI: Add d16 support for image intrinsics. Summary: This patch implements d16 support for image load, image store and image sample intrinsics. Reviewers: Matt, Brian. Differential Revision: https://reviews.llvm.org/D3991 llvm-svn: 322903	2018-01-18 22:08:53 +00:00
Eric Christopher	2cb6bd8f84	Typo fix SIBABRT -> SIGABRT. Based on a patch by Henry Wong! llvm-svn: 322902	2018-01-18 21:45:51 +00:00
Martin Storsjo	e7e96e7196	[test] Actually check the common parts in CodeGen/ARM/global-merge-external.ll. NFC. Previously, these parts weren't ever checked. The label patterns need to be extended to match successfully on macho. Differential Revision: https://reviews.llvm.org/D42126 llvm-svn: 322900	2018-01-18 21:21:48 +00:00
Peter Collingbourne	2a40842c98	Support: Add missing #include. This #include is necessary to provide the definitions of _fpclass and _FPCLASS_NZ when building with libc++. llvm-svn: 322885	2018-01-18 20:49:33 +00:00
Paul Robinson	56dfed1023	[DWARFv5] Number the line-table's directory array correctly. The compilation directory has always been #0, but as of DWARF v5 it is explicitly listed in the line-table section instead of implicitly being a reference to the compile_unit DIE's DW_AT_comp_dir attribute. This means the dumper should number the dumped array starting with 0 or 1 depending on the DWARF version of the line table. References in the generated DWARF are correct, it's just the dumper that was wrong. Also some assembler-coded tests were similarly confused about directory numbers. llvm-svn: 322884	2018-01-18 20:33:35 +00:00
Sylvestre Ledru	60eadcb957	we have now https support for apt.llvm.org. Updating the URL llvm-svn: 322881	2018-01-18 19:57:35 +00:00
Dimitry Andric	502c6eb17b	Follow-up to rL322875 by initializing the do_libcxxabi variable properly. llvm-svn: 322879	2018-01-18 19:30:30 +00:00
Amara Emerson	8707267469	[AArch64][GlobalISel] Add isel support for global values in the large code model. Fixes PR35958. Differential Revision: https://reviews.llvm.org/D42175 llvm-svn: 322878	2018-01-18 19:21:27 +00:00
Simon Pilgrim	bc82314734	[X86][SSE] Regenerate vector promotion tests llvm-svn: 322877	2018-01-18 19:17:26 +00:00
Ana Pazos	168aa8b81a	[RISCV] Fixed setting predicates for compressed instructions. Summary: Fixed setting predicates for compressed instructions. Some instructions were being generated with C extension enabled only, without proper checks for the other required extensions like F, D and 32 and 64-bit target checks. Affected instructions: C_FLD, C_FLW, C_LD, C_FSD, C_FSW, C_SD, C_JAL, C_ADDIW, C_SUBW, C_ADDW, C_FLDSP, C_FLWSP, C_LDSP, C_FSDSP, C_FSWSP, C_SDSP Reviewers: asb, shiva0217 Reviewed By: asb Subscribers: rbar, johnrusso, simoncook, jordy.potman.lists, sabuasal, niosHD, llvm-commits Differential Revision: https://reviews.llvm.org/D42132 llvm-svn: 322876	2018-01-18 18:54:05 +00:00
Dimitry Andric	112ca3e80a	Add a -no-libcxxabi option to the test-release.sh script. On FreeBSD, it is currently not possible to build libcxxabi and link against it, so we have been building releases with -no-libs for quite some time. However, libcxx and libunwind should build without problems, so provide an option to skip just libcxxabi. llvm-svn: 322875	2018-01-18 18:39:13 +00:00
Simon Pilgrim	50f79f0b30	[X86][AVX] Add 256/512-bit slow PMULLD tests llvm-svn: 322874	2018-01-18 18:38:32 +00:00
Zachary Turner	af5c5622dd	Speed up iteration of CodeView record streams. There's some abstraction overhead in the underlying mechanisms that were being used, and it was leading to an abundance of small but not-free copies being made. This showed up on a profile. Eliminating this and going back to a low-level byte-based implementation speeds up lld with /DEBUG between 10 and 15%. Differential Revision: https://reviews.llvm.org/D42148 llvm-svn: 322871	2018-01-18 18:35:01 +00:00
Francis Visoiu Mistrih	a5ce5aa864	[CodeGen][NFC] Rename IsVerbose to IsStandalone in Machine*::print Committed r322867 too soon. Differential Revision: https://reviews.llvm.org/D42239 llvm-svn: 322868	2018-01-18 18:05:15 +00:00
Francis Visoiu Mistrih	0281a4fd10	[CodeGen] Print RegClasses on MI in verbose mode r322086 removed the trailing information describing reg classes for each register. This patch adds printing reg classes next to every register when individual operands/instructions/basic blocks are printed. In the case of dumping MIR or printing a full function, by default don't print it. Differential Revision: https://reviews.llvm.org/D42239 llvm-svn: 322867	2018-01-18 17:59:06 +00:00
Alexey Bataev	a0e48b2161	[SLP] Fix test checks, NFC. llvm-svn: 322865	2018-01-18 17:34:27 +00:00
Benjamin Kramer	716dfd78d8	[ADT] Just give up on GCC, I can't fix this. While the memmove workaround fixed it for GCC 6.3. GCC 4.8 and GCC 7.1 are still broken. I have no clue what's going on, just blacklist GCC for now. Needless to say this code is ubsan, asan and msan-clean. llvm-svn: 322862	2018-01-18 16:23:40 +00:00
Benjamin Kramer	fbb4905b3a	[ADT] Add a workaround for GCC miscompiling the trivially copyable Optional I've seen random crashes with GCC 4.8, GCC 6.3 and GCC 7.3, triggered by my Optional change. All of them affect a different set of targets. This change fixes the instance of the problem I'm seeing on my local machine, let's hope it's good enough for the other instances too. llvm-svn: 322859	2018-01-18 15:47:59 +00:00
Sanjay Patel	2bbd903d15	[TargetLowering] add punctuation for readability; NFC llvm-svn: 322855	2018-01-18 15:25:32 +00:00
Sam McCall	f4e0cbd2a4	[MachineOutliner] Fix r322788 - don't write to working directory llvm-svn: 322850	2018-01-18 15:02:28 +00:00
Joel Jones	df1734c950	[docs] Make ReleaseProcess.rst 80 column. NFCI llvm-svn: 322849	2018-01-18 14:57:55 +00:00
Francis Visoiu Mistrih	f2d53ff2dd	[CodeGen][NFC] Refactor MachineInstr::print * Handle more cases where the MI is not attached yet * Add similar asserts like in MIRPrinter::print llvm-svn: 322848	2018-01-18 14:52:14 +00:00
Benjamin Kramer	5e5fda04d8	[HWAsan] Fix uninitialized variable. Found by msan. llvm-svn: 322847	2018-01-18 14:19:04 +00:00
Simon Pilgrim	c0424a4452	[X86] Add PR35918 test case llvm-svn: 322846	2018-01-18 13:42:02 +00:00
Klaus Kretzschmar	5779bce977	test commit llvm-svn: 322844	2018-01-18 12:58:50 +00:00
Alex Bradbury	d376a8bad4	[RISCV] Codegen support for the standard RV32M instruction set extension llvm-svn: 322843	2018-01-18 12:36:38 +00:00
Alex Bradbury	60eef4726e	[RISCV] Implement frame pointer elimination llvm-svn: 322839	2018-01-18 11:34:02 +00:00
Benjamin Kramer	88a16a89f7	[ADT] Split optional to only include copy mechanics and dtor for non-trivial types. This makes uses of Optional more transparent to the compiler (and clang-tidy) and generates slightly smaller code. This is a re-land of r317019, which had issues with GCC 4.8 back then. Those issues don't reproduce anymore, but I'll watch the buildbots closely in case anything goes wrong. llvm-svn: 322838	2018-01-18 11:26:24 +00:00
Andrew V. Tischenko	71c29815dc	A new test to demostrate the current SHLD/SHRD code generation. llvm-svn: 322828	2018-01-18 10:40:48 +00:00
Alex Bradbury	55d398d618	[RISCV][NFC] Add nounwind to functions in div.ll and mul.ll Committing this separately to minimise irrelevant changes for an upcoming patch. llvm-svn: 322825	2018-01-18 09:41:14 +00:00
Sam Parker	a4a17bbc00	[SelectionDAG] Convert assert to condtion Follow-up to r322120 which can cause assertions for AArch64 because v1f64 and v1i64 are legal types. Differential Revision: https://reviews.llvm.org/D42097 llvm-svn: 322823	2018-01-18 09:22:24 +00:00
Craig Topper	672594bd71	[X86] Use vmovdqu64/vmovdqa64 for unmasked integer vector stores for consistency with loads. Previously we used 64 for vXi64 stores and 32 for everything else. This change uses 64 for everything just like do for loads. llvm-svn: 322820	2018-01-18 07:44:09 +00:00
Craig Topper	5585e2235d	[X86] Remove isel patterns for using unmasked vmovdqa32/vmovdqu32 for integer vector loads. These patterns were just looking for a vXi64 bitcasted to vXi32, but there is no advantage to using vmovdqa32 over vmovdqa64. llvm-svn: 322819	2018-01-18 07:44:06 +00:00
Clement Courbet	b50de8f5ac	Revert "Add a value_type to ArrayRef." clang OOMs on arm. This reverts commit a272b2f2ef63f7f602c9ef4d9e10dc4eb9f00aa1. llvm-svn: 322818	2018-01-18 07:26:34 +00:00
Craig Topper	77250fd2c5	[X86] Remove windows line endings from a test file. NFC llvm-svn: 322817	2018-01-18 06:47:09 +00:00
Rafael Espindola	7c97ded197	Don't drop dso_local in LTO. LTO sets dso_local as an optimization, so don't clear it. This avoid clearing it from undefined hidden symbols, which would then fail the verifier. llvm-svn: 322814	2018-01-18 05:38:43 +00:00
Craig Topper	d4310f3853	[DAGCombiner] Add a DAG combine to turn a splat build_vector where the splat elemnt is a bitcast from a vector type into a concat_vector For example, a build_vector of i64 bitcasted from v2i32 can be turned into a concat_vectors of the v2i32 vectors with a bitcast to a vXi64 type Differential Revision: https://reviews.llvm.org/D42090 llvm-svn: 322811	2018-01-18 04:17:06 +00:00
Rafael Espindola	bfea7bc2f8	Make GlobalValues with non-default visibilility dso_local. This is similar to r322317, but for visibility. It is not as neat because we have to special case extern_weak. The idea is the same as the previous change, make the transition to explicit dso_local easier for the frontends. With this they only have to add dso_local to symbols where we need some external information to decide if it is dso_local (like it being part of an ELF executable). llvm-svn: 322806	2018-01-18 02:08:23 +00:00
Justin Bogner	e1037058df	GlobalISel: Make MachineCSE runnable in the middle of the GlobalISel Right now, it is not possible to run MachineCSE in the middle of the GlobalISel pipeline. Being able to run generic optimizations between the core passes of GlobalISel was one of the goals of the new ISel framework. This is the first attempt to do it. The problem is that MachineCSE pass assumes all register operands have a register class, which, in GlobalISel context, won't be true until after the InstructionSelect pass. The reason for this behaviour is that before replacing one virtual register with another, MachineCSE pass (and most of the other optimization machine passes) must check if the virtual registers' constraints have a (sufficiently large) intersection, and constrain the resulting register appropriately if such intersection exists. GlobalISel extends the representation of such constraints from just a register class to a triple (low-level type, register bank, register class). This commit adds MachineRegisterInfo::constrainRegAttrs method that extends MachineRegisterInfo::constrainRegClass to such a triple. The idea is that going forward we should use: - RegisterBankInfo::constrainGenericRegister within GlobalISel's InstructionSelect pass - MachineRegisterInfo::constrainRegClass within SelectionDAG ISel - MachineRegisterInfo::constrainRegAttrs everywhere else regardless the target and instruction selector it uses. Patch by Roman Tereshin. Thanks! llvm-svn: 322805	2018-01-18 02:06:56 +00:00
Derek Schuff	e8d0b3a102	[WebAssembly] Remove duplicated RTLIB names Remove the tight coupling between llvm/CodeGenRuntimeLibcalls.def and the table of supported singatures for wasm. This will allow adding new libcalls without changing wasm's signature table. Also, some cleanup: Use ManagedStatics instead of const tables to avoid memory/binary bloat. Use a StringMap instead of a linear search for name lookup. Differential Revision: https://reviews.llvm.org/D35592 llvm-svn: 322802	2018-01-18 01:15:45 +00:00

1 2 3 4 5 ...

159070 Commits