llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 22:42:46 +02:00

Author	SHA1	Message	Date
Bradley Smith	2d2de3c295	[AArch64] Allow non-standard INS/DUP encodings The ARMv8 ARMARM states that for these instructions in A64 state: "Unspecified bits in "imm5" are ignored but should be set to zero by an assembler.", (imm4 for INS). Make the disassembler accept any encoding with these ignored bits set to 1. llvm-svn: 234896	2015-04-14 15:07:26 +00:00
Duncan P. N. Exon Smith	ac19c888b2	DebugInfo: Gut DIVariable and DIGlobalVariable Gut all the non-pointer API from the variable wrappers, except an implicit conversion from `DIGlobalVariable` to `DIDescriptor`. Note that if you're updating out-of-tree code, `DIVariable` wraps `MDLocalVariable` (`MDVariable` is a common base class shared with `MDGlobalVariable`). llvm-svn: 234840	2015-04-14 02:22:36 +00:00
Krzysztof Parzyszek	3efcf81e03	Allow memory intrinsics to be tail calls llvm-svn: 234764	2015-04-13 17:16:45 +00:00
Alexander Kornienko	71412ece39	Use 'override/final' instead of 'virtual' for overridden methods The patch is generated using clang-tidy misc-use-override check. This command was used: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py \ -checks='-*,misc-use-override' -header-filter='llvm\|clang' \ -j=32 -fix -format http://reviews.llvm.org/D8925 llvm-svn: 234679	2015-04-11 02:11:45 +00:00
Ahmed Bougacha	329fda6f2a	[CodeGen] Split -enable-global-merge into ARM and AArch64 options. Currently, there's a single flag, checked by the pass itself. It can't force-enable the pass (and is on by default), because it might not even have been created, as that's the targets decision. Instead, have separate explicit flags, so that the decision is consistently made in the target. Keep the flag as a last-resort "force-disable GlobalMerge" for now, for backwards compatibility. llvm-svn: 234666	2015-04-11 00:06:36 +00:00
Quentin Colombet	5841518be6	[AArch64] Strengthen the code for the prologue insertion. The spilled registers are pristine and thus, correctly handled by the register scavenger and so on, but the liveness information is strictly speaking wrong at this point. Fix that. llvm-svn: 234664	2015-04-10 23:14:34 +00:00
Chad Rosier	a7271b168a	[AArch64] Changes some SchedAlias to WriteRes for Cortex-A57. Using SchedAliases is convenient and works well for latency and resource lookup for instructions. However, this creates an entry in AArch64WriteLatencyTable with a WriteResourceID of 0, breaking any SchedReadAdvance since the lookup will fail. http://reviews.llvm.org/D8043 Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 234594	2015-04-10 13:19:27 +00:00
Chad Rosier	ae329f3d23	[AArch64] Adjusts Cortex-A57 machine model to handle zero shift. http://reviews.llvm.org/D8043 Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 234593	2015-04-10 13:19:21 +00:00
Benjamin Kramer	f6149322d4	Reduce dyn_cast<> to isa<> or cast<> where possible. No functional change intended. llvm-svn: 234586	2015-04-10 11:24:51 +00:00
Ahmed Bougacha	9e6b267c41	[AArch64] Promote f16 operations to f32. For the most common ones (such as fadd), we already did the promotion. Do the same thing for all the others. Currently, we'll just crash/assert on all these operations, as there's no hardware or libcall support whatsoever. f16 (half) is specified as an interchange - not arithmetic - format, and is expected to be promoted to single-precision for arithmetic operations. While there, teach the legalizer about promoting some of the (mostly floating-point) operations that we never needed before. Differential Revision: http://reviews.llvm.org/D8648 See related discussion on the thread for: http://reviews.llvm.org/D8755 llvm-svn: 234550	2015-04-10 00:08:48 +00:00
Juergen Ributzka	6f558fd68f	[AArch64][FastISel] Fix integer extend optimization. The integer extend optimization tries to fold the extend into the load instruction. This requires us to identify if the extend has already been emitted or not and act accordingly on it. The check that was originally performed for this was not sufficient. Besides checking the ValueMap for a mapped register we also need to check if the virtual register has already an associated machine instruction that defines it. This fixes rdar://problem/20470788. llvm-svn: 234529	2015-04-09 20:00:46 +00:00
Rafael Espindola	adc15d13f8	clang-format bits of code to make a followup patch easy to read. llvm-svn: 234519	2015-04-09 18:32:58 +00:00
Kristof Beyls	c065d4f7d5	[AArch64] Add support for dynamic stack alignment Differential Revision: http://reviews.llvm.org/D8876 llvm-svn: 234471	2015-04-09 08:49:47 +00:00
Lang Hames	0020ebeb89	[AArch64] Remove redundant -march option. Also fix a think-o from r234462. llvm-svn: 234467	2015-04-09 05:34:57 +00:00
Lang Hames	360efe3451	[AArch64] Teach AArch64TargetLowering::getOptimalMemOpType to consider alignment restrictions when choosing a type for small-memcpy inlining in SelectionDAGBuilder. This ensures that the loads and stores output for the memcpy won't be further expanded during legalization, which would cause the total number of instructions for the memcpy to exceed (often significantly) the inlining thresholds. <rdar://problem/17829180> llvm-svn: 234462	2015-04-09 03:40:33 +00:00
Tim Northover	75657ea420	AArch64: disallow "fmov sD, #-0.0" during assembly. We weren't checking the sign of the floating point immediate before translating it to "fmov sD, wzr". Similarly for D-regs. Technically "movi vD.2s, #0x80, lsl #24" would work most of the time, but it's not a blessed alias (and I don't think it should be since people expect writing sD to zero out the high lanes, and there's no dD equivalent). So an error it is. rdar://20455398 llvm-svn: 234372	2015-04-07 22:49:47 +00:00
Matthias Braun	4c77332de7	AArch64: Don't lower ISD::SELECT to ISD::SELECT_CC Instead of lowering SELECT to SELECT_CC which is further lowered later immediately call the SELECT_CC lowering code. This is preferable because: - Avoids an unnecessary roundtrip through the legalization queues with an intermediate node. - More importantly: Lowered operations get visited last leading to SELECT_CC getting visited with legalized operands and unlegalized ones for preexisting SELECT_CC nodes. This does not hurt the current code (hence no testcase) but is required for another patch I am working on. Differential Revision: http://reviews.llvm.org/D8187 llvm-svn: 234334	2015-04-07 17:33:05 +00:00
Rafael Espindola	d83c383098	Refactor a lot of duplicated code for stub output. This also moves it earlier so that it they are produced before we print an end symbol for the data section. llvm-svn: 234315	2015-04-07 13:42:44 +00:00
Duncan P. N. Exon Smith	f41651ac8a	CodeGen: Stop using DIDescriptor::is*() and auto-casting Same as r234255, but for lib/CodeGen and lib/Target. llvm-svn: 234258	2015-04-06 23:27:40 +00:00
Quentin Colombet	fcd49dac2f	[AArch64] Add a comment to make it explicit why we increased the complexity. Follow-up of r233653. llvm-svn: 233936	2015-04-02 18:54:23 +00:00
Vladimir Sukharev	520cdd942a	[AArch64] Rename v8.1a from "extension" to "architecture" v8.1a is renamed to architecture, accordingly to approaches in ARM backend. Excess generic cpu is removed. Intended use: "generic" cpu with "v8.1a" subtarget feature Reviewers: jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8766 llvm-svn: 233810	2015-04-01 14:49:29 +00:00
Quentin Colombet	574df40140	[AArch64] Enable the codegenprepare optimization that promotes operation to form extended loads. Implement the related target lowering hook so that the optimization has a better estimation of the cost of an extension. rdar://problem/19267165 llvm-svn: 233753	2015-03-31 20:52:32 +00:00
Vladimir Sukharev	22589e7b79	[AArch64] Add v8.1a "Rounding Double Multiply Add/Subtract" extension Reviewers: t.p.northover, jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8502 llvm-svn: 233693	2015-03-31 13:15:48 +00:00
Quentin Colombet	280dbbd452	[AArch64] Fix poor codegen for add immediate. We used to match the register variant before the immediate when the register argument could be implicitly zero-extended. llvm-svn: 233653	2015-03-31 00:31:13 +00:00
Eric Christopher	fdc8ea88a6	Replace the MCSubtargetInfo parameter with a Triple when creating an MCInstPrinter. Update all callers and use where we wanted a Triple previously. llvm-svn: 233648	2015-03-31 00:10:04 +00:00
Juergen Ributzka	33cfd96d53	Transfer implicit operands when expanding the RET_ReallyLR pseudo instruction. When we expand the RET_ReallyLR pseudo instruction we also need to transfer the implicit operands. The return register is an implicit operand and without it the liveness calculation generates an incorrect live-out set for the patchpoint. This fixes rdar://problem/19068476. llvm-svn: 233635	2015-03-30 22:45:56 +00:00
Eric Christopher	fa46d9d5da	Remove unused MCSubtargetInfo argument from the AArch64 MCInstPrinter ctors. llvm-svn: 233608	2015-03-30 21:52:26 +00:00
Eric Christopher	f6dc0ee979	Remove unused Target argument from MCInstPrinter ctor functions. llvm-svn: 233607	2015-03-30 21:52:21 +00:00
Akira Hatanaka	bbf66d7ddc	[AArch64InstPrinter] Use the feature bits of the subtarget passed to the print method. This enables the instprinter to print a different system register name based on the feature bits of the per-function subtarget. Differential Revision: http://reviews.llvm.org/D8668 llvm-svn: 233412	2015-03-27 20:37:20 +00:00
Akira Hatanaka	6a2e278ec7	[MCInstPrinter] Enable MCInstPrinter to change its behavior based on the per-function subtarget. Currently, code-gen passes the default or generic subtarget to the constructors of MCInstPrinter subclasses (see LLVMTargetMachine::addPassesToEmitFile), which enables some targets (AArch64, ARM, and X86) to change their instprinter's behavior based on the subtarget feature bits. Since the backend can now use different subtargets for each function, instprinter has to be changed to use the per-function subtarget rather than the default subtarget. This patch takes the first step towards enabling instprinter to change its behavior based on the per-function subtarget. It adds a bit "PassSubtarget" to AsmWriter which tells table-gen to pass a reference to MCSubtargetInfo to the various print methods table-gen auto-generates. I will follow up with changes to instprinters of AArch64, ARM, and X86. llvm-svn: 233411	2015-03-27 20:36:02 +00:00
Vladimir Sukharev	e42acd8cf3	[AArch64] Don't store available subtarget features in AArch64SysReg::SysRegMapper Subtarget features must not be a part of the target machine. So, they are now not being stored in SysRegMapper, but provided each time fromString()/toString() are called Reviewers: jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8655 llvm-svn: 233386	2015-03-27 17:11:29 +00:00
Vladimir Sukharev	7d42e2f8ac	[AArch64] Rename Pairs to Mappings in AArch64NamedImmMapper Third element is to be added soon to "struct AArch64NamedImmMapper::Mapping". So its instances are renamed from ...Pairs to ...Mappings Reviewers: jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8582 llvm-svn: 233300	2015-03-26 17:57:39 +00:00
Vladimir Sukharev	a6923c8e1b	[AArch64] Move initializations of AArch64NamedImmMapper out of void AArch64Operand::print(...) class AArch64NamedImmMapper is to become dependent of SubTargetFeatures, while class AArch64Operand don't have access to the latter. So, AArch64NamedImmMapper constructor invocations are refactored away from methods of AArch64Operand. Reviewers: jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8579 llvm-svn: 233297	2015-03-26 17:29:53 +00:00
Vladimir Sukharev	790efe2f48	[AArch64, ARM] Add v8.1a architecture and generic cpu New architecture and cpu added, following http://community.arm.com/groups/processors/blog/2014/12/02/the-armv8-a-architecture-and-its-ongoing-development Reviewers: t.p.northover Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8505 llvm-svn: 233290	2015-03-26 17:05:54 +00:00
Peter Collingbourne	c55866b5f0	AArch64: use a different means to determine whether to byte swap relocations. This code depended on a bug in the FindAssociatedSection function that would cause it to return the wrong result for certain absolute expressions. Instead, use EvaluateAsRelocatable. llvm-svn: 233119	2015-03-24 21:47:03 +00:00
David Blaikie	eefd19904e	Refactor: Simplify boolean expressions in AArch64 target Simplify boolean expressions using `true` and `false` with `clang-tidy` Patch by Richard Thomson. Reviewed By: rengolin Differential Revision: http://reviews.llvm.org/D8525 llvm-svn: 233089	2015-03-24 16:24:01 +00:00
Michael Kuperstein	1278cdeb94	Revert "Use std::bitset for SubtargetFeatures" This reverts commit r233055. It still causes buildbot failures (gcc running out of memory on several platforms, and a self-host failure on arm), although less than the previous time. llvm-svn: 233068	2015-03-24 12:56:59 +00:00
Michael Kuperstein	c6ff005c9e	Use std::bitset for SubtargetFeatures Previously, subtarget features were a bitfield with the underlying type being uint64_t. Since several targets (X86 and ARM, in particular) have hit or were very close to hitting this bound, switching the features to use a bitset. No functional change. The first time this was committed (r229831), it caused several buildbot failures. At least some of the ARM ones were due to gcc/binutils issues, and should now be fixed. Differential Revision: http://reviews.llvm.org/D8542 llvm-svn: 233055	2015-03-24 09:17:25 +00:00
Ahmed Bougacha	dda2ff1737	[AArch64, ARM] Enable GlobalMerge with -O3 rather than -O1. The pass used to be enabled by default with CodeGenOpt::Less (-O1). This is too aggressive, considering the pass indiscriminately merges all globals together. Currently, performance doesn't always improve, and, on code that uses few globals (e.g., the odd file- or function- static), more often than not is degraded by the optimization. Lengthy discussion can be found on llvmdev (AArch64-focused; ARM has similar problems): http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-February/082800.html Also, it makes tooling and debuggers less useful when dealing with globals and data sections. GlobalMerge needs to better identify those cases that benefit, and this will be done separately. In the meantime, move the pass to run with -O3 rather than -O1, on both ARM and AArch64. llvm-svn: 233024	2015-03-23 21:17:36 +00:00
Benjamin Kramer	6a9aa608f1	Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used. llvm-svn: 232998	2015-03-23 19:32:43 +00:00
Benjamin Kramer	45a545b9c6	Purge unused includes throughout libSupport. NFC. llvm-svn: 232976	2015-03-23 18:07:13 +00:00
Chad Rosier	c67eff5c3b	[AArch64] Enable rematerialization of float 0 values. Patch by Geoff Berry<gberry@codeaurora.org>. llvm-svn: 232967	2015-03-23 17:19:34 +00:00
Benjamin Kramer	d52ec1c0ec	Move private classes into anonymous namespaces NFC. llvm-svn: 232944	2015-03-23 12:30:58 +00:00
Daniel Sanders	698ec39776	[aarch64] Distinguish the 'Q' and 'm' inline assembly memory constraints. Summary: But still handle them the same way since I don't know how they differ on this target. Clang also has code for 'Ump', 'Utf', 'Usa', and 'Ush' but calls llvm_unreachable() on this code path so they are not converted to a constraint id at the moment. No functional change intended. Reviewers: t.p.northover Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D8177 llvm-svn: 232941	2015-03-23 11:33:15 +00:00
Eric Christopher	e8e68a5117	Remove the bare getSubtargetImpl call from the AArch64 port. As part of this add a test that shows we can generate code for functions that specifically enable a subtarget feature. llvm-svn: 232884	2015-03-21 04:04:50 +00:00
Ahmed Bougacha	6bc0aa2395	[AArch64] Prefer UZP for concat_vector of illegal truncs. Follow-up to r232459: prefer a UZP shuffle to the intermediate truncs. llvm-svn: 232871	2015-03-21 01:08:39 +00:00
Rafael Espindola	06353319f0	Don't declare all text sections at the start of the .s The code this patch removes was there to make sure the text sections went before the dwarf sections. That is necessary because MachO uses offsets relative to the start of the file, so adding a section can change relaxations. The dwarf sections were being printed at the start just to produce symbols pointing at the start of those sections. The underlying issue was fixed in r231898. The dwarf sections are now printed when they are about to be used, which is after we printed the text sections. To make sure we don't regress, the patch makes the MachO streamer assert if CodeGen puts anything unexpected after the DWARF sections. llvm-svn: 232842	2015-03-20 20:00:01 +00:00
John Brawn	2e601255af	[ARM] Fix handling of thumb1 out-of-range frame offsets LocalStackSlotPass assumes that isFrameOffsetLegal doesn't change its answer when the base register changes. Unfortunately this isn't true in thumb1, where SP-based loads allow a larger offset than non-SP-based loads, and this causes the base register reuse code to generate instructions that are unencodable, causing an assertion failure. Solve this by adding a BaseReg parameter to isFrameOffsetLegal, which ARMBaseRegisterInfo can then make use of to give the correct answer. Differential Revision: http://reviews.llvm.org/D8419 llvm-svn: 232825	2015-03-20 17:20:07 +00:00
Rafael Espindola	dcba9c010c	Split the object streamer callback in one per file format. There are two main advantages to doing this * Targets that only need to handle one of the formats specially don't have to worry about the others. For example, x86 now only registers a constructor for the COFF streamer. * Changes to the arguments passed to one format constructor will not impact the other formats. llvm-svn: 232699	2015-03-19 01:50:16 +00:00
Rafael Espindola	a6821e116c	two or more, use a for. llvm-svn: 232688	2015-03-18 23:15:49 +00:00

1 2 3 4 5 ...

963 Commits