llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 05:53:07 +01:00

Author	SHA1	Message	Date
Benjamin Kramer	0f0bd365d8	Replace std::copy with a back inserter with vector append where feasible All of the cases were just appending from random access iterators to a vector. Using insert/append can grow the vector to the perfect size directly and moves the growing out of the loop. No intended functionalty change. llvm-svn: 230845	2015-02-28 10:11:12 +00:00
Owen Anderson	5897fb359b	Switch a std::map to a DenseMap in CodeGenRegisters. The keys of the map are unique by pointer address, so there's no need to use the llvm::less comparator. This allows us to use DenseMap instead, which reduces tblgen time by 20% on my stress test. llvm-svn: 230769	2015-02-27 17:57:01 +00:00
Reid Kleckner	646fb547c4	Re-instate the pragma optimize hack for MSVC, but not clang-cl Reverts commit r230686 with define modifications. llvm-svn: 230692	2015-02-26 21:34:11 +00:00
Reid Kleckner	54fb8b9554	Remove stale pragma hack for an unsupported MSVC version llvm-svn: 230686	2015-02-26 21:08:27 +00:00
Eric Christopher	63ec72f57a	Fix a couple of depedent->dependent typos. llvm-svn: 230584	2015-02-26 00:00:33 +00:00
Elena Demikhovsky	0e7ac15634	AVX-512: Gather and Scatter patterns Gather and scatter instructions additionally write to one of the source operands - mask register. In this case Gather has 2 destination values - the loaded value and the mask. Till now we did not support code gen pattern for gather - the instruction was generated from intrinsic only and machine node was hardcoded. When we introduce the masked_gather node, we need to select instruction automatically, in the standard way. I added a flag "hasTwoExplicitDefs" that allows to handle 2 destination operands. (Some code in the X86InstrFragmentsSIMD.td is commented out, just to split one big patch in many small patches) llvm-svn: 230471	2015-02-25 09:46:31 +00:00
Tim Northover	a640d920b4	CodeGen: convert CCState interface to using ArrayRefs Everyone except R600 was manually passing the length of a static array at each callsite, calculated in a variety of interesting ways. Far easier to let ArrayRef handle that. There should be no functional change, but out of tree targets may have to tweak their calls as with these examples. llvm-svn: 230118	2015-02-21 02:11:17 +00:00
Michael Kuperstein	b590beb3d8	Reverting r229831 due to multiple ARM/PPC/MIPS build-bot failures. llvm-svn: 229841	2015-02-19 11:38:11 +00:00
Michael Kuperstein	96956c5022	Use std::bitset for SubtargetFeatures Previously, subtarget features were a bitfield with the underlying type being uint64_t. Since several targets (X86 and ARM, in particular) have hit or were very close to hitting this bound, switching the features to use a bitset. No functional change. Differential Revision: http://reviews.llvm.org/D7065 llvm-svn: 229831	2015-02-19 09:01:04 +00:00
Aaron Ballman	7bf5e9ba92	MSVC 2013 does not ICE on this code in the same fashion that MSVC 2012 did; NFC. llvm-svn: 229422	2015-02-16 19:33:36 +00:00
Aaron Ballman	0b45511a2e	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229340	2015-02-15 22:54:22 +00:00
Craig Topper	dc76cc8405	[X86] Add the remaining 11 possible exact ModRM formats. This makes their encodings linear which can then be used to simplify some other code. llvm-svn: 229279	2015-02-15 04:16:44 +00:00
Craig Topper	3b9a8648f4	[X86] Add support for parsing and printing the mnemonic aliases for the XOP VPCOM instructions. llvm-svn: 229078	2015-02-13 07:42:25 +00:00
Benjamin Kramer	4b76aa3d46	MathExtras: Bring Count(Trailing\|Leading)Ones and CountPopulation in line with countTrailingZeros Update all callers. llvm-svn: 228930	2015-02-12 15:35:40 +00:00
Elena Demikhovsky	40c204cf7d	Masked Gather and Scatter Intrinsics. Gather and Scatter are new introduced intrinsics, comming after recently implemented masked load and store. This is the first patch for Gather and Scatter intrinsics. It includes only the syntax, parsing and verification. Gather and Scatter intrinsics allow to perform multiple memory accesses (read/write) in one vector instruction. The intrinsics are not target specific and will have the following syntax: Gather: declare <16 x i32> @llvm.masked.gather.v16i32(<16 x i32> <vector of ptrs>, i32 <alignment>, <16 x i1> <mask>, <16 x i32> <passthru>) declare <8 x float> @llvm.masked.gather.v8f32(<8 x float><vector of ptrs>, i32 <alignment>, <8 x i1> <mask>, <8 x float><passthru>) Scatter: declare void @llvm.masked.scatter.v8i32(<8 x i32><vector value to be stored> , <8 x i32><vector of ptrs> , i32 <alignment>, <8 x i1> <mask>) declare void @llvm.masked.scatter.v16i32(<16 x i32> <vector value to be stored> , <16 x i32> <vector of ptrs>, i32 <alignment>, <16 x i1><mask> ) Vector of ptrs - a set of source/destination addresses, to load/store the value. Mask - switches on/off vector lanes to prevent memory access for switched-off lanes vector of ptrs, value and mask should have the same vector width. These are code examples where gather / scatter should be used and will allow function vectorization ;void foo1(int * restrict A, int * restrict B, int * restrict C) { ; for (int i=0; i<SIZE; i++) { ; A[i] = B[C[i]]; ; } ;} ;void foo3(int * restrict A, int * restrict B) { ; for (int i=0; i<SIZE; i++) { ; A[B[i]] = i+5; ; } ;} Tests will come in the following patches, with CodeGen and Vectorizer. http://reviews.llvm.org/D7433 llvm-svn: 228521	2015-02-08 08:27:19 +00:00
David Blaikie	87c973c9d7	STLExtras: Provide less/equal functors with templated function call operators, plus a deref'ing functor template utility Similar to the C++14 void specializations of these templates, useful as a stop-gap until LLVM switches to '14. Example use-cases in tblgen because I saw some functors that looked like they could be simplified/refactored. Reviewers: dexonsmith Differential Revision: http://reviews.llvm.org/D7324 llvm-svn: 227828	2015-02-02 18:35:10 +00:00
Owen Anderson	2a38e36679	Replace another std::set in the core of CodeGenRegister, this time with sorted arrays. The hot path through this region of code does lots of batch inserts into sets. By storing them as sorted arrays, we can defer the sorting to the end of the batch, which is dramatically more efficient. This reduces tblgen runtime by 25% on my worst-case target. llvm-svn: 227682	2015-01-31 09:13:36 +00:00
Owen Anderson	e0ceaab206	Change more of the guts of CodeGenRegister's RegUnit tracking to be based on bit vectors. This is a continuation of my prior work to move some of the inner workings for CodeGenRegister to use bit vectors when computing about register units. This is highly beneficial to TableGen runtime on targets with large, dense register files. This patch represents a ~40% runtime reduction over and above my earlier improvement on a stress test of this case. llvm-svn: 227678	2015-01-31 07:49:41 +00:00
Owen Anderson	157d3f8255	Change a very hot piece of code in TableGen's register unit computations to use bit vectors rather than arrays. For target descriptions with very large and very dense register files, TableGen can take an extremely long time to run. This change makes a dent in that (~15% in my measurements) by accelerating the single hottest operation with better data structures. I believe there's still a lot of room to make this even faster with more global changes that require replacing some of the existing datastructures in this area with bit vectors, but that's a more involved change and I wanted to get this simpler improvement in first. llvm-svn: 227562	2015-01-30 09:05:49 +00:00
Craig Topper	fdec9f588d	[X86] Teach disassembler to handle illegal immediates on AVX512 integer compare instructions. llvm-svn: 227302	2015-01-28 10:09:56 +00:00
Eric Christopher	aacfef65cf	Move DataLayout back to the TargetMachine from TargetSubtargetInfo derived classes. Since global data alignment, layout, and mangling is often based on the DataLayout, move it to the TargetMachine. This ensures that global data is going to be layed out and mangled consistently if the subtarget changes on a per function basis. Prior to this all targets() have had subtarget dependent code moved out and onto the TargetMachine. One target hasn't been migrated as part of this change: R600. The R600 port has, as a subtarget feature, the size of pointers and this affects global data layout. I've currently hacked in a FIXME to enable progress, but the port needs to be updated to either pass the 64-bitness to the TargetMachine, or fix the DataLayout to avoid subtarget dependent features. llvm-svn: 227113	2015-01-26 19:03:15 +00:00
Craig Topper	011934eb9c	[X86] Replace i32i8imm on SSE/AVX instructions with i32u8imm which will make the assembler bounds check them. It will also make them print as unsigned. llvm-svn: 227032	2015-01-25 02:21:16 +00:00
Ramkumar Ramachandra	550e92d3f7	Intrinsics: introduce llvm_any_ty aka ValueType Any Specifically, gc.result benefits from this greatly. Instead of: gc.result.int.* gc.result.float.* gc.result.ptr.* ... We now have a gc.result.* that can specialize to literally any type. Differential Revision: http://reviews.llvm.org/D7020 llvm-svn: 226857	2015-01-22 20:14:38 +00:00
Craig Topper	39f463653a	[X86] Convert all the i8imm used by SSE and AVX instructions to u8imm. This makes the assembler check their size and removes a hack from the disassembler to avoid sign extending the immediate. llvm-svn: 226645	2015-01-21 08:15:54 +00:00
Alexander Kornienko	66580103e2	Replace size method call of containers to empty method where appropriate This patch was generated by a clang tidy checker that is being open sourced. The documentation of that checker is the following: /// The emptiness of a container should be checked using the empty method /// instead of the size method. It is not guaranteed that size is a /// constant-time function, and it is generally more efficient and also shows /// clearer intent to use empty. Furthermore some containers may implement the /// empty method but not implement the size method. Using empty whenever /// possible makes it easier to switch to another container in the future. Patch by Gábor Horváth! llvm-svn: 226161	2015-01-15 11:41:30 +00:00
Hal Finkel	0a5f201705	[TableGen] Add support for negative immediates to AsmMatcherEmitter This adds support for creating an InstAlias with a negative immediate, i.e.: def NOT : InstAlias<"not $dst, $src", (XORI GR32:$dst, GR32:$src, -1)>; by resolving this problem: RISCVGenAsmMatcher.inc:95:11: error: expected '= constant-expression' or end of enumerator definition CVT_imm_-1, ^^^^^^^^^^ Patch by Jordy Potman, thanks! llvm-svn: 226073	2015-01-15 01:33:00 +00:00
Reid Kleckner	002e480f22	Add the llvm.frameallocate and llvm.recoverframeallocation intrinsics These intrinsics allow multiple functions to share a single stack allocation from one function's call frame. The function with the allocation may only perform one allocation, and it must be in the entry block. Functions accessing the allocation call llvm.recoverframeallocation with the function whose frame they are accessing and a frame pointer from an active call frame of that function. These intrinsics are very difficult to inline correctly, so the intention is that they be introduced rarely, or at least very late during EH preparation. Reviewers: echristo, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D6493 llvm-svn: 225746	2015-01-13 00:48:10 +00:00
Tom Stellard	783c3c0b2c	Target: Allow target specific operand types This adds two new fields to the RegisterOperand TableGen class: string OperandNamespace = "MCOI"; string OperandType = "OPERAND_REGISTER"; These fields can be used to specify a target specific operand type, which will be stored in the OperandType member of the MCOperandInfo object. This can be useful for targets that need to store some extra information about operands that cannot be expressed using the target independent types. For example, in the R600 backend, there are operands which can take either registers or immediates and it is convenient to be able to specify this in the TableGen definitions. llvm-svn: 225661	2015-01-12 19:33:09 +00:00
Craig Topper	ac00edab84	[X86] Don't print 'dword ptr' or 'qword ptr' on the operand to some of the LEA variants in Intel syntax. The memory operand is inherently unsized. llvm-svn: 225432	2015-01-08 07:41:30 +00:00
Craig Topper	adb8acdc57	[X86] Remove some unused TYPE enums from the disassembler. llvm-svn: 225343	2015-01-07 07:47:52 +00:00
Craig Topper	4bf5d86cad	[X86] Add OpSize32 to XBEGIN_4. Add XBEGIN_2 with OpSize16. Requires new AsmParserOperand types that detect 16-bit and 32/64-bit mode so that we choose the right instruction based on default sizing without predicates. This is necessary since predicates mess up the disassembler table building. llvm-svn: 225256	2015-01-06 08:59:30 +00:00
Craig Topper	44b2a9bdec	[X86] Remove unused operand type from disassembler handling. NFC llvm-svn: 225151	2015-01-05 08:18:52 +00:00
Craig Topper	3ff5d48f0f	Fix some formatting in tablegen output. llvm-svn: 225113	2015-01-03 08:16:29 +00:00
Craig Topper	53453556cd	Replace some 'unreachable' comments with llvm_unreachable. llvm-svn: 225112	2015-01-03 08:16:14 +00:00
Craig Topper	025fc67ed2	[X86] Disassembler support for move to/from %rax with a 32-bit memory offset is REX.W and AdSize prefix are both present. llvm-svn: 225099	2015-01-03 00:00:20 +00:00
Craig Topper	1b3e85d67c	[X86] Make the instructions that use AdSize16/32/64 co-exist together without using mode predicates. This is necessary to allow the disassembler to be able to handle AdSize32 instructions in 64-bit mode when address size prefix is used. Eventually we should probably also support 'addr32' and 'addr16' in the assembler to override the address size on some of these instructions. But for now we'll just use special operand types that will lookup the current mode size to select the right instruction. llvm-svn: 225075	2015-01-02 07:02:25 +00:00
Craig Topper	f189a728be	[X86] Fix disassembly of absolute moves to work correctly in 16 and 32-bit modes with all 4 combinations of OpSize and AdSize prefixes being present or not. llvm-svn: 225036	2014-12-31 07:07:31 +00:00
Elena Demikhovsky	4a153fb55a	Masked Load/Store - Changed the order of parameters in intrinsics. No functional changes. The documentation is coming. llvm-svn: 224829	2014-12-25 07:49:20 +00:00
Craig Topper	5f1e825f93	[X86] Remove the single AdSize indicator and replace it with separate AdSize16/32/64 flags. This removes a hardcoded list of instructions in the CodeEmitter. Eventually I intend to remove the predicates on the affected instructions since in any given mode two of them are valid if we supported addr32/addr16 prefixes in the assembler. llvm-svn: 224809	2014-12-24 06:05:22 +00:00
David Blaikie	fc0fed2e9a	Use iterators rather than indices to make this forwards-compatible with a change to the underlying container (to std::list) llvm-svn: 224734	2014-12-22 21:26:38 +00:00
David Blaikie	856eebedff	unique_ptrify MatchableInfo(const CodeGenInstAlias *Alias)'s parameter llvm-svn: 224733	2014-12-22 21:26:26 +00:00
Evgeny Astigeevich	41308b1f31	On behalf of Matthew Wahab: An instruction alias defined with InstAlias and an optional operand in the middle of the AsmString field, "..${a} <operands>", would get the final "}" printed in the instruction disassembly. This wouldn't happen if the optional operand appeared as the last item in the AsmString which is how the current backends avoided the problem. There don't appear to be any tests for this part of Tablegen but it passes the pre-commit tests. Manually tested the change by enabling the generic alias printer in the ARM backend and checking the output. Differential Revision: http://reviews.llvm.org/D6529 llvm-svn: 224348	2014-12-16 18:16:17 +00:00
Ahmed Bougacha	6c3e1c0f56	[MC] Reset the MCInst in the matcher function before adding opcode/operands. On X86, the Intel asm parser tries to match all memory operand sizes when none is explicitly specified. For LEA, which doesn't really have a memory operand (just a pointer one), this results in multiple successful matches, one for each memory size. There's no error because it's same opcode, so really, it's just one match. However, the tablegen'd matcher function adds opcode/operands to the passed MCInst, and this results in multiple duplicated operands. This commit clears the MCInst in the tablegen'd matcher function. We sometimes clear it when the match failed, so there's no expectation of keeping the previous content anyway. Differential Revision: http://reviews.llvm.org/D6670 llvm-svn: 224347	2014-12-16 18:05:28 +00:00
Craig Topper	7be100deb9	Use unique_ptr to remove explicit delete. llvm-svn: 224224	2014-12-15 00:40:07 +00:00
Craig Topper	4e9e1df95a	Use range-based for loops. llvm-svn: 224187	2014-12-13 05:12:19 +00:00
Michael Ilseman	6d636ee500	Clean up static analyzer warnings. Clang's static analyzer found several potential cases of undefined behavior, use of un-initialized values, and potentially null pointer dereferences in tablegen, Support, MC, and ADT. This cleans them up with specific assertions on the assumptions of the code. llvm-svn: 224154	2014-12-12 21:48:03 +00:00
Matt Arsenault	022dacf996	Try to include operand counts for mismatches in tablegen. llvm-svn: 224068	2014-12-11 22:27:14 +00:00
Craig Topper	9a511af4f9	Use range-based for loops. NFC llvm-svn: 224005	2014-12-11 07:04:54 +00:00
Craig Topper	c4b33849c6	Use unique_ptr instead of DeleteContainerSeconds. llvm-svn: 223918	2014-12-10 06:18:57 +00:00
Matthias Braun	154c44f060	Tablegen'erate lanemasks for register units. Now we can relate lanemasks in a virtual register to register units. llvm-svn: 223889	2014-12-10 01:12:56 +00:00
Matthias Braun	ab5b739bb2	Add function that translates subregister lane masks to other subregs. This works like the composeSubRegisterIndices() function but transforms a subregister lane mask instead of a subregister index. llvm-svn: 223874	2014-12-10 01:12:00 +00:00
Matthias Braun	b86ee4fb38	Let tablegen compute maximum lanemask for regs/regclasses. Let tablegen compute the combination of subregister lanemasks for all subregisters in a register/register class. This is preparation for further work subregister allocation llvm-svn: 223873	2014-12-10 01:11:56 +00:00
Craig Topper	cf39241d17	Use range-based for loops. NFC. llvm-svn: 223762	2014-12-09 08:05:51 +00:00
Elena Demikhovsky	befed29343	Masked Load / Store Intrinsics - the CodeGen part. I'm recommiting the codegen part of the patch. The vectorizer part will be send to review again. Masked Vector Load and Store Intrinsics. Introduced new target-independent intrinsics in order to support masked vector loads and stores. The loop vectorizer optimizes loops containing conditional memory accesses by generating these intrinsics for existing targets AVX2 and AVX-512. The vectorizer asks the target about availability of masked vector loads and stores. Added SDNodes for masked operations and lowering patterns for X86 code generator. Examples: <16 x i32> @llvm.masked.load.v16i32(i8* %addr, <16 x i32> %passthru, i32 4 /* align /, <16 x i1> %mask) declare void @llvm.masked.store.v8f64(i8 %addr, <8 x double> %value, i32 4, <8 x i1> %mask) Scalarizer for other targets (not AVX2/AVX-512) will be done in a separate patch. http://reviews.llvm.org/D6191 llvm-svn: 223348	2014-12-04 09:40:44 +00:00
David Blaikie	09f48eb35e	range-for some things llvm-svn: 223263	2014-12-03 19:58:49 +00:00
David Blaikie	904349ae77	Simplify CodeGenRegBank::inferMatchingSuperRegClass & its caller by passing an iterator rather than index llvm-svn: 223262	2014-12-03 19:58:48 +00:00
David Blaikie	2afe696d92	Simplify ownership of RegClasses by using list<CodeGenRegisterClass> instead of vector<CodeGenRegisterClass*> This complicates a few algorithms due to not having random access, but not by a huge degree I don't think (open to debate/design discussion/etc). llvm-svn: 223261	2014-12-03 19:58:45 +00:00
David Blaikie	afd7d9143a	Range-for some stuff related to RegClasses, and comment cases where range-for isn't suitable. llvm-svn: 223260	2014-12-03 19:58:41 +00:00
Philip Reames	d056135ae1	[Statepoints 2/4] Statepoint infrastructure for garbage collection: MI & x86-64 Backend This is the second patch in a small series. This patch contains the MachineInstruction and x86-64 backend pieces required to lower Statepoints. It does not include the code to actually generate the STATEPOINT machine instruction and as a result, the entire patch is currently dead code. I will be submitting the SelectionDAG parts within the next 24-48 hours. Since those pieces are by far the most complicated, I wanted to minimize the size of that patch. That patch will include the tests which exercise the functionality in this patch. The entire series can be seen as one combined whole in http://reviews.llvm.org/D5683. The STATEPOINT psuedo node is generated after all gc values are explicitly spilled to stack slots. The purpose of this node is to wrap an actual call instruction while recording the spill locations of the meta arguments used for garbage collection and other purposes. The STATEPOINT is modeled as modifing all of those locations to prevent backend optimizations from forwarding the value from before the STATEPOINT to after the STATEPOINT. (Doing so would break relocation semantics for collectors which wish to relocate roots.) The implementation of STATEPOINT is closely modeled on PATCHPOINT. Eventually, much of the code in this patch will be removed. The long term plan is to merge the functionality provided by statepoints and patchpoints. Merging their implementations in the backend is likely to be a good starting point. Reviewed by: atrick, ributzka llvm-svn: 223085	2014-12-01 22:52:56 +00:00
David Blaikie	120c878c24	Remove some unnecessary vector::reserve/assign calls. llvm-svn: 222959	2014-11-29 18:13:42 +00:00
David Blaikie	f94c9b7b2c	Remove indirection of vector<T*> in favor of deque<T> llvm-svn: 222958	2014-11-29 18:13:39 +00:00
David Blaikie	76b89029d3	Use deque<T> rather than vector<T*> since it provides the same invalidation semantics (at least when removal is not needed) without the extra indirection/ownership complexity Order matters for this container, it seems (using a forward_list and replacing the original push_backs with emplace_fronts caused test failures). I didn't look too deeply into why. (& in retrospect, I might go back & change some of the forward_lists I introduced to deques anyway - since most don't require removal, deque is a more memory-friendly data structure (moderate locality while not invalidating pointers)) llvm-svn: 222950	2014-11-29 07:04:51 +00:00
David Blaikie	5fe3d4e324	Constify some things in preparation for CodeGenSubRegIndex to be stored by value in their container, removing the indirection llvm-svn: 222949	2014-11-29 07:04:49 +00:00
Duncan P. N. Exon Smith	42a532db38	Revert "Simplify some more ownership using forward_list<T> rather than vector<unique_ptr<T>>" This reverts commit r222935 and its follow-up r222938 ("Push unique_ptr a bit further through some APIs and simplify some cleanup"), since it causes bot failures (at least on Darwin): http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental/1391/ llvm-svn: 222943	2014-11-28 23:00:22 +00:00
David Blaikie	271a5098b8	Reapply "Use std::map<K, V> rather than std::map<K, std::unique_ptr<V>>"" Just avoid using std::map::emplace since it's not implemented in libstdc++ 4.7. Reapplies r222937, reverted in r222939. llvm-svn: 222940	2014-11-28 22:15:06 +00:00
David Blaikie	e1a8cc1858	Revert "Use std::map<K, V> rather than std::map<K, std::unique_ptr<V>>" Seems libstdc++ on some buildbots is lacking std::map::emplace, which is weird... reverting while I look into it. This reverts commit r222937. llvm-svn: 222939	2014-11-28 22:01:06 +00:00
David Blaikie	811a6ceb6a	Push unique_ptr a bit further through some APIs and simplify some cleanup llvm-svn: 222938	2014-11-28 21:59:58 +00:00
David Blaikie	d8632be513	Use std::map<K, V> rather than std::map<K, std::unique_ptr<V>> Pointers and references to map elements are never invalidated (except on removal, which isn't used here) so there's no need for the indirection unless there's polymorphism at work. A little const correctness had to be fixed, since the indirection allowed some benign const violations. llvm-svn: 222937	2014-11-28 21:37:54 +00:00
Duncan P. N. Exon Smith	73ce6dbb2b	Revert "Masked Vector Load and Store Intrinsics." This reverts commit r222632 (and follow-up r222636), which caused a host of LNT failures on an internal bot. I'll respond to the commit on the list with a reproduction of one of the failures. Conflicts: lib/Target/X86/X86TargetTransformInfo.cpp llvm-svn: 222936	2014-11-28 21:29:14 +00:00
David Blaikie	79cc429e36	Simplify some more ownership using forward_list<T> rather than vector<unique_ptr<T>> llvm-svn: 222935	2014-11-28 21:20:24 +00:00
David Blaikie	c5c9704103	Forgotten formatting from previous commit llvm-svn: 222934	2014-11-28 21:20:22 +00:00
David Blaikie	a82ec79fa6	Simplify ownership by using forward_list<T> rather than vector<unique_ptr<T>> Since the elements were not polymorphic, the unique_ptr was only used to avoid pointer invalidation on container resizes - might as well skip the indirection and use a container with suitable invalidation semantics. llvm-svn: 222931	2014-11-28 20:35:57 +00:00
Craig Topper	7e15ad8915	Fix a few memory leaks in CodeGenRegBank. llvm-svn: 222930	2014-11-28 20:30:39 +00:00
Craig Topper	21a312821d	Use unique_ptr to simplify deletion. llvm-svn: 222929	2014-11-28 20:30:37 +00:00
Craig Topper	acd4727821	Fix another memory leak in TableGen AsmMatcher by deleting CodeGenInstAliases. llvm-svn: 222912	2014-11-28 05:01:21 +00:00
Craig Topper	4948539472	Use unique_ptr to fix some memory leaks in Tablegen AsmMatcherEmitter. llvm-svn: 222909	2014-11-28 03:53:02 +00:00
Craig Topper	11a804fae2	Use range-based for loops and const-correct a few things. llvm-svn: 222908	2014-11-28 03:53:00 +00:00
Craig Topper	886524ee76	Remove neverHasSideEffects support from TableGen CodeGenInstruction. Everyone should use hasSideEffects now. llvm-svn: 222809	2014-11-26 04:11:14 +00:00
Craig Topper	5cbe015db2	Remove unncessary check for Int_* and *_Int in AsmMatcherEmitter. These are all marked isCodeGenOnly these days. llvm-svn: 222783	2014-11-25 20:11:34 +00:00
Craig Topper	936503a138	Use range-based for loops. llvm-svn: 222782	2014-11-25 20:11:31 +00:00
Craig Topper	c1da102d02	Remove dead code. llvm-svn: 222781	2014-11-25 20:11:29 +00:00
Craig Topper	7a0892403a	Remove unused MaxSize variable. llvm-svn: 222780	2014-11-25 20:11:27 +00:00
Craig Topper	6d0c335936	Move a vector instead of copying it. llvm-svn: 222779	2014-11-25 20:11:25 +00:00
Craig Topper	c4009eedfd	Replace a comment that says 'unreachable' with llvm_unreachable in TableGen AsmWriter output. llvm-svn: 222650	2014-11-24 14:09:52 +00:00
Craig Topper	b0d4991215	Detect best type for some static index tables instead of just using uint32_t to reduce total data size. llvm-svn: 222643	2014-11-24 02:08:35 +00:00
Craig Topper	3199a0602e	Tablegen output formatting fixes. llvm-svn: 222633	2014-11-23 09:40:10 +00:00
Elena Demikhovsky	36a2243ab7	Masked Vector Load and Store Intrinsics. Introduced new target-independent intrinsics in order to support masked vector loads and stores. The loop vectorizer optimizes loops containing conditional memory accesses by generating these intrinsics for existing targets AVX2 and AVX-512. The vectorizer asks the target about availability of masked vector loads and stores. Added SDNodes for masked operations and lowering patterns for X86 code generator. Examples: <16 x i32> @llvm.masked.load.v16i32(i8* %addr, <16 x i32> %passthru, i32 4 /* align /, <16 x i1> %mask) declare void @llvm.masked.store.v8f64(i8 %addr, <8 x double> %value, i32 4, <8 x i1> %mask) Scalarizer for other targets (not AVX2/AVX-512) will be done in a separate patch. http://reviews.llvm.org/D6191 llvm-svn: 222632	2014-11-23 08:07:43 +00:00
Craig Topper	9b0aeb935e	Reduce size of some tables in tablegen register info output. Primarily done by using SequenceToOffsetTable to reduce the register pressure set tables and then sizing the indices into the tables appropriately. Size a few other table entries based on content as well. Reduces X86RegisterInfo.o by ~9k. llvm-svn: 222621	2014-11-22 18:30:18 +00:00
Craig Topper	b487f2dfc6	Add extra new line and remove some trailing whitespace from tablegen RegisterInfo output file. llvm-svn: 222508	2014-11-21 05:58:14 +00:00
Craig Topper	65bda63c2e	Remove unnecessary extra spaces from tablegen register info output. llvm-svn: 222411	2014-11-20 05:22:35 +00:00
Craig Topper	ca7045d052	Use array_lengthof instead of sizeof(array)/sizeof(element) in a tablegen output. llvm-svn: 222410	2014-11-20 05:22:32 +00:00
David Blaikie	60e6c80905	Update SetVector to rely on the underlying set's insert to return a pair<iterator, bool> This is to be consistent with StringSet and ultimately with the standard library's associative container insert function. This lead to updating SmallSet::insert to return pair<iterator, bool>, and then to update SmallPtrSet::insert to return pair<iterator, bool>, and then to update all the existing users of those functions... llvm-svn: 222334	2014-11-19 07:49:26 +00:00
David Blaikie	7499cbae4c	Remove StringMap::GetOrCreateValue in favor of StringMap::insert Having two ways to do this doesn't seem terribly helpful and consistently using the insert version (which we already has) seems like it'll make the code easier to understand to anyone working with standard data structures. (I also updated many references to the Entry's key and value to use first() and second instead of getKey{Data,Length,} and get/setValue - for similar consistency) Also removes the GetOrCreateValue functions so there's less surface area to StringMap to fix/improve/change/accommodate move semantics, etc. llvm-svn: 222319	2014-11-19 05:49:42 +00:00
David Blaikie	79f6f3ce15	Make StringSet::insert return pair<iterator, bool> like other self-associative containers StringSet is still a bit dodgy in that it exposes the raw iterator of the StringMap parent, which exposes the weird detail that StringSet actually has a 'value'... but anyway, this is useful for a handful of clients that want to reference the newly inserted/persistent string data in the StringSet/Map/Entry/thing. llvm-svn: 222302	2014-11-19 02:56:00 +00:00
David Blaikie	6fac8481fc	Revert "Improve memory ownership/management in TableGen by unique_ptrifying TreePattern's Tree member." This reverts commit r222183. Broke on the MSVC buildbots due to MSVC not producing default move operations - I'd fix it immediately but just broke my build system a bit, so backing out until I have a chance to get everything going again. llvm-svn: 222187	2014-11-17 22:55:41 +00:00
David Blaikie	56647fa3d9	Improve memory ownership/management in TableGen by unique_ptrifying TreePattern's Tree member. The next step is to actually use unique_ptr in TreePatternNode's Children vector. That will be more intrusive, and may not work, depending on exactly how these things are handled (I have a bad suspicion things are shared more than they should be, making this more DAG than tree - but if it's really a tree, unique_ptr should suffice) llvm-svn: 222183	2014-11-17 22:16:55 +00:00
Craig Topper	5b6e56da60	Move register class name strings to a single array in MCRegisterInfo to reduce static table size and number of relocation entries. Indices into the table are stored in each MCRegisterClass instead of a pointer. A new method, getRegClassName, is added to MCRegisterInfo and TargetRegisterInfo to lookup the string in the table. llvm-svn: 222118	2014-11-17 05:50:14 +00:00
David Blaikie	d0f57445b1	Turn a leaked object into a stack variable instead. llvm-svn: 222046	2014-11-14 21:53:50 +00:00
Bill Schmidt	66f498a584	Change order of tablegen generated fast-isel instruction code to be based on instruction complexity The order that tablegen fast-isel instruction code is generated is currently based on the text of the predicate (using string less-than). This patch changes this to instead use the instruction complexity. Because the complexities are not unique a C++ multimap is used instead of a map. This fixes the problem where code with no predicate always comes out first (the empty string always compares as less than all other strings) thus making the code with predicates dead code. See the FMUL code in PPCFastISel.cpp for an example. It also more closely matches the normal codegen ordering. Some error checking in the tablegen fast-isel code is fixed as well. Patch by Bill Seurer. llvm-svn: 222038	2014-11-14 21:05:45 +00:00
David Blaikie	9aaa1dd2e1	Fix nested namespace with decltype to hopefully work with MSVC Build failed here: http://lab.llvm.org:8011/builders/lld-x86_64-win7/builds/14629/steps/build_Lld/logs/stdio So I'm taking a shot in the dark that MSVC (whatever version that is) can't cope with nested name specifiers with a decltype prefix. llvm-svn: 221931	2014-11-13 21:56:57 +00:00
David Blaikie	6d609b52ab	Use unique_ptr to handle ownership of TreePatterns in CodeGenDAGPatterns::PatternFragments We might be able to use unique_ptr to handle ownership of the TreePatternNodes too - looking into that next. llvm-svn: 221928	2014-11-13 21:40:02 +00:00
Matt Arsenault	f24b2619a2	Make TreePattern::error use Twine The underlying error function already uses a Twine, and most of the uses build up strings. llvm-svn: 221740	2014-11-11 23:48:11 +00:00
Rafael Espindola	476a83ecd4	MCAsmParserExtension has a copy of the MCAsmParser. Use it. Base classes were storing a second copy. llvm-svn: 221667	2014-11-11 05:18:41 +00:00
Matt Arsenault	1838bf2925	Support REG_SEQUENCE in tablegen. The problem is mostly that variadic output instruction aren't handled, so it is rejected for having an inconsistent number of operands, and then the right number of operands isn't emitted. llvm-svn: 221117	2014-11-02 23:46:51 +00:00
Matt Arsenault	05da066691	Fix typo llvm-svn: 221116	2014-11-02 23:46:47 +00:00
Matt Arsenault	97e5c314c1	Fix missing C++ mode comment llvm-svn: 221115	2014-11-02 23:46:44 +00:00
Daniel Sanders	79790d7c5e	[tablegen] Add CustomCallingConv and use it to tablegen-erate the outermost parts of the Mips O32 implementation Summary: CustomCallingConv is simply a CallingConv that tablegen should not generate the implementation for. It allows regular CallingConv's to delegate to these custom functions. This is (currently) necessary for Mips and we cannot use CCCustom without having to adapt to the different API that CCCustom uses. This brings us a bit closer to being able to remove MipsCC::analyzeCallOperands and MipsCC::analyzeFormalArguments in favour of the common implementation. No functional change to the targets. Depends on D3341 Reviewers: vmedic Reviewed By: vmedic Subscribers: vmedic, llvm-commits Differential Revision: http://reviews.llvm.org/D5965 llvm-svn: 221052	2014-11-01 17:38:22 +00:00
Robert Khasanov	2ca56ad410	[AVX512] Extended avx512_sqrt_packed (sqrt instructions) to VL subset. Refactored through AVX512_maskable llvm-svn: 220806	2014-10-28 18:15:20 +00:00
Robert Khasanov	c3a1b1b644	Moved out IIT_V64 from common values section. Thanks Juergen Ributzka for notice. llvm-svn: 220224	2014-10-20 19:25:05 +00:00
Robert Khasanov	625ba0e53e	[AVX512] Extended avx512_binop_rm for AVX512VL subsets. Added avx512_binop_rm_vl multiclass for VL subset Added encoding tests llvm-svn: 219390	2014-10-09 08:38:48 +00:00
Adam Nemet	80cc28c134	[AVX512] Support mask register in MRMDestReg format This is necessary for masking vextract*x4. llvm-svn: 219359	2014-10-08 23:25:29 +00:00
Bob Wilson	84f3fb3ea5	PR21101: tablegen's FastISel emitter should filter out unused functions. FastISel has a fixed set of virtual functions that are overridden by the tablegen-generated code for each target. These functions are distinguished by the kinds of operands, e.g., register + immediate = "ri". The FastISel emitter has been blindly emitting functions with different combinations of operand kinds, even for combinations that are completely unused by FastISel, e.g., "fastEmit_rrr". Change to filter out functions that will be irrelevant for FastISel and do not bother generating the code for them. Also add explicit "override" keywords for the virtual functions that are overridden. llvm-svn: 218838	2014-10-01 22:44:01 +00:00
Adam Nemet	e0d1a483d8	[X86 disasm tblegen backend] Clean up numPhysicalOperands asserts No functionality change intended. This implements Elena's idea to put the new additionalOperand outside the switch to cover all cases (http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140929/237763.html). Note only nontrivial change is in MRMSrcMemFrm. This requires an inclusive interval of [2, 4] because we have prefix-dependent optional immediate operand. llvm-svn: 218790	2014-10-01 19:28:11 +00:00
Robert Khasanov	2972b6033d	[AVX512] Added intrinsics for VPCMPEQB and VPCMPEQW. Added new operand type for intrinsics (IIT_V64) llvm-svn: 218668	2014-09-30 11:32:22 +00:00
Craig Topper	456f534a1e	Reduce code duplication a bit. llvm-svn: 218563	2014-09-27 05:26:42 +00:00
Craig Topper	94e5f75223	Fix TableGen -gen-disassembler output for bit fields with an offset. This fixes bit assignments like this Inst{7-0} = Foo{9-2} Patch by Steve King. llvm-svn: 218560	2014-09-27 04:38:02 +00:00
Daniel Sanders	c3ccff7583	[mips] Add CCValAssign::[ASZ]ExtUpper and CCPromoteToUpperBitsInType and handle struct's correctly on big-endian N32/N64 return values. Summary: The N32/N64 ABI's require that structs passed in registers are laid out such that spilling the register with 'sd' places the struct at the lowest address. For little endian this is trivial but for big-endian it requires that structs are shifted into the upper bits of the register. We also require that structs passed in registers have the 'inreg' attribute for big-endian N32/N64 to work correctly. This is because the tablegen-erated calling convention implementation only has access to the lowered form of struct arguments (one or more integers of up to 64-bits each) and is unable to determine the original type. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5286 llvm-svn: 218451	2014-09-25 12:15:05 +00:00
Chandler Carruth	5b09348e8e	[x86] Fix a pretty horrible bug and inconsistency in the x86 asm parsing (and latent bug in the instruction definitions). This is effectively a revert of r136287 which tried to address a specific and narrow case of immediate operands failing to be accepted by x86 instructions with a pretty heavy hammer: it introduced a new kind of operand that behaved differently. All of that is removed with this commit, but the test cases are both preserved and enhanced. The core problem that r136287 and this commit are trying to handle is that gas accepts both of the following instructions: insertps $192, %xmm0, %xmm1 insertps $-64, %xmm0, %xmm1 These will encode to the same byte sequence, with the immediate occupying an 8-bit entry. The first form was fixed by r136287 but that broke the prior handling of the second form! =[ Ironically, we would still emit the second form in some cases and then be unable to re-assemble the output. The reason why the first instruction failed to be handled is because prior to r136287 the operands ere marked 'i32i8imm' which forces them to be sign-extenable. Clearly, that won't work for 192 in a single byte. However, making thim zero-extended or "unsigned" doesn't really address the core issue either because it breaks negative immediates. The correct fix is to make these operands 'i8imm' reflecting that they can be either signed or unsigned but must be 8-bit immediates. This patch backs out r136287 and then changes those places as well as some others to use 'i8imm' rather than one of the extended variants. Naturally, this broke something else. The custom DAG nodes had to be updated to have a much more accurate type constraint of an i8 node, and a bunch of Pat immediates needed to be specified as i8 values. The fallout didn't end there though. We also then ceased to be able to match the instruction-specific intrinsics to the instructions so modified. Digging, this is because they too used i32 rather than i8 in their signature. So I've also switched those intrinsics to i8 arguments in line with the instructions. In order to make the intrinsic adjustments of course, I also had to add auto upgrading for the intrinsics. I suspect that the intrinsic argument types may have led everything down this rabbit hole. Pretty happy with the result. llvm-svn: 217310	2014-09-06 10:00:01 +00:00
Craig Topper	a06b70e729	Use vector constructor instead of a for loop to initialize entries. llvm-svn: 217123	2014-09-04 04:49:03 +00:00
Juergen Ributzka	76dd2e3da7	[FastISel][tblgen] Rename tblgen generated FastISel functions. NFC. This is the final round of renaming. This changes tblgen to emit lower-case function names for FastEmitInst_* and FastEmit_*, and updates all its uses in the source code. Reviewed by Eric llvm-svn: 217075	2014-09-03 20:56:59 +00:00
Yaron Keren	4785f27b1b	Fix ambiguous call to make_unique and clang-format. llvm-svn: 217023	2014-09-03 08:22:30 +00:00
Craig Topper	bb86d8c127	Recommit "Use unique_ptr to manager FilterChooser ownership." Just using insert of a pair this time instead of emplace. llvm-svn: 217018	2014-09-03 06:07:54 +00:00
Craig Topper	968b75c827	Revert "Use unique_ptr to manager FilterChooser ownership." std::map::emplace isn't working on some of the bots. llvm-svn: 217015	2014-09-03 05:59:23 +00:00
Craig Topper	59c4b332bd	Use unique_ptr to manager FilterChooser ownership. llvm-svn: 217014	2014-09-03 05:49:09 +00:00
Craig Topper	3092ef0c60	Implement move constructor and remove copy constructor for Filter objects in FixedLenDecoderEmitter. Also remove unused copy constructor of FilterChooser. llvm-svn: 217013	2014-09-03 05:49:07 +00:00
Pete Cooper	b6145d2e7a	Tablegen scheduling models don't reference empty itineraries as of r216919, so don't emit the unused itinerary variables llvm-svn: 216993	2014-09-02 23:23:34 +00:00
Eric Christopher	2f6f860aaa	Reinstate "Nuke the old JIT." Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reinstates commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 216982	2014-09-02 22:28:02 +00:00
Pete Cooper	92fc86558d	Change MCSchedModel to be a struct of statically initialized data. This removes static initializers from the backends which generate this data, and also makes this struct match the other Tablegen generated structs in behaviour Reviewed by Andy Trick and Chandler C llvm-svn: 216919	2014-09-02 17:43:54 +00:00
Robin Morisset	e583310c3b	Fix typos in comments, NFC Summary: Just fixing comments, no functional change. Test Plan: N/A Reviewers: jfb Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D5130 llvm-svn: 216784	2014-08-29 21:53:01 +00:00
Craig Topper	43cee2f5fc	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216525	2014-08-27 05:25:25 +00:00
Craig Topper	d133367f2d	Fix some cases were ArrayRefs were being passed by reference. Also remove 'const' from some other ArrayRef uses since its implicitly const already. llvm-svn: 216524	2014-08-27 05:25:00 +00:00
Robert Khasanov	4316b2ca5f	[SKX] avx512_icmp_packed multiclass extension Extended avx512_icmp_packed multiclass by masking versions. Added avx512_icmp_packed_rmb multiclass for embedded broadcast versions. Added corresponding _vl multiclasses. Added encoding tests for CPCMP{EQ\|GT}* instructions. Add more fields for X86VectorVTInfo. Added AVX512VLVectorVTInfo that include X86VectorVTInfo for 512/256/128-bit versions Differential Revision: http://reviews.llvm.org/D5024 llvm-svn: 216383	2014-08-25 14:49:34 +00:00
Craig Topper	c2e0ae6754	Use range based for loops to avoid needing to re-mention SmallPtrSet size. llvm-svn: 216351	2014-08-24 23:23:06 +00:00
Dylan Noblesmith	75d9deb1b7	TableGen: unique_ptr-ify RecordKeeper llvm-svn: 216350	2014-08-24 19:10:57 +00:00
Dylan Noblesmith	1e8ed68e13	TableGen: use auto and for-range llvm-svn: 216348	2014-08-24 19:10:49 +00:00
Craig Topper	65775cc03d	Repace SmallPtrSet with SmallPtrSetImpl in function arguments to avoid needing to mention the size. llvm-svn: 216158	2014-08-21 05:55:13 +00:00
Quentin Colombet	1849edbdf6	Add isInsertSubreg property. This patch adds a new property: isInsertSubreg and the related target hooks: TargetIntrInfo::getInsertSubregInputs and TargetInstrInfo::getInsertSubregLikeInputs to specify that a target specific instruction is a (kind of) INSERT_SUBREG. The approach is similar to r215394. <rdar://problem/12702965> llvm-svn: 216139	2014-08-20 23:49:36 +00:00
Quentin Colombet	b02f26b10f	Add isExtractSubreg property. This patch adds a new property: isExtractSubreg and the related target hooks: TargetIntrInfo::getExtractSubregInputs and TargetInstrInfo::getExtractSubregLikeInputs to specify that a target specific instruction is a (kind of) EXTRACT_SUBREG. The approach is similar to r215394. <rdar://problem/12702965> llvm-svn: 216130	2014-08-20 21:51:26 +00:00
Tim Northover	9127b613b1	TableGen: allow use of uint64_t for available features mask. ARM in particular is getting dangerously close to exceeding 32 bits worth of possible subtarget features. When this happens, various parts of MC start to fail inexplicably as masks get truncated to "unsigned". Mostly just refactoring at present, and there's probably no way to test. llvm-svn: 215887	2014-08-18 11:49:42 +00:00
Craig Topper	aa7422b5a6	Revert "Repace SmallPtrSet with SmallPtrSetImpl in function arguments to avoid needing to mention the size." Getting a weird buildbot failure that I need to investigate. llvm-svn: 215870	2014-08-18 00:24:38 +00:00
Craig Topper	227456e133	Repace SmallPtrSet with SmallPtrSetImpl in function arguments to avoid needing to mention the size. llvm-svn: 215868	2014-08-17 23:47:00 +00:00
Benjamin Kramer	da144ed5a2	Canonicalize header guards into a common format. Add header guards to files that were missing guards. Remove #endif comments as they don't seem common in LLVM (we can easily add them back if we decide they're useful) Changes made by clang-tidy with minor tweaks. llvm-svn: 215558	2014-08-13 16:26:38 +00:00
Quentin Colombet	022fe32e53	Add isRegSequence property. This patch adds a new property: isRegSequence and the related target hooks: TargetIntrInfo::getRegSequenceInputs and TargetInstrInfo::getRegSequenceLikeInputs to specify that a target specific instruction is a (kind of) REG_SEQUENCE. <rdar://problem/12702965> llvm-svn: 215394	2014-08-11 22:17:14 +00:00
Eric Christopher	378bc328f0	Temporarily Revert "Nuke the old JIT." as it's not quite ready to be deleted. This will be reapplied as soon as possible and before the 3.6 branch date at any rate. Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reverts commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 215154	2014-08-07 22:02:54 +00:00
Rafael Espindola	e9ebbe5559	Nuke the old JIT. I am sure we will be finding bits and pieces of dead code for years to come, but this is a good start. Thanks to Lang Hames for making MCJIT a good replacement! llvm-svn: 215111	2014-08-07 14:21:18 +00:00
Pete Cooper	8cac65e882	Change BitsInit to inherit from TypedInit. This is useful in a later patch where binary literals such as 0b000 will become BitsInit values instead of IntInit values. llvm-svn: 215085	2014-08-07 05:47:04 +00:00
Eric Christopher	4a1cdb2ba7	Remove the target machine from CCState. Previously it was only used to get the subtarget and that's accessible from the MachineFunction now. This helps clear the way for smaller changes where we getting a subtarget will require passing in a MachineFunction/Function as well. llvm-svn: 214988	2014-08-06 18:45:26 +00:00
Eric Christopher	99307e99a2	Remove the TargetMachine forwards for TargetSubtargetInfo based information and update all callers. No functional change. llvm-svn: 214781	2014-08-04 21:25:23 +00:00
Robert Khasanov	35dfdfef2d	[SKX] Enabling load/store instructions: encoding Instructions: VMOVAPD, VMOVAPS, VMOVDQA8, VMOVDQA16, VMOVDQA32,VMOVDQA64, VMOVDQU8, VMOVDQU16, VMOVDQU32,VMOVDQU64, VMOVUPD, VMOVUPS, Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 214719	2014-08-04 14:35:15 +00:00
Matt Arsenault	5b572cbd87	Make getNamedOperandIdx readonly llvm-svn: 214524	2014-08-01 17:00:27 +00:00
Tom Stellard	d0909fe830	TableGen: Allow AddedComplexity values to be negative This is useful for cases when stand-alone patterns are preferred to the patterns included in the instruction definitions. Instead of requiring that stand-alone patterns set a larger AddedComplexity value, which can be confusing to new developers, the allows us to reduce the complexity of the included patterns to achieve the same result. There will be test cases for this added to the R600 backend in a future commit. llvm-svn: 214466	2014-08-01 00:32:36 +00:00
Kevin Enderby	0615385ba4	Add support for the X86 secure guard extensions instructions in assembler (SGX). This allows assembling the two new instructions, encls and enclu for the SKX processor model. Note the diffs are a bigger than what might think, but to fit the new MRM_CF and MRM_D7 in things in the right places things had to be renumbered and shuffled down causing a bit more diffs. rdar://16228228 llvm-svn: 214460	2014-07-31 23:57:38 +00:00
Robert Khasanov	d86d770d47	[SKX] Enabling mask logic instructions: encoding, lowering Instructions: KAND{BWDQ}, KANDN{BWDQ}, KOR{BWDQ}, KXOR{BWDQ}, KXNOR{BWDQ} Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 214081	2014-07-28 13:46:45 +00:00
Akira Hatanaka	e9a7fadd46	[stack protector] Fix a potential security bug in stack protector where the address of the stack guard was being spilled to the stack. Previously the address of the stack guard would get spilled to the stack if it was impossible to keep it in a register. This patch introduces a new target independent node and pseudo instruction which gets expanded post-RA to a sequence of instructions that load the stack guard value. Register allocator can now just remat the value when it can't keep it in a register. <rdar://problem/12475629> llvm-svn: 213967	2014-07-25 19:31:34 +00:00
Robert Khasanov	cfc9aa43e1	[SKX] Enabling mask instructions: encoding, lowering KMOVB, KMOVW, KMOVD, KMOVQ, KNOTB, KNOTW, KNOTD, KNOTQ Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213757	2014-07-23 14:49:42 +00:00
Richard Smith	f8a40b80fc	Revert of r213521. This change introduced a non-hermetic test (depending on a file not in the test/ area). Backing out now so that this test isn't part of the 3.5 branch. Original commit message: "TableGen: Allow AddedComplexity values to be negative [...]" llvm-svn: 213596	2014-07-22 02:32:12 +00:00
Robert Khasanov	ae2da173af	[SKX] Enabling SKX target and AVX512BW, AVX512DQ, AVX512VL features. Enabling HasAVX512{DQ,BW,VL} predicates. Adding VK2, VK4, VK32, VK64 masked register classes. Adding new types (v64i8, v32i16) to VR512. Extending calling conventions for new types (v64i8, v32i16) Patch by Zinovy Nis <zinovy.y.nis@intel.com> Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213545	2014-07-21 14:54:21 +00:00
Tom Stellard	c386c1b7f3	TableGen: Allow AddedComplexity values to be negative This is useful for cases when stand-alone patterns are preferred to the patterns included in the instruction definitions. Instead of requiring that stand-alone patterns set a larger AddedComplexity value, which can be confusing to new developers, the allows us to reduce the complexity of the included patterns to achieve the same result. llvm-svn: 213521	2014-07-21 13:28:54 +00:00
David Blaikie	939901ec68	Remove uses of the redundant ".reset(nullptr)" of unique_ptr, in favor of ".reset()" It's also possible to just write "= nullptr", but there's some question of whether that's as readable, so I leave it up to authors to pick which they prefer for now. If we want to discuss standardizing on one or the other, we can do that at some point in the future. llvm-svn: 213438	2014-07-19 01:05:11 +00:00
Reid Kleckner	a0033713ef	TableGen: Add 'static' to a large array to avoid a huge stack allocation Speculative fix for a -Wframe-larger-than warning from gcc. Clang will implicitly promote such constant arrays to globals, so in theory it won't hit this. llvm-svn: 213298	2014-07-17 19:43:40 +00:00
Adam Nemet	09fcf8939c	[X86] AVX512: Add disassembler support for compressed displacement There are two parts here. First is to modify tablegen to adjust the encoding type ENCODING_RM with the scaling factor. The second is to use the new encoding types to compute the correct displacement in the decoder. Fixes <rdar://problem/17608489> llvm-svn: 213281	2014-07-17 17:04:56 +00:00
Justin Holewinski	a1eab159d8	[TABLEGEN] Do not crash on intrinsics with names longer than 40 characters Differential Revision: http://reviews.llvm.org/D4537 llvm-svn: 213253	2014-07-17 11:23:29 +00:00
Sanjay Patel	2f0f025b2b	Move Post RA Scheduling flag bit into SchedMachineModel Refactoring; no functional changes intended Removed PostRAScheduler bits from subtargets (X86, ARM). Added PostRAScheduler bit to MCSchedModel class. This bit is set by a CPU's scheduling model (if it exists). Removed enablePostRAScheduler() function from TargetSubtargetInfo and subclasses. Fixed the existing enablePostMachineScheduler() method to use the MCSchedModel (was just returning false!). Added methods to TargetSubtargetInfo to allow overrides for AntiDepBreakMode, CriticalPathRCs, and OptLevel for PostRAScheduling. Added enablePostRAScheduler() function to PostRAScheduler class which queries the subtarget for the above values. Preserved existing scheduler behavior for ARM, MIPS, PPC, and X86: a. ARM overrides the CPU's postRA settings by enabling postRA for any non-Thumb or Thumb2 subtarget. b. MIPS overrides the CPU's postRA settings by enabling postRA for everything. c. PPC overrides the CPU's postRA settings by enabling postRA for everything. d. X86 is the only target that actually has postRA specified via sched model info. Differential Revision: http://reviews.llvm.org/D4217 llvm-svn: 213101	2014-07-15 22:39:58 +00:00
Reid Kleckner	baa1dcd25b	Option: Propagate flags from groups to options in each group This should make it easy to set a flag for a whole group of clang driver options. llvm-svn: 212865	2014-07-12 00:18:58 +00:00
Jim Grosbach	54e0e56d3f	Change an assert() to a diagnostic. llvm-svn: 212637	2014-07-09 18:55:49 +00:00
Alp Toker	0ca11b2493	SourceMgr: make valid buffer IDs start from one Use 0 for the invalid buffer instead of -1/~0 and switch to unsigned representation to enable more idiomatic usage. Also introduce a trivial SourceMgr::getMainFileID() instead of hard-coding 0/1 to identify the main file. llvm-svn: 212398	2014-07-06 10:33:31 +00:00
Saleem Abdulrasool	ffd9a72c8d	TableGen: introduce support for MSBuiltin Add MSBuiltin which is similar in vein to GCCBuiltin. This allows for adding intrinsics for Microsoft compatibility to individual instructions. This is needed to permit the creation of ARM specific MSVC extensions. This is not currently in use, and requires an associated change in clang to enable use of the intrinsics defined by this new class. This merely sets the LLVM portion of the infrastructure in place to permit the use of this functionality. A separate set of changes will enable the new intrinsics. llvm-svn: 212350	2014-07-04 18:42:25 +00:00
Alp Toker	97022b0c1f	Revert "Introduce a string_ostream string builder facilty" Temporarily back out commits r211749, r211752 and r211754. llvm-svn: 211814	2014-06-26 22:52:05 +00:00
Alp Toker	fd9ead3b6f	Introduce a string_ostream string builder facilty string_ostream is a safe and efficient string builder that combines opaque stack storage with a built-in ostream interface. small_string_ostream<bytes> additionally permits an explicit stack storage size other than the default 128 bytes to be provided. Beyond that, storage is transferred to the heap. This convenient class can be used in most places an std::string+raw_string_ostream pair or SmallString<>+raw_svector_ostream pair would previously have been used, in order to guarantee consistent access without byte truncation. The patch also converts much of LLVM to use the new facility. These changes include several probable bug fixes for truncated output, a programming error that's no longer possible with the new interface. llvm-svn: 211749	2014-06-26 00:00:48 +00:00
Alp Toker	af677c39a3	Use SourceMgr::getMemoryBuffer() in a couple of places Cleanup only. llvm-svn: 211656	2014-06-25 00:41:15 +00:00
Craig Topper	701fca8e8d	Replace some assert(0)'s with llvm_unreachable. llvm-svn: 211141	2014-06-18 05:05:13 +00:00
James Molloy	ed6ddd8719	Move SetTheory from utils/TableGen into lib/TableGen so Clang can use it. llvm-svn: 211100	2014-06-17 13:10:38 +00:00
Artyom Skrobov	e445b07705	Condition codes AL and NV are invalid in the aliases that use inverted condition codes (CINC, CINV, CNEG, CSET, and CSETM). Matching aliases based on "immediate classes", when disassembling, wasn't previously supported, hence adding MCOperandPredicate into class Operand, and implementing the support for it in AsmWriterEmitter. The parsing for those aliases was already custom, so just adding the missing condition into AArch64AsmParser::parseCondCode. llvm-svn: 210528	2014-06-10 13:11:35 +00:00
Artyom Skrobov	295d6d771e	Refactoring in AsmWriterEmitter::EmitPrintAliasInstruction() llvm-svn: 210527	2014-06-10 12:47:23 +00:00
David Blaikie	f670b953e7	AsmMatchers: Use unique_ptr to manage ownership of MCParsedAsmOperand I saw at least a memory leak or two from inspection (on probably untested error paths) and r206991, which was the original inspiration for this change. I ran this idea by Jim Grosbach a few weeks ago & he was OK with it. Since it's a basically mechanical patch that seemed sufficient - usual post-commit review, revert, etc, as needed. llvm-svn: 210427	2014-06-08 16:18:35 +00:00
Aaron Ballman	1a7e332e11	Teach the table generated emitPseudoExpansionLowering function to not emit a switch statement containing only a default statement (and no cases). Updated some of the code to use range-based for loops as well. No functional changes. llvm-svn: 209521	2014-05-23 15:33:39 +00:00
Tim Northover	4bb52c77de	ARM64: separate load/store operands to simplify assembler This changes ARM64 to use separate operands for each component of an address, and look for separate '[', '$Rn, ..., ']' tokens when parsing. This allows us to do away with quite a bit of special C++ code to handle monolithic "addressing modes" in the MC components. The more incremental matching of the assembler operands also allows for better diagnostics when LLVM is presented with invalid input. Most of the complexity here is with the register-offset instructions, which were extremely dodgy beforehand: even when the instruction used wM, LLVM's model had xM as an operand. We papered over this discrepancy before, but that approach doesn't work now so I split them into separate X and W variants. llvm-svn: 209425	2014-05-22 11:56:09 +00:00
Daniel Sanders	b0569f0a49	[asm matcher] Fix incorrect assertion when there are exactly 32 SubtargetFeatures Summary: The minimal type needs to hold a value of '1ULL << 31' but getMinimalTypeForRange() is called with a value of '1ULL << 32'. This patch will also reduce the size of the matcher table when there are 8 or 16 SubtargetFeatures. Also added a dump of the SubtargetFeatures to the -debug output and corrected getMinimalTypeInRange() to consider 0xffffffffull to be a 32-bit value. The testcase is that no existing code is broken and that LLVM still successfully compiles after adding MIPS64r6 CodeGen support. Reviewers: rafael Reviewed By: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3787 llvm-svn: 209288	2014-05-21 10:11:24 +00:00
Richard Smith	cb6bc29096	[modules] Add module maps for LLVM. These are not quite ready for prime-time yet, but only a few more Clang patches need to land. (I have 'ninja check' passing locally.) llvm-svn: 209269	2014-05-21 02:46:14 +00:00
Eric Christopher	262770bdee	Clean up language and grammar. Based on a patch by jfcaron3@gmail.com! PR19806 llvm-svn: 209216	2014-05-20 17:11:11 +00:00
Tim Northover	20001e6010	TableGen: permit non-leaf ComplexPattern uses This allows the results of a ComplexPattern check to be distributed to separate named Operands, instead of the current system where all results must apply (and match perfectly) with a single Operand. For example, if "some_addrmode" is a ComplexPattern producing two results, you can write: def : Pat<(load (some_addrmode GPR64:$base, imm:$offset)), (INST GPR64:$base, imm:$offset)>; This should allow neater instruction definitions in TableGen that don't put all possible aspects of addressing into a single operand, but are still usable with relatively simple C++ CodeGen idioms. llvm-svn: 209206	2014-05-20 11:52:46 +00:00
Tim Northover	465abfd56e	TableGen: convert InstAlias's Emit bit to an int. When multiple aliases overlap, the correct string to print can often be determined purely by considering the InstAlias declarations in some particular order. This allows the user to specify that order manually when desired, without resorting to hacking around with the default lexicographical order on Record instantiation, which is error-prone and ugly. I was also mistaken about "add w2, w3, w4" being the same as "add w2, w3, w4, uxtw". That's only true if Rn is the stack pointer. llvm-svn: 209199	2014-05-20 09:17:16 +00:00
Eric Christopher	f08cdc9333	Fix typos. llvm-svn: 209164	2014-05-19 21:18:47 +00:00
Tim Northover	31e1362588	TableGen: fix operand counting for aliases TableGen has a fairly dubious heuristic to decide whether an alias should be printed: does the alias have lest operands than the real instruction. This is bad enough (particularly with no way to override it), but it should at least be calculated consistently for both strings. This patch implements that logic: first get the correct string for the variant, in the same way as the Matcher, without guessing; then count the number of whitespace chars. There are basically 4 changes this brings about after the previous commits; all of these appear to be good, so I have changed the tests: + ARM64: we print "neg X, Y" instead of "sub X, xzr, Y". + ARM64: we skip implicit "uxtx" and "uxtw" modifiers. + Sparc: we print "mov A, B" instead of "or %g0, A, B". + Sparc: we print "fcmpX A, B" instead of "fcmpX %fcc0, A, B" llvm-svn: 208969	2014-05-16 09:42:04 +00:00
Tim Northover	ac5dac4c75	TableGen: use correct MIOperand when printing aliases Previously, TableGen assumed that every aliased operand consumed precisely 1 MachineInstr slot (this was reasonable because until a couple of days ago, nothing more complicated was eligible for printing). This allows a couple more ARM64 aliases to print so we can remove the special code. On the X86 side, I've gone for explicit AT&T size specifiers as the default, so turned off a few of the aliases that would have just started printing. llvm-svn: 208880	2014-05-15 13:36:01 +00:00
Tim Northover	4ba95d4483	TableGen/ARM64: print aliases even if they have syntax variants. To get at least one use of the change (and some actual tests) in with its commit, I've enabled the AArch64 & ARM64 NEON mov aliases. llvm-svn: 208867	2014-05-15 11:16:32 +00:00
Alp Toker	18115693f7	Fix typos llvm-svn: 208839	2014-05-15 01:52:21 +00:00
Aaron Ballman	a4fce68290	Teach the table generator to not generate switch statements containing only a default label with no cases. This solves some warnings with MSVC. No functional changes intended. llvm-svn: 208694	2014-05-13 12:52:35 +00:00
Tim Northover	d58954eabf	TableGen: strengthen assert llvm-svn: 208679	2014-05-13 09:37:41 +00:00
Jay Foad	b23e7b4f97	Fix gcc -Wparentheses warning. llvm-svn: 208675	2014-05-13 08:26:53 +00:00
Tim Northover	3c2cc7a397	TableGen: use PrintMethods to print more aliases llvm-svn: 208607	2014-05-12 18:04:06 +00:00
Hal Finkel	c52e65b830	Move late partial-unrolling thresholds into the processor definitions The old method used by X86TTI to determine partial-unrolling thresholds was messy (because it worked by testing target features), and also would not correctly identify the target CPU if certain target features were disabled. After some discussions on IRC with Chandler et al., it was decided that the processor scheduling models were the right containers for this information (because it is often tied to special uop dispatch-buffer sizes). This does represent a small functionality change: - For generic x86-64 (which uses the SB model and, thus, will get some unrolling). - For AMD cores (because they still currently use the SB scheduling model) - For Haswell (based on benchmarking by Louis Gerbarg, it was decided to bump the default threshold to 50; we're working on a test case for this). Otherwise, nothing has changed for any other targets. The logic, however, has been moved into BasicTTI, so other targets may now also opt-in to this functionality simply by setting LoopMicroOpBufferSize in their processor model definitions. llvm-svn: 208289	2014-05-08 09:14:44 +00:00
Eric Christopher	423c1a5415	Debug.h already includes raw_ostream.h, no need to include it again. llvm-svn: 208235	2014-05-07 18:19:04 +00:00
Eric Christopher	a4b177382a	ArrayRef-ize the Feature and Processor tables for SubtargetFeatures. This removes arguments passed everywhere and allows the use of standard iteration over lists. Should be no functional change. llvm-svn: 208127	2014-05-06 20:23:04 +00:00
Tim Northover	c3dfe08427	AArch64/ARM64: implement diagnosis of unpredictable loads & stores llvm-svn: 208091	2014-05-06 14:15:14 +00:00
Craig Topper	c6ae7d76d7	[C++11] Use 'nullptr' in tablegen output files. llvm-svn: 207611	2014-04-30 05:53:35 +00:00
Benjamin Kramer	4f8fb8ff6c	raw_ostream: Forward declare OpenFlags and include FileSystem.h only where necessary. llvm-svn: 207593	2014-04-29 23:26:49 +00:00
Chandler Carruth	c38a2413fa	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, TableGen edition. llvm-svn: 206846	2014-04-22 03:06:00 +00:00
David Blaikie	80008be7d5	Simplify DFAPacketizerEmitter State copy/move semantics to use compiler defaults. llvm-svn: 206824	2014-04-21 22:58:00 +00:00
Chandler Carruth	15c7b91ac2	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00

... 2 3 4 5 6 ...

3609 Commits