llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 22:12:57 +02:00

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	ca12aec0a1	AsmPrinter: Document why DIEValueList uses a linked-list, NFC There are two main reasons why a linked-list makes sense for `DIEValueList`. 1. We want `DIE` to be on a `BumpPtrAllocator` to improve teardown efficiency. Making `DIEValueList` array-based would make that much more complicated. 2. The singly-linked list is fairly memory efficient. The histogram [1] shows that most DIEs have relatively few values, so we often pay less than the 2/3-pointer static overhead of a vector. Furthermore, we don't know ahead of time exactly how many values a `DIE` needs, so a vector-like scheme will on average over-allocate by ~50%. As it happens, that's the same memory overhead as the linked list node. [1]: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-May/085910.html The comment I added to the code is a little more succinct, but I think it's enough to give the idea. llvm-svn: 240868	2015-06-27 01:19:17 +00:00
Duncan P. N. Exon Smith	37e178924a	CodeGen: Push the ModuleSlotTracker through MachineOperands Push `ModuleSlotTracker` through `MachineOperand`s, dropping the time for `llc -print-machineinstrs` on the testcase in PR23865 from ~13 seconds to ~9 seconds. Now `SlotTracker::processFunctionMetadata()` accounts for only 8% of the runtime, which seems reasonable. llvm-svn: 240845	2015-06-26 22:06:47 +00:00
Duncan P. N. Exon Smith	913c983bf7	CodeGen: Use a single SlotTracker in MachineFunction::print() Expose enough of the IR-level `SlotTracker` so that `MachineFunction::print()` can use a single one for printing `BasicBlock`s. Next step would be to lift this through a few more APIs so that we can make other print methods faster. Fixes PR23865, changing the runtime of `llc -print-machineinstrs` from many minutes (killed after 3 minutes, but it wasn't very close) to 13 seconds for a 502185 line dump. llvm-svn: 240842	2015-06-26 22:04:20 +00:00
Pete Cooper	c7592843bf	Convert a bunch of loops to foreach. NFC. This uses the new SDNode::op_values() iterator range committed in r240805. llvm-svn: 240817	2015-06-26 19:18:49 +00:00
Pete Cooper	f9eedfbd62	Add op_values() to iterate over the SDValue operands of an SDNode. SDNode already had ops() which would iterate over the operands and return SDUse*. This version instead gets the SDValue's out of the SDUse's so that we can use foreach in more places. Reviewed by David Blaikie. llvm-svn: 240805	2015-06-26 18:17:36 +00:00
Alex Lorenz	a7e76895fa	MIR Serialization: Serialize machine basic block operands. This commit serializes machine basic block operands. The machine basic block operands use the following syntax: %bb.<id>[.<name>] This commit also modifies the YAML representation for the machine basic blocks - a new, required field 'id' is added to the MBB YAML mapping. The id is used to resolve the MBB references to the actual MBBs. And while the name of the MBB can be included in a MBB reference, this name isn't used to resolve MBB references - as it's possible that multiple MBBs will reference the same BB and thus they will have the same name. If the name is specified, the parser will verify that it is equal to the name of the MBB with the specified id. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10608 llvm-svn: 240792	2015-06-26 16:46:11 +00:00
Hao Liu	00aff8cc17	[InterleavedAccess] Add a pass InterleavedAccess to identify interleaved memory accesses and transform into target specific intrinsics. E.g. An interleaved load (Factor = 2): %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <0, 2, 4, 6> %v1 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <1, 3, 5, 7> It can be transformed into a ld2 intrinsic in AArch64 backend or a vld2 intrinsic in ARM backend. E.g. An interleaved store (Factor = 3): %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr It can be transformed into a st3 intrinsic in AArch64 backend or a vst3 intrinsic in ARM backend. Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240751	2015-06-26 02:10:27 +00:00
Duncan P. N. Exon Smith	f3712066fc	AsmPrinter: More explicitly scope iterator for MSVC r240748 seems to be on the right path. Be more explicit. http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/1961/ llvm-svn: 240750	2015-06-26 00:53:44 +00:00
Duncan P. N. Exon Smith	6f5f78b174	AsmPrinter: Explicitly scope iterator for MSVC Try to placate bots by explicitly scoping a conversion constructor from `iterator` to `const_iterator`. http://lab.llvm.org:8011/builders/sanitizer-windows/builds/5931/ llvm-svn: 240748	2015-06-26 00:41:53 +00:00
Duncan P. N. Exon Smith	b8e0101e11	AsmPrinter: Use an intrusively linked list for DIE::Children Replace the `std::vector<>` for `DIE::Children` with an intrusively linked list. This is a strict memory improvement: it requires no auxiliary storage, and reduces `sizeof(DIE)` by one pointer. It also factors out the DIE-related malloc traffic. This drops llc memory usage from 735 MB down to 718 MB, or ~2.3%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240736	2015-06-25 23:52:10 +00:00
Duncan P. N. Exon Smith	249d680189	AsmPrinter: Convert DIE::Values to a linked list Change `DIE::Values` to a singly linked list, where each node is allocated on a `BumpPtrAllocator`. In order to support `push_back()`, the list is circular, and points at the tail element instead of the head. I abstracted the core list logic out to `IntrusiveBackList` so that it can be reused for `DIE::Children`, which also cares about `push_back()`. This drops llc memory usage from 799 MB down to 735 MB, about 8%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240733	2015-06-25 23:46:41 +00:00
Alex Lorenz	e3d2250295	MIR Serialization: Serialize simple MachineRegisterInfo attributes. This commit serializes the 3 scalar boolean attributes from the MachineRegisterInfo class: IsSSA, TracksRegLiveness, and TracksSubRegLiveness. These attributes are serialized as part of the machine function YAML mapping. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10618 llvm-svn: 240579	2015-06-24 19:56:10 +00:00
Duncan P. N. Exon Smith	1cdb57d681	AsmPrinter: Cleanup DIEValue::EmitValue() API, NFC Stop taking a `dwarf::Form` in `DIEValue::EmitValue()` and `DIEValue::SizeOf()`, since they're always passed `DIEValue::getForm()` anyway. This is just left over from when `DIEValue` didn't know its own form. llvm-svn: 240566	2015-06-24 18:48:11 +00:00
Alex Lorenz	bca9f0ccb7	MIR Parser: Use correct source locations for machine instruction diagnostics. This commit translates the source locations for MIParser diagnostics from the locations in the machine instruction string to the locations in the MIR file. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10574 llvm-svn: 240474	2015-06-23 22:39:23 +00:00
Sanjoy Das	ab2d156691	Revert "[FaultMaps] Move FaultMapParser to Object/" This reverts commit r240364 (git c49542e5bb186). The issue r240364 was trying to fix was fixed independently in r240362. llvm-svn: 240448	2015-06-23 20:09:03 +00:00
Benjamin Kramer	7873dd07ed	[MachineBasicBlock] Add getFirstNonDebugInstr to complement getLastNonDebugInstr Use it in CodeGen where applicable. No functionality change intended. llvm-svn: 240414	2015-06-23 14:47:29 +00:00
Benjamin Kramer	7b31b9af6e	[MachineBasicBlock] Use the const_cast(this) trick to reduce duplication NFC. llvm-svn: 240413	2015-06-23 14:47:18 +00:00
Rafael Espindola	5613e38017	Use MCSymbols for FastISel. The summary is that it moves the mangling earlier and replaces a few calls to .addExternalSymbol with addSym. I originally wanted to replace all the uses of addExternalSymbol with addSym, but noticed it was a lot of work and doesn't need to be done all at once. llvm-svn: 240395	2015-06-23 12:21:54 +00:00
Alexander Kornienko	f993659b8f	Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC) Apparently, the style needs to be agreed upon first. llvm-svn: 240390	2015-06-23 09:49:53 +00:00
Sanjoy Das	2f4fb64dc2	[FaultMaps] Move FaultMapParser to Object/ Summary: That way llvm-objdump can rely on it without adding an extra dependency on CodeGen. This change duplicates the FaultKind enum and the code that serializes it to a string. I could not figure out a way to get around this without adding a new dependency to Object Reviewers: rafael, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10619 llvm-svn: 240364	2015-06-23 01:05:26 +00:00
Sanjoy Das	0e6baf1df9	Fix a typo in a comment in FaultMaps.h. NFC. llvm-svn: 240363	2015-06-23 01:05:21 +00:00
Krzysztof Parzyszek	1ffe9ef119	Remove "const" from the MachineFunction reference in VLIWPacketizerList llvm-svn: 240317	2015-06-22 18:59:44 +00:00
Sanjoy Das	ce613c8a7a	Fix MSVC build (again!). Remove two `typename`s that I should have removed in r240307 but left in by mistake. llvm-svn: 240309	2015-06-22 18:24:50 +00:00
Sanjoy Das	bea61317e9	[FaultMaps] Add a parser for the __llvm__faultmaps section. Summary: The parser is exercised by llvm-objdump using -print-fault-maps. As is probably obvious, the code itself was "heavily inspired" by http://reviews.llvm.org/D10434. Reviewers: reames, atrick, JosephTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10491 llvm-svn: 240304	2015-06-22 18:03:02 +00:00
Rafael Espindola	49ad24ce06	Avoid a Symbol -> Name -> Symbol conversion. Before this we were producing a TargetExternalSymbol from a MCSymbol. That meant extracting the symbol name and fetching the symbol again down the pipeline. This patch adds a DAG.getMCSymbol that lets the MCSymbol pass unchanged on the DAG. Doing so removes the need for MO_NOPREFIX and fixes the root cause of pr23900, allowing r240130 to be committed again. llvm-svn: 240300	2015-06-22 17:46:53 +00:00
Alex Lorenz	71956fb32c	MIR Serialization: Serialize machine instruction names. This commit implements initial machine instruction serialization. It serializes machine instruction names. The instructions are represented using a YAML sequence of string literals and are a part of machine basic block YAML mapping. This commit introduces a class called 'MIParser' which will be used to parse the machine instructions and operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10481 llvm-svn: 240295	2015-06-22 17:02:30 +00:00
Alex Lorenz	98562075f0	MIR Serialization: Serialize the list of machine basic blocks with simple attributes. This commit implements the initial serialization of machine basic blocks in a machine function. Only the simple, scalar MBB attributes are serialized. The reference to LLVM IR's basic block is preserved when that basic block has a name. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10465 llvm-svn: 240145	2015-06-19 17:43:07 +00:00
Alex Lorenz	42042953be	MIR Serialization: use default member initializers to initialize yaml::MachineFunction. NFC. Default member initializers are permitted since r236244. llvm-svn: 240142	2015-06-19 17:36:02 +00:00
Alexander Kornienko	40cb19d802	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ llvm/lib/ Thanks to Eugene Kosov for the original patch! llvm-svn: 240137	2015-06-19 15:57:42 +00:00
Alex Lorenz	d4f9106453	MIR Serialization: initialize the fields without the default initializers in yaml::MachineFunction My commit r239790 which introduced serialization for simple machine function attributes didn't initialize them when parsing because I have misread the documentation for YAML IO's mapOptional method. The mapOptional method doesn't actually set the values to the values returned by the default constructor for that type when the key value pair is missing, it just doesn't modify those values, so they still contain the value that was set during initialization by the default constructor. But the fields in yaml::MachineFunction with types like unsigned and bool are not initialized by default, and thus they can still be uninitialized after mapOptional during parsing. This commit adds default initialization for those fields to prevent this. llvm-svn: 240054	2015-06-18 20:57:41 +00:00
Sanjay Patel	314cd16f5d	don't repeat function / variable names in comments; NFC llvm-svn: 240035	2015-06-18 18:28:20 +00:00
Rafael Espindola	5c0ce7ad5b	Rename and improve emitSectionOffset. Different object formats represent references from dwarf in different ways. ELF uses a relocation to the referenced point (except for .dwo) and COFF/MachO use the offset of the referenced point inside its section. This patch renames emitSectionOffset because * It doesn't produce an offset on ELF. * It changes behavior depending on how DWARF is represented, so adding dwarf to its name is probably a good thing. The patch also adds an option to force the use of offsets.That avoids funny looking code like if (!UseOffsets) Asm->emitSectionOffset.... It was correct, but read as if the ! was inverted. llvm-svn: 239866	2015-06-16 23:22:02 +00:00
Sanjay Patel	ace9860241	propagate IR-level fast-math-flags to DAG nodes, disabled by default This is an updated version of the patch that was checked in at: http://reviews.llvm.org/rL237046 but subsequently reverted because it exposed a bug in the DAG Combiner: http://reviews.llvm.org/D9893 This time, there's an enablement flag ("EnableFMFInDAG") around the code in SelectionDAGBuilder where we copy the set of FP optimization flags from IR instructions to DAG nodes. So, in theory, there should be no functional change from this patch as-is, but it will allow testing with the added functionality to proceed via "-enable-fmf-dag" passed to llc. This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 Differential Revision: http://reviews.llvm.org/D10403 llvm-svn: 239828	2015-06-16 16:25:43 +00:00
Alex Lorenz	0686df9bd6	MIR Serialization: Print and parse simple machine function attributes. This commit serializes the simple, scalar attributes from the 'MachineFunction' class. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10449 llvm-svn: 239790	2015-06-16 00:10:47 +00:00
Alex Lorenz	2e08d769c3	MIR Serialization: Connect the machine function analysis pass to the MIR parser. This commit connects the machine function analysis pass (which creates machine functions) to the MIR parser, which will initialize the machine functions with the state from the MIR file and reconstruct the machine IR. This commit introduces a new interface called 'MachineFunctionInitializer', which can be used to provide custom initialization for the machine functions. This commit also introduces a new diagnostic class called 'DiagnosticInfoMIRParser' which is used for MIR parsing errors. This commit modifies the default diagnostic handling in LLVMContext - now the the diagnostics are printed directly into llvm::errs() so that the MIR parsing errors can be printed with colours. Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9928 llvm-svn: 239753	2015-06-15 20:30:22 +00:00
Sanjoy Das	a95063f8bb	Unbreak the build from r239740. Do not re-use an enum name as a field name. Some bots don't like this. llvm-svn: 239746	2015-06-15 19:29:44 +00:00
Sanjoy Das	cc16b02fd5	[CodeGen] Add a pass to fold null checks into nearby memory operations. Summary: This change adds an "ImplicitNullChecks" target dependent pass. This pass folds null checks into memory operation using the FAULTING_LOAD pseudo-op introduced in previous patches. Depends on D10197 Depends on D10199 Depends on D10200 Reviewers: reames, rnk, pgavlin, JosephTremoulet, atrick Reviewed By: atrick Subscribers: ab, JosephTremoulet, llvm-commits Differential Revision: http://reviews.llvm.org/D10201 llvm-svn: 239743	2015-06-15 18:44:27 +00:00
Sanjoy Das	b396b9e375	[CodeGen] Introduce a FAULTING_LOAD_OP pseudo-op. Summary: This instruction encodes a loading operation that may fault, and a label to branch to if the load page-faults. The locations of potentially faulting loads and their "handler" destinations are recorded in a FaultMap section, meant to be consumed by LLVM's clients. Nothing generates FAULTING_LOAD_OP instructions yet, but they will be used in a future change. The documentation (FaultMaps.rst) needs improvement and I will update this diff with a more expanded version shortly. Depends on D10196 Reviewers: rnk, reames, AndyAyers, ab, atrick, pgavlin Reviewed By: atrick, pgavlin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10197 llvm-svn: 239740	2015-06-15 18:44:08 +00:00
Reid Kleckner	493c014968	[WinEH] Create an llvm.x86.seh.exceptioninfo intrinsic This intrinsic is like framerecover plus a load. It recovers the EH registration stack allocation from the parent frame and loads the exception information field out of it, giving back a pointer to an EXCEPTION_POINTERS struct. It's designed for clang to use in SEH filter expressions instead of accessing the EXCEPTION_POINTERS parameter that is available on x64. This required a minor change to MC to allow defining a label variable to another absolute framerecover label variable. llvm-svn: 239567	2015-06-11 22:32:23 +00:00
Douglas Katzman	0f36653e69	Fix English usage in command line flag help string. llvm-svn: 239556	2015-06-11 20:03:23 +00:00
Pete Cooper	8328fce315	Remove MachineModuleInfo::UsedFunctions as it has no users. It hasn't been used since r130964. This also removes MachineModuleInfo::isUsedFunction and MachineModuleInfo::AnalyzeModule, both of which were only there to support UsedFunctions. llvm-svn: 239501	2015-06-11 01:04:56 +00:00
Akira Hatanaka	c42437a4f8	Remove DisableTailCalls from TargetOptions and the code in resetTargetOptions that was resetting it. Remove the uses of DisableTailCalls in subclasses of TargetLowering and use the value of function attribute "disable-tail-calls" instead. Also, unconditionally add pass TailCallElim to the pipeline and check the function attribute at the start of runOnFunction to disable the pass on a per-function basis. This is part of the work to remove TargetMachine::resetTargetOptions, and since DisableTailCalls was the last non-fast-math option that was being reset in that function, we should be able to remove the function entirely after the work to propagate IR-level fast-math flags to DAG nodes is completed. Out-of-tree users should remove the uses of DisableTailCalls and make changes to attach attribute "disable-tail-calls"="true" or "false" to the functions in the IR. rdar://problem/13752163 Differential Revision: http://reviews.llvm.org/D10099 llvm-svn: 239427	2015-06-09 19:07:19 +00:00
Akira Hatanaka	bcc4af8eec	Include header file <functional>. This is a follow-up to r239325. llvm-svn: 239329	2015-06-08 19:03:32 +00:00
Akira Hatanaka	76bd57e472	[ARM] Pass a callback to FunctionPass constructors to enable skipping execution on a per-function basis. Previously some of the passes were conditionally added to ARM's pass pipeline based on the target machine's subtarget. This patch makes changes to add those passes unconditionally and execute them conditonally based on the predicate functor passed to the pass constructors. This enables running different sets of passes for different functions in the module. rdar://problem/20542263 Differential Revision: http://reviews.llvm.org/D8717 llvm-svn: 239325	2015-06-08 18:50:43 +00:00
Sanjay Patel	044a50e6ce	fix typos; NFC llvm-svn: 239303	2015-06-08 15:21:38 +00:00
Hao Liu	d39edfda46	[LoopVectorize] Teach Loop Vectorizor about interleaved memory accesses. Interleaved memory accesses are grouped and vectorized into vector load/store and shufflevector. E.g. for (i = 0; i < N; i+=2) { a = A[i]; // load of even element b = A[i+1]; // load of odd element ... // operations on a, b, c, d A[i] = c; // store of even element A[i+1] = d; // store of odd element } The loads of even and odd elements are identified as an interleave load group, which will be transfered into vectorized IRs like: %wide.vec = load <8 x i32>, <8 x i32>* %ptr %vec.even = shufflevector <8 x i32> %wide.vec, <8 x i32> undef, <4 x i32> <i32 0, i32 2, i32 4, i32 6> %vec.odd = shufflevector <8 x i32> %wide.vec, <8 x i32> undef, <4 x i32> <i32 1, i32 3, i32 5, i32 7> The stores of even and odd elements are identified as an interleave store group, which will be transfered into vectorized IRs like: %interleaved.vec = shufflevector <4 x i32> %vec.even, %vec.odd, <8 x i32> <i32 0, i32 4, i32 1, i32 5, i32 2, i32 6, i32 3, i32 7> store <8 x i32> %interleaved.vec, <8 x i32>* %ptr This optimization is currently disabled by defaut. To try it by adding '-enable-interleaved-mem-accesses=true'. llvm-svn: 239291	2015-06-08 06:39:56 +00:00
Matt Arsenault	b88a683ecf	Add isLegalAddressingMode address space argument to TTI Update to match the TLI version, and remove the TLI version's default argument. llvm-svn: 239260	2015-06-07 20:12:03 +00:00
Sanjay Patel	d41c35dc2f	make reciprocal estimate code generation more flexible by adding command-line options (3rd try) The first try (r238051) to land this was reverted due to ExecutionEngine build failure; that was hopefully addressed by r238788. The second try (r238842) to land this was reverted due to BUILD_SHARED_LIBS failure; that was hopefully addressed by r238953. This patch adds a TargetRecip class for processing many recip codegen possibilities. The class is intended to handle both command-line options to llc as well as options passed in from a front-end such as clang with the -mrecip option. The x86 backend is updated to use the new functionality. Only -mcpu=btver2 with -ffast-math should see a functional change from this patch. All other x86 CPUs continue to not use reciprocal estimates by default with -ffast-math. Differential Revision: http://reviews.llvm.org/D8982 llvm-svn: 239001	2015-06-04 01:32:35 +00:00
Rafael Espindola	81fe124a03	Revert "make reciprocal estimate code generation more flexible by adding command-line options (2nd try)" This reverts commit r238842. It broke -DBUILD_SHARED_LIBS=ON build. llvm-svn: 238900	2015-06-03 05:32:44 +00:00
Benjamin Kramer	9139d0a501	Push constness through LoopInfo::isLoopHeader and clean it up a bit. NFC. llvm-svn: 238843	2015-06-02 15:28:27 +00:00
Sanjay Patel	006dd407ed	make reciprocal estimate code generation more flexible by adding command-line options (2nd try) The first try (r238051) to land this was reverted due to bot failures that were hopefully addressed by r238788. This patch adds a TargetRecip class for processing many recip codegen possibilities. The class is intended to handle both command-line options to llc as well as options passed in from a front-end such as clang with the -mrecip option. The x86 backend is updated to use the new functionality. Only -mcpu=btver2 with -ffast-math should see a functional change from this patch. All other x86 CPUs continue to not use reciprocal estimates by default with -ffast-math. Differential Revision: http://reviews.llvm.org/D8982 llvm-svn: 238842	2015-06-02 15:28:15 +00:00
Matthias Braun	6e408c79f4	LiveRangeEdit: Fix liveranges not shrinking on subrange kill. If a dead instruction we may not only have a last-use in the main live range but also in a subregister range if subregisters are tracked. We need to partially rebuild live ranges in both cases. The testcase only broke when subregister liveness was enabled. I commited it in the current form because there is currently no flag to enable/disable subregister liveness. This fixes PR23720. llvm-svn: 238785	2015-06-01 21:26:26 +00:00
Chandler Carruth	6f372f0515	[sdag] Add the helper I most want to the DAG -- building a bitcast around a value using its existing SDLoc. Start using this in just one function to save omg lines of code. llvm-svn: 238638	2015-05-30 04:14:10 +00:00
Benjamin Kramer	0e31955b32	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types If the type isn't trivially moveable emplace can skip a potentially expensive move. It also saves a couple of characters. Call sites were found with the ASTMatcher + some semi-automated cleanup. memberCallExpr( argumentCountIs(1), callee(methodDecl(hasName("push_back"))), on(hasType(recordDecl(has(namedDecl(hasName("emplace_back")))))), hasArgument(0, bindTemporaryExpr( hasType(recordDecl(hasNonTrivialDestructor())), has(constructExpr()))), unless(isInTemplateInstantiation())) No functional change intended. llvm-svn: 238602	2015-05-29 19:43:39 +00:00
Matthias Braun	4eae512569	CodeGen: Use mop_iterator instead of MIOperands/ConstMIOperands MIOperands/ConstMIOperands are classes iterating over the MachineOperand of a MachineInstr, however MachineInstr::mop_iterator does the same thing. I assume these two iterators exist to have a uniform interface to iterate over the operands of a machine instruction bundle and a single machine instruction. However in practice I find it more confusing to have 2 different iterator classes, so this patch transforms (nearly all) the code to use mop_iterators. The only exception being MIOperands::anlayzePhysReg() and MIOperands::analyzeVirtReg() still needing an equivalent, I leave that as an exercise for the next patch. Differential Revision: http://reviews.llvm.org/D9932 This version is slightly modified from the proposed revision in that it introduces MachineInstr::getOperandNo to avoid the extra counting variable in the few loops that previously used MIOperands::getOperandNo. llvm-svn: 238539	2015-05-29 02:56:46 +00:00
Matthias Braun	ab15095964	MachineFrameInfo: Simplify pristine register calculation. About pristine regsiters: Pristine registers "hold a value that is useless to the current function, but that must be preserved - they are callee saved registers that have not been saved." This concept saves compile time as it frees the prologue/epilogue inserter from adding every such register to every basic blocks live-in list. However the current code in getPristineRegs is formulated in a complicated way: Inside the function prologue and epilogue all callee saves are considered pristine, while in the rest of the code only the non-saved ones are considered pristine. This requires logic to differentiate between prologue/epilogue and the rest and in the presence of shrink-wrapping this even becomes complicated/expensive. It's also unnecessary because the prologue epilogue inserters already mark callee-save registers that are saved/restores properly in the respective blocks in the prologue/epilogue (see updateLiveness() in PrologueEpilogueInserter.cpp). So only declaring non-saved/restored callee saved registers as pristine just works. Differential Revision: http://reviews.llvm.org/D10101 llvm-svn: 238524	2015-05-28 23:20:35 +00:00
Alex Lorenz	8269eba38f	MIR Serialization: print and parse machine function names. This commit introduces a serializable structure called 'llvm::yaml::MachineFunction' that stores the machine function's name. This structure will mirror the machine function's state in the future. This commit prints machine functions as YAML documents containing a YAML mapping that stores the state of a machine function. This commit also parses the YAML documents that contain the machine functions. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D9841 llvm-svn: 238519	2015-05-28 22:41:12 +00:00
Reid Kleckner	711f4eb0a1	[WinEH] Start inserting state number stores for C++ EH This moves all the state numbering code for C++ EH to WinEHPrepare so that we can call it from the X86 state numbering IR pass that runs before isel. Now we just call the same state numbering machinery and insert a bunch of stores. It also populates MachineModuleInfo with information about the current function. llvm-svn: 238514	2015-05-28 22:00:24 +00:00
Duncan P. N. Exon Smith	9ba8af76a1	AsmPrinter: Stop exposing underlying DIE children list, NFC Update `DIE` API to hide the implementation of `DIE::Children` so we can swap it out. llvm-svn: 238468	2015-05-28 19:56:34 +00:00
Duncan P. N. Exon Smith	0c791a688f	AsmPrinter: Rename begin_values() => values_begin(), NFC llvm-svn: 238456	2015-05-28 18:55:38 +00:00
Owen Anderson	e507d8cb77	Add support for the convergent flag at the MC and MachineInstr levels. llvm-svn: 238450	2015-05-28 18:33:39 +00:00
Duncan P. N. Exon Smith	6869f59e40	AsmPrinter: Suppress warnings on GCC from r238362, NFC GCC seems to have some overzealous warnings about strict aliasing. Rafael reports that this patch suppresses them on GCC 4.9, and I'm hoping this will work for GCC 4.7 as well. I'll watch [1] and iterate if necessary. [1]: http://bb.pgr.jp/builders/clang-3stage-x86_64-linux/builds/8597 llvm-svn: 238447	2015-05-28 18:09:13 +00:00
Duncan P. N. Exon Smith	78ae127736	AsmPrinter: Return added DIE from DIE::addChild() Change `DIE::addChild()` to return a reference to the just-added node, and update consumers to use it directly. An upcoming commit will abstract away (and eventually change) the underlying storage of `DIE::Children`. llvm-svn: 238372	2015-05-27 22:59:03 +00:00
Duncan P. N. Exon Smith	287e807ffb	AsmPrinter: Stop exposing underlying DIEValue list, NFC Change the `DIE` API to hide the implementation of the list of `DIEValue`s. llvm-svn: 238369	2015-05-27 22:44:06 +00:00
Duncan P. N. Exon Smith	c4e8cdcf10	AsmPrinter: Store abbreviation data directly in DIE and DIEValue Stop storing a `DIEAbbrev` in `DIE`, since the data fits neatly inside the `DIEValue` list. Besides being a cleaner data structure (avoiding the parallel arrays), this gives us more freedom to rearrange the `DIEValue` list. This fixes the temporary memory regression from 845 MB up to 879 MB, and drops it further to 829 MB for a net memory decrease of around 1.9% (incremental decrease around 5.7%). (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238364	2015-05-27 22:31:41 +00:00
Duncan P. N. Exon Smith	65e3562960	Reapply "AsmPrinter: Change DIEValue to be stored by value" This reverts commit r238350, effectively reapplying r238349 after fixing (all?) the problems, all somehow related to how I was using `AlignedArrayCharUnion<>` inside `DIEValue`: - MSVC can only handle `sizeof()` on types, not values. Change the assert. - GCC doesn't know the `is_trivially_copyable` type trait. Instead of asserting it, add destructors. - Call placement new even when constructing POD (i.e., the pointers). - Instead of copying the char buffer, copy the casted classes. I've left in a couple of `static_assert`s that I think both MSVC and GCC know how to handle. If the bots disagree with me, I'll remove them. - Check that the constructed type is either standard layout or a pointer. This protects against a programming error: we really want the "small" `DIEValue`s to be small and simple, so don't accidentally change them not to be. - Similarly, check that the size of the buffer is no bigger than a `uint64_t` or a pointer. (I thought checking against `sizeof(uint64_t)` would be good enough, but Chandler suggested that pointers might sometimes be bigger than that in the context of sanitizers.) I've also committed r238359 in the meantime, which introduces a DIEValue.def to simplify dispatching between the various types (thanks to a review comment by David Blaikie). Without that, this commit would be almost unintelligible. Here's the original commit message: -- Change `DIEValue` to be stored/passed/etc. by value, instead of reference. It's now a discriminated union, with a `Val` field storing the actual type. The classes that used to inherit from `DIEValue` no longer do. There are two categories of these: - Small values fit in a single pointer and are stored by value. - Large values require auxiliary storage, and are stored by reference. The only non-mechanical change is to tools/dsymutil/DwarfLinker.cpp. It was relying on `DIEInteger`s being passed around by reference, so I replaced that assumption with a `PatchLocation` type that stores a safe reference to where the `DIEInteger` lives instead. This commit causes a temporary regression in memory usage, since I've left merging `DIEAbbrevData` into `DIEValue` for a follow-up commit. I measured an increase from 845 MB to 879 MB, around 3.9%. The follow-up drops it lower than the starting point, and I've only recently brought the memory this low anyway, so I'm committing these changes separately to keep them incremental. (I also considered swapping the commits, but the other one first would cause a lot more code churn.) (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) -- llvm-svn: 238362	2015-05-27 22:14:58 +00:00
Duncan P. N. Exon Smith	963930d2a6	AsmPrinter: Introduce DIEValue.def, NFC Use a .def macro file to iterate through the various subclasses of `DIEValue`. llvm-svn: 238359	2015-05-27 21:15:43 +00:00
Duncan P. N. Exon Smith	ce281b5177	Revert "AsmPrinter: Change DIEValue to be stored by value" This reverts commit r238349, since it caused some errors on bots: - std::is_trivially_copyable isn't available until GCC 5.0. - It was complaining about strict aliasing with my use of ArrayCharUnion. llvm-svn: 238350	2015-05-27 19:30:27 +00:00
Duncan P. N. Exon Smith	c8fd56ec31	AsmPrinter: Change DIEValue to be stored by value Change `DIEValue` to be stored/passed/etc. by value, instead of reference. It's now a discriminated union, with a `Val` field storing the actual type. The classes that used to inherit from `DIEValue` no longer do. There are two categories of these: - Small values fit in a single pointer and are stored by value. - Large values require auxiliary storage, and are stored by reference. The only non-mechanical change is to tools/dsymutil/DwarfLinker.cpp. It was relying on `DIEInteger`s being passed around by reference, so I replaced that assumption with a `PatchLocation` type that stores a safe reference to where the `DIEInteger` lives instead. This commit causes a temporary regression in memory usage, since I've left merging `DIEAbbrevData` into `DIEValue` for a follow-up commit. I measured an increase from 845 MB to 879 MB, around 3.9%. The follow-up drops it lower than the starting point, and I've only recently brought the memory this low anyway, so I'm committing these changes separately to keep them incremental. (I also considered swapping the commits, but the other one first would cause a lot more code churn.) (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238349	2015-05-27 19:22:50 +00:00
Duncan P. N. Exon Smith	ceedfec2fb	AsmPrinter: Reorganize DIE.h, NFC An upcoming commit will switch to storing `DIEValue`s by value in a `DIE`, so move the `DIE` down below all the subclasses it can. Two subclasses inherit from `DIE`, so drop those down below `DIE`. `DIEValue` will become a discriminated union, which contains an instance of one of the classes that are currently subclasses. "Big" ones, such as those that inherit from `DIE`, will continue to be stored by pointer. This commit does as much of the code motion ahead of time as possible. llvm-svn: 238346	2015-05-27 18:49:38 +00:00
Alex Lorenz	b5ebbcd330	Resubmit r237954 (MIR Serialization: print and parse LLVM IR using MIR format). This commit a 3rd attempt at comitting the initial MIR serialization patch. The first commit (r237708) was reverted in 237730. Then the second commit (r237954) was reverted in r238007, as the MIR library under CodeGen caused a circular dependency where the CodeGen library depended on MIR and MIR library depended on CodeGen. This commit has fixed the dependencies between CodeGen and MIR by reorganizing the MIR serialization code - the code that prints out MIR has been moved to CodeGen, and the MIR library has been renamed to MIRParser. Now the CodeGen library doesn't depend on the MIRParser library, thus the circular dependency no longer exists. --Original Commit Message-- MIR Serialization: print and parse LLVM IR using MIR format. This commit is the initial commit for the MIR serialization project. It creates a new library under CodeGen called 'MIR'. This new library adds a new machine function pass that prints out the LLVM IR using the MIR format. This pass is then added as a last pass when a 'stop-after' option is used in llc. The new library adds the initial functionality for parsing of MIR files as well. This commit also extends the llc tool so that it can recognize and parse MIR input files. Reviewers: Duncan P. N. Exon Smith, Matthias Braun, Philip Reames Differential Revision: http://reviews.llvm.org/D9616 llvm-svn: 238341	2015-05-27 18:02:19 +00:00
Matthias Braun	d199caeaa9	MachineBasicBlock: Cleanup computeRegisterLiveness() - Clean documentation comment - Change the API to accept an iterator so you can actually pass MachineBasicBlock::end() now. - Add more "const". llvm-svn: 238288	2015-05-27 05:12:39 +00:00
Akira Hatanaka	6f08fc2ca6	Remove NoFramePointerElim and NoFramePointerElimOverride from TargetOptions and remove ExecutionEngine's dependence on CodeGen. NFC. This is a follow-up to r238080. Differential Revision: http://reviews.llvm.org/D9830 llvm-svn: 238244	2015-05-26 20:17:20 +00:00
Matt Arsenault	9b6718f328	Make EmitFunctionHeader virtual This is to fix problems introduced by r232481. For HSAIL, this function does essentially nothing desirable, and injects unwanted / incorrect stuff before the function. The only thing it really needs to do is call EmitFunctionEntryLabel in this case. llvm-svn: 238222	2015-05-26 17:33:15 +00:00
NAKAMURA Takumi	f5fce1248c	Prune CRLFs. llvm-svn: 238125	2015-05-25 01:43:23 +00:00
Duncan P. N. Exon Smith	7d1c798112	AsmPrinter: Make DIEValue::Ty private, NFC llvm-svn: 238123	2015-05-24 17:03:15 +00:00
Duncan P. N. Exon Smith	018e1309ce	AsmPrinter: Avoid creating symbols in DwarfStringPool Stop creating symbols we don't need in `DwarfStringPool`. The consumers only call `DwarfStringPoolEntryRef::getSymbol()` when DWARF is relocatable, so this just stops creating the unused symbols when it's not. This drops memory usage from 851 MB to 845 MB, around 0.7%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238122	2015-05-24 16:58:59 +00:00
Duncan P. N. Exon Smith	34971e156b	AsmPrinter: Avoid EmitLabelDifference() in DwarfAccelTable Mint a new function, `AsmPrinter::emitDwarfStringOffset()`, which takes a `DwarfStringPoolEntryRef`. When DWARF is relocatable across sections, this defers to `emitSectionOffset()` and emits the `MCSymbol`; otherwise, just emit the offset directly, without using any intermediate symbols. `EmitLabelDifference()` is already optimized to emit absolute label differences cheaply when possible, so there aren't any major memory savings here (853 MB down to 851 MB, or 0.2%). However, it prepares for making the `MCSymbol`s in the `DwarfStringPool` optional. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238119	2015-05-24 16:48:54 +00:00
Duncan P. N. Exon Smith	017697dbaf	AsmPrinter: Use DwarfStringPoolEntry in DwarfAccelTable, NFC This is just an API change, but it prepares to stop using `EmitLabelDifference()` when possible. llvm-svn: 238118	2015-05-24 16:44:32 +00:00
Duncan P. N. Exon Smith	1a688379a8	AsmPrinter: Make DIEString small Expose the `DwarfStringPool` entry in a header, and store a pointer to it directly in `DIEString`. Instead of choosing at creation time how to emit it, use the `dwarf::Form` to determine that at emission time. Besides avoiding the other `DIEValue`, this shaves two pointers off of `DIEString`; the data is now a single pointer. This is a nice cleanup on its own -- and drops memory usage from 861 MB down to 853 MB, around 0.9% -- but it's also preparation for passing `DIEValue`s by value. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238117	2015-05-24 16:40:47 +00:00
Duncan P. N. Exon Smith	11482e6ab1	AsmPrinter: Extract DwarfStringPoolEntry from DwarfStringPool, NFC Extract out `DwarfStringPoolEntry` and `DwarfStringPoolRef` from `DwarfStringPool` so that downstream users can start using `DwarfStringPool::getEntry()` directly. This will allow users to delay the decision between emitting a symbol or an offset until later. llvm-svn: 238116	2015-05-24 16:33:33 +00:00
Duncan P. N. Exon Smith	9252f8630f	AsmPrinter: Remove the vtable-entry from DIEValue Remove all virtual functions from `DIEValue`, dropping the vtable pointer from its layout. Instead, create "impl" functions on the subclasses, and use the `DIEValue::Type` to implement the dynamic dispatch. This is necessary -- obviously not sufficient -- for passing `DIEValue`s around by value. However, this change stands on its own: we make tons of these. I measured a drop in memory usage from 888 MB down to 860 MB, or around 3.2%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238084	2015-05-23 01:45:07 +00:00
Duncan P. N. Exon Smith	c9056c2399	CodeGen: Remove redundant DIETypeSignature::dump(), NFC We already have this in `DIEValue`; no reason to shadow it. llvm-svn: 238082	2015-05-23 01:26:26 +00:00
Akira Hatanaka	626316f0c3	Stop resetting NoFramePointerElim in TargetMachine::resetTargetOptions. This is part of the work to remove TargetMachine::resetTargetOptions. In this patch, instead of updating global variable NoFramePointerElim in resetTargetOptions, its use in DisableFramePointerElim is replaced with a call to TargetFrameLowering::noFramePointerElim. This function determines on a per-function basis if frame pointer elimination should be disabled. There is no change in functionality except that cl:opt option "disable-fp-elim" can now override function attribute "no-frame-pointer-elim". llvm-svn: 238080	2015-05-23 01:14:08 +00:00
Akira Hatanaka	c7f329a550	Simplify and rename function overrideFunctionAttributes. NFC. This is in preparation to making changes needed to stop resetting NoFramePointerElim in resetTargetOptions. llvm-svn: 238079	2015-05-23 01:12:26 +00:00
Rafael Espindola	693b117a0a	Revert "make reciprocal estimate code generation more flexible by adding command-line options" This reverts commit r238051. It broke some bots: http://lab.llvm.org:8011/builders/llvm-ppc64-linux1/builds/18190 llvm-svn: 238075	2015-05-23 00:22:44 +00:00
Sanjay Patel	d16f201696	make reciprocal estimate code generation more flexible by adding command-line options This patch adds a class for processing many recip codegen possibilities. The TargetRecip class is intended to handle both command-line options to llc as well as options passed in from a front-end such as clang with the -mrecip option. The x86 backend is updated to use the new functionality. Only -mcpu=btver2 with -ffast-math should see a functional change from this patch. All other CPUs continue to not use reciprocal estimates by default with -ffast-math. Differential Revision: http://reviews.llvm.org/D8982 llvm-svn: 238051	2015-05-22 21:10:06 +00:00
Sanjay Patel	e25813a137	remove names from comments, fix typos; NFC llvm-svn: 238027	2015-05-22 15:38:24 +00:00
Sanjay Patel	c8f21ecdf6	fix typo in comment; NFC llvm-svn: 238025	2015-05-22 15:31:47 +00:00
Puyan Lotfi	7a4c16a356	Compile time improvements to VirtRegRewriter. This change to VirtRegRewriter::addMBBLiveIns adds live-in registers for each MachineBasicBlock's LiveIns set without isLiveIn checks as they are being added because doing so is expensive. After all live-in registers are added, the LiveIn vectors are sorted and uniqued. llvm-svn: 238008	2015-05-22 08:11:26 +00:00
NAKAMURA Takumi	4a7fccdf3b	Revert r237954, "Resubmit r237708 (MIR Serialization: print and parse LLVM IR using MIR format)." It brought cyclic dependencies between LLVMCodeGen and LLVMMIR. llvm-svn: 238007	2015-05-22 07:17:07 +00:00
Duncan P. N. Exon Smith	d24ab51c79	SDAG: Give SDDbgValues their own allocator (and reset it) Previously `SDDbgValue`s used the general allocator that lives for all of `SelectionDAG`. Instead, give them their own allocator, and reset it whenever `SDDbgInfo::clear()` is called, plugging a spiritual leak. This drops `SelectionDAGBuilder::visitIntrinsicCall()` off of my heap profile (was at around 2% of `llc` for codegen of `-flto -g`). Thanks to Pete Cooper for spotting the problem and suggesting the fix. llvm-svn: 237998	2015-05-22 05:45:19 +00:00
Chad Rosier	c4d3b538dd	[MachineInstr] Add mayLoadOrStore API. NFC. llvm-svn: 237955	2015-05-21 21:00:30 +00:00
Alex Lorenz	4c932ccc6c	Resubmit r237708 (MIR Serialization: print and parse LLVM IR using MIR format). This commit is a 2nd attempt at committing the initial MIR serialization patch. The first commit (r237708) made the incremental buildbots unstable and was reverted in r237730. The original commit didn't add a terminating null character to the LLVM IR source which was passed to LLParser, and this sometimes caused the test 'llvmIR.mir' to fail with a parsing error because the LLVM IR source didn't have a null character immediately after the end and thus LLLexer encountered some garbage characters that ultimately caused the error. This commit also includes the other test fixes I committed in r237712 (llc path fix) and r237723 (remove target triple) which also got reverted in r237730. --Original Commit Message-- MIR Serialization: print and parse LLVM IR using MIR format. This commit is the initial commit for the MIR serialization project. It creates a new library under CodeGen called 'MIR'. This new library adds a new machine function pass that prints out the LLVM IR using the MIR format. This pass is then added as a last pass when a 'stop-after' option is used in llc. The new library adds the initial functionality for parsing of MIR files as well. This commit also extends the llc tool so that it can recognize and parse MIR input files. Reviewers: Duncan P. N. Exon Smith, Matthias Braun, Philip Reames Differential Revision: http://reviews.llvm.org/D9616 llvm-svn: 237954	2015-05-21 20:54:45 +00:00
Rafael Espindola	dda3f1317e	Move alignment from MCSectionData to MCSection. This starts merging MCSection and MCSectionData. There are a few issues with the current split between MCSection and MCSectionData. * It optimizes the the not as important case. We want the production of .o files to be really fast, but the split puts the information used for .o emission in a separate data structure. * The ELF/COFF/MachO hierarchy is not represented in MCSectionData, leading to some ad-hoc ways to represent the various flags. * It makes it harder to remember where each item is. The attached patch starts merging the two by moving the alignment from MCSectionData to MCSection. Most of the patch is actually just dropping 'const', since MCSectionData is mutable, but MCSection was not. llvm-svn: 237936	2015-05-21 19:20:38 +00:00
Sanjay Patel	9c27a1a2bc	don't repeat function names in comments; NFC llvm-svn: 237911	2015-05-21 16:42:22 +00:00
Andrew Kaylor	7532e8ed04	[WinEH] C++ EH state numbering fixes Differential Revision: http://reviews.llvm.org/D9787 llvm-svn: 237854	2015-05-20 23:22:24 +00:00
Igor Laevsky	277d7cfcc2	[StatepointLowering] Support of the gc.relocates for invoke statepoints. This change implements support for lowering of the gc.relocates tied to the invoke statepoint. This is acomplished by storing frame indices of the lowered values in "StatepointRelocatedValues" map inside FunctionLoweringInfo instead of storing them in per-basic block structure StatepointLowering. After this change StatepointLowering is used only during "LowerStatepoint" call and it is not necessary to store it as a field in SelectionDAGBuilder anymore. Differential Revision: http://reviews.llvm.org/D7798 llvm-svn: 237786	2015-05-20 11:37:25 +00:00
Swaroop Sridhar	8765ceb638	Add a GCStrategy for CoreCLR This change adds a new GC strategy for supporting the CoreCLR runtime. This strategy is currently identical to Statepoint-example GC, but is necessary for several upcoming changes specific to CoreCLR, such as: 1. Base-pointers not explicitly reported for interior pointers 2. Different format for stack-map encoding 3. Location of Safe-point polls: polls are only needed before loop-back edges and before tail-calls (not needed at function-entry) 4. Runtime specific handshake between calls to managed/unmanaged functions. llvm-svn: 237753	2015-05-20 01:07:23 +00:00
Akira Hatanaka	e7b801e86a	Add a comment line that was accidentally dropped in r236671. llvm-svn: 237734	2015-05-19 21:59:11 +00:00
Alex Lorenz	740bc929d2	Revert r237708 (MIR serialization) - incremental buildbots became unstable. The incremental buildbots entered a pass-fail cycle where during the fail cycle one of the tests from this commit fails for an unknown reason. I have reverted this commit and will investigate the cause of this problem. llvm-svn: 237730	2015-05-19 21:41:28 +00:00
Matthias Braun	0e49c41528	MachineInstr: Remove unused parameter. llvm-svn: 237726	2015-05-19 21:22:20 +00:00
Alex Lorenz	cb5500c145	MIR Serialization: print and parse LLVM IR using MIR format. This commit is the initial commit for the MIR serialization project. It creates a new library under CodeGen called 'MIR'. This new library adds a new machine function pass that prints out the LLVM IR using the MIR format. This pass is then added as a last pass when a 'stop-after' option is used in llc. The new library adds the initial functionality for parsing of MIR files as well. This commit also extends the llc tool so that it can recognize and parse MIR input files. Reviewers: Duncan P. N. Exon Smith, Matthias Braun, Philip Reames Differential Revision: http://reviews.llvm.org/D9616 llvm-svn: 237708	2015-05-19 18:17:39 +00:00
Matthias Braun	ed527f2f9e	SelectionDAG: Cleanup and simplify FoldConstantArithmetic This cleans up the FoldConstantArithmetic code by factoring out the case of two ConstantSDNodes into an own function. This avoids unnecessary complexity for many callers who already have ConstantSDNode arguments. This also avoids an intermeidate SmallVector datastructure and a loop over that datastructure. llvm-svn: 237651	2015-05-19 01:40:21 +00:00
Matthias Braun	4662acc8ae	MachineInstr: Change return value of getOpcode() to unsigned. This was previously returning int. However there are no negative opcode numbers and more importantly this was needlessly different from MCInstrDesc::getOpcode() (which even is the value returned here) and SDValue::getOpcode()/SDNode::getOpcode(). llvm-svn: 237611	2015-05-18 20:27:55 +00:00
Benjamin Kramer	541b0eac9b	[WinEH] Push unique_ptr through the Action interface. This was the source of many leaks in the past, this should fix them once and for all. llvm-svn: 237524	2015-05-16 15:40:03 +00:00
Pete Cooper	8d13c88def	Remove 3 includes from MCInstrDesc.h and explicitly include them where needed llvm-svn: 237481	2015-05-15 21:58:42 +00:00
James Molloy	64b6b65ee6	Add SDNodes for umin, umax, smin and smax. This adds new SDNodes for signed/unsigned min/max. These nodes are built from select/icmp pairs matched at SDAGBuilder stage. This patch adds the nodes, as well as legalization support and sets them to be "expand" for all targets. NFC for now; this will be tested when I switch AArch64 to using these new nodes. llvm-svn: 237423	2015-05-15 09:03:15 +00:00
Akira Hatanaka	d1c869cd1a	Stop resetting SanitizeAddress in TargetMachine::resetTargetOptions. NFC. Instead of doing that, create a temporary copy of MCTargetOptions and reset its SanitizeAddress field based on the function's attribute every time an InlineAsm instruction is emitted in AsmPrinter::EmitInlineAsm. This is part of the work to remove TargetMachine::resetTargetOptions (the FIXME added to TargetMachine.cpp in r236009 explains why this function has to be removed). Differential Revision: http://reviews.llvm.org/D9570 llvm-svn: 237412	2015-05-15 00:20:44 +00:00
Matthias Braun	2b48b0ebda	TargetSchedule: factor out common code; NFC llvm-svn: 237376	2015-05-14 18:01:13 +00:00
Nick Lewycky	fe62b3c55b	Revert r237046. See the testcase on the thread where r237046 was committed. llvm-svn: 237317	2015-05-13 23:41:47 +00:00
Sergey Dmitrouk	5e3a4d1a49	[DebugInfo] Debug locations for constant SD nodes Several updates for [DebugInfo] Add debug locations to constant SD nodes (r235989). Includes: * re-enabling the change (disabled recently); * missing change for FP constants; * resetting debug location of constant node if it's used more than at one place to prevent emission of wrong locations in case of coalesced constants; * a couple of additional tests. Now all look ups in CSEMap are wrapped by additional method. Comment in D9084 suggests that debug locations aren't useful for "target constants", so there might be one more change related to this API (namely, dropping debug locations for getTarget*Constant methods). Differential Revision: http://reviews.llvm.org/D9604 llvm-svn: 237237	2015-05-13 08:58:03 +00:00
Sanjoy Das	6d67db8c09	[Statepoints] Support for "patchable" statepoints. Summary: This change adds two new parameters to the statepoint intrinsic, `i64 id` and `i32 num_patch_bytes`. `id` gets propagated to the ID field in the generated StackMap section. If the `num_patch_bytes` is non-zero then the statepoint is lowered to `num_patch_bytes` bytes of nops instead of a call (the spill and reload code remains unchanged). A non-zero `num_patch_bytes` is useful in situations where a language runtime requires complete control over how a call is lowered. This change brings statepoints one step closer to patchpoints. With some additional work (that is not part of this patch) it should be possible to get rid of `TargetOpcode::STATEPOINT` altogether. PlaceSafepoints generates `statepoint` wrappers with `id` set to `0xABCDEF00` (the old default value for the ID reported in the stackmap) and `num_patch_bytes` set to `0`. This can be made more sophisticated later. Reviewers: reames, pgavlin, swaroop.sridhar, AndyAyers Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9546 llvm-svn: 237214	2015-05-12 23:52:24 +00:00
Richard Smith	1a7ba091af	Add missing #include and forward decl, found by modules build. llvm-svn: 237195	2015-05-12 21:49:03 +00:00
Pat Gavlin	f7cb0841d2	[Statepoints] Split the calling convention and statepoint flags operand to STATEPOINT into two separate operands. Differential Revision: http://reviews.llvm.org/D9623 llvm-svn: 237166	2015-05-12 19:50:19 +00:00
Sanjay Patel	29eae3af52	fix typos, grammar; NFC llvm-svn: 237139	2015-05-12 14:52:42 +00:00
Eric Christopher	2ba04d1116	Migrate existing backends that care about software floating point to use the information in the module rather than TargetOptions. We've had and clang has used the use-soft-float attribute for some time now so have the backends set a subtarget feature based on a particular function now that subtargets are created based on functions and function attributes. For the one middle end soft float check go ahead and create an overloadable TargetLowering::useSoftFloat function that just checks the TargetSubtargetInfo in all cases. Also remove the command line option that hard codes whether or not soft-float is set by using the attribute for all of the target specific test cases - for the generic just go ahead and add the attribute in the one case that showed up. llvm-svn: 237079	2015-05-12 01:26:05 +00:00
Richard Smith	7bf03500ac	Add more missing #includes, found by modules build. llvm-svn: 237062	2015-05-11 22:41:07 +00:00
Sanjay Patel	87722498a9	propagate IR-level fast-math-flags to DAG nodes; 2nd try; NFC This is a less ambitious version of: http://reviews.llvm.org/rL236546 because that was reverted in: http://reviews.llvm.org/rL236600 because it caused memory corruption that wasn't related to FMF but was actually due to making nodes with 2 operands derive from a plain SDNode rather than a BinarySDNode. This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. llvm-svn: 237046	2015-05-11 21:07:09 +00:00
Andrew Kaylor	295660694e	[WinEH] Update exception numbering to give handlers their own base state. Differential Revision: http://reviews.llvm.org/D9512 llvm-svn: 237014	2015-05-11 19:41:19 +00:00
Sanjay Patel	0b97042089	group getNode() variants by purpose and add comments; NFC llvm-svn: 237013	2015-05-11 19:34:10 +00:00
Tom Stellard	529f439abb	CodeGen: Make MachineInstr::untieRegOperand() a public function This makes it easier to update in place instructions with tied operands. Differential Revision: http://reviews.llvm.org/D9231 llvm-svn: 237005	2015-05-11 17:40:54 +00:00
Arnold Schwaighofer	d6f4926afa	ScheduleDAGInstrs: In functions with tail calls PseudoSourceValues are not non-aliasing distinct objects The code that builds the dependence graph assumes that two PseudoSourceValues don't alias. In a tail calling function two FixedStackObjects might refer to the same location. Worse 'immutable' fixed stack objects like function arguments are not immutable and will be clobbered. Change this so that a load from a FixedStackObject is not invariant in a tail calling function and don't return a PseudoSourceValue for an instruction in tail calling functions when building the dependence graph so that we handle function arguments conservatively. Fix for PR23459. rdar://20740035 llvm-svn: 236916	2015-05-08 23:52:00 +00:00
Pat Gavlin	c022b8d288	Extend the statepoint intrinsic to allow statepoints to be marked as transitions from GC-aware code to code that is not GC-aware. This changes the shape of the statepoint intrinsic from: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 unused, ...call args, i32 # deopt args, ...deopt args, ...gc args) to: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 flags, ...call args, i32 # transition args, ...transition args, i32 # deopt args, ...deopt args, ...gc args) This extension offers the backend the opportunity to insert (somewhat) arbitrary code to manage the transition from GC-aware code to code that is not GC-aware and back. In order to support the injection of transition code, this extension wraps the STATEPOINT ISD node generated by the usual lowering lowering with two additional nodes: GC_TRANSITION_START and GC_TRANSITION_END. The transition arguments that were passed passed to the intrinsic (if any) are lowered and provided as operands to these nodes and may be used by the backend during code generation. Eventually, the lowering of the GC_TRANSITION_{START,END} nodes should be informed by the GC strategy in use for the function containing the intrinsic call; for now, these nodes are instead replaced with no-ops. Differential Revision: http://reviews.llvm.org/D9501 llvm-svn: 236888	2015-05-08 18:07:42 +00:00
Sergey Dmitrouk	6e4ac5dd5c	Disable r235989 "Reapply r235977 "[DebugInfo] Add debug locations to constant SD nodes"" Will be re-enabled with missing changes for ConstantFPSDNode and fixes for wrong locations due to constant coalescing. llvm-svn: 236758	2015-05-07 18:33:50 +00:00
Akira Hatanaka	ae9350177e	Let llc and opt override "-target-cpu" and "-target-features" via command line options. This commit fixes a bug in llc and opt where "-mcpu" and "-mattr" wouldn't override function attributes "-target-cpu" and "-target-features" in the IR. Differential Revision: http://reviews.llvm.org/D9537 llvm-svn: 236677	2015-05-06 23:54:14 +00:00
Akira Hatanaka	59565430e4	Factor out a function which determines the cpu and feature strings based on command line options -mcpu and -mattr. NFC. llvm-svn: 236671	2015-05-06 23:49:24 +00:00
Wei Mi	338b822ab9	[X86] Disable loop unrolling in loop vectorization pass when VF is 1. The patch disabled unrolling in loop vectorization pass when VF==1 on x86 architecture, by setting MaxInterleaveFactor to 1. Unrolling in loop vectorization pass may introduce the cost of overflow check, memory boundary check and extra prologue/epilogue code when regular unroller will unroll the loop another time. Disable it when VF==1 remove the unnecessary cost on x86. The same can be done for other platforms after verifying interleaving/memory bound checking to be not perf critical on those platforms. Differential Revision: http://reviews.llvm.org/D9515 llvm-svn: 236613	2015-05-06 17:12:25 +00:00
Matt Arsenault	9f6dfaa664	Add ChangeTo* to MachineOperand for symbols llvm-svn: 236612	2015-05-06 17:05:54 +00:00
NAKAMURA Takumi	3fdcf7a5c9	Reformat. llvm-svn: 236601	2015-05-06 14:03:22 +00:00
NAKAMURA Takumi	2655f48b30	Revert r236546, "propagate IR-level fast-math-flags to DAG nodes (NFC)" It caused undefined behavior. llvm-svn: 236600	2015-05-06 14:03:12 +00:00
Sanjay Patel	b7125c62c7	propagate IR-level fast-math-flags to DAG nodes (NFC) This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. There are 2 structural changes here: 1. The main diff is that we're preparing to extend the optimization flags to affect more than just binary SDNodes. Eg, IR intrinsics ( https://llvm.org/bugs/show_bug.cgi?id=21290 ) or non-binop nodes that don't even exist in IR such as FMA, FNEG, etc. 2. The other change is that we're actually copying the FP fast-math-flags from the IR instructions to SDNodes. Differential Revision: http://reviews.llvm.org/D8900 llvm-svn: 236546	2015-05-05 21:40:38 +00:00
Pete Cooper	cdb160f205	Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC Note, this is a reapplication of r236515 with a fix to not assert on non-register operands, but instead only handle them until the subsequent commit. Original commit message follows. The code was basically the same here already. Just added an out parameter for a vector of seen defs so that UpdatePredRedefs can call StepForward first, then do its own post processing on the seen defs. Will be used in the next commit to also handle regmasks. llvm-svn: 236538	2015-05-05 20:14:22 +00:00
Pete Cooper	a4a30efee4	Revert "Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC" This reverts commit 963cdbccf6e5578822836fd9b2ebece0ba9a60b7 (ie r236514) This is to get the bots green while i investigate. llvm-svn: 236518	2015-05-05 18:49:08 +00:00
Pete Cooper	0705733a6e	Refactor UpdatePredRedefs and StepForward to avoid duplication. NFC The code was basically the same here already. Just added an out parameter for a vector of seen defs so that UpdatePredRedefs can call StepForward first, then do its own post processing on the seen defs. Will be used in the next commit to also handle regmasks. llvm-svn: 236514	2015-05-05 18:31:31 +00:00
Quentin Colombet	c82cc9dc57	[ShrinkWrap] Add (a simplified version) of shrink-wrapping. This patch introduces a new pass that computes the safe point to insert the prologue and epilogue of the function. The interest is to find safe points that are cheaper than the entry and exits blocks. As an example and to avoid regressions to be introduce, this patch also implements the required bits to enable the shrink-wrapping pass for AArch64. Context Currently we insert the prologue and epilogue of the method/function in the entry and exits blocks. Although this is correct, we can do a better job when those are not immediately required and insert them at less frequently executed places. The job of the shrink-wrapping pass is to identify such places. Motivating example Let us consider the following function that perform a call only in one branch of a if: define i32 @f(i32 %a, i32 %b) { %tmp = alloca i32, align 4 %tmp2 = icmp slt i32 %a, %b br i1 %tmp2, label %true, label %false true: store i32 %a, i32* %tmp, align 4 %tmp4 = call i32 @doSomething(i32 0, i32* %tmp) br label %false false: %tmp.0 = phi i32 [ %tmp4, %true ], [ %a, %0 ] ret i32 %tmp.0 } On AArch64 this code generates (removing the cfi directives to ease readabilities): _f: ; @f ; BB#0: stp x29, x30, [sp, #-16]! mov x29, sp sub sp, sp, #16 ; =16 cmp w0, w1 b.ge LBB0_2 ; BB#1: ; %true stur w0, [x29, #-4] sub x1, x29, #4 ; =4 mov w0, wzr bl _doSomething LBB0_2: ; %false mov sp, x29 ldp x29, x30, [sp], #16 ret With shrink-wrapping we could generate: _f: ; @f ; BB#0: cmp w0, w1 b.ge LBB0_2 ; BB#1: ; %true stp x29, x30, [sp, #-16]! mov x29, sp sub sp, sp, #16 ; =16 stur w0, [x29, #-4] sub x1, x29, #4 ; =4 mov w0, wzr bl _doSomething add sp, x29, #16 ; =16 ldp x29, x30, [sp], #16 LBB0_2: ; %false ret Therefore, we would pay the overhead of setting up/destroying the frame only if we actually do the call. Proposed Solution This patch introduces a new machine pass that perform the shrink-wrapping analysis (See the comments at the beginning of ShrinkWrap.cpp for more details). It then stores the safe save and restore point into the MachineFrameInfo attached to the MachineFunction. This information is then used by the PrologEpilogInserter (PEI) to place the related code at the right place. This pass runs right before the PEI. Unlike the original paper of Chow from PLDI’88, this implementation of shrink-wrapping does not use expensive data-flow analysis and does not need hack to properly avoid frequently executed point. Instead, it relies on dominance and loop properties. The pass is off by default and each target can opt-in by setting the EnableShrinkWrap boolean to true in their derived class of TargetPassConfig. This setting can also be overwritten on the command line by using -enable-shrink-wrap. Before you try out the pass for your target, make sure you properly fix your emitProlog/emitEpilog/adjustForXXX method to cope with basic blocks that are not necessarily the entry block. Design Decisions 1. ShrinkWrap is its own pass right now. It could frankly be merged into PEI but for debugging and clarity I thought it was best to have its own file. 2. Right now, we only support one save point and one restore point. At some point we can expand this to several save point and restore point, the impacted component would then be: - The pass itself: New algorithm needed. - MachineFrameInfo: Hold a list or set of Save/Restore point instead of one pointer. - PEI: Should loop over the save point and restore point. Anyhow, at least for this first iteration, I do not believe this is interesting to support the complex cases. We should revisit that when we motivating examples. Differential Revision: http://reviews.llvm.org/D9210 <rdar://problem/3201744> llvm-svn: 236507	2015-05-05 17:38:16 +00:00
Jonathan Roelofs	60595b4739	Fix doxygen comment typo. NFC llvm-svn: 236180	2015-04-29 23:33:32 +00:00
Duncan P. N. Exon Smith	09b5c9c24d	IR: Give 'DI' prefix to debug info metadata Finish off PR23080 by renaming the debug info IR constructs from `MD` to `DI`. The last of the `DIDescriptor` classes were deleted in r235356, and the last of the related typedefs removed in r235413, so this has all baked for about a week. Note: If you have out-of-tree code (like a frontend), I recommend that you get everything compiling and tests passing with the previous commit before updating to this one. It'll be easier to keep track of what code is using the `DIDescriptor` hierarchy and what you've already updated, and I think you're extremely unlikely to insert bugs. YMMV of course. Back to this commit: I did this using the rename-md-di-nodes.sh upgrade script I've attached to PR23080 (both code and testcases) and filtered through clang-format-diff.py. I edited the tests for test/Assembler/invalid-generic-debug-node-*.ll by hand since the columns were off-by-three. It should work on your out-of-tree testcases (and code, if you've followed the advice in the previous paragraph). Some of the tests are in badly named files now (e.g., test/Assembler/invalid-mdcompositetype-missing-tag.ll should be 'dicompositetype'); I'll come back and move the files in a follow-up commit. llvm-svn: 236120	2015-04-29 16:38:44 +00:00
Elena Demikhovsky	4068c4fa64	fixed comments, blanks, nullptr; NFC llvm-svn: 236086	2015-04-29 06:49:50 +00:00
Sanjay Patel	877a22a3bc	move IR-level optimization flags into their own struct This is a preliminary step to using the IR-level floating-point fast-math-flags in the SDAG (D8900). In this patch, we introduce the optimization flags as their own struct. As noted in the TODO comment, we should eventually share this data between the IR passes and the backend. We also switch the existing nsw / nuw / exact bit functionality of the BinaryWithFlagsSDNode class to use the new struct. The tradeoff is that instead of using the free but limited space of SDNode's SubclassData, we add a data member to the subclass. This means we don't have to repeat all of the get/set methods per flag, but we're potentially adding size to all nodes of this subclassi type. In practice on 64-bit systems (measured on Linux and MacOS X), there is no size difference between an SDNode and BinaryWithFlagsSDNode after this change: they're both 80 bytes. This means that we had at least one free byte to play with due to struct alignment. Differential Revision: http://reviews.llvm.org/D9325 llvm-svn: 235997	2015-04-28 16:39:12 +00:00
Sergey Dmitrouk	7bfbc12128	Reapply r235977 "[DebugInfo] Add debug locations to constant SD nodes" [DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235989	2015-04-28 14:05:47 +00:00
Daniel Jasper	39180626db	Revert "[DebugInfo] Add debug locations to constant SD nodes" This breaks a test: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/23870 llvm-svn: 235987	2015-04-28 13:38:35 +00:00
Sergey Dmitrouk	01a4dcd3bb	[DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235977	2015-04-28 11:56:37 +00:00
Elena Demikhovsky	901c20e649	Masked gather and scatter: Added code for SelectionDAG. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 235970	2015-04-28 07:57:37 +00:00
Reid Kleckner	3a59b4e3a9	[SEH] Implement GetExceptionCode in __except blocks This introduces an intrinsic called llvm.eh.exceptioncode. It is lowered by copying the EAX value live into whatever basic block it is called from. Obviously, this only works if you insert it late during codegen, because otherwise mid-level passes might reschedule it. llvm-svn: 235768	2015-04-24 20:25:05 +00:00
Lang Hames	5d79e39b45	[AsmPrinter] Make AsmPrinter's OutStreamer member a unique_ptr. AsmPrinter owns the OutStreamer, so an owning pointer makes sense here. Using a reference for this is crufty. llvm-svn: 235752	2015-04-24 19:11:51 +00:00
Reid Kleckner	40c6671601	Re-commit "[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works" This reverts commit r235617. r235649 should have addressed the problems. llvm-svn: 235667	2015-04-23 23:22:33 +00:00
Reid Kleckner	7a1a1e4e4e	Revert "[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works" We still have some "uses remain after removal" issues in -O0 builds. This reverts commit r235557. llvm-svn: 235617	2015-04-23 18:34:01 +00:00
Reid Kleckner	dbbb7b8ac8	[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works This removes the -sehprepare flag and makes __C_specific_handler functions always to use WinEHPrepare. This was tested by building all of chromium_builder_tests and running a few tests that use SEH, but if something breaks, we can revert this. llvm-svn: 235557	2015-04-22 22:13:09 +00:00
Pete Cooper	553bbedfdb	Change MachineOperand::OpKind from unsigned char to a bitfield. NFC. This causes OpKind and all the bitfields after it to use 32-bit load/stores instead of i24's for the existing bitfields. Bugs will be filed to track whether clang and llvm could have generated the 32-bit operations in the front-end or optimizer. Reviewed by Rafael. llvm-svn: 235528	2015-04-22 17:48:26 +00:00
Duncan P. N. Exon Smith	8911bb4284	DebugInfo: Drop rest of DIDescriptor subclasses Delete the remaining subclasses of (the already deleted) `DIDescriptor`. Part of PR23080. llvm-svn: 235404	2015-04-21 18:44:06 +00:00
Reid Kleckner	3b4014368f	Re-land r235154-r235156 under the existing -sehprepare flag Keep the old SEH fan-in lowering on by default for now, since projects rely on it. This will make it easy to test this change with a simple flag flip. llvm-svn: 235399	2015-04-21 18:23:57 +00:00
Eric Christopher	4995cc0feb	Remove the FCFI option from TargetOptions as it is currently unused. llvm-svn: 235269	2015-04-19 03:21:08 +00:00
Eric Christopher	5f337055c2	Remove CFIFuncName from TargetOptions as it is currently unused. llvm-svn: 235268	2015-04-19 03:21:04 +00:00
Eric Christopher	b2b28a6b63	Remove the CFIEnforcing flag from TargetOptions as it is unused. llvm-svn: 235267	2015-04-19 03:20:59 +00:00
Eric Christopher	853f61980f	Remove the CFIType TargetOption as it is unused. llvm-svn: 235266	2015-04-19 03:20:55 +00:00
Kit Barton	ddc4c0a63d	Add support for v1i128 type. The v1i128 type is needed for the quadword add/substract instructions introduced in POWER8. Futhermore, the PowerPC ABI specifies that parameters of type v1i128 are to be passed in a single vector register, while parameters of type i128 are passed in pairs of GPRs. Thus, it is necessary to be able to differentiate between v1i128 and i128 in LLVM. http://reviews.llvm.org/D8564 llvm-svn: 235198	2015-04-17 16:11:05 +00:00
Nico Weber	ffd0269e17	Revert r235154-r235156, they cause asserts when building win64 code (http://crbug.com/477988 ) llvm-svn: 235170	2015-04-17 09:10:43 +00:00
Reid Kleckner	52ead7bcab	[SEH] Reimplement x64 SEH using WinEHPrepare This now emits simple, unoptimized xdata tables for __C_specific_handler based on the handlers listed in @llvm.eh.actions calls produced by WinEHPrepare. This adds support for running __finally blocks when exceptions are thrown, and removes the old landingpad fan-in codepath. I ran some manual execution tests on small basic test cases with and without optimization, as well as on Chrome base_unittests, which uses a small amount of SEH. I'm sure there are bugs, and we may need to revert. llvm-svn: 235154	2015-04-17 01:01:27 +00:00
Duncan P. N. Exon Smith	380b5bd2b0	DebugInfo: Remove 'inlinedAt:' field from MDLocalVariable Remove 'inlinedAt:' from MDLocalVariable. Besides saving some memory (variables with it seem to be single largest `Metadata` contributer to memory usage right now in -g -flto builds), this stops optimization and backend passes from having to change local variables. The 'inlinedAt:' field was used by the backend in two ways: 1. To tell the backend whether and into what a variable was inlined. 2. To create a unique id for each inlined variable. Instead, rely on the 'inlinedAt:' field of the intrinsic's `!dbg` attachment, and change the DWARF backend to use a typedef called `InlinedVariable` which is `std::pair<MDLocalVariable, MDLocation>`. This `DebugLoc` is already passed reliably through the backend (as verified by r234021). This commit removes the check from r234021, but I added a new check (that will survive) in r235048, and changed the `DIBuilder` API in r235041 to require a `!dbg` attachment whose 'scope:` is in the same `MDSubprogram` as the variable's. If this breaks your out-of-tree testcases, perhaps the script I used (mdlocalvariable-drop-inlinedat.sh) will help; I'll attach it to PR22778 in a moment. llvm-svn: 235050	2015-04-15 22:29:27 +00:00
Krzysztof Parzyszek	3efcf81e03	Allow memory intrinsics to be tail calls llvm-svn: 234764	2015-04-13 17:16:45 +00:00
Benjamin Kramer	70b4ac9a5e	Remove empty non-virtual destructors or mark them =default when non-public These add no value but can make a class non-trivially copyable. NFC. llvm-svn: 234688	2015-04-11 15:32:26 +00:00
Alexander Kornienko	71412ece39	Use 'override/final' instead of 'virtual' for overridden methods The patch is generated using clang-tidy misc-use-override check. This command was used: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py \ -checks='-*,misc-use-override' -header-filter='llvm\|clang' \ -j=32 -fix -format http://reviews.llvm.org/D8925 llvm-svn: 234679	2015-04-11 02:11:45 +00:00
Jingyue Wu	a3bb69f6c6	Divergence analysis for GPU programs Summary: Some optimizations such as jump threading and loop unswitching can negatively affect performance when applied to divergent branches. The divergence analysis added in this patch conservatively estimates which branches in a GPU program can diverge. This information can then help LLVM to run certain optimizations selectively. Test Plan: test/Analysis/DivergenceAnalysis/NVPTX/diverge.ll Reviewers: resistor, hfinkel, eliben, meheff, jholewinski Subscribers: broune, bjarke.roune, madhur13490, tstellarAMD, dberlin, echristo, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D8576 llvm-svn: 234567	2015-04-10 05:03:50 +00:00
David Majnemer	7ed6f9b08e	[WinEHPrepare] Don't rely on the order of IR The IPToState table must be emitted after we have generated labels for all functions in the table. Don't rely on the order of the list of globals. Instead, utilize WinEHFuncInfo to tell us how many catch handlers we expect to outline. Once we know we've visited all the catch handlers, emit the cppxdata. llvm-svn: 234566	2015-04-10 04:56:17 +00:00
Sanjay Patel	25d45a2b18	move helper function closer to use; NFC llvm-svn: 234344	2015-04-07 19:13:06 +00:00
Rafael Espindola	d00b8c898b	Clear the stub map in getSortedStubs. This makes sure they are only output once (and frees a bit of memory). llvm-svn: 234313	2015-04-07 12:59:28 +00:00
Duncan P. N. Exon Smith	f41651ac8a	CodeGen: Stop using DIDescriptor::is*() and auto-casting Same as r234255, but for lib/CodeGen and lib/Target. llvm-svn: 234258	2015-04-06 23:27:40 +00:00
Duncan P. N. Exon Smith	ea3fb29a5a	DebugInfo: Remove DIDescriptor::Verify() Remove `DIDescriptor::Verify()` and the `Verify()`s from subclasses. They had already been gutted, and just did an `isa<>` check. In a couple of cases I've temporarily dropped the check entirely, but subsequent commits are going to disallow conversions to the `DIDescriptor`s directly from `MDNode`, so the checks will come back in another form soon enough. llvm-svn: 234201	2015-04-06 19:49:39 +00:00
Sanjay Patel	c92141d527	remove function/variable names from comments; NFC llvm-svn: 234107	2015-04-04 21:06:39 +00:00
Rafael Espindola	0601e2fab8	Implement unique sections with an unique ID. This allows the compiler/assembly programmer to switch back to a section. This in turn fixes the bootstrap failure on powerpc (tested on gcc110) without changing the ppc codegen at all. I will try to cleanup the various getELFSection overloads in a followup patch. Just using a default argument now would lead to ambiguities. llvm-svn: 234099	2015-04-04 18:02:01 +00:00
David Majnemer	f0e072b4f9	[WinEH] Fill out CatchHigh in the TryBlockMap Now all fields in the WinEH xdata have been filled out. llvm-svn: 234067	2015-04-03 23:37:34 +00:00
David Majnemer	4a823b111e	[WinEH] Fill out .xdata for catch objects This add support for catching an exception such that an exception object available to the catch handler will be initialized by the runtime. llvm-svn: 234062	2015-04-03 22:49:05 +00:00
David Majnemer	694a466675	[WinEH] Sink UnwindHelp completely out of IR We don't need to represent UnwindHelp in IR. Instead, we can use the knowledge that we are emitting the parent function to decide if we should create the UnwindHelp stack object. llvm-svn: 234061	2015-04-03 22:32:26 +00:00
Andrew Kaylor	cd70932cf9	[WinEH] Handle nested landing pads in outlined catch handlers Differential Revision: http://reviews.llvm.org/D8596 llvm-svn: 234041	2015-04-03 19:37:50 +00:00
Duncan P. N. Exon Smith	054ffcf2c3	CodeGen: Assert that inlined-at locations agree As a follow-up to r234021, assert that a debug info intrinsic variable's `MDLocalVariable::getInlinedAt()` always matches the `MDLocation::getInlinedAt()` of its `!dbg` attachment. The goal here is to get rid of `MDLocalVariable::getInlinedAt()` entirely (PR22778), but I'll let these assertions bake for a while first. If you have an out-of-tree backend that just broke, you're probably attaching the wrong `DebugLoc` to a `DBG_VALUE` instruction. The one you want is the location that was attached to the corresponding `@llvm.dbg.declare` or `@llvm.dbg.value` call that you started with. llvm-svn: 234038	2015-04-03 19:20:26 +00:00
Matthias Braun	aa841574c7	MachineRegisterInfo: Make it clear that hints are for vregs llvm-svn: 233986	2015-04-03 00:18:33 +00:00
Philip Reames	b94079ff0c	[gcroot] Remove unused items from an enum These two were never implemented for gcroot, so there's no point in keeping them around now. llvm-svn: 233892	2015-04-02 05:02:16 +00:00
Matthias Braun	599e837694	Remove declarations for nonexistent methods llvm-svn: 233841	2015-04-01 21:16:02 +00:00
David Majnemer	e7ba02b466	[WinEH] Generate .xdata for catch handlers This lets us catch exceptions in simple cases. N.B. Things that do not work include (but are not limited to): - Throwing from within a catch handler. - Catching an object with a named catch parameter. - 'CatchHigh' is fictitious, we aren't sure of its purpose. - We aren't entirely efficient with regards to the number of EH states that we generate. - IP-to-State tables are sensitive to the order of emission. llvm-svn: 233767	2015-03-31 22:35:44 +00:00
Duncan P. N. Exon Smith	f6cf053551	DebugLoc: Remove getFromDILexicalBlock() The only user of `DebugLoc::getFromDILexicalBlock()` was creating a new `MDLocation` as convenient API for passing an `MDScope`. Stop doing that, and remove the API. If in the future we actually want to create new DebugLocs, calling `MDLexicalBlock::get()` makes more sense. llvm-svn: 233643	2015-03-30 23:47:26 +00:00
Duncan P. N. Exon Smith	82ab1b5fc7	LexicalScopes: Use debug info hierarchy pervasively Pervasively use the types provided by the debug info hierarchy rather than `MDNode` in `LexicalScopes`. I noticed (again, I guess, based on comments in the implementation?) that `DILexicalBlockFile::getScope()` returns something different from `DILexicalBlockFile::getContext()`. I created a local helper for getting the same logic from `MDLexicalBlockFile` called `getScopeOfScope()`. I still don't really understand it, but I've added some FIXMEs and I'll come back to it (I suspect the way we encode these objects isn't really ideal). Note that my previous commit r233610 accidentally changed behaviour in `findLexicalScope()` -- it transitioned from a call to `DILexicalBlockFile::getScope()` to `MDLexicalBlockFile::getScope()` (sounds right, doesn't it?) -- so I've fixed that as a drive-by. No tests failed with my error, so it looks like we're missing some coverage here... when I come back to understand the logic, I'll see if I can add some. Other than the fix to `findLexicalScope()`, no functionality change. llvm-svn: 233640	2015-03-30 23:21:21 +00:00
David Majnemer	6b3129357f	[WinEH] Run cleanup handlers when an exception is thrown Generate tables in the .xdata section representing what actions to take when an exception is thrown. This currently fills in state for cleanups, catch handlers are still unfinished. llvm-svn: 233636	2015-03-30 22:58:10 +00:00
Duncan P. N. Exon Smith	10ec79b94f	LexicalScopes: Use MDLocation directly instead of DebugLoc There's no benefit to using `DebugLoc` here. Moreover, this will let a follow-up commit work with `MDScope` directly instead of `DebugLoc`. llvm-svn: 233610	2015-03-30 21:54:46 +00:00
Simon Pilgrim	2751d2eb79	[DAGCombiner] Add support for TRUNCATE + FP_EXTEND vector constant folding This patch adds supports for the vector constant folding of TRUNCATE and FP_EXTEND instructions and tidies up the SINT_TO_FP and UINT_TO_FP instructions to match. It also moves the vector constant folding for the FNEG and FABS instructions to use the DAG.getNode() functionality like the other unary instructions. Differential Revision: http://reviews.llvm.org/D8593 llvm-svn: 233224	2015-03-25 22:30:31 +00:00
Benjamin Kramer	ddeec2a32e	Internalize the StackMapLiveness pass. No need to have its own header when it's not used anywhere. NFC. llvm-svn: 233072	2015-03-24 13:20:54 +00:00
Eric Christopher	95f92de138	Change getISAEncoding to use the target triple to determine thumb-ness similar to the rest of the Module level asm printing infrastructure as debug info finalization happens after the function may be missing. llvm-svn: 232875	2015-03-21 03:13:01 +00:00
Eric Christopher	3aef531a8e	Remove dead calls and function arguments dealing with TRI in StackMaps. llvm-svn: 232847	2015-03-20 21:05:18 +00:00
Eric Christopher	0e8997ccf4	Rewrite StackMap location handling to pre-compute the dwarf register numbers before emission. This removes a dependency on being able to access TRI at the module level and is similar to the DwarfExpression handling. I've modified the debug support into print/dump routines that'll do the same dumping but is now callable anywhere and if TRI isn't available will go ahead and just print out raw register numbers. llvm-svn: 232821	2015-03-20 16:03:42 +00:00
Matthias Braun	7aa4f3c5ea	Do not track subregister liveness when it brings no benefits Some subregisters are only to indicate different access sizes, while not providing any way to actually divide the register up into multiple disjunct parts. Avoid tracking subregister liveness in these cases as it is not beneficial. Differential Revision: http://reviews.llvm.org/D8429 llvm-svn: 232695	2015-03-19 00:21:58 +00:00
NAKAMURA Takumi	37e98c8299	Split comma-separated \param(s). [-Wdocumentation] llvm-svn: 232584	2015-03-18 02:09:25 +00:00
David Majnemer	de51ea1b14	COFF: Let globals with private linkage reside in their own section COFF COMDATs (for selection kinds other than 'select any') require at least one non-section symbol in the symbol table. Satisfy this by morally enhancing the linkage from private to internal. Differential Revision: http://reviews.llvm.org/D8394 llvm-svn: 232570	2015-03-17 23:54:51 +00:00
David Majnemer	a16c93669c	Revert "COFF: Let globals with private linkage reside in their own section" This reverts commit r232539. This was committed accidently. llvm-svn: 232543	2015-03-17 20:41:11 +00:00
David Majnemer	9c4a0b633b	COFF: Let globals with private linkage reside in their own section Summary: COFF COMDATs (for selection kinds other than 'select any') require at least one non-section symbol in the symbol table. Satisfy this by morally enhancing the linkage from private to internal. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8374 llvm-svn: 232539	2015-03-17 20:39:25 +00:00
Rafael Espindola	fe83cbe6f6	Centralize the handling of unique ids for temporary labels. Before this patch code wanting to create temporary labels for a given entity (function, cu, exception range, etc) had to keep its own counter to have stable symbol names. createTempSymbol would still add a suffix to make sure a new symbol was always returned, but it kept a single counter. Because of that, if we were to use just createTempSymbol("cu_begin"), the label could change from cu_begin42 to cu_begin43 because some other code started using temporary labels. Simplify this by just keeping one counter per prefix and removing the various specialized counters. llvm-svn: 232535	2015-03-17 20:07:06 +00:00
Michael Zolotukhin	16e5c0ceec	TTI: Honour cost model for estimating cost of vector-intrinsic and calls. Review: http://reviews.llvm.org/D8096 llvm-svn: 232528	2015-03-17 19:37:28 +00:00
Michael Zolotukhin	89ad5a256d	TTI: Add getCallInstrCost. Review: http://reviews.llvm.org/D8094 llvm-svn: 232524	2015-03-17 19:26:23 +00:00
Rafael Espindola	334d65b02e	Remove the error prone GetTempSymbol API. llvm-svn: 232487	2015-03-17 15:02:17 +00:00
Rafael Espindola	5657b7ec6c	Make EmitFunctionHeader a private helper. llvm-svn: 232481	2015-03-17 14:38:30 +00:00
Rafael Espindola	205b5558af	Move the EH symbol to the asm printer and use it for the SJLJ case too. llvm-svn: 232475	2015-03-17 13:57:48 +00:00

... 2 3 4 5 6 ...

5321 Commits