llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 05:52:53 +02:00

Author	SHA1	Message	Date
Benjamin Kramer	6083cb3002	BranchFolding: MergePotentialsElt has a total order, just call array_pod_sort. No functionality change intended. llvm-svn: 232217	2015-03-13 21:17:02 +00:00
Daniel Sanders	b2b69459a8	Recommit r232027 with PR22883 fixed: Add infrastructure for support of multiple memory constraints. The operand flag word for ISD::INLINEASM nodes now contains a 15-bit memory constraint ID when the operand kind is Kind_Mem. This constraint ID is a numeric equivalent to the constraint code string and is converted with a target specific hook in TargetLowering. This patch maps all memory constraints to InlineAsm::Constraint_m so there is no functional change at this point. It just proves that using these previously unused bits in the encoding of the flag word doesn't break anything. The next patch will make each target preserve the current mapping of everything to Constraint_m for itself while changing the target independent implementation of the hook to return Constraint_Unknown appropriately. Each target will then be adapted in separate patches to use appropriate Constraint_* values. PR22883 was caused the matching operands copying the whole of the operand flags for the matched operand. This included the constraint id which needed to be replaced with the operand number. This has been fixed with a conversion function. Following on from this, matching operands also used the operand number as the constraint id. This has been fixed by looking up the matched operand and taking it from there. llvm-svn: 232165	2015-03-13 12:45:09 +00:00
Hao Liu	ac40111e96	[MachineCopyPropagation] Fix a bug causing incorrect removal for the instruction sequences as follows %Q5_Q6<def> = COPY %Q2_Q3 %D5<def> = %D3<def> = %D3<def> = COPY %D6 // Incorrectly removed in MachineCopyPropagation Using of %D3 results in incorrect result ... Reviewed in http://reviews.llvm.org/D8242 llvm-svn: 232142	2015-03-13 05:15:23 +00:00
Eric Christopher	51fea81b26	Use the cached subtarget off of the machine function. llvm-svn: 232129	2015-03-13 00:49:50 +00:00
Eric Christopher	f03b492b22	Use the cached subtarget off of the machine function. llvm-svn: 232128	2015-03-13 00:38:19 +00:00
Sanjay Patel	13a9b5db63	[X86, AVX2] Replace inserti128 and extracti128 intrinsics with generic shuffles This should complete the job started in r231794 and continued in r232045: We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. AVX2 introduced proper integer variants of the hacked integer insert/extract C intrinsics that were created for this same functionality with AVX1. This should complete the removal of insert/extract128 intrinsics. The Clang precursor patch for this change was checked in at r232109. llvm-svn: 232120	2015-03-12 23:16:18 +00:00
Eric Christopher	be5bba737e	Remove unused headers. llvm-svn: 232102	2015-03-12 21:04:42 +00:00
Hal Finkel	dc4180d54f	Revert "r232027 - Add infrastructure for support of multiple memory constraints" This (r232027) has caused PR22883; so it seems those bits might be used by something else after all. Reverting until we can figure out what else to do. Original commit message: The operand flag word for ISD::INLINEASM nodes now contains a 15-bit memory constraint ID when the operand kind is Kind_Mem. This constraint ID is a numeric equivalent to the constraint code string and is converted with a target specific hook in TargetLowering. This patch maps all memory constraints to InlineAsm::Constraint_m so there is no functional change at this point. It just proves that using these previously unused bits in the encoding of the flag word doesn't break anything. The next patch will make each target preserve the current mapping of everything to Constraint_m for itself while changing the target independent implementation of the hook to return Constraint_Unknown appropriately. Each target will then be adapted in separate patches to use appropriate Constraint_* values. llvm-svn: 232093	2015-03-12 20:09:39 +00:00
Sanjay Patel	aa9ea9aae7	[X86, AVX] replace vextractf128 intrinsics with generic shuffles Now that we've replaced the vinsertf128 intrinsics, do the same for their extract twins. This is very much like D8086 (checked in at r231794): We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is also the LLVM sibling to the cfe D8275 patch. Differential Revision: http://reviews.llvm.org/D8276 llvm-svn: 232045	2015-03-12 15:15:19 +00:00
Daniel Sanders	4eee6f840d	Add infrastructure for support of multiple memory constraints. Summary: The operand flag word for ISD::INLINEASM nodes now contains a 15-bit memory constraint ID when the operand kind is Kind_Mem. This constraint ID is a numeric equivalent to the constraint code string and is converted with a target specific hook in TargetLowering. This patch maps all memory constraints to InlineAsm::Constraint_m so there is no functional change at this point. It just proves that using these previously unused bits in the encoding of the flag word doesn't break anything. The next patch will make each target preserve the current mapping of everything to Constraint_m for itself while changing the target independent implementation of the hook to return Constraint_Unknown appropriately. Each target will then be adapted in separate patches to use appropriate Constraint_* values. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D8171 llvm-svn: 232027	2015-03-12 11:00:48 +00:00
Jingyue Wu	764ac5a7c6	[NVPTXAsmPrinter] do not print .align on function headers Summary: PTX does not allow .align directives on function headers. Fixes PR21551. Test Plan: test/Codegen/NVPTX/function-align.ll Reviewers: eliben, jholewinski Reviewed By: eliben, jholewinski Subscribers: llvm-commits, eliben, jpienaar, jholewinski Differential Revision: http://reviews.llvm.org/D8274 llvm-svn: 232004	2015-03-12 01:50:30 +00:00
Reid Kleckner	ae151db0a9	Make llvm.eh.actions an intrinsic and add docs for it These docs don't match the way WinEHPrepare uses them yet, and verifier support isn't implemented either. The implementation will come after the documentation text is reviewed and agreed upon. llvm-svn: 232003	2015-03-12 01:45:37 +00:00
Reid Kleckner	997aab28ac	Stop calling DwarfEHPrepare from WinEHPrepare Instead, run both EH preparation passes, and have them both ignore functions with unrecognized EH personalities. Pass delegation involved some hacky code for creating an AnalysisResolver that we don't need now. llvm-svn: 231995	2015-03-12 00:36:20 +00:00
Mehdi Amini	94c8770ed5	Move the DataLayout to the generic TargetMachine, making it mandatory. Summary: I don't know why every singled backend had to redeclare its own DataLayout. There was a virtual getDataLayout() on the common base TargetMachine, the default implementation returned nullptr. It was not clear from this that we could assume at call site that a DataLayout will be available with each Target. Now getDataLayout() is no longer virtual and return a pointer to the DataLayout member of the common base TargetMachine. I plan to turn it into a reference in a future patch. The only backend that didn't have a DataLayout previsouly was the CPPBackend. It now initializes the default DataLayout. This commit is NFC for all the other backends. Test Plan: clang+llvm ninja check-all Reviewers: echristo Subscribers: jfb, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D8243 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231987	2015-03-12 00:07:24 +00:00
Reid Kleckner	49cd81f1f5	Fix some clang warnings in WinEHPrepare llvm-svn: 231985	2015-03-11 23:39:36 +00:00
Reid Kleckner	789a64e429	Handle big index in getelementptr instruction CodeGen incorrectly ignores (assert from APInt) constant index bigger than 2^64 in getelementptr instruction. This is a test and fix for that. Patch by Paweł Bylica! Reviewed By: rnk Subscribers: majnemer, rnk, mcrosier, resistor, llvm-commits Differential Revision: http://reviews.llvm.org/D8219 llvm-svn: 231984	2015-03-11 23:36:10 +00:00
Andrew Kaylor	6bed959c78	Extended support for native Windows C++ EH outlining Differential Review: http://reviews.llvm.org/D7886 llvm-svn: 231981	2015-03-11 23:22:06 +00:00
Eric Christopher	b5d16a61c0	Remove useMachineScheduler and replace it with subtarget options that control, individually, all of the disparate things it was controlling. At the same time move a FIXME in the Hexagon port to a new subtarget function that will enable a user of the machine scheduler to avoid using the source scheduler for pre-RA-scheduling. The FIXME would have this removed, but involves either testcase changes or adding -pre-RA-sched=source to a few testcases. llvm-svn: 231980	2015-03-11 22:56:10 +00:00
Eric Christopher	7e02765bdf	Have getCallPreservedMask and getThisCallPreservedMask take a MachineFunction argument so that we can grab subtarget specific features off of it. llvm-svn: 231979	2015-03-11 22:42:13 +00:00
Rafael Espindola	268252825b	Put jump tables in unique sections on COFF. If a function is going in an unique section (because of -ffunction-sections for example), putting a jump table in .rodata will keep .rodata alive and that will keep alive any other function that also has a jump table. Instead, put the jump table in a unique section that is associated with the function. llvm-svn: 231961	2015-03-11 19:58:37 +00:00
Eric Christopher	2925153548	Have getRegPressureSetLimit take a MachineFunction so that a we can inspect the subtarget and function when computing values. llvm-svn: 231951	2015-03-11 18:34:58 +00:00
Rafael Espindola	6932b2f8e9	Print section start labels when first switching to the section. This is less brittle and avoids polluting the start of the file with every debug section. llvm-svn: 231898	2015-03-11 00:51:37 +00:00
Eric Christopher	91c4e41987	Have TargetRegisterInfo::getLargestLegalSuperClass take a MachineFunction argument so that it can look up the subtarget rather than using a cached one in some Targets. llvm-svn: 231888	2015-03-10 23:46:01 +00:00
Paul Robinson	ff7c900586	Emit correct linkage-name attribute based on DWARF version. There are still 4 tests that check for DW_AT_MIPS_linkage_name, because they specify DWARF 2 or 3 in the module metadata. So, I didn't create an explicit version-based test for the attribute. Differential Revision: http://reviews.llvm.org/D8227 llvm-svn: 231880	2015-03-10 22:44:45 +00:00
Eric Christopher	db29a2f01c	Remove the use of the subtarget in MCCodeEmitter creation and update all ports accordingly. Required a couple of small rewrites in handling subtarget features during creation in PPC. llvm-svn: 231861	2015-03-10 22:03:14 +00:00
Rafael Espindola	96f9d3dd7f	Create symbols marking the start of a section earlier. This lets us pass the symbol to the constructor and avoid the mutable field. This also opens the way for outputting the symbol only when needed, instead of outputting them at the start of the file. llvm-svn: 231859	2015-03-10 22:00:25 +00:00
Quentin Colombet	b7df98faee	[CodeGenPrepare] Refine the cost model provided by the promotion helper. - Use TargetLowering to check for the actual cost of each extension. - Provide a factorized method to check for the cost of an extension: TargetLowering::isExtFree. - Provide a virtual method TargetLowering::isExtFreeImpl for targets to be able to tune the cost of non-free extensions. This refactoring offers a better granularity to model what really happens on different targets. No performance changes and very few code differences. Part of <rdar://problem/19267165> llvm-svn: 231855	2015-03-10 21:48:15 +00:00
Chad Rosier	dad3b0ec71	Don't evaluate rend() on every iteration of the loop. llvm-svn: 231837	2015-03-10 20:29:59 +00:00
Bruno Cardoso Lopes	e8b8714ddf	[AsmPrinter][TLOF] Reintroduce AArch64 test Follow up from r231505. Fix the non-determinism by using a MapVector and reintroduce the AArch64 testcase. Defer deleting the got candidates up to the end and remove them in a bulk, avoiding linear time removal of each element. Thanks to Renato Golin for trying it out on other platforms. llvm-svn: 231830	2015-03-10 20:05:23 +00:00
Adrian Prantl	aa7fb527c3	Change the datatype of DwarfExpression::Emit(Un)Signed to (u)int64_t so it matches the one used by ByteStreamer::Emit(U\|S)LEB128. llvm-svn: 231823	2015-03-10 19:23:37 +00:00
Rafael Espindola	074f6e2b72	Store an optional section start label in MCSection. This makes code that uses section relative expressions (debug info) simpler and less brittle. This is still a bit awkward as the symbol is created late and has to be stored in a mutable field. I will move the symbol creation earlier in the next patch. llvm-svn: 231802	2015-03-10 16:58:10 +00:00
Igor Laevsky	ec23b1a840	Teach lowering to correctly handle invoke statepoint and gc results tied to them. Note that we still can not lower gc.relocates for invoke statepoints. Also it extracts getCopyFromRegs helper function in SelectionDAGBuilder as we need to be able to customize type of the register exported from basic block during lowering of the gc.result. (Resubmitting this change after not being able to reproduce buildbot failure) Differential Revision: http://reviews.llvm.org/D7760 llvm-svn: 231800	2015-03-10 16:26:48 +00:00
Chad Rosier	edcfbac252	[BranchFolding] Remove MMOs during tail merge to preserve dependencies. When tail merging it may be necessary to remove MMOs from memory operations to ensures later passes (e.g., MI sched) conservatively compute dependencies. Currently, we only remove the MMO from the common tail if the MMO doesn't match with the relative instruction in the non-common tail(s). A more robust solution would be to add multiple MMOs from the duplicate MIs to the new MI. Currently ScheduleDAGInstrs.cpp ignores all MMOs on instructions with multiple MMOs, so this solution is equivalent for the time being. No test case included as this is incredibly difficult to reproduce. Patch was a collaborative effort between Ana Pazos and myself. Phabricator: http://reviews.llvm.org/D7769 llvm-svn: 231799	2015-03-10 16:22:52 +00:00
Sanjay Patel	5c62e16cdb	[X86, AVX] replace vinsertf128 intrinsics with generic shuffles We want to replace as much custom x86 shuffling via intrinsics as possible because pushing the code down the generic shuffle optimization path allows for better codegen and less complexity in LLVM. This is the sibling patch for the Clang half of this change: http://reviews.llvm.org/D8088 Differential Revision: http://reviews.llvm.org/D8086 llvm-svn: 231794	2015-03-10 16:08:36 +00:00
Rafael Espindola	8bcb85a7ac	Remove effectively dead code. Switching back and forth between sections does nothing (other than producing larger .s files). llvm-svn: 231790	2015-03-10 14:48:01 +00:00
Daniel Sanders	b0ddb75c07	The operand flag word used in ISD::INLINEASM is an i32 not a pointer. NFC. Summary: This is part of the work to support memory constraints that behave differently to 'm'. The subsequent patches will expand on the existing encoding (which is a 32-bit int) and as a result in some flag words will no longer fit into an i16. This problem only affected the MSP430 target which appears to have 16-bit pointers. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D8168 llvm-svn: 231783	2015-03-10 10:42:59 +00:00
Rafael Espindola	cd36ff5123	Move variable into assert to fix -Asserts builds. llvm-svn: 231753	2015-03-10 04:28:09 +00:00
Rafael Espindola	b021663a47	Remove incredibly confusing isBaseAddressKnownZero. When referring to a symbol in a dwarf section on ELF we should use .long foo instead of .long foo - .debug_something because ELF is unaware of the content of the sections and therefore needs relocations. This has nothing to do with optimizing a -0. llvm-svn: 231751	2015-03-10 04:11:52 +00:00
Rafael Espindola	9f3b858c2b	Use a better name for compile unit labels. They mark the start of a compile unit, so name them .Lcu_*. Using Section->getLabelBeginName() makes it looks like they mark the start of the section. While at it, switch to createTempSymbol to avoid collisions with labels created in inline assembly. Not sure if a "don't crash" test is worth it. With this getLabelBeginName is dead, delete it. llvm-svn: 231750	2015-03-10 03:58:36 +00:00
Frederic Riss	1f956a6735	DwarfAccelTable: remove unneeded bucket terminators. Last commit fixed the handling of hash collisions, but it introdcuced unneeded bucket terminators in some places. The generated table was correct, it can just be a tiny bit smaller. As the previous table was correct, the test doesn't need updating. If we really wanted to test this, I could add the section size to the dwarf dump and test for a precise value there. IMO the correctness test is sufficient. llvm-svn: 231748	2015-03-10 03:47:55 +00:00
Rafael Espindola	240f1f5b6b	Move label creation close to emission. NFC. llvm-svn: 231744	2015-03-10 03:11:11 +00:00
Mehdi Amini	f88efe5f8a	DataLayout is mandatory, update the API to reflect it with references. Summary: Now that the DataLayout is a mandatory part of the module, let's start cleaning the codebase. This patch is a first attempt at doing that. This patch is not exactly NFC as for instance some places were passing a nullptr instead of the DataLayout, possibly just because there was a default value on the DataLayout argument to many functions in the API. Even though it is not purely NFC, there is no change in the validation. I turned as many pointer to DataLayout to references, this helped figuring out all the places where a nullptr could come up. I had initially a local version of this patch broken into over 30 independant, commits but some later commit were cleaning the API and touching part of the code modified in the previous commits, so it seemed cleaner without the intermediate state. Test Plan: Reviewers: echristo Subscribers: llvm-commits From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231740	2015-03-10 02:37:25 +00:00
Frederic Riss	9c6539e46b	DwarfAccelTable: Fix handling of hash collisions. It turns out accelerator tables where totally broken if they contained entries with colliding hashes. The failure mode is pretty bad, as it not only impacted the colliding entries, but would basically make all the entries after the first hash collision pointing in the wrong place. The testcase uses the symbol names that where found to collide during a clang build. From a performance point of view, the patch adds a sort and a linear walk over each bucket contents. While it has a measurable impact on the accelerator table emission, it's not showing up significantly in clang profiles (and I'd argue that correctness is priceless :-)). llvm-svn: 231732	2015-03-10 00:46:31 +00:00
Ahmed Bougacha	c37ed67e4a	[CodeGen] Replace the reused stores' chain for extractelt expansion. This fixes a subtle issue that was introduced in r205153. When reusing a store for the extractelement expansion (to load directly from it, inserting of going through the stack), later stores to the same location might have overwritten the data we were expecting to extract from. To fix that, we need to explicitly replace the chain going out of the reused store, so that later stores also have an explicit dependency on the generated element-extracting loads, and can't clobber them. rdar://20066785 Differential Revision: http://reviews.llvm.org/D8180 llvm-svn: 231721	2015-03-09 22:51:05 +00:00
Reid Kleckner	dd109017af	Reland r229944: EH: Prune unreachable resume instructions during Dwarf EH preparation Fix the double-deletion of AnalysisResolver when delegating through to Dwarf EH preparation by creating one from scratch. Hopefully the new pass manager simplifies this. This reverts commit r229952. llvm-svn: 231719	2015-03-09 22:45:16 +00:00
Rafael Espindola	bc2dad2e08	Use a MapVector instead of an extra sort. This also has the advantage of not depending on the brittle getLabelBeginName. llvm-svn: 231714	2015-03-09 22:08:37 +00:00
Frederic Riss	b9251b7d03	DwarfAccelTable: fix obvious typo. I have a test for that issue, but I didn't include it in the commit as it's a 200KB file for a pretty minor issue. (The reason the file is so big is that it needs > 1024 variables/functions to trigger and that with debug information. The issue/fix on the other side is totally trivial. If poeple want the test commited, I can do that. It just didn't seem worth it to me. llvm-svn: 231701	2015-03-09 21:09:50 +00:00
Rafael Espindola	bfecc2407c	Don't prime the section map. This was just creating unused labels for .text when the module had no functions. llvm-svn: 231694	2015-03-09 20:09:58 +00:00
Rafael Espindola	90d175f38e	Print jump tables before exception tables. In the case where just tables are part of the function section, this produces more readable assembly by avoiding switching to the eh section and back to .text. This would also break with non unique section names, as trying to switch to a unique section actually creates a new one. llvm-svn: 231677	2015-03-09 18:29:12 +00:00
Rafael Espindola	f97fea3045	Don't repeat name in comment. NFC. llvm-svn: 231676	2015-03-09 18:11:42 +00:00
Rafael Espindola	99f144e698	Remove dummy method implementations. These are pure virtual in the base class, so the compiler checks that they are implemented. llvm-svn: 231673	2015-03-09 17:58:49 +00:00
David Blaikie	350b9cbf65	Simplify expressions involving boolean constants with clang-tidy Patch by Richard (legalize at xmission dot com). Differential Revision: http://reviews.llvm.org/D8154 llvm-svn: 231617	2015-03-09 01:57:13 +00:00
Benjamin Kramer	385eaa614d	Make static variables const if possible. Makes them go into a read-only section. Or fold them into a initializer list which has the same effect. NFC. llvm-svn: 231598	2015-03-08 16:07:39 +00:00
Simon Pilgrim	60d1ada777	[DAGCombiner] Add a shuffle mask commutation helper function. NFCI. We have an increasing number of cases where we are creating commuted shuffle masks - all implementing nearly the same code. This patch adds a static helper function - ShuffleVectorSDNode::commuteMask() and replaces a number of cases to use it. Differential Revision: http://reviews.llvm.org/D8139 llvm-svn: 231581	2015-03-07 22:33:11 +00:00
Benjamin Kramer	38504f768a	Make constant arrays that are passed to functions as const. In theory this allows the compiler to skip materializing the array on the stack. In practice clang often fails to do that, but that's a different story. NFC. llvm-svn: 231571	2015-03-07 17:41:00 +00:00
Simon Pilgrim	d85428f7b8	Use SDValue bool check to tidyup some possible combines. NFC. llvm-svn: 231569	2015-03-07 16:34:55 +00:00
Andrea Di Biagio	7cad5c0bec	[DAGCombiner] Fix wrong folding of AND dag nodes. This patch fixes the logic in the DAGCombiner that folds an AND node according to rule: (and (X (load V)), C) -> (X (load V)) An AND between a vector load 'X' and a constant build_vector 'C' can be folded into the load itself only if we can prove that the AND operation is redundant. The algorithm implemented by 'visitAND' firstly computes the splat value 'S' from C, and then checks if S has the lower 'B' bits set (where B is the size in bits of the vector element type). The algorithm takes into account also the 'undef' bits in the splat mask. Unfortunately, the algorithm only worked under the assumption that the size of S is a multiple of the vector element type. With this patch, we conservatively avoid folding the AND if the splat bits are not compatible with the vector element type. Added X86 test and-load-fold.ll Differential Revision: http://reviews.llvm.org/D8085 llvm-svn: 231563	2015-03-07 12:24:55 +00:00
Simon Pilgrim	ce7343d964	[DAGCombiner] SCALAR_TO_VECTOR(EXTRACT_VECTOR_ELT(V,C)) -> VECTOR_SHUFFLE This patch attempts to convert a SCALAR_TO_VECTOR using an operand from an EXTRACT_VECTOR_ELT into a VECTOR_SHUFFLE. This prevents many cases of spilling scalar data between the gpr + simd registers. At present the optimization only accepts cases where there is no TRUNC of the scalar type (i.e. all types must match). Differential Revision: http://reviews.llvm.org/D8132 llvm-svn: 231554	2015-03-07 05:52:42 +00:00
Matthias Braun	47028079f1	DAGCombiner: Canonicalize select(and/or,x,y) depending on target. This is based on the following equivalences: select(C0 & C1, X, Y) <=> select(C0, select(C1, X, Y), Y) select(C0 \| C1, X, Y) <=> select(C0, X, select(C1, X, Y)) Many target cannot perform and/or on the CPU flags and therefore the right side should be choosen to avoid materializign the i1 flags in an integer register. If the target can perform this operation efficiently we normalize to the left form. Differential Revision: http://reviews.llvm.org/D7622 llvm-svn: 231507	2015-03-06 19:49:10 +00:00
Matthias Braun	b345d38012	DAGCombiner: Factor out some and/or combines. This is in preparation for changing visitSELECT to normalize towards select(Cond0, select(Cond1, X, Y), Y); select(Cond0, X, select(Cond1, X, Y)) which perfom an implicit and/or of the conditions. The factored function contains all DAGCombine rules which reduce two values combined by an And/Or operation to a single value. This does not include rules involving constants as visitSELECT already handles that case. Differential Revision: http://reviews.llvm.org/D8026 llvm-svn: 231506	2015-03-06 19:49:06 +00:00
Matthias Braun	47efbcf97f	ExecutionDepsFix: Indizes -> Indices. Translate german to english. llvm-svn: 231500	2015-03-06 18:56:20 +00:00
Eric Christopher	cc13212eb1	Fix typo. llvm-svn: 231495	2015-03-06 18:20:23 +00:00
Bruno Cardoso Lopes	f5e7d40f2d	[AsmPrinter][TLOF] 32-bit MachO support for replacing GOT equivalents Add MachO 32-bit (i.e. arm and x86) support for replacing global GOT equivalent symbol accesses. Unlike 64-bit targets, there's no GOTPCREL relocation, and access through a non_lazy_symbol_pointers section is used instead. -- before _extgotequiv: .long _extfoo _delta: .long _extgotequiv-_delta -- after _delta: .long L_extfoo$non_lazy_ptr-_delta .section __IMPORT,__pointers,non_lazy_symbol_pointers L_extfoo$non_lazy_ptr: .indirect_symbol _extfoo .long 0 llvm-svn: 231475	2015-03-06 13:49:05 +00:00
Bruno Cardoso Lopes	c84d60c12f	[AsmPrinter][TLOF] ARM64 MachO support for replacing GOT equivalents Follow up r230264 and add ARM64 support for replacing global GOT equivalent symbol accesses by references to the GOT entry for the final symbol instead, example: -- before .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta -- after .globl _foo _foo: .long 42 .globl _delta Ltmp3: .long _foo@GOT-Ltmp3 llvm-svn: 231474	2015-03-06 13:48:45 +00:00
Michael Zolotukhin	b7a87300a2	LegalizeTypes: Handle shift by 0 in ExpandShiftByConstant. Though such shifts are usually optimized away by combiner, we still can encounter them after a vector shift is legalized. llvm-svn: 231443	2015-03-06 01:13:01 +00:00
Benjamin Kramer	cfe741989e	SelectionDAGBuilder: Merge 3 copies of the limited precision exp2 emission code. NFC intended. llvm-svn: 231406	2015-03-05 21:13:08 +00:00
Andrew Kaylor	4692180b1d	Fix uninitialized memory references in WinEHPrepare llvm-svn: 231405	2015-03-05 21:06:42 +00:00
Benjamin Kramer	288ff9cec2	SDAG: Merge the meat of two ExpandAtomic implementations. The copies already diverged, don't let them become any worse. Reduce redundancy in code with a little macro metaprogramming. llvm-svn: 231401	2015-03-05 20:04:29 +00:00
Rafael Espindola	0acd6bd828	Use the correct func begin symbol in all places in ppc. I missed an occurrence of the old symbol in my previous patch. llvm-svn: 231398	2015-03-05 19:47:50 +00:00
Rafael Espindola	22f2e68861	Use the generic Lfunc_begin label on ppc. This removes yet another custom label to mark the start of a function. llvm-svn: 231390	2015-03-05 18:55:50 +00:00
David Majnemer	bdce8557da	X86: Optimize address mode matching for FRAME_ALLOC_RECOVER nodes We know that the absolute symbol will be less than 2GB and thus will always fit. llvm-svn: 231389	2015-03-05 18:50:12 +00:00
Reid Kleckner	d0e0d012a0	Replace llvm.frameallocate with llvm.frameescape Turns out it's pretty straightforward and simplifies the implementation. Reviewers: andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D8051 llvm-svn: 231386	2015-03-05 18:26:34 +00:00
Simon Pilgrim	2d456bc887	[DagCombiner] Allow shuffles to merge through bitcasts Currently shuffles may only be combined if they are of the same type, despite the fact that bitcasts are often introduced in between shuffle nodes (e.g. x86 shuffle type widening). This patch allows a single input shuffle to peek through bitcasts and if the input is another shuffle will merge them, shuffling using the smallest sized type, and re-applying the bitcasts at the inputs and output instead. Dropped old ShuffleToZext test - this patch removes the use of the zext and vector-zext.ll covers these anyhow. Differential Revision: http://reviews.llvm.org/D7939 llvm-svn: 231380	2015-03-05 17:14:04 +00:00
Igor Laevsky	26ad396d2f	Revert change r231366 as it broke clang-native-arm-cortex-a9 Analysis/properties.m test. llvm-svn: 231374	2015-03-05 15:41:14 +00:00
Elena Demikhovsky	a71b2e475e	AVX-512, SKX: Enabled masked_load/store operations for this target. Added lowering for ISD::CONCAT_VECTORS and ISD::INSERT_SUBVECTOR for i1 vectors, it is needed to pass all masked_memop.ll tests for SKX. llvm-svn: 231371	2015-03-05 15:11:35 +00:00
Igor Laevsky	eb9f695bc2	Teach lowering to correctly handle invoke statepoint and gc results tied to them. Note that we still can not lower gc.relocates for invoke statepoints. Also it extracts getCopyFromRegs helper function in SelectionDAGBuilder as we need to be able to customize type of the register exported from basic block during lowering of the gc.result. llvm-svn: 231366	2015-03-05 14:11:21 +00:00
Arnaud A. de Grandmaison	2068314641	[PBQP] Use a local bit-matrix to speedup searching an edge in the graph. Build time (user time) for building llvm+clang+lldb in release mode: - default allocator: 9086 seconds - with PBQP: 9126 seconds - with PBQP + local bit matrix cache: 9097 seconds llvm-svn: 231360	2015-03-05 09:12:59 +00:00
Frederic Riss	04bc89f177	Remove useless break after return. Pointed out by Paul Robinson. llvm-svn: 231353	2015-03-05 06:13:39 +00:00
Chandler Carruth	5fb5d9fc63	[MBP] Use range based for-loops throughout this code. Several had already been added and the inconsistency made choosing names and changing code more annoying. Plus, wow are they better for this code! llvm-svn: 231347	2015-03-05 03:19:05 +00:00
Chandler Carruth	57083c3412	[MBP] NFC, run clang-format over this code and tweak things to make the result reasonable. This code predated clang-format and so there was a reasonable amount of crufty formatting that had accumulated. This should ensure that neither myself nor others end up with formatting-only changes sneaking into other fixes. llvm-svn: 231341	2015-03-05 02:35:31 +00:00
Chandler Carruth	e53763f252	[MBP] This is no longer 'block-placement2'. ;] The old variants are long gone, update this code to reflect that. llvm-svn: 231340	2015-03-05 02:28:25 +00:00
Rafael Espindola	d1ab81bec9	Use the existing begin and end symbol for debug info. llvm-svn: 231338	2015-03-05 02:05:42 +00:00
Chandler Carruth	9b555d5159	[MBP] Revert r231238 which attempted to fix a nasty bug where MBP is just arbitrarily interleaving unrelated control flows once they get moved "out-of-line" (both outside of natural CFG ordering and with diamonds that cannot be fully laid out by chaining fallthrough edges). This easy solution doesn't work in practice, and it isn't just a small bug. It looks like a very different strategy will be required. I'm working on that now, and it'll again go behind some flag so that everyone can experiment and make sure it is working well for them. llvm-svn: 231332	2015-03-05 01:07:03 +00:00
Paul Robinson	0ccc20c06e	Turn off .debug_pubnames/pubtypes for PS4. Differential Revision: http://reviews.llvm.org/D8067 llvm-svn: 231322	2015-03-05 00:08:27 +00:00
Frederic Riss	de8a85d593	Teach DIEInteger to emit FORM_strp and FORM_ref_addr attributes. To be used/tested by llvm-dsymutil. (llvm-dsymutil does a 'static' link, no need for relocations for most things, so it'll just emit raw integers for most attributes) llvm-svn: 231298	2015-03-04 22:07:36 +00:00
Paul Robinson	982e6ad0ef	Support standard DWARF TLS opcode; Darwin and PS4 use it. Differential Revision: http://reviews.llvm.org/D8018 llvm-svn: 231286	2015-03-04 20:55:11 +00:00
Mehdi Amini	29ebc2d39f	Make DataLayout Non-Optional in the Module Summary: DataLayout keeps the string used for its creation. As a side effect it is no longer needed in the Module. This is "almost" NFC, the string is no longer canonicalized, you can't rely on two "equals" DataLayout having the same string returned by getStringRepresentation(). Get rid of DataLayoutPass: the DataLayout is in the Module The DataLayout is "per-module", let's enforce this by not duplicating it more than necessary. One more step toward non-optionality of the DataLayout in the module. Make DataLayout Non-Optional in the Module Module->getDataLayout() will never returns nullptr anymore. Reviewers: echristo Subscribers: resistor, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D7992 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231270	2015-03-04 18:43:29 +00:00
Wei Mi	74dfd60327	Revert the test commit. llvm-svn: 231264	2015-03-04 17:44:22 +00:00
Wei Mi	70f53737d0	Test commit. It will be reverted in the next commit. llvm-svn: 231262	2015-03-04 17:41:17 +00:00
Adrian Prantl	0e7554ffc9	Fix DwarfExpression::AddMachineRegExpression so it doesn't read past the end of an expression that ends with DW_OP_plus. Caught by the ASAN build bots. llvm-svn: 231260	2015-03-04 17:39:33 +00:00
JF Bastien	0cecbf8a42	Mutate TargetLowering::shouldExpandAtomicRMWInIR to specifically dictate how AtomicRMWInsts are expanded. Summary: In PNaCl, most atomic instructions have their own @llvm.nacl.atomic.* function, each one, with a few exceptions, represents a consistent behaviour across all NaCl-supported targets. Unfortunately, the atomic RMW operations nand, [u]min, and [u]max aren't directly represented by any such @llvm.nacl.atomic.* function. This patch refines shouldExpandAtomicRMWInIR in TargetLowering so that a future `Le32TargetLowering` class can selectively inform the caller how the target desires the atomic RMW instruction to be expanded (ie via load-linked/store-conditional for ARM/AArch64, via cmpxchg for X86/others?, or not at all for Mips) if at all. This does not represent a behavioural change and as such no tests were added. Patch by: Richard Diamond. Reviewers: jfb Reviewed By: jfb Subscribers: jfb, aemerson, t.p.northover, llvm-commits Differential Revision: http://reviews.llvm.org/D7713 llvm-svn: 231250	2015-03-04 15:47:57 +00:00
Chandler Carruth	0fe0191603	[MBP] Fix a really horrible bug in MachineBlockPlacement, but behind a flag for now. First off, thanks to Daniel Jasper for really pointing out the issue here. It's been here forever (at least, I think it was there when I first wrote this code) without getting really noticed or fixed. The key problem is what happens when two reasonably common patterns happen at the same time: we outline multiple cold regions of code, and those regions in turn have diamonds or other CFGs for which we can't just topologically lay them out. Consider some C code that looks like: if (a1()) { if (b1()) c1(); else d1(); f1(); } if (a2()) { if (b2()) c2(); else d2(); f2(); } done(); Now consider the case where a1() and a2() are unlikely to be true. In that case, we might lay out the first part of the function like: a1, a2, done; And then we will be out of successors in which to build the chain. We go to find the best block to continue the chain with, which is perfectly reasonable here, and find "b1" let's say. Laying out successors gets us to: a1, a2, done; b1, c1; At this point, we will refuse to lay out the successor to c1 (f1) because there are still un-placed predecessors of f1 and we want to try to preserve the CFG structure. So we go get the next best block, d1. ... wait for it ... Except that the next best block isn't d1. It is b2! d1 is waaay down inside these conditionals. It is much less important than b2. Except that this is exactly what we didn't want. If we keep going we get the entire set of the rest of the CFG interleaved!!! a1, a2, done; b1, c1; b2, c2; d1, f1; d2, f2; So we clearly need a better strategy here. =] My current favorite strategy is to actually try to place the block whose predecessor is closest. This very simply ensures that we unwind these kinds of CFGs the way that is natural and fitting, and should minimize the number of cache lines instructions are spread across. It also happens to be dead simple. It's like the datastructure was specifically set up for this use case or something. We only push blocks onto the work list when the last predecessor for them is placed into the chain. So the back of the worklist is the nearest next block. Unfortunately, a change like this is going to cause soooo many benchmarks to swing wildly. So for now I'm adding this under a flag so that we and others can validate that this is fixing the problems described, that it seems possible to enable, and hopefully that it fixes more of our problems long term. llvm-svn: 231238	2015-03-04 12:18:08 +00:00
Daniel Jasper	3a2dd1b264	Add a flag to experiment with outlining optional branches. In a CFG with the edges A->B->C and A->C, B is an optional branch. LLVM's default behavior is to lay the blocks out naturally, i.e. A, B, C, in order to improve code locality and fallthroughs. However, if a function contains many of those optional branches only a few of which are taken, this leads to a lot of unnecessary icache misses. Moving B out of line can work around this. Review: http://reviews.llvm.org/D7719 llvm-svn: 231230	2015-03-04 11:05:34 +00:00
Michael Kuperstein	c46a58c60e	[DAGCombine] Fix a bug in a BUILD_VECTOR combine When trying to convert a BUILD_VECTOR into a shuffle, we try to split a single source vector that is twice as wide as the destination vector. We can not do this when we also need the zero vector to create a blend. This fixes PR22774. Differential Revision: http://reviews.llvm.org/D8040 llvm-svn: 231219	2015-03-04 07:27:39 +00:00
Frederic Riss	ff6a5cb9e4	Move emitDIE and emitAbbrevs to AsmPrinter. NFC. (They are called emitDwarfDIE and emitDwarfAbbrevs in their new home) llvm-dsymutil wants to reuse that code, but it doesn't have a DwarfUnit or a DwarfDebug object to call those. It has access to an AsmPrinter though. Having emitDIE in the AsmPrinter also removes the DwarfFile dependency on DwarfDebug, and thus the patch drops that field. Differential Revision: http://reviews.llvm.org/D8024 llvm-svn: 231210	2015-03-04 02:30:17 +00:00
Frederic Riss	ddd86366cc	Constify AsmPrinter passed to DIE methods. llvm-svn: 231209	2015-03-04 02:30:08 +00:00
Mehdi Amini	62f77ca5f4	Use report_fatal_error instead of unreachable for -fast-isel-abort Suggestion by Andrea Di Biagio From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231201	2015-03-04 01:48:39 +00:00
Rafael Espindola	eb2f517737	Use the vanilla func_end symbol for .size. No need to create yet another temp symbol. llvm-svn: 231198	2015-03-04 01:35:23 +00:00
David Blaikie	ffa6b17b59	Recommit r231168: unique_ptrify LiveRange::segmentSet GCC 4.7's libstdc++ doesn't have std::map::emplace, but it does have std::unordered_map::emplace, and the use case here doesn't appear to need ordering. The container has been changed in a separate/precursor patch, and now this patch should hopefully build cleanly even with GCC 4.7. & then I realized the order of the container did matter, so extra handling of ordering was added in r231189. Original commit message: This makes LiveRange non-copyable, and LiveInterval is already non-movable (due to the explicit dtor), so now it's non-copyable and non-movable. Fix the one case where we were relying on the (deprecated in C++11) implicit copy ctor of LiveInterval (which happened to work because the ctor created an object with a null segmentSet, so double-deleting the null pointer was fine). llvm-svn: 231192	2015-03-04 01:20:33 +00:00
David Blaikie	0b4edcf797	Recommit r231175: Change LiveStackAnalysis::SS2IntervalMap from std::map to std::unordered_map The order of this container was needed at one point - so, at that point create a temporary array of pointers, sort those, then iterate them. This keeps lookup efficient (& the lesser issue, of allowing the use of emplace... ), object identity preserved, and ordered iteration in the one place that requires it. While this has no functional change, I realize it does mean allocating an extra data structure and performing a sort - so if this looks suspect to anyone regarding perf characteristics, I'm all ears. llvm-svn: 231189	2015-03-04 01:15:53 +00:00
Matthias Braun	13805aa1b9	RegisterCoalescer: Gracefully continue if subrange merging fails. There is a known bug where the register coalescer fails to merge subranges when multiple ranges end up in the "overflow" bit 32 of the lanemasks. A proper fix for this is complicated so for now this is a workaround which lets the register coalescer drop the subregister liveness information (we just loose some precision by that) and continue. llvm-svn: 231186	2015-03-04 00:43:50 +00:00
Rafael Espindola	64eb6d620c	Drop the "eh_" from eh_func_begin and eh_func_end. They will be used for more than eh tables. llvm-svn: 231185	2015-03-04 00:27:43 +00:00
David Blaikie	584f0b5274	Revert "unique_ptrify LiveRange::segmentSet" Apparently something does care about ordering of LiveIntervals... so revert all that stuff (r231175, r231176, r231177) & take some time to re-evaluate. llvm-svn: 231184	2015-03-04 00:15:02 +00:00
David Blaikie	15dab25a2e	Recommit r231168: unique_ptrify LiveRange::segmentSet GCC 4.7's libstdc++ doesn't have std::map::emplace, but it does have std::unordered_map::emplace, and the use case here doesn't appear to need ordering. The container has been changed in a separate/precursor patch, and now this patch should hopefully build cleanly even with GCC 4.7. Original commit message: This makes LiveRange non-copyable, and LiveInterval is already non-movable (due to the explicit dtor), so now it's non-copyable and non-movable. Fix the one case where we were relying on the (deprecated in C++11) implicit copy ctor of LiveInterval (which happened to work because the ctor created an object with a null segmentSet, so double-deleting the null pointer was fine). llvm-svn: 231176	2015-03-03 23:53:03 +00:00
David Blaikie	697241f146	Revert "unique_ptrify LiveRange::segmentSet" GCC 4.7 shakes fist (doesn't have std::map::emplace... ) This reverts commit r231168. llvm-svn: 231173	2015-03-03 23:44:07 +00:00
David Blaikie	b042ba9ed4	unique_ptrify LiveRange::segmentSet This makes LiveRange non-copyable, and LiveInterval is already non-movable (due to the explicit dtor), so now it's non-copyable and non-movable. Fix the one case where we were relying on the (deprecated in C++11) implicit copy ctor of LiveInterval (which happened to work because the ctor created an object with a null segmentSet, so double-deleting the null pointer was fine). llvm-svn: 231168	2015-03-03 23:30:40 +00:00
Reid Kleckner	32718ad8f8	WinEH: Remove vestigial EH object Ultimately, we'll need to leave something behind to indicate which alloca will hold the exception, but we can figure that out when it comes time to emit the __CxxFrameHandler3 catch handler table. llvm-svn: 231164	2015-03-03 23:20:30 +00:00
Eric Christopher	5d29fd1be5	Fix a problem where the TwoAddressInstructionPass which generate redundant register moves in a loop. From: int M, total; void foo() { int i; for (i = 0; i < M; i++) { total = total + i / 2; } } This is the kernel loop: .LBB0_2: # %for.body =>This Inner Loop Header: Depth=1 movl %edx, %esi movl %ecx, %edx shrl $31, %edx addl %ecx, %edx sarl %edx addl %esi, %edx incl %ecx cmpl %eax, %ecx jl .LBB0_2 -------------------------- The first mov insn "movl %edx, %esi" could be removed if we change "addl %esi, %edx" to "addl %edx, %esi". The IR before TwoAddressInstructionPass is: BB#2: derived from LLVM BB %for.body Predecessors according to CFG: BB#1 BB#2 %vreg3<def> = COPY %vreg12<kill>; GR32:%vreg3,%vreg12 %vreg2<def> = COPY %vreg11<kill>; GR32:%vreg2,%vreg11 %vreg7<def,tied1> = SHR32ri %vreg3<tied0>, 31, %EFLAGS<imp-def,dead>; GR32:%vreg7,%vreg3 %vreg8<def,tied1> = ADD32rr %vreg3<tied0>, %vreg7<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg8,%vreg3,%vreg7 %vreg9<def,tied1> = SAR32r1 %vreg8<kill,tied0>, %EFLAGS<imp-def,dead>; GR32:%vreg9,%vreg8 %vreg4<def,tied1> = ADD32rr %vreg9<kill,tied0>, %vreg2<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg4,%vreg9,%vreg2 %vreg5<def,tied1> = INC64_32r %vreg3<kill,tied0>, %EFLAGS<imp-def,dead>; GR32:%vreg5,%vreg3 CMP32rr %vreg5, %vreg0, %EFLAGS<imp-def>; GR32:%vreg5,%vreg0 %vreg11<def> = COPY %vreg4; GR32:%vreg11,%vreg4 %vreg12<def> = COPY %vreg5<kill>; GR32:%vreg12,%vreg5 JL_4 <BB#2>, %EFLAGS<imp-use,kill> Now TwoAddressInstructionPass will choose vreg9 to be tied with vreg4. However, it doesn't see that there is copy from vreg4 to vreg11 and another copy from vreg11 to vreg2 inside the loop body. To remove those copies, it is necessary to choose vreg2 to be tied with vreg4 instead of vreg9. This code pattern commonly appears when there is reduction operation in a loop. So check for a reversed copy chain and if we encounter one then we can commute the add instruction so we can avoid a copy. Patch by Wei Mi. http://reviews.llvm.org/D7806 llvm-svn: 231148	2015-03-03 22:03:03 +00:00
David Blaikie	d695cc9032	DAGCombiner::LoadedSlice: Remove explicit copy ctor in favor of the Rule of Zero This way, the copy assignment operator can be used without hitting the deprecated case in C++11. llvm-svn: 231144	2015-03-03 21:50:47 +00:00
David Blaikie	5fd9cda286	Revert "Remove the explicit SDNodeIterator::operator= in favor of the implicit default" Accidentally committed a few more of these cleanup changes than intended. Still breaking these out & tidying them up. This reverts commit r231135. llvm-svn: 231136	2015-03-03 21:18:16 +00:00
David Blaikie	f9b228449d	Remove the explicit SDNodeIterator::operator= in favor of the implicit default There doesn't seem to be any need to assert that iterator assignment is between iterators over the same node - if you want to reuse an iterator variable to iterate another node, that's perfectly acceptable. Just don't mix comparisons between iterators into disjoint sequences, as usual. llvm-svn: 231135	2015-03-03 21:17:08 +00:00
David Blaikie	a22d013b3a	Remove LatencyPriorityQueue::dump because it relies on an implicit copy ctor which is deprecated in C++11 (due to the presence of a user-declare dtor in the base class) This type could be made copyable (= default a protected copy ctor in the base class, and preferably make the derived class final to avoid risks of providing a slicing copy operation to further derived classes) but it seemed easier to avoid that complexity for a dump function that I assume (by symmetry with ResourcePriorityQueue's dump, which was actively buggy) not often used. llvm-svn: 231133	2015-03-03 21:16:56 +00:00
David Blaikie	53c7e75dbf	unique_ptrify ResourcePriorityQueue::ResourceModel llvm-svn: 231127	2015-03-03 20:49:08 +00:00
David Blaikie	370206c461	Remove ResourcePriorityQueue::dump as it relies on copying a non-copyable type which would result in a double-delete llvm-svn: 231126	2015-03-03 20:49:05 +00:00
Andrew Kaylor	b825507df8	Fixing problem with field initialization order llvm-svn: 231122	2015-03-03 20:22:09 +00:00
Adrian Prantl	fadb9a888c	Fix PR22762. When emitting a DWARF expression check whether this is the frame register before checking if there is a DWARF register number for it. Thanks to H.J. Lu for diagnosing this and providing the testcase! llvm-svn: 231121	2015-03-03 20:12:52 +00:00
Andrew Kaylor	6232795e40	Outline cleanup handlers for native Windows C++ exception handling Differential Revision: http://reviews.llvm.org/D7865 llvm-svn: 231117	2015-03-03 20:00:16 +00:00
Eric Christopher	cb31f7b7b3	Add a comment above findRepresentativeClass explaining why it's where it is so that future generations can understand. llvm-svn: 231111	2015-03-03 19:47:14 +00:00
Dario Domizioli	ef32d033a1	Fix PR22750: non-determinism causes assertion failure in DWARF generation The cause of the issue is the interaction of two factors: 1) When generating a DW_TAG_imported_declaration DIE which imports another imported declaration, the code in AsmPrinter/DwarfCompileUnit.cpp asserts that the second imported declaration must already have a DIE. 2) There is a non-determinism in the order in which imported declarations within the same scope are processed. Because of the non-determinism (2), it is possible that an imported declaration is processed before another one it depends on, breaking the assumption in (1). The source of the non-determinism is that the imported declaration DIDescriptors are sorted by scope in DwarfDebug::beginModule(); however that sort is not a stable_sort, therefore the order of the declarations within the same scope is not preserved. The attached patch changes the std::sort to a std::stable_sort and it fixes the problem. Test omitted due to it being non-deterministic and depending on the implementation of std::sort. llvm-svn: 231100	2015-03-03 18:40:53 +00:00
Daniel Jasper	5816fc16b8	During PHI elimination, split critical edges that move copies out of loops. This prevents the behavior observed in llvm.org/PR22369. I am not sure whether I am reading the code correctly, but the early exit based on isLiveOutPastPHIs() seems to make the wrong assumption that RegisterCoalescer won't be able to coalesce those copies later. This change hides the new behavior behind -no-phi-elim-live-out-early-exit as it currently breaks four tests: * Assertion in: CodeGen/Hexagon/hwloop-cleanup.ll * Worse code in: CodeGen/X86/coalescer-commute4.ll CodeGen/X86/phys_subreg_coalesce-2.ll CodeGen/X86/zlib-longest-match.ll The root cause here seems to be that the heuristic that determines the visitation order in RegisterCoalescer gets less lucky. llvm-svn: 231064	2015-03-03 10:23:11 +00:00
Andrew Kaylor	8cfaab59d9	Remap arguments and non-alloca values used by outlined C++ exception handlers. Differential Revision: http://reviews.llvm.org/D7844 llvm-svn: 231042	2015-03-03 00:41:03 +00:00
Adrian Prantl	807869b230	Revert "Revert "For the dwarf expression code get the subtarget off of the current"" This reapplies r230990 without modifications. llvm-svn: 231024	2015-03-02 22:02:36 +00:00
Adrian Prantl	ebdf2406b7	Refactor DebugLocDWARFExpression so it doesn't require access to the TargetRegisterInfo. DebugLocEntry now holds a buffer with the raw bytes of the pre-calculated DWARF expression. Ought to be NFC, but it does slightly alter the output format of the textual assembly. This reapplies 230930 without the assertion in DebugLocEntry::finalize() because not all Machine registers can be lowered into DWARF register numbers and floating point constants cannot be expressed. llvm-svn: 231023	2015-03-02 22:02:33 +00:00
Rui Ueyama	dcef8234bd	Use read{16,32,64}{le,be}() instead of *reinterpret_cast<u{little,big}{16,32,64}_t>(). llvm-svn: 231016	2015-03-02 21:19:12 +00:00
Adrian Prantl	d8089cbc89	Revert "Refactor DebugLocDWARFExpression so it doesn't require access to the" This reverts commit 230975 to investigate buildbot breakage. llvm-svn: 231004	2015-03-02 20:01:54 +00:00
Adrian Prantl	12a9943af7	Revert "For the dwarf expression code get the subtarget off of the current" This reverts commit 230990 because also reverting 230975. llvm-svn: 231003	2015-03-02 20:01:47 +00:00
Eric Christopher	79e6bafcf3	For the dwarf expression code get the subtarget off of the current MachineFunction. llvm-svn: 230990	2015-03-02 19:01:47 +00:00
Adrian Prantl	e8d05839e3	Refactor DebugLocDWARFExpression so it doesn't require access to the TargetRegisterInfo. DebugLocEntry now holds a buffer with the raw bytes of the pre-calculated DWARF expression. Ought to be NFC, but it does slightly alter the output format of the textual assembly. This reapplies 230930 with a relaxed assertion in DebugLocEntry::finalize() that allows for empty DWARF expressions for constant FP values. llvm-svn: 230975	2015-03-02 17:21:06 +00:00
Benjamin Kramer	b9cb72cbe5	Accidentaly inverted the condition again. Sorry. llvm-svn: 230973	2015-03-02 16:45:08 +00:00
Benjamin Kramer	d525f7a9a0	Avoid assertion in MSVC 2013 debug builds. llvm-svn: 230972	2015-03-02 16:42:56 +00:00
Benjamin Kramer	95a7c55022	Simplify code. NFC. llvm-svn: 230948	2015-03-02 11:57:04 +00:00
Nico Weber	e4e94b4dca	Revert r230930, it caused PR22747. llvm-svn: 230932	2015-03-02 04:37:11 +00:00
Adrian Prantl	5e06e85b02	Refactor DebugLocDWARFExpression so it doesn't require access to the TargetRegisterInfo. DebugLocEntry now holds a buffer with the raw bytes of the pre-calculated DWARF expression. Ought to be NFC, but it does slightly alter the output format of the textual assembly. llvm-svn: 230930	2015-03-02 02:38:18 +00:00
Arnaud A. de Grandmaison	d0ab75130b	[PBQP] Address post-commit style comment for r230904. NFC. Thanks David ! llvm-svn: 230908	2015-03-01 21:22:50 +00:00
Arnaud A. de Grandmaison	0ce66152a8	[PBQP] Do not add an edge between nodes with totally disjoint allowed registers Such edges are zero matrix, and they bring no additional info to the allocation problem, apart from contributing to nodes' degree. Removing those edges is expected to improve allocation time. Tune the spill cost comparison, as this gives better average performances now that the nodes' degrees has changed. llvm-svn: 230904	2015-03-01 20:39:34 +00:00
Sanjay Patel	46d370c549	avoid infinite looping when folding vector multiplies of constants (PR22698) We were missing a check for the following fold in DAGCombiner: // fold (fmul (fmul x, c1), c2) -> (fmul x, (fmul c1, c2)) If 'x' is also a constant, then we shouldn't do anything. Otherwise, we could end up swapping the operands back and forth forever. This should fix: http://llvm.org/bugs/show_bug.cgi?id=22698 Differential Revision: http://reviews.llvm.org/D7917 llvm-svn: 230884	2015-03-01 00:09:35 +00:00
Benjamin Kramer	c1b31521c7	DwarfAccelTable: We know how many hashes we have in the output, just reserve the precise number llvm-svn: 230865	2015-02-28 20:15:00 +00:00
Benjamin Kramer	9cb5a80c17	StackColoring: Move set instead of copying. NFC. llvm-svn: 230864	2015-02-28 20:14:38 +00:00
Benjamin Kramer	9d5f68f7ad	LiveRange: Replace a creative vector erase loop with std::remove_if. I didn't see this so far because it scans backwards, but that doesn't make it any less quadratic. NFC. llvm-svn: 230863	2015-02-28 20:14:27 +00:00
Mehdi Amini	22544426f3	Fixup for recent -fast-isel-abort change: code didn't match description Level 1 should abort for all instructions but call/terminators/args. Instead it was aborting only if the level was > 2 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 230861	2015-02-28 19:34:54 +00:00
Benjamin Kramer	4718069d87	Convert push_back loops into append calls. No functionality change intended. llvm-svn: 230849	2015-02-28 13:20:15 +00:00
Benjamin Kramer	411a71a68d	ArrayRefize memory operand folding. NFC. llvm-svn: 230846	2015-02-28 12:04:00 +00:00
Benjamin Kramer	0f0bd365d8	Replace std::copy with a back inserter with vector append where feasible All of the cases were just appending from random access iterators to a vector. Using insert/append can grow the vector to the perfect size directly and moves the growing out of the loop. No intended functionalty change. llvm-svn: 230845	2015-02-28 10:11:12 +00:00
Benjamin Kramer	3baef96802	MachineDominators: Move applySplitCriticalEdges into the cpp file. It's too big for inlining anyways. Also clean it up slightly. No functionality change intended. llvm-svn: 230806	2015-02-27 23:13:13 +00:00
Benjamin Kramer	a9b2056013	Reduce double set lookups. llvm-svn: 230798	2015-02-27 21:43:14 +00:00
Eric Christopher	7c8f775d46	Remove the Forward Control Flow Integrity pass and its dependencies. This work is currently being rethought along different lines and if this work is needed it can be resurrected out of svn. Remove it for now as no current work in ongoing on it and it's unused. Verified with the authors before removal. llvm-svn: 230780	2015-02-27 19:03:38 +00:00
Mehdi Amini	32875af6e3	Change the fast-isel-abort option from bool to int to enable "levels" Summary: Currently fast-isel-abort will only abort for regular instructions, and just warn for function calls, terminators, function arguments. There is already fast-isel-abort-args but nothing for calls and terminators. This change turns the fast-isel-abort options into an integer option, so that multiple levels of strictness can be defined. This will help no being surprised when the "abort" option indeed does not abort, and enables the possibility to write test that verifies that no intrinsics are forgotten by fast-isel. Reviewers: resistor, echristo Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D7941 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 230775	2015-02-27 18:32:11 +00:00
Rafael Espindola	95e3cfd5ed	Centralize handling of the eh_begin and eh_end labels. This removes a bit of duplicated code and more importantly, remembers the labels so that they don't need to be looked up by name. This in turn allows for any name to be used and avoids a crash if the name we wanted was already taken. llvm-svn: 230772	2015-02-27 18:18:39 +00:00
Sanjoy Das	8f9c30bdcb	Don't modify the DenseMap being iterated over from within the loop that is iterating over it Inserting elements into a `DenseMap` invalidated iterators pointing into the `DenseMap` instance. Differential Revision: http://reviews.llvm.org/D7924 llvm-svn: 230719	2015-02-27 02:24:16 +00:00
Eric Christopher	29697aeb53	Rewrite MachineOperand::print and MachineInstr::print to avoid uses of TM->getSubtargetImpl and propagate to all calls. This could be a debugging regression in places where we had a TargetMachine and/or MachineFunction but don't have it as part of the MachineInstr. Fixing this would require passing a MachineFunction/Function down through the print operator, but none of the existing uses in tree seem to do this. llvm-svn: 230710	2015-02-27 00:11:34 +00:00
Rafael Espindola	efb97e5733	Put jump tables in distinct sections if -ffunction-sections is used. A small regression in r230411 was that we were basing the decision on -fdata-sections. llvm-svn: 230707	2015-02-26 23:55:11 +00:00
Eric Christopher	302e78f721	Remove DebugLoc::print(LLVMContext, raw_ostream), it was just forwarding to the one that didn't take a context. llvm-svn: 230700	2015-02-26 23:32:17 +00:00
Eric Christopher	454cbc40f6	getRegForInlineAsmConstraint wants to use TargetRegisterInfo for a lookup, pass that in rather than use a naked call to getSubtargetImpl. This involved passing down and around either a TargetMachine or TargetRegisterInfo. Update all callers/definitions around the targets and SelectionDAG. llvm-svn: 230699	2015-02-26 22:38:43 +00:00
Eric Christopher	ab11a1a5d9	Add a TargetMachine argument to the AddressingModeMatcher, we'll need this shortly to get a TargetRegisterInfo from the subtarget for TargetLowering routines. llvm-svn: 230698	2015-02-26 22:38:34 +00:00
Rafael Espindola	c61aeeff41	Simplify arange output. Move SectionMap to its only user (emitDebugARanges) and reorder to save a call to sort. llvm-svn: 230693	2015-02-26 22:02:02 +00:00
Paul Robinson	b0132db1ee	When the source has a series of assignments, users reasonably want to have the debugger step through each one individually. Turn off the combine for adjacent stores at -O0 so we get this behavior. Possibly, DAGCombine shouldn't run at all at -O0, but that's for another day; see PR22346. Differential Revision: http://reviews.llvm.org/D7181 llvm-svn: 230659	2015-02-26 18:47:57 +00:00
Eric Christopher	2a41cb1089	Remove an argument-less call to getSubtargetImpl from TargetLoweringBase. This required plumbing a TargetRegisterInfo through computeRegisterProperties and into findRepresentativeClass which uses it for register class iteration. This required passing a subtarget into a few target specific initializations of TargetLowering. llvm-svn: 230583	2015-02-26 00:00:24 +00:00
Eric Christopher	667d77b874	Move TargetLoweringBase::getTypeConversion to the .cpp file from the .h file. It's used in only one place (other than recursively) and there's no need to include it everywhere. Saves almost 900k from total llvm object file size. llvm-svn: 230561	2015-02-25 22:41:30 +00:00
Andrew Kaylor	1601d3591a	Fixing a problem with insert location in WinEH outlining llvm-svn: 230535	2015-02-25 20:12:49 +00:00
Rafael Espindola	565d0485ca	Support SHF_MERGE sections in COMDATs. This patch unifies the comdat and non-comdat code paths. By doing this it add missing features to the comdat side and removes the fixed section assumptions from the non-comdat side. In ELF there is no one true section for "4 byte mergeable" constants. We are better off computing the required properties of the section and asking the context for it. llvm-svn: 230411	2015-02-25 00:52:15 +00:00
David Majnemer	074e36baf8	PrologEpilogInserter: Clean up math in calculateFrameObjectOffsets There is no need to open-code the alignment calculation, we have a handy RoundUpToAlignment function which "Does The Right Thing (TM)". llvm-svn: 230392	2015-02-24 23:08:13 +00:00
Simon Pilgrim	50fa51790b	Reapplied D7816 & rL230177 & rL230278 - with an additional fix toensure that the smallest build vector input scalar type is always used. Additional (crash) test cases already committed. llvm-svn: 230388	2015-02-24 22:08:56 +00:00
Andrew Kaylor	914fdbf483	Fixing eol-style llvm-svn: 230378	2015-02-24 20:49:35 +00:00
Eric Christopher	db420cc1dd	Revert: Author: Simon Pilgrim <llvm-dev@redking.me.uk> Date: Mon Feb 23 23:04:28 2015 +0000 Fix based on post-commit comment on D7816 & rL230177 - BUILD_VECTOR operand truncation was using the the BV's output scalar type instead of the input type. and Author: Simon Pilgrim <llvm-dev@redking.me.uk> Date: Sun Feb 22 18:17:28 2015 +0000 [DagCombiner] Generalized BuildVector Vector Concatenation The CONCAT_VECTORS combiner pass can transform the concat of two BUILD_VECTOR nodes into a single BUILD_VECTOR node. This patch generalises this to support any number of BUILD_VECTOR nodes, and also permits UNDEF nodes to be included as well. This was noticed as AVX vec128 -> vec256 canonicalization sometimes creates a CONCAT_VECTOR with a real vec128 lower and an vec128 UNDEF upper. Differential Revision: http://reviews.llvm.org/D7816 as the root cause of PR22678 which is causing an assertion inside the DAG combiner. I'll follow up to the main thread as well. llvm-svn: 230358	2015-02-24 19:11:00 +00:00
Eric Christopher	7ee6f3d6c2	Rename UpdateRegAllocHint to match style guidelines. llvm-svn: 230357	2015-02-24 19:10:57 +00:00
Matthias Braun	2c6ac76701	DAGCombiner: Move variable definitions closer to use; NFC llvm-svn: 230354	2015-02-24 18:52:01 +00:00
Matthias Braun	69fb836050	DAGCombiner: Move variable declaration closer to definiion; NFC llvm-svn: 230353	2015-02-24 18:51:59 +00:00
Tim Northover	afcf85da25	ARM: treat [N x i32] and [N x i64] as AAPCS composite types The logic is almost there already, with our special homogeneous aggregate handling. Tweaking it like this allows front-ends to emit AAPCS compliant code without ever having to count registers or add discarded padding arguments. Only arrays of i32 and i64 are needed to model AAPCS rules, but I decided to apply the logic to all integer arrays for more consistency. llvm-svn: 230348	2015-02-24 17:22:34 +00:00
Hal Finkel	5b5de90baa	[SDAG] Handle LowerOperation returning its input consistently For almost all node types, if the target requested custom lowering, and LowerOperation returned its input, we'd treat the original node as legal. This did not work, however, for many loads and stores, because they follow slightly different code paths, and we did not account for the possibility of LowerOperation returning its input at those call sites. I think that we now handle this consistently everywhere. At the call sites in LegalizeDAG, we used to assert in this case, so there's no functional change for any existing code there. For the call sites in LegalizeVectorOps, this really only affects whether or not we set Changed = true, but I think makes the semantics clearer. No test case here, but it will be covered by an upcoming PowerPC commit adding QPX support. llvm-svn: 230332	2015-02-24 12:59:47 +00:00
Simon Pilgrim	ec80e98714	Fix based on post-commit comment on D7816 & rL230177 - BUILD_VECTOR operand truncation was using the the BV's output scalar type instead of the input type. llvm-svn: 230278	2015-02-23 23:04:28 +00:00
Andrea Di Biagio	fc5bcff188	[X86] Teach how to custom lower double-to-half conversions under fast-math. This patch teaches the backend how to expand a double-half conversion into a double-float conversion immediately followed by a float-half conversion. We do this only under fast-math, and if float-half conversions are legal for the target. Added test CodeGen/X86/fastmath-float-half-conversion.ll Differential Revision: http://reviews.llvm.org/D7832 llvm-svn: 230276	2015-02-23 22:59:02 +00:00
Bruno Cardoso Lopes	d2335534c5	[AsmPrinter] Access pointers to globals via pcrel GOT entries Front-ends could use global unnamed_addr to hold pointers to other symbols, like @gotequivalent below: @foo = global i32 42 @gotequivalent = private unnamed_addr constant i32* @foo @delta = global i32 trunc (i64 sub (i64 ptrtoint (i32** @gotequivalent to i64), i64 ptrtoint (i32* @delta to i64)) to i32) The global @delta holds a data "PC"-relative offset to @gotequivalent, an unnamed pointer to @foo. The darwin/x86-64 assembly output for this follows: .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta Since unnamed_addr indicates that the address is not significant, only the content, we can optimize the case above by replacing pc-relative accesses to "GOT equivalent" globals, by a PC relative access to the GOT entry of the final symbol instead. Therefore, "delta" can contain a pc relative relocation to foo's GOT entry and we avoid the emission of "gotequivalent", yielding the assembly code below: .globl _foo _foo: .long 42 .globl _delta _delta: .long _foo@GOTPCREL+4 There are a couple of advantages of doing this: (1) Front-ends that need to emit a great deal of data to store pointers to external symbols could save space by not emitting such "got equivalent" globals and (2) IR constructs combined with this opt opens a way to represent GOT pcrel relocations by using the LLVM IR, which is something we previously had no way to express. Differential Revision: http://reviews.llvm.org/D6922 rdar://problem/18534217 llvm-svn: 230264	2015-02-23 21:26:18 +00:00
Andrew Kaylor	e73260c648	Removing unused private field. llvm-svn: 230259	2015-02-23 21:03:30 +00:00
Andrew Kaylor	8eb14460aa	Second attempt to fix WinEHCatchDirector build failures. llvm-svn: 230257	2015-02-23 20:44:34 +00:00
Andrew Kaylor	695cbcb2a1	Attempting to fix WinEHCatchDirector destructor related build failures. llvm-svn: 230252	2015-02-23 20:19:15 +00:00
Andrew Kaylor	4e424eb088	Remap frame variables for native Windows exception handling. Differential Revision: http://reviews.llvm.org/D7770 llvm-svn: 230249	2015-02-23 20:01:56 +00:00
Eric Christopher	673f20f883	Rewrite the global merge pass to be subprogram agnostic for now. It was previously using the subtarget to get values for the global offset without actually checking each function as it was generating code. Go ahead and solidify the current behavior and make the existing FIXMEs more prominent. As a note the ARM backend previously had a thumb1 and non-thumb1 set of defaults. Only the former was tested so I've changed the behavior to only use that for now. llvm-svn: 230245	2015-02-23 19:28:45 +00:00
Simon Pilgrim	fb2ad0f430	[DagCombiner] Generalized BuildVector Vector Concatenation The CONCAT_VECTORS combiner pass can transform the concat of two BUILD_VECTOR nodes into a single BUILD_VECTOR node. This patch generalises this to support any number of BUILD_VECTOR nodes, and also permits UNDEF nodes to be included as well. This was noticed as AVX vec128 -> vec256 canonicalization sometimes creates a CONCAT_VECTOR with a real vec128 lower and an vec128 UNDEF upper. Differential Revision: http://reviews.llvm.org/D7816 llvm-svn: 230177	2015-02-22 18:17:28 +00:00
Hal Finkel	9822c946fa	[DAGCombine] Don't assume integer-type legailty in reduceBuildVecConvertToConvertBuildVec DAGCombine will rewrite an BUILD_VECTOR where all non-undef inputs some from [US]INT_TO_FP, as a BUILD_VECTOR of integers with the conversion applied as a vector operation. We check operation legality of the conversion, but fail to check legality of the integer vector type itself. Because targets don't normally override operation legality defaults for illegal types, we need to check this also. This came up in the context of the QPX vector entensions for PowerPC (which can have legal floating-point vector types without corresponding legal integer vector types). No in-tree test case for this yes, but one can be added once the QPX support has been committed. llvm-svn: 230176	2015-02-22 16:10:22 +00:00
Hal Finkel	e88c3cdcc4	[SDAG] Use correct alignments on expanded vector trunc-store/ext-loads When expanding a truncating store or extending load using vector extracts or inserts and scalar stores and loads, we were giving each of these scalar stores or loads the same alignment as the original vector operation. While this will often be right (most vector operations, especially those produced by autovectorization, have the alignment of the underlying scalar type), the vector operation could certainly have a larger alignment. No test case (yet); noticed by inspection. llvm-svn: 230175	2015-02-22 15:58:04 +00:00
Benjamin Kramer	b345aa0024	MachineInstr: Use range-based for loops. NFC. llvm-svn: 230142	2015-02-21 17:08:08 +00:00
Benjamin Kramer	d7449ff688	Calling memmove on a MachineOperand is totally safe. While it's not POD due to the user-defined constructor, it's still a trivially copyable type. No functional change. llvm-svn: 230141	2015-02-21 16:22:48 +00:00
Eric Christopher	0b46346911	Unconditionally create a new MCInstrInfo in the asm printer for asm parsing since it's not subtarget dependent and we can't depend upon the one hanging off the MachineFunction's subtarget still being around. llvm-svn: 230135	2015-02-21 09:09:15 +00:00
David Majnemer	cc843a9a3a	X86: Call __main using the SelectionDAG Synthesizing a call directly using the MI layer would confuse the frame lowering code. This is problematic as frame lowering is highly sensitive the particularities of calls, etc. llvm-svn: 230129	2015-02-21 05:49:45 +00:00
Matthias Braun	be9efdd856	LiveRangeCalc: Don't start liveranges of PHI instruction at the block begin. Summary: Letting them begin at the PHI instruction slightly simplifies the code but more importantly avoids breaking the assumption that live ranges starting at the block begin are also live at the end of the predecessor blocks. The MachineVerifier checks that but was apparently never run in the few instances where liveranges are calculated for machine-SSA functions. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7779 llvm-svn: 230093	2015-02-20 23:43:14 +00:00
Rafael Espindola	2661d3d69a	Use short names for jumptable sections. Also refactor code to remove some duplication. llvm-svn: 230087	2015-02-20 23:28:28 +00:00
Eric Christopher	5e8a7d821d	Used the cached subtarget off of the MachineFunction. llvm-svn: 230078	2015-02-20 22:36:11 +00:00
Matt Arsenault	79c7d059c9	Add generic fmad DAG node. This allows sharing of FMA forming combines to work with instructions that have the same semantics as a separate multiply and add. This is expand by default, and only formed post legalization so it shouldn't have much impact on targets that do not want it. llvm-svn: 230070	2015-02-20 22:10:33 +00:00
Eric Christopher	ef1a74c434	Grab the DataLayout off of the TargetMachine since that's where it's stored. llvm-svn: 230059	2015-02-20 20:56:39 +00:00
Eric Christopher	804b453f80	Get the function specific subtarget. llvm-svn: 230038	2015-02-20 18:44:17 +00:00
Eric Christopher	a480b0a899	Get the cached subtarget off the MachineFunction rather than inquiring for a new one from the TargetMachine. llvm-svn: 230037	2015-02-20 18:44:15 +00:00
Igor Laevsky	d3544cb9c6	Generalize statepoint lowering to use ImmutableStatepoint. Move statepoint lowering into a separate function 'LowerStatepoint' which uses ImmutableStatepoint instead of a CallInst. Also related utility functions are changed to receive ImmutableCallSite. Differential Revision: http://reviews.llvm.org/D7756 llvm-svn: 230017	2015-02-20 15:28:35 +00:00
Nick Lewycky	234d31cba6	Fix build with gcc. This has a -Wsequence-point error on 'MII', which is a good point. llvm-svn: 229979	2015-02-20 07:17:40 +00:00
Eric Christopher	092557bc6f	Remove more uses of TargetMachine::getSubtargetImpl from the AsmPrinter. getSubtargetInfo now asserts that the MachineFunction exists. Debug printing of register naming now uses the register info from MCAsmInfo as that's unchanging. llvm-svn: 229978	2015-02-20 07:16:19 +00:00
Eric Christopher	c1722581e0	AsmPrinter::doFinalization is at the module level and so doesn't have access to a target specific subtarget info. Grab the module level MCSubtargetInfo for the JumpInstrTable output stubs. llvm-svn: 229974	2015-02-20 06:59:48 +00:00
Eric Christopher	5c2882940e	Remove the MCInstrInfo cached variable as it was only used in a single place and replace calls to getSubtargetImpl with calls to get the subtarget from the MachineFunction where valid. llvm-svn: 229971	2015-02-20 06:35:21 +00:00
Chandler Carruth	4c2aad4a26	Revert r229944: EH: Prune unreachable resume instructions during Dwarf EH preparation This doesn't pass 'ninja check-llvm' for me. Lots of tests, including the ones updated, fail with crashes and other explosions. llvm-svn: 229952	2015-02-20 02:15:36 +00:00
Reid Kleckner	13eabcdf32	EH: Prune unreachable resume instructions during Dwarf EH preparation Today a simple function that only catches exceptions and doesn't run destructor cleanups ends up containing a dead call to _Unwind_Resume (PR20300). We can't remove these dead resume instructions during normal optimization because inlining might introduce additional landingpads that do have cleanups to run. Instead we can do this during EH preparation, which is guaranteed to run after inlining. Fixes PR20300. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D7744 llvm-svn: 229944	2015-02-20 01:00:19 +00:00
Eric Christopher	dd67a28429	This needs to be a const variable so the two sides of the ternary operator agree on type. llvm-svn: 229938	2015-02-20 00:03:45 +00:00
Eric Christopher	bc999b55a9	Only use the initialized MCInstrInfo if it's been initialized already during SetupMachineFunction. This is also the single use of MII and it'll be changing to TargetInstrInfo (which is MachineFunction based) in the next commit here. llvm-svn: 229931	2015-02-19 23:52:35 +00:00
Eric Christopher	0880ea126d	Migrate away a use of the subtarget (and TargetMachine) from AsmPrinterDwarf since the information is on the MCRegisterInfo via the MCContext and MMI that we already have on the AsmPrinter. llvm-svn: 229928	2015-02-19 23:29:42 +00:00
Ahmed Bougacha	5920886c7a	[CodeGen] Use ArrayRef instead of std::vector&. NFC. The former lets us use SmallVectors. Do so in ARM and AArch64. llvm-svn: 229925	2015-02-19 23:13:10 +00:00
Eric Christopher	ffd5ade469	MCTargetOptions reside on the TargetMachine that we always have via TargetOptions. llvm-svn: 229917	2015-02-19 21:29:51 +00:00
Eric Christopher	3a3ea0cf22	Remove a call to TargetMachine::getSubtarget from the inline asm support in the asm printer. If we can get a subtarget from the machine function then we should do so, otherwise we can go ahead and create a default one since we're at the module level. llvm-svn: 229916	2015-02-19 21:24:23 +00:00
Eric Christopher	ad47e0cf41	Remove unused argument from emitInlineAsmStart. llvm-svn: 229907	2015-02-19 19:52:25 +00:00
Eric Christopher	edf6e8ae9f	Update and remove a few calls to TargetMachine::getSubtargetImpl out of the asm printer. llvm-svn: 229883	2015-02-19 18:46:23 +00:00
Benjamin Kramer	c0850fa665	Demote vectors to arrays. No functionality change. llvm-svn: 229861	2015-02-19 15:26:17 +00:00
Chandler Carruth	105d2fa5e8	[x86,sdag] Two interrelated changes to the x86 and sdag code. First, don't combine bit masking into vector shuffles (even ones the target can handle) once operation legalization has taken place. Custom legalization of vector shuffles may exist for these patterns (making the predicate return true) but that custom legalization may in some cases produce the exact bit math this matches. We only really want to handle this prior to operation legalization. However, the x86 backend, in a fit of awesome, relied on this. What it would do is mark VSELECTs as expand, which would turn them into arithmetic, which this would then match back into vector shuffles, which we would then lower properly. Amazing. Instead, the second change is to teach the x86 backend to directly form vector shuffles from VSELECT nodes with constant conditions, and to mark all of the vector types we support lowering blends as shuffles as custom VSELECT lowering. We still mark the forms which actually support variable blends as legal so that the custom lowering is bypassed, and the legal lowering can even be used by the vector shuffle legalization (yes, i know, this is confusing. but that's how the patterns are written). This makes the VSELECT lowering much more sensible, and in fact should fix a bunch of bugs with it. However, as you'll see in the test cases, right now what it does is point out the hilarious deficiency of the new vector shuffle lowering when it comes to blends. Fortunately, my very next patch fixes that. I can't submit it yet, because that patch, somewhat obviously, forms the exact and/or pattern that the DAG combine is matching here! Without this patch, teaching the vector shuffle lowering to produce the right code infloops in the DAG combiner. With this patch alone, we produce terrible code but at least lower through the right paths. With both patches, all the regressions here should be fixed, and a bunch of the improvements (like using 2 shufps with no memory loads instead of 2 andps with memory loads and an orps) will stay. Win! There is one other change worth noting here. We had hilariously wrong vectorization cost estimates for vselect because we fell through to the code path that assumed all "expand" vector operations are scalarized. However, the "expand" lowering of VSELECT is vector bit math, most definitely not scalarized. So now we go back to the correct if horribly naive cost of "1" for "not scalarized". If anyone wants to add actual modeling of shuffle costs, that would be cool, but this seems an improvement on its own. Note the removal of 16 and 32 "costs" for doing a blend. Even in SSE2 we can blend in fewer than 16 instructions. ;] Of course, we don't right now because of OMG bad code, but I'm going to fix that. Next patch. I promise. llvm-svn: 229835	2015-02-19 10:36:19 +00:00
Reid Kleckner	53bc7b1858	Add an IR-to-IR test for dwarf EH preparation using opt This tests the simple resume instruction elimination logic that we have before making some changes to it. llvm-svn: 229768	2015-02-18 23:17:41 +00:00
Andrew Kaylor	de93ac8895	Style and formatting fixes for r229715 llvm-svn: 229758	2015-02-18 22:52:18 +00:00
Reid Kleckner	c8eb2119c4	dos2unix the WinEH file and tests llvm-svn: 229735	2015-02-18 19:52:46 +00:00
David Blaikie	3963f3adef	Remove unused member variables (-Wunused-private-field) llvm-svn: 229722	2015-02-18 18:52:49 +00:00
Andrew Kaylor	eca1819627	Adding implementation to outline C++ catch handlers for native Windows 64 exception handling. Differential Revision: http://reviews.llvm.org/D7363 llvm-svn: 229715	2015-02-18 18:31:51 +00:00
Michael Kuperstein	8617e481c7	Fixes two issue in SimplifyDemandedBits of sext_in_reg: 1) We should not try to simplify if the sext has multiple uses 2) There is no need to simplify is the source value is already sign-extended. Patch by Gil Rapaport <gil.rapaport@intel.com> Differential Revision: http://reviews.llvm.org/D6949 llvm-svn: 229659	2015-02-18 09:43:40 +00:00
Daniel Jasper	1a2370862a	NFC: Use range-based for loops and more consistent naming. No functional changes intended. (I plan on doing some modifications to this function and would like to have as few unrelated changes as possible in the patch) llvm-svn: 229649	2015-02-18 08:19:16 +00:00
Daniel Jasper	3f53d83cd0	Remove experimental options to control machine block placement. This reverts r226034. Benchmarking with those flags has not revealed anything interesting. llvm-svn: 229648	2015-02-18 08:18:07 +00:00
Matthias Braun	85e8274991	LiveRangeCalc: Rename some parameters from kill to use, NFC. Those parameters did not necessarily describe kill points but just uses. llvm-svn: 229601	2015-02-18 01:50:52 +00:00
Rafael Espindola	5bb7947928	Twines should be passed by const ref. llvm-svn: 229590	2015-02-17 23:44:22 +00:00
Rafael Espindola	be6855ec2a	Add r228939 back with a fix. The problem in the original patch was not switching back to .text after printing an eh table. Original message: On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 229586	2015-02-17 23:34:51 +00:00
Duncan P. N. Exon Smith	e7c97c9135	AsmPrinter: Take range in DwarfExpression::AddExpression(), NFC Previously `DwarfExpression::AddExpression()` relied on default-constructing the end iterators for `DIExpression` -- once the operands are represented explicitly via `MDExpression` (instead of via the strange `StringRef` navigator in `DIHeaderIterator`) this won't work. Explicitly take an iterator for the end of the range. llvm-svn: 229572	2015-02-17 22:30:56 +00:00
Rafael Espindola	1dfd40aabb	Add r228980 back. Add support for having multiple sections with the same name and comdat. Using this in combination with -ffunction-sections allows LLVM to output a .o file with mulitple sections named .text. This saves space by avoiding long unique names of the form .text.<C++ mangled name>. llvm-svn: 229541	2015-02-17 20:48:01 +00:00
Eric Christopher	5962c00022	Make the ARM AsmPrinter independent of global subtarget initialization. Initialize the subtarget once per function and migrate Emit{Start\|End}OfAsmFile to either use attributes on the TargetMachine or get information from the subtarget we'd use for assembling. One bit (getISAEncoding) touched the general AsmPrinter and the debug output. Handle this one by passing the function for the subprogram down and updating all callers and users. The top-level-ness of the ARM attribute output for assembly is, by nature, contrary to how we'd want to do this for an LTO situation where we have multiple cpu architectures so this solution is good enough for now. llvm-svn: 229528	2015-02-17 20:02:32 +00:00
Eric Christopher	309b378b7d	80-column fixups. llvm-svn: 229527	2015-02-17 20:02:28 +00:00
Sanjay Patel	0117b06b3f	Canonicalize splats as build_vectors (PR22283) This is a follow-on patch to: http://reviews.llvm.org/D7093 That patch canonicalized constant splats as build_vectors, and this patch removes the constant check so we can canonicalize all splats as build_vectors. This fixes the 2nd test case in PR22283: http://llvm.org/bugs/show_bug.cgi?id=22283 The unfortunate code duplication between SelectionDAG and DAGCombiner is discussed in the earlier patch review. At least this patch is just removing code... This improves an existing x86 AVX test and changes codegen in an ARM test. Differential Revision: http://reviews.llvm.org/D7389 llvm-svn: 229511	2015-02-17 16:54:32 +00:00
Benjamin Kramer	21bee91af5	Prefer SmallVector::append/insert over push_back loops. Same functionality, but hoists the vector growth out of the loop. llvm-svn: 229500	2015-02-17 15:29:18 +00:00
Duncan P. N. Exon Smith	cf766d8c3d	AsmPrinter: Use DIExpression default constructor, NFC llvm-svn: 229464	2015-02-17 02:42:45 +00:00
Duncan P. N. Exon Smith	b89af4c667	AsmPrinter: Stop creating DebugLocs While looking at a heap profile of a clang LTO bootstrap with -g, I noticed that 2.2% of memory in an `llvm-lto` of clang is from calling `DebugLoc::get()` in `collectVariableInfo()` (accounting for ~40% of memory used for `MDLocation`s). I suspect this was introduced by r226736, whose goal was to prevent uniquing of `DebugLoc`s (goal achieved, if so). There's no reason we need a `DebugLoc` here at all -- it was just being used for (in)convenient API -- so the fix is to pass the scope and inlined-at directly to `LexicalScopes::findInlinedScope()`. llvm-svn: 229459	2015-02-17 00:02:27 +00:00
Matthias Braun	9fab4aa0a3	RegisterCoalescer: Don't rematerialize subregister definitions. We cannot simply rematerialize instructions which only defining a subregister, as the final value also depends on the previous instructions. This fixes test/CodeGen/R600/subreg-coalescer-bug.ll with subreg liveness enabled. llvm-svn: 229444	2015-02-16 22:05:17 +00:00
Matthias Braun	98c8c4e394	RegisterCoalescer: Do not look for regclass of IMPLICIT_DEF. IMPLICIT_DEF is a generic instruction and has no (fixed) output register class defined. The rematerialization code of the register coalescer should not scan the instruction description for a register class. This fixes a problem showing up in test/CodeGen/R600/subreg-coalescer-crash.ll with subregister liveness enabled. llvm-svn: 229443	2015-02-16 22:05:12 +00:00
Mehdi Amini	d44ae390cb	SelectionDAG: fold (fp_to_u/sint (s/uint_to_fp)) here too Update SPARC tests to match. From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229438	2015-02-16 21:47:58 +00:00
Matthias Braun	56362c4e5b	RegisterCoalescer: Improve previous fix for wrong def after. The previous fix in r225503 was needlessly complicated. The problem goes away as well if the arguments to MergeValueNumberInto are supplied in the correct order. This was previously missed because the existing code already had the wrong order but an additional later Merge was hiding the bug for the main liverange VNI. llvm-svn: 229424	2015-02-16 19:34:27 +00:00
Aaron Ballman	523472eab5	MSVC 2013 supports std::forward_as_tuple, while MSVC 2012 did not; so we can move to using the improved API. llvm-svn: 229414	2015-02-16 18:21:19 +00:00
Andrew Trick	e7964c82c7	AArch64: Safely handle the incoming sret call argument. This adds a safe interface to the machine independent InputArg struct for accessing the index of the original (IR-level) argument. When a non-native return type is lowered, we generate the hidden machine-level sret argument on-the-fly. Before this fix, we were representing this argument as OrigArgIndex == 0, which is an outright lie. In particular this crashed in the AArch64 backend where we actually try to access the type of the original argument. Now we use a sentinel value for machine arguments that have no original argument index. AArch64, ARM, Mips, and PPC now check for this case before accessing the original argument. Fixes <rdar://19792160> Null pointer assertion in AArch64TargetLowering llvm-svn: 229413	2015-02-16 18:10:47 +00:00
Michael Kuperstein	70d8ac0a8e	Fix quoting of #pragma comment for MS compat, LLVM part. For #pragma comment(linker, ...) MSVC expects the comment string to be quoted, but for #pragma comment(lib, ...) the compiler itself quotes the library name. Since this distinction disappears by the time the directive reaches the backend, move quoting for the "lib" version to the frontend. Differential Revision: http://reviews.llvm.org/D7652 llvm-svn: 229375	2015-02-16 11:57:17 +00:00
Aaron Ballman	0b45511a2e	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229340	2015-02-15 22:54:22 +00:00
Chandler Carruth	0b684c6980	[SDAG] Teach the SelectionDAG to canonicalize vector shuffles of splats directly into blends of the splats. These patterns show up even very late in the vector shuffle lowering where we don't have any chance for DAG combining to kick in, and blending is a tremendously simpler operation to model. By coercing the shuffle into a blend we can much more easily match and lower shuffles of splats. Immediately with this change there are significantly more blends being matched in the x86 vector shuffle lowering. llvm-svn: 229308	2015-02-15 12:18:12 +00:00
Chandler Carruth	6153ae3921	[x86] Fix PR22377, a regression with the new vector shuffle legality test. This was just a matter of the DAG combine for vector shuffles being too aggressive. This is a bit of a grey area, but I think generally if we can re-use intermediate shuffles, we should. Certainly, given the test cases I have available, this seems like the right call. llvm-svn: 229285	2015-02-15 07:01:10 +00:00
Duncan P. N. Exon Smith	33685ffe5d	CodeGen: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) Also, add `Function::getFnStackAlignment()`, and canonicalize: getAttributes().getStackAlignment(AttributeSet::FunctionIndex) => getFnStackAlignment() llvm-svn: 229208	2015-02-14 01:44:41 +00:00
Matthias Braun	cad0e84d8e	Revert "On ELF, put PIC jump tables in a non executable section." This reverts commit r228939. The commit broke something in the output of exception handling tables on darwin x86-64. llvm-svn: 229203	2015-02-14 01:16:54 +00:00
Reid Kleckner	56669ab852	Unify the two EH personality classification routines I wrote We only need one. llvm-svn: 229193	2015-02-14 00:21:02 +00:00
Andrea Di Biagio	7791371c4e	[CodeGenPrepare] Removed duplicate logic. SimplifyCFG already knows how to speculate calls to cttz/ctlz. SimplifyCFG now knows how to speculate calls to intrinsic cttz/ctlz that are 'cheap' for the target. Therefore, some of the logic in CodeGenPrepare that was originally added at revision 224899 can now be removed. This patch is basically a no functional change. It removes the duplicated logic in CodeGenPrepare and converts all the existing target specific tests for cttz/ctlz into SimplifyCFG tests. Differential Revision: http://reviews.llvm.org/D7608 llvm-svn: 229105	2015-02-13 14:15:48 +00:00
Arnaud A. de Grandmaison	b1fde3b904	[PBQP] Conservativelly allocatable nodes can be spilled and give a better solution Although such nodes are allocatable, the cost of spilling may be less than allocating to register, so spilling the node may provide a better solution. The assert does not account for this case, so remove it for now. llvm-svn: 229103	2015-02-13 12:04:42 +00:00
Chandler Carruth	18e8c62883	[PM] Remove the old 'PassManager.h' header file at the top level of LLVM's include tree and the use of using declarations to hide the 'legacy' namespace for the old pass manager. This undoes the primary modules-hostile change I made to keep out-of-tree targets building. I sent an email inquiring about whether this would be reasonable to do at this phase and people seemed fine with it, so making it a reality. This should allow us to start bootstrapping with modules to a certain extent along with making it easier to mix and match headers in general. The updates to any code for users of LLVM are very mechanical. Switch from including "llvm/PassManager.h" to "llvm/IR/LegacyPassManager.h". Qualify the types which now produce compile errors with "legacy::". The most common ones are "PassManager", "PassManagerBase", and "FunctionPassManager". llvm-svn: 229094	2015-02-13 10:01:29 +00:00
Chandler Carruth	33dabe4f44	Re-sort #include lines using my handy dandy ./utils/sort_includes.py script. This is in preparation for changes to lots of include lines. llvm-svn: 229088	2015-02-13 09:09:03 +00:00
Chandler Carruth	6c78cd7569	Revert a series of commits starting at r228886 which is triggering some regressions for LLDB on Linux. Rafael indicated on lldb-dev that we should just go ahead and revert these but that he wasn't at a computer. The patches backed out are as follows: r228980: Add support for having multiple sections with the name and ... r228889: Invert the section relocation map. r228888: Use the existing SymbolTableIndex intsead of doing a lookup. r228886: Create the Section -> Rel Section map when it is first needed. These patches look pretty nice to me, so hoping its not too hard to get them re-instated. =D llvm-svn: 229080	2015-02-13 07:52:39 +00:00
Rafael Espindola	4467ec2e41	Add support for having multiple sections with the same name and comdat. Using this in combination with -ffunction-sections allows LLVM to output a .o file with mulitple sections named .text. This saves space by avoiding long unique names of the form .text.<C++ mangled name>. llvm-svn: 228980	2015-02-12 23:29:51 +00:00
Hal Finkel	76da9b7997	[SDAG] Don't try to use FP_EXTEND/FP_ROUND for int<->fp promotions The PowerPC backend has long promoted some floating-point vector operations (such as select) to integer vector operations. Unfortunately, this behavior was broken by r216555. When using FP_EXTEND/FP_ROUND for promotions, we must check that both the old and new types are floating-point types. Otherwise, we must use BITCAST as we did prior to r216555 for everything. llvm-svn: 228969	2015-02-12 22:43:52 +00:00
Rafael Espindola	5feecddc53	On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 228939	2015-02-12 17:46:49 +00:00
Rafael Espindola	fb65819e24	Put each jump table in an independent section if the function is too. This allows the linker to GC both, fixing pr22557. llvm-svn: 228937	2015-02-12 17:16:46 +00:00
Benjamin Kramer	4b76aa3d46	MathExtras: Bring Count(Trailing\|Leading)Ones and CountPopulation in line with countTrailingZeros Update all callers. llvm-svn: 228930	2015-02-12 15:35:40 +00:00

... 3 4 5 6 7 ...

18268 Commits