llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Matthias Braun	57ec25e7ce	LiveInterval: Introduce createMainRangeFromSubranges(). This function constructs the main liverange by merging all subranges if subregister liveness tracking is available. This should be slightly faster to compute instead of performing the liveness calculation again for the main range. More importantly it avoids cases where the main liverange would cover positions where no subrange was live. These cases happened for partial definitions where the actual defined part was dead and only the undefined parts used later. The register coalescing requires that every part covered by the main live range has at least one subrange live. I also expect this function to become usefull later for places where the subranges are modified in a way that it is hard to correctly fix the main liverange in the machine scheduler, we can simply reconstruct it from subranges then. llvm-svn: 224806	2014-12-24 02:11:51 +00:00
Matthias Braun	97612b7c58	RegisterCoalescer: With subrange liveness there may be no RedefVNI for unused lanes. llvm-svn: 224805	2014-12-24 02:11:48 +00:00
Matthias Braun	9adcb401e3	LiveRangeEdit: Check for completely empy subranges after removing ValNos. Completely empty subranges are not allowed and must be removed when subreg liveness is enabled. llvm-svn: 224804	2014-12-24 02:11:46 +00:00
Matthias Braun	2e61b0a55a	LiveIntervalAnalysis: Fix performance bug that I introduced in r224663. Without a reference the code did not remember when moving the iterators of the subranges/registerunit ranges forward and instead would scan from the beginning again at the next position. llvm-svn: 224803	2014-12-24 02:11:43 +00:00
Peter Zotov	e82997fcba	[OCaml] PR21901: Update tests. This finishes the fix partially applied by r224782. llvm-svn: 224802	2014-12-24 01:58:45 +00:00
Peter Zotov	52abf62de2	[OCaml] Expose Llvm_executionengine.get_{global_value,function}_address. Patch by Ramkumar Ramachandra <artagnon@gmail.com>. Also remove Llvm_executionengine.get_pointer_to_global, as it is actually deprecated and didn't appear in a stable release. llvm-svn: 224801	2014-12-24 01:52:51 +00:00
Chandler Carruth	aa0e0370a0	[SROA] Update the documentation and names for accessing the slices within a partition of an alloca in SROA. This reflects the fact that the organization of the slices isn't really ideal for analysis, but is the naive way in which the slices are available while we're processing them in the core partitioning algorithm. It is possible we could improve matters, and I've left a FIXME with one of my ideas for how to do this, but it is a lot of work, the benefit is somewhat minor, and it isn't clear that it would be strictly better. =/ Not really satisfying, but I'm out of really good ideas. This also improves one place where the debug logging failed to mark some split partitions. Now we log in one place, slightly later, and with accurate information about whether the slice is split by the partition being rewritten. llvm-svn: 224800	2014-12-24 01:48:09 +00:00
Adrian Prantl	fcd65fd312	Debug Info: In symmetry to DW_TAG_pointer_type, do not emit the byte size of a DW_TAG_ptr_to_member_type. This restores the behavior from before r224780-r224781. llvm-svn: 224799	2014-12-24 01:17:51 +00:00
Chandler Carruth	268dbb6208	[SROA] Refactor the integer and vector promotion testing logic to operate in terms of the new Partition class, and generally have a more clear set of arguments. No functionality changed. The most notable improvements here are consistently using the terminology of 'partition' for a collection of slices that will be rewritten together and 'slice' for a region of an alloca that is used by a particular instruction. This also makes it more clear that the split things are actually slices as well, just ones that will be split by the proposed partition. This doesn't yet address the confusing aspects of the partition's interface where slices that will be split by the partition and start prior to the partition are accesssed via Partition::splitSlices() while the core range of slices exposed by a Partition includes both unsplit slices and slices which will be split by the end, but started within the offset range of the partition. This is particularly hard to address because the algorithm which computes partitions quite literally doesn't know which slices these will end up being until too late. I'm looking at whether I can fix that or not, but I'm not optimistic. I'll update the comments and/or names to further explain this either way. I've also added one FIXME in this patch relating to this confusion so that I don't forget about it. llvm-svn: 224798	2014-12-24 01:05:14 +00:00
Colin LeMahieu	6ce3a834b9	[Hexagon] Removing old classes. llvm-svn: 224795	2014-12-24 00:43:00 +00:00
Kevin Enderby	80b4a3c2ca	Another attempt to fix the LLVM Windows build bot lld-x86_64-win7, one last place to fix I think. llvm-svn: 224794	2014-12-24 00:16:51 +00:00
Kevin Enderby	7152971b3b	Attempt to fix the LLVM Windows build bot lld-x86_64-win7. llvm-svn: 224793	2014-12-23 23:43:59 +00:00
Kevin Enderby	3c109420ec	Add printing the LC_THREAD load commands with llvm-objdump’s -private-headers. llvm-svn: 224792	2014-12-23 22:56:39 +00:00
Kostya Serebryany	7c74b14d55	[asan] change the coverage collection scheme so that we can easily emit coverage for the entire process as a single bit set, and if coverage_bitset=1 actually emit that bitset llvm-svn: 224789	2014-12-23 22:32:17 +00:00
Hal Finkel	e37bd38154	[PowerPC] Ensure that the TOC reload directly follows bctrl on PPC64 On non-Darwin PPC64, the TOC reload needs to come directly after the bctrl instruction (for indirect calls) because the 'bctrl/ld 2, 40(1)' instruction sequence is interpreted by the unwinding code in libgcc. To make sure these occur as a pair, as with other pairings interpreted by the linker, fuse the two instructions into one instruction (for code generation only). In the future, we might wish to do this by emitting CFI directives instead, but this solution is simpler, and mirrors what GCC does. Additional discussion on this point is contained in the PR. Fixes PR22015. llvm-svn: 224788	2014-12-23 22:29:40 +00:00
Colin LeMahieu	3a9d8a20be	[Hexagon] Adding doubleword load. llvm-svn: 224787	2014-12-23 20:44:59 +00:00
Colin LeMahieu	c8d82f0149	[Hexagon] Reapplying 224775 load words. llvm-svn: 224786	2014-12-23 20:02:16 +00:00
Jozef Kolek	a7fba787ce	[mips][microMIPS] Implement CACHE, PREF, SSNOP, EHB and PAUSE instructions Differential Revision: http://reviews.llvm.org/D5204 llvm-svn: 224785	2014-12-23 19:55:34 +00:00
Colin LeMahieu	240787f100	Reverting 224775 until mayLoad flag is addressed. llvm-svn: 224783	2014-12-23 19:22:59 +00:00
Rafael Espindola	e3ea3709d0	Finish removing DestroySource. Fixes pr21901. llvm-svn: 224782	2014-12-23 19:16:45 +00:00
Adrian Prantl	7f2c79ccd0	DIBuilder: Similar to createPointerType, make createMemberPointerType take a size and alignment. Several assertions in DwarfDebug rely on all variable types to report back a size, or to be derived from a type with a size. Tested in CFE. llvm-svn: 224780	2014-12-23 19:11:47 +00:00
Mehdi Amini	1f065b2623	Always assert in DAGCombine and not only when -debug is enabled Right now in DAG Combine check the validity of the returned type only when -debug is given on the command line. However usually the test cases in the validation does not use -debug. An Assert build should always check this. llvm-svn: 224779	2014-12-23 18:59:02 +00:00
Rafael Espindola	b47e702061	Pass LSAN_OPTIONS down so that it is possible to add suppressions. llvm-svn: 224777	2014-12-23 18:39:02 +00:00
Rafael Espindola	f9dc0ab110	Fix a leak found by asan. llvm-svn: 224776	2014-12-23 18:18:37 +00:00
Colin LeMahieu	9d1882c36f	[Hexagon] Adding word loads. llvm-svn: 224775	2014-12-23 18:06:56 +00:00
Colin LeMahieu	263816de1a	[Hexagon] Adding signed halfword loads. llvm-svn: 224774	2014-12-23 17:25:57 +00:00
Rafael Espindola	136f9d1837	Fix a leak found by asan. llvm-svn: 224773	2014-12-23 17:20:23 +00:00
Colin LeMahieu	df751494b1	[Hexagon] Adding unsigned halfword load. llvm-svn: 224772	2014-12-23 16:42:57 +00:00
Jozef Kolek	814723a8ed	[mips][microMIPS] Implement LWSP and SWSP instructions Differential Revision: http://reviews.llvm.org/D6416 llvm-svn: 224771	2014-12-23 16:16:33 +00:00
Peter Zotov	41942af9a1	[OCaml] PR22014: OCaml bindings didn't link to libLLVM-*.so with -Wl,--as-needed Patch by Evangelos Foutras <evangelos@foutrelis.com>. llvm-svn: 224766	2014-12-23 13:09:59 +00:00
Michael Kuperstein	180649bb38	[ValueTracking] Move GlobalAlias handling to be after the max depth check in computeKnownBits() GlobalAlias handling used to be after GlobalValue handling, which meant it was, in practice, dead code. r220165 moved GlobalAlias handling to be before GlobalValue handling, but also moved it to be before the max depth check, causing an assert due to a recursion depth limit violation. This moves GlobalAlias handling forward to where it's safe, and changes the GlobalValue handling to only look at GlobalObjects. Differential Revision: http://reviews.llvm.org/D6758 llvm-svn: 224765	2014-12-23 11:33:41 +00:00
Elena Demikhovsky	bb8ca1f551	AVX-512: Added FMA instructions, intrinsics an tests for KNL and SKX targets by Asaf Badouh http://reviews.llvm.org/D6456 llvm-svn: 224764	2014-12-23 10:30:39 +00:00
Hal Finkel	f0195675c6	[PowerPC] Don't mark the return-address slot as immutable It is tempting to mark the fixed stack slot used to store the return address as immutable when lowering @llvm.returnaddress(i32 0). Unfortunately, within the function, it is not completely immutable: it is written during the function prologue. When using post-RA instruction scheduling, the prologue instructions are available for scheduling, and we're not free to interchange the order of a particular store in the prologue with loads from that stack location. Fixes PR21976. llvm-svn: 224761	2014-12-23 09:45:06 +00:00
Elena Demikhovsky	cfbcf5995c	AVX-512: BLENDM - fixed encoding of the broadcast version Added more intrinsics and encoding tests. llvm-svn: 224760	2014-12-23 09:36:28 +00:00
Michael Kuperstein	81a40a5cbe	[DagCombine] Improve DAGCombiner BUILD_VECTOR when it has two sources of elements This partially fixes PR21943. For AVX, we go from: vmovq (%rsi), %xmm0 vmovq (%rdi), %xmm1 vpermilps $-27, %xmm1, %xmm2 ## xmm2 = xmm1[1,1,2,3] vinsertps $16, %xmm2, %xmm1, %xmm1 ## xmm1 = xmm1[0],xmm2[0],xmm1[2,3] vinsertps $32, %xmm0, %xmm1, %xmm1 ## xmm1 = xmm1[0,1],xmm0[0],xmm1[3] vpermilps $-27, %xmm0, %xmm0 ## xmm0 = xmm0[1,1,2,3] vinsertps $48, %xmm0, %xmm1, %xmm0 ## xmm0 = xmm1[0,1,2],xmm0[0] To the expected: vmovq (%rdi), %xmm0 vmovhpd (%rsi), %xmm0, %xmm0 retq Fixing this for AVX2 is still open. Differential Revision: http://reviews.llvm.org/D6749 llvm-svn: 224759	2014-12-23 08:59:45 +00:00
Hal Finkel	867d174942	[PowerPC] Don't attempt a 64-bit pow2 division on PPC32 In r224033, in moving the signed power-of-2 division expansion into BuildSDIVPow2, I accidentally made it possible to attempt the lowering for a 64-bit division on PPC32. This later asserts. Fixes PR21928. llvm-svn: 224758	2014-12-23 08:38:50 +00:00
Michael Liao	e7760832e1	[SimplifyCFG] Revise common code sinking - Fix the case where more than 1 common instructions derived from the same operand cannot be sunk. When a pair of value has more than 1 derived values in both branches, only 1 derived value could be sunk. - Replace BB1 -> (BB2, PN) map with joint value map, i.e. map of (BB1, BB2) -> PN, which is more accurate to track common ops. llvm-svn: 224757	2014-12-23 08:26:55 +00:00
Michael Kuperstein	9dee8a54d7	Remove a bad cast in CloneModule() A cast that was introduced in r209007 was accidentally left in after the changes made to GlobalAlias rules in r210062. This crashes if the aliasee is a now-leggal ConstantExpr. llvm-svn: 224756	2014-12-23 08:23:45 +00:00
Ahmed Bougacha	ac752e2bc3	[ARM] Don't break alignment when combining base updates into load/stores. r223862/r224203 tried to also combine base-updating load/stores. There was a mistake there: the alignment was added as is as an operand to the ARMISD::VLD/VST node. However, the VLD/VST selection logic doesn't care about less-than-standard alignment attributes. For example, no matter the alignment of a v2i64 load (say 1), SelectVLD picks VLD1q64 (because of the memory type). But VLD1q64 ("vld1.64 {dXX, dYY}") is 8-aligned, per ARMARMv7a 3.2.1. For the 1-aligned load, what we really want is VLD1q8. This commit introduces bitcasts if necessary, and changes the vld/vst type to one whose standard alignment matches the original load/store alignment. Differential Revision: http://reviews.llvm.org/D6759 llvm-svn: 224754	2014-12-23 06:07:31 +00:00
Alexey Samsonov	fc98b977fe	Fix UBSan bootstrap: replace shift of negative value with multiplication. llvm-svn: 224752	2014-12-23 04:15:53 +00:00
Alexey Samsonov	5dfc5859cf	Fix UBSan bootstrap: don't bind reference to nullptr. llvm-svn: 224751	2014-12-23 04:15:47 +00:00
Chandler Carruth	5fb4297ebd	Revert r224739: Debug info: Teach SROA how to update debug info for fragmented variables. This caused codegen to start crashing when we built somewhat large programs with debug info and optimizations. 'check-msan' hit in, and I suspect a bootstrap would as well. I mailed a test case to the review thread. llvm-svn: 224750	2014-12-23 02:58:14 +00:00
Jim Grosbach	19c4fa899d	X86: Don't over-align combined loads. When combining consecutive loads+inserts into a single vector load, we should keep the alignment of the base load. Doing otherwise can, and does, lead to using overly aligned instructions. In the included test case, for example, using a 32-byte vmovaps on a 16-byte aligned value. Oops. rdar://19190968 llvm-svn: 224746	2014-12-23 00:35:23 +00:00
Reid Kleckner	04fe8002a0	Make musttail more robust for vector types on x86 Previously I tried to plug musttail into the existing vararg lowering code. That turned out to be a mistake, because non-vararg calls use significantly different register lowering, even on x86. For example, AVX vectors are usually passed in registers to normal functions and memory to vararg functions. Now musttail uses a completely separate lowering. Hopefully this can be used as the basis for non-x86 perfect forwarding. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D6156 llvm-svn: 224745	2014-12-22 23:58:37 +00:00
David Blaikie	8bf70da31e	Remove dynamic allocation/indirection from GCOVBlocks owned by GCOVFunction Since these are all created in the DenseMap before they are referenced, there's no problem with pointer validity by the time it's required. This removes another use of DeleteContainerSeconds/manual memory management which I'm cleaning up from time to time. llvm-svn: 224744	2014-12-22 23:12:42 +00:00
Adrian Prantl	566772c5d4	Thumb1 frame lowering: Mark CFI instructions with the FrameSetup flag. Followup to r224294: ARM/AArch64: Attach the FrameSetup MIFlag to CFI instructions. Debug info marks the first instruction without the FrameSetup flag as being the end of the function prologue. Any CFI instructions in the middle of the function prologue would cause debug info to end the prologue too early and worse, attach the line number of the CFI instruction, which incidentally is often 0. llvm-svn: 224743	2014-12-22 23:09:14 +00:00
Chandler Carruth	d6a74b72f1	[SROA] Lift the logic for traversing the alloca slices one partition at a time into a partition iterator and a Partition class. There is a lot of knock-on simplification that this enables, largely stemming from having a Partition object to refer to in lots of helpers. I've only done a minimal amount of that because enoguh stuff is changing as-is in this commit. This shouldn't change any observable behavior. I've worked hard to preserve the exact traversal semantics which were originally present even though some of them make no sense. I'll be changing some of this in subsequent commits now that the logic is carefully factored into a reusable place. The primary motivation for this change is to break the rewriting into phases in order to support more intelligent rewriting. For example, I'm planning to change how split loads and stores are rewritten to remove the significant overuse of integer bit packing in the resulting code and allow more effective secondary splitting of aggregates. For any of this to work, they have to share the exact traversal logic. llvm-svn: 224742	2014-12-22 22:46:00 +00:00
Bruno Cardoso Lopes	c8d20ce475	[LCSSA] Handle PHI insertion in disjoint loops Take two disjoint Loops L1 and L2. LoopSimplify fails to simplify some loops (e.g. when indirect branches are involved). In such situations, it can happen that an exit for L1 is the header of L2. Thus, when we create PHIs in one of such exits we are also inserting PHIs in L2 header. This could break LCSSA form for L2 because these inserted PHIs can also have uses in L2 exits, which are never handled in the current implementation. Provide a fix for this corner case and test that we don't assert/crash on that. Differential Revision: http://reviews.llvm.org/D6624 rdar://problem/19166231 llvm-svn: 224740	2014-12-22 22:35:46 +00:00
Adrian Prantl	85354b18d3	Debug info: Teach SROA how to update debug info for fragmented variables. This allows us to generate debug info for extremely advanced code such as typedef struct { long int a; int b;} S; int foo(S s) { return s.b; } which at -O1 on x86_64 is codegen'd into define i32 @foo(i64 %s.coerce0, i32 %s.coerce1) #0 { ret i32 %s.coerce1, !dbg !24 } with this patch we emit the following debug info for this TAG_formal_parameter [3] AT_location( 0x00000000 0x0000000000000000 - 0x0000000000000006: rdi, piece 0x00000008, rsi, piece 0x00000004 0x0000000000000006 - 0x0000000000000008: rdi, piece 0x00000008, rax, piece 0x00000004 ) AT_name( "s" ) AT_decl_file( "/Volumes/Data/llvm/_build.ninja.release/test.c" ) Thanks to chandlerc, dblaikie, and echristo for their feedback on all previous iterations of this patch! llvm-svn: 224739	2014-12-22 22:26:00 +00:00
Reid Kleckner	eecdfef0b6	Fix Windows unwind info for functions in sections other than .text Previously we assumed the section name had the form .text$foo, which is what we used to do for inline functions. If the dollar wasn't present, we'd put unwind data in the .pdata and .xdata sections for the main .text section, which is incorrect. Fixes PR22001. llvm-svn: 224738	2014-12-22 22:10:08 +00:00

1 2 3 4 5 ...

111103 Commits