llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 08:23:21 +01:00

Author	SHA1	Message	Date
Chandler Carruth	f44e3c7745	Simplify the AND-rooted mask+shift checking code to match that of the SRL-rooted code. llvm-svn: 147941	2012-01-11 09:35:04 +00:00
Chandler Carruth	5a0ea8a5fd	Unify the interface of the three mask+shift transform helpers, and factor the differences that were hiding in one of them into its other caller, the SRL handling code. No change in behavior. llvm-svn: 147940	2012-01-11 09:35:02 +00:00
Chandler Carruth	7d8eac052d	Clarify and make explicit some of the requirements for transforming mask+shift pairs at the beginning of the ISD::AND case block, and then hoist the final pattern into a helper function, simplifying and reflowing it appropriately. This should have no observable behavior change, but several simplifications fell out of this such as directly computing the new mask constant, etc. llvm-svn: 147939	2012-01-11 09:35:00 +00:00
Jakob Stoklund Olesen	9baf09c11a	Fix undefined code and reenable test case. I don't think the compact encoding code is right, but at least is has defined behavior now. llvm-svn: 147938	2012-01-11 09:08:04 +00:00
Chandler Carruth	068b0ca15a	Hoist the logic to transform shift+mask combinations into sub-register extracts and scaled addressing modes into its own helper function. No functionality changed here, just hoisting and layout fixes falling out of that hoisting. llvm-svn: 147937	2012-01-11 08:48:20 +00:00
Chandler Carruth	b3371fa250	Teach the X86 instruction selection to do some heroic transforms to detect a pattern which can be implemented with a small 'shl' embedded in the addressing mode scale. This happens in real code as follows: unsigned x = my_accelerator_table[input >> 11]; Here we have some lookup table that we look into using the high bits of 'input'. Each entity in the table is 4-bytes, which means this implicitly gets turned into (once lowered out of a GEP): (unsigned)((char)my_accelerator_table + ((input >> 11) << 2)); The shift right followed by a shift left is canonicalized to a smaller shift right and masking off the low bits. That hides the shift right which x86 has an addressing mode designed to support. We now detect masks of this form, and produce the longer shift right followed by the proper addressing mode. In addition to saving a (rather large) instruction, this also reduces stalls in Intel chips on benchmarks I've measured. In order for all of this to work, one part of the DAG needs to be canonicalized still further* than it currently is. This involves removing pointless 'trunc' nodes between a zextload and a zext. Without that, we end up generating spurious masks and hiding the pattern. llvm-svn: 147936	2012-01-11 08:41:08 +00:00
Stepan Dyatkovskiy	7ba274153a	Improved compile time: 1. Size heuristics changed. Now we calculate number of unswitching branches only once per loop. 2. Some checks was moved from UnswitchIfProfitable to processCurrentLoop, since it is not changed during processCurrentLoop iteration. It allows decide to skip some loops at an early stage. Extended statistics: - Added total number of instructions analyzed. llvm-svn: 147935	2012-01-11 08:40:51 +00:00
NAKAMURA Takumi	71c80e7fe7	llvm/test/CodeGen/X86/zext-fold.ll: Relax an expression in stack offset. llvm-svn: 147928	2012-01-11 07:34:22 +00:00
NAKAMURA Takumi	83fb05f0c7	llvm/test/CodeGen/X86/sub-with-overflow.ll: Add explicit -mtriple=i686-linux. llvm-svn: 147927	2012-01-11 07:34:14 +00:00
Andrew Trick	1454836d05	Clarified the SCEV getSmallConstantTripCount interface with in-your-face comments. This interface is misleading and dangerous, but it is actually what we need for unrolling. llvm-svn: 147926	2012-01-11 06:52:55 +00:00
Rafael Espindola	c326212da6	Add big endian mips support. Based on a patch by Jack Carter. llvm-svn: 147924	2012-01-11 04:04:14 +00:00
Rafael Espindola	fff89417f5	Add the skeleton of an asm parser for mips. llvm-svn: 147923	2012-01-11 03:56:41 +00:00
Andrew Trick	393dc735f6	ARM Ld/St Optimizer fix. Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits. Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12 llvm-svn: 147922	2012-01-11 03:56:08 +00:00
Jakob Stoklund Olesen	77f594c37b	Disable test that seems to expose an unrelated Linux issue. llvm-svn: 147921	2012-01-11 03:42:27 +00:00
Jakob Stoklund Olesen	37e4396a06	Detect when a value is undefined on an edge to a landing pad. Consider this code: int h() { int x; try { x = f(); g(); } catch (...) { return x+1; } return x; } The variable x is undefined on the first edge to the landing pad, but it has the f() return value on the second edge to the landing pad. SplitAnalysis::getLastSplitPoint() would assume that the return value from f() was live into the landing pad when f() throws, which is of course impossible. Detect these cases, and treat them as if the landing pad wasn't there. This allows spill code to be inserted after the function call to f(). <rdar://problem/10664933> llvm-svn: 147912	2012-01-11 02:07:05 +00:00
Jakob Stoklund Olesen	63258fcd99	Exclusively use SplitAnalysis::getLastSplitPoint(). Delete the alternative implementation in LiveIntervalAnalysis. These functions computed the same thing, but SplitAnalysis caches the result. llvm-svn: 147911	2012-01-11 02:07:00 +00:00
Evan Cheng	f7f94542fc	Avoid CSE of instructions which define physical registers across MBBs unless the physical registers are not allocatable. llvm-svn: 147902	2012-01-11 00:38:11 +00:00
Bill Wendling	2a03f15116	If the global variable is removed by the linker, then don't constant merge it with other symbols. An object in the __cfstring section is suppoed to be filled with CFString objects, which have a pointer to ___CFConstantStringClassReference followed by a pointer to a __cstring. If we allow the object in the __cstring section to be merged with another global, then it could end up in any section. Because the linker is going to remove these symbols in the final executable, we shouldn't bother to merge them. <rdar://problem/10564621> llvm-svn: 147899	2012-01-11 00:13:08 +00:00
Eric Christopher	cc62a64250	Don't avoid recursing for pointer types, just reference types. Expand on the comment. Fixes constvars.exp on the gdb test builder. llvm-svn: 147897	2012-01-11 00:01:29 +00:00
Chad Rosier	830f1909d8	Add test case for r147881. llvm-svn: 147891	2012-01-10 23:09:53 +00:00
Lang Hames	11ff139f8f	Fixed order of operands in comment to match code. llvm-svn: 147890	2012-01-10 22:53:20 +00:00
Joerg Sonnenberger	fccad68f65	Default stack alignment for 32bit x86 should be 4 Bytes, not 8 Bytes. Add a test that checks the stack alignment of a simple function for Darwin, Linux and NetBSD for 32bit and 64bit mode. llvm-svn: 147888	2012-01-10 22:43:53 +00:00
Jakob Stoklund Olesen	7f7f8a2e77	Consider unknown alignment caused by OptimizeThumb2Instructions(). This function runs after all constant islands have been placed, and may shrink some instructions to their 2-byte forms. This can actually cause some constant pool entries to move out of range because of growing alignment padding. Treat instructions that may be shrunk the same as inline asm - they erode the known alignment bits. Also reinstate an old assertion in verify(). It is correct now that basic block offsets include alignments. Add a single large test case that will hopefully exercise many parts of the constant island pass. <rdar://problem/10670199> llvm-svn: 147885	2012-01-10 22:32:14 +00:00
Evan Cheng	6466bb6919	80 col violation. llvm-svn: 147884	2012-01-10 22:27:32 +00:00
Chad Rosier	7bab07a5f1	Add missing VEX predicates to VMOVSDto64rr/VMOVSDto64mr. This fixes a few failing test cases on our internal AVX nightly tester. rdar://10663637 llvm-svn: 147881	2012-01-10 22:14:06 +00:00
Devang Patel	90a5a47ef8	Let asm parser query asm syntax dialect. llvm-svn: 147880	2012-01-10 21:49:42 +00:00
Kevin Enderby	21c229f1fb	This is the matching change for the data structure name changes for the functional change in r147860 to use DW_TAG_label's instead TAG_subprogram's. This only changes names and updates comments. No functional change. llvm-svn: 147877	2012-01-10 21:12:34 +00:00
Jim Grosbach	59537e1ce3	ARM updating VST2 pseudo-lowering fixed vs. register update. rdar://10663487 llvm-svn: 147876	2012-01-10 21:11:12 +00:00
Benjamin Kramer	94637ab6c6	Fix some leftover control reaches end of non-void function warnings. llvm-svn: 147874	2012-01-10 20:47:20 +00:00
Chandler Carruth	ecd9169f3a	Teach the triple library about the androideabi environment. Patch by Evgeniy Stepanov. llvm-svn: 147871	2012-01-10 19:46:00 +00:00
Richard Smith	03de404e6f	Move default case for covered enum outside of switch. llvm-svn: 147870	2012-01-10 19:43:09 +00:00
Bill Wendling	4a4be29bb5	For i386, don't use the generic code. As the comment around 7746 says, it's better to use the x87 extended precision here than SSE. And the generic code doesn't know how to do that. It also regains the speed lost for the uint64_to_float.c testcase. <rdar://problem/10669858> llvm-svn: 147869	2012-01-10 19:41:30 +00:00
Richard Smith	9cf5d2b6f5	Fix a -Wreturn-type warning in g++. llvm-svn: 147867	2012-01-10 19:10:22 +00:00
Chandler Carruth	844b5fc832	Cleanup these asserts to follow common LLVM style and coding conventions. Also, clarify the grouping of one of the asserts to silence -Wparentheses. llvm-svn: 147863	2012-01-10 18:18:52 +00:00
Chandler Carruth	2a6b59a693	Add 'llvm_unreachable' to passify GCC's understanding of the constraints of several newly un-defaulted switches. This also helps optimizers (including LLVM's) recognize that every case is covered, and we should assume as much. llvm-svn: 147861	2012-01-10 18:08:01 +00:00
Kevin Enderby	75f4b470f9	Various crash reporting tools have a problem with the dwarf generated for assembly source when it generates the TAG_subprogram dwarf debug info for the labels that have nothing between them as in this bit of assembly source: % cat ZeroLength.s _func1: _func2: nop One solution would be to not emit the subsequent labels with the same address and use the next label with a different address or the end of the section for the AT_high_pc value of the TAG_subprogram. Turns out in llvm-mc it is not possible in all cases to determine of two symbols have the same value at the point we put out the TAG_subprogram dwarf debug info. So we will have llvm-mc instead of putting out TAG_subprogram's put out DW_TAG_label's. And the DW_TAG_label does not have a AT_high_pc value which avoids the problem. This commit is only the functional change to make the diffs clear as to what is really being changed. The next commit will be to clean up the names of such things like MCGenDwarfSubprogramEntry to something like MCGenDwarfLabelEntry. rdar://10666925 llvm-svn: 147860	2012-01-10 17:52:29 +00:00
Devang Patel	9a08a580a7	Add definition for intel asm variant. Right now, this just adds additional entries in match table. The parser does not use them yet. llvm-svn: 147859	2012-01-10 17:51:54 +00:00
Devang Patel	c1e4ca5839	Record asm variant id in MatchEntry and check it while matching instruction. llvm-svn: 147858	2012-01-10 17:50:43 +00:00
David Blaikie	8d47bb30e3	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Nadav Rotem	969b8a6903	Fix a bug in the legalization of shuffle vectors. When we emulate shuffles using BUILD_VECTORS we may be using a BV of different type. Make sure to cast it back. llvm-svn: 147851	2012-01-10 14:28:46 +00:00
Benjamin Kramer	1f1a76c7af	Add definitions for AMD's bobcat (aka btver1) llvm-svn: 147846	2012-01-10 11:50:02 +00:00
Craig Topper	01eba20904	Fix a crash in AVX2 when trying to broadcast a double into a 128-bit vector. There is no vbroadcastsd xmm, but we do need to support 64-bit integers broadcasted into xmm. Also factor the AVX check into the isVectorBroadcast function. This makes more sense since the AVX2 check was already inside. llvm-svn: 147844	2012-01-10 08:23:59 +00:00
Craig Topper	9beee30168	Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is the final piece to remove the AVX hack that disabled SSE. llvm-svn: 147843	2012-01-10 06:54:16 +00:00
Craig Topper	5f6f96da91	Remove hasSSEorAVX functions and change all callers to use just hasSSE. AVX is now an SSE level and no longer disables SSE checks. llvm-svn: 147842	2012-01-10 06:37:29 +00:00
Craig Topper	c9756440ea	Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget. llvm-svn: 147841	2012-01-10 06:30:56 +00:00
Evan Cheng	7855c5d08f	Allow machine-cse to look across MBB boundary when cse'ing instructions that define physical registers. It's currently very restrictive, only catching cases where the CE is in an immediate (and only) predecessor. But it catches a surprising large number of cases. rdar://10660865 llvm-svn: 147827	2012-01-10 02:02:58 +00:00
Andrew Trick	db66631fb3	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Jakob Stoklund Olesen	f9de299af4	Accurately model hardware alignment rounding. On Thumb, the displacement computation hardware uses the address of the current instruction rouned down to a multiple of 4. Include this rounding in the UserOffset we compute for each instruction. When inline asm is present, the instruction alignment may not be known. Constrain the maximum displacement instead in that case. This makes it possible for CreateNewWater() and OffsetIsInRange() to agree about the valid displacements. When they disagree, infinite looping happens. As always, test cases for this stuff are insane. <rdar://problem/10660175> llvm-svn: 147825	2012-01-10 01:34:59 +00:00
Rafael Espindola	6b018e0b1d	Remove the logging streamer. llvm-svn: 147820	2012-01-10 00:40:39 +00:00
Jakob Stoklund Olesen	eb6540fdbf	Catch runaway ARMConstantIslandPass even in -Asserts builds. The pass is prone to looping, and it is better to crash than loop forever, even in a -Asserts build. <rdar://problem/10660175> llvm-svn: 147806	2012-01-09 22:16:24 +00:00

1 2 3 4 5 ...

79255 Commits