llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	185c3797be	Break up getProfitableChainIncrement(). The required checks are moved to ChainInstruction() itself and the policy decisions are moved to IVChain::isProfitableInc(). Also cache the ExprBase in IVChain to avoid frequent recomputations. No functional change intended. llvm-svn: 155676	2012-04-26 23:33:11 +00:00
Jakob Stoklund Olesen	613b89fecd	Turn IVChain into a struct. No functional change intended. llvm-svn: 155675	2012-04-26 23:33:09 +00:00
Chad Rosier	f3d4646377	Add instcombine patterns for the following transformations: (x & y) \| (x ^ y) -> x \| y (x & y) + (x ^ y) -> x \| y Patch by Manman Ren. rdar://10770603 llvm-svn: 155674	2012-04-26 23:29:14 +00:00
Evan Cheng	4be3b28946	DumpSegment64Command() wasn't returning correct result. Caught by static analyzer. rdar://11329354 llvm-svn: 155669	2012-04-26 22:07:28 +00:00
Andrew Trick	1aa00c0baa	Fix the SD scheduler to avoid gluing the same node twice. DAGCombine strangeness may result in multiple loads from the same offset. They both may try to glue themselves to another load. We could insist that the redundant loads glue themselves to each other, but the beter fix is to bail out from bad gluing at the time we detect it. Fixes rdar://11314175: BuildSchedUnits assert. llvm-svn: 155668	2012-04-26 21:48:25 +00:00
Ted Kremenek	ec38ddb19c	Defensively guard against calling malloc() with a size of zero. llvm-svn: 155661	2012-04-26 20:54:27 +00:00
Jim Grosbach	bf9adf2ab5	ARM: Thumb ldr(literal) base address alignment is 32-bits. The base address for the PC-relative load is Align(PC,4), so it's the address of the word containing the 16-bit instruction, not the address of the instruction itself. Ugh. rdar://11314619 llvm-svn: 155659	2012-04-26 20:48:12 +00:00
Joerg Sonnenberger	33d3ecdf42	Add note about returns_twice magic removal from LLVM itself. llvm-svn: 155657	2012-04-26 20:10:07 +00:00
Preston Gurd	fb1760744d	Trivial change to set UseLeaForSP flag in addition to toggling the FeatureLeaForSP feature bit when llvm auto detects Intel Atom. Patch by Andy Zhang llvm-svn: 155655	2012-04-26 19:52:27 +00:00
Michael J. Spencer	fbbb8b9593	[CMake] Restructure how Clang, Polly and other external projects get included. While making lld build under the tools directory I decided to refactor how this works. There is now a macro, add_llvm_external_project, which takes the name of the expected subdirectory. This sets up two CMake options. * LLVM_EXTERNAL_${NAME}_SOURCE_DIR This is the path to the source. It defaults to ${CMAKE_CURRENT_SOURCE_DIR}/${name}. * LLVM_EXTERNAL_${NAME}_BUILD Enable and disable building the tool as part of LLVM. I chose LLVM_EXTERNAL_${NAME} as a prefix so they all show up together in the GUI. llvm-svn: 155654	2012-04-26 19:43:35 +00:00
Michael J. Spencer	0dd0f2c8f0	[Support/YAML] Properly fix unitialized variable warning by inserting a 'REPLACEMENT CHARACTER' (U+FFFD) when getAsInteger fails. llvm-svn: 155653	2012-04-26 19:27:11 +00:00
Stepan Dyatkovskiy	4c3c675cca	Fixed SmallMap test. The order of items is undefined in DenseMap. So being checking the increment for big mode, we can only check that all items are in map. llvm-svn: 155651	2012-04-26 18:45:24 +00:00
Tim Northover	876c151146	Use VLD1 in NEON extenting-load patterns instead of VLDR. On some cores it's a bad idea for performance to mix VFP and NEON instructions and since these patterns are NEON anyway, the NEON load should be used. llvm-svn: 155630	2012-04-26 08:46:29 +00:00
Tim Northover	b83dc53c3a	Test commit. llvm-svn: 155626	2012-04-26 08:24:07 +00:00
Craig Topper	f883096ff7	Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to corei7-avx, core-avx-i, and core-avx2 cpu names. llvm-svn: 155618	2012-04-26 06:40:15 +00:00
Chandler Carruth	587c136c31	Teach the reassociate pass to fold chains of multiplies with repeated elements to minimize the number of multiplies required to compute the final result. This uses a heuristic to attempt to form near-optimal binary exponentiation-style multiply chains. While there are some cases it misses, it seems to at least a decent job on a very diverse range of inputs. Initial benchmarks show no interesting regressions, and an 8% improvement on SPASS. Let me know if any other interesting results (in either direction) crop up! Credit to Richard Smith for the core algorithm, and helping code the patch itself. llvm-svn: 155616	2012-04-26 05:30:30 +00:00
Evan Cheng	505263cb36	Specify cpu to unbreak tests. llvm-svn: 155604	2012-04-26 01:38:10 +00:00
Evan Cheng	4d570a3f0e	If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume the feature set of v7a. This comes about if the user specifies something like -arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as uxtab in this case. rdar://11318438 llvm-svn: 155601	2012-04-26 01:13:36 +00:00
Bill Wendling	d9d9230b83	Don't forget to reset 'first operand' flag when we're setting the MDNodeOperand value. llvm-svn: 155599	2012-04-26 00:38:42 +00:00
Jakob Stoklund Olesen	b35c6a6369	Try to fix llvm-arm-linux builder with -mcpu. llvm-svn: 155589	2012-04-25 21:22:33 +00:00
Preston Gurd	237e411bb4	Trivial change to make the test use -mcpu=generic so as to avoid a failure if run on an Intel Atom with post RA instruction scheduling. llvm-svn: 155587	2012-04-25 21:04:54 +00:00
Benjamin Kramer	36acdd4832	Reapply the SmallMap patch with a fix. Comparing ~0UL with an unsigned will always return false when long is 64 bits long. llvm-svn: 155568	2012-04-25 18:01:58 +00:00
Jakob Stoklund Olesen	e2913e1ad5	Print IV chain numbers while collecting them. llvm-svn: 155567	2012-04-25 18:01:32 +00:00
Jakob Stoklund Olesen	24c99d2966	Remove more dead code. llvm-svn: 155566	2012-04-25 18:01:30 +00:00
Richard Barton	e9a972bbe3	Unify internal representation of ARM instructions with a register right-shifted by #32 . These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation. llvm-svn: 155565	2012-04-25 18:00:18 +00:00
Eric Christopher	d38d9bb28b	Revert "First implementation of:" This reverts commit 76271a3366731d4c372fdebcd8d3437e6e09a61b. as it's breaking the bots. llvm-svn: 155562	2012-04-25 17:51:00 +00:00
Stepan Dyatkovskiy	a580659b6c	First implementation of: - FlatArrayMap. Very simple map container that uses flat array inside. - MultiImplMap. Map container interface, that has two modes, one for small amount of elements and one for big amount. - SmallMap. SmallMap is DenseMap compatible MultiImplMap. It uses FlatArrayMap for small mode, and DenseMap for big mode. Also added unittests for new classes and update for ProgrammersManual. For more details about new classes see ProgrammersManual and comments in sourcecode. llvm-svn: 155557	2012-04-25 17:09:38 +00:00
Jakob Stoklund Olesen	d8587c363b	Simplify LiveIntervals::getApproximateInstructionCount(). This function is only used for a heuristic during -join-physregs. It doesn't need floating point. llvm-svn: 155554	2012-04-25 16:32:23 +00:00
Jakob Stoklund Olesen	dbd5f4eb96	Remove a dead function. llvm-svn: 155553	2012-04-25 16:32:20 +00:00
Jakob Stoklund Olesen	7f1be74a4a	Remove the -disable-cross-class-join option. Cross-class joins have been normal and fully supported for a while now. With TableGen generating the getMatchingSuperRegClass() hook, they are unlikely to cause problems again. llvm-svn: 155552	2012-04-25 16:17:50 +00:00
Jakob Stoklund Olesen	b8d98c5060	Cross-class joining is winning. Remove the heuristic for disabling cross-class joins. The greedy register allocator can handle the narrow register classes, and when it splits a live range, it can pick a larger register class. Benchmarks were unaffected by this change. <rdar://problem/11302212> llvm-svn: 155551	2012-04-25 16:17:47 +00:00
Craig Topper	5828c654b9	Add ifdef around getSubtargetFeatureName in tablegen output file so that only targets that want the function get it. This prevents other targets from getting an unused function warning. llvm-svn: 155538	2012-04-25 06:56:34 +00:00
Craig Topper	1a016fd95d	Use vector_shuffles instead of target specific unpack nodes for AVX ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code. llvm-svn: 155537	2012-04-25 06:39:39 +00:00
Chris Lattner	f492d85ae9	openbsd doesn't support soname, patch by Brad Smith! llvm-svn: 155536	2012-04-25 06:37:20 +00:00
Chandler Carruth	8821a79947	Actually delete now-empty file. llvm-svn: 155532	2012-04-25 02:30:00 +00:00
Lang Hames	7f69fbca29	Reverting r155468. Chris and Chandler have convinced me that it's dangerous and in poor taste. Talking through some alternate solutions with Chandler. llvm-svn: 155530	2012-04-25 02:16:54 +00:00
Akira Hatanaka	b3ecf903f1	Do not use $gp as a dedicated global register if the target ABI is not O32. llvm-svn: 155522	2012-04-25 01:24:52 +00:00
Andrew Trick	f76f2597a3	typo in declaration from earlier today llvm-svn: 155519	2012-04-25 01:11:22 +00:00
Dan Gohman	64171a0b3b	Simplify the known retain count tracking; use a boolean state instead of a precise count. Also, move RRInfo's Partial field into PtrState, now that it won't increase the size. llvm-svn: 155513	2012-04-25 00:50:46 +00:00
Dan Gohman	3a24d34041	Build custom predecessor and successor lists for each basic block. These lists exclude invoke unwind edges and loop backedges which are being ignored. This makes it easier to ignore them consistently. llvm-svn: 155500	2012-04-24 22:53:18 +00:00
Jim Grosbach	7ac2ac85a8	ARM: improved assembler diagnostics for missing CPU features. When an instruction match is found, but the subtarget features it requires are not available (missing floating point unit, or thumb vs arm mode, for example), issue a diagnostic that identifies what the feature mismatch is. rdar://11257547 llvm-svn: 155499	2012-04-24 22:40:08 +00:00
Andrew Trick	47f01c373e	Fix a naughty header include that breaks "installed" builds. llvm-svn: 155486	2012-04-24 20:36:19 +00:00
Nadav Rotem	3c817bb807	ConstantFoldSelectInstruction swapped the operands of the select. Fix 12592. Patch by Matt Pharr. llvm-svn: 155480	2012-04-24 20:18:49 +00:00
Nadav Rotem	62a0eeb276	Fix the testcase. We do expect two vblendw on XMMs. llvm-svn: 155477	2012-04-24 19:57:38 +00:00
Nadav Rotem	d35ec2e4ad	Add a testcase for 155440 llvm-svn: 155475	2012-04-24 19:45:28 +00:00
Evan Cheng	7f9bf43bcf	MachineBasicBlock::SplitCriticalEdge() should follow LLVM IR variant and refuse to break edge to EH landing pad. rdar://11300144 llvm-svn: 155470	2012-04-24 19:06:55 +00:00
Lang Hames	08eb5f2340	Add support for llvm.arm.neon.vmull* intrinsics to InstCombine. This fixes <rdar://problem/11291436>. llvm-svn: 155468	2012-04-24 18:58:36 +00:00
Chandler Carruth	eb9c5df516	Fix a crash on valid (if UB) bitcode that is produced for some global constants in C++11 mode. I have no idea why it required such particular circumstances to get here, the code seems clearly to rely upon unchecked assumptions. Specifically, when we decide to form an index into a struct type, we may have gone through (at least one) zero-length array indexing round, which would have left the offset un-adjusted, and thus not necessarily valid for use when indexing the struct type. This is just an canonicalization step, so the correct thing is to refuse to canonicalize nonsensical GEPs of this form. Implemented, and test case added. Fixes PR12642. Pair debugged and coded with Richard Smith. =] I credit him with most of the debugging, and preventing me from writing the wrong code. llvm-svn: 155466	2012-04-24 18:42:47 +00:00
Jim Grosbach	69d9654f7c	ARM: Nuke remnant bogus code. r154362 was supposed to delete this bit, but obviously didn't. rdar://11305594 llvm-svn: 155465	2012-04-24 18:39:47 +00:00
Stepan Dyatkovskiy	37fad66b80	Related to PR1255. Let's begin. I'll commit classes that corresponds to our latest PR1255 discussion posts in llvm-commits. Strategy. 0. Implement new classes. Classes doesn't affect anything. They still work with ConstantInt base values at this stage. 1. Fictitious replacement of current ConstantInt case values with ConstantRangesSet. Case ranges set will still hold single value, and ConstantInt getCaseValue() will return it. But additionally implement new method in SwitchInst that allows to work with case ranges. Currenly I think it should be some wrapper that returns either single value or ConstantRangesSet object. 2. Step-by-step replacement of old "ConstantInt getCaseValue()" with new alternative. Modify algorithms for all passes that works with SwitchInst. But don't modify LLParser and BitcodeReader/Writer. Still hold single value in each ConstantRangesSet object. On this stage some parts of LLVM will use old-style methods, and some ones new-style. 3. After all getCaseValue() usages will removed and whole LLVM and its clients will work in new style - modify LLParser, Reader and Writer. Remove getCaseValue(). 4. Replace ConstantInt-based case ranges set items with APInt ones. Currently we are on Zero Stage: New classes. ConstantRangesSet. I selected ConstantArrays as case ranges set "holder" object (it is a temporary decision, I'll explain why below). The array items are may be ConstantVectors with single item, and ConstantVectors with two items (that means single number and range respectively). The ConstantInt will used as basic value representation. It will replaced with APInt then. Of course ConstantArray and ConstantVector will go away after ConstantInt => APInt replacement. New class mandatory features: - bool isSatisfies(ConstantInt V) method (need better name?). Returns true if the given value satisfies this case. - Case's ranges and values enumeration. In some passes we need to analize each case (SwitchLowering for example). Factory + unified clusterify. I also propose to implement the factory that allows to build case object with user friendly way. I called it CRSBuilder by now. Currenly I implemented the factory that allows add,remove pairs of range+successor. It also allows add existing ConstantRangesSet decompiling it to separated ranges. Factory can emit either clusters set (single case range + successor) or the set of "ConstantRangesSet + Successor" pairs. So you can use it either as builder for new cases set for SwitchInst, or for clusterification of existing cases set. Just call Factory.optimize() and it emits optimized and sorted clusters collection for you! I tested clusterification on SelectionDAGBuilder - it works fine. Don't worry it was not included in this patch. Just new classes. Factory is a template. There are two params: SuccessorClass and IsReadonly. So you can specify what successor you need (BB or MBB). And you can also restrict your factory to use values in read-only mode (SelectionDAGBuilder need IsReadonly=true). Read-only factory couldn't build the cases ranges. llvm-svn: 155464	2012-04-24 18:31:10 +00:00

1 2 3 4 5 ...

81921 Commits