llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	10c3144381	Fix a quadratic algorithm in MachineBranchProbabilityInfo. The getSumForBlock function was quadratic in the number of successors because getSuccWeight would perform a linear search for an already known iterator. llvm-svn: 161460	2012-08-08 00:20:37 +00:00
Dan Gohman	df7f8afaf2	Avoid recomputing the unique exit blocks and their insert points when doing multiple scalar promotions on a single loop. This also has the effect of preserving the order of stores sunk out of loops, which is aesthetically pleasing, and it happens to fix the testcase in PR13542, though it doesn't fix the underlying problem. llvm-svn: 161459	2012-08-08 00:00:26 +00:00
Jakob Stoklund Olesen	08414c1860	Skip tied operand pairs that already have the same register. llvm-svn: 161454	2012-08-07 22:47:06 +00:00
Jakob Stoklund Olesen	3c221664e3	Add SelectionDAG::getTargetIndex. This adds support for TargetIndex operands during isel. The meaning of these (index, offset, flags) operands is entirely defined by the target. llvm-svn: 161453	2012-08-07 22:37:05 +00:00
Bob Wilson	51c50d44b7	Fix a serious typo in InstCombine's optimization of comparisons. An unsigned value converted to floating-point will always be greater than a negative constant. Unfortunately InstCombine reversed the check so that unsigned values were being optimized to always be greater than all positive floating-point constants. <rdar://problem/12029145> llvm-svn: 161452	2012-08-07 22:35:16 +00:00
Evan Cheng	96c6741fad	X86 cmp lowering is looking past truncate on the condition node. It should only do so when the high bits are known zero. This caused a subtle miscompilation. rdar://12027825 llvm-svn: 161451	2012-08-07 22:21:00 +00:00
Bill Wendling	06a580b5da	For non-Darwin platforms, we want to generate stack protectors only for character arrays. This is in line with what GCC does. <rdar://problem/10529227> llvm-svn: 161446	2012-08-07 20:59:05 +00:00
Jakob Stoklund Olesen	301af79343	Add a new kind of MachineOperand: MO_TargetIndex. A target index operand looks a lot like a constant pool reference, but it is completely target-defined. It contains the 8-bit TargetFlags, a 32-bit index, and a 64-bit offset. It is preserved by all code generator passes. TargetIndex operands can be used to carry target-specific information in cases where immediate operands won't suffice. llvm-svn: 161441	2012-08-07 18:56:39 +00:00
Andrew Kaylor	adda20daff	Enable lazy compilation in MCJIT llvm-svn: 161438	2012-08-07 18:33:00 +00:00
Jakob Stoklund Olesen	8836660866	Fix a couple of typos. llvm-svn: 161437	2012-08-07 18:32:57 +00:00
Jakob Stoklund Olesen	438bc30c3d	Add trace accessor methods, implement primitive if-conversion heuristic. Compare the critical paths of the two traces through an if-conversion candidate. If the difference is larger than the branch brediction penalty, reject the if-conversion. If would never pay. llvm-svn: 161433	2012-08-07 18:02:19 +00:00
Jim Grosbach	a22a0e50a1	Tidy up a bit. llvm-svn: 161430	2012-08-07 17:53:14 +00:00
Rafael Espindola	ccfbbaa11f	The dominance computation already has logic for computing if an edge dominates a use or a BB, but it is inline in the handling of the invoke instruction. This patch refactors it so that it can be used in other cases. For example, in define i32 @f(i32 %x) { bb0: %cmp = icmp eq i32 %x, 0 br i1 %cmp, label %bb2, label %bb1 bb1: br label %bb2 bb2: %cond = phi i32 [ %x, %bb0 ], [ 0, %bb1 ] %foo = add i32 %cond, %x ret i32 %foo } GVN should be able to replace %x with 0 in any use that is dominated by the true edge out of bb0. In the above example the only such use is the one in the phi. llvm-svn: 161429	2012-08-07 17:30:46 +00:00
Hal Finkel	aa174abb14	Add a comment about mftb vs. mfspr on PPC. Thanks to Alex Rosenberg for the suggestion. llvm-svn: 161428	2012-08-07 17:04:20 +00:00
Alexey Samsonov	fa8f91a368	Fix the representation of debug line table in DebugInfo LLVM library, and "instruction address -> file/line" lookup. Instead of plain collection of rows, debug line table for compilation unit is now treated as the number of row ranges, describing sequences (series of contiguous machine instructions). The sequences are not always listed in the order of increasing address, so previously used std::lower_bound() sometimes produced wrong results. Now the instruction address lookup consists of two stages: finding the correct sequence, and searching for address in range of rows for this sequence. llvm-svn: 161414	2012-08-07 11:46:57 +00:00
Benjamin Kramer	b8389165be	PR13095: Give an inline cost bonus to functions using byval arguments. We give a bonus for every argument because the argument setup is not needed anymore when the function is inlined. With this patch we interpret byval arguments as a compact representation of many arguments. The byval argument setup is implemented in the backend as an inline memcpy, so to model the cost as accurately as possible we take the number of pointer-sized elements in the byval argument and give a bonus of 2 instructions for every one of those. The bonus is capped at 8 elements, which is the number of stores at which the x86 backend switches from an expanded inline memcpy to a real memcpy. It would be better to use the real memcpy threshold from the backend, but it's not available via TargetData. This change brings the performance of c-ray in line with gcc 4.7. The included test case tries to reproduce the c-ray problem to catch regressions for this benchmark early, its performance is dominated by the inline decision of a specific call. This only has a small impact on most code, more on x86 and arm than on x86_64 due to the way the ABI works. When building LLVM for x86 it gives a small inline cost boost to virtually any function using StringRef or STL allocators, but only a 0.01% increase in overall binary size. The size of gcc compiled by clang actually shrunk by a couple bytes with this patch applied, but not significantly. llvm-svn: 161413	2012-08-07 11:13:19 +00:00
Chandler Carruth	ca6b087618	Fix PR13412, a nasty miscompile due to the interleaved instsimplify+inline strategy. The crux of the problem is that instsimplify was reasonably relying on an invariant that is true within any single function, but is no longer true mid-inline the way we use it. This invariant is that an argument pointer != a local (alloca) pointer. The fix is really light weight though, and allows instsimplify to be resiliant to these situations: when checking the relation ships to function arguments, ensure that the argumets come from the same function. If they come from different functions, then none of these assumptions hold. All credit to Benjamin Kramer for coming up with this clever solution to the problem. llvm-svn: 161410	2012-08-07 10:59:59 +00:00
Chandler Carruth	49d4e3f282	Add a much more conservative strategy for aligning branch targets. Previously, MBP essentially aligned every branch target it could. This bloats code quite a bit, especially non-looping code which has no real reason to prefer aligned branch targets so heavily. As Andy said in review, it's still a bit odd to do this without a real cost model, but this at least has much more plausible heuristics. Fixes PR13265. llvm-svn: 161409	2012-08-07 09:45:24 +00:00
Manman Ren	5d43c19d9e	MachineCSE: Update the heuristics for isProfitableToCSE. If the result of a common subexpression is used at all uses of the candidate expression, CSE should not increase the live range of the common subexpression. rdar://11393714 and rdar://11819721 llvm-svn: 161396	2012-08-07 06:16:46 +00:00
Bill Wendling	69f9777937	Revert r161371. Removing the 'const' before Type is a "good thing". --- Reverse-merging r161371 into '.': U include/llvm/Target/TargetData.h U lib/Target/TargetData.cpp llvm-svn: 161394	2012-08-07 05:51:59 +00:00
Jack Carter	32420dd092	The define for 64 bit sign extension neglected to initialize fields of the class that it used. The result was nonsense code. Before: 0000000000000000 <foo>: 0: 00441100 0x441100 4: 03e00008 jr ra 8: 00000000 nop After: 0000000000000000 <foo>: 0: 00041000 sll v0,a0,0x0 4: 03e00008 jr ra 8: 00000000 nop llvm-svn: 161377	2012-08-07 00:35:22 +00:00
Bill Wendling	dc532577fd	Constify the Type parameter to some methods (which are const anyway). llvm-svn: 161371	2012-08-07 00:26:35 +00:00
Andrew Trick	35b938c991	Allow x86 subtargets to use the GenericModel defined in X86Schedule.td. This allows codegen passes to query properties like InstrItins->SchedModel->IssueWidth. It also ensure's that computeOperandLatency returns the X86 defaults for loads and "high latency ops". This should have no significant impact on existing schedulers because X86 defaults happen to be the same as global defaults. llvm-svn: 161370	2012-08-07 00:25:30 +00:00
Jack Carter	d768975885	Mips relocation R_MIPS_64 relocates a 64 bit double word. I hit this in a very large program (spirit.cpp), but have not figured out how to make a small make check test for it. llvm-svn: 161366	2012-08-07 00:01:14 +00:00
Jack Carter	3f30c3effe	The Mips64InstrInfo.td definitions DynAlloc64 LEA_ADDiu64 were using a class defined for 32 bit instructions and thus the instruction was for addiu instead of daddiu. This was corrected by adding the instruction opcode as a field in the base class to be filled in by the defs. llvm-svn: 161359	2012-08-06 23:29:06 +00:00
Bill Wendling	efdb610522	Reduce indentation by early exiting. llvm-svn: 161356	2012-08-06 22:52:45 +00:00
Jakob Stoklund Olesen	8d107f3948	Fix typo. llvm-svn: 161354	2012-08-06 22:34:51 +00:00
Bill Wendling	dd05d04260	Add a way to grab the target options from the LTO command line. When the command line target options were removed from the LLVM libraries, LTO lost its ability to specify things like `-disable-fp-elim'. Add this back by adding the command line variables to the `lto' project. <rdar://problem/12038729> llvm-svn: 161353	2012-08-06 21:34:54 +00:00
Jack Carter	fdb00bef02	Mips relocations R_MIPS_HIGHER and R_MIPS_HIGHEST. These 2 relocations gain access to the highest and the second highest 16 bits of a 64 bit object. R_MIPS_HIGHER %higher(A+S) The %higher(x) function is [ (((long long) x + 0x80008000LL) >> 32) & 0xffff ]. R_MIPS_HIGHEST %highest(A+S) The %highest(x) function is [ (((long long) x + 0x800080008000LL) >> 48) & 0xffff ]. llvm-svn: 161348	2012-08-06 21:26:03 +00:00
Hal Finkel	15265edebe	MFTB on PPC64 should really be encoded using MFSPR. The MFTB instruction itself is being phased out, and its functionality is provided by MFSPR. According to the ISA docs, using MFSPR works on all known chips except for the 601 (which did not have a timebase register anyway) and the POWER3. Thanks to Adhemerval Zanella for pointing this out! llvm-svn: 161346	2012-08-06 21:21:44 +00:00
Eric Christopher	f5132794cd	Add support for the OpenBSD for Bitrig. Patch by David Hill. llvm-svn: 161344	2012-08-06 20:52:18 +00:00
Simon Atanasyan	03f8384c4e	Fix MIPS DSP Rev1 intrinsics memory properties. The patch reviewed by Akira Hatanaka. llvm-svn: 161332	2012-08-06 19:47:47 +00:00
Jakob Stoklund Olesen	2bb405a329	Put up warning signs around MO::getNextOperandForReg(). llvm-svn: 161329	2012-08-06 18:48:43 +00:00
Roman Divacky	59eec94f55	Remove empty overrides of processFunctionBeforeFrameFinalized(). llvm-svn: 161328	2012-08-06 18:14:18 +00:00
Craig Topper	dc1b95e7de	Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom handling in DAGISelToDAG due to limitations in TableGen's implicit def handling. Fixes PR11305. llvm-svn: 161318	2012-08-06 06:22:36 +00:00
Craig Topper	7a2d94e7d0	Update test to check for r161305 llvm-svn: 161307	2012-08-05 09:06:28 +00:00
Craig Topper	e26b30c830	Remove custom inserter for MWAIT. It doesn't do anything that couldn't be represented in a pattern. llvm-svn: 161306	2012-08-05 00:36:57 +00:00
Craig Topper	c716a3f554	Use a COPY node instead of an explicit MOVA opcode in the custom insterter for pcmpestrm/pcmpistrm. Allows the register allocator to handle it better and prevent wasted identity moves. llvm-svn: 161305	2012-08-05 00:17:48 +00:00
Hal Finkel	aadd19de06	Add readcyclecounter lowering on PPC64. On PPC64, this can be done with a simple TableGen pattern. To enable this, I've added the (otherwise missing) readcyclecounter SDNode definition to TargetSelectionDAG.td. llvm-svn: 161302	2012-08-04 14:10:46 +00:00
Anton Korobeynikov	b0d4fe0a5e	Skip impdef regs during eabi save/restore list emission to workaround PR11902 llvm-svn: 161301	2012-08-04 13:25:58 +00:00
Anton Korobeynikov	dca34647bc	Recognize vst1.64 / vld1.64 with 3 and 4 regs as load from / store to stack stuff (this corresponds by spilling/reloading regs in DTriple / DQuad reg classes). No testcase, found by inspection. llvm-svn: 161300	2012-08-04 13:22:14 +00:00
Anton Korobeynikov	6dd5c91aae	Add stack spill / reload instructions for DTriple and DQuad register classes, which were missed for no reason. This fixes PR13377 llvm-svn: 161299	2012-08-04 13:16:12 +00:00
Bill Wendling	5e0afec8b6	Remove extraneous ';'. llvm-svn: 161298	2012-08-04 10:31:40 +00:00
Benjamin Kramer	65e524a766	Update cmake build. llvm-svn: 161297	2012-08-04 09:52:39 +00:00
Benjamin Kramer	c1a7151fa1	Postpone the deletion of the old name in StructType::setName to allow using a slice of the old name. Fixes PR13522. Add a rudimentary unit test to exercise the behavior. llvm-svn: 161296	2012-08-04 09:47:02 +00:00
NAKAMURA Takumi	d7acba5c24	[CMake] add_lit_target: Remove comments about add_dependencies. It is not a bug in cmake that add_custom_target(DEPENDS) would not accept targets but file-level dependencies. llvm-svn: 161295	2012-08-04 02:14:44 +00:00
NAKAMURA Takumi	eb88bde151	llc: Try to suppress failures since r161262 . FIXME: Fix several tests on i686-win32 due to lacking of many libraries. llvm-svn: 161292	2012-08-04 01:29:37 +00:00
Jakob Stoklund Olesen	d5b3babd6f	Delete a dead variable. TwoAddressInstructionPass doesn't remat any more. llvm-svn: 161285	2012-08-04 00:04:03 +00:00
Jakob Stoklund Olesen	69611c470c	TwoAddressInstructionPass refactoring: Extract another method. llvm-svn: 161284	2012-08-03 23:57:58 +00:00
Bob Wilson	9f6e25017a	Refactor and check "onlyReadsMemory" before optimizing builtins. This patch is mostly just refactoring a bunch of copy-and-pasted code, but it also adds a check that the call instructions are readnone or readonly. That check was already present for sin, cos, sqrt, log2, and exp2 calls, but it was missing for the rest of the builtins being handled in this code. llvm-svn: 161282	2012-08-03 23:29:17 +00:00

1 2 3 4 5 ...

84015 Commits