llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Meador Inge	b6984384bf	instcombine: Migrate strpbrk optimizations This patch migrates the strpbrk optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167105	2012-10-31 04:29:58 +00:00
Michael Liao	299b55458f	Clean up redundant SP register maintained in X86 TLI llvm-svn: 167104	2012-10-31 04:14:09 +00:00
Meador Inge	5f906a50d3	instcombine: Migrate strlen optimizations This patch migrates the strlen optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167103	2012-10-31 03:33:06 +00:00
Meador Inge	4d309f330c	instcombine: Migrate strncpy optimizations This patch migrates the strncpy optimizations from the simplify-libcalls pass into the instcombine library call simplifier. llvm-svn: 167102	2012-10-31 03:33:00 +00:00
Nadav Rotem	9ab0e93cc1	LoopVectorize: Do not vectorize loops with tiny constant trip counts. llvm-svn: 167101	2012-10-31 03:31:07 +00:00
Bill Schmidt	f4c899f8e7	This patch addresses an ABI compatibility issue with empty aggregate parameters. Examples of these are: struct { } a; union { } b[256]; int a[0]; An empty aggregate has an address, although dereferencing that address is pointless. When passed as a parameter, an empty aggregate does not consume a protocol register, nor does it consume a doubleword in the parameter save area. Passing an empty aggregate by reference passes an address just as for any other aggregate. Returning an empty aggregate uses GPR3 as a hidden address of the return value location, just as for any other aggregate. The patch modifies PPCTargetLowering::LowerFormalArguments_64SVR4 and PPCTargetLowering::LowerCall_64SVR4 to properly skip empty aggregate parameters passed by value. The handling of return values and by-reference parameters was already correct. Built on powerpc64-unknown-linux-gnu and tested with no new regressions. A test case is included to test proper handling of empty aggregate parameters on both sides of the function call protocol. llvm-svn: 167090	2012-10-31 01:15:05 +00:00
Akira Hatanaka	7297f1c0d1	Change signature of function RAFast::spillAll to avoid conversion between type MachineInstr* and MachineBasicBlock::iterator. llvm-svn: 167088	2012-10-31 00:56:01 +00:00
Rafael Espindola	7aaf7247d3	xlc supports __attribute__((aligned(x))), use it. Patch by Kai. llvm-svn: 167087	2012-10-31 00:54:26 +00:00
Akira Hatanaka	a2384b1c6d	Check that iterator I is not the end iterator. llvm-svn: 167086	2012-10-31 00:50:52 +00:00
Rafael Espindola	79781084a5	Add extra declarations of hash_value needed to build llvm with xlc 12.1. Patch by Kai! llvm-svn: 167085	2012-10-31 00:46:18 +00:00
Nadav Rotem	240ead98fd	Add support for loops that don't start with Zero. This is important for loops in the LAPACK test-suite. These loops start at 1 because they are auto-converted from fortran. llvm-svn: 167084	2012-10-31 00:45:26 +00:00
Meador Inge	261da7dfde	instcombine: Migrate stpcpy optimizations This patch migrates the stpcpy optimizations from the simplify-libcalls pass into the instcombine library call simplifier. Note that the __stpcpy_chk simplifications were migrated in a previous commit. llvm-svn: 167083	2012-10-31 00:20:56 +00:00
Meador Inge	1a88082441	instcombine: Split out the __stpcpy_chk simplifications from StrCpyChkOpt r166198 migrated the strcpy optimization to instcombine. The strcpy simplifier that was migrated from Transforms/Scalar/SimplifyLibCalls.cpp was also doing some __strcpy_chk simplifications. Those fortified simplifications were migrated as well, but introduced a bug in the __stpcpy_chk simplifier in the process. This happened because the __strcpy_chk and __stpcpy_chk simplifiers were both mapped to StrCpyChkOpt which was updated with simplifications that worked for __strcpy_chk, but not __stpcpy_chk. This patch fixes the problem by adding proper test coverage and creating a new simplifier for __stpcpy_chk (instead of sharing one with __strcpy_chk). llvm-svn: 167082	2012-10-31 00:20:51 +00:00
Manman Ren	f26bd7d8f9	X86 SSE: update rsqrtss and rcpss to use two source operands and the first source operand is tied to the destination operand. This is to accurately model the corresponding instructions where the upper bits are unmodified. rdar://12558838 PR14221 llvm-svn: 167064	2012-10-30 23:53:59 +00:00
Eli Friedman	c968a32c9b	Fix regression in old-style JIT. llvm-svn: 167057	2012-10-30 22:21:55 +00:00
Manman Ren	584c3daf8d	X86 MMX: optimize transfer from mmx to i32 We used to generate a store (movq) + a load. Now we use movd. rdar://9946746 llvm-svn: 167056	2012-10-30 22:15:38 +00:00
Nadav Rotem	e06ea2d50f	Add documentation. llvm-svn: 167055	2012-10-30 22:06:26 +00:00
Eric Christopher	1dcc1b0cdc	Reformat and 80-column this. It's not strictly conforming yet, but it's better. llvm-svn: 167053	2012-10-30 21:36:43 +00:00
Chandler Carruth	d3b4a83c9f	Fix PR14212: For some strange reason I treated vectors differently from integers in that the code to handle split alloca-wide integer loads or stores doesn't come first. It should, for the same reasons as with integers, and the PR attests to that. Also had to fix a busted assert in that this test case also covers. llvm-svn: 167051	2012-10-30 20:52:40 +00:00
Chad Rosier	24643b6410	[inline asm] Get the mayLoad/mayStore directly from the MIOp_ExtraInfo operand. llvm-svn: 167050	2012-10-30 20:39:19 +00:00
Hal Finkel	6cfb988397	BBVectorize: Cache fixed-order pairs instead of recomputing pointer info. Instead of recomputing relative pointer information just prior to fusing, cache this information (which also needs to be computed during the candidate-pair selection process). This cuts down on the total number of SE queries made, and also is a necessary intermediate step on the road toward including shuffle costs in the pair selection procedure. No functionality change is intended. llvm-svn: 167049	2012-10-30 20:17:37 +00:00
Akira Hatanaka	dbea525cfc	[mips] Allow tail-call optimization for vararg functions and functions which use the caller's stack. llvm-svn: 167048	2012-10-30 20:16:31 +00:00
Chad Rosier	3f4e0f8e8e	Add a comment for r167040. llvm-svn: 167046	2012-10-30 20:01:12 +00:00
Benjamin Kramer	78cdbf2f16	LoopIdiom: Fix a serious missed optimization: we only turned top-level loops into memmove. Thanks to Preston Briggs for catching this! llvm-svn: 167045	2012-10-30 19:49:39 +00:00
Hal Finkel	1c116e9ec0	BBVectorize: Fix a small bug introduced in r167042. We need to make sure that we take the correct load/store alignment when the inputs are flipped. llvm-svn: 167044	2012-10-30 19:47:37 +00:00
Akira Hatanaka	a5bf1f87ef	Add code for saving formal argument information to MipsFunctionInfo. This information will be used by IsEligibleForTailCallOptimization to determine whether a call can be tail-call optimized. llvm-svn: 167043	2012-10-30 19:37:25 +00:00
Hal Finkel	2d3b9c41d5	BBVectorize: Simplify how input swapping is handled. Stop propagating the FlipMemInputs variable into the routines that create the replacement instructions. Instead, just flip the arguments of those routines. This allows for some associated cleanup (not all of which is done here). No functionality change is intended. llvm-svn: 167042	2012-10-30 19:35:29 +00:00
Akira Hatanaka	cf35a025b3	Add definition of function MipsTargetLowering::passArgOnStack which emits nodes for passing a function call argument on a stack. llvm-svn: 167041	2012-10-30 19:23:25 +00:00
Chad Rosier	528b5cd1a6	[inline asm] Implement mayLoad and mayStore for inline assembly. In general, the MachineInstr MayLoad/MayLoad flags are based on the tablegen implementation. For inline assembly, however, we need to compute these based on the constraints. Revert r166929 as this is no longer needed, but leave the test case in place. rdar://12033048 and PR13504 llvm-svn: 167040	2012-10-30 19:11:54 +00:00
Akira Hatanaka	aa8ad65cad	Do not do tail-call optimization if target is mips16. llvm-svn: 167039	2012-10-30 19:07:58 +00:00
Hal Finkel	a27a64ab3e	BBVectorize: Don't make calls to SE when the result is unused. SE was being called during the instruction-fusion process (when the result is unreliable, and thus ignored). No functionality change is intended. llvm-svn: 167037	2012-10-30 18:55:49 +00:00
Nadav Rotem	42c710c5c7	80-col llvm-svn: 167036	2012-10-30 18:37:43 +00:00
Nadav Rotem	69e6bca813	LoopVectorize: Add support for write-only loops when the write destination is a single pointer. Speedup SciMark by 1% llvm-svn: 167035	2012-10-30 18:36:45 +00:00
Adhemerval Zanella	74fd05ff3f	PowerPC: Expand FSRQT for vector types This patch expands FSQRT for floating point vector types when altivec is used. llvm-svn: 167034	2012-10-30 18:29:42 +00:00
Nadav Rotem	4fc2912062	LoopVectorize: Fix a bug in the initialization of reduction variables. AND needs to start at all-one while XOR, and OR need to start at zero. llvm-svn: 167032	2012-10-30 18:12:36 +00:00
Ulrich Weigand	418fafa0b8	Set %defaultjit to use MCJIT for PowerPC targets. Update Transforms/LICM/2003-12-11-SinkingToPHI.ll test to use %defaultjit as well. llvm-svn: 167031	2012-10-30 18:07:58 +00:00
Bill Wendling	18a846fca4	Fix grammar. llvm-svn: 167029	2012-10-30 17:51:02 +00:00
Michael Liao	6aead01244	Enable ELF machine type to be specified explicitly in X86 backend llvm-svn: 167027	2012-10-30 17:33:39 +00:00
Quentin Colombet	dde058d386	Change ForceSizeOpt attribute into MinSize attribute llvm-svn: 167020	2012-10-30 16:32:52 +00:00
Duncan Sands	bce56286fb	Fix isEliminableCastPair to work correctly in the presence of pointers with different sizes. llvm-svn: 167018	2012-10-30 16:03:32 +00:00
Hans Wennborg	885eff267a	switch_to_lookup_table.ll: Remove some unnecessary lines, comments, function attributes, etc. llvm-svn: 167016	2012-10-30 15:11:52 +00:00
Adhemerval Zanella	ac3ba40bc2	PowerPC: More support for Altivec compare operations This patch adds more support for vector type comparisons using altivec. It adds correct support for v16i8, v8i16, v4i32, and v4f32 vector types for comparison operators ==, !=, >, >=, <, and <=. llvm-svn: 167015	2012-10-30 13:50:19 +00:00
Duncan Sands	db410bd2b6	Add a helper for telling whether a type is a pointer or vector of pointer type. Simplify the implementation of the corresponding integer and float functions and move them inline while there. llvm-svn: 167014	2012-10-30 13:38:54 +00:00
Ulrich Weigand	2df331332d	Enable some additional constant folding for PPCDoubleDouble. This fixes Clang :: CodeGen/complex-builtints.c on PowerPC. llvm-svn: 167013	2012-10-30 12:33:18 +00:00
Hans Wennborg	40eb1b4055	Use TargetTransformInfo to control switch-to-lookup table transformation When the switch-to-lookup tables transform landed in SimplifyCFG, it was pointed out that this could be inappropriate for some targets. Since there was no way at the time for the pass to know anything about the target, an awkward reverse-transform was added in CodeGenPrepare that turned lookup tables back into switches for some targets. This patch uses the new TargetTransformInfo to determine if a switch should be transformed, and removes CodeGenPrepare::ConvertLoadToSwitch. llvm-svn: 167011	2012-10-30 11:23:25 +00:00
Hal Finkel	1e4b354323	Remove an invalid assert in TargetTransformImpl getCastInstrCost had an assert prohibiting scalar to vector casts. Such casts, however, are allowed. This should make the vectorizer buildbot happier. llvm-svn: 166998	2012-10-30 02:41:57 +00:00
Sid Manning	4db9e00747	* Add e_flags enum for Hexagon * Add Hexagon specific section indexes for small data - Reviewed by Michael Spencer llvm-svn: 166997	2012-10-30 02:26:15 +00:00
Jim Grosbach	6585037b8c	ARM: Better disassembly for pc-relative LDR. When the operand is a plain immediate rather than a label, print it as [pc, #imm] like we do for the Thumb2 wide encoding variant. rdar://12154503 llvm-svn: 166991	2012-10-30 01:04:51 +00:00
Reed Kotler	de0ea1027e	Change mips16 delay slot jumps to non delay slot forms by default. We will make them delay slot forms if there is something that can be placed in the delay slot during a separate pass. Mips16 extended instructions cannot be placed in delay slots. llvm-svn: 166990	2012-10-30 00:54:49 +00:00
Nadav Rotem	2ada2db2a2	LoopVectorizer: change debug prints: Print the module identifier when deciding to vectorize. When deciding not to vectorize do not print the called function name because it can be null. llvm-svn: 166989	2012-10-30 00:40:39 +00:00

1 2 3 4 5 ...

86168 Commits