llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 05:52:53 +02:00

Author	SHA1	Message	Date
Nadav Rotem	42c710c5c7	80-col llvm-svn: 167036	2012-10-30 18:37:43 +00:00
Nadav Rotem	69e6bca813	LoopVectorize: Add support for write-only loops when the write destination is a single pointer. Speedup SciMark by 1% llvm-svn: 167035	2012-10-30 18:36:45 +00:00
Adhemerval Zanella	74fd05ff3f	PowerPC: Expand FSRQT for vector types This patch expands FSQRT for floating point vector types when altivec is used. llvm-svn: 167034	2012-10-30 18:29:42 +00:00
Nadav Rotem	4fc2912062	LoopVectorize: Fix a bug in the initialization of reduction variables. AND needs to start at all-one while XOR, and OR need to start at zero. llvm-svn: 167032	2012-10-30 18:12:36 +00:00
Ulrich Weigand	418fafa0b8	Set %defaultjit to use MCJIT for PowerPC targets. Update Transforms/LICM/2003-12-11-SinkingToPHI.ll test to use %defaultjit as well. llvm-svn: 167031	2012-10-30 18:07:58 +00:00
Bill Wendling	18a846fca4	Fix grammar. llvm-svn: 167029	2012-10-30 17:51:02 +00:00
Michael Liao	6aead01244	Enable ELF machine type to be specified explicitly in X86 backend llvm-svn: 167027	2012-10-30 17:33:39 +00:00
Quentin Colombet	dde058d386	Change ForceSizeOpt attribute into MinSize attribute llvm-svn: 167020	2012-10-30 16:32:52 +00:00
Duncan Sands	bce56286fb	Fix isEliminableCastPair to work correctly in the presence of pointers with different sizes. llvm-svn: 167018	2012-10-30 16:03:32 +00:00
Hans Wennborg	885eff267a	switch_to_lookup_table.ll: Remove some unnecessary lines, comments, function attributes, etc. llvm-svn: 167016	2012-10-30 15:11:52 +00:00
Adhemerval Zanella	ac3ba40bc2	PowerPC: More support for Altivec compare operations This patch adds more support for vector type comparisons using altivec. It adds correct support for v16i8, v8i16, v4i32, and v4f32 vector types for comparison operators ==, !=, >, >=, <, and <=. llvm-svn: 167015	2012-10-30 13:50:19 +00:00
Duncan Sands	db410bd2b6	Add a helper for telling whether a type is a pointer or vector of pointer type. Simplify the implementation of the corresponding integer and float functions and move them inline while there. llvm-svn: 167014	2012-10-30 13:38:54 +00:00
Ulrich Weigand	2df331332d	Enable some additional constant folding for PPCDoubleDouble. This fixes Clang :: CodeGen/complex-builtints.c on PowerPC. llvm-svn: 167013	2012-10-30 12:33:18 +00:00
Hans Wennborg	40eb1b4055	Use TargetTransformInfo to control switch-to-lookup table transformation When the switch-to-lookup tables transform landed in SimplifyCFG, it was pointed out that this could be inappropriate for some targets. Since there was no way at the time for the pass to know anything about the target, an awkward reverse-transform was added in CodeGenPrepare that turned lookup tables back into switches for some targets. This patch uses the new TargetTransformInfo to determine if a switch should be transformed, and removes CodeGenPrepare::ConvertLoadToSwitch. llvm-svn: 167011	2012-10-30 11:23:25 +00:00
Hal Finkel	1e4b354323	Remove an invalid assert in TargetTransformImpl getCastInstrCost had an assert prohibiting scalar to vector casts. Such casts, however, are allowed. This should make the vectorizer buildbot happier. llvm-svn: 166998	2012-10-30 02:41:57 +00:00
Sid Manning	4db9e00747	* Add e_flags enum for Hexagon * Add Hexagon specific section indexes for small data - Reviewed by Michael Spencer llvm-svn: 166997	2012-10-30 02:26:15 +00:00
Jim Grosbach	6585037b8c	ARM: Better disassembly for pc-relative LDR. When the operand is a plain immediate rather than a label, print it as [pc, #imm] like we do for the Thumb2 wide encoding variant. rdar://12154503 llvm-svn: 166991	2012-10-30 01:04:51 +00:00
Reed Kotler	de0ea1027e	Change mips16 delay slot jumps to non delay slot forms by default. We will make them delay slot forms if there is something that can be placed in the delay slot during a separate pass. Mips16 extended instructions cannot be placed in delay slots. llvm-svn: 166990	2012-10-30 00:54:49 +00:00
Nadav Rotem	2ada2db2a2	LoopVectorizer: change debug prints: Print the module identifier when deciding to vectorize. When deciding not to vectorize do not print the called function name because it can be null. llvm-svn: 166989	2012-10-30 00:40:39 +00:00
Jakub Staszak	f1cddf738b	Re-commit r166971. I reverted it to quickly, when buildbots didn't have a chance to test it with chapni's fix (-mattr=+avx). llvm-svn: 166985	2012-10-30 00:01:57 +00:00
Kevin Enderby	ecb9e2620c	Fix ARM's b.w instruction for thumb 2 and the encoding T4. The branch target is 24 bits not 20 and the decoding needed to correctly handle converting the J1 and J2 bits to their I1 and I2 values to reconstruct the displacement. llvm-svn: 166982	2012-10-29 23:27:20 +00:00
Jakub Staszak	ce95e4429f	Revert r166971. It causes buildbot failure. To be investigated. llvm-svn: 166979	2012-10-29 23:13:50 +00:00
NAKAMURA Takumi	d544c8f4ca	llvm/test/CodeGen/X86/vec_shuffle-30.ll: Try to unbreak builds - assuming +avx. llvm-svn: 166974	2012-10-29 22:45:18 +00:00
Jakub Staszak	51f21a007f	Remove unused variable. llvm-svn: 166973	2012-10-29 22:04:32 +00:00
Jakub Staszak	6067106145	Simplify code. No functionality change. llvm-svn: 166972	2012-10-29 22:02:26 +00:00
Jakub Staszak	ded6f21890	Allow to fold vector load if there is more than one bitcast, so in the case: %0 = load <8 x i16>* %dest %1 = shufflevector <8 x i16> %0, <8 x i16> %in, <8 x i32> < i32 0, i32 1, i32 2, i32 3, i32 13, i32 undef, i32 14, i32 14> store <8 x i16> %1, <8 x i16>* %dest We get: vmovlpd (%eax), %xmm0, %xmm0 instead of: vmovaps (%eax), %xmm1 vmovsd %xmm1, %xmm0, %xmm0 No extra test-case is added. I just fixed the existing one (also it uses FileCheck now). llvm-svn: 166971	2012-10-29 21:56:35 +00:00
Nadav Rotem	0c9445eb5c	LoopVectorize: Update and preserve the dominator tree info. llvm-svn: 166970	2012-10-29 21:52:38 +00:00
Jakub Staszak	82909cc59f	Typo. llvm-svn: 166969	2012-10-29 21:49:46 +00:00
Bill Schmidt	77a8fd274b	This patch solves a problem with passing varargs parameters under the PPC64 ELF ABI. A varargs parameter consisting of a single-precision floating-point value, or of a single-element aggregate containing a single-precision floating-point value, must be passed in the low-order (rightmost) four bytes of the doubleword stack slot reserved for that parameter. If there are GPR protocol registers remaining, the parameter must also be mirrored in the low-order four bytes of the reserved GPR. Prior to this patch, such parameters were being passed in the high-order four bytes of the stack slot and the mirrored GPR. The patch adds a new test case to verify the correct code generation. llvm-svn: 166968	2012-10-29 21:18:16 +00:00
Simon Atanasyan	dc69984e0f	Add mips64-* and mips64el-* triples to configure scripts as valid triples denote Mips target. llvm-svn: 166961	2012-10-29 19:49:45 +00:00
Reed Kotler	3859f5469e	Implement patterns for extloadi8 and extloadi16 llvm-svn: 166960	2012-10-29 19:39:04 +00:00
Ulrich Weigand	445bd73056	In various places throughout the code generator, there were special checks to avoid performing compile-time arithmetic on PPCDoubleDouble. Now that APFloat supports arithmetic on PPCDoubleDouble, those checks are no longer needed, and we can treat the type like any other. llvm-svn: 166958	2012-10-29 18:35:49 +00:00
Ulrich Weigand	22232e05bb	APFloat cleanup: Remove now unused "arithmeticOK" logic. llvm-svn: 166954	2012-10-29 18:18:44 +00:00
Chad Rosier	3b32dec25d	Remove redundant test case from r166949, per Eli's suggestion. llvm-svn: 166953	2012-10-29 18:18:26 +00:00
Ulrich Weigand	ac97e73457	APFloat cleanup: Remove now unused fields "sign2" and "exponent2". llvm-svn: 166952	2012-10-29 18:17:42 +00:00
Ulrich Weigand	c504e37126	Implement arithmetic on APFloat with PPCDoubleDouble semantics by treating it as if it were an IEEE floating-point type with 106-bit mantissa. This makes compile-time arithmetic on "long double" for PowerPC in clang (in particular parsing of floating point constants) work, and fixes all "long double" related failures in the test suite. llvm-svn: 166951	2012-10-29 18:09:01 +00:00
Chad Rosier	651ecf255c	[ms-inline asm] Add support for the [] operator. Essentially, [expr1][expr2] is equivalent to [expr1 + expr2]. See test cases for more examples. rdar://12470392 llvm-svn: 166949	2012-10-29 18:01:54 +00:00
Nadav Rotem	f8e4e2b652	Rename the BB-vectorize flag to match the dragonegg name llvm-svn: 166948	2012-10-29 18:01:14 +00:00
Michael Liao	d26c27ad35	Fix PR14204 - Add missing pattern on X86ISD::VZEXT from VR256 to VR256 when AVX2 is enabled. llvm-svn: 166947	2012-10-29 17:57:12 +00:00
Joerg Sonnenberger	4609a4241f	Fix typo llvm-svn: 166945	2012-10-29 17:56:15 +00:00
Jakob Stoklund Olesen	05cec5db28	Completely disallow partial copies in adjustCopiesBackFrom(). Partial copies can show up even when CoalescerPair.isPartial() returns false. For example: %vreg24:dsub_0<def> = COPY %vreg31:dsub_0; QPR:%vreg24,%vreg31 Such a partial-partial copy is not good enough for the transformation adjustCopiesBackFrom() needs to do. llvm-svn: 166944	2012-10-29 17:51:52 +00:00
Ulrich Weigand	2daab9e4b4	Allow i32/i64 for 'f' constraint on PowerPC. This fixes PR12757. llvm-svn: 166943	2012-10-29 17:49:34 +00:00
Duncan Sands	e6f6a2ecdc	Remove a wrapper around getIntPtrType added to GVN by Hal in commit 166624 (the wrapper returns a vector of integers when passed a vector of pointers) by having getIntPtrType itself return a vector of integers in this case. Outside of this wrapper, I didn't find anywhere in the codebase that was relying on the old behaviour for vectors of pointers, so give this a whirl through the buildbots. llvm-svn: 166939	2012-10-29 17:31:46 +00:00
Bob Wilson	373d870759	Remove code to saturate profile counts. We may need to change the way profile counter values are stored, but saturation is the wrong thing to do. Just remove it for now. Patch by Alastair Murray! llvm-svn: 166938	2012-10-29 17:27:39 +00:00
Nadav Rotem	24b8d6c6f1	Change the PassManagerBuilder (used by -O3) loop vectorizer flag from -vectorize to -vectorize-loops because we dont want to share the same flag as the bb-vectorizer. llvm-svn: 166937	2012-10-29 16:36:25 +00:00
Hans Wennborg	8fb81f8122	Minor style fixes for TargetTransformationInfo and TargetTransformImpl llvm-svn: 166936	2012-10-29 16:26:52 +00:00
Reed Kotler	56b8c348f2	Expand all atomic ops for mips16. llvm-svn: 166935	2012-10-29 16:16:54 +00:00
NAKAMURA Takumi	3857058610	llvm/Config/config.h.cmake: Good bye, Kevin! We won't honor authors in comments. llvm-svn: 166934	2012-10-29 16:07:28 +00:00
NAKAMURA Takumi	7d3154afaa	PPCSubtarget.h: Add explicit braces. llvm-svn: 166932	2012-10-29 15:51:42 +00:00
NAKAMURA Takumi	faa1c78982	PPCSubtarget.h: Whitespace. llvm-svn: 166931	2012-10-29 15:51:35 +00:00

1 2 3 4 5 ...

86137 Commits