llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Chris Lattner	3f92c791b4	initialize ivar llvm-svn: 30780	2006-10-06 22:52:08 +00:00
Chris Lattner	d5f5a433b2	If a target uses a GOT, put it in the jt data section, not the text section. This will fix alpha when Andrew implements AlphaTargetMachine::getTargetLowering(). llvm-svn: 30779	2006-10-06 22:50:56 +00:00
Chris Lattner	2ca01febcf	Alpha uses a got llvm-svn: 30778	2006-10-06 22:46:51 +00:00
Chris Lattner	b5b96302f2	jump tables handle pic llvm-svn: 30776	2006-10-06 22:32:29 +00:00
Chris Lattner	ad60994822	print labels even if a MBB doesn't have a corresponding LLVM BB, just don't print the LLVM BB label. llvm-svn: 30775	2006-10-06 21:28:17 +00:00
Rafael Espindola	a96c205e12	add optional input flag to FMRRD llvm-svn: 30774	2006-10-06 20:33:26 +00:00
Rafael Espindola	54301ca490	add support for calling functions that return double llvm-svn: 30771	2006-10-06 19:10:05 +00:00
Evan Cheng	6d15f83d46	80 col violation. llvm-svn: 30770	2006-10-06 18:57:51 +00:00
Chris Lattner	399106d8f8	ugly codegen llvm-svn: 30769	2006-10-06 17:39:34 +00:00
Chris Lattner	0d39b3a4cf	Fix a miscompilation of: long long foo(long long X) { return (long long)(signed char)(int)X; } Instead of: _foo: extsb r2, r4 srawi r3, r4, 31 mr r4, r2 blr we now produce: _foo: extsb r4, r4 srawi r3, r4, 31 blr This fixes a miscompilation in ConstantFolding.cpp. llvm-svn: 30768	2006-10-06 17:34:12 +00:00
Rafael Espindola	d870b158b3	fix some bugs affecting functions with no arguments llvm-svn: 30767	2006-10-06 17:26:30 +00:00
Rafael Espindola	f35563ff66	fix the stack alignment llvm-svn: 30766	2006-10-06 14:29:47 +00:00
Rafael Espindola	f679bdf121	add support for calling functions that have double arguments llvm-svn: 30765	2006-10-06 12:50:22 +00:00
Evan Cheng	9ce3d493f0	Still need to support -mcpu=<> or cross compilation will fail. Doh. llvm-svn: 30764	2006-10-06 09:17:41 +00:00
Evan Cheng	6fc0ae2136	Do away with CPU feature list. Just use CPUID to detect MMX, SSE, SSE2, SSE3, and 64-bit support. llvm-svn: 30763	2006-10-06 08:21:07 +00:00
Evan Cheng	35a3337e1d	It appears the inline asm in GetCpuIDAndInfo() may clobbers some registers if it isn't inlined (at < -O3). Force it to be inlined. llvm-svn: 30762	2006-10-06 07:50:56 +00:00
Chris Lattner	5fc3bb074c	MachineBasicBlock::splice was incorrectly updating parent pointers on instructions. llvm-svn: 30760	2006-10-06 01:12:44 +00:00
Evan Cheng	275825195a	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Evan Cheng	c9e079d0c1	Add getStore() helper function to create ISD::STORE nodes. llvm-svn: 30758	2006-10-05 22:57:11 +00:00
Chris Lattner	eca9897bd5	Don't crash if an MBB doesn't have an LLVM BB llvm-svn: 30757	2006-10-05 21:40:14 +00:00
Rafael Espindola	2e4743b6d1	use a const ref for passing the vector to ArgumentLayout llvm-svn: 30756	2006-10-05 17:46:48 +00:00
Rafael Espindola	f0e4950ef4	implement a ArgumentLayout class to factor code common to LowerFORMAL_ARGUMENTS and LowerCALL implement FMDRR add support for f64 function arguments llvm-svn: 30754	2006-10-05 16:48:49 +00:00
Jim Laskey	3f9f064fd1	Alias analysis code clean ups. llvm-svn: 30753	2006-10-05 15:07:25 +00:00
Chris Lattner	513ba43053	add a new SimplifyDemandedVectorElts method, which works similarly to SimplifyDemandedBits. The idea is that some operations can be simplified if not all of the computed elements are needed. Some targets (like x86) have a large number of intrinsics that operate on a single element, but pass other elts through unmodified. If those other elements are not needed, the intrinsics can be simplified to scalar operations, and insertelement ops can be removed. This turns (f.e.): ushort %Convert_sse(float %f) { %tmp = insertelement <4 x float> undef, float %f, uint 0 ; <<4 x float>> [#uses=1] %tmp10 = insertelement <4 x float> %tmp, float 0.000000e+00, uint 1 ; <<4 x float>> [#uses=1] %tmp11 = insertelement <4 x float> %tmp10, float 0.000000e+00, uint 2 ; <<4 x float>> [#uses=1] %tmp12 = insertelement <4 x float> %tmp11, float 0.000000e+00, uint 3 ; <<4 x float>> [#uses=1] %tmp28 = tail call <4 x float> %llvm.x86.sse.sub.ss( <4 x float> %tmp12, <4 x float> < float 1.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > ) ; <<4 x float>> [#uses=1] %tmp37 = tail call <4 x float> %llvm.x86.sse.mul.ss( <4 x float> %tmp28, <4 x float> < float 5.000000e-01, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > ) ; <<4 x float>> [#uses=1] %tmp48 = tail call <4 x float> %llvm.x86.sse.min.ss( <4 x float> %tmp37, <4 x float> < float 6.553500e+04, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > ) ; <<4 x float>> [#uses=1] %tmp59 = tail call <4 x float> %llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> zeroinitializer ) ; <<4 x float>> [#uses=1] %tmp = tail call int %llvm.x86.sse.cvttss2si( <4 x float> %tmp59 ) ; <int> [#uses=1] %tmp69 = cast int %tmp to ushort ; <ushort> [#uses=1] ret ushort %tmp69 } into: ushort %Convert_sse(float %f) { entry: %tmp28 = sub float %f, 1.000000e+00 ; <float> [#uses=1] %tmp37 = mul float %tmp28, 5.000000e-01 ; <float> [#uses=1] %tmp375 = insertelement <4 x float> undef, float %tmp37, uint 0 ; <<4 x float>> [#uses=1] %tmp48 = tail call <4 x float> %llvm.x86.sse.min.ss( <4 x float> %tmp375, <4 x float> < float 6.553500e+04, float undef, float undef, float undef > ) ; <<4 x float>> [#uses=1] %tmp59 = tail call <4 x float> %llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> < float 0.000000e+00, float undef, float undef, float undef > ) ; <<4 x float>> [#uses=1] %tmp = tail call int %llvm.x86.sse.cvttss2si( <4 x float> %tmp59 ) ; <int> [#uses=1] %tmp69 = cast int %tmp to ushort ; <ushort> [#uses=1] ret ushort %tmp69 } which improves codegen from: _Convert_sse: movss LCPI1_0, %xmm0 movss 4(%esp), %xmm1 subss %xmm0, %xmm1 movss LCPI1_1, %xmm0 mulss %xmm0, %xmm1 movss LCPI1_2, %xmm0 minss %xmm0, %xmm1 xorps %xmm0, %xmm0 maxss %xmm0, %xmm1 cvttss2si %xmm1, %eax andl $65535, %eax ret to: _Convert_sse: movss 4(%esp), %xmm0 subss LCPI1_0, %xmm0 mulss LCPI1_1, %xmm0 movss LCPI1_2, %xmm1 minss %xmm1, %xmm0 xorps %xmm1, %xmm1 maxss %xmm1, %xmm0 cvttss2si %xmm0, %eax andl $65535, %eax ret This is just a first step, it can be extended in many ways. Testcase here: Transforms/InstCombine/vec_demanded_elts.ll llvm-svn: 30752	2006-10-05 06:55:50 +00:00
Chris Lattner	14ad447136	Add insertelement/extractelement helper ctors. llvm-svn: 30750	2006-10-05 06:24:58 +00:00
Chris Lattner	7f98896c02	Lower some min/max idioms to minss/maxss when unsafe fp math is enabled. llvm-svn: 30748	2006-10-05 04:11:26 +00:00
Chris Lattner	08da1a510d	Don't bother setting JumpTableTextSection, it is about to disappear llvm-svn: 30745	2006-10-05 03:13:59 +00:00
Chris Lattner	4f41b86e7f	Emit pic jumptables to the same section that the function is emitted to, allowing label differences to work. This fixes CodeGen/X86/pic_jumptable.ll llvm-svn: 30744	2006-10-05 03:13:28 +00:00
Chris Lattner	068190eb91	Pass the MachineFunction into EmitJumpTableInfo. llvm-svn: 30742	2006-10-05 03:01:21 +00:00
Chris Lattner	75e572ab20	implement and use getSectionForFunction llvm-svn: 30741	2006-10-05 02:51:36 +00:00
Chris Lattner	cc21d20348	Use getSectionForFunction. llvm-svn: 30740	2006-10-05 02:49:23 +00:00
Chris Lattner	d62ecab2e3	Use getSectionForFunction llvm-svn: 30739	2006-10-05 02:48:40 +00:00
Chris Lattner	a293b73042	use getSectionForFunction to decide which section to emit code into llvm-svn: 30738	2006-10-05 02:47:13 +00:00
Chris Lattner	758352e9b1	Implement getSectionForFunction, use it when printing function body. llvm-svn: 30737	2006-10-05 02:43:52 +00:00
Chris Lattner	b92a46c4f6	move getSectionForFunction to AsmPrinter llvm-svn: 30736	2006-10-05 02:42:47 +00:00
Chris Lattner	ca844c6695	Move getSectionForFunction to AsmPrinter, change it to return a string. llvm-svn: 30735	2006-10-05 02:42:20 +00:00
Chris Lattner	0ca8a69c28	implement DarwinTargetAsmInfo::getSectionForFunction, use it when outputting function bodies llvm-svn: 30733	2006-10-05 00:35:50 +00:00
Chris Lattner	6ba6d0e937	Give TargetAsmInfo a virtual dtor, add a new getSectionForFunction method. llvm-svn: 30732	2006-10-05 00:35:16 +00:00
Chris Lattner	2e10b0c095	emit jump table before debug info llvm-svn: 30731	2006-10-05 00:26:05 +00:00
Chris Lattner	dd6343bd8d	Always emit the jump table after the function so it's part of the same 'atom' as the function body. llvm-svn: 30730	2006-10-05 00:24:46 +00:00
Chris Lattner	f943ae3b8a	getFilename/getDirectory shouldn't abort if the global has no init. This can happen on bugpoint reduced testcases f.e.. llvm-svn: 30729	2006-10-04 23:06:26 +00:00
Evan Cheng	5974db9813	Fix some typos that can cause a flag value to have more than one use. llvm-svn: 30727	2006-10-04 22:23:53 +00:00
Chris Lattner	ba99215347	Fix a static dtor issue llvm-svn: 30726	2006-10-04 22:13:11 +00:00
Chris Lattner	d55928d5f6	Fix more static dtor issues llvm-svn: 30725	2006-10-04 21:52:35 +00:00
Chris Lattner	de1ba6dc70	Fix some more static dtor issues. llvm-svn: 30724	2006-10-04 21:49:37 +00:00
Evan Cheng	a77dd83caf	Added option -disable-x86-shuffle-opti to disable X86 specific vector shuffle optimizations. llvm-svn: 30723	2006-10-04 18:33:38 +00:00
Evan Cheng	54a8d5a6e0	Formating. llvm-svn: 30722	2006-10-04 18:33:00 +00:00
Jim Laskey	dd74085b55	More extensive alias analysis. llvm-svn: 30721	2006-10-04 16:53:27 +00:00
Jim Laskey	ef4d9453b9	More long term solution llvm-svn: 30720	2006-10-04 10:40:15 +00:00
Chris Lattner	4ec6c298d8	Pattern match min/max nodes when we have sse. This implements CodeGen/X86/scalar_sse_minmax.ll llvm-svn: 30719	2006-10-04 06:57:07 +00:00

1 2 3 4 5 ...

15280 Commits