llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Dale Johannesen	f12104ce4b	Handle 'X' constraint in asm's better. llvm-svn: 46485	2008-01-29 02:21:21 +00:00
Scott Michel	dc780aeb57	Overhaul Cell SPU's addressing mode internals so that there are now only two addressing mode nodes, SPUaform and SPUindirect (vice the three previous ones, SPUaform, SPUdform and SPUxform). This improves code somewhat because we now avoid using reg+reg addressing when it can be avoided. It also simplifies the address selection logic, which was the main point for doing this. Also, for various global variables that would be loaded using SPU's A-form addressing, prefer D-form offs[reg] addressing, keeping the base in a register if the variable is used more than once. llvm-svn: 46483	2008-01-29 02:16:57 +00:00
Bill Wendling	5b6f587a80	If the function has no machine instructions, then emit a "nop" so that the function label isn't associated with something it shouldn't be. llvm-svn: 46449	2008-01-28 09:15:03 +00:00
Chris Lattner	39c52e030b	add a note llvm-svn: 46413	2008-01-27 07:31:41 +00:00
Chris Lattner	f4bc2c5718	Use fldz and fld1 for long double constants instead of a constant pool load. llvm-svn: 46411	2008-01-27 06:19:31 +00:00
Chris Lattner	00183edf55	Add some notes. llvm-svn: 46405	2008-01-26 20:12:07 +00:00
Chris Lattner	6124c0eb2d	Remove some code for inferring alignment info from the x86 backend now that the dag combiner does it. llvm-svn: 46404	2008-01-26 20:07:42 +00:00
Bill Wendling	7b83688c73	If there's no instructions being emitted on X86 for a function, emit a nop. Emit the nop directly for PPC. llvm-svn: 46398	2008-01-26 09:03:52 +00:00
Bill Wendling	1c92468074	If there are no machine instructions emitted for a function, then insert a "nop" instruction so that we don't have the function's label associated with something that it's not supposed to be associated with. llvm-svn: 46394	2008-01-26 06:51:24 +00:00
Chris Lattner	00ead854ef	JITEmitter.cpp was trying to sync the icache for function stubs, but was actually passing a completely incorrect size to sys_icache_invalidate. Instead of having the JITEmitter do this (which doesn't have the correct size), just make the target sync its own stubs. llvm-svn: 46354	2008-01-25 16:41:09 +00:00
Chris Lattner	726a4e45e5	optimize fxor like for llvm-svn: 46345	2008-01-25 06:14:17 +00:00
Chris Lattner	79076fdf2a	Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows us to compile: double test(double X) { return copysign(0.0, X); } into: _test: andpd LCPI1_0(%rip), %xmm0 ret instead of: _test: pxor %xmm1, %xmm1 andpd LCPI1_0(%rip), %xmm1 movapd %xmm0, %xmm2 andpd LCPI1_1(%rip), %xmm2 movapd %xmm1, %xmm0 orpd %xmm2, %xmm0 ret llvm-svn: 46344	2008-01-25 05:46:26 +00:00
Anton Korobeynikov	37309ed741	Provide correct DWARF register numbering for debug information emission on x86-32/Darwin. This should fix bunch of issues. llvm-svn: 46337	2008-01-25 00:34:13 +00:00
Chris Lattner	16a8f126d3	Significantly simplify and improve handling of FP function results on x86-32. This case returns the value in ST(0) and then has to convert it to an SSE register. This causes significant codegen ugliness in some cases. For example in the trivial fp-stack-direct-ret.ll testcase we used to generate: _bar: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret because we move the result of foo() into an XMM register, then have to move it back for the return of bar. Instead of hacking ever-more special cases into the call result lowering code we take a much simpler approach: on x86-32, fp return is modeled as always returning into an f80 register which is then truncated to f32 or f64 as needed. Similarly for a result, we model it as an extension to f80 + return. This exposes the truncate and extensions to the dag combiner, allowing target independent code to hack on them, eliminating them in this case. This gives us this code for the example above: _bar: subl $12, %esp call L_foo$stub addl $12, %esp ret The nasty aspect of this is that these conversions are not legal, but we want the second pass of dag combiner (post-legalize) to be able to hack on them. To handle this, we lie to legalize and say they are legal, then custom expand them on entry to the isel pass (PreprocessForFPConvert). This is gross, but less gross than the code it is replacing :) This also allows us to generate better code in several other cases. For example on fp-stack-ret-conv.ll, we now generate: _test: subl $12, %esp call L_foo$stub fstps 8(%esp) movl 16(%esp), %eax cvtss2sd 8(%esp), %xmm0 movsd %xmm0, (%eax) addl $12, %esp ret where before we produced (incidentally, the old bad code is identical to what gcc produces): _test: subl $12, %esp call L_foo$stub fstpl (%esp) cvtsd2ss (%esp), %xmm0 cvtss2sd %xmm0, %xmm0 movl 16(%esp), %eax movsd %xmm0, (%eax) addl $12, %esp ret Note that we generate slightly worse code on pr1505b.ll due to a scheduling deficiency that is unrelated to this patch. llvm-svn: 46307	2008-01-24 08:07:48 +00:00
Evan Cheng	91089e6d66	Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type. llvm-svn: 46286	2008-01-23 23:17:41 +00:00
Duncan Sands	aff4eef6df	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Dale Johannesen	a54401ee30	Honor explicit section information on Darwin. llvm-svn: 46267	2008-01-23 00:58:14 +00:00
Evan Cheng	d436c2e724	SSE varargs arguments are passed in memory. llvm-svn: 46262	2008-01-22 23:26:53 +00:00
Chris Lattner	c964ccb2c4	Trivial patch to fix two warnings, please pull into llvm 2.2 llvm-svn: 46243	2008-01-22 04:47:47 +00:00
Anton Korobeynikov	ad17c6bbe6	Honour ByVal parameter attribute for name decoration llvm-svn: 46200	2008-01-20 14:00:07 +00:00
Anton Korobeynikov	539129f881	Remove Darwin'ism llvm-svn: 46199	2008-01-20 13:59:37 +00:00
Anton Korobeynikov	5ac8a5b72c	Enable PIC codegen on x86-64/linux llvm-svn: 46198	2008-01-20 13:58:16 +00:00
Duncan Sands	5e1cbc1ad7	Need to handle any 'nest' parameter before integer parameters, since otherwise it won't be passed in the right register. With this change trampolines work on x86-64 (thanks to Luke Guest for providing access to an x86-64 box). llvm-svn: 46192	2008-01-19 16:42:10 +00:00
Dale Johannesen	7807e86260	Implement flt_rounds for PowerPC. llvm-svn: 46174	2008-01-18 19:55:37 +00:00
Chris Lattner	b3be660985	get symbolic information for ppc ldbl nodes. llvm-svn: 46165	2008-01-18 18:51:16 +00:00
Chris Lattner	febc7ea9bf	Fix a latent bug exposed by my truncstore patch. We compiled stfiwx-2.ll to: _test: fctiwz f0, f1 stfiwx f0, 0, r4 blr instead of: _test: fctiwz f0, f1 stfd f0, -8(r1) nop nop lwz r2, -4(r1) stb r2, 0(r4) blr The former is not correct (stores 4 bytes, not 1). llvm-svn: 46161	2008-01-18 16:54:56 +00:00
Chris Lattner	eb7d65d073	make a method public llvm-svn: 46159	2008-01-18 06:52:41 +00:00
Dale Johannesen	0e1328e880	Revert the part of 45849 that treated weak globals as weak globals rather than commons. While not wrong, this change tickled a latent bug in Darwin's strip, so revert it for now as a workaround. llvm-svn: 46147	2008-01-17 23:36:04 +00:00
Dale Johannesen	08c757e707	Revert the part of 45848 that treated weak globals as weak globals rather than commons. While not wrong, this change tickled a latent bug in Darwin's strip, so revert it for now as a workaround. llvm-svn: 46144	2008-01-17 23:04:07 +00:00
Scott Michel	506e61bad1	Forward progress: crtbegin.c now compiles successfully! Fixed CellSPU's A-form (local store) address mode, so that all globals, externals, constant pool and jump table symbols are now wrapped within a SPUISD::AFormAddr pseudo-instruction. This now identifies all local store memory addresses, although it requires a bit of legerdemain during instruction selection to properly select loads to and stores from local store, properly generating "LQA" instructions. Also added mul_ops.ll test harness for exercising integer multiplication. llvm-svn: 46142	2008-01-17 20:38:41 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	d033200a8f	* Introduce a new SelectionDAG::getIntPtrConstant method and switch various codegen pieces and the X86 backend over to using it. * Add some comments to SelectionDAGNodes.h * Introduce a second argument to FP_ROUND, which indicates whether the FP_ROUND changes the value of its input. If not it is safe to xform things like fp_extend(fp_round(x)) -> x. llvm-svn: 46125	2008-01-17 07:00:52 +00:00
Duncan Sands	78e448d8b4	Trampoline support for x86-64. This looks like it should work, but I have no machine to test it on. Committed because it will at least cause no harm, and maybe someone can test it for me! llvm-svn: 46098	2008-01-16 22:55:25 +00:00
Chris Lattner	2825f8fff0	make it more clear that this predicate only applies to scalar FP types. llvm-svn: 46058	2008-01-16 06:24:21 +00:00
Chris Lattner	fbb75278b2	introduce a isTypeInSSEReg predicate, which allows us to simplify some code. No functionality change. llvm-svn: 46055	2008-01-16 06:19:45 +00:00
Chris Lattner	41e1fd13b2	My previous commit had an incomplete message, it should have been: make the 'fp return in ST(0)' optimization smart enough to look through token factor nodes. THis allows us to compile testcases like CodeGen/X86/fp-stack-retcopy.ll into: _carg: subl $12, %esp call L_foo$stub fstpl (%esp) fldl (%esp) addl $12, %esp ret instead of: _carg: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret Still not optimal, but much better and this is a trivial patch. Fixing the rest requires invasive surgery that is is not llvm 2.2 material. llvm-svn: 46054	2008-01-16 05:56:59 +00:00
Chris Lattner	1819f31640	make the 'fp return in ST(0)' optimization smart enough to look through token factor llvm-svn: 46053	2008-01-16 05:53:06 +00:00
Chris Lattner	d9f29a1e3e	various whitespace cleanups, no functionality change. llvm-svn: 46052	2008-01-16 05:52:18 +00:00
Dale Johannesen	1268083b64	Missed file from previous checkin. llvm-svn: 46030	2008-01-15 23:25:27 +00:00
Dale Johannesen	319c7bb405	Fix and enable EH for x86-64 Darwin. Adds ShortenEHDataFor64Bits as a not-very-accurate abstraction to cover all the changes in DwarfWriter. Some cosmetic changes to Darwin assembly code for gcc testsuite compatibility. llvm-svn: 46029	2008-01-15 23:24:56 +00:00
Chris Lattner	dac9c1bcfe	If someone wants to implement ppc TRAP, they can go for it :) llvm-svn: 46019	2008-01-15 22:15:02 +00:00
Chris Lattner	6846e346a8	rename SDTRet -> SDTNone. Move definition of 'trap' sdnode up from x86 instrinfo to targetselectiondag.td. llvm-svn: 46017	2008-01-15 22:02:54 +00:00
Owen Anderson	946d55e773	Remove DefInst from LiveVariables::VarInfo. Use the facilities on MachineRegisterInfo instead. llvm-svn: 46016	2008-01-15 22:02:46 +00:00
Chris Lattner	0072ce1ff4	no need to expand ISD::TRAP to X86ISD::TRAP, just match ISD::TRAP. llvm-svn: 46015	2008-01-15 21:58:22 +00:00
Anton Korobeynikov	44893ee93d	Fix JIT encoding of trap/ud2 instruction llvm-svn: 46012	2008-01-15 21:40:02 +00:00
Anton Korobeynikov	08ea121968	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Evan Cheng	d7403ac663	Rename CCIfStruct to CCIfByVal and CCStructAssign to CCPassByVal. Remove unused parameters of CCStructAssign and add size and alignment requirement info. llvm-svn: 45997	2008-01-15 03:34:58 +00:00
Evan Cheng	4eb7ae04c5	Both x86-32 and x86-64 handle byval parameter attributes. llvm-svn: 45996	2008-01-15 03:15:41 +00:00
Chris Lattner	554971a6d6	Improve the FP stackifier to decide all on its own whether an instruction kills a register or not. This is cheap and easy to do now that instructions record this on their flags, and this eliminates the second pass of LiveVariables from the x86 backend. This speeds up a release llc by ~2.5%. llvm-svn: 45955	2008-01-14 06:41:29 +00:00
Duncan Sands	0c362b8606	Whitespace tweak. llvm-svn: 45940	2008-01-13 21:20:29 +00:00

1 2 3 4 5 ...

7875 Commits