llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Chris Lattner	e16166b78d	implement __builtin_return_addr(0) on ppc. llvm-svn: 44700	2007-12-08 06:59:59 +00:00
Chris Lattner	e59a7ee26a	Implement ExpandOperationResult for ppc i64 fp->int, which fixes CodeGen/Generic/fp_to_int.ll among others. Its unclear why this just started failing... llvm-svn: 44407	2007-11-28 18:44:47 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Dale Johannesen	94241a8d3a	Disable a couple more things for ppcf128. llvm-svn: 43267	2007-10-23 23:20:14 +00:00
Evan Cheng	85eb733eff	Use ptr type in the immediate field of a BxA instruction so we don't end up selecting 32-bit call instruction for ppc64. llvm-svn: 43228	2007-10-22 19:46:19 +00:00
Chris Lattner	4354f2db6a	comment fixes llvm-svn: 43168	2007-10-19 04:08:28 +00:00
Dale Johannesen	b23b0bfa8f	More ppcf128 issues (maybe the last)? llvm-svn: 43160	2007-10-19 00:59:18 +00:00
Chris Lattner	c641c8c6ec	Change LowerFP_TO_SINT to create the specific code it needs instead of unconditionally creating an i64 bitcast. With the future legalizer design, operation legalization can't introduce new nodes with illegal types. This fixes the rest of olden on ppc32. llvm-svn: 43005	2007-10-15 20:14:52 +00:00
Dale Johannesen	6c89945eb8	Fix type mismatch error in PPC Altivec (only causes a problem when asserts are on). From vecLib. llvm-svn: 42959	2007-10-14 01:58:32 +00:00
Dan Gohman	171fb68ae0	Mark vector pow, ctpop, cttz, and ctlz as Expand on PowerPC. llvm-svn: 42904	2007-10-12 14:08:57 +00:00
Dan Gohman	edc841fb53	Set ISD::FPOW to Expand. llvm-svn: 42881	2007-10-11 23:21:31 +00:00
Dale Johannesen	76458ddf1e	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Dan Gohman	ae3b47b06f	When we start enabling SMUL_LOHI/UMUL_LOHI or SDIVREM/UDIVREM in target-indepenent lowering, don't use them on PowerPC. llvm-svn: 42755	2007-10-08 17:28:24 +00:00
Dale Johannesen	9b7ac95116	Next powerpc long double bits. Comparisons work, although not well, and shortening FP converts. llvm-svn: 42672	2007-10-06 01:24:11 +00:00
Dale Johannesen	c7b51b678d	First round of ppc long double. call/return and basic arithmetic works. Rename RTLIB long double functions to distinguish different flavors of long double; the lib functions have different names, alas. llvm-svn: 42644	2007-10-05 20:04:43 +00:00
Duncan Sands	c358890f73	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Owen Anderson	4b71e55287	Add lengthof and endof templates that hide a lot of sizeof computations. Patch by Sterling Stein! llvm-svn: 41758	2007-09-07 04:06:50 +00:00
Dale Johannesen	81d6ecb886	Enhance APFloat to retain bits of NaNs (fixes oggenc). Use APFloat interfaces for more references, mostly of ConstantFPSDNode. llvm-svn: 41632	2007-08-31 04:03:46 +00:00
Bill Wendling	c65cf7849d	Use i64 on a PPC64 machine llvm-svn: 41590	2007-08-30 00:59:19 +00:00
Chris Lattner	1e089aac3a	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Evan Cheng	ff50061170	Vector fneg must be expanded into fsub -0.0, X. llvm-svn: 40586	2007-07-30 07:51:22 +00:00
Duncan Sands	e8bb2c6d32	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Lauro Ramos Venancio	18fc770fd0	Assert when TLS is not implemented. llvm-svn: 39737	2007-07-11 17:19:51 +00:00
Dan Gohman	81cfdc2f19	Change getCopyToParts and getCopyFromParts to always use target-endian register ordering, for both physical and virtual registers. Update the PPC target lowering for calls to expect registers for the call result to already be in target order. llvm-svn: 38471	2007-07-09 20:59:04 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Chris Lattner	81e8a18e7c	describe an argument, hide it. llvm-svn: 37650	2007-06-19 05:46:06 +00:00
Chris Lattner	e13fac05d7	If a function is vararg, never pass inreg arguments in registers. Thanks to Anton for half of this patch. llvm-svn: 37641	2007-06-19 00:13:10 +00:00
Dan Gohman	2fd7d26df8	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Dan Gohman	875f6bde73	Apply this patch: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20070514/049845.html llvm-svn: 37240	2007-05-18 23:21:46 +00:00
Chris Lattner	4861b958f1	fix some subtle inline asm selection issues llvm-svn: 37067	2007-05-15 01:31:05 +00:00
Chris Lattner	b4ef9c8be3	Fix a bug in PPCTargetLowering::isLegalAddressingMode, scales other than 0/1/2 are always unsupported. llvm-svn: 35835	2007-04-09 22:10:05 +00:00
Nicolas Geoffray	681a87d9e8	Starting implementation of the ELF32 ABI specification of varargs handling. LowerVASTART emits the right code if the subtarget is ELF32, the other intrinsics (VAARG, VACOPY and VAEND) are not yet implemented. llvm-svn: 35625	2007-04-03 13:59:52 +00:00
Nicolas Geoffray	5897c064a6	The PPC64 ELF ABI is "intended to use the same structure layout and calling convention rules as the 64-bit PowerOpen ABI" (Reference http://www.linux-foundation.org/spec/ELF/ppc64/). Change all ELF tests to ELF32. llvm-svn: 35624	2007-04-03 12:35:28 +00:00
Nicolas Geoffray	b7c0895529	The ELF ABI specifies F1-F8 registers as argument registers for double, not F1-F10. This affects only ELF, not MachO. llvm-svn: 35622	2007-04-03 10:27:07 +00:00
Chris Lattner	c0405a348d	implement the new addressing mode description hook. llvm-svn: 35521	2007-03-30 23:15:24 +00:00
Lauro Ramos Venancio	99fca527d3	"The C standards do say that "char" may either be a "signed char" or "unsigned char" and it is up to the compilers implementation or the platform which is followed." http://www.arm.linux.org.uk/docs/faqs/signedchar.php llvm-svn: 35382	2007-03-27 16:33:08 +00:00
Chris Lattner	b19069959d	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Nicolas Geoffray	9c77df75ea	Stack and register alignment of call arguments in the ELF ABI llvm-svn: 35083	2007-03-13 15:02:46 +00:00
Evan Cheng	06d83c8fce	More flexible TargetLowering LSR hooks for testing whether an immediate is a legal target address immediate or scale. llvm-svn: 35074	2007-03-12 23:29:01 +00:00
Chris Lattner	26a5492049	Switch PPC return lower to use an autogenerated CC description. llvm-svn: 34940	2007-03-06 00:59:59 +00:00
Nicolas Geoffray	4b5b81198a	Implemented the frameaddress intrinsic for PPC. llvm-svn: 34787	2007-03-01 13:11:38 +00:00
Nicolas Geoffray	a562e5c1c5	Differentiate between the MachO and the ELF ABI the CALL instruction. llvm-svn: 34667	2007-02-27 13:01:19 +00:00
Chris Lattner	d4cd3a31e6	always lower to RETFLAG, never leave it as just ret. llvm-svn: 34639	2007-02-26 19:44:02 +00:00
Chris Lattner	796625a49d	no really, this is the right patch llvm-svn: 34605	2007-02-25 20:01:40 +00:00
Chris Lattner	49fc72110a	always promote float varargs to double. llvm-svn: 34604	2007-02-25 19:59:18 +00:00
Chris Lattner	041fb5bc67	implement support for the linux/ppc function call ABI. Patch by Nicolas Geoffray! llvm-svn: 34574	2007-02-25 05:34:32 +00:00
Jim Laskey	b57ee1fc37	Simplify lowering and selection of exception ops. llvm-svn: 34488	2007-02-22 14:56:36 +00:00
Jim Laskey	6a937ad320	Support to provide exception and selector registers. llvm-svn: 34482	2007-02-21 22:54:50 +00:00
Chris Lattner	e3eae5e265	Fix ixaddrs as well, allowing ppc64 to compile to: _test2: li r2, 0 lis r3, 1 std r2, 9024(r3) blr instead of: _test2: lis r2, 1 li r3, 0 ori r2, r2, 9024 std r3, 0(r2) blr This implements CodeGen/PowerPC/LargeAbsoluteAddr.ll:test2 llvm-svn: 34373	2007-02-17 06:57:26 +00:00
Chris Lattner	50411d5be7	Compile test/CodeGen/PowerPC/LargeAbsoluteAddr.ll to: _test: lis r2, 743 li r3, 0 stw r3, 32751(r2) blr instead of: _test: li r2, 0 stw r2, 32751(48693248) blr Implement support for ppc64 as well, allowing it to produce better code. llvm-svn: 34371	2007-02-17 06:44:03 +00:00
Nate Begeman	dc46021355	Finish off bug 680, allowing targets to custom lower frame and return address nodes. llvm-svn: 33636	2007-01-29 22:58:52 +00:00
Anton Korobeynikov	611d5e2eda	Propagate changes from my local tree. This patch includes: 1. New parameter attribute called 'inreg'. It has meaning "place this parameter in registers, if possible". This is some generalization of gcc's regparm(n) attribute. It's currently used only in X86-32 backend. 2. Completely rewritten CC handling/lowering code inside X86 backend. Merged stdcall + c CCs and fastcall + fast CC. 3. Dropped CSRET CC. We cannot add struct return variant for each target-specific CC (e.g. stdcall + csretcc and so on). 4. Instead of CSRET CC introduced 'sret' parameter attribute. Setting in on first attribute has meaning 'This is hidden pointer to structure return. Handle it gently'. 5. Fixed small bug in llvm-extract + add new feature to FunctionExtraction pass, which relinks all internal-linkaged callees from deleted function to external linkage. This will allow further linking everything together. NOTEs: 1. Documentation will be updated soon. 2. llvm-upgrade should be improved to translate csret => sret. Before this, there will be some unexpected test fails. llvm-svn: 33597	2007-01-28 13:31:35 +00:00
Jim Laskey	23ed7d2625	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Evan Cheng	5ba049eabf	setSetCCIsExpensive is gone. llvm-svn: 32941	2007-01-05 23:42:53 +00:00
Jim Laskey	721e7d2129	Provide support for FP_TO_UINT. llvm-svn: 32599	2006-12-15 14:32:57 +00:00
Chris Lattner	fb4898fdd8	Another step forward in PPC64 JIT support: we now no-longer need stubs emitted for external globals in PPC64-JIT-PIC mode (which is good because we didn't handle them before!). This also fixes a bug handling the picbase delta, which we would get wrong in some cases. llvm-svn: 32451	2006-12-11 23:22:45 +00:00
Jim Laskey	e2a261ad36	Missing opcode. llvm-svn: 32439	2006-12-11 18:45:56 +00:00
Anton Korobeynikov	e76b69846d	Cleaned setjmp/longjmp lowering interfaces. Now we're producing right code (both asm & cbe) for Mingw32 target. Removed autoconf checks for underscored versions of setjmp/longjmp. llvm-svn: 32415	2006-12-10 23:12:42 +00:00
Chris Lattner	f5fd4be9dd	Fix i64 uint_to_fp on ppc64 llvm-svn: 32297	2006-12-07 01:24:16 +00:00
Jim Laskey	1dcda902cd	Restoration of the stack pointer after a deallocation of a alloca was not updating the SP link. llvm-svn: 32202	2006-12-04 22:04:42 +00:00
Jim Laskey	8f43fbf759	1. In ppc64 mode we need only use one GPR. 2. Float values need to be promoted to double when they are vararg. llvm-svn: 32074	2006-12-01 16:30:47 +00:00
Chris Lattner	860908f98c	Fix the CodeGen/PowerPC/vec_constants.ll regression. llvm-svn: 32057	2006-12-01 01:45:39 +00:00
Chris Lattner	3219b522c8	Fix bug codegen'ing FP constant vectors with integer splats. Make sure the created intrinsics have the right integer types. This fixes PowerPC/2006-11-29-AltivecFPSplat.ll llvm-svn: 32024	2006-11-29 19:58:49 +00:00
Jim Laskey	00bcb51943	Offset for load of 32-bit arg in 64-bit world was incorrect. llvm-svn: 32019	2006-11-29 13:37:09 +00:00
Jim Laskey	7b0a74da3c	Remove debug code. llvm-svn: 31970	2006-11-28 18:27:02 +00:00
Jim Laskey	a5c5ceb212	32-bit int space was not accounted for properly in lowerCall. llvm-svn: 31966	2006-11-28 14:53:52 +00:00
Evan Cheng	98fa7ab4d7	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Chris Lattner	eb9b1840b3	on ppc64, float arguments take 8-byte stack slots not 4-byte stack slots. Also, valist should create a pointer RC reg class value, not a GPRC value. llvm-svn: 31840	2006-11-18 01:57:19 +00:00
Chris Lattner	0d88b19f2f	convert PPC::BCC to use the 'pred' operand instead of separate predicate value and CR reg #. This requires swapping the order of these everywhere that touches BCC and requires us to write custom matching logic for PPCcondbranch :( llvm-svn: 31835	2006-11-17 22:37:34 +00:00
Chris Lattner	73329ae80d	rename PPC::COND_BRANCH to PPC::BCC llvm-svn: 31834	2006-11-17 22:14:47 +00:00
Chris Lattner	1527483a15	start using PPC predicates more consistently. llvm-svn: 31833	2006-11-17 22:10:59 +00:00
Jim Laskey	8aac7dc0ee	This is a general clean up of the PowerPC ABI. Address several problems and bugs including making sure that the TOS links back to the previous frame, that the maximum call frame size is not included twice when using frame pointers, no longer growing the frame on calls, double storing of SP and a cleaner/faster dynamic alloca. llvm-svn: 31792	2006-11-16 22:43:37 +00:00
Chris Lattner	9bc55a6c38	fix ldu/stu jit encoding. Swith 64-bit preinc load instrs to use memri addrmodes. llvm-svn: 31757	2006-11-15 19:55:13 +00:00
Chris Lattner	e3a81b796c	lower "X = seteq Y, Z" to '(shr (ctlz (xor Y, Z)), 5)' instead of '(shr (ctlz (sub Y, Z)), 5)'. The use of xor better exposes the operation to bit-twiddling logic in the dag combiner. For example, this: typedef struct { unsigned prefix : 4; unsigned code : 4; unsigned unsigned_p : 4; } tree_common; int foo(tree_common a, tree_common b) { return a->code == b->code; } Now compiles to: _foo: lwz r2, 0(r4) lwz r3, 0(r3) xor r2, r3, r2 rlwinm r2, r2, 28, 28, 31 cntlzw r2, r2 srwi r3, r2, 5 blr instead of: _foo: lbz r2, 3(r4) lbz r3, 3(r3) srwi r2, r2, 4 srwi r3, r3, 4 subf r2, r2, r3 cntlzw r2, r2 srwi r3, r2, 5 blr saving a cycle. llvm-svn: 31725	2006-11-14 05:28:08 +00:00
Chris Lattner	fdffc51d57	minor tweaks, reject vector preinc. llvm-svn: 31717	2006-11-14 01:38:31 +00:00
Chris Lattner	9c5d395b0e	ppc64 doesn't have lwau, don't attempt to form it. llvm-svn: 31656	2006-11-11 00:08:42 +00:00
Chris Lattner	1aaa5f904c	implement preinc support for r+i loads on ppc64 llvm-svn: 31654	2006-11-10 23:58:45 +00:00
Chris Lattner	1604b6a873	add an initial cut at preinc loads for ppc32. This is broken for ppc64 (because the 64-bit reg target versions aren't implemented yet), doesn't support r+r addr modes, and doesn't handle stores, but it works otherwise. :) This is disabled unless -enable-ppc-preinc is passed to llc for now. llvm-svn: 31621	2006-11-10 02:08:47 +00:00
Evan Cheng	d7df1a37bc	PPC supports i32 / i64 pre-inc load / store. llvm-svn: 31599	2006-11-09 19:11:50 +00:00
Evan Cheng	89ee587963	Rename ISD::MemOpAddrMode to ISD::MemIndexedMode llvm-svn: 31595	2006-11-09 17:55:04 +00:00
Chris Lattner	bd39c99fd1	Refactor all the addressing mode selection stuff into the isel lowering class, where it can be used for preinc formation. llvm-svn: 31536	2006-11-08 02:15:41 +00:00
Reid Spencer	4bafa71dc1	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Chris Lattner	38c04a6bd6	Implement the getRegForInlineAsmConstraint method for PPC. With recent sdisel changes, this eliminates a ton of copies around common inline asms. For example: int test2(int Y, int X) { asm("foo %0, %1" : "=r"(X): "r"(X)); return X; } now compiles to: _test2: foo r3, r4 blr instead of: _test2: mr r2, r4 foo r2, r2 mr r3, r2 blr GCC produces: _test2: foo r4, r4 mr r3,r4 blr llvm-svn: 31367	2006-11-02 01:44:04 +00:00
Chris Lattner	d9afd310a6	Change the prototype for TargetLowering::isOperandValidForConstraint llvm-svn: 31318	2006-10-31 19:40:43 +00:00
Evan Cheng	5766dd6455	All targets expand BR_JT for now. llvm-svn: 31294	2006-10-30 08:02:39 +00:00
Chris Lattner	0f686ec438	set the ppc64 stack pointer right, dynamic alloca now works for ppc64 llvm-svn: 31028	2006-10-18 01:20:43 +00:00
Chris Lattner	6c403f7102	Expand alloca for ppc64 llvm-svn: 31027	2006-10-18 01:18:48 +00:00
Evan Cheng	fe5bb5dbe6	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Evan Cheng	d22f3dd3ed	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Evan Cheng	275825195a	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Evan Cheng	494e8e6971	Combine ISD::EXTLOAD, ISD::SEXTLOAD, ISD::ZEXTLOAD into ISD::LOADX. Add an extra operand to LOADX to specify the exact value extension type. llvm-svn: 30714	2006-10-04 00:56:09 +00:00
Chris Lattner	3057944738	Legalize is no longer limited to cleverness with just constant shift amounts. Allow it to be clever when possible and fall back to the gross code when needed. This allows us to compile: long long foo1(long long X, int C) { return X << (C\|32); } long long foo2(long long X, int C) { return X << (C&~32); } to: _foo1: rlwinm r2, r5, 0, 27, 31 slw r3, r4, r2 li r4, 0 blr .globl _foo2 .align 4 _foo2: rlwinm r2, r5, 0, 27, 25 subfic r5, r2, 32 slw r3, r3, r2 srw r5, r4, r5 or r3, r3, r5 slw r4, r4, r2 blr instead of: _foo1: ori r2, r5, 32 subfic r5, r2, 32 addi r6, r2, -32 srw r5, r4, r5 slw r3, r3, r2 slw r6, r4, r6 or r3, r3, r5 slw r4, r4, r2 or r3, r3, r6 blr .globl _foo2 .align 4 _foo2: rlwinm r2, r5, 0, 27, 25 subfic r5, r2, 32 addi r6, r2, -32 srw r5, r4, r5 slw r3, r3, r2 slw r6, r4, r6 or r3, r3, r5 slw r4, r4, r2 or r3, r3, r6 blr llvm-svn: 30507	2006-09-20 03:47:40 +00:00
Chris Lattner	92c8924309	Fold the PPCISD shifts when presented with 0 inputs. This occurs for code like: long long test(long long X, int Y) { return 1ULL << Y; } long long test2(long long X, int Y) { return -1LL << Y; } which we used to compile to: _test: li r2, 1 subfic r3, r5, 32 li r4, 0 addi r6, r5, -32 srw r3, r2, r3 slw r4, r4, r5 slw r6, r2, r6 or r3, r4, r3 slw r4, r2, r5 or r3, r3, r6 blr _test2: li r2, -1 subfic r3, r5, 32 addi r6, r5, -32 srw r3, r2, r3 slw r4, r2, r5 slw r2, r2, r6 or r3, r4, r3 or r3, r3, r2 blr Now we produce: _test: li r2, 1 addi r3, r5, -32 subfic r4, r5, 32 slw r3, r2, r3 srw r4, r2, r4 or r3, r4, r3 slw r4, r2, r5 blr _test2: li r2, -1 subfic r3, r5, 32 addi r6, r5, -32 srw r3, r2, r3 slw r4, r2, r5 slw r2, r2, r6 or r3, r4, r3 or r3, r3, r2 blr llvm-svn: 30479	2006-09-19 05:22:59 +00:00
Evan Cheng	dd52a60189	Reflects MachineConstantPoolEntry changes. llvm-svn: 30279	2006-09-12 21:04:05 +00:00
Reid Spencer	2567610703	For PR387: Close out this long standing bug by removing the remaining overloaded virtual functions in LLVM. The -Woverloaded-virtual option is now turned on. llvm-svn: 29934	2006-08-28 01:02:49 +00:00
Chris Lattner	c482a5d057	Fix a bug in a recent refactoring that broke a bunch of stuff. llvm-svn: 29649	2006-08-12 07:20:05 +00:00
Chris Lattner	8ca6e82bce	Eliminate use of getNode that takes a vector. llvm-svn: 29614	2006-08-11 17:38:39 +00:00
Chris Lattner	2f9c4426fc	Convert vectors to fixed sized arrays and smallvectors. Eliminate use of getNode that takes a vector. llvm-svn: 29609	2006-08-11 17:18:05 +00:00
Chris Lattner	7e905fba17	Fix miscompilation of float vector returns. Compile code to this: _func: vsldoi v2, v3, v2, 12 vsldoi v2, v2, v2, 4 blr instead of: _func: vsldoi v2, v3, v2, 12 vsldoi v2, v2, v2, 4 *** vor f1, v2, v2 blr llvm-svn: 29607	2006-08-11 16:47:32 +00:00
Chris Lattner	51e1b75fba	Fix some ppc64 issues with vector code. llvm-svn: 29384	2006-07-28 16:45:47 +00:00
Chris Lattner	b4165c39d7	Rename RelocModel::PIC to PIC_, to avoid conflicts with -DPIC. llvm-svn: 29307	2006-07-26 21:12:04 +00:00
Chris Lattner	abaaddc214	Implement Regression/CodeGen/PowerPC/bswap-load-store.ll by folding bswaps into i16/i32 load/stores. llvm-svn: 29089	2006-07-10 20:56:58 +00:00
Chris Lattner	2c3f67f6a7	Implement 64-bit select, bswap, etc. llvm-svn: 28935	2006-06-27 20:14:52 +00:00
Chris Lattner	8569f4042d	PPC doesn't have bit converts to/from i64 llvm-svn: 28932	2006-06-27 18:40:08 +00:00
Chris Lattner	26f2bd4d4b	Implement 64-bit undef, sub, shl/shr, srem/urem llvm-svn: 28929	2006-06-27 18:18:41 +00:00
Chris Lattner	b4a636f966	Use i32 for shift amounts instead of i64. This gets bisort working. llvm-svn: 28927	2006-06-27 17:34:57 +00:00
Chris Lattner	494f476ca7	Implement a bunch of 64-bit cleanliness work. With this, treeadd builds (but doesn't work right). llvm-svn: 28921	2006-06-27 00:04:13 +00:00
Chris Lattner	cbd4d14b24	Improve PPC64 calling convention support llvm-svn: 28919	2006-06-26 22:48:35 +00:00
Chris Lattner	5fa6e47534	Correct returns of 64-bit values, though they seemed to work before... llvm-svn: 28892	2006-06-21 00:34:03 +00:00
Chris Lattner	81845946ff	fix some assumptions that pointers can only be 32-bits. With this, we can now compile: static unsigned long X; void test1() { X = 0; } into: _test1: lis r2, ha16(_X) li r3, 0 stw r3, lo16(_X)(r2) blr Totally amazing :) llvm-svn: 28839	2006-06-16 21:01:35 +00:00
Chris Lattner	fa884ac11b	Rename some subtarget features. A CPU now can have 64-bit instructions, can in 32-bit mode we can choose to optionally use 64-bit registers. llvm-svn: 28824	2006-06-16 17:34:12 +00:00
Evan Cheng	32feafd76c	Type of extract_element index operand should be iPTR. llvm-svn: 28797	2006-06-15 08:18:06 +00:00
Chris Lattner	b231c3d11c	Fix a problem exposed by the local allocator. CALL instructions are not marked as using incoming argument registers, so the local allocator would clobber them between their set and use. To fix this, we give the call instructions a variable number of uses in the CALL MachineInstr itself, so live variables understands the live ranges of these register arguments. llvm-svn: 28744	2006-06-10 01:14:28 +00:00
Chris Lattner	31b150e334	Always reserve space for 8 spilled GPRs. GCC apparently assumes that this space will be available, even if the callee isn't varargs. llvm-svn: 28571	2006-05-30 21:21:04 +00:00
Evan Cheng	de0f25081a	Change RET node to include signness information of the return values. i.e. RET chain, value1, sign1, value2, sign2, ... llvm-svn: 28510	2006-05-26 23:10:12 +00:00
Evan Cheng	4a74dd0c51	CALL node change (arg / sign pairs instead of just arguments). llvm-svn: 28462	2006-05-25 00:57:32 +00:00
Chris Lattner	f604017e47	Patches to make the LLVM sources more -pedantic clean. Patch provided by Anton Korobeynikov! This is a step towards closing PR786. llvm-svn: 28447	2006-05-24 17:04:05 +00:00
Chris Lattner	bc3be2ff8a	Fix CodeGen/Generic/vector.ll:test_div with altivec. llvm-svn: 28445	2006-05-24 00:15:25 +00:00
Chris Lattner	56862bbd53	Handle SETO* like we handle SET*, restoring behavior after Evan's setcc change. This fixes PowerPC/fnegsel.ll. llvm-svn: 28443	2006-05-24 00:06:44 +00:00
Chris Lattner	2208c3214c	Make PPC call lowering more aggressive, making the isel matching code simple enough to be autogenerated. llvm-svn: 28354	2006-05-17 19:00:46 +00:00
Chris Lattner	03c70b7f27	Switch PPC over to a call-selection model where the lowering code creates the copyto/fromregs instead of making the PPCISD::CALL selection code create them. This vastly simplifies the selection code, and moves the ABI handling parts into one place. llvm-svn: 28346	2006-05-17 06:01:33 +00:00
Chris Lattner	348883611c	3 changes, 2 of which are cleanup one of which changes codegen: 1. Rearrange code a bit so that the special case doesn't require indenting lots of code. 2. Add comments describing PPC calling convention. 3. Only round up to 56-bytes of stack space for an outgoing call if the callee is varargs. This saves a bit of stack space. llvm-svn: 28342	2006-05-17 00:15:40 +00:00
Chris Lattner	a36579803f	implement passing/returning vector regs to calls, at least non-varargs calls. llvm-svn: 28341	2006-05-16 23:54:25 +00:00
Chris Lattner	b5271a0f4c	Instead of implementing LowerCallTo directly, let the default impl produce an ISD::CALL node, then custom lower that. This means that we only have to handle LEGAL call operands/results, not every possible type. This allows us to simplify the call code, shrinking it by about 1/3. llvm-svn: 28339	2006-05-16 22:56:08 +00:00
Chris Lattner	40d1eaad0a	Simplify the argument counting logic by only incrementing the index. llvm-svn: 28335	2006-05-16 18:58:15 +00:00
Chris Lattner	0ae068ed8f	Simplify the dead argument handling code. llvm-svn: 28334	2006-05-16 18:54:32 +00:00
Chris Lattner	fbbe542235	Vector args passed in registers don't reserve stack space. llvm-svn: 28333	2006-05-16 18:51:52 +00:00
Chris Lattner	0a12e343e2	Switch the PPC backend over to using FORMAL_ARGUMENTS for formal argument handling. This makes the lower argument code significantly simpler (we only need to handle legal argument types). Incidentally, this also implements support for vector argument registers, so long as they are not on the stack. llvm-svn: 28331	2006-05-16 18:18:50 +00:00
Chris Lattner	199f3f6af8	Fit in 80 cols llvm-svn: 28311	2006-05-16 04:20:24 +00:00
Chris Lattner	adcb0582d8	Remove dead var, fix bad override. llvm-svn: 28264	2006-05-12 21:09:57 +00:00
Chris Lattner	e3de67fae2	Fix CodeGen/Generic/2006-04-28-Sign-extend-bool.ll llvm-svn: 28017	2006-04-28 21:56:10 +00:00
Nate Begeman	7ed816f900	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	47a41ae889	Fix a crash on: void foo2(vector float A, vector float B) { vector float C = (vector float)vec_cmpeq(A, B); if (!vec_any_eq(A, B)) B = (vector float){0,0,0,0}; A = C; } llvm-svn: 27808	2006-04-18 18:28:22 +00:00
Chris Lattner	2bd91746e1	pretty print node name llvm-svn: 27806	2006-04-18 18:05:58 +00:00
Chris Lattner	44ea12c5f8	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. llvm-svn: 27804	2006-04-18 17:59:36 +00:00
Chris Lattner	e90fdf3b98	Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing even/odd halves. Thanks to Nate telling me what's what. llvm-svn: 27793	2006-04-18 04:28:57 +00:00
Chris Lattner	5951b60cb4	Implement v16i8 multiply with this code: vmuloub v5, v3, v2 vmuleub v2, v3, v2 vperm v2, v2, v5, v4 This implements CodeGen/PowerPC/vec_mul.ll. With this, v16i8 multiplies are 6.79x faster than before. Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with GCC. Remove the 'integer multiplies' todo from the README file. llvm-svn: 27792	2006-04-18 03:57:35 +00:00
Chris Lattner	4d84b56e64	Lower v8i16 multiply into this code: li r5, lo16(LCPI1_0) lis r6, ha16(LCPI1_0) lvx v4, r6, r5 vmulouh v5, v3, v2 vmuleuh v2, v3, v2 vperm v2, v2, v5, v4 where v4 is: LCPI1_0: ; <16 x ubyte> .byte 2 .byte 3 .byte 18 .byte 19 .byte 6 .byte 7 .byte 22 .byte 23 .byte 10 .byte 11 .byte 26 .byte 27 .byte 14 .byte 15 .byte 30 .byte 31 This is 5.07x faster on the G5 (measured) than lowering to scalar code + loads/stores. llvm-svn: 27789	2006-04-18 03:43:48 +00:00
Chris Lattner	613d7fda64	Custom lower v4i32 multiplies into a cute sequence, instead of having legalize scalarize the sequence into 4 mullw's and a bunch of load/store traffic. This speeds up v4i32 multiplies 4.1x (measured) on a G5. This implements PowerPC/vec_mul.ll llvm-svn: 27788	2006-04-18 03:24:30 +00:00
Chris Lattner	f2347c31b4	Make sure to check splats of every constant we can, handle splat(31) by being a bit more clever, add support for odd splats from -31 to -17. llvm-svn: 27764	2006-04-17 18:09:22 +00:00
Chris Lattner	cc4222d95b	Teach the ppc backend to use rol and vsldoi to generate splatted constants. This implements vec_constants.ll:test_vsldoi and test_rol llvm-svn: 27760	2006-04-17 17:55:10 +00:00
Chris Lattner	2d8d6c9feb	Make some code more general, adding support for constant formation of several new patterns. llvm-svn: 27754	2006-04-17 06:58:41 +00:00
Chris Lattner	9dd4ebffca	Learn how to make odd splatted constants in range [17,29]. This implements PowerPC/vec_constants.ll:test_29. llvm-svn: 27752	2006-04-17 06:07:44 +00:00
Chris Lattner	72a67a5b1f	Pull some code out into a helper function. Effeciently codegen even splats in the range [-32,30]. This allows us to codegen <30,30,30,30> as: vspltisw v0, 15 vadduwm v2, v0, v0 instead of as a cp load. llvm-svn: 27750	2006-04-17 06:00:21 +00:00
Chris Lattner	5367a73dec	Implement a TODO: for any shuffle that can be viewed as a v4[if]32 shuffle, if it can be implemented in 3 or fewer discrete altivec instructions, codegen it as such. This implements Regression/CodeGen/PowerPC/vec_perf_shuffle.ll llvm-svn: 27748	2006-04-17 05:28:54 +00:00
Chris Lattner	d86516991a	Implement a TODO: have the legalizer canonicalize a bunch of operations to one type (v4i32) so that we don't have to write patterns for each type, and so that more CSE opportunities are exposed. llvm-svn: 27731	2006-04-16 01:37:57 +00:00
Chris Lattner	f4126f0db7	Make the BUILD_VECTOR lowering code much more aggressive w.r.t constant vectors. Remove some done items from the todo list. llvm-svn: 27729	2006-04-16 01:01:29 +00:00
Chris Lattner	44245f11c3	Fix a crash when faced with a shuffle vector that has an undef in its mask. llvm-svn: 27726	2006-04-15 23:48:05 +00:00
Chris Lattner	5c9d357d7c	Allow undef in a shuffle mask llvm-svn: 27714	2006-04-14 23:19:08 +00:00
Chris Lattner	cf80e569f6	Move the rest of the PPCTargetLowering::LowerOperation cases out into separate functions, for simplicity and code clarity. llvm-svn: 27693	2006-04-14 06:01:58 +00:00

1 2 3 4 5 ...

399 Commits