llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 22:12:47 +01:00

Author	SHA1	Message	Date
Chris Lattner	51793c5b92	split out an encoder for memri operands, allowing a relocation to be plopped into the immediate field. This allows us to encode stuff like this: lbz r3, lo16(__ZL4init)(r4) ; globalopt.cpp:5 ; encoding: [0x88,0x64,A,A] ; fixup A - offset: 0, value: lo16(__ZL4init), kind: fixup_ppc_lo16 stw r3, lo16(__ZL1s)(r5) ; globalopt.cpp:6 ; encoding: [0x90,0x65,A,A] ; fixup A - offset: 0, value: lo16(__ZL1s), kind: fixup_ppc_lo16 With this, we should have a completely function MCCodeEmitter for PPC, wewt. llvm-svn: 119134	2010-11-15 08:22:03 +00:00
Chris Lattner	96f8078924	add support for encoding the lo14 forms used for a few PPC64 addressing modes. For example, we now get: ld r3, lo16(_G)(r3) ; encoding: [0xe8,0x63,A,0bAAAAAA00] ; fixup A - offset: 0, value: lo16(_G), kind: fixup_ppc_lo14 llvm-svn: 119133	2010-11-15 08:02:41 +00:00
Chris Lattner	ac5f6ff408	implement the start of support for lo16 and ha16, allowing us to get stuff like: lis r4, ha16(__ZL4init) ; encoding: [0x3c,0x80,A,A] ; fixup A - offset: 0, value: ha16(__ZL4init), kind: fixup_ppc_ha16 llvm-svn: 119127	2010-11-15 06:33:39 +00:00
Chris Lattner	8b6744a612	change direct branches to encode with the same encoding method as direct calls. Change conditional branches to encode with their own method, simplifying the JIT encoder and making room for adding an mc fixup. llvm-svn: 119125	2010-11-15 06:09:35 +00:00
Chris Lattner	4795a876f6	eliminate a now-unneeded operand printer. llvm-svn: 119124	2010-11-15 06:01:10 +00:00
Chris Lattner	37bebc344a	split call operands out to their own encoding class, simplifying code in the JIT. Use this to form the first fixup for the PPC backend, giving us stuff like this: bl L_foo$stub ; encoding: [0b010010AA,A,A,0bAAAAAA01] ; fixup A - offset: 0, value: L_foo$stub, kind: fixup_ppc_br24 llvm-svn: 119123	2010-11-15 05:57:53 +00:00
Chris Lattner	6eb2a0b277	add proper encoding for MTCRF instead of using a hack. llvm-svn: 119121	2010-11-15 05:19:25 +00:00
Chris Lattner	6f23a467b0	add basic encoding support for immediates and registers, allowing us to encode all of these instructions correctly (for example): mflr r0 ; encoding: [0x7c,0x08,0x02,0xa6] stw r0, 8(r1) ; encoding: [0x90,0x01,0x00,0x08] stwu r1, -64(r1) ; encoding: [0x94,0x21,0xff,0xc0] llvm-svn: 119118	2010-11-15 04:51:55 +00:00
Chris Lattner	8fa38e3222	remove asmstrings (which can never be printed) from pseudo instructions, allowing is to eliminate some dead operand printing methods from the instprinter. llvm-svn: 119113	2010-11-15 03:48:58 +00:00
Chris Lattner	3c8d9ea286	lower PPC::MFCRpseud when transforming to MC, avoiding calling the aborting printSpecial() method. This gets us to 8 failures. llvm-svn: 119084	2010-11-14 22:03:15 +00:00
Jakob Stoklund Olesen	9521e574f8	Emit COPY instead of FMR/FMSD instructions for floating point conversion on PowerPC. llvm-svn: 108555	2010-07-16 21:03:52 +00:00
Dale Johannesen	78714b5dc9	The PPC MFCR instruction implicitly uses all 8 of the CR registers. Currently it is not so marked, which leads to VCMPEQ instructions that feed into it getting deleted. If it is so marked, local RA complains about this sequence: vreg = MCRF CR0 MFCR <kill of whatever preg got assigned to vreg> All current uses of this instruction are only interested in one of the 8 CR registers, so redefine MFCR to be a normal unary instruction with a CR input (which is emitted only as a comment). That avoids all problems. 7739628. llvm-svn: 104238	2010-05-20 17:48:26 +00:00
Dan Gohman	dc05cdd475	Set isTerminator on TRAP instructions. llvm-svn: 103778	2010-05-14 16:46:02 +00:00
Dan Gohman	c0438974b2	Don't use isBarrier for the PowerPC sync instruction. isBarrier is for control barriers, not memory ordering barriers. llvm-svn: 103777	2010-05-14 16:42:16 +00:00
Chris Lattner	896b393fab	set SDNPVariadic on nodes throughout the rest of the targets that need them. llvm-svn: 98937	2010-03-19 05:33:51 +00:00
Jakob Stoklund Olesen	7221654c33	Merge PPC instructions FMRS and FMRD into a single FMR instruction. This is possible because F8RC is a subclass of F4RC. We keep FMRSD around so fextend has a pattern. Also allow folding of memory operands on FMRSD. llvm-svn: 97275	2010-02-26 21:53:24 +00:00
Chris Lattner	03b5f3e853	remove a bunch of dead named arguments in input patterns, though some look dubious afaict, these are all ok. llvm-svn: 96899	2010-02-23 06:54:29 +00:00
Chris Lattner	ce7be2638a	Eliminate some uses of immAllOnes, just use -1, it does the same thing and is more efficient for the matcher. llvm-svn: 96712	2010-02-21 03:12:16 +00:00
Jakob Stoklund Olesen	8e4cdf70e1	Don't specify CR sub-registers as implicit defs of BL instructions. It is enough to give the super registers CR0, CR1, ..., and specifying the sub-registers as well causes confusion in the liveness computations. llvm-svn: 92778	2010-01-05 21:38:37 +00:00
Tilmann Scheller	29361c46ac	Add support for calls through function pointers in the 64-bit PowerPC SVR4 ABI. Patch contributed by Ken Werner of IBM! llvm-svn: 91680	2009-12-18 13:00:15 +00:00
Dan Gohman	b5ec39e2dc	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Bob Wilson	25738f9e79	Add PowerPC codegen for indirect branches. llvm-svn: 86050	2009-11-04 21:31:18 +00:00
Dan Gohman	3393a4c997	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Dan Gohman	0ac693a89e	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Dale Johannesen	7d68f8de7f	Model the carry bit on ppc32. Without this we could move a SUBFC (etc.) below the SUBFE (etc.) that consumed the carry bit. Add missing ADDIC8, noticed along the way. llvm-svn: 82266	2009-09-18 20:15:22 +00:00
Tilmann Scheller	03f517b799	Add support for the PowerPC 64-bit SVR4 ABI. The Link Register is volatile when using the 32-bit SVR4 ABI. Make it possible to use the 64-bit SVR4 ABI. Add non-volatile registers for the 64-bit SVR4 ABI. Make sure r2 is a reserved register when using the 64-bit SVR4 ABI. Update PPCFrameInfo for the 64-bit SVR4 ABI. Add FIXME for 64-bit Darwin PPC. Insert NOP instruction after direct function calls. Emit official procedure descriptors. Create TOC entries for GlobalAddress references. Spill 64-bit non-volatile registers to the correct slots. Only custom lower VAARG when using the 32-bit SVR4 ABI. Use simple VASTART lowering for the 64-bit SVR4 ABI. llvm-svn: 79091	2009-08-15 11:54:46 +00:00
Owen Anderson	48f2f0ae72	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Owen Anderson	b4bce99769	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Dan Gohman	5d566d918b	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Tilmann Scheller	8166687389	Refactor ABI code in the PowerPC backend. Make CalculateParameterAndLinkageAreaSize() Darwin-specific. Remove SVR4 specific code from LowerCALL_Darwin() and LowerFORMAL_ARGUMENTS_Darwin(). Rename MachoABI to DarwinABI for consistency. Rename ELF ABI to SVR4 ABI for consistency. Factor out common call return lowering between the Darwin and SVR4 ABI. Factor out common call lowering between the Darwin and SVR4 ABI. llvm-svn: 74766	2009-07-03 06:47:08 +00:00
Tilmann Scheller	37389b484a	Implement the SVR4 ABI for PowerPC. Implement LowerFORMAL_ARGUMENTS_SVR4(). Implement LowerCALL_SVR4(). Add support for split arguments. Implement by value parameter passing for aggregates. Add support for variable argument lists. Create the spill area for argument registers of variable argument functions no longer at a fixed offset. Make sure callee saved registers are spilled to the correct stack offsets. Change allocation order of non-volatile floating-point registers. Add VRSAVE to the list of callee-saved registers, add CallConvLowering for vararg calls. Add support for variable argument calls with Vector arguments. Add support for VR and VRSAVE save area, improve allocation order for non-volatile vector registers. Stop creating illegal i8 values in LowerVASTART(). Add memory access width hints. Make sure to reserve space on the stack for the frame pointer. When using the SVR4 ABI, reserve r13 for the Small Data Area pointer. Assure that the frame pointer is spilled to the correct location on the stack. Some FP registers were not marked as volatile. Make sure the i64 words from a long double are passed either both in registers or both on the stack. Only put integer arguments in registers which are not marked with the inreg flag. llvm-svn: 74765	2009-07-03 06:45:56 +00:00
Dan Gohman	5dad0993a9	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Dan Gohman	6333d48459	Add a sanity-check to tablegen to catch the case where isSimpleLoad is set but mayLoad is not set. Fix all the problems this turned up. Change code to not use isSimpleLoad instead of mayLoad unless it really wants isSimpleLoad. llvm-svn: 60459	2008-12-03 02:30:17 +00:00
Dale Johannesen	ff738e8897	Add a RM pseudoreg for the rounding mode, which allows ppcf128->int conversion to work with DeadInstructionElimination. This is now turned off but RM is harmless. It does not do a complete job of modeling the rounding mode. Revert marking MFCR as using all 7 CR subregisters; while correct, this caused the problem in PR 2964, plus the local RA crash noted in the comments. This was needed to make DeadInstructionElimination, but as we are not running that, it is backed out for now. Eventually it should go back in and the other problems fixed where they're broken. llvm-svn: 58391	2008-10-29 18:26:45 +00:00
Dale Johannesen	20f93b45e7	Mark MFCR as reading all condition code registers. Prevents some more overzealous deletions (mostly in AltiVec code). llvm-svn: 58121	2008-10-24 22:08:01 +00:00
Dale Johannesen	b79ddda5bf	Mark defs and uses of CTR and LR correctly. Prevents DeadMachineInstructionElim from thinking things like MTCTR are dead (fixes massive testsuite breakage at -O0). llvm-svn: 58043	2008-10-23 20:41:28 +00:00
Duncan Sands	1349af7df4	Fix warnings about mb/me being potentially used uninitialized in these functions with gcc-4.3. llvm-svn: 57635	2008-10-16 13:02:33 +00:00
Chris Lattner	7910d59d44	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Dan Gohman	89660301e3	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	e9a1266213	Implement partial-word binary atomics on ppc. llvm-svn: 55478	2008-08-28 17:53:09 +00:00
Dale Johannesen	f201a3aaf3	Implement 32 & 64 bit versions of PPC atomic binary primitives. llvm-svn: 55343	2008-08-25 22:34:37 +00:00
Dale Johannesen	fbb408de74	Remove PPC-specific lowering for atomics; the generic stuff works fine. Mark rewritten cmp-and-swap as not using CR1. llvm-svn: 55336	2008-08-25 21:09:52 +00:00
Dale Johannesen	95a40e3045	Implement __sync_synchronize on ppc32. Patch by Gary Benson. llvm-svn: 55186	2008-08-22 17:20:54 +00:00
Dale Johannesen	1ac64c3718	Rewrite ppc code generated for __sync_{bool\|val}_compare_and_swap so that lwarx and stwcx are always executed the same number of times. This is important for performance, I'm told. llvm-svn: 55163	2008-08-22 03:49:10 +00:00
Nate Begeman	9be47adde4	Implement ISD::TRAP support on PPC llvm-svn: 54644	2008-08-11 17:36:31 +00:00
Evan Cheng	c69b53dff9	Implement llvm.atomic.cmp.swap.i32 on PPC. Patch by Gary Benson! llvm-svn: 53505	2008-07-12 02:23:19 +00:00
Anton Korobeynikov	eb63554d81	Provide correct encoding for PPC LWARX instructions. Patch by Gary Benson! llvm-svn: 52828	2008-06-27 16:10:20 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Evan Cheng	f583b3feb6	64-bit atomic operations. llvm-svn: 49949	2008-04-19 02:30:38 +00:00
Evan Cheng	09e77f6b83	PPC32 atomic operations. llvm-svn: 49947	2008-04-19 01:30:48 +00:00
Evan Cheng	11d2c09adc	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Nicolas Geoffray	e3381f0f1f	Add description of individual bits in CR. This fix PR1765. llvm-svn: 48143	2008-03-10 14:12:10 +00:00
Chris Lattner	08ec4919ea	Add support for ppc64 shifts with 7-bit (oversized) shift amount (e.g. PPCshl). llvm-svn: 48027	2008-03-07 20:18:24 +00:00
Chris Lattner	2f13ccc181	Replace SDT_PPCShiftOp in favor of SDTIntBinOps. This allows it to work with 32 or 64-bit operands/results. llvm-svn: 48026	2008-03-07 20:13:51 +00:00
Bill Wendling	8d64999daf	This is the initial check-in for adding register scavenging to PPC. (Currently, PPC-64 doesn't work.) This also lowers the spilling of the CR registers so that it uses a register other than the default R0 register (the scavenger scrounges for one). A significant part of this patch fixes how kill information is handled. llvm-svn: 47863	2008-03-03 22:19:16 +00:00
Bill Wendling	2cae66e28b	Final de-tabification. llvm-svn: 47663	2008-02-27 06:33:05 +00:00
Nate Begeman	1867c6c264	Make register scavenging happy by not using a reg (CR0) that isn't defined llvm-svn: 47045	2008-02-13 02:58:33 +00:00
Chris Lattner	6846e346a8	rename SDTRet -> SDTNone. Move definition of 'trap' sdnode up from x86 instrinfo to targetselectiondag.td. llvm-svn: 46017	2008-01-15 22:02:54 +00:00
Chris Lattner	6ad01a9965	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. llvm-svn: 45824	2008-01-10 05:45:39 +00:00
Chris Lattner	9b4f2b2316	get def use info more correct. llvm-svn: 45821	2008-01-10 05:12:37 +00:00
Chris Lattner	14310afe42	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	5489888580	rename isStore -> mayStore to more accurately reflect what it captures. llvm-svn: 45656	2008-01-06 08:36:04 +00:00
Chris Lattner	8b4b75c771	Change the 'isStore' inferrer to look for 'SDNPMayStore' instead of "ISD::STORE". This allows us to mark target-specific dag nodes as storing (such as ppc byteswap stores). This allows us to remove more explicit isStore flags from the .td files. Finally, add a warning for when a .td file contains an explicit isStore and tblgen is able to infer it. llvm-svn: 45654	2008-01-06 06:44:58 +00:00
Chris Lattner	9f8735181f	remove some isStore flags that are now inferred automatically. llvm-svn: 45652	2008-01-06 05:53:26 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Bill Wendling	ce9eae6687	Mark the "isRemat" instruction as never having side effects. llvm-svn: 45190	2007-12-19 06:07:48 +00:00
Evan Cheng	64a1febf9a	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Bill Wendling	c08dedb060	Initial commit of the machine code LICM pass. It successfully hoists this: _foo: li r2, 0 LBB1_1: ; bb li r5, 0 stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmplw cr0, r2, r4 bne cr0, LBB1_1 ; bb LBB1_2: ; return blr to: _foo: li r2, 0 li r5, 0 LBB1_1: ; bb stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmplw cr0, r2, r4 bne cr0, LBB1_1 ; bb LBB1_2: ; return blr ZOMG!! :-) Moar to come... llvm-svn: 44687	2007-12-07 21:42:31 +00:00
Bill Wendling	934fcd87e7	Unifacalize the CALLSEQ{START,END} stuff. llvm-svn: 44045	2007-11-13 09:19:02 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Owen Anderson	aba398a5ce	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	0590c75f18	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Dale Johannesen	76458ddf1e	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Evan Cheng	b43255bc68	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. llvm-svn: 41863	2007-09-11 19:55:27 +00:00
Evan Cheng	b050c17b31	Some out operands were incorrectly specified as input operands. llvm-svn: 40697	2007-08-01 23:07:38 +00:00
Evan Cheng	53cb03b583	No more noResults. llvm-svn: 40132	2007-07-21 00:34:19 +00:00
Evan Cheng	f8d66a1eec	Oops. These stores actually produce results. llvm-svn: 40074	2007-07-20 00:20:46 +00:00
Evan Cheng	8312ed6f77	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Evan Cheng	3b1b3eba6a	Do away with ImmutablePredicateOperand. llvm-svn: 37961	2007-07-06 23:22:46 +00:00
Evan Cheng	9b7432c311	PPC conditional branch predicate does not change after isel. llvm-svn: 37893	2007-07-05 07:09:50 +00:00
Evan Cheng	4dd52e052f	PredicateOperand can be used as a normal operand for isel. llvm-svn: 36947	2007-05-08 21:06:08 +00:00
Nicolas Geoffray	b7c0895529	The ELF ABI specifies F1-F8 registers as argument registers for double, not F1-F10. This affects only ELF, not MachO. llvm-svn: 35622	2007-04-03 10:27:07 +00:00
Nicolas Geoffray	a562e5c1c5	Differentiate between the MachO and the ELF ABI the CALL instruction. llvm-svn: 34667	2007-02-27 13:01:19 +00:00
Chris Lattner	d4cd3a31e6	always lower to RETFLAG, never leave it as just ret. llvm-svn: 34639	2007-02-26 19:44:02 +00:00
Chris Lattner	b5ce97a83a	one important bugfix: PPC32 didn't have both elf and macho support for external symbols and global addresses. Add the missing ones. one important workaround: PPCISD::CALL is matched by both PPCcall_ELF and PPCcall_Macho, disable the _ELF patterns for now. llvm-svn: 34601	2007-02-25 19:20:53 +00:00
Chris Lattner	041fb5bc67	implement support for the linux/ppc function call ABI. Patch by Nicolas Geoffray! llvm-svn: 34574	2007-02-25 05:34:32 +00:00
Jim Laskey	23ed7d2625	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Chris Lattner	f50d87eb50	Rewrite the branch selector to be correct in the face of large functions. The algorithm it used before wasn't 100% correct, we now use an iterative expansion model. This fixes assembler errors when compiling 403.gcc with tail merging enabled. Change the way the branch selector works overall: Now, the isel generates PPC::BCC instructions (as it used to) directly, and these BCC instructions are emitted to the output or jitted directly if branches don't need expansion. Only if branches need expansion are instructions rewritten and created. This should make branch select faster, and eliminates the Bxx instructions from the .td file. llvm-svn: 31837	2006-11-18 00:32:03 +00:00
Chris Lattner	a5439b7913	add encoding for BCC, after finally wrestling strange ppc/tblgen endianness issues to the ground. llvm-svn: 31836	2006-11-17 23:53:28 +00:00
Chris Lattner	0d88b19f2f	convert PPC::BCC to use the 'pred' operand instead of separate predicate value and CR reg #. This requires swapping the order of these everywhere that touches BCC and requires us to write custom matching logic for PPCcondbranch :( llvm-svn: 31835	2006-11-17 22:37:34 +00:00
Chris Lattner	73329ae80d	rename PPC::COND_BRANCH to PPC::BCC llvm-svn: 31834	2006-11-17 22:14:47 +00:00
Chris Lattner	1527483a15	start using PPC predicates more consistently. llvm-svn: 31833	2006-11-17 22:10:59 +00:00
Jim Laskey	8aac7dc0ee	This is a general clean up of the PowerPC ABI. Address several problems and bugs including making sure that the TOS links back to the previous frame, that the maximum call frame size is not included twice when using frame pointers, no longer growing the frame on calls, double storing of SP and a cleaner/faster dynamic alloca. llvm-svn: 31792	2006-11-16 22:43:37 +00:00
Chris Lattner	283e7306c1	fix broken encoding llvm-svn: 31778	2006-11-16 01:01:28 +00:00
Chris Lattner	4edb6f09fe	add patterns for ppc32 preinc stores. ppc64 next. llvm-svn: 31775	2006-11-16 00:41:37 +00:00
Chris Lattner	c4b9cff1f9	switch these back to the 'bad old way' llvm-svn: 31774	2006-11-16 00:33:34 +00:00
Chris Lattner	bd95b9d4ae	Stop using isTwoAddress, switching to operand constraints instead. Tell the codegen emitter that specific operands are not to be encoded, fixing JIT regressions w.r.t. pre-inc loads and stores (e.g. lwzu, which we generate even when general preinc loads are not enabled). llvm-svn: 31770	2006-11-15 23:24:18 +00:00
Chris Lattner	9bc55a6c38	fix ldu/stu jit encoding. Swith 64-bit preinc load instrs to use memri addrmodes. llvm-svn: 31757	2006-11-15 19:55:13 +00:00
Chris Lattner	6d5a509e34	Switch loads over to use memri as the operand instead of a reg/imm operand pair for cleanliness. Add instructions for PPC32 preinc-stores with commented out patterns. More improvement is needed to enable the patterns, but we're getting close. llvm-svn: 31749	2006-11-15 02:43:19 +00:00
Chris Lattner	55c68f61a7	group load and store instructions together. No functionality change. llvm-svn: 31736	2006-11-14 19:19:53 +00:00
Chris Lattner	dc48b6a77c	Rework PPC64 calls. Now we have a LR8/CTR8 register which the PPC64 calls clobber. This allows LR8 to be save/restored correctly as a 64-bit quantity, instead of handling it as a 32-bit quantity. This unbreaks ppc64 codegen when the code is actually located above the 4G boundary. llvm-svn: 31734	2006-11-14 18:44:47 +00:00
Chris Lattner	3d48461071	Mark operands as symbol lo instead of imm32 so that they print lo(x) around globals. llvm-svn: 31672	2006-11-11 04:51:36 +00:00
Chris Lattner	5e975945a5	dform 8/9 are identical to dform 1 llvm-svn: 31637	2006-11-10 17:51:02 +00:00
Chris Lattner	1604b6a873	add an initial cut at preinc loads for ppc32. This is broken for ppc64 (because the 64-bit reg target versions aren't implemented yet), doesn't support r+r addr modes, and doesn't handle stores, but it works otherwise. :) This is disabled unless -enable-ppc-preinc is passed to llc for now. llvm-svn: 31621	2006-11-10 02:08:47 +00:00
Chris Lattner	35fb10e1a4	correct the (currently unused) pattern for lwzu. llvm-svn: 31535	2006-11-08 02:13:12 +00:00
Chris Lattner	a193c9c977	encode BLR predicate info for the JIT llvm-svn: 31450	2006-11-04 05:42:48 +00:00
Chris Lattner	a7687f805c	Go through all kinds of trouble to mark 'blr' as having a predicate operand that takes a register and condition code. Print these pieces of BLR the right way, even though it is currently set to 'always'. Next up: get the JIT encoding right, then enhance branch folding to produce predicated blr for simple examples. llvm-svn: 31449	2006-11-04 05:27:39 +00:00
Chris Lattner	db04ba8502	Describe PPC predicates, which are a pair of CR# and condition. llvm-svn: 31438	2006-11-03 23:53:25 +00:00
Chris Lattner	5f953f0927	remove dead vars llvm-svn: 31433	2006-11-03 23:46:45 +00:00
Chris Lattner	e38c95d2a3	Add intrinsics for the rest of the DCB* instructions. llvm-svn: 31148	2006-10-24 01:08:42 +00:00
Evan Cheng	fe5bb5dbe6	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	14f18d4896	set isBarrier correctly llvm-svn: 30936	2006-10-13 19:10:34 +00:00
Chris Lattner	e4e8893807	mark adjcallstack up/down as clobbering and using the SP llvm-svn: 30908	2006-10-12 17:56:34 +00:00
Evan Cheng	ca66f49574	Add properties to ComplexPattern. llvm-svn: 30891	2006-10-11 21:03:53 +00:00
Evan Cheng	d22f3dd3ed	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Chris Lattner	26213b40aa	Use abstract private/comment directives, to increase portability to ppc/linux llvm-svn: 30621	2006-09-27 02:55:21 +00:00
Nate Begeman	7bcce1a7f6	Fold AND and ROTL more often llvm-svn: 30577	2006-09-22 05:01:56 +00:00
Evan Cheng	34a49551f5	CALLSEQ_* produces chain even if that's not needed. llvm-svn: 29603	2006-08-11 09:03:33 +00:00
Chris Lattner	0f4e4b1bcb	bswapped load/store instructions are only availble in indexed addressing form. As such, use xoaddr (indexed only), not xaddr for address selection. This fixes CodeGen/PowerPC/2006-07-19-stwbrx-crash.ll, a crash compiling lencod. llvm-svn: 29208	2006-07-19 17:15:36 +00:00
Chris Lattner	5985b77fae	Make the implicit def instructions look like other instrs. llvm-svn: 29174	2006-07-18 16:33:26 +00:00
Chris Lattner	abaaddc214	Implement Regression/CodeGen/PowerPC/bswap-load-store.ll by folding bswaps into i16/i32 load/stores. llvm-svn: 29089	2006-07-10 20:56:58 +00:00
Chris Lattner	da08df5d8a	Add 64-bit MTCTR so that indirect calls work. llvm-svn: 28931	2006-06-27 18:36:44 +00:00
Chris Lattner	26f2bd4d4b	Implement 64-bit undef, sub, shl/shr, srem/urem llvm-svn: 28929	2006-06-27 18:18:41 +00:00
Chris Lattner	494f476ca7	Implement a bunch of 64-bit cleanliness work. With this, treeadd builds (but doesn't work right). llvm-svn: 28921	2006-06-27 00:04:13 +00:00
Chris Lattner	5d0654b832	Remove two more definitions llvm-svn: 28918	2006-06-26 22:47:37 +00:00
Chris Lattner	209c2db6b9	remove two unused instructions. llvm-svn: 28917	2006-06-26 22:44:13 +00:00
Chris Lattner	10d22c274e	Make these predicates correct in 64-bit mode too. llvm-svn: 28890	2006-06-20 23:21:20 +00:00
Chris Lattner	75e6449a0f	Rename OR4 -> OR. Move some PPC64-specific stuff to the 64-bit file llvm-svn: 28889	2006-06-20 23:18:58 +00:00
Chris Lattner	2e1d3158f1	remove unused flag llvm-svn: 28888	2006-06-20 23:15:07 +00:00
Chris Lattner	19df1fcd72	remove some unused patterns llvm-svn: 28886	2006-06-20 23:11:36 +00:00
Chris Lattner	eede1e2c00	Add some 64-bit logical ops. Split imm16Shifted into a sext/zext form for 64-bit support. Add some patterns for immediate formation. For example, we now compile this: static unsigned long long Y; void test3() { Y = 0xF0F00F00; } into: _test3: li r2, 3840 lis r3, ha16(_Y) xoris r2, r2, 61680 std r2, lo16(_Y)(r3) blr GCC produces: _test3: li r0,0 lis r2,ha16(_Y) ori r0,r0,61680 sldi r0,r0,16 ori r0,r0,3840 std r0,lo16(_Y)(r2) blr llvm-svn: 28883	2006-06-20 22:34:10 +00:00
Chris Lattner	4ff5f3d852	64-bit bugfix: 0xFFFF0000 cannot be formed with a single lis. llvm-svn: 28880	2006-06-20 21:39:30 +00:00
Chris Lattner	3ae4156dd7	Remove some now-unneeded casts from instruction patterns. With the casts removed, tblgen produces identical output to with them in. llvm-svn: 28867	2006-06-20 00:39:56 +00:00
Chris Lattner	163da7cdcb	In 64-bit mode, addr mode operands use G8RC instead of GPRC. llvm-svn: 28840	2006-06-16 21:29:03 +00:00
Chris Lattner	81845946ff	fix some assumptions that pointers can only be 32-bits. With this, we can now compile: static unsigned long X; void test1() { X = 0; } into: _test1: lis r2, ha16(_X) li r3, 0 stw r3, lo16(_X)(r2) blr Totally amazing :) llvm-svn: 28839	2006-06-16 21:01:35 +00:00
Chris Lattner	cb294464e7	Split 64-bit instructions out into a separate .td file llvm-svn: 28838	2006-06-16 20:22:01 +00:00
Chris Lattner	b231c3d11c	Fix a problem exposed by the local allocator. CALL instructions are not marked as using incoming argument registers, so the local allocator would clobber them between their set and use. To fix this, we give the call instructions a variable number of uses in the CALL MachineInstr itself, so live variables understands the live ranges of these register arguments. llvm-svn: 28744	2006-06-10 01:14:28 +00:00
Chris Lattner	bfbee64ecf	Add PowerPC intrinsics to support dcbz[l] llvm-svn: 28696	2006-06-06 21:29:23 +00:00
Chris Lattner	2208c3214c	Make PPC call lowering more aggressive, making the isel matching code simple enough to be autogenerated. llvm-svn: 28354	2006-05-17 19:00:46 +00:00
Chris Lattner	03c70b7f27	Switch PPC over to a call-selection model where the lowering code creates the copyto/fromregs instead of making the PPCISD::CALL selection code create them. This vastly simplifies the selection code, and moves the ABI handling parts into one place. llvm-svn: 28346	2006-05-17 06:01:33 +00:00
Nate Begeman	7ed816f900	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	f58f727be6	These are correctly encoded by the JIT. I checked :) llvm-svn: 27810	2006-04-18 19:03:38 +00:00
Chris Lattner	44ea12c5f8	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. llvm-svn: 27804	2006-04-18 17:59:36 +00:00
Chris Lattner	2ffa288a23	Add VRRC select support llvm-svn: 27543	2006-04-08 22:45:08 +00:00
Chris Lattner	e330741a6c	Lower vector compares to VCMP nodes, just like we lower vector comparison predicates to VCMPo nodes. llvm-svn: 27285	2006-03-31 05:13:27 +00:00
Chris Lattner	ac98e20cc9	Use normal lvx for scalar_to_vector instead of lve*x. They do the exact same thing and we have a dag node for the former. llvm-svn: 27205	2006-03-28 01:43:22 +00:00
Chris Lattner	65a455b060	Codegen vector predicate compares. llvm-svn: 27151	2006-03-26 10:06:40 +00:00
Chris Lattner	cb5f9269a9	Move all Altivec stuff out into a new PPCInstrAltivec.td file. Add a bunch of patterns for different datatypes, e.g. bit_convert, undef and zero vector support. llvm-svn: 27117	2006-03-25 07:51:43 +00:00
Chris Lattner	57064915a6	Add some basic patterns for other datatypes llvm-svn: 27116	2006-03-25 07:39:07 +00:00
Chris Lattner	2fa3a6c436	Add support for __builtin_altivec_vnmsubfp /vmaddfp llvm-svn: 27112	2006-03-25 07:05:55 +00:00
Chris Lattner	0899b16b2d	Codegen things like: <int -1, int -1, int -1, int -1> and <int 65537, int 65537, int 65537, int 65537> Using things like: vspltisb v0, -1 and: vspltish v0, 1 instead of using constant pool loads. This implements CodeGen/PowerPC/vec_splat.ll:splat_imm_i{32\|16}. llvm-svn: 27106	2006-03-25 06:12:06 +00:00
Chris Lattner	21abff3712	Fix a bad JIT encoding of VPERM. Why is VPERM D,A,B,C but vfmadd is D,A,C,B ?? llvm-svn: 27069	2006-03-24 18:24:43 +00:00
Chris Lattner	ba4966c16c	add support for using vxor to build zero vectors. This implements Regression/CodeGen/PowerPC/vec_zero.ll llvm-svn: 27059	2006-03-24 07:48:08 +00:00
Chris Lattner	ace2d0d227	Gabor points out that we can't spell. :) llvm-svn: 27049	2006-03-24 07:12:19 +00:00
Chris Lattner	974982c89c	Add PPC vector bit-convert support llvm-svn: 26995	2006-03-23 19:54:27 +00:00
Chris Lattner	f84f3bf95b	When possible, custom lower 32-bit SINT_TO_FP to this: _foo2: extsw r2, r3 std r2, -8(r1) lfd f0, -8(r1) fcfid f0, f0 frsp f1, f0 blr instead of this: _foo2: lis r2, ha16(LCPI2_0) lis r4, 17200 xoris r3, r3, 32768 stw r3, -4(r1) stw r4, -8(r1) lfs f0, lo16(LCPI2_0)(r2) lfd f1, -8(r1) fsub f0, f1, f0 frsp f1, f0 blr This speeds up Misc/pi from 2.44s->2.09s with LLC and from 3.01->2.18s with llcbeta (16.7% and 38.1% respectively). llvm-svn: 26943	2006-03-22 05:30:33 +00:00
Chris Lattner	2e606dc60f	Fix the JIT encoding of the VAForm_1 instructions, including vmaddfp llvm-svn: 26935	2006-03-22 01:44:36 +00:00
Chris Lattner	acb2506622	When codegen'ing vector MUL using VFMADD, add the 0, don't mul the 0. llvm-svn: 26913	2006-03-21 00:51:38 +00:00
Chris Lattner	978628896b	Fix a couple of bugs in permute/splat generate, thanks to Nate for actually figuring these out! :) llvm-svn: 26904	2006-03-20 18:26:51 +00:00
Chris Lattner	fb0e160aa5	Fix the pattern for VADDUWM, add i32 splat llvm-svn: 26901	2006-03-20 17:51:58 +00:00
Evan Cheng	57da1afbc8	Use tblgen'd VECTOR_SHUFFLE selection code. llvm-svn: 26900	2006-03-20 08:14:16 +00:00
Chris Lattner	dc3605efdb	Add support for generating vspltw, instead of a vperm instruction with a constant pool load. This generates significantly nicer code for splats. When tblgen gets bugfixed, we can remove the custom selection code. llvm-svn: 26898	2006-03-20 06:51:10 +00:00
Chris Lattner	1cdeda1c5a	Check in some intermediate code that adds a skeleton for matching vsplt* instructions llvm-svn: 26894	2006-03-20 06:15:45 +00:00
Chris Lattner	c230af9810	fix typo llvm-svn: 26889	2006-03-20 05:05:55 +00:00
Chris Lattner	bea056ecf2	add vsplat instructions, fix sched description for vperm llvm-svn: 26888	2006-03-20 04:47:33 +00:00
Chris Lattner	0e56cf0d94	Custom lower arbitrary VECTOR_SHUFFLE's to VPERM. TODO: leave specific ones as VECTOR_SHUFFLE's and turn them into specialized operations like vsplt* llvm-svn: 26887	2006-03-20 01:53:53 +00:00
Chris Lattner	6f502da274	add the vperm instruction llvm-svn: 26883	2006-03-20 01:00:56 +00:00
Chris Lattner	789570bafb	Custom lower SCALAR_TO_VECTOR into lve*x. llvm-svn: 26868	2006-03-19 06:55:52 +00:00
Chris Lattner	89bc332152	add support for vector undef llvm-svn: 26863	2006-03-19 06:10:09 +00:00
Chris Lattner	a9b4a2ab99	minor fixes llvm-svn: 26857	2006-03-19 05:43:01 +00:00
Chris Lattner	b46a4c28ad	we don't use lmw/stmw. When we want them they are easy enough to add llvm-svn: 26853	2006-03-19 04:33:37 +00:00
Nate Begeman	793c8136ae	Fix subfic to match subc by default instead of sub so that it is correctly cost-modeled as producing a flag. This fixes the test I just added for neg llvm-svn: 26835	2006-03-17 22:41:37 +00:00
Nate Begeman	42736d46b2	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	f2008cb73b	Strangely, calls clobber call-clobbered vector regs. Whodathoughtit? llvm-svn: 26808	2006-03-16 22:35:59 +00:00
Chris Lattner	bf153651b1	Add support for copying registers. still needed: spilling and reloading them llvm-svn: 26800	2006-03-16 20:03:58 +00:00
Nate Begeman	e371cb595a	Update scheduling info for vrsave instruction llvm-svn: 26776	2006-03-15 05:25:05 +00:00
Chris Lattner	d0505331d2	For functions that use vector registers, save VRSAVE, mark used registers, and update it on entry to each function, then restore it on exit. This compiles: void func(vfloat a, vfloat b, vfloat c) { a = b c + c; } to this: _func: mfspr r2, 256 oris r6, r2, 49152 mtspr 256, r6 lvx v0, 0, r5 lvx v1, 0, r4 vmaddfp v0, v1, v0, v0 stvx v0, 0, r3 mtspr 256, r2 blr GCC produces this (which has additional stack accesses): _func: mfspr r0,256 stw r0,-4(r1) oris r0,r0,0xc000 mtspr 256,r0 lvx v0,0,r5 lvx v1,0,r4 lwz r12,-4(r1) vmaddfp v0,v0,v1,v0 stvx v0,0,r3 mtspr 256,r12 blr llvm-svn: 26733	2006-03-13 21:52:10 +00:00
Chris Lattner	ba10d4e4ab	Mark instructions that are cracked by the PPC970 decoder as such. llvm-svn: 26720	2006-03-13 05:15:10 +00:00
Chris Lattner	a278639f29	Several big changes: 1. Use flags on the instructions in the .td file to indicate the PPC970 unit type instead of a table in the .cpp file. Much cleaner. 2. Change the hazard recognizer to build d-groups according to the actual algorithm used, not my flawed understanding of it. 3. Model "must be in the first slot" and "must be the only instr in a group" accurately. llvm-svn: 26719	2006-03-12 09:13:49 +00:00
Chris Lattner	af44ead7f3	implement TII::insertNoop llvm-svn: 26562	2006-03-05 23:49:55 +00:00
Chris Lattner	137c02aa60	Compile this: void foo(float a, int b) { b = a; } to this: _foo: fctiwz f0, f1 stfiwx f0, 0, r4 blr instead of this: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) stw r2, 0(r4) blr This implements CodeGen/PowerPC/stfiwx.ll, and also incidentally does the right thing for GCC bugzilla 26505. llvm-svn: 26447	2006-03-01 05:50:56 +00:00
Nate Begeman	9c0ab71f4a	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Nate Begeman	7261b82dd7	Add missing patterns for andi. and andis., fixing test/Regression/CodeGen/ PowerPC/and-imm.ll llvm-svn: 26136	2006-02-12 09:09:52 +00:00
Chris Lattner	20d4194a0d	PHI and INLINEASM are now built-in instructions provided by Target.td llvm-svn: 25674	2006-01-27 01:46:15 +00:00
Chris Lattner	29e1825fd3	ahem :) llvm-svn: 25239	2006-01-12 02:05:36 +00:00
Nate Begeman	cff96008ac	Add bswap, rotl, and rotr nodes Add dag combiner code to recognize rotl, rotr Add ppc code to match rotl Targets should add rotl/rotr patterns if they have them llvm-svn: 25222	2006-01-11 21:21:00 +00:00
Nate Begeman	7ed9b8b287	Remove a comment that no longer applies. llvm-svn: 25167	2006-01-10 00:15:59 +00:00
Chris Lattner	95443534bf	add ret void support back llvm-svn: 25164	2006-01-09 23:20:37 +00:00
Evan Cheng	e720cfd690	New DAG node properties SNDPInFlag, SNDPOutFlag, and SNDPOptInFlag to replace hasInFlag, hasOutFlag. llvm-svn: 25155	2006-01-09 18:28:21 +00:00
Jim Laskey	5eddaee9f3	Added initial support for DEBUG_LABEL allowing debug specific labels to be inserted in the code. llvm-svn: 25104	2006-01-05 01:25:28 +00:00
Jim Laskey	897ad8ddb7	Add unique id to debug location for debug label use (work in progress.) llvm-svn: 25096	2006-01-04 15:04:11 +00:00
Nate Begeman	ec7c28a28c	Add support for generating v4i32 altivec code llvm-svn: 25046	2005-12-30 00:12:56 +00:00
Evan Cheng	231b11ba87	Added field noResults to Instruction. Currently tblgen cannot tell which operands in the operand list are results so it assumes the first one is a result. This is bad. Ideally we would fix this by separating results from inputs, e.g. (res R32:$dst), (ops R32:$src1, R32:$src2). But that's a more distruptive change. Adding 'let noResults = 1' is the workaround to tell tblgen that the instruction does not produces a result. It works for now since tblgen does not support instructions which produce multiple results. llvm-svn: 25017	2005-12-26 09:11:45 +00:00
Evan Cheng	d87688fe72	* Removed the use of FLAG. Now use hasFlagIn and hasFlagOut instead. * Added a pseudo instruction (for each target) that represent "return void". This is a workaround for lack of optional flag operand (return void is not lowered so it does not have a flag operand.) llvm-svn: 24997	2005-12-23 22:14:32 +00:00
Evan Cheng	05ad906ccf	Flip the meaning of FPContractions to reflect Requires<[]> change. llvm-svn: 24884	2005-12-20 20:08:53 +00:00
Nate Begeman	a114534620	Pattern-match return. Includes gross hack! llvm-svn: 24874	2005-12-20 00:26:01 +00:00
Nate Begeman	9c7dce88b5	Convert load/store over to being pattern matched llvm-svn: 24871	2005-12-19 23:25:09 +00:00
Jim Laskey	37957b1ad3	Added source file/line correspondence for dwarf (PowerPC only at this point.) llvm-svn: 24748	2005-12-16 22:45:29 +00:00
Nate Begeman	69da94a1b9	Add a second vector type to the VRRC register class, and fix some patterns so that tablegen can infer all types. llvm-svn: 24746	2005-12-16 09:19:13 +00:00
Nate Begeman	fe7a3f28e3	Use the new predicate support that Evan Cheng added to remove some code from the DAGToDAG cpp file. This adds pattern support for vector and scalar fma, which passes test/Regression/CodeGen/PowerPC/fma.ll, and does the right thing in the presence of -disable-excess-fp-precision. Allows us to match: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = mul <4 x float> %tmp1, %tmp1 %tmp3 = add <4 x float> %tmp2, %tmp1 store <4 x float> %tmp3, <4 x float> *%a ret void } As: _foo: li r2, 0 lvx v0, r2, r3 vmaddfp v0, v0, v0, v0 stvx v0, r2, r3 blr Or, with llc -disable-excess-fp-precision, _foo: li r2, 0 lvx v0, r2, r3 vxor v1, v1, v1 vmaddfp v1, v0, v0, v1 vaddfp v0, v1, v0 stvx v0, r2, r3 blr llvm-svn: 24719	2005-12-14 22:54:33 +00:00
Evan Cheng	fbc29bb3dd	Added predicate !NoExcessFPPrecision to FMADD, FMADDS, FMSUB, and FMSUBS. llvm-svn: 24716	2005-12-14 22:07:12 +00:00
Nate Begeman	09855eafd1	Add support for fmul node of type v4f32. void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = mul <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float> *%a ret void } Is selected to: _foo: li r2, 0 lvx v0, r2, r3 vxor v1, v1, v1 vmaddfp v0, v0, v0, v1 stvx v0, r2, r3 blr llvm-svn: 24701	2005-12-14 00:34:09 +00:00
Nate Begeman	1700fe3f71	Prepare support for AltiVec multiply, divide, and sqrt. llvm-svn: 24700	2005-12-13 22:55:22 +00:00
Chris Lattner	6d4db7c732	Remove type casts that are no longer needed llvm-svn: 24661	2005-12-11 07:45:47 +00:00
Nate Begeman	a0e26b25f4	Add support for TargetConstantPool nodes to the dag isel emitter, and use them in the PPC backend, to simplify some logic out of Select and SelectAddr. llvm-svn: 24657	2005-12-10 02:36:00 +00:00
Nate Begeman	5c6a84b5fc	Add support patterns to many load and store instructions which will hopefully use patterns in the near future. llvm-svn: 24651	2005-12-09 23:54:18 +00:00
Chris Lattner	68a0fed879	Use new PPC-specific nodes to represent shifts which require the 6-bit amount handling that PPC provides. These are generated by the lowering code and prevents the dag combiner from assuming (rightfully) that the shifts don't only look at 5 bits. This fixes a miscompilation of crafty with the new front-end. llvm-svn: 24615	2005-12-06 02:10:38 +00:00
Chris Lattner	b2d4850394	Add some explicit type casts so that tblgen knows the type of the shift amount, which is not necessarily the same as the type being shifted. llvm-svn: 24594	2005-12-05 02:34:05 +00:00
Chris Lattner	f38170bbd2	Autogen matching code for ADJCALLSTACK[UP\|DOWN], thanks to Evan's tblgen improvements. llvm-svn: 24591	2005-12-04 19:01:59 +00:00
Chris Lattner	a8af34937b	Finish moving uncond br over to .td file, remove from .cpp file. llvm-svn: 24590	2005-12-04 18:48:01 +00:00
Chris Lattner	b62b05bde6	Define BR in the .td file now that Evan made tblgen smarter. llvm-svn: 24589	2005-12-04 18:42:54 +00:00
Nate Begeman	ebafe9c6d8	Represent the encoding of the SPR instructions as they actually are, so that we can use the correct SPR numbers in the InstrInfo.td file. This is necessary to support VRsave. llvm-svn: 24521	2005-11-29 22:42:50 +00:00
Nate Begeman	16a1c53abc	Add the remainder of the AltiVec 4 x float instructions. Further enhancements will be necessary to teach the code generator that since there is no fmul, it will have to do vmaddfp, adding +0.0. llvm-svn: 24516	2005-11-29 08:04:45 +00:00
Nate Begeman	84cac055ad	Small tweaks noticed while on the plane. llvm-svn: 24492	2005-11-26 22:39:34 +00:00
Nate Begeman	687456dd7a	Some first bits of AltiVec stuff: Instruction Formats, Encodings, and Registers. Apologies to Jim if the scheduling info so far isn't accurate. There's a few more things like VRsave support that need to be finished up in my local tree before I can commit code that Does The Right Thing for turning 4 x float into the various altivec packed float instructions. llvm-svn: 24489	2005-11-23 05:29:52 +00:00
Chris Lattner	02522dc4e6	disentangle call operands from branch operands a bit llvm-svn: 24400	2005-11-17 19:16:08 +00:00
Chris Lattner	92a1367bed	Generate LA and ADDIS when possible. llvm-svn: 24395	2005-11-17 17:52:01 +00:00
Chris Lattner	8d04987a39	Add an initial hack at legalizing GlobalAddress into the appropriate nodes on Darwin to remove smarts from the isel. This is currently disabled by default (uncomment setOperationAction(ISD::GlobalAddress to enable it). tblgen needs to become smarter about tglobaladdr nodes and bigger patterns needed to be added to the .td file. However, we can currently emit stuff like this: :) li r2, lo16(L_x$non_lazy_ptr) lis r3, ha16(L_x$non_lazy_ptr) lwzx r2, r3, r2 The obvious improvements will follow. llvm-svn: 24390	2005-11-17 07:30:41 +00:00
Chris Lattner	5f605f3c12	LI could theoretically be used for the lo-part of a global address, just like lis can be used for the high part. llvm-svn: 24388	2005-11-17 07:04:43 +00:00
Nate Begeman	684381a73b	Patch to clean up function call pseudos and support the BLA instruction, which branches to an absolute address. This is required to support objc direct dispatch. llvm-svn: 24370	2005-11-16 00:48:01 +00:00
Chris Lattner	379e078ee6	add support for branch on ordered/unordered. llvm-svn: 24067	2005-10-28 20:32:44 +00:00
Chris Lattner	e2df44dbb7	autogen undef llvm-svn: 23991	2005-10-25 21:03:41 +00:00
Chris Lattner	a701ef16fc	Allow pseudos to have patterns, no functionality change llvm-svn: 23988	2005-10-25 20:58:43 +00:00
Chris Lattner	fb373ddb69	Autogen fsel llvm-svn: 23987	2005-10-25 20:55:47 +00:00
Chris Lattner	aaf22bf5c5	Autogen a few new ppc-specific nodes llvm-svn: 23985	2005-10-25 20:41:46 +00:00
Chris Lattner	ffa76df587	Instead of aborting if not a case we can handle specially, break out and let the generic code handle it. This fixes CodeGen/Generic/2005-10-21-longlonggtu.ll on ppc. also, reindent this code llvm-svn: 23874	2005-10-21 21:17:10 +00:00
Nate Begeman	d633b875bb	Match rotate. This does actually match the rotates in an rc5 cipher, but I haven't seen it fire on our testsuite. llvm-svn: 23863	2005-10-21 06:36:18 +00:00
Nate Begeman	bbce8c042c	Add some more patterns for i64 on ppc llvm-svn: 23842	2005-10-20 07:51:08 +00:00
Jim Laskey	514a74d946	Added InstrSchedClass to each of the PowerPC Instructions. Note that when adding new instructions that you should refer to the table at the bottom of PPCSchedule.td. llvm-svn: 23830	2005-10-19 19:51:16 +00:00
Nate Begeman	83f0f34140	Write patterns for the various shl and srl patterns that don't involve doing something clever. llvm-svn: 23824	2005-10-19 18:42:01 +00:00
Chris Lattner	61ae05f5dd	now that tblgen is smarter, use integers directly. This should help Andrew too llvm-svn: 23818	2005-10-19 04:32:04 +00:00
Chris Lattner	73379995ab	Convert these cases to patterns llvm-svn: 23811	2005-10-19 01:38:02 +00:00
Nate Begeman	fccb39f398	Woo, it kinda works. We now generate this atrociously bad, but correct, code for long long foo(long long a, long long b) { return a + b; } _foo: or r2, r3, r3 or r3, r4, r4 or r4, r5, r5 or r5, r6, r6 rldicr r2, r2, 32, 31 rldicl r3, r3, 0, 32 rldicr r4, r4, 32, 31 rldicl r5, r5, 0, 32 or r2, r3, r2 or r3, r5, r4 add r4, r3, r2 rldicl r2, r4, 32, 32 or r4, r4, r4 or r3, r2, r2 blr llvm-svn: 23809	2005-10-19 01:12:32 +00:00
Nate Begeman	722531ea21	Make a new reg class for 64 bit regs that aliases the 32 bit regs. This will have to tide us over until we get real subreg support, but it prevents the PrologEpilogInserter from spilling 8 byte GPRs on a G4 processor. Add some initial support for TRUNCATE and ANY_EXTEND, but they don't currently work due to issues with ScheduleDAG. Something wll have to be figured out. llvm-svn: 23803	2005-10-19 00:05:37 +00:00
Chris Lattner	5edea4e9cd	Fix the JIT encoding of LWA, LD, STD, and STDU. llvm-svn: 23787	2005-10-18 16:51:22 +00:00
Nate Begeman	b0e319a7c7	First bits of 64 bit PowerPC stuff, currently disabled. A lot of this is purely mechanical. llvm-svn: 23778	2005-10-18 00:28:58 +00:00
Chris Lattner	114941504c	Add a pattern for FSQRTS llvm-svn: 23750	2005-10-15 21:44:15 +00:00
Chris Lattner	11127fcf98	Rename PowerPC.td -> PPC.td llvm-svn: 23740	2005-10-14 23:40:39 +00:00

... 3 4 5 6 7 ...

438 Commits