llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 22:12:47 +01:00

Author	SHA1	Message	Date
Dan Gohman	955fdc7a4c	Add explicit keywords. llvm-svn: 53179	2008-07-07 18:00:37 +00:00
Evan Cheng	3f664b6fd3	Split scheduling from instruction selection. llvm-svn: 52923	2008-06-30 20:45:06 +00:00
Duncan Sands	d634afe3aa	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	938e74654b	Convert the last remaining users of the non-APInt form of ComputeMaskedBits to use the APInt form, and remove the non-APInt form. llvm-svn: 47654	2008-02-27 01:23:58 +00:00
Anton Korobeynikov	4f6e612973	Remove bunch of gcc 4.3-related warnings from Target llvm-svn: 47369	2008-02-20 11:22:39 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Evan Cheng	1c67dcaae7	Dwarf requires variable entries to be in the source order. Right now, since we are recording variable information at isel time this means parameters would appear in the reverse order. The short term fix is to issue recordVariable() at asm printing time instead. llvm-svn: 46724	2008-02-04 23:06:48 +00:00
Evan Cheng	c57ec111f2	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Dan Gohman	13d1327796	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Chris Lattner	cafc567fb7	Finally implement correct ordered comparisons for PPC, even though the code generated is not wonderful. This turns a miscompilation into a code quality bug (noted in the ppc readme). This fixes PR642, which is over 2 years old (!). Nate, please review this. llvm-svn: 45742	2008-01-08 06:46:30 +00:00
Chris Lattner	f83aae613c	rename TargetInstrDescriptor -> TargetInstrDesc. Make MachineInstr::getDesc return a reference instead of a pointer, since it can never be null. llvm-svn: 45695	2008-01-07 07:27:27 +00:00
Chris Lattner	9d38dfa4a5	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	f7f96d818f	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Chris Lattner	96167aa93c	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	0590c75f18	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Evan Cheng	13b846b1ad	Prevent PPC::BCC first operand, the PRED number, from being isel'd into a LI instruction. llvm-svn: 37790	2007-06-29 01:25:06 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Chris Lattner	dbce9ef4b8	Fix a bug which caused us to never be able to use signed comparisons for equality comparisons of a constant. This allows us to codegen the 'sintzero' loop in PR1288 as: LBB1_1: ;cond_next li r4, 0 addi r2, r2, 1 stw r4, 0(r3) addi r3, r3, 4 cmpwi cr0, r2, -1 bne cr0, LBB1_1 ;cond_next instead of: LBB1_1: ;cond_next addi r2, r2, 1 li r4, 0 xoris r5, r2, 65535 stw r4, 0(r3) addi r3, r3, 4 cmplwi cr0, r5, 65535 bne cr0, LBB1_1 ;cond_next This implements CodeGen/PowerPC/compare-simm.ll, and also cuts 74 instructions out of kc++. llvm-svn: 35590	2007-04-02 05:59:42 +00:00
Chris Lattner	8896b6cb46	eliminate static ctors for Statistic objects. llvm-svn: 32703	2006-12-19 22:59:26 +00:00
Jim Laskey	18b1edb10d	Reduce number of instructions to load 64-bit constants. llvm-svn: 32481	2006-12-12 13:23:43 +00:00
Bill Wendling	f13d78d3b8	What should be the last unnecessary <iostream>s in the library. llvm-svn: 32333	2006-12-07 22:21:48 +00:00
Chris Lattner	a531ce882e	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Evan Cheng	98fa7ab4d7	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Chris Lattner	0d88b19f2f	convert PPC::BCC to use the 'pred' operand instead of separate predicate value and CR reg #. This requires swapping the order of these everywhere that touches BCC and requires us to write custom matching logic for PPCcondbranch :( llvm-svn: 31835	2006-11-17 22:37:34 +00:00
Chris Lattner	73329ae80d	rename PPC::COND_BRANCH to PPC::BCC llvm-svn: 31834	2006-11-17 22:14:47 +00:00
Chris Lattner	1527483a15	start using PPC predicates more consistently. llvm-svn: 31833	2006-11-17 22:10:59 +00:00
Chris Lattner	4edb6f09fe	add patterns for ppc32 preinc stores. ppc64 next. llvm-svn: 31775	2006-11-16 00:41:37 +00:00
Chris Lattner	9bc55a6c38	fix ldu/stu jit encoding. Swith 64-bit preinc load instrs to use memri addrmodes. llvm-svn: 31757	2006-11-15 19:55:13 +00:00
Chris Lattner	10b528d5c3	remove a ton of custom selection logic no longer needed llvm-svn: 31733	2006-11-14 18:43:11 +00:00
Chris Lattner	1409347c13	allow the offset of a preinc'd load to be the low-part of a global. This produces this clever code: _millisecs: lis r2, ha16(_Time.1182) lwzu r3, lo16(_Time.1182)(r2) lwz r2, 4(r2) addic r4, r2, 1 addze r3, r3 blr instead of this: _millisecs: lis r2, ha16(_Time.1182) la r3, lo16(_Time.1182)(r2) lwz r2, lo16(_Time.1182)(r2) lwz r3, 4(r3) addic r4, r3, 1 addze r3, r2 blr for: long %millisecs() { %tmp = load long* %Time.1182 ; <long> [#uses=1] %tmp1 = add long %tmp, 1 ; <long> [#uses=1] ret long %tmp1 } llvm-svn: 31673	2006-11-11 04:53:30 +00:00
Chris Lattner	1aaa5f904c	implement preinc support for r+i loads on ppc64 llvm-svn: 31654	2006-11-10 23:58:45 +00:00
Chris Lattner	1604b6a873	add an initial cut at preinc loads for ppc32. This is broken for ppc64 (because the 64-bit reg target versions aren't implemented yet), doesn't support r+r addr modes, and doesn't handle stores, but it works otherwise. :) This is disabled unless -enable-ppc-preinc is passed to llc for now. llvm-svn: 31621	2006-11-10 02:08:47 +00:00
Evan Cheng	736a8eb3cd	Match tblegen changes. llvm-svn: 31571	2006-11-08 20:34:28 +00:00
Chris Lattner	bd39c99fd1	Refactor all the addressing mode selection stuff into the isel lowering class, where it can be used for preinc formation. llvm-svn: 31536	2006-11-08 02:15:41 +00:00
Reid Spencer	4bafa71dc1	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Chris Lattner	ed0b1b4b24	fix miscompilation of llvm.isunordered, where we branched on the opposite condition. This fixes miscompilation of Olden/bh and many others. llvm-svn: 31301	2006-10-30 23:02:25 +00:00
Nate Begeman	7bcce1a7f6	Fold AND and ROTL more often llvm-svn: 30577	2006-09-22 05:01:56 +00:00
Chris Lattner	4d97247875	Improve PPC64 equality comparisons like PPC32 comparisons. llvm-svn: 30510	2006-09-20 04:33:27 +00:00
Chris Lattner	69390a3f80	Two improvements: 1. Codegen this comparison: if (X == 0x8000) as: cmplwi cr0, r3, 32768 bne cr0, LBB1_2 ;cond_next instead of: lis r2, 0 ori r2, r2, 32768 cmpw cr0, r3, r2 bne cr0, LBB1_2 ;cond_next 2. Codegen this comparison: if (X == 0x12345678) as: xoris r2, r3, 4660 cmplwi cr0, r2, 22136 bne cr0, LBB1_2 ;cond_next instead of: lis r2, 4660 ori r2, r2, 22136 cmpw cr0, r3, r2 bne cr0, LBB1_2 ;cond_next llvm-svn: 30509	2006-09-20 04:25:47 +00:00
Chris Lattner	33bd5dcfb7	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Evan Cheng	a6f81f1863	Do not use getTargetNode() and SelectNodeTo() which takes more than 3 SDOperand arguments. Use the variants which take an array and number instead. llvm-svn: 29907	2006-08-27 08:14:06 +00:00
Evan Cheng	1c3d571e4b	SelectNodeTo now returns a SDNode*. llvm-svn: 29901	2006-08-26 08:00:10 +00:00
Evan Cheng	2db7799507	Select() no longer require Result operand by reference. llvm-svn: 29898	2006-08-26 05:34:46 +00:00
Evan Cheng	b246ad7b2f	Match tblgen changes. llvm-svn: 29895	2006-08-26 01:07:58 +00:00
Chris Lattner	2c12a719c0	Fix PowerPC/2006-08-15-SelectionCrash.ll and simplify selection code. llvm-svn: 29715	2006-08-15 23:48:22 +00:00
Evan Cheng	6053206580	Match tablegen changes. llvm-svn: 29604	2006-08-11 09:08:15 +00:00
Chris Lattner	7b1362fa52	Start eliminating temporary vectors used to create DAG nodes. Instead, pass in the start of an array and a count of operands where applicable. In many cases, the number of operands is known, so this static array can be allocated on the stack, avoiding the heap. In many other cases, a SmallVector can be used, which has the same benefit in the common cases. I updated a lot of code calling getNode that takes a vector, but ran out of time. The rest of the code should be updated, and these methods should be removed. We should also do the same thing to eliminate the methods that take a vector of MVT::ValueTypes. It would be extra nice to convert the dagiselemitter to avoid creating vectors for operands when calling getTargetNode. llvm-svn: 29566	2006-08-08 02:23:42 +00:00
Evan Cheng	d18be1d9c1	Match tablegen isel changes. llvm-svn: 29549	2006-08-07 22:28:20 +00:00
Evan Cheng	3b5f1c6248	Remove InFlightSet hack. No longer needed. llvm-svn: 29373	2006-07-28 00:47:19 +00:00
Evan Cheng	e869883c2d	Remove NodeDepth llvm-svn: 29338	2006-07-27 06:40:15 +00:00
Chris Lattner	26f1985fdc	shrink libllvmgcc.dylib another 25K llvm-svn: 28971	2006-06-28 22:00:36 +00:00
Chris Lattner	852423b469	Don't match 64-bit bitfield inserts into rlwimi's. todo add rldimi. :) llvm-svn: 28944	2006-06-27 21:08:52 +00:00
Chris Lattner	d7b1f61e72	Fix ppc64 jump tables llvm-svn: 28941	2006-06-27 20:46:17 +00:00
Chris Lattner	a572f110b4	Fix variable shadowing issue llvm-svn: 28922	2006-06-27 00:10:13 +00:00
Chris Lattner	494f476ca7	Implement a bunch of 64-bit cleanliness work. With this, treeadd builds (but doesn't work right). llvm-svn: 28921	2006-06-27 00:04:13 +00:00
Chris Lattner	7bc8eae1f0	Work around a nasty tblgen bug where it doesn't add operands for varargs nodes correctly. llvm-svn: 28745	2006-06-10 01:15:02 +00:00
Chris Lattner	cbcad040b3	Fix build failure of povray llvm-svn: 28473	2006-05-25 18:06:16 +00:00
Chris Lattner	e3059fb8bd	Fix Benchmarks/MallocBench/cfrac llvm-svn: 28471	2006-05-25 16:54:16 +00:00
Evan Cheng	09942d3f8b	Assert if InflightSet is not cleared after instruction selecting a BB. llvm-svn: 28459	2006-05-25 00:24:28 +00:00
Evan Cheng	b040dd86af	Clear HandleMap and ReplaceMap after instruction selection. Or it may cause non-deterministic behavior. llvm-svn: 28454	2006-05-24 20:46:25 +00:00
Chris Lattner	2208c3214c	Make PPC call lowering more aggressive, making the isel matching code simple enough to be autogenerated. llvm-svn: 28354	2006-05-17 19:00:46 +00:00
Chris Lattner	03c70b7f27	Switch PPC over to a call-selection model where the lowering code creates the copyto/fromregs instead of making the PPCISD::CALL selection code create them. This vastly simplifies the selection code, and moves the ABI handling parts into one place. llvm-svn: 28346	2006-05-17 06:01:33 +00:00
Chris Lattner	a36579803f	implement passing/returning vector regs to calls, at least non-varargs calls. llvm-svn: 28341	2006-05-16 23:54:25 +00:00
Chris Lattner	bcd2c4f32d	Fix PowerPC/2006-05-12-rlwimi-crash.ll Nate, please verify that if InsertMask is 0, rlwimi shouldn't be used. This fixes the crash and causes no PPC testsuite regressions. llvm-svn: 28243	2006-05-12 16:29:37 +00:00
Nate Begeman	a706539a72	Fold more shifts into inserts, and update the README llvm-svn: 28168	2006-05-08 17:38:32 +00:00
Nate Begeman	591488077e	Update some stuff now that the new rlwimi code has gone in llvm-svn: 28162	2006-05-08 02:52:38 +00:00
Nate Begeman	dc94b738d0	New rlwimi implementation, which is superior to the old one. There are still a couple missed optimizations, but we now generate all the possible rlwimis for multiple inserts into the same bitfield. More regression tests to come. llvm-svn: 28156	2006-05-07 00:23:38 +00:00
Nate Begeman	7ed816f900	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	2ffa288a23	Add VRRC select support llvm-svn: 27543	2006-04-08 22:45:08 +00:00
Chris Lattner	65a455b060	Codegen vector predicate compares. llvm-svn: 27151	2006-03-26 10:06:40 +00:00
Chris Lattner	e199d55073	#include Intrinsics.h into all dag isels llvm-svn: 27109	2006-03-25 06:47:10 +00:00
Chris Lattner	3133dafd4b	Like the comment says, prefer to use the implicit add done by [r+r] addressing modes than emitting an explicit add and using a base of r0. This implements Regression/CodeGen/PowerPC/mem-rr-addr-mode.ll llvm-svn: 27068	2006-03-24 17:58:06 +00:00
Chris Lattner	cfbce5186a	Add support for "ri" addressing modes where the immediate is a 14-bit field which is shifted left two bits before use. Instructions like STD use this addressing mode. llvm-svn: 26942	2006-03-22 05:26:03 +00:00
Chris Lattner	6417236c41	With Evan's latest tblgen patch, this code is obsolete, thanks Evan! llvm-svn: 26917	2006-03-21 06:37:40 +00:00
Chris Lattner	cdc4657988	Handle constant addresses more efficiently, folding the low bits into the disp field of the load/store if possible. This compiles CodeGen/PowerPC/load-constant-addr.ll to: _test: lis r2, 2838 lfs f1, 26848(r2) blr instead of: _test: lis r2, 2838 ori r2, r2, 26848 lfs f1, 0(r2) blr llvm-svn: 26908	2006-03-20 22:38:22 +00:00
Chris Lattner	5c994b8c63	reenable this hack, the tblgen version isn't quite ready llvm-svn: 26902	2006-03-20 17:54:43 +00:00
Evan Cheng	57da1afbc8	Use tblgen'd VECTOR_SHUFFLE selection code. llvm-svn: 26900	2006-03-20 08:14:16 +00:00
Chris Lattner	dc3605efdb	Add support for generating vspltw, instead of a vperm instruction with a constant pool load. This generates significantly nicer code for splats. When tblgen gets bugfixed, we can remove the custom selection code. llvm-svn: 26898	2006-03-20 06:51:10 +00:00
Nate Begeman	42736d46b2	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Chris Lattner	b5d0896994	Save/restore VRSAVE once per function, not once per block. llvm-svn: 26793	2006-03-16 18:25:23 +00:00
Chris Lattner	392087f5bd	Fix an off by one error that caused PPC LLC failures last night. llvm-svn: 26758	2006-03-14 17:56:49 +00:00
Evan Cheng	7ec94f2ff7	Added getTargetLowering() to TargetMachine. Refactored targets to support this. llvm-svn: 26742	2006-03-13 23:20:37 +00:00
Chris Lattner	d0505331d2	For functions that use vector registers, save VRSAVE, mark used registers, and update it on entry to each function, then restore it on exit. This compiles: void func(vfloat a, vfloat b, vfloat c) { a = b c + c; } to this: _func: mfspr r2, 256 oris r6, r2, 49152 mtspr 256, r6 lvx v0, 0, r5 lvx v1, 0, r4 vmaddfp v0, v1, v0, v0 stvx v0, 0, r3 mtspr 256, r2 blr GCC produces this (which has additional stack accesses): _func: mfspr r0,256 stw r0,-4(r1) oris r0,r0,0xc000 mtspr 256,r0 lvx v0,0,r5 lvx v1,0,r4 lwz r12,-4(r1) vmaddfp v0,v0,v1,v0 stvx v0,0,r3 mtspr 256,r12 blr llvm-svn: 26733	2006-03-13 21:52:10 +00:00
Chris Lattner	a278639f29	Several big changes: 1. Use flags on the instructions in the .td file to indicate the PPC970 unit type instead of a table in the .cpp file. Much cleaner. 2. Change the hazard recognizer to build d-groups according to the actual algorithm used, not my flawed understanding of it. 3. Model "must be in the first slot" and "must be the only instr in a group" accurately. llvm-svn: 26719	2006-03-12 09:13:49 +00:00
Chris Lattner	3f23d22d3f	Change the interface for getting a target HazardRecognizer to be more clean. llvm-svn: 26608	2006-03-08 04:25:59 +00:00
Chris Lattner	4cd6cd499d	Implement a very very simple hazard recognizer for LSU rejects and ctr set/read flushes llvm-svn: 26587	2006-03-07 06:32:48 +00:00
Chris Lattner	317021b6c4	Implement CodeGen/PowerPC/or-addressing-mode.ll, which is also PR668. llvm-svn: 26450	2006-03-01 07:14:48 +00:00
Chris Lattner	3d451516ec	Implement selection of inline asm memory operands llvm-svn: 26348	2006-02-24 02:13:12 +00:00
Nate Begeman	9c0ab71f4a	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Evan Cheng	131901cbb8	If the false case is the current basic block, then this is a self loop. We do not want to emit "Loop: ... brcond Out; br Loop", as it adds an extra instruction in the loop. Instead, invert the condition and emit "Loop: ... br!cond Loop; br Out. Generalize the fix by moving it from PPCDAGToDAGISel to SelectionDAGLowering. llvm-svn: 26231	2006-02-16 08:27:56 +00:00
Evan Cheng	6bd0f9c4ba	Match getTargetNode() changes (now return SDNode* instead of SDOperand). llvm-svn: 26085	2006-02-09 07:17:49 +00:00
Evan Cheng	521e5a1bfe	Change Select() from SDOperand Select(SDOperand N); to void Select(SDOperand &Result, SDOperand N); llvm-svn: 26067	2006-02-09 00:37:58 +00:00
Evan Cheng	9fb67ea859	Complex pattern isel code shouldn't select nodes. llvm-svn: 26010	2006-02-05 08:45:01 +00:00
Evan Cheng	fb902782e8	Use SelectRoot() as entry of any tblgen based isel. llvm-svn: 25997	2006-02-05 06:46:41 +00:00
Chris Lattner	6a5d2450a3	Use PPCISD::CALL instead of ISD::CALL llvm-svn: 25717	2006-01-27 23:34:02 +00:00
Chris Lattner	aafc339b4e	Add explicit #includes of <iostream> llvm-svn: 25515	2006-01-22 23:41:00 +00:00
Chris Lattner	4d2c4cb7a7	Use the default impl of DYNAMIC_STACKALLOC, allowing us to delete some code. llvm-svn: 25334	2006-01-15 09:02:48 +00:00
Chris Lattner	452a84e2b6	these cases are autogenerated llvm-svn: 25238	2006-01-12 02:01:45 +00:00
Chris Lattner	861897037b	remove dead code llvm-svn: 25237	2006-01-12 01:54:15 +00:00
Chris Lattner	5488b43338	Fix a compile crash building MultiSource/Applications/d with the new front-end. The PPC backend was generating random shift counts in this case, due to an uninitialized variable. llvm-svn: 25114	2006-01-05 18:32:49 +00:00
Nate Begeman	96c7e22231	Fix one of the things in the todo file, and get a bit closer to folding constant offsets from statics into the address arithmetic. llvm-svn: 24999	2005-12-24 01:00:15 +00:00
Nate Begeman	a114534620	Pattern-match return. Includes gross hack! llvm-svn: 24874	2005-12-20 00:26:01 +00:00
Nate Begeman	d4562971b3	Fix a couple of the FIXMEs, thanks to suggestion from Chris. This allows us to load and store vectors directly at a pointer (offset of zero) by using r0 as the base register. This also requires some asm printer work to satisfy the darwin assembler. For void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float> *%a ret void } We now produce: _foo: lvx v0, 0, r3 vaddfp v0, v0, v0 stvx v0, 0, r3 blr Instead of: _foo: li r2, 0 lvx v0, r2, r3 vaddfp v0, v0, v0 stvx v0, r2, r3 blr llvm-svn: 24872	2005-12-19 23:40:42 +00:00
Nate Begeman	9c7dce88b5	Convert load/store over to being pattern matched llvm-svn: 24871	2005-12-19 23:25:09 +00:00
Chris Lattner	0124442495	This is handled by the autogen'd code llvm-svn: 24834	2005-12-18 21:06:11 +00:00
Nate Begeman	f5ac708070	Remove a now unused statistic. llvm-svn: 24720	2005-12-14 22:56:16 +00:00
Nate Begeman	fe7a3f28e3	Use the new predicate support that Evan Cheng added to remove some code from the DAGToDAG cpp file. This adds pattern support for vector and scalar fma, which passes test/Regression/CodeGen/PowerPC/fma.ll, and does the right thing in the presence of -disable-excess-fp-precision. Allows us to match: void %foo(<4 x float> * %a) { entry: %tmp1 = load <4 x float> * %a; %tmp2 = mul <4 x float> %tmp1, %tmp1 %tmp3 = add <4 x float> %tmp2, %tmp1 store <4 x float> %tmp3, <4 x float> *%a ret void } As: _foo: li r2, 0 lvx v0, r2, r3 vmaddfp v0, v0, v0, v0 stvx v0, r2, r3 blr Or, with llc -disable-excess-fp-precision, _foo: li r2, 0 lvx v0, r2, r3 vxor v1, v1, v1 vmaddfp v1, v0, v0, v1 vaddfp v0, v1, v0 stvx v0, r2, r3 blr llvm-svn: 24719	2005-12-14 22:54:33 +00:00
Nate Begeman	a0e26b25f4	Add support for TargetConstantPool nodes to the dag isel emitter, and use them in the PPC backend, to simplify some logic out of Select and SelectAddr. llvm-svn: 24657	2005-12-10 02:36:00 +00:00
Chris Lattner	46ca9774fc	Silence another annoying GCC warning llvm-svn: 24627	2005-12-06 20:56:18 +00:00
Chris Lattner	e6daa0e5bc	The basic fneg cases are already autogen'd llvm-svn: 24592	2005-12-04 19:04:38 +00:00
Chris Lattner	f38170bbd2	Autogen matching code for ADJCALLSTACK[UP\|DOWN], thanks to Evan's tblgen improvements. llvm-svn: 24591	2005-12-04 19:01:59 +00:00
Chris Lattner	a8af34937b	Finish moving uncond br over to .td file, remove from .cpp file. llvm-svn: 24590	2005-12-04 18:48:01 +00:00
Chris Lattner	046761f312	Make sure these get added into the codegenmap when appropriate llvm-svn: 24566	2005-12-01 18:09:22 +00:00
Chris Lattner	1b8cb77fea	Fix a regression caused by a patch earlier today llvm-svn: 24561	2005-12-01 03:50:19 +00:00
Evan Cheng	363ad8bbc4	Use a getCopyToReg() variant to generate a flaggy CopyToReg node. llvm-svn: 24558	2005-12-01 00:41:50 +00:00
Chris Lattner	06fbfe625c	SelectNodeTo now returns N. Use it instead of return N directly. llvm-svn: 24549	2005-11-30 22:53:06 +00:00
Nate Begeman	31121419c8	First chunk of actually generating vector code for packed types. These changes allow us to generate the following code: _foo: li r2, 0 lvx v0, r2, r3 vaddfp v0, v0, v0 stvx v0, r2, r3 blr for this llvm: void %foo(<4 x float>* %a) { entry: %tmp1 = load <4 x float>* %a %tmp2 = add <4 x float> %tmp1, %tmp1 store <4 x float> %tmp2, <4 x float>* %a ret void } llvm-svn: 24534	2005-11-30 08:22:07 +00:00
Chris Lattner	4581404290	Enable global address legalization, fixing a todo and allowing the removal of some code. This exposes the implicit load from the stubs to the DAG, allowing them to be optimized by the dag combiner. It also moves darwin specific stuff out of the isel into the legalizer, and allows more to be moved to the .td file. llvm-svn: 24397	2005-11-17 18:26:56 +00:00
Chris Lattner	e3c5f4c9d2	Teach the selector to fold lo(g) into load instruction immediate fields llvm-svn: 24396	2005-11-17 18:02:16 +00:00
Chris Lattner	8d04987a39	Add an initial hack at legalizing GlobalAddress into the appropriate nodes on Darwin to remove smarts from the isel. This is currently disabled by default (uncomment setOperationAction(ISD::GlobalAddress to enable it). tblgen needs to become smarter about tglobaladdr nodes and bigger patterns needed to be added to the .td file. However, we can currently emit stuff like this: :) li r2, lo16(L_x$non_lazy_ptr) lis r3, ha16(L_x$non_lazy_ptr) lwzx r2, r3, r2 The obvious improvements will follow. llvm-svn: 24390	2005-11-17 07:30:41 +00:00
Chris Lattner	7fdf96ed96	When lowering direct calls, lower them to use a targetglobaladress directly instead of a globaladdress. This has no effect on the generated code at all. llvm-svn: 24386	2005-11-17 05:56:14 +00:00
Nate Begeman	684381a73b	Patch to clean up function call pseudos and support the BLA instruction, which branches to an absolute address. This is required to support objc direct dispatch. llvm-svn: 24370	2005-11-16 00:48:01 +00:00
Chris Lattner	2338614087	Don't emit "32" for unordered comparison llvm-svn: 24073	2005-10-28 22:58:07 +00:00
Chris Lattner	28552d8cc8	add a hack to get code with ordered comparisons working. This hack is tracked as PR642 llvm-svn: 24068	2005-10-28 20:49:47 +00:00
Chris Lattner	379e078ee6	add support for branch on ordered/unordered. llvm-svn: 24067	2005-10-28 20:32:44 +00:00
Chris Lattner	e2df44dbb7	autogen undef llvm-svn: 23991	2005-10-25 21:03:41 +00:00
Chris Lattner	fb373ddb69	Autogen fsel llvm-svn: 23987	2005-10-25 20:55:47 +00:00
Chris Lattner	aaf22bf5c5	Autogen a few new ppc-specific nodes llvm-svn: 23985	2005-10-25 20:41:46 +00:00
Chris Lattner	a050c43068	The dag isel generator generates this now llvm-svn: 23984	2005-10-25 20:36:10 +00:00
Chris Lattner	115fa976bf	Be a bit more paranoid about calling SelectNodeTo llvm-svn: 23982	2005-10-25 20:26:41 +00:00
Chris Lattner	1fe1eab5a8	Fix a couple of minor bugs. The first fixes povray, the second fixes things if the dag combiner isn't run llvm-svn: 23981	2005-10-25 19:32:37 +00:00
Chris Lattner	ffa76df587	Instead of aborting if not a case we can handle specially, break out and let the generic code handle it. This fixes CodeGen/Generic/2005-10-21-longlonggtu.ll on ppc. also, reindent this code llvm-svn: 23874	2005-10-21 21:17:10 +00:00
Nate Begeman	6c42f509bc	Invert the TargetLowering flag that controls divide by consant expansion. Add a new flag to TargetLowering indicating if the target has really cheap signed division by powers of two, make ppc use it. This will probably go away in the future. Implement some more ISD::SDIV folds in the dag combiner Remove now dead code in the x86 backend. llvm-svn: 23853	2005-10-21 00:02:42 +00:00
Nate Begeman	dc1a2a1f19	Move the target constant divide optimization up into the dag combiner, so that the nodes can be folded with other nodes, and we can not duplicate code in every backend. Alpha will probably want this too. llvm-svn: 23835	2005-10-20 02:15:44 +00:00
Nate Begeman	83f0f34140	Write patterns for the various shl and srl patterns that don't involve doing something clever. llvm-svn: 23824	2005-10-19 18:42:01 +00:00
Chris Lattner	73379995ab	Convert these cases to patterns llvm-svn: 23811	2005-10-19 01:38:02 +00:00
Nate Begeman	fccb39f398	Woo, it kinda works. We now generate this atrociously bad, but correct, code for long long foo(long long a, long long b) { return a + b; } _foo: or r2, r3, r3 or r3, r4, r4 or r4, r5, r5 or r5, r6, r6 rldicr r2, r2, 32, 31 rldicl r3, r3, 0, 32 rldicr r4, r4, 32, 31 rldicl r5, r5, 0, 32 or r2, r3, r2 or r3, r5, r4 add r4, r3, r2 rldicl r2, r4, 32, 32 or r4, r4, r4 or r3, r2, r2 blr llvm-svn: 23809	2005-10-19 01:12:32 +00:00
Nate Begeman	722531ea21	Make a new reg class for 64 bit regs that aliases the 32 bit regs. This will have to tide us over until we get real subreg support, but it prevents the PrologEpilogInserter from spilling 8 byte GPRs on a G4 processor. Add some initial support for TRUNCATE and ANY_EXTEND, but they don't currently work due to issues with ScheduleDAG. Something wll have to be figured out. llvm-svn: 23803	2005-10-19 00:05:37 +00:00
Nate Begeman	ee581735d9	Add the ability to lower return instructions to TargetLowering. This allows us to lower legal return types to something else, to meet ABI requirements (such as that i64 be returned in two i32 regs on Darwin/ppc). llvm-svn: 23802	2005-10-18 23:23:37 +00:00
Nate Begeman	b0e319a7c7	First bits of 64 bit PowerPC stuff, currently disabled. A lot of this is purely mechanical. llvm-svn: 23778	2005-10-18 00:28:58 +00:00
Nate Begeman	723637974b	More PPC32 -> PPC changes, as well as merging some classes that were redundant after the change. llvm-svn: 23759	2005-10-16 05:39:50 +00:00
Chris Lattner	4893eb3b04	Remove some dead code: the ORI/ORIS cases are autogen'd. This makes SelectIntImmediateExpr dead. llvm-svn: 23753	2005-10-15 22:06:18 +00:00
Chris Lattner	6f97b0bb81	These instructions are now autogenerated llvm-svn: 23751	2005-10-15 21:44:56 +00:00
Chris Lattner	c9479e899a	remove dead code llvm-svn: 23749	2005-10-15 21:40:12 +00:00
Chris Lattner	d3946bbea6	Rename PPC32.h to PPC.h This completes the grand PPC file renaming llvm-svn: 23745	2005-10-14 23:59:06 +00:00
Chris Lattner	0405f3388f	Rename PowerPC.h to PPC.h llvm-svn: 23743	2005-10-14 23:51:18 +00:00
Chris Lattner	612940b7b0	Eliminate PowerPC.td and PPC32.td, consolidating them into PPC.td llvm-svn: 23738	2005-10-14 23:37:35 +00:00
Chris Lattner	102b6e76e6	These are now autogenerated llvm-svn: 23731	2005-10-14 06:26:29 +00:00
Chris Lattner	97ee5c7f18	Disable formation of rlwinm instructions from SRA bases. This fixes the 177.mesa failure from last night, and fixes the CodeGen/PowerPC/2005-10-08-ArithmeticRotate.ll regression test I added. If this code cannot be fixed, it should be removed for good, but I'll leave it to Nate to decide its fate. llvm-svn: 23670	2005-10-09 05:36:17 +00:00
Chris Lattner	6809d07aa5	When preselecting, favor things that have low depth to select first. This is faster and uses less stack space. This reduces our stack requirement enough to compile sixtrack, and though it's a hack, should be enough until we switch to iterative isel llvm-svn: 23664	2005-10-07 22:10:27 +00:00
Chris Lattner	f572bec811	Pull out Call, reducing stack frame size from 6032 bytes to 5184 bytes. llvm-svn: 23650	2005-10-06 19:07:45 +00:00
Chris Lattner	0e0c65b680	Pull out setcc, this reduces stack frame size from 7520 to 6032 bytes llvm-svn: 23649	2005-10-06 19:03:35 +00:00
Chris Lattner	0e97719aa3	Pull two more methods out, reducing stack frame size from 8224 -> 7520 bytes llvm-svn: 23648	2005-10-06 18:56:10 +00:00
Chris Lattner	4b1b6161ef	Add a recursive-iterative hybrid stage to attempt to reduce stack space, this helps but not enough. Start pulling cases out of PPC32DAGToDAGISel::Select. With GCC 4, this function required 8512 bytes of stack space for each invocation (GCC 3 required less than 700 bytes). Pulling this first function out gets us down to 8224. More to come :( llvm-svn: 23647	2005-10-06 18:45:51 +00:00
Chris Lattner	00a39b4fd5	another solution to the fsel issue. Instead of having 4 variants, just force the comparison to be 64-bits. This is fine because extensions from float to double are free. llvm-svn: 23589	2005-10-02 07:07:49 +00:00
Chris Lattner	efc0d24037	fsel can take a different FP type for the comparison and for the result. As such split the FSEL family into 4 things instead of just two. llvm-svn: 23588	2005-10-02 06:58:23 +00:00
Chris Lattner	b66cf00015	Minor tweak to the branch selector. When emitting a two-way branch, and if we're in a single-mbb loop, make sure to emit the backwards branch as the conditional branch instead of the uncond branch. For example, emit this: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 ble cr0, LBBl29_z__44 b LBBl29_z__48 * NOT PART OF LOOP Instead of: LBBl29_z__44: stw r9, 0(r15) stw r9, 4(r15) stw r9, 8(r15) stw r9, 12(r15) addi r15, r15, 16 addi r8, r8, 1 cmpw cr0, r8, r28 bgt cr0, LBBl29_z__48 * PART OF LOOP! b LBBl29_z__44 The former sequence has one fewer dispatch group for the loop body. llvm-svn: 23582	2005-10-01 23:06:26 +00:00
Chris Lattner	2f63a0f7c6	fix typo llvm-svn: 23578	2005-10-01 02:51:36 +00:00
Chris Lattner	50411b1026	Modify the ppc backend to use two register classes for FP: F8RC and F4RC. These are used to represent float and double values, and the two regclasses contain the same physical registers. llvm-svn: 23577	2005-10-01 01:35:02 +00:00
Jim Laskey	94e6b5c648	Should be using flag and not chain. llvm-svn: 23572	2005-09-30 23:43:37 +00:00
Chris Lattner	47999915ad	Remove code for patterns that are autogenerated llvm-svn: 23532	2005-09-29 23:33:31 +00:00
Chris Lattner	3c9cb55f13	Never rely on ReplaceAllUsesWith when selecting, use CodeGenMap instead. ReplaceAllUsesWith does not replace scalars SDOperand floating around on the stack, permitting things to be selected multiple times. llvm-svn: 23515	2005-09-29 00:59:32 +00:00
Chris Lattner	04dae1b9ce	Autogen MUL, move FP cases together llvm-svn: 23512	2005-09-28 22:53:16 +00:00
Chris Lattner	6f2adbd5b0	disentangle FP from INT versions of div/mul llvm-svn: 23511	2005-09-28 22:50:24 +00:00
Chris Lattner	95601538e4	Use the autogenerated matcher for ADD/SUB llvm-svn: 23510	2005-09-28 22:47:28 +00:00
Chris Lattner	9a2fb006e4	Add FP versions of the binary operators, keeping the int and fp worlds seperate. llvm-svn: 23506	2005-09-28 22:29:58 +00:00
Chris Lattner	64a7b30379	All (xor *) cases are autogenerated now llvm-svn: 23497	2005-09-28 18:12:37 +00:00
Chris Lattner	82eb231e2f	Implement PowerPC/eqv-andc-orc-nor.ll:EQV3 llvm-svn: 23494	2005-09-28 18:04:52 +00:00
Chris Lattner	dad2994a3a	These nodes are all autogenerated llvm-svn: 23489	2005-09-28 17:07:09 +00:00
Chris Lattner	6d1a2716a4	Make sure to clear the CodeGenMap after each basic block is selected to avoid cross MBB pollution. llvm-svn: 23470	2005-09-27 17:45:33 +00:00
Chris Lattner	c7fb54af0b	we don't need this proto any longer llvm-svn: 23342	2005-09-13 22:05:21 +00:00
Chris Lattner	ffab20ddeb	move the #include for the generated code into the isel class body so we can use/define class methods llvm-svn: 23339	2005-09-13 22:03:06 +00:00
Chris Lattner	00e9278551	PowerPC cannot truncstore i1 natively llvm-svn: 23304	2005-09-10 00:21:06 +00:00
Chris Lattner	1769be4699	Remove some cases handled by the generated portion of the isel llvm-svn: 23262	2005-09-07 23:45:15 +00:00
Nate Begeman	718cae4eba	Implement i64<->fp using the fctidz/fcfid instructions on PowerPC when we are allowed to generate 64-bit-only PowerPC instructions for 32 bit hosts, such as the PowerPC 970. This speeds up 189.lucas from 81.99 to 32.64 seconds. llvm-svn: 23250	2005-09-06 22:03:27 +00:00
Chris Lattner	dcb9830e25	include the dag isel fragment llvm-svn: 23239	2005-09-03 01:17:22 +00:00
Chris Lattner	cbfe5d4180	Change the isel to not break out of the big giant switch. Instead, the switch should never be exited, so its bottom is now unreachable. llvm-svn: 23234	2005-09-03 00:53:47 +00:00
Chris Lattner	b6d26dc675	Implement dynamic allocas correctly. In particular, because we were copying directly out of R1 (without using a CopyFromReg, which uses a chain), multiple allocas were getting CSE'd together, producing bogus code. For this: int %foo(bool %X, int %A, int %B) { br bool %X, label %T, label %F F: %G = alloca int %H = alloca int store int %A, int* %G store int %B, int* %H %R = load int* %G ret int %R T: ret int 0 } We were generating: _foo: stwu r1, -16(r1) stw r31, 4(r1) or r31, r1, r1 stw r1, 12(r31) cmpwi cr0, r3, 0 bne cr0, .LBB_foo_2 ; T .LBB_foo_1: ; F li r2, 16 subf r2, r2, r1 ;; One alloca or r1, r2, r2 or r3, r1, r1 or r1, r2, r2 or r2, r1, r1 stw r4, 0(r3) stw r5, 0(r2) lwz r3, 0(r3) lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr .LBB_foo_2: ; T li r3, 0 lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr Now we generate: _foo: stwu r1, -16(r1) stw r31, 4(r1) or r31, r1, r1 stw r1, 12(r31) cmpwi cr0, r3, 0 bne cr0, .LBB_foo_2 ; T .LBB_foo_1: ; F or r2, r1, r1 li r3, 16 subf r2, r3, r2 ;; Alloca 1 or r1, r2, r2 or r2, r1, r1 or r6, r1, r1 subf r3, r3, r6 ;; Alloca 2 or r1, r3, r3 or r3, r1, r1 stw r4, 0(r2) stw r5, 0(r3) lwz r3, 0(r2) lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr .LBB_foo_2: ; T li r3, 0 lwz r1, 12(r31) lwz r31, 4(r31) lwz r1, 0(r1) blr This fixes Povray and SPASS with the dag isel, the last two failing cases. Tommorow we will hopefully turn it on by default! :) llvm-svn: 23190	2005-09-01 21:31:30 +00:00
Chris Lattner	88cc0407e3	Fix a bug where we were useing HA to get the high part, which seems like it could cause a miscompile. Fixing this didn't fix the two programs that fail though. :( This also changes the implementation to follow the pattern selector more closely, causing us to select 0 to li instead of lis. llvm-svn: 23189	2005-09-01 19:38:28 +00:00
Chris Lattner	0dacf023bf	Do not select the operands being passed into SelectCC. IT does this itself and selecting early prevents folding immediates into the cmpw* instructions llvm-svn: 23188	2005-09-01 19:20:44 +00:00
Chris Lattner	914a0dbba1	Move FCTIWZ handling out of the instruction selectors and into legalization, getting them out of the business of making stack slots. llvm-svn: 23180	2005-08-31 21:09:52 +00:00
Chris Lattner	ed72c03aa1	Remove dead code llvm-svn: 23179	2005-08-31 20:25:15 +00:00
Chris Lattner	eef2e52921	add assert zext/sext to the dag isel llvm-svn: 23171	2005-08-31 18:08:46 +00:00
Chris Lattner	3d6bf9e384	Fix 'ret long' to return the high and lo parts in the right registers. This fixes crafty and probably others. llvm-svn: 23167	2005-08-31 01:34:29 +00:00
Chris Lattner	f08ec1bc4f	now that physregs can exist in the same dag with multiple types, remove some ugly hacks llvm-svn: 23162	2005-08-30 22:59:48 +00:00
Chris Lattner	0739f5a9da	Fix type mismatches when passing f32 values to calls llvm-svn: 23159	2005-08-30 21:28:19 +00:00
Chris Lattner	2f4de6af75	Fix some indentation (first hunks). Remove code (last hunk) that miscompiled immediate and's, such as and uint %tmp.30, 4294958079 into andi. r8, r8, 56319 andis. r8, r8, 65535 instead of: li r9, -9217 and r8, r8, r9 The first always generates zero. This fixes espresso. llvm-svn: 23155	2005-08-30 18:37:48 +00:00
Chris Lattner	fd2708cce1	Fix a problem Nate found where we swapped the operands of SHL/SHR_PARTS. This fixes fourinarow llvm-svn: 23153	2005-08-30 17:42:59 +00:00
Chris Lattner	6a4bdd85ad	codegen ADD_PARTS correctly: put the results in the right registers! This fixes fhourstones llvm-svn: 23152	2005-08-30 17:40:13 +00:00
Chris Lattner	025f964997	add operands in the right order, fixing McCat/18-imp with the dag isel llvm-svn: 23150	2005-08-30 17:13:58 +00:00
Chris Lattner	0e6343b2e4	Make sure the selector emits register register copies with flag operands linking them to calls when appropriate, this prevents the scheduler from pulling these copies away from the call. This fixes Ptrdist/yacr2 llvm-svn: 23143	2005-08-30 01:57:02 +00:00
Chris Lattner	66ff5ca72d	The first operand to AND does not always have more than two operands. This fixes MediaBench/toast with the dag selector llvm-svn: 23141	2005-08-30 00:59:16 +00:00
Chris Lattner	874adc990b	emit FMR instructions to convert f64<->f32 instructions, so things like STOREs, know the right type to store. llvm-svn: 23139	2005-08-30 00:30:43 +00:00
Chris Lattner	5c81e4b034	fix a crash in cfrac llvm-svn: 23137	2005-08-29 23:49:25 +00:00
Chris Lattner	7a6dfa3f21	Implement DYNAMIC_STACKALLOC, wrap some long lines llvm-svn: 23136	2005-08-29 23:30:11 +00:00
Chris Lattner	238fbfcf67	Fix a dumb bug of mine where we were mishandling the PPC ABI (undef handling). This fixes voronoi and bh in Olden, allowing all of olden to pass! llvm-svn: 23133	2005-08-29 22:22:57 +00:00
Chris Lattner	bc3fb85efd	Fix a bug the last patch exposed in treeadd among others llvm-svn: 23127	2005-08-29 01:07:02 +00:00
Chris Lattner	9821903d65	A hack to fix a problem folding immedaites. This fixes Olden/power. llvm-svn: 23126	2005-08-29 01:01:01 +00:00
Chris Lattner	eda3f49562	Fix order of operands for copytoreg node when emitting calls. This fixes Olden/msFix order of operands for copytoreg node when emitting calls. This fixes Olden/mstt. llvm-svn: 23125	2005-08-29 00:26:57 +00:00

... 2 3 4 5 6 ...

398 Commits