llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Kalle Raiskila	15993a5d28	Mark 'branch indirect' instruction as an indirect branch. Not having it confused assembly printing of jumptables. llvm-svn: 141862	2011-10-13 11:40:03 +00:00
Kalle Raiskila	7c154fe467	Pass signed (not unsigned) 10 bit field to SPU 'ori' instruction. llvm-svn: 139004	2011-09-02 10:05:01 +00:00
Kalle Raiskila	6e33c92ffb	Allow vector shifts (shl,lshr,ashr) on SPU. There was a previous implementation with patterns that would have matched e.g. shl <v4i32> <i32>, but this is not valid LLVM IR so they never were selected. llvm-svn: 126998	2011-03-04 13:19:18 +00:00
Kalle Raiskila	cc5b703c81	Add branch hinting for SPU. The implemented algorithm is overly simplistic (just speculate all branches are taken)- this is work in progress. llvm-svn: 126651	2011-02-28 14:08:24 +00:00
David Greene	0db8e64017	Fix vector sign extend to put the source and destination types in the correct places. llvm-svn: 124601	2011-01-31 20:39:01 +00:00
Kalle Raiskila	7401b2a1db	Split up RotateShift itinerary in SPU. 'rotq' and 'shlq' instructions go to the odd pipeline, wheras the inter-vector equivalents 'rot', 'shl' go to the even. llvm-svn: 123622	2011-01-17 13:33:19 +00:00
Kalle Raiskila	457fa0b3bc	Add a "nop filler" pass to SPU. Filling no-ops is done just before emitting of assembly, when the instruction stream is final. No-ops are inserted to align the instructions so the dual-issue of the pipeline is utilized. This speeds up generated code with a minimum of 1% on a select set of algorithms. This pass may be redundant if the instruction scheduler and all subsequent passes that modify the instruction stream (prolog+epilog inserter, register scavenger, are there others?) are made aware of the instruction alignments. llvm-svn: 123226	2011-01-11 09:07:54 +00:00
Kalle Raiskila	71dec6ff42	Handle lshr for i128 correctly on SPU also when shiftamount > 7. llvm-svn: 120288	2010-11-29 14:44:28 +00:00
Kalle Raiskila	64f85ff7b3	Allow machine LICM to do its job on SPU. -return a sensible value for register pressure -add pattern to 'ila' instrucion llvm-svn: 120285	2010-11-29 10:08:09 +00:00
Kalle Raiskila	b017eaea7b	Allow for 'fcmp ogt' in SPU. Fix by Visa Putkinen! llvm-svn: 120090	2010-11-24 11:42:17 +00:00
Kalle Raiskila	f89d0d0389	Fix memory access lowering on SPU, adding support for the case where alignment<value size. These cases were silently miscompiled before this patch. Now they are overly verbose -especially storing is- and any front-end should still avoid misaligned memory accesses as much as possible. The bit juggling algorithm added here probably has some room for improvement still. llvm-svn: 118889	2010-11-12 10:14:03 +00:00
Kalle Raiskila	c6bdc97934	Zap some redundant 'ori $?, $?, 0' from SPU. Also remove some code that died in the process. One now non-existant ori is checked for. llvm-svn: 115306	2010-10-01 09:20:01 +00:00
Kalle Raiskila	68e2c15954	Change SPU register re-interpretations from OR to COPY_TO_REGCLASS instruction. This cleans up after the mess r108567 left in the CellSPU backend. ORCvt-instruction were used to reinterpret registers, and the ORs were then removed by isMoveInstr(). This patch now removes 350 instrucions of format: or $3, $3, $3 (from the 52 testcases in CodeGen/CellSPU). One case of a nonexistant or is checked for. Some moves of the form 'ori $., $., 0' and 'ai $., $., 0' still remain. llvm-svn: 114074	2010-09-16 12:29:33 +00:00
Kalle Raiskila	8b6f5df4ae	Remove all traces of v2[i,f]32 on SPU. The "half vectors" are now widened to full size by the legalizer. The only exception is in parameter passing, where half vectors are expanded. This causes changes to some dejagnu tests. llvm-svn: 111360	2010-08-18 10:04:39 +00:00
Kalle Raiskila	e2c0e66ff1	Have SPU handle halfvec stores aligned by 8 bytes. llvm-svn: 110576	2010-08-09 16:33:00 +00:00
Kalle Raiskila	ce1e4d80cb	Make SPU backend handle insertelement and store for "half vectors" llvm-svn: 110198	2010-08-04 13:59:48 +00:00
Kalle Raiskila	014c93befb	More SPU v2f32 stuff added: insertelement and shuffle. llvm-svn: 110038	2010-08-02 11:22:10 +00:00
Kalle Raiskila	766fd434df	Add preliminary v2f32 support for SPU. Like with v2i32, we just duplicate the instructions and operate on half vectors. Also reorder code in SPUInstrInfo.td for better coherency. llvm-svn: 110037	2010-08-02 10:25:47 +00:00
Kalle Raiskila	21615cb06e	Add preliminary v2i32 support for SPU backend. As there are no such registers in SPU, this support boils down to "emulating" them by duplicating instructions on the general purpose registers. This adds the most basic operations on v2i32: passing parameters, addition, subtraction, multiplication and a few others. llvm-svn: 110035	2010-08-02 08:54:39 +00:00
Kalle Raiskila	61289abcda	Fix encoding of 'sf' and 'sfh' instructions. llvm-svn: 103399	2010-05-10 08:13:49 +00:00
Chris Lattner	0530bbf7ea	fix a typo, bitconvert from node to itself isn't valid. llvm-svn: 99755	2010-03-28 08:36:45 +00:00
Chris Lattner	ac16bb9827	stop using vnot_conv llvm-svn: 99750	2010-03-28 07:48:17 +00:00
Chris Lattner	fe2e2b9e57	remove some damaged sign extend patterns that can never match. llvm-svn: 98932	2010-03-19 04:53:47 +00:00
Chris Lattner	28d2398af5	do some serious surgery on CellSPU to get it back into a world where it uses types consistently. llvm-svn: 98532	2010-03-15 05:53:47 +00:00
Chris Lattner	014fa780b4	disambiguate some types, add a fixme about some inconsistent intrinsics. llvm-svn: 97959	2010-03-08 18:59:49 +00:00
Dan Gohman	b5ec39e2dc	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Dan Gohman	b937f9d590	Don't mark conditional branch instructions as control barriers. llvm-svn: 86732	2009-11-10 22:16:57 +00:00
Dan Gohman	5d566d918b	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Scott Michel	a023598ad3	CellSPU: Revert inadvertent mis-fix of fneg. llvm-svn: 67084	2009-03-17 16:45:16 +00:00
Scott Michel	2c4ac99ef8	CellSPU: - Fix fabs, fneg for f32 and f64. - Use BuildVectorSDNode.isConstantSplat, now that the functionality exists - Continue to improve i64 constant lowering. Lower certain special constants to the constant pool when they correspond to SPU's shufb instruction's special mask values. This avoids the overhead of performing a shuffle on a zero-filled vector just to get the special constant when the memory load suffices. llvm-svn: 67067	2009-03-17 01:15:45 +00:00
Scott Michel	2e2bccf754	CellSPU: Incorporate Tilmann's 128-bit operation patch. Evidently, it gets the llvm-gcc bootstrap a bit further along. llvm-svn: 67048	2009-03-16 18:47:25 +00:00
Scott Michel	e00d746487	CellSPU: - Update DWARF debugging support. llvm-svn: 63059	2009-01-26 22:33:37 +00:00
Scott Michel	af51520775	Untabify code. llvm-svn: 62991	2009-01-26 03:37:41 +00:00
Scott Michel	da9360e77e	CellSPU: - Rename fcmp.ll test to fcmp32.ll, start adding new double tests to fcmp64.ll - Fix select_bits.ll test - Capitulate to the DAGCombiner and move i64 constant loads to instruction selection (SPUISelDAGtoDAG.cpp). <rant>DAGCombiner will insert all kinds of 64-bit optimizations after operation legalization occurs and now we have to do most of the work that instruction selection should be doing twice (once to determine if v2i64 build_vector can be handled by SelectCode(), which then runs all of the predicates a second time to select the necessary instructions.) But, CellSPU is a good citizen.</rant> llvm-svn: 62990	2009-01-26 03:31:40 +00:00
Scott Michel	c80e71ac35	CellSPU: - Ensure that (operation) legalization emits proper FDIV libcall when needed. - Fix various bugs encountered during llvm-spu-gcc build, along with various cleanups. - Start supporting double precision comparisons for remaining libgcc2 build. Discovered interesting DAGCombiner feature, which is currently solved via custom lowering (64-bit constants are not legal on CellSPU, but DAGCombiner insists on inserting one anyway.) - Update README. llvm-svn: 62664	2009-01-21 04:58:48 +00:00
Scott Michel	b4699590f0	- Convert remaining i64 custom lowering into custom instruction emission sequences in SPUDAGToDAGISel.cpp and SPU64InstrInfo.td, killing custom DAG node types as needed. - i64 mul is now a legal instruction, but emits an instruction sequence that stretches tblgen and the imagination, as well as violating laws of several small countries and most southern US states (just kidding, but looking at a function with 80+ parameters is really weird and just plain wrong.) - Update tests as needed. llvm-svn: 62254	2009-01-15 04:41:47 +00:00
Scott Michel	54f7f6d67f	CellSPU: - Add preliminary support for v2i32; load/store generates the right code but there's a lot work to be done to make this vector type operational. llvm-svn: 61829	2009-01-06 23:10:38 +00:00
Scott Michel	c30557841b	CellSPU: - Fix bugs 3194, 3195: i128 load/stores produce correct code (although, we need to ensure that i128 is 16-byte aligned in real life), and 128 zero- extends are supported. - New td file: SPU128InstrInfo.td: this is where all new i128 support should be put in the future. - Continue to hammer on i64 operations and test cases; ensure that the only remaining problem will be i64 mul. llvm-svn: 61784	2009-01-06 03:36:14 +00:00
Scott Michel	0d9d939406	CellSPU: - Fix (brcond (setq ...)) bug, where BRNZ should have been used vice BRZ. - Kill unused/unnecessary nodes in SPUNodes.td - Beef out the i64operations.c test harness to use a lot of unaligned loads, test loops and LLVM loop/basic block optimizations; run the test harness successfully on real Cell hardware. llvm-svn: 61664	2009-01-05 01:34:35 +00:00
Scott Michel	cdcae67887	- Start moving target-dependent nodes that could be represented by an instruction sequence and cannot ordinarily be simplified by DAGcombine into the various target description files or SPUDAGToDAGISel.cpp. This makes some 64-bit operations legal. - Eliminate target-dependent ISD enums. - Update tests. llvm-svn: 61508	2008-12-30 23:28:25 +00:00
Scott Michel	e555efe94d	- Various '#if 0' cleanups. - Move v4i32, i32 mul into SPUInstrInfo.td, with a few more instruction cleanups there as well. - Make SMUL_LOHI, UMUL_LOHI competely illegal for Cell SPU, to better assist Chris to see the problem in bug 3101. llvm-svn: 61464	2008-12-29 03:23:36 +00:00
Scott Michel	bf224860c8	- Remove Tilmann's custom truncate lowering: it completely hosed over DAGcombine's ability to find reasons to remove truncates when they were not needed. Consequently, the CellSPU backend would produce correct, but _really slow and horrible_, code. Replaced with instruction sequences that do the equivalent truncation in SPUInstrInfo.td. - Re-examine how unaligned loads and stores work. Generated unaligned load code has been tested on the CellSPU hardware; see the i32operations.c and i64operations.c in CodeGen/CellSPU/useful-harnesses. (While they may be toy test code, it does prove that some real world code does compile correctly.) - Fix truncating stores in bug 3193 (note: unpack_df.ll will still make llc fault because i64 ult is not yet implemented.) - Added i64 eq and neq for setcc and select/setcc; started new instruction information file for them in SPU64InstrInfo.td. Additional i64 operations should be added to this file and not to SPUInstrInfo.td. llvm-svn: 61447	2008-12-27 04:51:36 +00:00
Scott Michel	0b5c67e1e0	CellSPU: - Fix bug 3185, with misc other cleanups. - Needed to implement SPUInstrInfo::InsertBranch(). CAUTION: Not sure what gets or needs to get passed to InsertBranch() to insert a conditional branch. This will abort for now until a good test case shows up. llvm-svn: 60811	2008-12-10 00:15:19 +00:00
Scott Michel	6e9747d2d6	CellSPU: Fix bug 3055 - Add v4f32, v2f64 to LowerVECTOR_SHUFFLE - Look for vector rotate in shuffle elements, generate a vector rotate instead of a full-blown shuffle when opportunity presents itself. - Generate larger test harness and fix a few interesting but obscure bugs. llvm-svn: 60552	2008-12-04 21:01:44 +00:00
Scott Michel	1f907dd784	CellSPU: - First patch from Nehal Desai, a new contributor at Aerospace. Nehal's patch fixes sign/zero/any-extending loads for integers and floating point. Example code, compiled w/o debugging or optimization where he first noticed the bug: int main(void) { float a = 99.0; printf("%d\n", a); return 0; } Verified that this code actually works on a Cell SPU. Changes by Scott Michel: - Fix bug in the value type list constructed by SPUISD::LDRESULT to include both the load result's result and chain, not just the chain alone. - Simplify LowerLOAD and remove extraneous and unnecessary chains. - Remove unused SPUISD pseudo instructions. llvm-svn: 60526	2008-12-04 03:02:42 +00:00
Dan Gohman	5dad0993a9	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Scott Michel	e0bbe7afb7	CellSPU: - Incorporate Tilmann Scheller's ISD::TRUNCATE custom lowering patch - Update SPU calling convention info, even if it's not used yet (but can be at some point or another) - Ensure that any-extended f32 loads are custom lowered, especially when they're promoted for use in printf. llvm-svn: 60438	2008-12-02 19:53:53 +00:00
Scott Michel	cf677b5a67	CellSPU: - Fix v2[if]64 vector insertion code before IBM files a bug report. - Ensure that zero (0) offsets relative to $sp don't trip an assert (add $sp, 0 gets legalized to $sp alone, tripping an assert) - Shuffle masks passed to SPUISD::SHUFB are now v16i8 or v4i32 llvm-svn: 60358	2008-12-01 17:56:02 +00:00
Scott Michel	a37c52f255	CellSPU: Fix mnemonic typo in pattern; "shlqbyi" -> "shlqby". llvm-svn: 59998	2008-11-25 00:23:16 +00:00
Scott Michel	c3965308a4	CellSPU: (a) Improve the extract element code: there's no need to do gymnastics with rotates into the preferred slot if a shuffle will do the same thing. (b) Rename a couple of SPUISD pseudo-instructions for readability and better semantic correspondence. (c) Fix i64 sign/any/zero extension lowering. llvm-svn: 59965	2008-11-24 17:11:17 +00:00

1 2

75 Commits