llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Chris Lattner	c745aa59b3	add more and significantly better comments to the rest of the machineinstr flags that can be set. Add predicates for the ones lacking it, and switch some clients over to using the predicates instead of Flags directly. llvm-svn: 45690	2008-01-07 06:21:53 +00:00
Chris Lattner	9b987de2c5	rename hasVariableOperands() -> isVariadic(). Add some comments. Evan, please review the comments I added to getNumDefs to make sure that they are accurate, thx. llvm-svn: 45687	2008-01-07 05:19:29 +00:00
Chris Lattner	9d38dfa4a5	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	96d0a93f8e	remove some uses of MachineOpCode, move getSchedClass into TargetInstrDescriptor from TargetInstrInfo. llvm-svn: 45678	2008-01-07 02:46:03 +00:00
Chris Lattner	93e1e6ee12	Add predicates methods to TargetOperandInfo, and switch all clients over to using them, instead of diddling Flags directly. Change the various flags from const variables to enums. llvm-svn: 45677	2008-01-07 02:39:19 +00:00
Chris Lattner	f7f96d818f	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Owen Anderson	2e866e9cdf	Update CodeGen for MRegisterInfo --> TargetInstrInfo changes. llvm-svn: 45673	2008-01-07 01:35:56 +00:00
Gordon Henriksen	88a41c672b	Enabling the target-independent garbage collection infrastructure by hooking it up to the various compiler pipelines. This doesn't actually add support for any GC algorithms, which means it temporarily breaks a few tests. To be fixed shortly. llvm-svn: 45669	2008-01-07 01:30:38 +00:00
Nate Begeman	a6e9de35ae	If custom lowering of insert element fails, the result Val will be 0. Don't overwrite a variable used by the fallthrough code path in this case. llvm-svn: 45630	2008-01-05 20:47:37 +00:00
Duncan Sands	8a4882564a	Fix PR1833 - eh.exception and eh.selector return two values, which means doing extra legalization work. It would be easier to get this kind of thing right if there was some documentation... llvm-svn: 45472	2007-12-31 18:35:50 +00:00
Owen Anderson	ae7e2c1e03	Move copyRegToReg from MRegisterInfo to TargetInstrInfo. This is part of the Machine-level API cleanup instigated by Chris. llvm-svn: 45470	2007-12-31 06:32:00 +00:00
Chris Lattner	de54e62962	update a couple of references to SSARegMap. llvm-svn: 45468	2007-12-31 04:16:08 +00:00
Chris Lattner	96167aa93c	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	f3f074dfdd	fix typo duncan noticed! llvm-svn: 45459	2007-12-30 21:21:10 +00:00
Chris Lattner	c2f0543beb	use simplified operand addition methods. llvm-svn: 45436	2007-12-30 00:57:42 +00:00
Chris Lattner	1b0ad80f53	use simplified operand addition methods. llvm-svn: 45435	2007-12-30 00:51:11 +00:00
Chris Lattner	19dd6c4eac	Start using the simplified methods for adding operands. llvm-svn: 45432	2007-12-30 00:41:17 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	8193d4af33	remove attribution from lib Makefiles. llvm-svn: 45415	2007-12-29 20:09:26 +00:00
Chris Lattner	cd147e5596	Fold comparisons against a constant nan, and optimize ORD/UNORD comparisons with a constant. This allows us to compile isnan to: _foo: fcmpu cr7, f1, f1 mfcr r2 rlwinm r3, r2, 0, 31, 31 blr instead of: LCPI1_0: ; float .space 4 _foo: lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr7, f1, f0 mfcr r2 rlwinm r3, r2, 0, 31, 31 blr llvm-svn: 45405	2007-12-29 08:37:08 +00:00
Chris Lattner	38687aa2b2	make sure not to zap volatile stores, thanks a lot to Dale for noticing this! llvm-svn: 45402	2007-12-29 07:15:45 +00:00
Chris Lattner	78ae7ff876	don't fold fp_round(fp_extend(load)) -> fp_round(extload) llvm-svn: 45400	2007-12-29 06:55:23 +00:00
Chris Lattner	7cb2de8e48	Delete a store whose input is a load from the same pointer: x = load p store x -> p llvm-svn: 45398	2007-12-29 06:26:16 +00:00
Chris Lattner	084002dcb0	initial code for forming an FGETSIGN node. This is disabled until legalizer support goes in. llvm-svn: 45323	2007-12-22 21:35:38 +00:00
Chris Lattner	24b2a6de5c	improve support for fgetsign llvm-svn: 45322	2007-12-22 21:26:52 +00:00
Chris Lattner	a8f6fac7a3	Tell TargetLoweringOpt whether it is running before or after legalize. llvm-svn: 45321	2007-12-22 20:56:36 +00:00
Chris Lattner	f2aa80e0de	Add a new FGETSIGN operation, which defaults to expand on all targets. llvm-svn: 45320	2007-12-22 20:47:56 +00:00
Evan Cheng	8fee10d116	More accurate checks for two-address constraints. llvm-svn: 45259	2007-12-20 09:25:31 +00:00
Evan Cheng	1994e3038e	Bring back a burr scheduling heuristic that's still needed. llvm-svn: 45252	2007-12-20 02:22:36 +00:00
Duncan Sands	b2e0a67cc0	Simplify LowerCallTo by using a callsite. llvm-svn: 45198	2007-12-19 09:48:52 +00:00
Duncan Sands	09250d2dff	The C++ exception handling personality function wants to know about calls that cannot throw ('nounwind'): if such a call does throw for some reason then the personality will terminate the program. The distinction between an ordinary call and a nounwind call is that an ordinary call gets an entry in the exception table but a nounwind call does not. This patch sets up the exception table appropriately. One oddity is that I've chosen to bracket nounwind calls with labels (like invokes) - the other choice would have been to bracket ordinary calls with labels. While bracketing ordinary calls is more natural (because bracketing by labels would then correspond exactly to getting an entry in the exception table), I didn't do it because introducing labels impedes some optimizations and I'm guessing that ordinary calls occur more often than nounwind calls. This fixes the gcc filter2 eh test, at least at -O0 (the inliner needs some tweaking at higher optimization levels). llvm-svn: 45197	2007-12-19 07:36:31 +00:00
Evan Cheng	694994ba7b	Don't leave newly created nodes around if it turns out they are not needed. llvm-svn: 45186	2007-12-19 01:34:38 +00:00
Evan Cheng	8824950e8f	Fix PR1872: SrcValue and SrcValueOffset should not be used to compute load / store node id. llvm-svn: 45167	2007-12-18 19:38:14 +00:00
Evan Cheng	e8de683a74	Also print alignment and volatileness. llvm-svn: 45164	2007-12-18 19:06:30 +00:00
Evan Cheng	36bfae49e3	FIX for PR1799: When a load is unfolded from an instruction, check if it is a new node. If not, do not create a new SUnit. llvm-svn: 45157	2007-12-18 08:42:10 +00:00
Evan Cheng	e6c89b6120	SelectionDAG::dump() should print SrcValue of LoadSDNode and StoreSDNode. llvm-svn: 45151	2007-12-18 07:02:08 +00:00
Duncan Sands	3a0d757bd5	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
Evan Cheng	713b398a35	Bug fix. Must also match ResNo when matching an operand with a user. llvm-svn: 45028	2007-12-14 08:25:15 +00:00
Dan Gohman	0075ea1f5f	Allow vector integer constants to be created with SelectionDAG::getConstant, in the same way as vector floating-point constants. This allows the legalize expansion code for @llvm.ctpop and friends to be usable with vector types. llvm-svn: 44954	2007-12-12 22:21:26 +00:00
Evan Cheng	b5ceb5ae06	Pretty print shuffle mask operand. llvm-svn: 44837	2007-12-11 02:08:35 +00:00
Chris Lattner	ac2bfb78d9	Duncan points out that the subtraction is unneeded since hte code knows the vector is not pow2 llvm-svn: 44740	2007-12-09 17:56:34 +00:00
Chris Lattner	84241dd771	Add support for splitting the operand of a return instruction. llvm-svn: 44728	2007-12-09 00:06:19 +00:00
Chris Lattner	164d423f91	add many new cases to SplitResult. SplitResult now handles all the cases that LegalizeDAG does. llvm-svn: 44726	2007-12-08 23:58:27 +00:00
Chris Lattner	6db5e91e16	Implement splitting support for store, allowing us to compile: %f8 = type <8 x float> define void @test_f8(%f8* %P, %f8* %Q, %f8* %S) { %p = load %f8* %P ; <%f8> [#uses=1] %q = load %f8* %Q ; <%f8> [#uses=1] %R = add %f8 %p, %q ; <%f8> [#uses=1] store %f8 %R, %f8* %S ret void } into: _test_f8: movaps 16(%rdi), %xmm0 addps 16(%rsi), %xmm0 movaps (%rdi), %xmm1 addps (%rsi), %xmm1 movaps %xmm0, 16(%rdx) movaps %xmm1, (%rdx) ret llvm-svn: 44725	2007-12-08 23:24:26 +00:00
Chris Lattner	b073361c3e	implement vector splitting of load, undef, and binops. llvm-svn: 44724	2007-12-08 23:08:49 +00:00
Chris Lattner	8942654140	implement some methods. llvm-svn: 44723	2007-12-08 22:40:18 +00:00
Chris Lattner	cd7fc69c7c	add scaffolding for splitting of vectors. llvm-svn: 44722	2007-12-08 22:37:41 +00:00
Chris Lattner	558d101dbd	reorganize header to separate into functional blocks. llvm-svn: 44719	2007-12-08 21:59:32 +00:00
Chris Lattner	5e6bf4bc6f	split scalarization out to its own file. llvm-svn: 44718	2007-12-08 20:30:28 +00:00
Chris Lattner	b5925fda97	Split expansion out into its own file. llvm-svn: 44717	2007-12-08 20:27:32 +00:00
Chris Lattner	633c76f607	Split promotion support out to its own file. llvm-svn: 44716	2007-12-08 20:24:38 +00:00
Chris Lattner	39571d55bc	Rename LegalizeDAGTypes.cpp -> LegalizeTypes.cpp llvm-svn: 44715	2007-12-08 20:17:13 +00:00
Chris Lattner	424de81c2e	Split the class definition of DAGTypeLegalizer out into a header. Leave it visibility hidden, but not in an anon namespace. llvm-svn: 44714	2007-12-08 20:16:06 +00:00
Dale Johannesen	80dd0c5141	Redo previous patch so optimization only done for i1. Simpler and safer. llvm-svn: 44663	2007-12-06 17:53:31 +00:00
Chris Lattner	64a1a9f502	third time around: instead of disabling this completely, only disable it if we don't know it will be obviously profitable. Still fixme, but less so. :) llvm-svn: 44658	2007-12-06 07:47:55 +00:00
Chris Lattner	bb5fb18af8	Actually, disable this code for now. More analysis and improvements to the X86 backend are needed before this should be enabled by default. llvm-svn: 44657	2007-12-06 07:44:31 +00:00
Chris Lattner	c467b49c96	implement a readme entry, compiling the code into: _foo: movl $12, %eax andl 4(%esp), %eax movl _array(%eax), %eax ret instead of: _foo: movl 4(%esp), %eax shrl $2, %eax andl $3, %eax movl _array(,%eax,4), %eax ret As it turns out, this triggers all the time, in a wide variety of situations, for example, I see diffs like this in various programs: - movl 8(%eax), %eax - shll $2, %eax - andl $1020, %eax - movl (%esi,%eax), %eax + movzbl 8(%eax), %eax + movl (%esi,%eax,4), %eax - shll $2, %edx - andl $1020, %edx - movl (%edi,%edx), %edx + andl $255, %edx + movl (%edi,%edx,4), %edx Unfortunately, I also see stuff like this, which can be fixed in the X86 backend: - andl $85, %ebx - addl _bit_count(,%ebx,4), %ebp + shll $2, %ebx + andl $340, %ebx + addl _bit_count(%ebx), %ebp llvm-svn: 44656	2007-12-06 07:33:36 +00:00
Chris Lattner	91251d1337	implement the rest of the functionality from SelectionDAGLegalize::ScalarizeVectorOp llvm-svn: 44654	2007-12-06 05:53:43 +00:00
Dale Johannesen	8bc5d4be6a	Fix PR1842. llvm-svn: 44649	2007-12-06 01:43:46 +00:00
Chris Lattner	a43f6b4a08	more scalarization llvm-svn: 44608	2007-12-05 07:45:02 +00:00
Chris Lattner	e1b35fb035	scalarize vector binops llvm-svn: 44607	2007-12-05 07:36:58 +00:00
Chris Lattner	914befbc8a	Implement framework for scalarizing node results. This is sufficient to codegen this: define float @test_extract_elt(<1 x float> * %P) { %p = load <1 x float>* %P %R = extractelement <1 x float> %p, i32 0 ret float %R } llvm-svn: 44570	2007-12-04 07:48:46 +00:00
Chris Lattner	eead86bbc5	start providing framework for scalarizing vectors. llvm-svn: 44569	2007-12-04 07:29:51 +00:00
Duncan Sands	1e2e4972ff	Rather than having special rules like "intrinsics cannot throw exceptions", just mark intrinsics with the nounwind attribute. Likewise, mark intrinsics as readnone/readonly and get rid of special aliasing logic (which didn't use anything more than this anyway). llvm-svn: 44544	2007-12-03 20:06:50 +00:00
Duncan Sands	1b0feb42e2	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Nate Begeman	4278967588	Support returning non-power-of-2 vectors to unblock some work llvm-svn: 44371	2007-11-27 19:28:48 +00:00
Duncan Sands	3602011bec	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	ab5cd0b1c5	err, no really. llvm-svn: 44352	2007-11-27 06:14:32 +00:00
Chris Lattner	a2be558b75	don't depend on ADL. llvm-svn: 44351	2007-11-27 06:14:12 +00:00
Dan Gohman	a9f8208852	Don't lower srem/urem X%C to X-X/C*C unless the division is actually optimized. This avoids creating illegal divisions when the combiner is running after legalize; this fixes PR1815. Also, it produces better code in the included testcase by avoiding the subtract and multiply when the division isn't optimized. llvm-svn: 44341	2007-11-26 23:46:11 +00:00
Chris Lattner	424aad9995	Implement expand support for MERGE_VALUEs that only produces one result. llvm-svn: 44304	2007-11-24 19:12:15 +00:00
Chris Lattner	c06f8366b4	Implement support for custom legalization in DAGTypeLegalizer::ExpandOperand. Improve a comment. Unbreak Duncan's carefully written path compression where I didn't realize what was happening! llvm-svn: 44301	2007-11-24 18:11:42 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Duncan Sands	7a8a7099b1	Fix a bug in which node A is replaced by node B, but later node A gets back into the DAG again because it was hiding in one of the node maps: make sure that node replacement happens in those maps too. llvm-svn: 44263	2007-11-21 16:43:19 +00:00
Chris Lattner	21c243d051	ExpandUnalignedLoad doesn't handle vectors right at all apparently. Fix a couple of problems: 1. Don't assume the VT-1 is a VT that is half the size. 2. Treat vectors of FP in the vector path, not the FP path. This has a couple of remaining problems before it will work with the code in PR1811: the code below this change assumes that it can use extload/shift/or to construct the result, which isn't right for vectors. This also doesn't handle vectors of 1 or vectors that aren't pow-2. llvm-svn: 44243	2007-11-19 21:38:03 +00:00
Chris Lattner	c0f2f19744	Implement vector expand support for shuffle_vector. This fixes PR1811. llvm-svn: 44242	2007-11-19 21:16:54 +00:00
Chris Lattner	30a6eee34c	Implement splitting of UNDEF nodes. This is the first step towards fixing PR1811 llvm-svn: 44239	2007-11-19 20:21:32 +00:00
Dan Gohman	0f62120b01	Add support in SplitVectorOp for remainder operators. llvm-svn: 44233	2007-11-19 15:15:03 +00:00
Nate Begeman	2a8ef3f29a	Add support for vectors to int <-> float casts. llvm-svn: 44204	2007-11-17 03:58:34 +00:00
Anton Korobeynikov	b6c3255d80	Implement necessary bits for flt_rounds gcc builtin. Codegen bits and llvm-gcc support will follow. llvm-svn: 44182	2007-11-15 23:25:33 +00:00
Nate Begeman	a42e2c2272	Basic non-power-of-2 vector support llvm-svn: 44181	2007-11-15 21:15:26 +00:00
Duncan Sands	895e6284a9	This assertion was bogus. llvm-svn: 44167	2007-11-15 09:54:37 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Duncan Sands	edf7e3b5f4	Move MinAlign to MathExtras.h. llvm-svn: 43944	2007-11-09 13:41:39 +00:00
Duncan Sands	7df7c7aed1	Fix some load/store logic that would be wrong for apints on big-endian machines if the bitwidth is not a multiple of 8. Introduce a new helper, MVT::getStoreSizeInBits, and use it. llvm-svn: 43934	2007-11-09 08:57:19 +00:00
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Evan Cheng	781ed42681	Didn't mean to check these in. llvm-svn: 43923	2007-11-09 01:28:33 +00:00
Evan Cheng	7e1d8b99ab	Bug fix. Passive nodes are not in SUnitMap. llvm-svn: 43922	2007-11-09 01:27:11 +00:00
Evan Cheng	d9bab93a44	If both parts of smul_lohi, etc. are used, don't simplify. If only one part is used, try simplify it. llvm-svn: 43888	2007-11-08 09:25:29 +00:00
Dan Gohman	ff12f4602f	Remainder operations must be either integer or floating-point. llvm-svn: 43781	2007-11-06 22:11:54 +00:00
Evan Cheng	3764ad2bac	Add pseudo dependency to force two-address instruction to be scheduled after other uses. There was a overly restricted check that prevented some obvious cases. llvm-svn: 43762	2007-11-06 08:44:59 +00:00
Dan Gohman	6255ce9f5d	Add support for vector remainder operations. llvm-svn: 43744	2007-11-05 23:35:22 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Dale Johannesen	1f70f86c7a	Make labels work in asm blocks; allow labels as parameters. Rename ValueRefList to ParamList in AsmParser, since its only use is for parameters. llvm-svn: 43734	2007-11-05 21:20:28 +00:00
Dan Gohman	19d88d511b	Add std:: to sort calls. llvm-svn: 43652	2007-11-02 22:24:01 +00:00
Dan Gohman	26c8800fbd	Change illegal uses of ++ to uses of STLExtra.h's next function. llvm-svn: 43651	2007-11-02 22:22:02 +00:00
Duncan Sands	281da5e25f	Fix a thinko. llvm-svn: 43639	2007-11-02 15:18:06 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Duncan Sands	b86535ad9a	Promotion of sdiv/srem/udiv/urem. llvm-svn: 43551	2007-10-31 08:57:43 +00:00
Dale Johannesen	9bc04ae496	Make i64=expand_vector_elt(v2i64) work in 32-bit mode. llvm-svn: 43535	2007-10-31 00:32:36 +00:00

1 2 3 4 5 ...

2083 Commits