llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 07:22:55 +01:00

Author	SHA1	Message	Date
Chris Lattner	8f2d079b36	Fix lowering of ctlz, so now UnitTests/2005-05-11-Popcount-ffs-fls passes with the CBE llvm-svn: 21875	2005-05-11 20:24:12 +00:00
Chris Lattner	303ac68c80	Fix lowering of cttz to work with signed values llvm-svn: 21874	2005-05-11 20:02:14 +00:00
Chris Lattner	330f44f3b6	fix and concisify intinsic lowering for ctpop. Unfortunately, this code looks completely untested. :( llvm-svn: 21873	2005-05-11 19:42:05 +00:00
Chris Lattner	eeeaf45bba	Fix the last remaining bug preventing us from switching the X86 BE over from the simple isel to the pattern isel. This forces inserted libcalls to serialize against other function calls, which was breaking UnitTests/2005-05-12-Int64ToFP. Hopefully this will fix issues on other targets as well. llvm-svn: 21872	2005-05-11 19:02:11 +00:00
Chris Lattner	296754995e	Do not memoize ADJCALLSTACKDOWN nodes, provide a method to hack on them. llvm-svn: 21871	2005-05-11 18:57:39 +00:00
Chris Lattner	74763db128	wrap long line llvm-svn: 21870	2005-05-11 18:57:06 +00:00
Chris Lattner	d76582b540	Make sure to legalize generated ctpop nodes, convert tabs to spaces llvm-svn: 21868	2005-05-11 18:35:21 +00:00
Duraid Madina	8ad9786fcd	expand count-leading/trailing-zeros; the test 2005-05-11-Popcount-ffs-fls.c should now pass (the "LLVM" and "REF" results should be identical) llvm-svn: 21866	2005-05-11 08:45:08 +00:00
Chris Lattner	b452b5aa42	Add some notes for expanding clz/ctz llvm-svn: 21862	2005-05-11 05:27:09 +00:00
Chris Lattner	4f05136f61	Simplify this code, use the proper shift amount llvm-svn: 21861	2005-05-11 05:21:31 +00:00
Duraid Madina	b9062e56cf	add the popcount instruction and support this in the isel the primary user of this will probably end up being find-first-set-bit/find- last-set-bit, which i'll get around to... llvm-svn: 21860	2005-05-11 05:16:09 +00:00
Chris Lattner	3edc8ecb53	Legalize this correctly llvm-svn: 21859	2005-05-11 05:09:47 +00:00
Chris Lattner	d5d2886ee7	No really IA64 :) llvm-svn: 21858	2005-05-11 05:03:56 +00:00
Chris Lattner	e358ac532b	X86 has more than just 32-bit registers llvm-svn: 21857	2005-05-11 05:00:34 +00:00
Chris Lattner	457996c4a6	implement expansion of ctpop nodes, implementing CodeGen/Generic/llvm-ct-intrinsics.ll llvm-svn: 21856	2005-05-11 04:51:16 +00:00
Chris Lattner	ce84b90a3d	Print bit count nodes correctly llvm-svn: 21855	2005-05-11 04:50:30 +00:00
Chris Lattner	593c0e8957	Do not use "" as a sentinal for a missing argument! This fixes PR560. llvm-svn: 21850	2005-05-10 23:20:17 +00:00
Misha Brukman	4fda633c59	Why output multiple strings, let the compiler concatenate them for us for free llvm-svn: 21845	2005-05-10 22:03:50 +00:00
Misha Brukman	d264be8e32	* Convert tabs to spaces, fix code alignment * Remove trailing whitespace * Wrap long lines llvm-svn: 21844	2005-05-10 22:02:28 +00:00
Chris Lattner	8230bddde2	Convert feature of the simple isel over for the pattern isel to use. llvm-svn: 21840	2005-05-10 03:53:18 +00:00
Chris Lattner	758f2fe1a3	Fix Reassociate/shifttest.ll llvm-svn: 21839	2005-05-10 03:39:25 +00:00
Jeff Cohen	afc58006b7	Silence some VC++ warnings llvm-svn: 21838	2005-05-10 02:22:38 +00:00
Chris Lattner	f221558c21	If a function contains no allocas, all of the calls in it are trivially suitable for tail calls. llvm-svn: 21836	2005-05-09 23:51:13 +00:00
Chris Lattner	5edb4c4af6	The semantics of cast X to bool are a comparison against zero, not a truncation! llvm-svn: 21833	2005-05-09 22:17:13 +00:00
Chris Lattner	d96aea21d7	Implement READPORT/WRITEPORT, implementing the last X86 regression tests that were failing with the pattern selector. Note that the support that existed in the simple selector was clearly broken in several ways though (which has also been fixed). llvm-svn: 21831	2005-05-09 21:17:38 +00:00
Chris Lattner	6a55b1d4dd	do not emit illegal instructions llvm-svn: 21830	2005-05-09 21:06:04 +00:00
Chris Lattner	7ba0699b05	Fix the syntax of the i/o instructions, these are obviously unused. llvm-svn: 21829	2005-05-09 20:49:20 +00:00
Chris Lattner	46b51ab388	legalize readio/writeio into load/stores, fixing CodeGen/X86/io.llx with the pattern isel. llvm-svn: 21828	2005-05-09 20:37:29 +00:00
Chris Lattner	95c836384b	legalize readio/writeio into a load/store if requested llvm-svn: 21827	2005-05-09 20:36:57 +00:00
Chris Lattner	7cc8edfc30	legalize READPORT, WRITEPORT, READIO, WRITEIO, at least in the basic cases where they are directly supported by the architecture. Wrap a bunch of long lines :( llvm-svn: 21826	2005-05-09 20:23:03 +00:00
Chris Lattner	af6bde0db6	Add support for matching the READPORT, WRITEPORT, READIO, WRITEIO intrinsics llvm-svn: 21825	2005-05-09 20:22:36 +00:00
Chris Lattner	eee649df34	Add support for READPORT, WRITEPORT, READIO, WRITEIO llvm-svn: 21824	2005-05-09 20:22:17 +00:00
Chris Lattner	b28f865865	restore some non-dead code I removed last night breaking double casts to uint llvm-svn: 21821	2005-05-09 18:37:02 +00:00
Chris Lattner	333ae3d837	fold and (shl X, C1), C2 -> rlwinm when possible. Many other cases are possible, include and (srl) and the inverses (shl and) etc. llvm-svn: 21820	2005-05-09 17:39:48 +00:00
Chris Lattner	c3fa88e7c8	Fold shifts into subsequent SHL's. These shifts often arise due to addrses arithmetic lowering. llvm-svn: 21818	2005-05-09 17:06:45 +00:00
Duraid Madina	64a52fc615	fix and cleanup constmul code a bit, this fixes mediabench/toast and probably a couple of other tests. llvm-svn: 21814	2005-05-09 13:18:34 +00:00
Chris Lattner	3094cec3c9	Wrap long lines, remove dead code that is now handled by legalize llvm-svn: 21811	2005-05-09 05:40:26 +00:00
Chris Lattner	5d291fa443	Fix FP -> bool casts llvm-svn: 21810	2005-05-09 05:33:18 +00:00
Chris Lattner	d3bb28d97a	implement and.ll:test33 llvm-svn: 21809	2005-05-09 04:58:36 +00:00
Chris Lattner	a1e633ef7a	Don't use the load/store instruction as the source pointer, use the pointer being stored/loaded through! llvm-svn: 21806	2005-05-09 04:28:51 +00:00
Chris Lattner	bfbefe0837	memoize all nodes, even null Value* nodes. Do not add two token chain outputs llvm-svn: 21805	2005-05-09 04:14:13 +00:00
Chris Lattner	b85030373d	wrap long lines llvm-svn: 21804	2005-05-09 04:08:33 +00:00
Chris Lattner	6ffae1a3ec	Print SrcValue nodes correctly llvm-svn: 21803	2005-05-09 04:08:27 +00:00
Chris Lattner	65d61d9d44	Fix X86/2005-05-08-FPStackifierPHI.ll: ugly gross hack. llvm-svn: 21801	2005-05-09 03:36:39 +00:00
Chris Lattner	a2edd7e449	Preserve CC's when linking modules llvm-svn: 21799	2005-05-09 01:09:39 +00:00
Chris Lattner	2d9c054f4e	Preserve calling conventions when doing IPO llvm-svn: 21798	2005-05-09 01:05:50 +00:00
Chris Lattner	eff214d7de	wrap long lines, preserve calling conventions when cloning functions and turning calls into invokes llvm-svn: 21797	2005-05-09 01:04:34 +00:00
Chris Lattner	5a7f1642b7	By definition, 'tail' calls cannot access the stack frame of their caller. Expose this as a simple form of mod/ref information. This implements BasicAA/tailcall-modref.ll llvm-svn: 21796	2005-05-08 23:58:12 +00:00
Chris Lattner	fb4a99b117	Verify that varargs functions all have ccc llvm-svn: 21792	2005-05-08 22:27:09 +00:00
Chris Lattner	b57ab2e975	Convert non-address taken functions with C calling conventions to fastcc. llvm-svn: 21791	2005-05-08 22:18:06 +00:00
Chris Lattner	d5a353a675	Implement Reassociate/mul-neg-add.ll llvm-svn: 21788	2005-05-08 21:41:35 +00:00
Chris Lattner	f535f6e808	Bail out earlier llvm-svn: 21786	2005-05-08 21:33:47 +00:00
Chris Lattner	39f74def7f	Teach reassociate that 0-X === X*-1 llvm-svn: 21785	2005-05-08 21:28:52 +00:00
Chris Lattner	319ac8f822	Fix PR557 and basictest[34].ll. This makes reassociate realize that loads should be treated as unmovable, and gives distinct ranks to distinct values defined in the same basic block, allowing reassociate to do its thing. llvm-svn: 21783	2005-05-08 20:57:04 +00:00
Chris Lattner	b5de308c5f	Add debugging information llvm-svn: 21781	2005-05-08 20:09:57 +00:00
Chris Lattner	e74082156b	eliminate gotos llvm-svn: 21780	2005-05-08 19:48:43 +00:00
Chris Lattner	6d85b91b24	Wrap long lines. Fix "warning: conflicting types for built-in function 'memset'" warning from the CBE+GCC. llvm-svn: 21779	2005-05-08 19:46:29 +00:00
Chris Lattner	a9d5fdd4fd	Improve reassociation handling of inverses, implementing inverses.ll. llvm-svn: 21778	2005-05-08 18:59:37 +00:00
Chris Lattner	afbdc0b969	clean up and modernize this pass. llvm-svn: 21776	2005-05-08 18:45:26 +00:00
Chris Lattner	7b41539f32	Strength reduce SAR into SHR if there is no way sign bits could be shifted in. This tends to get cases like this: X = cast ubyte to int Y = shr int X, ... Tested by: shift.ll:test24 llvm-svn: 21775	2005-05-08 17:34:56 +00:00
Chris Lattner	c2670a0da6	Refactor some code llvm-svn: 21772	2005-05-08 00:19:31 +00:00
Chris Lattner	cd7caaa866	Handle some simple cases where we can see that values get annihilated. llvm-svn: 21771	2005-05-08 00:08:33 +00:00
Chris Lattner	1e84d885b7	Fix a miscompilation of crafty by clobbering the "A" variable. llvm-svn: 21770	2005-05-07 23:49:08 +00:00
Chris Lattner	5662127ed6	Rewrite the guts of the reassociate pass to be more efficient and logical. Instead of trying to do local reassociation tweaks at each level, only process an expression tree once (at its root). This does not improve the reassociation pass in any real way. llvm-svn: 21768	2005-05-07 21:59:39 +00:00
Reid Spencer	b4fdf14d34	* Add two strlen optimizations: strlen(x) != 0 -> x != 0 strlen(x) == 0 -> x == 0 * Change nested statistics to use style of other LLVM statistics so that only the name of the optimization (simplify-libcalls) is used as the statistic name, and the description indicates which specific all is optimized. Cuts down on some redundancy and saves a few bytes of space. * Make note of stpcpy optimization that could be done. llvm-svn: 21766	2005-05-07 20:15:59 +00:00
Reid Spencer	65d553cd03	Don't increment the counter unless the debug flag is set. llvm-svn: 21762	2005-05-07 04:59:45 +00:00
Chris Lattner	3edf09a5eb	Convert shifts to muls to assist reassociation. This implements Reassociate/shifttest.ll llvm-svn: 21761	2005-05-07 04:24:13 +00:00
Chris Lattner	b1ea71fbcd	Simplify the code and rearrange it. No major functionality changes here. llvm-svn: 21759	2005-05-07 04:08:02 +00:00
Jeff Cohen	eafa15885e	Silence VC++ warnings about unsafe mixing of ints and bools with the \| operator. llvm-svn: 21758	2005-05-07 02:44:04 +00:00
Chris Lattner	f6775e16bf	remove some dead (always dynamically false) flags llvm-svn: 21752	2005-05-06 22:35:09 +00:00
Chris Lattner	1f6d3b2344	encode calling conventions for call/invoke instructions. llvm-svn: 21751	2005-05-06 22:34:01 +00:00
Chris Lattner	494f3da7b3	encode function calling convs in the bytecode file. invoke and call are still to come. llvm-svn: 21749	2005-05-06 20:42:57 +00:00
Chris Lattner	562734e130	parse new calling conv specifiers llvm-svn: 21748	2005-05-06 20:27:19 +00:00
Chris Lattner	de5b492521	wrap a longline llvm-svn: 21747	2005-05-06 20:27:03 +00:00
Chris Lattner	26a44493ef	add support for explicit calling conventions llvm-svn: 21746	2005-05-06 20:26:43 +00:00
Chris Lattner	0995b3da02	use splice instead of remove/insert for a minor speedup llvm-svn: 21743	2005-05-06 19:58:35 +00:00
Chris Lattner	146014b748	remove some ugly hacks that are no longer needed since andrew removed the varargs munging code llvm-svn: 21742	2005-05-06 19:49:51 +00:00
Chris Lattner	c9be572154	BAD typeo which caused many testsuite failures last night. Note to self, do not change code after testing it without retesting! llvm-svn: 21741	2005-05-06 17:13:16 +00:00
Chris Lattner	1bc2753d69	clean up the CBE output a bit llvm-svn: 21740	2005-05-06 06:58:42 +00:00
Chris Lattner	f70b2785b7	add tail marker as a comment llvm-svn: 21739	2005-05-06 06:53:07 +00:00
Chris Lattner	4e9d804f1d	Make the stub functions be tail calls llvm-svn: 21738	2005-05-06 06:48:54 +00:00
Chris Lattner	146447f57a	Preserve tail marker llvm-svn: 21737	2005-05-06 06:48:21 +00:00
Chris Lattner	0187977904	Implement Transforms/Inline/inline-tail.ll llvm-svn: 21736	2005-05-06 06:47:52 +00:00
Chris Lattner	3d4098b1e0	preserve the tail marker llvm-svn: 21734	2005-05-06 06:46:58 +00:00
Chris Lattner	47c5cd63f6	lex tail llvm-svn: 21729	2005-05-06 06:20:33 +00:00
Chris Lattner	59d23baab1	add bytecode reader support for tail calls llvm-svn: 21727	2005-05-06 06:13:34 +00:00
Chris Lattner	72ffd7e7d5	Add a 'tail' marker for call instructions, patch contributed by Alexander Friedman. llvm-svn: 21722	2005-05-06 05:51:46 +00:00
Chris Lattner	99db0ab3df	Wrap long lines llvm-svn: 21720	2005-05-06 05:34:40 +00:00
Chris Lattner	b953e27f85	DCE intrinsic instructions without side effects. llvm-svn: 21719	2005-05-06 05:27:34 +00:00
Chris Lattner	4f7bba1106	These intrinsics do not access memory llvm-svn: 21718	2005-05-06 05:21:04 +00:00
Chris Lattner	2b4c801d10	Teach instcombine propagate zeroness through shl instructions, implementing and.ll:test31 llvm-svn: 21717	2005-05-06 04:53:20 +00:00
Chris Lattner	ead76729cc	Implement shift.ll:test23. If we are shifting right then immediately truncating the result, turn signed shift rights into unsigned shift rights if possible. This leads to later simplification and happens often in 176.gcc. For example, this testcase: struct xxx { unsigned int code : 8; }; enum codes { A, B, C, D, E, F }; int foo(struct xxx P) { if ((enum codes)P->code == A) bar(); } used to be compiled to: int %foo(%struct.xxx %P) { %tmp.1 = getelementptr %struct.xxx* %P, int 0, uint 0 ; <uint> [#uses=1] %tmp.2 = load uint %tmp.1 ; <uint> [#uses=1] %tmp.3 = cast uint %tmp.2 to int ; <int> [#uses=1] %tmp.4 = shl int %tmp.3, ubyte 24 ; <int> [#uses=1] %tmp.5 = shr int %tmp.4, ubyte 24 ; <int> [#uses=1] %tmp.6 = cast int %tmp.5 to sbyte ; <sbyte> [#uses=1] %tmp.8 = seteq sbyte %tmp.6, 0 ; <bool> [#uses=1] br bool %tmp.8, label %then, label %UnifiedReturnBlock Now it is compiled to: %tmp.1 = getelementptr %struct.xxx* %P, int 0, uint 0 ; <uint> [#uses=1] %tmp.2 = load uint %tmp.1 ; <uint> [#uses=1] %tmp.2 = cast uint %tmp.2 to sbyte ; <sbyte> [#uses=1] %tmp.8 = seteq sbyte %tmp.2, 0 ; <bool> [#uses=1] br bool %tmp.8, label %then, label %UnifiedReturnBlock which is the difference between this: foo: subl $4, %esp movl 8(%esp), %eax movl (%eax), %eax shll $24, %eax sarl $24, %eax testb %al, %al jne .LBBfoo_2 and this: foo: subl $4, %esp movl 8(%esp), %eax movl (%eax), %eax testb %al, %al jne .LBBfoo_2 This occurs 3243 times total in the External tests, 215x in povray, 6x in each f2c'd program, 1451x in 176.gcc, 7x in crafty, 20x in perl, 25x in gap, 3x in m88ksim, 25x in ijpeg. Maybe this will cause a little jump on gcc tommorow :) llvm-svn: 21715	2005-05-06 04:18:52 +00:00
Chris Lattner	20b5bce229	Implement xor.ll:test22 llvm-svn: 21713	2005-05-06 02:07:39 +00:00
Chris Lattner	27f6e62cac	implement and.ll:test30 and set.ll:test21 llvm-svn: 21712	2005-05-06 01:53:19 +00:00
Chris Lattner	d38c600c9d	implement or.ll:test20 llvm-svn: 21709	2005-05-06 00:58:50 +00:00
Misha Brukman	1996bf6ea5	* Order #includes alphabetically * Remove commented-out debug printouts llvm-svn: 21707	2005-05-05 23:45:17 +00:00
Misha Brukman	d29b27d73b	Remove extra blank line llvm-svn: 21706	2005-05-05 23:43:47 +00:00
Misha Brukman	f52511fcc6	Remove vim settings from source code; people should use llvm/utils/vim/vimrc llvm-svn: 21704	2005-05-05 22:33:09 +00:00
Chris Lattner	64134a43a1	add support for undef values of opaque type, addressing PR541 llvm-svn: 21701	2005-05-05 22:21:19 +00:00
Chris Lattner	c390fbea0d	Add some extra checks. Opaque types don't have a null marker. llvm-svn: 21700	2005-05-05 20:57:00 +00:00
Chris Lattner	6e8167d1c2	When hitting an unsupported intrinsic, actually print it Lower debug info to noops. llvm-svn: 21698	2005-05-05 17:55:17 +00:00
Andrew Lenharth	09c3c4add4	ctpop lowering in legalize llvm-svn: 21697	2005-05-05 15:55:21 +00:00
Chris Lattner	adcc532d05	Fix a bug compimling Ruby, fixing this testcase: LowerSetJmp/2005-05-05-OldUses.ll llvm-svn: 21696	2005-05-05 15:47:43 +00:00
Andrew Lenharth	8e2beec4d1	fix typo llvm-svn: 21693	2005-05-04 19:25:37 +00:00
Andrew Lenharth	58ff51b153	Well, add support for ct* for 21264 only. 21164 is broken until expand works. llvm-svn: 21692	2005-05-04 19:12:09 +00:00
Andrew Lenharth	9282d00d4f	Make promoteOp work for CT* Proof? ubyte %bar(ubyte %x) { entry: %tmp.1 = call ubyte %llvm.ctlz( ubyte %x ) ret ubyte %tmp.1 } ==> zapnot $16,1,$0 CTLZ $0,$0 subq $0,56,$0 zapnot $0,1,$0 ret $31,($26),1 llvm-svn: 21691	2005-05-04 19:11:05 +00:00
Chris Lattner	1c462db06f	Instcombine: cast (X != 0) to int, cast (X == 1) to int -> X iff X has only the low bit set. This implements set.ll:test20. This triggers 2x on povray, 9x on mesa, 11x on gcc, 2x on crafty, 1x on eon, 6x on perlbmk and 11x on m88ksim. It allows us to compile these two functions into the same code: struct s { unsigned int bit : 1; }; unsigned foo(struct s p) { if (p->bit) return 1; else return 0; } unsigned bar(struct s p) { return p->bit; } llvm-svn: 21690	2005-05-04 19:10:26 +00:00
Reid Spencer	c564fd819c	Implement the IsDigitOptimization for simplifying calls to the isdigit library function: isdigit(chr) -> 0 or 1 if chr is constant isdigit(chr) -> chr - '0' <= 9 otherwise Although there are many calls to isdigit in llvm-test, most of them are compiled away by macros leaving only this: 2 MultiSource/Applications/hexxagon llvm-svn: 21688	2005-05-04 18:58:28 +00:00
Reid Spencer	8d2736401b	* Correct the function prototypes for some of the functions to match the actual spec (int -> uint) * Add the ability to get/cache the strlen function prototype. * Make sure generated values are appropriately named for debugging purposes * Add the SPrintFOptimiation for 4 casts of sprintf optimization: sprintf(str,cstr) -> llvm.memcpy(str,cstr) (if cstr has no %) sprintf(str,"") -> store sbyte 0, str sprintf(str,"%s",src) -> llvm.memcpy(str,src) (if src is constant) sprintf(str,"%c",chr) -> store chr, str ; store sbyte 0, str+1 The sprintf optimization didn't fire as much as I had hoped: 2 MultiSource/Applications/SPASS 5 MultiSource/Benchmarks/McCat/18-imp 22 MultiSource/Benchmarks/Prolangs-C/TimberWolfMC 1 MultiSource/Benchmarks/Prolangs-C/assembler 6 MultiSource/Benchmarks/Prolangs-C/unix-smail 2 MultiSource/Benchmarks/mediabench/mpeg2/mpeg2dec llvm-svn: 21679	2005-05-04 03:20:21 +00:00
Andrew Lenharth	8b64bd0fd5	Implement count leading zeros (ctlz), count trailing zeros (cttz), and count population (ctpop). Generic lowering is implemented, however only promotion is implemented for SelectionDAG at the moment. More coming soon. llvm-svn: 21676	2005-05-03 17:19:30 +00:00
Chris Lattner	9620dd281d	fix a bug in the 1 index GEP handling code llvm-svn: 21670	2005-05-03 16:44:45 +00:00
Reid Spencer	f52c228416	Implement optimizations for the strchr and llvm.memset library calls. Neither of these activated as many times as was hoped: strchr: 9 MultiSource/Applications/siod 1 MultiSource/Applications/d 2 MultiSource/Prolangs-C/archie-client 1 External/SPEC/CINT2000/176.gcc/176.gcc llvm.memset: no hits llvm-svn: 21669	2005-05-03 07:23:44 +00:00
Chris Lattner	e53a188512	add direct support for making GEP instrs with one index llvm-svn: 21665	2005-05-03 05:43:30 +00:00
Jeff Cohen	d33e8df701	Use ANSI-approved way of getting the value infinity (otherwise VC++ won't compile it) llvm-svn: 21662	2005-05-03 03:13:01 +00:00
Reid Spencer	0c484ea7de	Avoid garbage output in the statistics display by ensuring that the strings passed to Statistic's constructor are not destructable. The stats are printed during static destruction and the SimplifyLibCalls module was getting destructed before the statistics. llvm-svn: 21661	2005-05-03 02:54:54 +00:00
Reid Spencer	123f4e393f	Add the StrNCmpOptimization which is similar to strcmp. Unfortunately, this optimization didn't trigger on any llvm-test tests. llvm-svn: 21660	2005-05-03 01:43:45 +00:00
Reid Spencer	a5fcd1660f	Implement the fprintf optimization which converts calls like this: fprintf(F,"hello") -> fwrite("hello",strlen("hello"),1,F) fprintf(F,"%s","hello") -> fwrite("hello",strlen("hello"),1,F) fprintf(F,"%c",'x') -> fputc('c',F) This optimization fires severals times in llvm-test: 313 MultiSource/Applications/Burg 302 MultiSource/Benchmarks/Prolangs-C/TimberWolfMC 189 MultiSource/Benchmarks/Prolangs-C/mybison 175 MultiSource/Benchmarks/Prolangs-C/football 130 MultiSource/Benchmarks/Prolangs-C/unix-tbl llvm-svn: 21657	2005-05-02 23:59:26 +00:00
Andrew Lenharth	d46211fc03	fold fp div by 0 to inf, the way gcc does. This is legal according to the FP spec llvm-svn: 21655	2005-05-02 21:25:47 +00:00
Andrew Lenharth	1e1117ed7a	Remove support for 1.0 style varargs amusing of course, because we will have to go back to those semantics soon llvm-svn: 21654	2005-05-02 19:07:27 +00:00
John Criswell	d1933cb2e4	Fixed a comment. llvm-svn: 21653	2005-05-02 14:47:42 +00:00
Duraid Madina	4d9c8f8dce	support multiplication by constant negative integers this constmul code is still buggy though, so beware. mul by 7427 is currently broken, for example. will fix it when I get a moment :) llvm-svn: 21652	2005-05-02 07:27:14 +00:00
Duraid Madina	7a185a79a5	add support for bools to SELECT, this fixes Prolangs-C/bison from the testsuite, however 09-vor is still dead (hopefully for other reasons!) llvm-svn: 21651	2005-05-02 06:41:13 +00:00
Chris Lattner	7db64049a6	Implement getelementptr.ll:test11 llvm-svn: 21647	2005-05-01 04:42:15 +00:00
Chris Lattner	cee86a7095	Check for volatile loads only once. Implement load.ll:test7 llvm-svn: 21645	2005-05-01 04:24:53 +00:00
Tanya Lattner	845b0cc908	SMS for superblocks. llvm-svn: 21643	2005-05-01 01:27:47 +00:00
Tanya Lattner	b9da851880	Added extra constructor for superblocks. llvm-svn: 21642	2005-05-01 01:25:53 +00:00
Tanya Lattner	425f215095	Fixed bug in searchPath function for finding nodes between two recurrences. Changed dependence analyzer to only use dep distances of 2 or less. This is experimental. Changed MSchedGraph to be able to represent more then one BB (first steps). llvm-svn: 21641	2005-04-30 23:07:59 +00:00
Andrew Lenharth	936709ad19	I was sure I had thought about this and there was a reason it should work. But it is entirely possible I am just crazy. llvm-svn: 21640	2005-04-30 14:19:13 +00:00
Alkis Evlogimenos	66f1632de8	Do not use deprecated APIs llvm-svn: 21639	2005-04-30 07:13:31 +00:00
Reid Spencer	f7511e4fe2	Fix a comment that stated the wrong thing. llvm-svn: 21638	2005-04-30 06:45:47 +00:00
Chris Lattner	b0f53013d6	Eliminate some random whitespace llvm-svn: 21637	2005-04-30 04:44:07 +00:00
Chris Lattner	fe72cdf838	Codegen and legalize sin/cos/llvm.sqrt as FSIN/FCOS/FSQRT calls. This patch was contributed by Morten Ofstad, with some minor tweaks and bug fixes added by me. llvm-svn: 21636	2005-04-30 04:43:14 +00:00
Chris Lattner	b0af0dd919	Doesn't support these nodes llvm-svn: 21634	2005-04-30 04:26:56 +00:00
Chris Lattner	ce0d8c2408	This target doesn't support the FSIN/FCOS/FSQRT nodes yet llvm-svn: 21633	2005-04-30 04:26:06 +00:00
Chris Lattner	15d29b0220	Add support for FSIN/FCOS when unsafe math ops are enabled. Patch contributed by Morten Ofstad! llvm-svn: 21632	2005-04-30 04:25:35 +00:00
Chris Lattner	663664d10c	Add support for llvm.sqrt and sin/cos if unsafe math optimizations are enabled. llvm-svn: 21631	2005-04-30 04:12:40 +00:00
Chris Lattner	05d8a36ba7	Expose an option allowing unsafe math optimizations. Patch contributed by Morten Ofstad! llvm-svn: 21630	2005-04-30 04:09:52 +00:00
Chris Lattner	0366e4c0d3	Lower llvm.sqrt -> fsqrt/sqrt llvm-svn: 21629	2005-04-30 04:07:50 +00:00
Chris Lattner	234ffe2395	Add llvm.sqrt intrinsic, patch contributed by Morten Ofstad llvm-svn: 21627	2005-04-30 03:44:07 +00:00
Reid Spencer	cc551c4345	* Don't depend on "guessing" what a FILE* is, just require that the actual type be obtained from a CallInst we're optimizing. * Make it possible for getConstantStringLength to return the ConstantArray that it extracts in case the content is needed by an Optimization. * Implement the strcmp optimization * Implement the toascii optimization This pass is now firing several to many times in the following MultiSource tests: Applications/Burg - 7 (strcat,strcpy) Applications/siod - 13 (strcat,strcpy,strlen) Applications/spiff - 120 (exit,fputs,strcat,strcpy,strlen) Applications/treecc - 66 (exit,fputs,strcat,strcpy) Applications/kimwitu++ - 34 (strcmp,strcpy,strlen) Applications/SPASS - 588 (exit,fputs,strcat,strcpy,strlen) llvm-svn: 21626	2005-04-30 03:17:54 +00:00
Reid Spencer	a32eb179ed	Implement the optimizations for "pow" and "fputs" library calls. llvm-svn: 21618	2005-04-29 09:39:47 +00:00
Reid Spencer	ff5cc3cb16	Remove optimizations that don't require both operands to be constant. These are moved to simplify-libcalls pass. llvm-svn: 21614	2005-04-29 05:55:35 +00:00
Jeff Cohen	6dccb593c9	Consistently use 'class' to silence VC++ llvm-svn: 21612	2005-04-29 03:05:44 +00:00
Reid Spencer	fb6e0590a8	* Add constant folding for additional floating point library calls such as sinh, cosh, etc. * Make the name comparisons for the fp libcalls a little more efficient by switching on the first character of the name before doing comparisons. llvm-svn: 21611	2005-04-28 23:01:59 +00:00
Chris Lattner	27a534f181	Add support for FSQRT node, patch contributed by Morten Ofstad llvm-svn: 21610	2005-04-28 22:07:18 +00:00
Chris Lattner	fb0d0ea349	These functions can set errno! llvm-svn: 21609	2005-04-28 21:52:31 +00:00
Chris Lattner	236cef3563	Add some new X86 instrs, patch contributed by Morten Ofstad llvm-svn: 21608	2005-04-28 21:50:05 +00:00
Chris Lattner	2f7a83ffbf	Codegen fabs/fabsf as FABS. Patch contributed by Morten Ofstad llvm-svn: 21607	2005-04-28 21:48:42 +00:00
Chris Lattner	6ec8bb9e8d	Legalize FSQRT, FSIN, FCOS nodes, patch contributed by Morten Ofstad llvm-svn: 21606	2005-04-28 21:44:33 +00:00
Chris Lattner	4678a790e6	Add FSQRT, FSIN, FCOS nodes, patch contributed by Morten Ofstad llvm-svn: 21605	2005-04-28 21:44:03 +00:00
Reid Spencer	e7eb17c64b	Remove from the TODO list those optimizations that are already handled by constant folding implemented in lib/Transforms/Utils/Local.cpp. llvm-svn: 21604	2005-04-28 18:05:16 +00:00
Reid Spencer	b5d4b854ea	Document additional libcall transformations that need to be written. Help Wanted! There's a lot of them to write. llvm-svn: 21603	2005-04-28 04:40:06 +00:00
Reid Spencer	49cfe25457	Doxygenate. llvm-svn: 21602	2005-04-27 21:29:20 +00:00
Chris Lattner	96704dee49	remove 'statement with no effect' warning llvm-svn: 21600	2005-04-27 20:12:17 +00:00
Andrew Lenharth	2a00530fa7	Implement Value* tracking for loads and stores in the selection DAG. This enables one to use alias analysis in the backends. (TRUNK)Stores and (EXT\|ZEXT\|SEXT)Loads have an extra SDOperand which is a SrcValueSDNode which contains the Value. Note that if the operation is introduced by the backend, it will still have the operand, but the value will be null. llvm-svn: 21599	2005-04-27 20:10:01 +00:00
Chris Lattner	11f6bc02a9	Unbreak the sparc backend. llvm-svn: 21598	2005-04-27 18:57:15 +00:00
Reid Spencer	b7cff5d9d1	More Cleanup: * Name the instructions by appending to name of original * Factor common part out of a switch statement. llvm-svn: 21597	2005-04-27 17:46:54 +00:00
Duraid Madina	2f8f3f018d	clean up some warnings llvm-svn: 21590	2005-04-27 11:57:39 +00:00
Reid Spencer	1eb67fef62	This is a cleanup commit: * Correct stale documentation in a few places * Re-order the file to better associate things and reduce line count * Make the pass thread safe by caching the Function* objects needed by the optimizers in the pass object instead of globally. * Provide the SimplifyLibCalls pass object to the optimizer classes so they can access cached Function* objects and TargetData info * Make sure the pass resets its cache if the Module passed to runOnModule changes * Rename CallOptimizer LibCallOptimization. All the classes are named Optimization while the objects are Optimizer. * Don't cache Function* in the optimizer objects because they could be used by multiple PassManager's running in multiple threads * Add an optimization for strcpy which is similar to strcat * Add a "TODO" list at the end of the file for ideas on additional libcall optimizations that could be added (get ideas from other compilers). Sorry for the huge diff. Its mostly reorganization of code. That won't happen again as I believe the design and infrastructure for this pass is now done or close to it. llvm-svn: 21589	2005-04-27 07:54:40 +00:00
Chris Lattner	792ae155ad	detect functions that never return, and turn the instruction following a call to them into an 'unreachable' instruction. This triggers a bunch of times, particularly on gcc: gzip: 36 gcc: 601 eon: 12 bzip: 38 llvm-svn: 21587	2005-04-27 04:52:23 +00:00
Reid Spencer	e3b60245eb	Prefix the debug statistics so they group together. llvm-svn: 21583	2005-04-27 00:20:23 +00:00
Reid Spencer	27f80b8c96	In debug builds, make a statistic for each kind of call optimization. This helps track down what gets triggered in the pass so its easier to identify good test cases. llvm-svn: 21582	2005-04-27 00:05:45 +00:00
Chris Lattner	bd077a1945	This analysis doesn't take 'throwing' into consideration, it looks at 'unwinding' llvm-svn: 21581	2005-04-26 23:53:25 +00:00
Reid Spencer	ddef064121	Fix up the debug statement to actually use a newline .. radical concept. llvm-svn: 21580	2005-04-26 23:07:08 +00:00
Reid Spencer	7f06064798	Uh, this isn't argpromotion. llvm-svn: 21579	2005-04-26 23:05:17 +00:00
Reid Spencer	42906defb1	Add some debugging output so we can tell which calls are getting triggered llvm-svn: 21578	2005-04-26 23:02:16 +00:00
Reid Spencer	47a20efcb0	No, seriously folks, memcpy really does return void. llvm-svn: 21575	2005-04-26 22:49:48 +00:00
Reid Spencer	270f03e49e	memcpy returns void!!!!! llvm-svn: 21574	2005-04-26 22:46:23 +00:00
Chris Lattner	6b2ebc1531	don't let Reid build void*'s :) llvm-svn: 21571	2005-04-26 20:03:33 +00:00
Reid Spencer	303c65cea6	Fix some bugs found by running on llvm-test: * MemCpyOptimization can only be optimized if the 3rd and 4th arguments are constants and we weren't checking for that. * The result of llvm.memcpy (and llvm.memmove) is void* not sbyte*, put in a cast. llvm-svn: 21570	2005-04-26 19:55:57 +00:00
Reid Spencer	27afdaf88f	Changes From Review Feedback: * Have the SimplifyLibCalls pass acquire the TargetData and pass it down to the optimization classes so they can use it to make better choices for the signatures of functions, etc. * Rearrange the code a little so the utility functions are closer to their usage and keep the core of the pass near the top of the files. * Adjust the StrLen pass to get/use the correct prototype depending on the TargetData::getIntPtrType() result. The result of strlen is size_t which could be either uint or ulong depending on the platform. * Clean up some coding nits (cast vs. dyn_cast, remove redundant items from a switch, etc.) * Implement the MemMoveOptimization as a twin of MemCpyOptimization (they only differ in name). llvm-svn: 21569	2005-04-26 19:13:17 +00:00
Chris Lattner	5dc0b9e938	Make interval partition print correctly, patch contributed by Vladimir Prus! llvm-svn: 21566	2005-04-26 14:48:28 +00:00
Chris Lattner	f6199ef63a	Fix the compile failures from last night. llvm-svn: 21565	2005-04-26 14:40:41 +00:00
Duraid Madina	90bcae7fd2	constmul bugfix: multiply by 27611 was broken llvm-svn: 21564	2005-04-26 09:42:50 +00:00
Duraid Madina	675a0b9769	clean up the code! (oops) lots more cleaning left, however. llvm-svn: 21563	2005-04-26 08:43:47 +00:00
Reid Spencer	5590c48202	* Merge get_GVInitializer and getCharArrayLength into a single function named getConstantStringLength. This is the common part of StrCpy and StrLen optimizations and probably several others, yet to be written. It performs all the validity checks for looking at constant arrays that are supposed to be null-terminated strings and then computes the actual length of the string. * Implement the MemCpyOptimization class. This just turns memcpy of 1, 2, 4 and 8 byte data blocks that are properly aligned on those boundaries into a load and a store. Much more could be done here but alignment restrictions and lack of knowledge of the target instruction set prevent use from doing significantly more. That will have to be delegated to the code generators as they lower llvm.memcpy calls. llvm-svn: 21562	2005-04-26 07:45:18 +00:00
Duraid Madina	ee826ec8f6	* Add code to reduce multiplies by constant integers to shifts, adds and subtracts. This is a very rough and nasty implementation of Lefevre's "pattern finding" algorithm. With a few small changes though, it should end up beating most other methods in common use, regardless of the size of the constant (currently, it's often one or two shifts worse) TODO: rewrite it so it's not hideously ugly (this is a translation from perl, which doesn't help ;) bypass most of it for multiplies by 2^n+1 (eventually) teach it that some combinations of shift+add are cheaper than others (e.g. shladd on ia64, scaled adds on alpha) get it to try multiple booth encodings in search of the cheapest routine make it work for negative constants This is hacked up as a DAG->DAG transform, so once I clean it up I hope it'll be pulled out of here and put somewhere else. The only thing backends should really have to worry about for now is where to draw the line between using this code vs. going ahead and doing an integer multiply anyway. llvm-svn: 21560	2005-04-26 07:23:02 +00:00
Reid Spencer	584e662d19	* Implement StrLenOptimization * Factor out commonalities between StrLenOptimization and StrCatOptimization * Make sure that signatures return sbyte* not void* llvm-svn: 21559	2005-04-26 05:24:00 +00:00
Reid Spencer	6a1c238029	Incorporate feedback from Chris: * Change signatures of OptimizeCall and ValidateCalledFunction so they are non-const, allowing the optimization object to be modified. This is in support of caching things used across multiple calls. * Provide two functions for constructing and caching function types * Modify the StrCatOptimization to cache Function objects for strlen and llvm.memcpy so it doesn't regenerate them on each call site. Make sure these are invalidated each time we start the pass. * Handle both a GEP Instruction and a GEP ConstantExpr * Add additional checks to make sure we really are dealing with an arary of sbyte and that all the element initializers are ConstantInt or ConstantExpr that reduce to ConstantInt. * Make sure the GlobalVariable is constant! * Don't use ConstantArray::getString as it can fail and it doesn't give us the right thing. We must check for null bytes in the middle of the array. * Use llvm.memcpy instead of memcpy so we can factor alignment into it. * Don't use void* types in signatures, replace with sbyte* instead. llvm-svn: 21555	2005-04-26 03:26:15 +00:00
Chris Lattner	15bcc5273b	Fold (X > -1) \| (Y > -1) --> (X&Y > -1) llvm-svn: 21552	2005-04-26 01:18:33 +00:00
Reid Spencer	5fcce35fa8	Changes due to code review and new implementation: * Don't use std::string for the function names, const char* will suffice * Allow each CallOptimizer to validate the function signature before doing anything * Repeatedly loop over the functions until an iteration produces no more optimizations. This allows one optimization to insert a call that is optimized by another optimization. * Implement the ConstantArray portion of the StrCatOptimization * Provide a template for the MemCpyOptimization * Make ExitInMainOptimization split the block, not delete everything after the return instruction. (This covers revision 1.3 and 1.4, as the 1.3 comments were botched) llvm-svn: 21548	2005-04-25 21:20:38 +00:00
Chris Lattner	d8ac4da793	implement some more logical compares with constants, so that: int foo1(int x, int y) { int t1 = x >= 0; int t2 = y >= 0; return t1 & t2; } int foo2(int x, int y) { int t1 = x == -1; int t2 = y == -1; return t1 & t2; } produces: _foo1: or r2, r4, r3 srwi r2, r2, 31 xori r3, r2, 1 blr _foo2: and r2, r4, r3 addic r2, r2, 1 li r2, 0 addze r3, r2 blr instead of: _foo1: srwi r2, r4, 31 xori r2, r2, 1 srwi r3, r3, 31 xori r3, r3, 1 and r3, r2, r3 blr _foo2: addic r2, r4, 1 li r2, 0 addze r2, r2 addic r3, r3, 1 li r3, 0 addze r3, r3 and r3, r2, r3 blr llvm-svn: 21547	2005-04-25 21:20:28 +00:00
Reid Spencer	9b66533e40	Lots of changes based on review and new functionality: * Use a llvm-svn: 21546	2005-04-25 21:11:48 +00:00
Chris Lattner	7931b75a81	Codegen x < 0 \| y < 0 as (x\|y) < 0. This allows us to compile this to: _foo: or r2, r4, r3 srwi r3, r2, 31 blr instead of: _foo: srwi r2, r4, 31 srwi r3, r3, 31 or r3, r2, r3 blr llvm-svn: 21544	2005-04-25 21:03:25 +00:00
Chris Lattner	3aff97254e	Make dominates(A,B) work with post dominators. Patch contributed by Naveen Neelakantam, thanks! llvm-svn: 21543	2005-04-25 20:50:33 +00:00
Chris Lattner	3f22e5ba5d	implement getelementptr.ll:test10 llvm-svn: 21541	2005-04-25 20:17:30 +00:00
Chris Lattner	bab9c90db4	Correctly handle global-argument aliases induced in main llvm-svn: 21537	2005-04-25 19:16:31 +00:00
Chris Lattner	e39652d21c	Don't mess up SCC traversal when a node has null edges out of it. llvm-svn: 21536	2005-04-25 19:16:17 +00:00
Reid Spencer	4b4864684a	Post-Review Cleanup: * Fix comments at top of file * Change algorithm for running the call optimizations from nn to something closer to n. Use a hash_map to store and lookup the optimizations since there will eventually (or potentially) be a large number of them. This gets lookup based on the name of the function to O(1). Each CallOptimizer now has a std::string member named func_name that tracks the name of the function that it applies to. It is this string that is entered into the hash_map for fast comparison against the function names encountered in the module. * Cleanup some style issues pertaining to iterator invalidation * Don't pass the Function pointer to the OptimizeCall function because if the optimization needs it, it can get it from the CallInst passed in. * Add the skeleton for a new CallOptimizer, StrCatOptimizer which will eventually replace strcat's of constant strings with direct copies. llvm-svn: 21526	2005-04-25 03:59:26 +00:00
Reid Spencer	e952b16f37	Shut GCC 4.0 up about classes that have virtual functions but a non-virtual destructor. Just add the do-nothing virtual destructor. llvm-svn: 21524	2005-04-25 02:55:55 +00:00
Reid Spencer	95a0d8af78	A new pass to provide specific optimizations for certain well-known library calls. The pass visits all external functions in the module and determines if such function calls can be optimized. The optimizations are specific to the library calls involved. This initial version only optimizes calls to exit(3) when they occur in main(): it changes them to ret instructions. llvm-svn: 21522	2005-04-25 02:53:12 +00:00
Reid Spencer	27134f31f2	Older compilers won't like the inline virtual destructor in the header file so we put the destructor in Pass.cpp and make it non-inline. llvm-svn: 21520	2005-04-25 01:01:35 +00:00
Reid Spencer	c206223e65	Shut GCC 4.0 up about classes with virtual functions but no virtual destructor. llvm-svn: 21510	2005-04-24 22:27:20 +00:00
Chris Lattner	e78ae0e1b1	Eliminate cases where we could << by 64, which is undefined in C. llvm-svn: 21500	2005-04-24 17:46:05 +00:00
Chris Lattner	5fdcc49858	Implement xor.ll:test21: select (not C), A, B -> select C, B, A llvm-svn: 21495	2005-04-24 07:30:14 +00:00
Chris Lattner	a9f3e89328	Allow these methods to take a generic Value* to simplify clients. Use const_cast instead of c casts. llvm-svn: 21493	2005-04-24 07:28:37 +00:00
Chris Lattner	26c5e79151	Use getPrimitiveSizeInBits() instead of getPrimitiveSize()*8 Completely rework the 'setcc (cast x to larger), y' code. This code has the advantage of implementing setcc.ll:test19 (being more general than the previous code) and being correct in all cases. This allows us to unxfail 2004-11-27-SetCCForCastLargerAndConstant.ll, and close PR454. llvm-svn: 21491	2005-04-24 06:59:08 +00:00
Chris Lattner	dfae677997	Fix a bug in my previous checkin llvm-svn: 21485	2005-04-23 22:01:39 +00:00
Chris Lattner	d10f1f55f9	Add a method, remove last use of Type.def llvm-svn: 21483	2005-04-23 22:00:09 +00:00
Jeff Cohen	6c42217055	Eliminate tabs and trailing spaces llvm-svn: 21480	2005-04-23 21:38:35 +00:00

... 2 3 4 5 6 ...

10221 Commits