llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-29 23:12:55 +01:00

Author	SHA1	Message	Date
Chris Lattner	5d9ecff961	encode rdtsc correctly llvm-svn: 24435	2005-11-20 22:13:18 +00:00
Chris Lattner	f4f66fafd9	use chain operands to ensure the copies don't wander from the rdtsc instruction. llvm-svn: 24434	2005-11-20 22:01:40 +00:00
Andrew Lenharth	a369904fc5	The second patch of X86 support for read cycle counter. llvm-svn: 24430	2005-11-20 21:41:10 +00:00
Chris Lattner	af79013023	Teach the x86 backend about the register constraints of its addressing mode. Patch by Evan Cheng llvm-svn: 24423	2005-11-19 07:01:30 +00:00
Chris Lattner	6e0171ba8b	Add load and other support to the dag-dag isel. Patch contributed by Evan Cheng! llvm-svn: 24419	2005-11-19 02:11:08 +00:00
Chris Lattner	72fa26a85b	add more patterns, patch by Evan Cheng. llvm-svn: 24406	2005-11-18 01:04:42 +00:00
Chris Lattner	f829636c6b	Add patterns for some 16-bit immediate instructions, patch contributed by Evan Cheng. llvm-svn: 24384	2005-11-17 02:01:55 +00:00
Chris Lattner	fec54e57a0	Add patterns for several simple instructions that take i32 immediates. Patch contributed by Evan Cheng! llvm-svn: 24382	2005-11-16 22:59:19 +00:00
Chris Lattner	9caa214f72	initial step at adding a dag-to-dag isel for X86 backend. Patch contributed by Evan Cheng! llvm-svn: 24371	2005-11-16 01:54:32 +00:00
Chris Lattner	792ac11aee	Separate X86ISelLowering stuff out from the X86ISelPattern.cpp file. Patch contributed by Evan Cheng. llvm-svn: 24358	2005-11-15 00:40:23 +00:00
Chris Lattner	3fdc97d460	Add a new option to indicate we want the code generator to emit code quickly,not spending tons of time microoptimizing it. This is useful for an -O0style of build. llvm-svn: 24233	2005-11-08 02:11:51 +00:00
Chris Lattner	bdcb2a99b6	add a note that Nate mentioned last week llvm-svn: 23898	2005-10-23 21:44:59 +00:00
Chris Lattner	11c044d7cf	Put some of my random notes somewhere public llvm-svn: 23897	2005-10-23 19:52:42 +00:00
Nate Begeman	6c42f509bc	Invert the TargetLowering flag that controls divide by consant expansion. Add a new flag to TargetLowering indicating if the target has really cheap signed division by powers of two, make ppc use it. This will probably go away in the future. Implement some more ISD::SDIV folds in the dag combiner Remove now dead code in the x86 backend. llvm-svn: 23853	2005-10-21 00:02:42 +00:00
Nate Begeman	0eeb198d81	Remove some dead code now that the dag combiner exists. llvm-svn: 23754	2005-10-15 22:08:02 +00:00
Nate Begeman	3b6c2df603	Properly split f32 and f64 into separate register classes for scalar sse fp fixing a bunch of nasty hackery llvm-svn: 23735	2005-10-14 22:06:00 +00:00
Chris Lattner	462fe8b2cc	silence some warnings llvm-svn: 23594	2005-10-02 16:29:36 +00:00
Chris Lattner	f70bf81bb6	simplify this code using the new regclass info passed in llvm-svn: 23557	2005-09-30 17:12:38 +00:00
Chris Lattner	a1266f8ed5	Pass extra regclasses into spilling code llvm-svn: 23537	2005-09-30 01:29:42 +00:00
Chris Lattner	d3b3d07c41	Add FP versions of the binary operators, keeping the int and fp worlds seperate. Though I have done extensive testing, it is possible that this will break things in configs I can't test. Please let me know if this causes a problem and I'll fix it ASAP. llvm-svn: 23505	2005-09-28 22:29:17 +00:00
Chris Lattner	4a8f6d97ff	Implement the isLoadFromStackSlot interface llvm-svn: 23387	2005-09-19 05:23:44 +00:00
Chris Lattner	54139f0b83	give all operands names llvm-svn: 23356	2005-09-14 21:10:24 +00:00
Chris Lattner	fa644b391f	fix a major regression from my patch this afternoon llvm-svn: 23347	2005-09-14 06:06:45 +00:00
Chris Lattner	00bfb7812d	This code is no longer needed, it is moved to the target-indep code llvm-svn: 23332	2005-09-13 19:31:44 +00:00
Chris Lattner	70e3e44ec4	Handle any_extend like zext llvm-svn: 23202	2005-09-02 00:16:09 +00:00
Jim Laskey	f32ef9a37f	1. Use SubtargetFeatures in llc/lli. 2. Propagate feature "string" to all targets. 3. Implement use of SubtargetFeatures in PowerPCTargetSubtarget. llvm-svn: 23192	2005-09-01 21:38:21 +00:00
Reid Spencer	a4acd20ae2	Adjust to member variable name change. llvm-svn: 23119	2005-08-27 19:09:48 +00:00
Chris Lattner	439ef36320	Fix a bug in my previous checkin llvm-svn: 23082	2005-08-26 17:18:44 +00:00
Chris Lattner	a31708e6b3	Change ConstantPoolSDNode to actually hold the Constant itself instead of putting it into the constant pool. This allows the isel machinery to create constants that it will end up deciding are not needed, without them ending up in the resultant function constant pool. llvm-svn: 23081	2005-08-26 17:15:30 +00:00
Chris Lattner	fc59d17656	Fix a warning llvm-svn: 23031	2005-08-25 00:05:15 +00:00
Chris Lattner	0444d66753	Adjust to new livevars interface llvm-svn: 22991	2005-08-23 23:41:14 +00:00
Chris Lattner	51bad854a7	Simplify this code by using LiveVariables::KillsRegister llvm-svn: 22988	2005-08-23 22:49:55 +00:00
Chris Lattner	2ac3fd08d2	Split RegisterClass 'Methods' into MethodProtos and MethodBodies llvm-svn: 22929	2005-08-19 19:13:20 +00:00
Chris Lattner	f86654ffde	Put register classes into namespaces llvm-svn: 22925	2005-08-19 18:51:57 +00:00
Chris Lattner	e894de1791	The simple isel being gone makes this dead! llvm-svn: 22914	2005-08-19 18:32:03 +00:00
Chris Lattner	d7bd59d77e	add a few missing cases llvm-svn: 22891	2005-08-19 00:41:29 +00:00
Chris Lattner	f62a66a21c	Give ADJCALLSTACKDOWN/UP the correct operands. Give a whole bunch of other stuff variable operands, particularly FP. The FP stackifier is playing fast and loose with operands here, so we have to mark them all as variable. This will have to be fixed before we can dag->dag the X86 backend. The solution is for the pre-stackifier and post-stackifier instructions to all be disjoint. llvm-svn: 22890	2005-08-19 00:38:22 +00:00
Chris Lattner	abad70eaf8	The variable SAR's only take one operand too llvm-svn: 22888	2005-08-19 00:31:37 +00:00
Chris Lattner	8ce7dd449a	Stop adding bogus operands to variable shifts on X86. These instructions only take one operand. The other comes implicitly in through CL. llvm-svn: 22887	2005-08-19 00:16:17 +00:00
Nate Begeman	a978ae8b7d	Remove the X86 and PowerPC Simple instruction selectors; their time has passed. llvm-svn: 22886	2005-08-18 23:53:15 +00:00
Chris Lattner	583658a766	update the backends to work with the new CopyFromReg/CopyToReg/ImplicitDef nodes llvm-svn: 22807	2005-08-16 21:56:37 +00:00
Nate Begeman	f6b6378f23	Implement BR_CC and BRTWOWAY_CC. This allows the removal of a rather nasty fixme from the PowerPC backend. Emit slightly better code for legalizing select_cc. llvm-svn: 22805	2005-08-16 19:49:35 +00:00
Nate Begeman	6e0168fe5f	Fix last night's X86 regressions by putting code for SSE in the if(SSE) block. nur. llvm-svn: 22788	2005-08-14 18:37:02 +00:00
Nate Begeman	89f12b7721	Fix FP_TO_UINT with Scalar SSE2 now that the legalizer can handle it. We now generate the relatively good code sequences: unsigned short foo(float a) { return a; } _foo: movss 4(%esp), %xmm0 cvttss2si %xmm0, %eax movzwl %ax, %eax ret and unsigned bar(float a) { return a; } _bar: movss .CPI_bar_0, %xmm0 movss 4(%esp), %xmm1 movapd %xmm1, %xmm2 subss %xmm0, %xmm2 cvttss2si %xmm2, %eax xorl $-2147483648, %eax cvttss2si %xmm1, %ecx ucomiss %xmm0, %xmm1 cmovb %ecx, %eax ret llvm-svn: 22786	2005-08-14 04:36:51 +00:00
Chris Lattner	1277703a48	Update the targets to the new SETCC/CondCodeSDNode interfaces. llvm-svn: 22729	2005-08-09 20:21:10 +00:00
Chris Lattner	07af090121	adjust to change in getSubtarget() api llvm-svn: 22687	2005-08-05 21:54:27 +00:00
Nate Begeman	09997f1012	Add Subtarget support to PowerPC. Next up, using it. llvm-svn: 22644	2005-08-04 07:12:09 +00:00
Nate Begeman	6cd034da8e	Scalar SSE: load +0.0 -> xorps/xorpd Scalar SSE: a < b ? c : 0.0 -> cmpss, andps Scalar SSE: float -> i16 needs to be promoted llvm-svn: 22637	2005-08-03 23:26:28 +00:00
Chris Lattner	cc8ae687e1	Update to use the new MathExtras.h support for log2 computation. Patch contributed by Jim Laskey! llvm-svn: 22594	2005-08-02 19:26:06 +00:00
Jeff Cohen	019104459d	Keep tabs and trailing spaces out. llvm-svn: 22565	2005-07-30 18:33:25 +00:00
Chris Lattner	11be5c11d5	fix a typeo llvm-svn: 22561	2005-07-30 00:43:00 +00:00
Chris Lattner	a681fc64d6	Change the fp to integer code to not perform 2-byte stores followed by 1 byte loads and other operations. This is bad for store-forwarding on common CPUs. We now do this: fnstcw WORD PTR [%ESP] mov %AX, WORD PTR [%ESP] instead of: fnstcw WORD PTR [%ESP] mov %AL, BYTE PTR [%ESP + 1] llvm-svn: 22559	2005-07-30 00:17:52 +00:00
Chris Lattner	cf208334d9	Use a custom expander for all FP to int conversions, as the X86 only has FP-to-int-in-memory: this exposes the load from the stored slot to the selection dag, allowing it to be folded into other operaions. llvm-svn: 22556	2005-07-30 00:05:54 +00:00
Andrew Lenharth	3a7dc9f0bd	turn off GOT on archs that didn't use it (not that it appeard to harm them much with it on) llvm-svn: 22553	2005-07-29 23:32:02 +00:00
Chris Lattner	2aa847898d	Implement a FIXME: move a bunch of cruft for handling FP_TO_*INT operations that the X86 does not support to the legalizer. This allows it to be better optimized, etc, and will help with SSE support. llvm-svn: 22551	2005-07-29 01:00:29 +00:00
Chris Lattner	9db78c43c5	Don't forget to diddle with the control word when performing an FISTP64. llvm-svn: 22550	2005-07-29 00:54:34 +00:00
Chris Lattner	a30d4be57d	Use a custom expander to compile this: long %test4(double %X) { %tmp.1 = cast double %X to long ; <long> [#uses=1] ret long %tmp.1 } to this: _test4: sub %ESP, 12 fld QWORD PTR [%ESP + 16] fistp QWORD PTR [%ESP] mov %EDX, DWORD PTR [%ESP + 4] mov %EAX, DWORD PTR [%ESP] add %ESP, 12 ret instead of this: _test4: sub %ESP, 28 fld QWORD PTR [%ESP + 32] fstp QWORD PTR [%ESP] call ___fixdfdi add %ESP, 28 ret llvm-svn: 22549	2005-07-29 00:40:01 +00:00
Jeff Cohen	bd51ec7461	Eliminate all remaining tabs and trailing spaces. llvm-svn: 22523	2005-07-27 06:12:32 +00:00
Jeff Cohen	81980781a1	Eliminate tabs and trailing spaces. llvm-svn: 22520	2005-07-27 05:53:44 +00:00
Andrew Lenharth	0e1c0e7c79	update interface llvm-svn: 22498	2005-07-22 20:49:37 +00:00
Reid Spencer	40c5ebe4eb	For: memory operations -> stores This is the first incremental patch to implement this feature. It adds no functionality to LLVM but setup up the information needed from targets in order to implement the optimization correctly. Each target needs to specify the maximum number of store operations for conversion of the llvm.memset, llvm.memcpy, and llvm.memmove intrinsics into a sequence of store operations. The limit needs to be chosen at the threshold of performance for such an optimization (generally smallish). The target also needs to specify whether the target can support unaligned stores for multi-byte store operations. This helps ensure the optimization doesn't generate code that will trap on an alignment errors. More patches to follow. llvm-svn: 22468	2005-07-19 04:52:44 +00:00
Nate Begeman	160c12d896	Teach the legalizer how to promote SINT_TO_FP to a wider SINT_TO_FP that the target natively supports. This eliminates some special-case code from the x86 backend and generates better code as well. For an i8 to f64 conversion, before & after: _x87 before: subl $2, %esp movb 6(%esp), %al movsbw %al, %ax movw %ax, (%esp) filds (%esp) addl $2, %esp ret _x87 after: subl $2, %esp movsbw 6(%esp), %ax movw %ax, (%esp) filds (%esp) addl $2, %esp ret _sse before: subl $12, %esp movb 16(%esp), %al movsbl %al, %eax cvtsi2sd %eax, %xmm0 addl $12, %esp ret _sse after: subl $12, %esp movsbl 16(%esp), %eax cvtsi2sd %eax, %xmm0 addl $12, %esp ret llvm-svn: 22452	2005-07-16 02:02:34 +00:00
Nate Begeman	7a1bc7318d	Teach the register allocator that movaps is also a move instruction llvm-svn: 22451	2005-07-16 02:00:20 +00:00
Nate Begeman	c93c1c5148	A couple more darwinisms llvm-svn: 22450	2005-07-16 01:59:47 +00:00
Chris Lattner	79573b1a93	Remove all knowledge of UINT_TO_FP from the X86 backend, relying on the legalizer to eliminate them. With this comes the expected code quality improvements, such as, for this: double foo(unsigned short X) { return X; } we now generate this: _foo: subl $4, %esp movzwl 8(%esp), %eax movl %eax, (%esp) fildl (%esp) addl $4, %esp ret instead of this: _foo: subl $4, %esp movw 8(%esp), %ax movzwl %ax, %eax ;; Load not folded into this. movl %eax, (%esp) fildl (%esp) addl $4, %esp ret -Chris llvm-svn: 22449	2005-07-16 00:28:20 +00:00
Nate Begeman	957e0e7c9e	Get closer to fully working scalar FP in SSE regs. This gets singlesource working, and Olden/power. llvm-svn: 22441	2005-07-15 00:38:55 +00:00
Nate Begeman	8c2dadc92e	Add support for printing the sse scalar comparison instruction mnemonics. llvm-svn: 22440	2005-07-14 22:52:25 +00:00
Nate Begeman	7330d9cd80	Check in the last of the darwin-specific code necessary to get shootout working before modifying the asm printer to use the subtarget info. llvm-svn: 22408	2005-07-12 18:34:58 +00:00
Nate Begeman	4d96f2769c	Clean up the TargetSubtarget class a bit, removing an unnecessary argument to the constructor. llvm-svn: 22392	2005-07-12 02:41:19 +00:00
Chris Lattner	b383dec36f	Minor changes to improve comments and fix the build on _WIN32 systems. llvm-svn: 22391	2005-07-12 02:36:10 +00:00
Chris Lattner	855fe2ea0c	Add a note llvm-svn: 22390	2005-07-12 02:35:36 +00:00
Nate Begeman	626fb671c8	Implement Subtarget support Implement the X86 Subtarget. This consolidates the checks for target triple, and setting options based on target triple into one place. This allows us to convert the asm printer and isel over from being littered with "forDarwin", "forCygwin", etc. into just having the appropriate flags for each subtarget feature controlling the code for that feature. This patch also implements indirect external and weak references in the X86 pattern isel, for darwin. Next up is to convert over the asm printers to use this new interface. llvm-svn: 22389	2005-07-12 01:41:54 +00:00
Nate Begeman	faf9b5b763	Commit some pending darwin changes before subtarget support. llvm-svn: 22388	2005-07-12 01:37:28 +00:00
Chris Lattner	cace336deb	Output .size directives to tell the assembler the size of each function. llvm-svn: 22381	2005-07-11 06:29:14 +00:00
Chris Lattner	af83621722	Fix crazy indentation llvm-svn: 22380	2005-07-11 06:25:47 +00:00
Chris Lattner	a556fc183f	Refactor things a bit to allow the ELF code emitter to run the X86 machine code emitter after itself. llvm-svn: 22376	2005-07-11 05:17:48 +00:00
Chris Lattner	41dbb3993d	Remove prototype for non-existant function llvm-svn: 22372	2005-07-11 04:20:55 +00:00
Chris Lattner	ffaf40a143	Change *EXTLOAD to use an VTSDNode operand instead of being an MVTSDNode. This is the last MVTSDNode. This allows us to eliminate a bunch of special case code for handling MVTSDNodes. Also, remove some uses of dyn_cast that should really be cast (which is cheaper in a release build). llvm-svn: 22368	2005-07-10 01:56:13 +00:00
Chris Lattner	273b81e0c0	Change TRUNCSTORE to use a VTSDNode operand instead of being an MVTSTDNode llvm-svn: 22366	2005-07-10 00:29:18 +00:00
Nate Begeman	70532b9f00	Add support for assembling .s files on mac os x for intel Add support for running bugpoint on mac os x for intel llvm-svn: 22351	2005-07-08 00:23:26 +00:00
Chris Lattner	e45dd7850d	Restore some code that was accidentally removed by Nate's patch yesterday. This fixes the regressions from last night. llvm-svn: 22344	2005-07-07 17:12:53 +00:00
Nate Begeman	667588d524	Fix a typo in my checkin today that caused regressions. Oops! llvm-svn: 22341	2005-07-07 06:32:01 +00:00
Nate Begeman	e5314eb2c2	First round of support for doing scalar FP using the SSE2 ISA extension and XMM registers. There are many known deficiencies and fixmes, which will be addressed ASAP. The major benefit of this work is that it will allow the LLVM register allocator to allocate FP registers across basic blocks. The x86 backend will still default to x87 style FP. To enable this work, you must pass -enable-sse-scalar-fp and either -sse2 or -sse3 to llc. An example before and after would be for: double foo(double *P) { double Sum = 0; int i; for (i = 0; i < 1000; ++i) Sum += P[i]; return Sum; } The inner loop looks like the following: x87: .LBB_foo_1: # no_exit fldl (%esp) faddl (%eax,%ecx,8) fstpl (%esp) incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit SSE2: addsd (%eax,%ecx,8), %xmm0 incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit llvm-svn: 22340	2005-07-06 18:59:04 +00:00
Chris Lattner	199560c668	Make several cleanups to Andrews varargs change: 1. Pass Value's into lowering methods so that the proper pointers can be added to load/stores from the valist 2. Intrinsics that return void should only return a token chain, not a token chain/retval pair. 3. Rename LowerVAArgNext -> LowerVAArg, because VANext is long gone. 4. Now that we have Value's available in the lowering methods, pass them into any load/stores from the valist that are emitted llvm-svn: 22339	2005-07-05 19:58:54 +00:00
Chris Lattner	f8faaa8a39	Fit to 80 columns llvm-svn: 22336	2005-07-05 17:50:16 +00:00
Chris Lattner	93a2576078	Percolate the call up to the right superclass llvm-svn: 22330	2005-07-03 17:34:39 +00:00
Nate Begeman	12d2961fd2	The statistic needs to be in the correct namespace. llvm-svn: 22327	2005-07-01 23:56:38 +00:00
Chris Lattner	b299933fc7	Refactor X86AsmPrinter.cpp into multiple files. Patch contributed by Aaron Gray, cleaned up by me. llvm-svn: 22324	2005-07-01 22:44:09 +00:00
Nate Begeman	31e3028bac	Make the x86 asm printer darwin-aware. This mostly entails doing the same thing as cygwin most of the time, and printing our alignments in log2 rather than number of bytes. llvm-svn: 22316	2005-06-30 00:53:20 +00:00
Nate Begeman	032a94775d	Initial set of .td file changes necessary to get scalar fp in xmm registers working. The instruction selector changes will hopefully be coming later this week once they are debugged. This is necessary to support the darwin x86 FP model, and is recommended by intel as the replacement for x87. As a bonus, the register allocator knows how to deal with these registers across basic blocks, unliky the FP stackifier. This leads to significantly better codegen in several cases. llvm-svn: 22300	2005-06-27 21:20:31 +00:00
Chris Lattner	cd6b5c5011	Add support to the X86 backend for emitting ELF files. To use this, we currently use: llc t.bc --filetype=obj This will produce a t.o file which is dumpable with readelf. Currently the file produced is empty, but the scaffolding to do more is now in place. llvm-svn: 22292	2005-06-27 06:30:12 +00:00
Chris Lattner	06282f51cf	Refactor the addPassesToEmitAssembly interface into a addPassesToEmitFile interface. llvm-svn: 22282	2005-06-25 02:48:37 +00:00
Andrew Lenharth	4fd2bde906	If we support structs as va_list, we must pass pointers to them to va_copy See last commit for LangRef, this implements it on all targets. llvm-svn: 22273	2005-06-22 21:04:42 +00:00
John Criswell	00c8485c77	Fixed indentation. llvm-svn: 22270	2005-06-20 19:59:22 +00:00
Andrew Lenharth	a9214fec08	core changes for varargs llvm-svn: 22254	2005-06-18 18:34:52 +00:00
Chris Lattner	dce633f2dd	silence a bogus warning llvm-svn: 22245	2005-06-17 13:23:32 +00:00
Nate Begeman	023a21ea32	Fix lli linking on Mac OS X 10.4.1 for Intel. llvm-svn: 22200	2005-06-08 01:02:38 +00:00
Reid Spencer	966f750ad2	Make sure that Cygwin assembly includes _ as part of function names. llvm-svn: 22190	2005-06-02 21:33:19 +00:00
Nate Begeman	f6e0f54d70	C'mon everybody, let's modify X86JITInfo.cpp. This time, we add <iostream> so that the shiny new use of std::cerr is defined. llvm-svn: 22156	2005-05-20 21:29:24 +00:00
Misha Brukman	48ca8224da	Since everyone else has "fixed" this file, might as well join in the fun. * Change assert() to std::cerr printout, as it will not appear in opt builds * Add comments to clarify what #ifdef/#else/#endif match what condition(s) llvm-svn: 22154	2005-05-20 19:46:50 +00:00
Chris Lattner	f1479ae390	Fix this a 3rd time :) llvm-svn: 22151	2005-05-20 17:00:21 +00:00
Andrew Lenharth	46b9d601f5	fix compilation error due to no abort being defined. There is probably a better way to do this llvm-svn: 22150	2005-05-20 16:34:44 +00:00
Duraid Madina	c043395fe8	this seems dead (and broke the ia64 build, so..) llvm-svn: 22147	2005-05-20 06:21:59 +00:00
Jeff Cohen	a0f463a25c	Fix tail call support in VC++ builds llvm-svn: 22143	2005-05-20 01:35:39 +00:00
Chris Lattner	c1d9b43fc1	Fastcc passes arguments in EAX and EDX, make sure the JIT doesn't clobber them llvm-svn: 22137	2005-05-19 06:49:17 +00:00
Chris Lattner	6dcae1c96e	Tailcalls require stubs to be emitted. Otherwise, the compilation callback doesn't know who 'called' it. llvm-svn: 22136	2005-05-19 05:54:33 +00:00
Chris Lattner	95650a26b2	don't reserve space for tailcall arg areas. It explicitly managed. llvm-svn: 22050	2005-05-15 06:07:10 +00:00
Chris Lattner	2fd2e53f6a	Teach reginfo how to deal with ADJSTACKPTRri, allowing us to generate: add %ESP, 20 jmp %EDX # TAIL CALL instead of: add %ESP, -8 add %ESP, 28 jmp %EDX # TAIL CALL llvm-svn: 22047	2005-05-15 05:49:58 +00:00
Chris Lattner	77f8c2cece	Implement proper tail calls in the X86 backend for all fastcc->fastcc tail calls. llvm-svn: 22046	2005-05-15 05:46:45 +00:00
Chris Lattner	7327c042b4	Add markers in the asm file for tail calls, add a new ADJSTACKPTRri sorta-pseudo-instruction llvm-svn: 22042	2005-05-15 03:10:37 +00:00
Chris Lattner	64232a8480	Yes, calltarget is the operand of the day. llvm-svn: 22040	2005-05-15 01:10:30 +00:00
Chris Lattner	adb31a99fa	When emitting the function epilog, check to see if there already a stack adjustment. If so, we merge the adjustment into the existing one. This allows us to generate: caller2: sub %ESP, 12 mov DWORD PTR [%ESP], 0 mov %EAX, 1234567890 mov %EDX, 0 call func2 add %ESP, 8 ret 4 intead of: caller2: sub %ESP, 12 mov DWORD PTR [%ESP], 0 mov %EAX, 1234567890 mov %EDX, 0 call func2 sub %ESP, 4 add %ESP, 12 ret 4 for X86/fast-cc-merge-stack-adj.ll llvm-svn: 22038	2005-05-14 23:53:43 +00:00
Chris Lattner	37e226fa9b	Add some new instructions llvm-svn: 22036	2005-05-14 23:35:21 +00:00
Chris Lattner	9d106c705d	Pass i64 values correctly split in reg/mem to fastcc calls. This fixes fourinarow with -enable-x86-fastcc. llvm-svn: 22022	2005-05-14 12:03:10 +00:00
Chris Lattner	adde8b1a71	Use target-specific nodes for calls. This allows the fastcc code to not have to do ugly hackery to avoid emitting code like this: call foo mov vreg, EAX adjcallstackup ... If foo is a fastcc call and if vreg gets spilled, we might end up with this: call foo mov [ESP+offset], EAX ;; Offset doesn't consider the 12! sub ESP, 12 Which is bad. The previous hacky code to deal with this was A) gross B) not good enough. In particular, it could miss cases and emit the bad code above. Now we always emit this: call foo adjcallstackup ... mov vreg, EAX directly. This makes fastcc with callees poping the stack work much better. Next stop (finally!) really is tail calls. llvm-svn: 22021	2005-05-14 08:48:15 +00:00
Chris Lattner	c0c6680744	use a target-specific node and custom expander to lower long->FP to FILD64m. This should fix some missing symbols problems on BSD and improve performance of programs that use that operation. llvm-svn: 22012	2005-05-14 06:52:07 +00:00
Chris Lattner	2210f7d6e9	Make sure the start of the arg area and the end (after the RA is pushed) is always 8-byte aligned for fastcc llvm-svn: 21995	2005-05-13 23:49:10 +00:00
Chris Lattner	1634435c77	fix typo llvm-svn: 21991	2005-05-13 22:46:57 +00:00
Chris Lattner	0d4b08e470	Fix the problems with callee popped argument lists llvm-svn: 21988	2005-05-13 22:13:49 +00:00
Chris Lattner	e667c34ef1	Don't emit SAR X, 0 in the case of sdiv Y, 2 llvm-svn: 21986	2005-05-13 21:50:27 +00:00
Chris Lattner	fc630bb4f0	Fix UnitTests/2005-05-13-SDivTwo.c llvm-svn: 21985	2005-05-13 21:48:20 +00:00
Chris Lattner	62593e4e66	switch to having the callee pop stack operands for fastcc. This is currently buggy do not use llvm-svn: 21984	2005-05-13 21:44:04 +00:00
Chris Lattner	e9944b033d	allow RETI llvm-svn: 21980	2005-05-13 20:46:35 +00:00
Chris Lattner	2c9d871f9b	Build TAILCALL nodes in LowerCallTo, treat them like normal calls everywhere. llvm-svn: 21976	2005-05-13 20:29:13 +00:00
Chris Lattner	9d788e93a6	Add an isTailCall flag to LowerCallTo llvm-svn: 21958	2005-05-13 18:50:42 +00:00
Chris Lattner	83d7e55471	add 'ret imm' instruction llvm-svn: 21945	2005-05-13 17:56:48 +00:00
Chris Lattner	7c3b219c28	Do not CopyFromReg physregs for live-in values. Instead, create a vreg for each live in, and copy the regs from the vregs. As the very first thing we do in the function, insert copies from the pregs to the vregs. This fixes problems where the token chain of CopyFromReg was not enough to allow reordering of the copyfromreg nodes and other unchained nodes (e.g. div, which clobbers eax on intel). llvm-svn: 21932	2005-05-13 07:38:09 +00:00
Chris Lattner	094bbfcebb	rename the ADJCALLSTACKDOWN/ADJCALLSTACKUP nodes to be CALLSEQ_START/BEGIN. llvm-svn: 21915	2005-05-12 23:24:06 +00:00
Chris Lattner	3106dfa185	Add a new -enable-x86-fastcc option that enables passing the first two integer values in registers for the fastcc calling conv. llvm-svn: 21912	2005-05-12 23:06:28 +00:00
Chris Lattner	7e08dd591c	Pass in Calling Convention to use into LowerCallTo llvm-svn: 21899	2005-05-12 19:56:45 +00:00
Chris Lattner	d8766cdae2	Enable pattern isel by default llvm-svn: 21898	2005-05-12 19:56:09 +00:00
Chris Lattner	e358ac532b	X86 has more than just 32-bit registers llvm-svn: 21857	2005-05-11 05:00:34 +00:00
Chris Lattner	8230bddde2	Convert feature of the simple isel over for the pattern isel to use. llvm-svn: 21840	2005-05-10 03:53:18 +00:00
Jeff Cohen	afc58006b7	Silence some VC++ warnings llvm-svn: 21838	2005-05-10 02:22:38 +00:00
Chris Lattner	d96aea21d7	Implement READPORT/WRITEPORT, implementing the last X86 regression tests that were failing with the pattern selector. Note that the support that existed in the simple selector was clearly broken in several ways though (which has also been fixed). llvm-svn: 21831	2005-05-09 21:17:38 +00:00
Chris Lattner	6a55b1d4dd	do not emit illegal instructions llvm-svn: 21830	2005-05-09 21:06:04 +00:00
Chris Lattner	7ba0699b05	Fix the syntax of the i/o instructions, these are obviously unused. llvm-svn: 21829	2005-05-09 20:49:20 +00:00
Chris Lattner	46b51ab388	legalize readio/writeio into load/stores, fixing CodeGen/X86/io.llx with the pattern isel. llvm-svn: 21828	2005-05-09 20:37:29 +00:00
Chris Lattner	b28f865865	restore some non-dead code I removed last night breaking double casts to uint llvm-svn: 21821	2005-05-09 18:37:02 +00:00
Chris Lattner	3094cec3c9	Wrap long lines, remove dead code that is now handled by legalize llvm-svn: 21811	2005-05-09 05:40:26 +00:00
Chris Lattner	5d291fa443	Fix FP -> bool casts llvm-svn: 21810	2005-05-09 05:33:18 +00:00
Chris Lattner	65d61d9d44	Fix X86/2005-05-08-FPStackifierPHI.ll: ugly gross hack. llvm-svn: 21801	2005-05-09 03:36:39 +00:00
Andrew Lenharth	8e2beec4d1	fix typo llvm-svn: 21693	2005-05-04 19:25:37 +00:00
Andrew Lenharth	8b64bd0fd5	Implement count leading zeros (ctlz), count trailing zeros (cttz), and count population (ctpop). Generic lowering is implemented, however only promotion is implemented for SelectionDAG at the moment. More coming soon. llvm-svn: 21676	2005-05-03 17:19:30 +00:00
Chris Lattner	15d29b0220	Add support for FSIN/FCOS when unsafe math ops are enabled. Patch contributed by Morten Ofstad! llvm-svn: 21632	2005-04-30 04:25:35 +00:00
Chris Lattner	663664d10c	Add support for llvm.sqrt and sin/cos if unsafe math optimizations are enabled. llvm-svn: 21631	2005-04-30 04:12:40 +00:00
Chris Lattner	27a534f181	Add support for FSQRT node, patch contributed by Morten Ofstad llvm-svn: 21610	2005-04-28 22:07:18 +00:00
Chris Lattner	236cef3563	Add some new X86 instrs, patch contributed by Morten Ofstad llvm-svn: 21608	2005-04-28 21:50:05 +00:00
Chris Lattner	2f7a83ffbf	Codegen fabs/fabsf as FABS. Patch contributed by Morten Ofstad llvm-svn: 21607	2005-04-28 21:48:42 +00:00
Andrew Lenharth	2a00530fa7	Implement Value* tracking for loads and stores in the selection DAG. This enables one to use alias analysis in the backends. (TRUNK)Stores and (EXT\|ZEXT\|SEXT)Loads have an extra SDOperand which is a SrcValueSDNode which contains the Value. Note that if the operation is introduced by the backend, it will still have the operand, but the value will be null. llvm-svn: 21599	2005-04-27 20:10:01 +00:00
Misha Brukman	bf3f6181fd	* Remove trailing whitespace * Convert tabs to spaces llvm-svn: 21426	2005-04-21 23:38:14 +00:00
Chris Lattner	8d813a7d51	Handle stores of global address as stores of immediates. Instead of: test1: movl $N, %eax movl %eax, G ret emit: test1: movl $N, G ret llvm-svn: 21407	2005-04-21 19:11:03 +00:00
Chris Lattner	706775467b	Handle (store &GV -> mem) as a store immediate. This often occurs for printf format strings and other stuff. Instead of generating this: movl $l1__2E_str_1, %eax movl %eax, (%esp) we now emit: movl $l1__2E_str_1, (%esp) llvm-svn: 21406	2005-04-21 19:03:24 +00:00
Nate Begeman	ecb5b5c028	Make pattern isel default for ppc Add new ppc beta option related to using condition registers Make pattern isel control flag (-enable-pattern-isel) global and tristate 0 == off 1 == on 2 == target default llvm-svn: 21309	2005-04-15 22:12:16 +00:00
Chris Lattner	85c6a7bed0	Fix some mysteriously missing {}'s which cause the miscompilation of Olden/mst, Ptrdist/bc, Obsequi, etc. llvm-svn: 21274	2005-04-13 03:29:53 +00:00
Chris Lattner	f25fefd9cf	Z_E_I is gone llvm-svn: 21267	2005-04-13 02:39:05 +00:00
Chris Lattner	fd4d6e0847	Use live out sets for return values instead of imp_defs, which is cleaner and faster. llvm-svn: 21181	2005-04-09 15:23:56 +00:00
Chris Lattner	39f963f968	This target does not support/want ISD::BRCONDTWOWAY llvm-svn: 21164	2005-04-09 03:22:37 +00:00
Chris Lattner	583dcc66f2	X86 zero extends setcc results llvm-svn: 21146	2005-04-07 19:41:46 +00:00
Chris Lattner	40ee8a062b	Fix SingleSource/Regression/C/2005-05-06-LongLongSignedShift.c, we were not properly sign extending the top of the result of a 64-bit shift right by a constant > 32. llvm-svn: 21120	2005-04-06 20:59:35 +00:00
Chris Lattner	c87ff88efb	Add (untested) support for MULHS and MULHU. llvm-svn: 21107	2005-04-06 04:21:07 +00:00
Chris Lattner	ba7cdbebb1	add signed versions of the extra precision multiplies llvm-svn: 21106	2005-04-06 04:19:22 +00:00
Chris Lattner	8f30d63c1a	add support for FABS and FNEG llvm-svn: 21015	2005-04-02 05:30:17 +00:00
Chris Lattner	a5d4718875	This target doesn't support fabs/fneg yet. llvm-svn: 21010	2005-04-02 05:03:24 +00:00
Chris Lattner	71434aa2dd	add an fabs instr llvm-svn: 21006	2005-04-02 04:31:56 +00:00
Chris Lattner	8ee783d9f0	Add support for 64-bit shifts. llvm-svn: 21005	2005-04-02 04:01:14 +00:00
Chris Lattner	67da3fdb70	Add support for ISD::UNDEF to the X86 be llvm-svn: 20990	2005-04-01 22:46:45 +00:00
Chris Lattner	46c4246df1	don't depend on the cfg being set up yet llvm-svn: 20936	2005-03-30 01:10:00 +00:00
Nate Begeman	f821401825	Change interface to LowerCallTo to take a boolean isVarArg argument. llvm-svn: 20842	2005-03-26 01:29:23 +00:00
Chris Lattner	ad07b1bc54	eliminate dead variables, patch contributed by Gabor Greif! llvm-svn: 20812	2005-03-24 17:32:20 +00:00
Nate Begeman	175a9f1cc6	Remove comments that are now meaningless from the pattern ISels, at Chris's request. llvm-svn: 20804	2005-03-24 04:39:54 +00:00
Chris Lattner	79ba9d58fd	Don't emit two comparisons when comparing a FP value against zero! llvm-svn: 20651	2005-03-17 16:29:26 +00:00
Chris Lattner	c9a3ea81bf	Fix the missing symbols problem Bill was hitting. Patch contributed by Bill Wendling!! llvm-svn: 20649	2005-03-17 15:38:16 +00:00
Chris Lattner	4b688a1c70	This mega patch converts us from using Function::a{iterator\|begin\|end} to using Function::arg_{iterator\|begin\|end}. Likewise Module::g* -> Module::global_*. This patch is contributed by Gabor Greif, thanks! llvm-svn: 20597	2005-03-15 04:54:21 +00:00
Reid Spencer	b0ca4aa8cd	Patch to make assembly output compatible with mingw compilation (identical to cygwin) llvm-svn: 20520	2005-03-08 17:02:05 +00:00
Chris Lattner	a024984017	Fix spelling, patch contributed by Gabor Greif! llvm-svn: 20343	2005-02-27 06:18:25 +00:00
Chris Lattner	9838ab1271	Silence some uninit variable warnings. llvm-svn: 20284	2005-02-23 05:57:21 +00:00
Chris Lattner	ab92b92bc5	We can fold promoted and non-promoted loads into divs also! llvm-svn: 19835	2005-01-25 20:35:10 +00:00
Chris Lattner	a9a0369879	Fold promoted loads into binary ops for FP, allowing us to generate m32 forms of FP ops. llvm-svn: 19834	2005-01-25 20:03:11 +00:00
Chris Lattner	6ff85c3152	Silence a warning. llvm-svn: 19798	2005-01-23 23:20:06 +00:00
Chris Lattner	94952e0947	Allow the FP stackifier to completely ignore functions that do not use FP at all. This should speed up the X86 backend fairly significantly on integer codes. Now if only we didn't have to compute livevar still... ;-) llvm-svn: 19796	2005-01-23 23:13:59 +00:00
Reid Spencer	5c7b6e83f0	Support Cygwin assembly generation. The cygwin version of Gnu ASsembler doesn't support certain directives and symbols on cygwin are prefixed with an underscore. This patch makes the necessary adjustments to the output. llvm-svn: 19775	2005-01-23 03:52:14 +00:00
Chris Lattner	b4cf4ffb04	Speed up folding operations into loads. llvm-svn: 19733	2005-01-21 21:43:02 +00:00
Chris Lattner	fd4d7f71ae	The ever-important vanity pass name :) llvm-svn: 19731	2005-01-21 21:35:14 +00:00
Chris Lattner	5f2fbeaa69	Fix a FIXME: realize that argument stores are all independent (don't alias) llvm-svn: 19728	2005-01-21 19:46:38 +00:00
Chris Lattner	febeb380ae	Implement ADD_PARTS/SUB_PARTS so that 64-bit integer add/sub work. This fixes most of the remaining llc-beta failures. llvm-svn: 19716	2005-01-20 18:53:00 +00:00
Chris Lattner	8b0a2a3251	Fix a crash compiling 134.perl. llvm-svn: 19711	2005-01-20 16:50:16 +00:00
Chris Lattner	6534e1ede3	Fix a problem where were were literally selecting for INCREASED register pressure, not decreases register pressure. Fix problem where we accidentally swapped the operands of SHLD, which caused fourinarow to fail. This fixes fourinarow. llvm-svn: 19697	2005-01-19 17:24:34 +00:00
Chris Lattner	b75589131d	When commuting these instructions, make sure to actually swap the operands too. llvm-svn: 19694	2005-01-19 16:55:52 +00:00
Chris Lattner	fde1a5688b	Implement Regression/CodeGen/X86/rotate.ll: emit rotate instructions (which typically cost 1 cycle) instead of shld/shrd instruction (which are typically 6 or more cycles). This also saves code space. For example, instead of emitting: rotr: mov %EAX, DWORD PTR [%ESP + 4] mov %CL, BYTE PTR [%ESP + 8] shrd %EAX, %EAX, %CL ret rotli: mov %EAX, DWORD PTR [%ESP + 4] shrd %EAX, %EAX, 27 ret Emit: rotr32: mov %CL, BYTE PTR [%ESP + 8] mov %EAX, DWORD PTR [%ESP + 4] ror %EAX, %CL ret rotli32: mov %EAX, DWORD PTR [%ESP + 4] ror %EAX, 27 ret We also emit byte rotate instructions which do not have a sh[lr]d counterpart at all. llvm-svn: 19692	2005-01-19 08:07:05 +00:00
Chris Lattner	34757ff939	Add rotate instructions. llvm-svn: 19690	2005-01-19 07:50:03 +00:00
Chris Lattner	e539ce8223	Match 16-bit shld/shrd instructions as well, implementing shift-double.llx:test5 llvm-svn: 19689	2005-01-19 07:37:26 +00:00
Chris Lattner	9d5ee289d7	Improve coverage of the X86 instruction set by adding 16-bit shift doubles. llvm-svn: 19687	2005-01-19 07:31:24 +00:00
Chris Lattner	c03f360215	Teach the code generator that shrd/shld is commutable if it has an immediate. This allows us to generate this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %EDX, DWORD PTR [%ESP + 8] shld %EDX, %EDX, 2 shl %EAX, 2 ret instead of this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] mov %EDX, %EAX shrd %EDX, %ECX, 30 shl %EAX, 2 ret Note the magically transmogrifying immediate. llvm-svn: 19686	2005-01-19 07:11:01 +00:00
Chris Lattner	575e912fcf	Codegen long >> 2 to this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %EDX, DWORD PTR [%ESP + 8] shrd %EAX, %EDX, 2 sar %EDX, 2 ret instead of this: test1: mov %ECX, DWORD PTR [%ESP + 4] shr %ECX, 2 mov %EDX, DWORD PTR [%ESP + 8] mov %EAX, %EDX shl %EAX, 30 or %EAX, %ECX sar %EDX, 2 ret and long << 2 to this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] * mov %EDX, %EAX shrd %EDX, %ECX, 30 shl %EAX, 2 ret instead of this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, %EAX shr %ECX, 30 mov %EDX, DWORD PTR [%ESP + 8] shl %EDX, 2 or %EDX, %ECX shl %EAX, 2 ret The extra copy (marked *) can be eliminated when I teach the code generator that shrd32rri8 is really commutative. llvm-svn: 19681	2005-01-19 06:18:43 +00:00
Chris Lattner	419a5d213b	X86 shifts mask the amount. llvm-svn: 19678	2005-01-19 03:36:30 +00:00
Chris Lattner	6dec8cb829	Code to handle FP_EXTEND is dead now. X86 doesn't support any data types to FP_EXTEND from! llvm-svn: 19674	2005-01-18 20:05:56 +00:00
Chris Lattner	798e9c85d6	Remove more dead code. llvm-svn: 19673	2005-01-18 19:50:08 +00:00
Chris Lattner	401814508f	The selection dag code handles the promotions from F32 to F64 for us, so we don't need to even think about F32 in the X86 code anymore. llvm-svn: 19672	2005-01-18 19:46:54 +00:00
Chris Lattner	dc09e52b3e	Fix 124.m88ksim. llvm-svn: 19667	2005-01-18 17:35:28 +00:00
Chris Lattner	a04b1ee7a8	Do not emit loads multiple times, potentially in the wrong places. llvm-svn: 19661	2005-01-18 04:18:32 +00:00
Chris Lattner	722ddeb86e	Eliminate bad assertions. llvm-svn: 19659	2005-01-18 04:00:54 +00:00
Chris Lattner	8f3a8d96e2	* Eliminate the TokenSet and just use the ExprMap for both tokens and values. * Insert some really pedantic assertions that will notice when we emit the same loads more than one time, exposing bugs. This turns a miscompilation in bzip2 into a compile-fail. yaay. llvm-svn: 19658	2005-01-18 03:51:59 +00:00
Chris Lattner	b3edb09ede	Rely on the code in MatchAddress to do this work. Otherwise we fail to match (X+Y)+(Z << 1), because we match the X+Y first, consuming the index register, then there is no place to put the Z. llvm-svn: 19652	2005-01-18 02:25:52 +00:00
Chris Lattner	ce2e0125dc	Fix a problem where probing for addressing modes caused expressions to be emitted too early. In particular, this fixes Regression/CodeGen/X86/regpressure.ll:regpressure3. This also improves the 2nd basic block in 164.gzip:flush_block, which went from .LBBflush_block_1: # loopentry.1.i movzx %EAX, WORD PTR [dyn_ltree + 20] movzx %ECX, WORD PTR [dyn_ltree + 16] mov DWORD PTR [%ESP + 32], %ECX movzx %ECX, WORD PTR [dyn_ltree + 12] movzx %EDX, WORD PTR [dyn_ltree + 8] movzx %EBX, WORD PTR [dyn_ltree + 4] mov DWORD PTR [%ESP + 36], %EBX movzx %EBX, WORD PTR [dyn_ltree] add DWORD PTR [%ESP + 36], %EBX add %EDX, DWORD PTR [%ESP + 36] add %ECX, %EDX add DWORD PTR [%ESP + 32], %ECX add %EAX, DWORD PTR [%ESP + 32] movzx %ECX, WORD PTR [dyn_ltree + 24] add %EAX, %ECX mov %ECX, 0 mov %EDX, %ECX to .LBBflush_block_1: # loopentry.1.i movzx %EAX, WORD PTR [dyn_ltree] movzx %ECX, WORD PTR [dyn_ltree + 4] add %ECX, %EAX movzx %EAX, WORD PTR [dyn_ltree + 8] add %EAX, %ECX movzx %ECX, WORD PTR [dyn_ltree + 12] add %ECX, %EAX movzx %EAX, WORD PTR [dyn_ltree + 16] add %EAX, %ECX movzx %ECX, WORD PTR [dyn_ltree + 20] add %ECX, %EAX movzx %EAX, WORD PTR [dyn_ltree + 24] add %ECX, %EAX mov %EAX, 0 mov %EDX, %EAX ... which results in less spilling in the function. This change alone speeds up 164.gzip from 37.23s to 36.24s on apoc. The default isel takes 37.31s. llvm-svn: 19650	2005-01-18 01:06:26 +00:00
Chris Lattner	a78f9ced61	Fix indentation. llvm-svn: 19649	2005-01-17 23:25:45 +00:00
Chris Lattner	dff1e3e86f	Don't bother using max here. llvm-svn: 19647	2005-01-17 23:02:13 +00:00
Chris Lattner	2d86b43318	Do not give token factor nodes outrageous weights llvm-svn: 19645	2005-01-17 22:56:09 +00:00
Chris Lattner	f2878ce8ba	Two changes: 1. Fold [mem] += (1\|-1) into inc [mem]/dec [mem] to save some icache space. 2. Do not let token factor nodes prevent forming '[mem] op= val' folds. llvm-svn: 19643	2005-01-17 22:10:42 +00:00
Chris Lattner	40c0fca632	Refactor load/op/store folding into it's own method, no functionality changes. llvm-svn: 19641	2005-01-17 19:25:26 +00:00
Chris Lattner	2348abc421	Fix a major regression last night that prevented us from producing [mem] op= reg operations. The body of the if is less indented but unmodified in this patch. llvm-svn: 19638	2005-01-17 17:49:14 +00:00
Chris Lattner	adb669ab1f	Codegen this: int %foo(int %X) { %T = add int %X, 13 %S = mul int %T, 3 ret int %S } as this: mov %ECX, DWORD PTR [%ESP + 4] lea %EAX, DWORD PTR [%ECX + 2*%ECX + 39] ret instead of this: mov %ECX, DWORD PTR [%ESP + 4] mov %EAX, %ECX add %EAX, 13 imul %EAX, %EAX, 3 ret llvm-svn: 19633	2005-01-17 06:48:02 +00:00
Chris Lattner	51590b615c	Fix test/Regression/CodeGen/X86/2005-01-17-CycleInDAG.ll and 132.ijpeg. Do not fold a load into an operation if it will induce a cycle in the DAG. Repeat after me: dAg. llvm-svn: 19631	2005-01-17 06:26:58 +00:00
Chris Lattner	f1e85bec5a	Do not fold a load into a comparison that is used by more than one place. The comparison will probably be folded, so this is not ok to do. This fixed 197.parser. llvm-svn: 19624	2005-01-17 01:34:14 +00:00
Chris Lattner	1b8c8fe020	Do not codegen 'xor bool, true' as 'not reg'. not reg inverts the upper bits of the bytereg. This fixes yacr2, 300.twolf and probably others. llvm-svn: 19622	2005-01-17 00:23:16 +00:00
Chris Lattner	46dac4394c	Set up the shift and setcc types. If we emit a load because we followed a token chain to get to it, try to fold it into its single user if possible. llvm-svn: 19620	2005-01-17 00:00:33 +00:00
Chris Lattner	9ffc59287e	* Adjust to changes in TargetLowering interfaces. * Remove custom promotion for bool and byte select ops. Legalize now promotes them for us. * Allow folding ConstantPoolIndexes into EXTLOAD's, useful for float immediates. * Declare which operations are not supported better. * Add some hacky code for TRUNCSTORE to pretend that we have truncstore for i16 types. This is useful for testing promotion code because I can just remove 16-bit registers all together and verify that programs work. llvm-svn: 19614	2005-01-16 07:34:08 +00:00
Chris Lattner	f3d950e816	Add support for truncstore and *extload. llvm-svn: 19566	2005-01-15 05:22:24 +00:00
Chris Lattner	27c91fac94	Adjust to CopyFromREg changes. llvm-svn: 19561	2005-01-14 22:37:41 +00:00
Chris Lattner	7a8788c9ac	Add new ImplicitDef node, rename CopyRegSDNode class to RegSDNode. llvm-svn: 19535	2005-01-13 20:50:02 +00:00
Chris Lattner	fce6a5439d	Codegen factor nodes more intelligently according to perceived register pressure. llvm-svn: 19532	2005-01-13 19:56:00 +00:00
Chris Lattner	cb4359465a	Initial trivial (but stupid) codegen for this node. llvm-svn: 19529	2005-01-13 18:01:36 +00:00
Chris Lattner	9a70166615	Add some really pedantic assertions to the load folding code. Fix a bunch of cases where we accidentally emitted a load folded once and unfolded elsewhere. llvm-svn: 19522	2005-01-13 05:53:16 +00:00
Chris Lattner	2ab70aafe0	We can only fold a load into an op if there is exactly one use of the value. Checking to see if the load has two uses is not equivalent, as the chain value may have zero uses. llvm-svn: 19518	2005-01-12 18:38:26 +00:00
Chris Lattner	4b03f0f99e	Try both ways to fold an add together. This allows us to generate this code imul %EAX, %EAX, 400 add %ECX, %EAX add %ESI, DWORD PTR [%ECX + 4*%EDX] inc %EDX cmp %EDX, 100 instead of this: imul %EAX, %EAX, 400 add %ECX, %EAX mov %EAX, %EDX shl %EAX, 2 add %ECX, %EAX add %ESI, DWORD PTR [%ECX] inc %EDX cmp %EDX, 100 llvm-svn: 19513	2005-01-12 18:08:53 +00:00
Chris Lattner	61c572eb7f	Fix a major miscompilation where we were overwriting the scale reg. llvm-svn: 19511	2005-01-12 07:33:20 +00:00
Chris Lattner	5816f1a302	Do not use the type of the RHS constant to determine the type of the operation. This fails for shifts because the constant is always 8 bits. llvm-svn: 19508	2005-01-12 05:22:07 +00:00
Chris Lattner	89d6b21ae6	Do not lose the offset from teh global when peephole optimizing instructions. This fixes FreeBench/pcompress llvm-svn: 19507	2005-01-12 05:17:28 +00:00
Jeff Cohen	614a5ec22a	Fix C++ more compilatiom errors llvm-svn: 19504	2005-01-12 04:29:05 +00:00
Chris Lattner	5ef92f3a40	Fix a compile error with VC++, which things that static const arrays need to be dynamically initialized. :( llvm-svn: 19503	2005-01-12 04:23:22 +00:00
Chris Lattner	627c64e5e5	Fix a bug that caused us to crash on povray. We weren't emitting an FP_REG_KILL into a block that had a successor with a FP PHI node. llvm-svn: 19502	2005-01-12 04:21:28 +00:00
Chris Lattner	a5f0ba59a0	Print a load of a null pointer (in intel mode) like this: mov %AX, WORD PTR [0] instead of like this: mov %AX, WORD PTR [] llvm-svn: 19501	2005-01-12 04:07:11 +00:00
Chris Lattner	360988bae2	Print a load of a null pointer like this: movw 0, %ax instead of like this: movw , %ax llvm-svn: 19500	2005-01-12 04:05:19 +00:00
Chris Lattner	3c85c67c97	Fix a crash compiling povray on UINT_TO_FP from i16. llvm-svn: 19499	2005-01-12 04:00:00 +00:00
Chris Lattner	4e72a2a000	There are no [mem] op= reg instructions for FP, so remove their entries. llvm-svn: 19496	2005-01-12 03:16:09 +00:00
Chris Lattner	00cb0ace9b	Fix a bug where we didn't insert FP_REG_KILL instructions into MBB's that contain FP PHI nodes but no other FP defining instructions. This fixes 183.equake llvm-svn: 19495	2005-01-12 02:57:10 +00:00
Chris Lattner	92166ed1df	Fold TRUNCATE (LOAD P) into a smaller load from P. llvm-svn: 19494	2005-01-12 02:19:06 +00:00
Chris Lattner	258b23bd9d	Be more careful about order of arg evalution for CopyToReg nodes. This shrinks 256.bzip2 from 7142 to 7103 lines of .s file. Second, add initial support for folding loads into compares, though this code is dynamically dead for now. :( llvm-svn: 19493	2005-01-12 02:02:48 +00:00
Chris Lattner	604416e8f4	Fold some more [mem] op= val operators. This allows us to things like this several times in 256.bzip2: mov %EAX, DWORD PTR [%ESP + 204] - mov %EAX, DWORD PTR [%EAX] - or %EAX, 2097152 - mov %ECX, DWORD PTR [%ESP + 204] - mov DWORD PTR [%ECX], %EAX + or DWORD PTR [%EAX], 2097152 llvm-svn: 19492	2005-01-12 01:28:00 +00:00
Chris Lattner	e83ae1063f	Fold loads into sign/zero extends. instead of: mov %AL, BYTE PTR [%EDX + l18_length_code] movzx %EAX, %AL Emit: movzx %EAX, BYTE PTR [%EDX + l18_length_code] llvm-svn: 19489	2005-01-11 23:33:00 +00:00
Chris Lattner	87a38bd4a8	Comment out debug code :) Select [mem] += Val operations. For constants, we used to get: mov %ECX, -32768 add %ECX, DWORD PTR [l4_match_start] mov DWORD PTR [l4_match_start], %ECX Now we get: add DWORD PTR [l4_match_start], -32768 For other values we used to get: mov %EBP, %EDI ;; because the add destroys the value add %EBP, DWORD PTR [l4_input_len] mov DWORD PTR [l4_input_len], %EBP now we get: add DWORD PTR [l4_input_len], %EDI Both of these use less registers than the alternative, are faster and smaller. llvm-svn: 19488	2005-01-11 23:21:30 +00:00
Chris Lattner	282473a25d	Handle the global address case here, not just the offset case. llvm-svn: 19487	2005-01-11 22:58:43 +00:00
Chris Lattner	9eb2cc700b	Treat int constants as not requiring a register, since they are almost always folded into an instruction. llvm-svn: 19486	2005-01-11 22:29:12 +00:00
Chris Lattner	7cb2220907	* Factor a bunch of binary operator cases into shared code. * Fold loads into Add, sub, and, or, xor and mul when possible. * Codegen shl X, 1 as add X, X llvm-svn: 19483	2005-01-11 21:19:59 +00:00
Chris Lattner	b838c9748e	Fold multiplies by 3,5,9 into addressing modes when possible. llvm-svn: 19480	2005-01-11 19:37:02 +00:00
Chris Lattner	e7b1130b01	Instead of generating stuff like this: mov %ECX, %EAX add %ECX, 32768 mov %SI, WORD PTR [2%ECX + l13_prev] Generate this: mov %SI, WORD PTR [2%ECX + l13_prev + 65536] This occurs when you have a GEP instruction where an index is "something + imm". llvm-svn: 19472	2005-01-11 06:36:20 +00:00
Chris Lattner	bb63a09cd1	Implement MEMCPY natively in terms of rep movs* llvm-svn: 19468	2005-01-11 06:19:26 +00:00
Chris Lattner	b2b08a8bc1	Implement memset -> rep stos* llvm-svn: 19467	2005-01-11 06:14:36 +00:00
Chris Lattner	58816a9e81	Announce that we don't support mem ops yet. llvm-svn: 19466	2005-01-11 05:57:36 +00:00
Chris Lattner	f867443d7e	Teach the address selector to make 'reg+reg' addressing modes. llvm-svn: 19457	2005-01-11 04:40:19 +00:00
Chris Lattner	edf06be50e	Emit NOT instructions. llvm-svn: 19455	2005-01-11 04:31:30 +00:00
Chris Lattner	4e4bef2d6c	Fix a bug emitting branches that broke a lot of programs. llvm-svn: 19452	2005-01-11 04:06:27 +00:00
Chris Lattner	4b51297a94	Be more careful where we set ContainsFPCode. We were missing a set in the int -> FP casting code. Note that we don't have to set it for FP operations that take FP values as operands: whatever produces the FP value will set the flag. llvm-svn: 19451	2005-01-11 03:50:45 +00:00
Chris Lattner	0c4c4094e3	Fix a major bug in setcc/cmov folding, where we accidentally inverted the sense of the comparison. llvm-svn: 19450	2005-01-11 03:37:59 +00:00
Chris Lattner	d188e03011	Take register pressure into account when we have to decide whether to evaluate the LHS or the RHS of an operation first. This causes good things to happen. For example, instead of compiling a loop to this: .LBBstrength_result7_1: # loopentry movl 16(%esp), %edi movl (%edi), %edi ;;; LOAD movl (%ecx), %ebx movl $2, (%eax,%ebx,4) movl (%edx), %ebx movl %esi, %ebp addl $21, %ebp addl $42, %esi cmpl $0, %edi ;;; USE cmovne %esi, %ebp cmpl %ebp, %ebx movl %ebp, %esi jg .LBBstrength_result7_1 We now compile it to this: .LBBstrength_result7_1: # loopentry movl %edi, %ebx addl $42, %ebx addl $21, %edi movl (%ecx), %ebp ;; LOAD cmpl $0, %ebp ;; USE cmovne %ebx, %edi movl (%edx), %ebx movl $2, (%eax,%ebx,4) movl (%esi), %ebx cmpl %edi, %ebx jg .LBBstrength_result7_1 Which reduces register pressure enough (in this case) to avoid spilling in the loop. As another example, consider the CodeGen/X86/regpressure.ll testcase. We used to generate this code for both cases: regpressure1: subl $32, %esp movl %esi, 12(%esp) movl %edi, 8(%esp) movl %ebx, 4(%esp) movl %ebp, (%esp) movl 36(%esp), %ecx movl (%ecx), %eax movl 4(%ecx), %edx movl %edx, 24(%esp) movl 8(%ecx), %edx movl %edx, 16(%esp) movl 12(%ecx), %edx movl 16(%ecx), %esi movl 20(%ecx), %edi movl 24(%ecx), %ebx movl %ebx, 28(%esp) movl 28(%ecx), %ebx movl 32(%ecx), %ebp movl %ebp, 20(%esp) movl 36(%ecx), %ecx imull 24(%esp), %eax imull 16(%esp), %eax imull %edx, %eax imull %esi, %eax imull %edi, %eax imull 28(%esp), %eax imull %ebx, %eax imull 20(%esp), %eax imull %ecx, %eax movl (%esp), %ebp movl 4(%esp), %ebx movl 8(%esp), %edi movl 12(%esp), %esi addl $32, %esp ret This code is basically trying to do all of the loads first, then execute all of the multiplies. Because we run out of registers, lots of spill code happens. We now generate this code for both cases: regpressure1: movl 4(%esp), %ecx movl (%ecx), %eax movl 4(%ecx), %edx imull %edx, %eax movl 8(%ecx), %edx imull %edx, %eax movl 12(%ecx), %edx imull %edx, %eax movl 16(%ecx), %edx imull %edx, %eax movl 20(%ecx), %edx imull %edx, %eax movl 24(%ecx), %edx imull %edx, %eax movl 28(%ecx), %edx imull %edx, %eax movl 32(%ecx), %edx imull %edx, %eax movl 36(%ecx), %ecx imull %ecx, %eax ret which is much nicer (when we fold loads into the muls it will be even better). The old instruction selector used to produce the good code for regpressure1 but not for regpressure2, as it depended on the order of operations in the LLVM code. llvm-svn: 19449	2005-01-11 03:11:44 +00:00
Chris Lattner	497e24c885	Fold setcc instructions into selects. llvm-svn: 19438	2005-01-10 22:10:13 +00:00
Chris Lattner	65d007ab62	Add conditional moves for the parity flag. llvm-svn: 19437	2005-01-10 22:09:33 +00:00
Chris Lattner	d61491dea2	Implement 8-bit multiply for X86. llvm-svn: 19435	2005-01-10 20:55:48 +00:00
Chris Lattner	fcab5f75c0	Codegen (Reg\|imm)+&GV as an LEA, because we cannot put it into the immediate field of an ADDri (due to current restrictions on MachineOperand :( ). This allows us to generate: leal Data+16000, %edx instead of: movl $Data, %edx addl $16000, %edx llvm-svn: 19420	2005-01-09 20:20:29 +00:00
Chris Lattner	35375c11bf	Fix copy and pasto's for FP -> Int. This fixes fldry llvm-svn: 19418	2005-01-09 19:49:59 +00:00
Chris Lattner	45155a3dee	Initial implementation of FP->INT and INT->FP casts Also, fix zero_extend from bool to i8, which fixes Shootout/objinst. llvm-svn: 19414	2005-01-09 18:52:44 +00:00
Chris Lattner	9ca9b20447	Fix a subtle bug involving constant expr casts from int to fp llvm-svn: 19410	2005-01-09 01:49:29 +00:00
Chris Lattner	c5e53c07fd	Implement varargs and returnaddress/frameaddress intrinsics. With this patch, all of SingleSource/UnitTests passes. llvm-svn: 19408	2005-01-09 00:01:27 +00:00
Chris Lattner	ca81756527	Okay 15th time is the charm. Looking at the vector size is useless as it gets clobbered by a previous statement. This fixes all calls finally. llvm-svn: 19399	2005-01-08 20:51:36 +00:00
Chris Lattner	85816cff9a	Okay, my off by one was actually off by two. This fixes Generic/2003-07-07-BadLongConst.ll llvm-svn: 19398	2005-01-08 20:39:31 +00:00
Chris Lattner	2d68cb6cf4	Fix off by one error llvm-svn: 19396	2005-01-08 20:31:34 +00:00
Chris Lattner	c4d075cfa3	Adjust to changes in LowerCallTo interface Minor bugfixes llvm-svn: 19376	2005-01-08 19:28:19 +00:00
Chris Lattner	6c7d3bd8ea	Wrap long line. llvm-svn: 19367	2005-01-08 06:59:50 +00:00
Chris Lattner	473ec492f7	The X86 instruction selector already handles codegen of: store float 123.45, float* %P as an integer store. This adds handling of float immediate stores as integers for arguments passed function calls. This is now tested by CodeGen/X86/store-fp-constant.ll llvm-svn: 19364	2005-01-08 05:45:24 +00:00
Chris Lattner	2c398fc8f6	Allow the selection-dag based selector to be diabled with -disable-pattern-isel. For now, this is the default, as the current selector is missing some big pieces. To enable the new selector, pass -disable-pattern-isel=false to llc or lli. llvm-svn: 19335	2005-01-07 07:50:50 +00:00
Chris Lattner	216198574d	Reimplementation of the X86 pattern isel. This is still missing many large pieces, but can already do amazing things in some cases. llvm-svn: 19334	2005-01-07 07:49:41 +00:00
Chris Lattner	74019f517a	This file is now dead. llvm-svn: 19333	2005-01-07 07:49:05 +00:00
Chris Lattner	079b497982	Add a new prototype llvm-svn: 19332	2005-01-07 07:48:33 +00:00
Chris Lattner	608dd77d6b	Codegen -1 and -0.0 more efficiently. This implements CodeGen/X86/negatize_zero.ll llvm-svn: 19313	2005-01-06 21:19:16 +00:00
Chris Lattner	6d651234d6	1. If a double FP constant must be put into a constant pool, but it can be precisely represented as a float, put it into the constant pool as a float. 2. Use the cbw/cwd/cdq instructions instead of an explicit SAR for signed division. llvm-svn: 19291	2005-01-05 16:30:14 +00:00
Chris Lattner	b438f5251f	Minor optimization to allocate R8 registers in a better order. llvm-svn: 19289	2005-01-05 16:09:16 +00:00
Jeff Cohen	36968ed8c1	Revert elimination of global variable hack... still needed. llvm-svn: 19273	2005-01-03 16:34:19 +00:00
Chris Lattner	1aaf8cccb2	ADC and IMUL are also commutable. llvm-svn: 19264	2005-01-03 01:27:59 +00:00
Jeff Cohen	1087b72875	Eliminate the use of the global variable hack in the X86 target that was used to get Visual Studio to link in X86.lib to the executables that need it. There is another way of doing it. llvm-svn: 19252	2005-01-02 04:23:12 +00:00
Chris Lattner	a78fd4726e	Disable 2->3 address promotion of add and inc instructions to LEA's. In addition to being three address, LEA's don't set the flags. This fixes 186.crafty. llvm-svn: 19251	2005-01-02 04:18:17 +00:00
Chris Lattner	3ef32da6c3	Add a new method. llvm-svn: 19249	2005-01-02 02:38:18 +00:00
Chris Lattner	95f1e628ed	Add support for SETNPr to lower to memory form. llvm-svn: 19248	2005-01-02 02:37:46 +00:00
Chris Lattner	d6bc921fa8	Implement the convertToThreeAddress method, add support for inverting JP/JNP branches. llvm-svn: 19247	2005-01-02 02:37:07 +00:00
Chris Lattner	0d6f03e52b	Two changes here: 1. Add new instructions for checking parity flags: JP, JNP, SETP, SETNP. 2. Set the isCommutable and isPromotableTo3Address bits on several instructions. llvm-svn: 19246	2005-01-02 02:35:46 +00:00
Chris Lattner	3b78513843	Remove unused enum value llvm-svn: 19024	2004-12-17 22:41:46 +00:00
Chris Lattner	d11ba51208	Change the sentinal llvm-svn: 19007	2004-12-17 00:46:51 +00:00
Chris Lattner	59d0c02d2b	Create a stack slot for the return address lazily instead of eagerly. This save small amounts of time for functions that don't call llvm.returnaddress or llvm.frameaddress (which is almost all functions). llvm-svn: 19006	2004-12-17 00:07:46 +00:00
Chris Lattner	4b1d58bf4b	Adjust to changes in asmwriter filenames llvm-svn: 18987	2004-12-16 17:33:24 +00:00
Chris Lattner	4136428410	Set the rounding mode for the X86 FPU to 64-bits instead of 80-bits. We don't support long double anyway, and this gives us FP results closer to other targets. This also speeds up 179.art from 41.4s to 18.32s, by eliminating a problem with extra precision that causes an FP == comparison to fail (leading to extra loop iterations). llvm-svn: 18895	2004-12-13 17:23:11 +00:00
Chris Lattner	6131b06f73	Use the target triple to pick this target. llvm-svn: 18830	2004-12-12 17:40:28 +00:00
Chris Lattner	99c8cf8ef8	Fix a regression caused by the previous patch llvm-svn: 18449	2004-12-03 05:13:15 +00:00
Chris Lattner	316f923a9c	Spill/restore X86 floating point stack registers with 64-bits of precision instead of 80-bits of precision. This fixes PR467. This change speeds up fldry on X86 with LLC from 7.32s on apoc to 4.68s. llvm-svn: 18433	2004-12-02 18:17:31 +00:00
Chris Lattner	dfdd49b7af	Consider 64-bit registers to be FP as well. llvm-svn: 18432	2004-12-02 17:57:21 +00:00
Tanya Lattner	893f987574	Reverting this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20041122/021428.html It broke Mutlisource/Applications/obsequi llvm-svn: 18407	2004-12-01 18:27:03 +00:00
Chris Lattner	9c400f3b28	Revamp long/ulong comparisons to use a much more efficient sequence (thanks to Brian and the Sun compiler for pointing out that the obvious works :) This also enables folding all long comparisons into setcc and branch instructions: before we could only do == and != For example, for: void test(unsigned long long A, unsigned long long B) { if (A < B) foo(); } We now generate: test: subl $4, %esp movl %esi, (%esp) movl 8(%esp), %eax movl 12(%esp), %ecx movl 16(%esp), %edx movl 20(%esp), %esi subl %edx, %eax sbbl %esi, %ecx jae .LBBtest_2 # UnifiedReturnBlock .LBBtest_1: # then call foo movl (%esp), %esi addl $4, %esp ret .LBBtest_2: # UnifiedReturnBlock movl (%esp), %esi addl $4, %esp ret Instead of: test: subl $12, %esp movl %esi, 8(%esp) movl %ebx, 4(%esp) movl 16(%esp), %eax movl 20(%esp), %ecx movl 24(%esp), %edx movl 28(%esp), %esi cmpl %edx, %eax setb %al cmpl %esi, %ecx setb %bl cmove %ax, %bx testb %bl, %bl je .LBBtest_2 # UnifiedReturnBlock .LBBtest_1: # then call foo movl 4(%esp), %ebx movl 8(%esp), %esi addl $12, %esp ret .LBBtest_2: # UnifiedReturnBlock movl 4(%esp), %ebx movl 8(%esp), %esi addl $12, %esp ret llvm-svn: 18330	2004-11-29 05:55:24 +00:00
Chris Lattner	7a34cbf266	Do not push two return addresses on the stack when we call external functions who have their addresses taken. This fixes test-call.ll llvm-svn: 18134	2004-11-22 22:25:30 +00:00
Chris Lattner	f4c8575535	There is no reason to emit function stubs for direct calls. llvm-svn: 18082	2004-11-21 03:46:06 +00:00
Chris Lattner	6d1fb33657	ignore generated files llvm-svn: 18073	2004-11-21 00:01:54 +00:00
Chris Lattner	4a340e281e	Remove all JIT specific code and switch the code generator over to emitting relocations for global references. llvm-svn: 18068	2004-11-20 23:55:15 +00:00
Chris Lattner	b9a44893e9	Implement the X86 JIT interfaces llvm-svn: 18067	2004-11-20 23:54:33 +00:00
Chris Lattner	8e33311566	Describe the X86 target-specific relocations. llvm-svn: 18066	2004-11-20 23:54:19 +00:00
Chris Lattner	3c20464ad7	We implement these interfaces llvm-svn: 18065	2004-11-20 23:53:56 +00:00
Chris Lattner	0c79788bc4	Dont' forget to switch back to decimal output llvm-svn: 18010	2004-11-19 20:57:07 +00:00
Chris Lattner	a7eec14b04	Fix a major bug in the signed shr code, which apparently only breaks 134.perl! llvm-svn: 17902	2004-11-16 18:40:52 +00:00
Chris Lattner	41d31d7461	Remove a dead function, which died when we got GAS emission working (phwew, hold your nose!) llvm-svn: 17869	2004-11-16 04:34:29 +00:00
Chris Lattner	b378786c97	Implement a simple FIXME: if we are emitting a basic block address that has already been emitted, we don't have to remember it and deal with it later, just emit it directly. llvm-svn: 17868	2004-11-16 04:30:51 +00:00
Chris Lattner	3f73c77ace	* Merge some win32 ifdefs together * Get rid of "emitMaybePCRelativeValue", either we want to emit a PC relative value or not: drop the maybe BS. As it turns out, the only places where the bool was a variable coming in, the bool was a dynamic constant. llvm-svn: 17867	2004-11-16 04:21:18 +00:00
Chris Lattner	3ed3e8669f	Add debug-only=jit printout, so we see when lazily resolved symbols are set up. llvm-svn: 17862	2004-11-15 23:16:55 +00:00
Chris Lattner	9ef34d44e1	Simplify and rearrange long shift code llvm-svn: 17861	2004-11-15 23:16:34 +00:00
Misha Brukman	8c1b4a5b9d	GhostLinkage should not reach asm printing stage llvm-svn: 17750	2004-11-14 21:03:49 +00:00
Chris Lattner	09b7f968e0	Don't print unneeded labels llvm-svn: 17714	2004-11-13 23:27:11 +00:00
Chris Lattner	1cde11aa95	shld is a very high latency operation. Instead of emitting it for shifts of two or three, open code the equivalent operation which is faster on athlon and P4 (by a substantial margin). For example, instead of compiling this: long long X2(long long Y) { return Y << 2; } to: X3_2: movl 4(%esp), %eax movl 8(%esp), %edx shldl $2, %eax, %edx shll $2, %eax ret Compile it to: X2: movl 4(%esp), %eax movl 8(%esp), %ecx movl %eax, %edx shrl $30, %edx leal (%edx,%ecx,4), %edx shll $2, %eax ret Likewise, for << 3, compile to: X3: movl 4(%esp), %eax movl 8(%esp), %ecx movl %eax, %edx shrl $29, %edx leal (%edx,%ecx,8), %edx shll $3, %eax ret This matches icc, except that icc open codes the shifts as adds on the P4. llvm-svn: 17707	2004-11-13 20:48:57 +00:00
Chris Lattner	c531e090db	Add missing check llvm-svn: 17706	2004-11-13 20:04:38 +00:00
Chris Lattner	d1381380ae	Compile: long long X3_2(long long Y) { return Y+Y; } int X(int Y) { return Y+Y; } into: X3_2: movl 4(%esp), %eax movl 8(%esp), %edx addl %eax, %eax adcl %edx, %edx ret X: movl 4(%esp), %eax addl %eax, %eax ret instead of: X3_2: movl 4(%esp), %eax movl 8(%esp), %edx shldl $1, %eax, %edx shll $1, %eax ret X: movl 4(%esp), %eax shll $1, %eax ret llvm-svn: 17705	2004-11-13 20:03:48 +00:00
John Criswell	402e338f11	Correct the name of stosd for the AT&T syntax: It's stosl (l for long == 32 bit). llvm-svn: 17658	2004-11-10 04:48:15 +00:00
John Criswell	97da76178c	Fix compilation problem; make the cast and the LHS be the same type. llvm-svn: 17488	2004-11-05 16:17:06 +00:00
Chris Lattner	499e1b16a7	Quiet VC++ warnings llvm-svn: 17484	2004-11-05 04:50:59 +00:00
Chris Lattner	d9696aa7b8	Fix a warning llvm-svn: 17431	2004-11-02 15:27:57 +00:00
Chris Lattner	10de12fd46	Add placeholder variable to make Win32 work, applied for Morten Ofstad llvm-svn: 17406	2004-11-01 20:10:20 +00:00
Reid Spencer	d3f7233495	Change Library Names Not To Conflict With Others When Installed llvm-svn: 17286	2004-10-27 23:18:45 +00:00
Reid Spencer	019621a1ea	Adjust to changes in Makefile.rules llvm-svn: 17167	2004-10-22 21:02:08 +00:00
Reid Spencer	e48ba34fd4	We won't use automake llvm-svn: 17155	2004-10-22 03:35:04 +00:00
Reid Spencer	ce514b1c2c	Initial automake generated Makefile template llvm-svn: 17136	2004-10-18 23:55:41 +00:00
Chris Lattner	cac643c78f	Improve compatibility with VC++, patch contributed by Morten Ofstad! llvm-svn: 17126	2004-10-18 15:54:17 +00:00
Chris Lattner	f96fb0c946	Don't print stuff out from the code generator. This broke the JIT horribly last night. :) bork! llvm-svn: 17093	2004-10-17 17:40:50 +00:00
Chris Lattner	63e6bdd207	Rewrite support for cast uint -> FP. In particular, we used to compile this: double %test(uint %X) { %tmp.1 = cast uint %X to double ; <double> [#uses=1] ret double %tmp.1 } into: test: sub %ESP, 8 mov %EAX, DWORD PTR [%ESP + 12] mov %ECX, 0 mov DWORD PTR [%ESP], %EAX mov DWORD PTR [%ESP + 4], %ECX fild QWORD PTR [%ESP] add %ESP, 8 ret ... which basically zero extends to 8 bytes, then does an fild for an 8-byte signed int. Now we generate this: test: sub %ESP, 4 mov %EAX, DWORD PTR [%ESP + 8] mov DWORD PTR [%ESP], %EAX fild DWORD PTR [%ESP] shr %EAX, 31 fadd DWORD PTR [.CPItest_0 + 4*%EAX] add %ESP, 4 ret .section .rodata .align 4 .CPItest_0: .quad 5728578726015270912 This does a 32-bit signed integer load, then adds in an offset if the sign bit of the integer was set. It turns out that this is substantially faster than the preceeding sequence. Consider this testcase: unsigned a[2]={1,2}; volatile double G; void main() { int i; for (i=0; i<100000000; ++i ) G += a[i&1]; } On zion (a P4 Xeon, 3Ghz), this patch speeds up the testcase from 2.140s to 0.94s. On apoc, an athlon MP 2100+, this patch speeds up the testcase from 1.72s to 1.34s. Note that the program takes 2.5s/1.97s on zion/apoc with GCC 3.3 -O3 -fomit-frame-pointer. llvm-svn: 17083	2004-10-17 08:01:28 +00:00
Chris Lattner	bf114f32c0	Unify handling of constant pool indexes with the other code paths, allowing us to use index registers for CPI's llvm-svn: 17082	2004-10-17 07:49:45 +00:00
Chris Lattner	892b15538d	Give the asmprinter the ability to print memrefs with a constant pool index, index reg and scale llvm-svn: 17081	2004-10-17 07:16:32 +00:00
Chris Lattner	2fdca0bc02	fold: %X = and Y, constantint %Z = setcc %X, 0 instead of emitting: and %EAX, 3 test %EAX, %EAX je .LBBfoo2_2 # UnifiedReturnBlock We now emit: test %EAX, 3 je .LBBfoo2_2 # UnifiedReturnBlock This triggers 581 times on 176.gcc for example. llvm-svn: 17080	2004-10-17 06:10:40 +00:00
Chris Lattner	ae2e5f4de1	Teach the X86 backend about unreachable and undef. Among other things, we now compile: 'foo() {}' into "ret" instead of "mov EAX, 0; ret" llvm-svn: 17049	2004-10-16 18:13:05 +00:00
Chris Lattner	25b5777485	Instruction select globals with offsets better. For example, on this test case: int C[100]; int foo() { return C[4]; } We now codegen: foo: mov %EAX, DWORD PTR [C + 16] ret instead of: foo: mov %EAX, OFFSET C mov %EAX, DWORD PTR [%EAX + 16] ret Other impressive features may be coming later. This patch is contributed by Jeff Cohen! llvm-svn: 17011	2004-10-15 05:05:29 +00:00
Chris Lattner	38de76365d	Give the X86 JIT the ability to encode global+disp constants. Patch contributed by Jeff Cohen! llvm-svn: 17010	2004-10-15 04:53:13 +00:00
Chris Lattner	812d56631a	Give the X86 asm printer the ability to print out addressing modes that have constant displacements from global variables. Patch by Jeff Cohen! llvm-svn: 17009	2004-10-15 04:44:53 +00:00
Chris Lattner	1b9a284e54	Allow X86 addressing modes to represent globals with offsets. Patch contributed by Jeff Cohen! llvm-svn: 17008	2004-10-15 04:43:20 +00:00
Reid Spencer	e6418ec30f	Update to reflect changes in Makefile rules. llvm-svn: 16950	2004-10-13 11:46:52 +00:00
Reid Spencer	1b7459b29d	Initial version of automake Makefile.am file. llvm-svn: 16893	2004-10-10 22:20:40 +00:00
Chris Lattner	2419e1d27e	The person who was planning to add SSE support isn't anymore, so disable the -sse* options (to avoid misleading people). Also, the stack alignment of the target doesn't depend on whether SSE is eventually implemented, so remove a comment. llvm-svn: 16860	2004-10-08 22:41:46 +00:00
Chris Lattner	1291307d27	Fix a major regression from the bugfix for 2004-10-08-SelectSetCCFold.llx, which prevented setcc's from being folded into branches. It appears that conditional branchinst's CC operand is actually operand(2), not operand(0) as we might expect. :( llvm-svn: 16859	2004-10-08 22:24:31 +00:00
Chris Lattner	1ac1e54bf9	Fix bug: 2004-10-08-SelectSetCCFold.llx. Normally this is hidden by the instcombine xform, which is why we didn't notice it before. llvm-svn: 16840	2004-10-08 16:34:13 +00:00
Chris Lattner	82aa8544a5	Remove debugging code, fix encoding problem. This fixes the problems the JIT had last night. llvm-svn: 16766	2004-10-06 14:31:50 +00:00
Chris Lattner	b0e465f0cb	Codegen signed mod by 2 or -2 more efficiently. Instead of generating: t: mov %EDX, DWORD PTR [%ESP + 4] mov %ECX, 2 mov %EAX, %EDX sar %EDX, 31 idiv %ECX mov %EAX, %EDX ret Generate: t: mov %ECX, DWORD PTR [%ESP + 4] * mov %EAX, %ECX cdq and %ECX, 1 xor %ECX, %EDX sub %ECX, %EDX * mov %EAX, %ECX ret Note that the two marked moves are redundant, and should be eliminated by the register allocator, but aren't. Compare this to GCC, which generates: t: mov %eax, DWORD PTR [%esp+4] mov %edx, %eax shr %edx, 31 lea %ecx, [%edx+%eax] and %ecx, -2 sub %eax, %ecx ret or ICC 8.0, which generates: t: movl 4(%esp), %ecx #3.5 movl $-2147483647, %eax #3.25 imull %ecx #3.25 movl %ecx, %eax #3.25 sarl $31, %eax #3.25 addl %ecx, %edx #3.25 subl %edx, %eax #3.25 addl %eax, %eax #3.25 negl %eax #3.25 subl %eax, %ecx #3.25 movl %ecx, %eax #3.25 ret #3.25 We would be in great shape if not for the moves. llvm-svn: 16763	2004-10-06 05:01:07 +00:00
Chris Lattner	09b6b3f514	Fix a scary bug with signed division by a power of two. We used to generate: s: ;; X / 4 mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, %EAX sar %ECX, 1 shr %ECX, 30 mov %EDX, %EAX add %EDX, %ECX sar %EAX, 2 ret When we really meant: s: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, %EAX sar %ECX, 1 shr %ECX, 30 add %EAX, %ECX sar %EAX, 2 ret Hey, this also reduces register pressure too :) llvm-svn: 16761	2004-10-06 04:19:43 +00:00
Chris Lattner	9258948b08	Codegen signed divides by 2 and -2 more efficiently. In particular instead of: s: ;; X / 2 movl 4(%esp), %eax movl %eax, %ecx shrl $31, %ecx movl %eax, %edx addl %ecx, %edx sarl $1, %eax ret t: ;; X / -2 movl 4(%esp), %eax movl %eax, %ecx shrl $31, %ecx movl %eax, %edx addl %ecx, %edx sarl $1, %eax negl %eax ret Emit: s: movl 4(%esp), %eax cmpl $-2147483648, %eax sbbl $-1, %eax sarl $1, %eax ret t: movl 4(%esp), %eax cmpl $-2147483648, %eax sbbl $-1, %eax sarl $1, %eax negl %eax ret llvm-svn: 16760	2004-10-06 04:02:39 +00:00
Chris Lattner	acd213fba3	Add some new instructions. Fix the asm string for sbb32rr llvm-svn: 16759	2004-10-06 04:01:02 +00:00
Chris Lattner	0228f228df	* Prune #includes * Update comments * Rearrange code a bit * Finally ELIMINATE the GAS workaround emitter for Intel mode. woot! llvm-svn: 16647	2004-10-04 07:31:08 +00:00
Chris Lattner	581948c8f6	Add support for emitting AT&T style .s files, and make it the default. Users may now choose their output format with the -x86-asm-syntax={intel\|att} flag. llvm-svn: 16646	2004-10-04 07:24:48 +00:00
Chris Lattner	5959f4a108	Convert some missed patterns to support AT&T style llvm-svn: 16645	2004-10-04 07:23:07 +00:00
Chris Lattner	a05d9f53bb	Apparently the GNU assembler has a HUGE hack to be compatible with really old and broken AT&T syntax assemblers. The problem with this hack is that SOME forms of the fdiv and fsub instructions have the 'r' bit inverted. This was a real pain to figure out, but is trivially easy to support: thus we are now bug compatible with gas and gcc. llvm-svn: 16644	2004-10-04 07:08:46 +00:00
Chris Lattner	08098895db	Fix incorrect suffix llvm-svn: 16642	2004-10-04 05:20:16 +00:00
Chris Lattner	c2fc9597bd	Fix some more missed suffixes and swapped operands llvm-svn: 16641	2004-10-04 01:38:10 +00:00
Chris Lattner	7b15a84728	Add missing suffixes to FP instructions for AT&T mode llvm-svn: 16640	2004-10-04 00:43:31 +00:00
Chris Lattner	8d44dcca97	Add support for the -x86-asm-syntax flag, which can be used to choose between Intel and AT&T style assembly language. The ultimate goal of this is to eliminate the GasBugWorkaroundEmitter class, but for now AT&T style emission is not fully operational. llvm-svn: 16639	2004-10-03 20:36:57 +00:00
Chris Lattner	94780713a8	Add support to the instruction patterns for AT&T style output, which will hopefully lead to the death of the 'GasBugWorkaroundEmitter'. This also includes changes to wrap the whole file to 80 columns! Woot! :) Note that the AT&T style output has not been tested at all. llvm-svn: 16638	2004-10-03 20:35:00 +00:00
Alkis Evlogimenos	3f8f30bcb8	The real x87 floating point registers should not be allocatable. They are only used by the stackifier when transforming FPn register allocations to the real stack file x87 registers. llvm-svn: 16472	2004-09-21 21:22:11 +00:00
Misha Brukman	e877aacbaa	s/ISel/X86ISel/ to have unique class names for debugging via gdb because the C++ front-end in gcc does not mangle classes in anonymous namespaces correctly. llvm-svn: 16469	2004-09-21 18:21:21 +00:00
Reid Spencer	c6a8d70cff	Convert code to compile with vc7.1. Patch contributed by Paolo Invernizzi. Thanks Paolo! llvm-svn: 16368	2004-09-15 17:06:42 +00:00
Misha Brukman	9e5af08ef9	Fit long lines into 80 cols via creative space elimination llvm-svn: 16353	2004-09-15 01:40:18 +00:00
Chris Lattner	aee36bb527	Revamp the Register class, and allow the use of the RegisterGroup class to specify aliases directly in register definitions. Patch contributed by Jason Eckhardt! llvm-svn: 16330	2004-09-14 04:17:02 +00:00
Misha Brukman	ec87a61944	Fix filename: Printer.cpp has become X86AsmPrinter.cpp llvm-svn: 16299	2004-09-12 21:26:04 +00:00
Alkis Evlogimenos	4d2b0a2b5b	Use a shorter form to express implicit use/defs in FpGETRESULT and FpSETRESULT. llvm-svn: 16247	2004-09-08 18:29:31 +00:00
Alkis Evlogimenos	3540de9ea6	A call instruction should implicitely define ST0 since the return value is returned in that register. The pseudo instructions FpGETRESULT and FpSETRESULT shold also have an implicity use and def of ST0 repsecitvely. llvm-svn: 16246	2004-09-08 16:54:54 +00:00
Reid Spencer	c4abcbefb1	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Reid Spencer	59cb27bcdc	Reduce the number of arguments in the instruction builder and make some improvements on instruction selection that account for register and frame index bases. Patch contributed by Jeff Cohen. Thanks Jeff! llvm-svn: 16110	2004-08-30 00:13:26 +00:00
Chris Lattner	6781bb48eb	Add -sse[,2,3] arguments to LLC llvm-svn: 16018	2004-08-24 08:18:44 +00:00
Chris Lattner	28c7ae5697	Nuke commented out stuff llvm-svn: 16017	2004-08-24 08:18:27 +00:00
Chris Lattner	64d3bc5e85	Switch from bytes to bits for alignment for consistency llvm-svn: 15974	2004-08-21 20:14:13 +00:00
Chris Lattner	4427fd9a3c	Reduce uses of getRegClass llvm-svn: 15973	2004-08-21 20:13:52 +00:00
Chris Lattner	8c5096d223	Rename var llvm-svn: 15897	2004-08-18 02:22:55 +00:00
Chris Lattner	d3d5c1d2a2	Start using alignment output routines from AsmPrinter. Changes to make this more similar to the ppc asmprinter llvm-svn: 15890	2004-08-17 19:25:42 +00:00
Chris Lattner	052cebe33c	Use the AsmPrinter emitGlobalConstant. llvm-svn: 15872	2004-08-17 06:48:55 +00:00
Chris Lattner	bf5dba50c5	Start using the AsmPrinter to emit our first class constants. This also drops our half-assed support for cygwin, which noone uses and doesn't work anyway. llvm-svn: 15839	2004-08-16 23:16:06 +00:00
Chris Lattner	3383506bcc	Disable the pattern isel llvm-svn: 15787	2004-08-15 23:02:17 +00:00
Chris Lattner	555a585fd8	Code insertion methods now return void instead of an int. llvm-svn: 15780	2004-08-15 22:15:11 +00:00
Chris Lattner	e58190f5f6	These methods no longer take a TargetRegisterClass* operand. llvm-svn: 15774	2004-08-15 21:56:44 +00:00
Nate Begeman	fabece673b	Eliminate MachineFunction& argument from eliminateFrameIndex in x86 Target. Get MachineFunction from MachineInstruction's parent's parent llvm-svn: 15739	2004-08-14 22:05:10 +00:00
Chris Lattner	5e7e9b6c26	Remove a bunch of ad-hoc target-specific flags that were only used by the old asmprinter. llvm-svn: 15660	2004-08-11 07:12:04 +00:00
Chris Lattner	b09bc9d4e3	Remove a dead method llvm-svn: 15659	2004-08-11 07:07:14 +00:00
Chris Lattner	3fc9d4490c	Finally, the entire instruction asmprinter is now generated from tblgen, woo! llvm-svn: 15658	2004-08-11 07:02:04 +00:00
Chris Lattner	3cef2f82ff	Add asmprintergen support for the last X86 instruction that needs it: pcrelative calls. llvm-svn: 15657	2004-08-11 06:59:12 +00:00
Chris Lattner	309873fed0	This file is long dead llvm-svn: 15656	2004-08-11 06:55:12 +00:00
Chris Lattner	9c171be048	Scrunch memoperands, add a few more for floating point memops Eliminate the FPI*m classes, converting them to use FPI instead. llvm-svn: 15655	2004-08-11 06:50:10 +00:00
Chris Lattner	f34003128d	Move hacks up llvm-svn: 15654	2004-08-11 06:09:55 +00:00
Chris Lattner	b287047c3f	Make FPI take asm string and operand list llvm-svn: 15653	2004-08-11 05:54:16 +00:00
Chris Lattner	c304bf7e03	Nuke the Imi patterns, by asmprintergenifying all users. llvm-svn: 15652	2004-08-11 05:31:07 +00:00
Chris Lattner	65ab459759	X86 instructions that read-modify-write memory are not LLVM two-address instructions. llvm-svn: 15651	2004-08-11 05:07:25 +00:00
Chris Lattner	384711a69c	Get rid of the Im8, Im16, Im32 classes, converting more instructions over to asmprintergeneration llvm-svn: 15650	2004-08-11 04:31:00 +00:00
Chris Lattner	24279a8ac8	Remove dead method llvm-svn: 15647	2004-08-11 02:26:39 +00:00
Chris Lattner	b66b9cd4a9	Convert asmprinter to new style of instruction printer Start asmprintergen'ifying machine instrs with memory operands. llvm-svn: 15646	2004-08-11 02:25:00 +00:00
Chris Lattner	5cf0a20d4f	This is purely a formatting patch that gets us closer to the mecca of fitting X86InstrInfo.td into 80 columns llvm-svn: 15629	2004-08-10 21:21:30 +00:00
Chris Lattner	f6c4de46e0	Drop the first argument of FPI, and asmprinterify fxch llvm-svn: 15628	2004-08-10 21:02:13 +00:00
Chris Lattner	97abe28059	This purely mechanical patch gives the "I" tblgen class operand list and asm string operands, and adjusts all users to pass them in instead of using II. llvm-svn: 15624	2004-08-10 20:17:41 +00:00
Chris Lattner	332fa9be1c	Convert Ii32 instructions over to use the asmprinter generator llvm-svn: 15621	2004-08-10 19:06:36 +00:00
Chris Lattner	068209661a	Convert the Ii16 instructions over llvm-svn: 15606	2004-08-10 16:22:02 +00:00
Chris Lattner	315782f0ac	Convert all Ii8 instructions over to the autogenerated asmprinter. llvm-svn: 15605	2004-08-10 16:09:54 +00:00
Alkis Evlogimenos	f853362a44	Stop using getValues(). llvm-svn: 15487	2004-08-04 08:44:43 +00:00
Chris Lattner	2677b71f64	Fix a warning llvm-svn: 15409	2004-08-01 19:31:30 +00:00
Chris Lattner	df7c9d0339	Convert all I<> instructions to asmformat. Delete the 'name' field of all instructions that have asmformats. llvm-svn: 15403	2004-08-01 09:52:59 +00:00
Chris Lattner	90a4b737dd	Eliminate 3 of the X86 printImplicit* flags. llvm-svn: 15398	2004-08-01 08:23:17 +00:00
Chris Lattner	de4844f84d	Get rid of 3 of the 4 'printimplicit' flags. Implicit operands are now explicitly listed in the asm string. llvm-svn: 15397	2004-08-01 08:22:29 +00:00
Chris Lattner	0c5ab21dcd	Convert more instructions over to the asmprinter llvm-svn: 15396	2004-08-01 08:13:11 +00:00
Chris Lattner	0a6fedb451	Handle registers a bit more efficiently llvm-svn: 15395	2004-08-01 08:12:41 +00:00
Chris Lattner	c40aa40525	give FP stack registers names llvm-svn: 15394	2004-08-01 08:12:13 +00:00
Chris Lattner	6c596faddb	Switch more instructions over to using the asmprinter. Fix bugs in the emission of in/out instructions (missing %'s on registers). llvm-svn: 15393	2004-08-01 07:44:35 +00:00
Chris Lattner	3a928f8119	The tblgen'erated asmparser wants a way to print operands. llvm-svn: 15392	2004-08-01 07:43:46 +00:00
Chris Lattner	e4c868ffa0	Rename the Printer class -> X86AsmPrinter. Include the tablegenerated assembly writer. llvm-svn: 15389	2004-08-01 06:02:08 +00:00
Chris Lattner	a02166d28b	Factor a bunch of the rules and add support for generating the asmwriter. llvm-svn: 15388	2004-08-01 06:01:32 +00:00
Chris Lattner	9a7b050ebb	Specify an asm string and operands lists for a bunch of instructions. This only really covers no-operand instructions so far. llvm-svn: 15387	2004-08-01 06:01:00 +00:00
Chris Lattner	101dccd430	Completely disable the pattern isel until it is more substantial. llvm-svn: 15380	2004-08-01 03:28:02 +00:00
Chris Lattner	9bce44c8cc	Entirely eliminate all patterns and expanders from this file. We shall go with an incremental approach rather than a revolutionary approach. llvm-svn: 15379	2004-08-01 03:25:01 +00:00
Chris Lattner	0717ef353d	Remove obsolete file llvm-svn: 15377	2004-08-01 03:19:28 +00:00
Alkis Evlogimenos	cdcb1c62e5	Align breaks. llvm-svn: 15371	2004-07-31 10:05:44 +00:00
Chris Lattner	0d66480e9e	Add breaks llvm-svn: 15365	2004-07-31 09:53:31 +00:00
Alkis Evlogimenos	1eb8a5dc09	Simplify code a bit. llvm-svn: 15364	2004-07-31 09:44:32 +00:00
Alkis Evlogimenos	de150fb74b	Correctly spell 'unconditional'. llvm-svn: 15363	2004-07-31 09:41:44 +00:00
Alkis Evlogimenos	bc3d550391	Implement insertGoto and reverseBranchCondition for the X86. llvm-svn: 15362	2004-07-31 09:38:47 +00:00
Chris Lattner	9a23ab1e63	Mark barrier instructions. Execution does not fall through uncond branches or return intructions. llvm-svn: 15356	2004-07-31 02:10:53 +00:00
Misha Brukman	3e7a88e9db	Fix indentation: should be 2 spaces. llvm-svn: 15240	2004-07-26 18:48:58 +00:00
Misha Brukman	61ff8a374f	Fix file header as it has been renamed. llvm-svn: 15239	2004-07-26 18:45:48 +00:00
Misha Brukman	ccd1114518	Renamed files to have the `X86' prefix for uniqueness purposes. All CVS history was renamed, the *,v were copied over. No worries. llvm-svn: 15238	2004-07-26 18:43:11 +00:00
Chris Lattner	093d84c480	Remove some (LARGE) abandoned code for the release. If this is ever needed again in the future, it can be resurrected out of CVS llvm-svn: 15112	2004-07-22 21:30:35 +00:00
Chris Lattner	e3d3cd3e71	Fix cases where we generated horrible code like this: mov %EDI, 12 add %EDI, %ECX mov %ECX, 12 add %ECX, %EDX mov %EDX, 12 add %EDX, %ESI instead (really!) generate this: add %ECX, 12 add %EDX, 12 add %ESI, 12 llvm-svn: 15090	2004-07-21 21:28:26 +00:00
Chris Lattner	e8b9b58454	While I'm at it, don't break codegen of mul by 3,5,9. llvm-svn: 15013	2004-07-19 23:50:57 +00:00
Chris Lattner	f668465840	Generate better code for multiplies by negative constants like -4, -1, -9, etc. llvm-svn: 15012	2004-07-19 23:47:21 +00:00
Reid Spencer	14243817ec	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Minimize redundant isa<GlobalValue> usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14950	2004-07-18 00:38:32 +00:00
Chris Lattner	9bcf258cc3	Make sure to emit the immediate byte for instructions like: shrd [mem], reg, imm This fixes the jit-ls failure on 186.crafty. llvm-svn: 14914	2004-07-17 20:26:14 +00:00
Chris Lattner	d7905d828b	Reserve the correct amt of space. llvm-svn: 14913	2004-07-17 20:24:05 +00:00
Chris Lattner	c4888ccda7	Patches towards fixing PR341 llvm-svn: 14841	2004-07-15 02:14:30 +00:00
Chris Lattner	210ffe4b77	Improve codegen for the LLVM offsetof/sizeof "operator". Before we compiled this LLVM function: int %foo() { ret int cast (int getelementptr (int null, int 1) to int) } into: foo: mov %EAX, 0 lea %EAX, DWORD PTR [%EAX + 4] ret now we compile it into: foo: mov %EAX, 4 ret This sequence is frequently generated by the MSIL front-end, and soon the malloc lowering pass and Java front-ends as well.. -Chris llvm-svn: 14834	2004-07-15 00:58:53 +00:00
Chris Lattner	6331eb6bbe	Delete the allocate*TargetMachine function, which is now dead . The shared command line options are now in a header that makes sense. llvm-svn: 14756	2004-07-11 04:17:10 +00:00
Chris Lattner	b67e3b01bc	Make these format a bit nicer llvm-svn: 14747	2004-07-11 03:27:42 +00:00
Chris Lattner	2ada866a78	Auto-registrate target llvm-svn: 14745	2004-07-11 02:48:49 +00:00
Reid Spencer	50ec3f9325	Add #include <iostream> since Value.h does not #include it any more. llvm-svn: 14622	2004-07-04 12:19:56 +00:00
Chris Lattner	6da0499f4b	Remove dead blocks llvm-svn: 14564	2004-07-02 05:46:41 +00:00
Misha Brukman	9e015dddb8	Fix associativity of parameters to assert(): now it actually makes sense. llvm-svn: 14483	2004-06-29 19:43:20 +00:00
Misha Brukman	b3e4179f42	Convert tabs to spaces. llvm-svn: 14482	2004-06-29 19:28:53 +00:00
Chris Lattner	2abf0134d0	I believe that the code generator now properly handles dead basic blocks. If not, this is a bug, and should be fixed. llvm-svn: 14476	2004-06-29 07:17:12 +00:00
Chris Lattner	cd1a39bbec	Fix a regression from r1.224. In particular, codegen a cast from double -> float as a truncation by going through memory. This truncation was being skipped, which caused 175.vpr to fail after aggressive register promotion. llvm-svn: 14473	2004-06-29 00:14:38 +00:00
Tanya Lattner	da38dc5180	Made a fix so that you can print out MachineInstrs that belong to a MachineBasicBlock that is not yet attached to a MachineFunction. This change includes changing the third operand (TargetMachine) to a pointer for the MachineInstr::print function. llvm-svn: 14389	2004-06-25 00:13:11 +00:00
Misha Brukman	e38f7ed2cc	Spell out `NoFramePointerElim' for readability. llvm-svn: 14299	2004-06-21 21:17:44 +00:00
Misha Brukman	a2ac4e4345	Use the common `NoFPElim' setting instead of our own. llvm-svn: 14298	2004-06-21 21:10:24 +00:00
Chris Lattner	cc465361d9	Move the IntrinsicLowering header into the CodeGen directory, as per PR346 llvm-svn: 14266	2004-06-20 07:49:54 +00:00
Chris Lattner	9e1bbe86ba	Codegen sub C, X a little bit better for register pressure. Instead of mov REG, C sub REG, X generate: neg X add X, C which uses one less reg llvm-svn: 14213	2004-06-18 00:50:37 +00:00
Chris Lattner	a5750b975a	Fold setcc instructions into select and branches that are not in the same BB as the setcc. llvm-svn: 14212	2004-06-18 00:29:22 +00:00
Chris Lattner	f815117481	Do not fold loads into instructions if it is used more than once. In particular we do not want to fold the load in cases like this: X = load = add A, X = add B, X llvm-svn: 14204	2004-06-17 22:15:25 +00:00
Chris Lattner	0cd29ae2cd	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() llvm-svn: 14201	2004-06-17 18:19:28 +00:00
Chris Lattner	9bb0083d16	Remove support for llvm.isnan. Alkis wins :) llvm-svn: 14189	2004-06-15 21:48:07 +00:00
Chris Lattner	d11493d8c4	Add basic support for the isunordered intrinsic. The isnan stuff still needs to go llvm-svn: 14185	2004-06-15 21:36:44 +00:00
Chris Lattner	3a8e675c03	By far, one of the most common uses of isnan is to make 'isunordered' comparisons. In an 'isunordered' predicate, which looks like this at the LLVM level: %a = call bool %llvm.isnan(double %X) %b = call bool %llvm.isnan(double %Y) %COM = or bool %a, %b We used to generate this code: fxch %ST(1) fucomip %ST(0), %ST(0) setp %AL fucomip %ST(0), %ST(0) setp %AH or %AL, %AH With this patch, we generate this code: fucomip %ST(0), %ST(1) fstp %ST(0) setp %AL Which should make alkis happy. Tested as X86/compare_folding.llx:test1 llvm-svn: 14148	2004-06-11 05:33:49 +00:00
Chris Lattner	f78e3e7f63	Fix bug in previous checkin llvm-svn: 14146	2004-06-11 05:22:44 +00:00
Chris Lattner	7d8093efb1	No really, these are dead now llvm-svn: 14145	2004-06-11 04:50:14 +00:00

... 7 8 9 10 11 ...

1547 Commits