llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Tom Stellard	715b7811c2	Revert "Build script changes for R600/SI Codegen v6" This reverts commit e3013202259ed1e006c21817c63cf25d75982721. llvm-svn: 160301	2012-07-16 18:19:46 +00:00
Tom Stellard	9dc4728c5c	Revert "Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h>" This reverts commit 0258a6bdd30802f5cc0e8e57c8e768fde2aef590. llvm-svn: 160299	2012-07-16 18:19:41 +00:00
Tom Stellard	5013977c33	Revert "Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen." This reverts commit ebc934ba32ee71abbb8f0f2eb6a0fbaa613ba0d2. llvm-svn: 160298	2012-07-16 18:19:40 +00:00
Tom Stellard	9c4f5d8855	Revert "Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator..." This reverts commit 29f28bc14ad5a907f5dc849f004fafeec0aab33a. llvm-svn: 160297	2012-07-16 18:19:38 +00:00
Tom Stellard	428cc1034f	Revert "Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0)." This reverts commit 4ba4acc1bc2561b944a571edbb6a2dc78e357dfe. llvm-svn: 160296	2012-07-16 18:19:37 +00:00
Tom Stellard	5637c04c6b	Revert "Target/AMDGPU: Fix includes, or msvc build failed." This reverts commit fef4aa1b16fcf7a472559abbbcf4c1adc9eb5ca6. llvm-svn: 160295	2012-07-16 18:19:32 +00:00
Chad Rosier	16c9db9ad6	With r160248 in place this code is no longer needed. llvm-svn: 160293	2012-07-16 17:42:13 +00:00
NAKAMURA Takumi	cd72e724ac	Target/AMDGPU: Fix includes, or msvc build failed. llvm-svn: 160280	2012-07-16 15:43:50 +00:00
NAKAMURA Takumi	48743bc036	Target/AMDGPU/AMDILIntrinsicInfo.cpp: Use llvm_unreachable() in nonreturn function, instead of assert(0). llvm-svn: 160279	2012-07-16 15:43:09 +00:00
NAKAMURA Takumi	877e9fac64	Target/AMDGPU/R600KernelParameters.cpp: Don't use "and", "or" as conditional operator... llvm-svn: 160278	2012-07-16 15:42:35 +00:00
Jack Carter	f2bf098c4f	Doubleword Shift Left Logical Plus 32 Mips shift instructions DSLL, DSRL and DSRA are transformed into DSLL32, DSRL32 and DSRA32 respectively if the shift amount is between 32 and 63 Here is a description of DSLL: Purpose: Doubleword Shift Left Logical Plus 32 To execute a left-shift of a doubleword by a fixed amount--32 to 63 bits Description: GPR[rd] <- GPR[rt] << (sa+32) The 64-bit doubleword contents of GPR rt are shifted left, inserting zeros into the emptied bits; the result is placed in GPR rd. The bit-shift amount in the range 0 to 31 is specified by sa. This patch implements the direct object output of these instructions. llvm-svn: 160277	2012-07-16 15:14:51 +00:00
NAKAMURA Takumi	2d04e559df	Target/AMDGPU: [CMake] Fix dependencies. 1) Add intrinsics_gen. Add AMDGPUCommonTableGen. llvm-svn: 160276	2012-07-16 15:09:11 +00:00
NAKAMURA Takumi	4fd62f7458	Target/AMDGPU/R600KernelParameters.cpp: Fix two includes, <llvm/IRBuilder.h> and <llvm/TypeBuilder.h> llvm-svn: 160275	2012-07-16 15:08:47 +00:00
Tom Stellard	c75d49d526	Build script changes for R600/SI Codegen v6 llvm-svn: 160272	2012-07-16 14:17:16 +00:00
Tom Stellard	9f326179fc	AMDGPU: Add core backend files for R600/SI codegen v6 llvm-svn: 160270	2012-07-16 14:17:08 +00:00
Nadav Rotem	67ff66bd0c	Fix a bug in the 3-address conversion of LEA when one of the operands is an undef virtual register. The problem is that ProcessImplicitDefs removes the definition of the register and marks all uses as undef. If we lose the undef marker then we get a register which has no def, is not marked as undef. The live interval analysis does not collect information for these virtual registers and we crash in later passes. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160260	2012-07-16 10:52:25 +00:00
Alexey Samsonov	c68bb48704	This CL changes the function prologue and epilogue emitted on X86 when stack needs realignment. It is intended to fix PR11468. Old prologue and epilogue looked like this: push %rbp mov %rsp, %rbp and $alignment, %rsp push %r14 push %r15 ... pop %r15 pop %r14 mov %rbp, %rsp pop %rbp The problem was to reference the locations of callee-saved registers in exception handling: locations of callee-saved had to be re-calculated regarding the stack alignment operation. It would take some effort to implement this in LLVM, as currently MachineLocation can only have the form "Register + Offset". Funciton prologue and epilogue are now changed to: push %rbp mov %rsp, %rbp push %14 push %15 and $alignment, %rsp ... lea -$size_of_saved_registers(%rbp), %rsp pop %r15 pop %r14 pop %rbp Reviewed by Chad Rosier. llvm-svn: 160248	2012-07-16 06:54:09 +00:00
Nadav Rotem	a09775b875	Teach getTargetVShiftNode about TargetConstant nodes. llvm-svn: 160234	2012-07-15 20:27:43 +00:00
Nadav Rotem	0377e0d234	Rename VBROADCASTSDrm into VBROADCASTSDYrm to match the naming convention. Allow the folding of vbroadcastRR to vbroadcastRM, where the memory operand is a spill slot. PR12782. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160230	2012-07-15 12:26:30 +00:00
Nadav Rotem	c40d85dda5	AVX: Fix a bug in getTargetVShiftNode. The shift amount has to be a 128bit vector with the same element type as the input vector. This is needed because of the patterns we have for the VP[SLL/SRA/SRL][W/D/Q] instructions. llvm-svn: 160222	2012-07-14 22:26:05 +00:00
Joel Jones	12ea066486	This is one of the first steps at moving to replace target-dependent intrinsics with target-indepdent intrinsics. The first instruction(s) to be handled are the vector versions of count leading zeros (ctlz). The changes here are to clang so that it generates a target independent vector ctlz when it sees an ARM dependent vector ctlz. The changes in llvm are to match the target independent vector ctlz and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector ctlzs with target-independent ctlzs. There are also changes to an existing test case in llvm for ARM vector count instructions and a new test for the bitcode upgrade. <rdar://problem/11831778> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160200	2012-07-13 23:25:25 +00:00
Jakob Stoklund Olesen	b8af245a15	Remove variable_ops from call instructions in most targets. Call instructions are no longer required to be variadic, and variable_ops should only be used for instructions that encode a variable number of arguments, like the ARM stm/ldm instructions. llvm-svn: 160189	2012-07-13 20:44:29 +00:00
Jakob Stoklund Olesen	6944c56847	Remove variable_ops from ARM call instructions. Function argument registers are added to the call SDNode, but InstrEmitter now knows how to make those operands implicit, and the call instruction doesn't have to be variadic. Explicit register operands should only be those that are encoded in the instruction, implicit register operands are for extra dependencies like call argument and return values. llvm-svn: 160188	2012-07-13 20:27:00 +00:00
Jack Carter	b8e3cf5fbc	The Mips specific relocation R_MIPS_GOT_DISP is used in cases where global symbols are directly represented in the GOT and we use an offset into the global offset table. This patch adds direct object support for R_MIPS_GOT_DISP. llvm-svn: 160183	2012-07-13 19:15:47 +00:00
Benjamin Kramer	308eb1b4c0	Make helper functions static. llvm-svn: 160173	2012-07-13 13:25:15 +00:00
Craig Topper	a75da664ff	Mark VINSERTI128rm as MayLoad=1. Fixes PR13348. llvm-svn: 160162	2012-07-13 05:46:28 +00:00
Benjamin Kramer	558c56f216	Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it. I already had the necessary things in place for IR-level passes but missed the machine passes. llvm-svn: 160137	2012-07-12 18:14:57 +00:00
Benjamin Kramer	f8e67a04f4	Add intrinsics for Ivy Bridge's rdrand instruction. The rdrand/cmov sequence is the same that is emitted by both GCC and ICC. Fixes PR13284. llvm-svn: 160117	2012-07-12 09:31:43 +00:00
Craig Topper	6eef81b65b	Update GATHER instructions to support 2 read-write operands. Patch from myself and Manman Ren. llvm-svn: 160110	2012-07-12 06:52:41 +00:00
Manman Ren	6b6d3e2854	ARM: fix typo in comments llvm-svn: 160093	2012-07-11 23:47:00 +00:00
Manman Ren	0479bff92d	ARM: Fix optimizeCompare to correctly check safe condition. It is safe if CPSR is killed or re-defined. When we are done with the basic block, check whether CPSR is live-out. Do not optimize away cmp if CPSR is live-out. llvm-svn: 160090	2012-07-11 22:51:44 +00:00
Jack Carter	7eebd50da0	Patch for Mips direct object generation. When WriteFragmentData() case FT_align called Asm.getBackend().writeNopData() is called, nothing is done since Mips implementation of writeNopData just returned "true". For some reason this has not caused problems in 32 bit mode, but in 64 bit mode it caused an assert when processing multiple function units. The test case included will assert without this patch. It runs twice with different flags to prevent false positives due to changes in code generation over time. llvm-svn: 160084	2012-07-11 22:17:39 +00:00
Jack Carter	d3a7595f31	This change removes an "initialization" warning. Even though variable in question could not be initialized before use, the code was such that the compiler had no way of knowing that. llvm-svn: 160081	2012-07-11 21:41:49 +00:00
Akira Hatanaka	e0fbd238e2	In register classes in MipsRegisterInfo.td, list the registers in ascending order of binary encoding. Patch by Vladimir Medic. llvm-svn: 160073	2012-07-11 20:51:50 +00:00
Chad Rosier	75817295b6	[x86 fast-isel] Per discussion with Eric, add all cases to switch with verbose comments. llvm-svn: 160069	2012-07-11 19:58:38 +00:00
Manman Ren	93ef864f3c	X86: Update to peephole optimization to move Movr0 before (Sub, Cmp) pair. When Movr0 is between sub and cmp, we move Movr0 before sub if it enables removal of Cmp. llvm-svn: 160066	2012-07-11 19:35:12 +00:00
Akira Hatanaka	2e26e543b9	Implement MipsTargetLowering::LowerSELECT_CC to custom lower SELECT_CC. llvm-svn: 160064	2012-07-11 19:32:27 +00:00
Chad Rosier	aaccad80b4	[x86 fast-isel] Rather then call llvm_unreachable() have fast-isel fall back to Selection DAG isel. Patch by Andrew Kaylor <andrew.kaylor@intel.com>. llvm-svn: 160055	2012-07-11 17:23:17 +00:00
Nadav Rotem	22652c85bc	When ext-loading and trunc-storing vectors to memory, on x86 32bit systems, allow loads/stores of 64bit values from xmm registers. llvm-svn: 160044	2012-07-11 13:27:05 +00:00
Akira Hatanaka	aad21ac7f2	Lower RETURNADDR node in Mips backend. Patch by Sasa Stankovic. llvm-svn: 160031	2012-07-11 00:53:32 +00:00
Jack Carter	639a740a15	Mips specific inline asm operand modifier 'L'. Low order register of a double word register operand. Operands are defined by the name of the variable they are marked with in the inline assembler code. This is a way to specify that the operand just refers to the low order register for that variable. It is the opposite of modifier 'D' which specifies the high order register. Example: main() { long long ll_input = 0x1111222233334444LL; long long ll_val = 3; int i_result = 0; __asm__ __volatile__( "or %0, %L1, %2" : "=r" (i_result) : "r" (ll_input), "r" (ll_val)); } Which results in: lui $2, %hi(_gp_disp) addiu $2, $2, %lo(_gp_disp) addiu $sp, $sp, -8 addu $2, $2, $25 sw $2, 0($sp) lui $2, 13107 ori $3, $2, 17476 <-- Low 32 bits of ll_input lui $2, 4369 ori $4, $2, 8738 <-- High 32 bits of ll_input addiu $5, $zero, 3 <-- Low 32 bits of ll_val addiu $2, $zero, 0 <-- High 32 bits of ll_val #APP or $3, $4, $5 <-- or i_result, high 32 ll_input, low 32 of ll_val #NO_APP addiu $sp, $sp, 8 jr $ra If not direction is done for the long long for 32 bit variables results in using the low 32 bits as ll_val shows. There is an existing bug if 'L' or 'D' is used for the destination register for 32 bit long longs in that the target value will be updated incorrectly for the non-specified part unless explicitly set within the inline asm code. llvm-svn: 160028	2012-07-10 22:41:20 +00:00
Chad Rosier	3273667edf	Move [get\|set]BasePtrStackAdjustment() from MachineFrameInfo to X86MachineFunctionInfo as this is currently only used by X86. If this ever becomes an issue on another arch (e.g., ARM) then we can hoist it back out. llvm-svn: 160009	2012-07-10 18:27:15 +00:00
Chad Rosier	5395ec6ee4	Add support for dynamic stack realignment in the presence of dynamic allocas on X86. Basically, this is a reapplication of r158087 with a few fixes. Specifically, (1) the stack pointer is restored from the base pointer before popping callee-saved registers and (2) in obscure cases (see comments in patch) we must cache the value of the original stack adjustment in the prologue and apply it in the epilogue. rdar://11496434 llvm-svn: 160002	2012-07-10 17:45:53 +00:00
Nadav Rotem	5f6e9d5ffe	Improve the loading of load-anyext vectors by allowing the codegen to load multiple scalars and insert them into a vector. Next, we shuffle the elements into the correct places, as before. Also fix a small dagcombine bug in SimplifyBinOpWithSameOpcodeHands, when the migration of bitcasts happened too late in the SelectionDAG process. llvm-svn: 159991	2012-07-10 13:25:08 +00:00
Richard Barton	2bacde8589	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) (and do it properly this time! llvm-svn: 159989	2012-07-10 12:51:09 +00:00
Craig Topper	b346ce8240	Reverse assembler/disassembler operand order for gather instructions. llvm-svn: 159983	2012-07-10 06:38:33 +00:00
Jim Grosbach	83589b60dc	ARM: Allow more flexible patterns in NEON formats. Some NEON instructions want to match against normal SDNodes for some operand types and Intrinsics for others. For example, CTLZ. To enable this, switch from explicitly requiring Intrinsic on the class templates to using SDPatternOperator instead. llvm-svn: 159974	2012-07-10 00:51:13 +00:00
Akira Hatanaka	96b3eb563a	Make register Mips::RA allocatable if not in mips16 mode. llvm-svn: 159971	2012-07-10 00:19:06 +00:00
Chad Rosier	b986265e3b	Revert r159938 (and r159945) to appease the buildbots. llvm-svn: 159960	2012-07-09 20:43:34 +00:00
Manman Ren	dc41586be4	X86: implement functions to analyze & synthesize CMOV\|SET\|Jcc getCondFromSETOpc, getCondFromCMovOpc, getSETFromCond, getCMovFromCond No functional change intended. If we want to update the condition code of CMOV\|SET\|Jcc, we first analyze the opcode to get the condition code, then update the condition code, finally synthesize the new opcode form the new condition code. llvm-svn: 159955	2012-07-09 18:57:12 +00:00
Akira Hatanaka	3d2bcefaf1	Reapply r158846. Access mips register classes via MCRegisterInfo's functions instead of via the TargetRegisterClasses defined in MipsGenRegisterInfo.inc. llvm-svn: 159953	2012-07-09 18:46:47 +00:00
Richard Barton	de6e2755f9	Some formatting to keep Clang happy llvm-svn: 159948	2012-07-09 18:30:56 +00:00
Richard Barton	1f07c6525e	Oops - correct broken disassembly for VMOV llvm-svn: 159945	2012-07-09 18:20:02 +00:00
Richard Barton	cb28956a79	Fix instruction description of VMOV (between two ARM core registers and two single-precision resiters) llvm-svn: 159938	2012-07-09 16:41:33 +00:00
Richard Barton	58c6ccbb1c	Prevent ARM assembler from losing a right shift by #32 applied to a register llvm-svn: 159937	2012-07-09 16:31:14 +00:00
Richard Barton	957a588c71	Spelling! llvm-svn: 159936	2012-07-09 16:14:28 +00:00
Richard Barton	2ca50f6513	Teach the assembler to use the narrow thumb encodings of various three-register dp instructions where permissable. llvm-svn: 159935	2012-07-09 16:12:24 +00:00
Andrew Trick	b9c8074dcd	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Manman Ren	eca5886e50	X86: Fix optimizeCompare to correctly check safe condition. It is safe if EFLAGS is killed or re-defined. When we are done with the basic block, check whether EFLAGS is live-out. Do not optimize away cmp if EFLAGS is live-out. llvm-svn: 159888	2012-07-07 03:34:46 +00:00
Chad Rosier	a9d216beac	Fix the naming of ensureAlignment. Per the coding standard function names should be camel case, and start with a lower case letter. llvm-svn: 159877	2012-07-06 23:13:38 +00:00
Jim Grosbach	b9fd88619e	ARM: Add test cleanup entry to the README. llvm-svn: 159864	2012-07-06 21:52:04 +00:00
Akira Hatanaka	37565e70b6	revert r159851. llvm-svn: 159854	2012-07-06 20:16:48 +00:00
Akira Hatanaka	4320724cc5	Reapply r158846. Include file MipsGenRegisterInfo.inc. llvm-svn: 159851	2012-07-06 19:29:11 +00:00
Manman Ren	8cbff1360f	X86: peephole optimization to remove cmp instruction For each Cmp, we check whether there is an earlier Sub which make Cmp redundant. We handle the case where SUB operates on the same source operands as Cmp, including the case where the two source operands are swapped. llvm-svn: 159838	2012-07-06 17:36:20 +00:00
NAKAMURA Takumi	bd5a31d598	Revert r159804, "[arm-fast-isel] Add support for vararg function calls." It broke LLVM :: CodeGen/Thumb2/large-call.ll on several hosts. llvm-svn: 159817	2012-07-06 11:12:44 +00:00
Jush Lu	4fdc23801c	[arm-fast-isel] Add support for vararg function calls. llvm-svn: 159804	2012-07-06 03:02:37 +00:00
Jack Carter	69077073a6	Changes per review of commit 159787 Mips specific inline asm operand modifier D. Comment changes and predicate change. llvm-svn: 159802	2012-07-06 02:44:22 +00:00
Jack Carter	d22e7550f9	Mips specific inline asm operand modifier D. Print the second half of a double word operand. The include list was cleaned up a bit as well. Also the test case was modified to test for both big and little patterns. llvm-svn: 159787	2012-07-05 23:58:21 +00:00
Akira Hatanaka	af5cc4a41d	Enclose instruction rdhwr with directives, which are needed when target is mips32 rev1 (the directives are emitted when target is mips32r2 too). llvm-svn: 159770	2012-07-05 19:26:38 +00:00
Jakob Stoklund Olesen	6edf66ffe8	Make X86 call and return instructions non-variadic. Function argument and return value registers aren't part of the encoding, so they should be implicit operands. llvm-svn: 159728	2012-07-04 23:53:27 +00:00
Jakob Stoklund Olesen	795083115c	Ensure CopyToReg nodes are always glued to the call instruction. The CopyToReg nodes that set up the argument registers before a call must be glued to the call instruction. Otherwise, the scheduler may emit the physreg copies long before the call, causing long live ranges for the fixed registers. Besides disabling good register allocation, that can also expose problems when EmitInstrWithCustomInserter() splits a basic block during the live range of a physreg. llvm-svn: 159721	2012-07-04 19:28:31 +00:00
Jakob Stoklund Olesen	79846e5c9b	Add early if-conversion support to X86. Implement the TII hooks needed by EarlyIfConversion to create cmov instructions and estimate their latency. Early if-conversion is still not enabled by default. llvm-svn: 159695	2012-07-04 00:09:58 +00:00
Craig Topper	4644d3577c	Remove extra space. llvm-svn: 159647	2012-07-03 06:48:58 +00:00
Craig Topper	60bbc2fde8	Change i128mem/i256mem to f128mem/f256mem on some floating point vector instructions. llvm-svn: 159646	2012-07-03 06:11:06 +00:00
Craig Topper	6fcb4454a0	Add aliases for pblendvb, blendvpd, and blendvps instructions with the implicit xmm0 operand specified. Fixes PR13252. llvm-svn: 159644	2012-07-03 05:49:45 +00:00
Jack Carter	0e58c3f697	mips32 long long register inline asm constraint support. inlineasm-cnstrnt-bad-r-1.ll is NOT supposed to fail, so it was removed. This resulted in the removal of a negative test (inlineasm-cnstrnt-bad-r-1.ll) llvm-svn: 159625	2012-07-02 23:35:23 +00:00
Eric Christopher	07e1aa6bfe	Revert " mips32 long long register inline asm constraint support." as it appears to be breaking the bots. This reverts commit 1b055ce320fa13f6f1ac81670d11b45e01f79876. llvm-svn: 159619	2012-07-02 23:22:25 +00:00
Evan Cheng	6196c5f5f3	Target option DisableJumpTables is a gross hack. Move it to TargetLowering instead. llvm-svn: 159611	2012-07-02 22:39:56 +00:00
Jack Carter	64aeffc069	mips32 long long register inline asm constraint support. inlineasm-cnstrnt-bad-r-1.ll is NOT supposed to fail, so it was removed. This resulted in the removal of a negative test (inlineasm-cnstrnt-bad-r-1.ll) llvm-svn: 159610	2012-07-02 22:39:45 +00:00
Jack Carter	4355dfdc86	Pass the correct ELFOSABI enumeration to the MipsELFObjectWriter constructor Contributer: Sasa Stankovic llvm-svn: 159574	2012-07-02 20:04:43 +00:00
Bob Wilson	a848f156de	Extend TargetPassConfig to allow running only a subset of the normal passes. This is still a work in progress but I believe it is currently good enough to fix PR13122 "Need unit test driver for codegen IR passes". For example, you can run llc with -stop-after=loop-reduce to have it dump out the IR after running LSR. Serializing machine-level IR is not yet supported but we have some patches in progress for that. The plan is to serialize the IR to a YAML file, containing separate sections for the LLVM IR, machine-level IR, and whatever other info is needed. Chad suggested that we stash the stop-after pass in the YAML file and use that instead of the start-after option to figure out where to restart the compilation. I think that's a great idea, but since it's not implemented yet I put the -start-after option into this patch for testing purposes. llvm-svn: 159570	2012-07-02 19:48:45 +00:00
Bob Wilson	7d344104a7	Consistently use AnalysisID types in TargetPassConfig. This makes it possible to just use a zero value to represent "no pass", so the phony NoPassID global variable is no longer needed. llvm-svn: 159568	2012-07-02 19:48:37 +00:00
Bob Wilson	0a1ef38836	Add all codegen passes to the PassManager via TargetPassConfig. This is a preliminary step toward having TargetPassConfig be able to start and stop the compilation at specified passes for unit testing and debugging. No functionality change. llvm-svn: 159567	2012-07-02 19:48:31 +00:00
Andrew Trick	6c5c71b8be	Revert accidental checkin. My last checkin was apparently not the branch I intended. It was missing one change (added by chandlerc), and contained a spurious change. llvm-svn: 159548	2012-07-02 19:12:29 +00:00
Andrew Trick	baf8a62800	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Bob Wilson	8564204a8c	Do not attempt to use ROR for Thumb1. Patch by Matt Fischer! llvm-svn: 159538	2012-07-02 17:22:47 +00:00
Elena Demikhovsky	0617b5a56c	Optimization of shuffle node that can fit to the register form of VBROADCAST instruction on AVX2. llvm-svn: 159504	2012-07-01 06:12:26 +00:00
Craig Topper	4fc5342fc7	Reduce code size by using a second switch statement to avoid extra calls to SelectAtomic64. Also catch cases where SelectAtomic64 fails. llvm-svn: 159503	2012-07-01 02:55:34 +00:00
Craig Topper	80279ea39f	Add a break to the end of case statement missed in r159501. llvm-svn: 159502	2012-07-01 02:18:18 +00:00
Craig Topper	8b795d08a5	Fix a crash on release builds if gather intrinsics are passed a non-constant value for the last argument. llvm-svn: 159501	2012-07-01 02:17:08 +00:00
Craig Topper	b2a94bd61c	Use a second switch statement to reduce number of calls to SelectGather in code. Reduces code size a bit. llvm-svn: 159500	2012-07-01 02:05:52 +00:00
Manman Ren	01e752886b	ARM: Clean up optimizeCompare in peephole, no functional change. Use getUniqueVRegDef. Replace a loop with existing interfaces: modifiesRegister and readsRegister. Factor out code into inline functions and simplify the code. llvm-svn: 159470	2012-06-29 22:06:19 +00:00
Manman Ren	125c1ee4e9	Add SrcReg2 to analyzeCompare and optimizeCompareInstr to handle Compare instructions with two register operands. llvm-svn: 159465	2012-06-29 21:33:59 +00:00
Chandler Carruth	4b51f99c87	Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h This was always part of the VMCore library out of necessity -- it deals entirely in the IR. The .cpp file in fact was already part of the VMCore library. This is just a mechanical move. I've tried to go through and re-apply the coding standard's preferred header sort, but at 40-ish files, I may have gotten some wrong. Please let me know if so. I'll be committing the corresponding updates to Clang and Polly, and Duncan has DragonEgg. Thanks to Bill and Eric for giving the green light for this bit of cleanup. llvm-svn: 159421	2012-06-29 12:38:19 +00:00
Andrew Trick	251f64f946	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Rafael Espindola	53e0eee9de	In the initial exec mode we always do a load to find the address of a variable. Before this patch in pic 32 bit code we would add the global base register and not load from that address. This is a really old bug, but before the introduction of the tls attributes we would never select initial exec for pic code. llvm-svn: 159409	2012-06-29 04:22:35 +00:00
Andrew Trick	52238a0ce5	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Manman Ren	63bf58865a	X86: add more GATHER intrinsics in LLVM Corrected type for index of llvm.x86.avx2.gather.d.pd.256 from 256-bit to 128-bit. Corrected types for src\|dst\|mask of llvm.x86.avx2.gather.q.ps.256 from 256-bit to 128-bit. Support the following intrinsics: llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256 llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256 llvm-svn: 159402	2012-06-29 00:54:20 +00:00
Jack Carter	14b317545e	Changed the formatting sequence of a curly brace to the comment per code review feedback. llvm-svn: 159376	2012-06-28 20:46:26 +00:00
Bill Wendling	a04b6f6de5	Remove layering violation #include. llvm-svn: 159372	2012-06-28 20:17:05 +00:00
Jack Carter	50778bd9cc	The Mips specific inline asm operand modifier 'z' has the following description in the gnu sources: Print $0 if operand is zero otherwise print the op normally. llvm-svn: 159324	2012-06-28 01:33:40 +00:00
Bill Wendling	e8949ecfa6	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h. The reasoning is because the DebugInfo module is simply an interface to the debug info MDNodes and has nothing to do with analysis. llvm-svn: 159312	2012-06-28 00:05:13 +00:00
Jack Carter	156781dada	This allows hello world to be compiled for Mips 64 direct object. It takes advantage of r159299 which introduces relocation support for N64. elf-dump needed to be upgraded to support N64 relocations as well. This passes make check. Jack llvm-svn: 159302	2012-06-27 23:13:42 +00:00
Jack Carter	dc890e3c25	This allows hello world to be compiled for Mips 64 direct object. It takes advantage of r159299 which introduces relocation support for N64. elf-dump needed to be upgraded to support N64 relocations as well. This passes make check. Jack llvm-svn: 159301	2012-06-27 22:48:25 +00:00
Chad Rosier	32642a8292	Whitespace. llvm-svn: 159300	2012-06-27 22:34:28 +00:00
Jack Carter	dc0ebcb076	The ELF relocation record format is different for N64 which many Mips 64 ABIs use than for O64 which many if not all other target ABIs use. Most architectures have the following 64 bit relocation record format: typedef struct { Elf64_Addr r_offset; /* Address of reference / Elf64_Xword r_info; / Symbol index and type of relocation / } Elf64_Rel; typedef struct { Elf64_Addr r_offset; Elf64_Xword r_info; Elf64_Sxword r_addend; } Elf64_Rela; Whereas N64 has the following format: typedef struct { Elf64_Addr r_offset;/ Address of reference / Elf64_Word r_sym; / Symbol index / Elf64_Byte r_ssym; / Special symbol / Elf64_Byte r_type3; / Relocation type / Elf64_Byte r_type2; / Relocation type / Elf64_Byte r_type; / Relocation type / } Elf64_Rel; typedef struct { Elf64_Addr r_offset;/ Address of reference / Elf64_Word r_sym; / Symbol index / Elf64_Byte r_ssym; / Special symbol / Elf64_Byte r_type3; / Relocation type / Elf64_Byte r_type2; / Relocation type / Elf64_Byte r_type; / Relocation type */ Elf64_Sxword r_addend; } Elf64_Rela; The structure is the same size, but the r_info data element is now 5 separate elements. Besides the content aspects, endian byte reordering will be different for the area with each element being endianized separately. I treat this as generic and continue to pass r_type as an integer masking and unmasking the byte sized N64 values for N64 mode. I've implemented this and it causes no affect on other current targets. This passes make check. Jack llvm-svn: 159299	2012-06-27 22:28:30 +00:00
Richard Barton	7d5dedd329	Teach assembler to handle capitalised operation values for DSB instructions llvm-svn: 159259	2012-06-27 09:48:23 +00:00
Richard Barton	b71aab9d7f	Prevent ARM Assembler crashing on unrecognised assembly format for DSB instruction llvm-svn: 159257	2012-06-27 09:36:19 +00:00
Akira Hatanaka	309f178268	Silence uninitialized variable warning in MipsISelDAGToDAG.cpp. llvm-svn: 159243	2012-06-27 00:49:46 +00:00
Akira Hatanaka	d7a4867791	Fix bug in computation of stack size in MipsFrameLowering.cpp. llvm-svn: 159240	2012-06-27 00:20:39 +00:00
Evan Cheng	079b7aa2f3	Add a missing check to avoid dereference null. No sensible test case possible. Sorry. rdar://11745134 llvm-svn: 159236	2012-06-26 22:54:59 +00:00
Manman Ren	6be46b7b4c	X86: add GATHER intrinsics (AVX2) in LLVM Support the following intrinsics: llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256 llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256 Modified Disassembler to handle VSIB addressing mode. llvm-svn: 159221	2012-06-26 19:47:59 +00:00
Jack Carter	0d53f88926	There are a number of generic inline asm operand modifiers that up to r158925 were handled as processor specific. Making them generic and putting tests for these modifiers in the CodeGen/Generic directory caused a number of targets to fail. This commit addresses that problem by having the targets call the generic routine for generic modifiers that they don't currently have explicit code for. For now only generic print operands 'c' and 'n' are supported.vi Affected files: test/CodeGen/Generic/asm-large-immediate.ll lib/Target/PowerPC/PPCAsmPrinter.cpp lib/Target/NVPTX/NVPTXAsmPrinter.cpp lib/Target/ARM/ARMAsmPrinter.cpp lib/Target/XCore/XCoreAsmPrinter.cpp lib/Target/X86/X86AsmPrinter.cpp lib/Target/Hexagon/HexagonAsmPrinter.cpp lib/Target/CellSPU/SPUAsmPrinter.cpp lib/Target/Sparc/SparcAsmPrinter.cpp lib/Target/MBlaze/MBlazeAsmPrinter.cpp lib/Target/Mips/MipsAsmPrinter.cpp MSP430 isn't represented because it did not even run with the long existing 'c' modifier and it was not apparent what needs to be done to get it inline asm ready. Contributer: Jack Carter llvm-svn: 159203	2012-06-26 13:49:27 +00:00
Elena Demikhovsky	613f5cf14e	Removed unused variable llvm-svn: 159197	2012-06-26 10:50:07 +00:00
Bill Wendling	1419f7c697	Rename to match other X86_64* names. llvm-svn: 159196	2012-06-26 10:05:06 +00:00
Elena Demikhovsky	832f074a32	Shuffle optimization for AVX/AVX2. The current patch optimizes frequently used shuffle patterns and gives these instruction sequence reduction. Before: vshufps $-35, %xmm1, %xmm0, %xmm2 ## xmm2 = xmm0[1,3],xmm1[1,3] vpermilps $-40, %xmm2, %xmm2 ## xmm2 = xmm2[0,2,1,3] vextractf128 $1, %ymm1, %xmm1 vextractf128 $1, %ymm0, %xmm0 vshufps $-35, %xmm1, %xmm0, %xmm0 ## xmm0 = xmm0[1,3],xmm1[1,3] vpermilps $-40, %xmm0, %xmm0 ## xmm0 = xmm0[0,2,1,3] vinsertf128 $1, %xmm0, %ymm2, %ymm0 After: vshufps $13, %ymm0, %ymm1, %ymm1 ## ymm1 = ymm1[1,3],ymm0[0,0],ymm1[5,7],ymm0[4,4] vshufps $13, %ymm0, %ymm0, %ymm0 ## ymm0 = ymm0[1,3,0,0,5,7,4,4] vunpcklps %ymm1, %ymm0, %ymm0 ## ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5] llvm-svn: 159188	2012-06-26 08:04:10 +00:00
Craig Topper	5c8bdeb3f3	Remove some duplicate instructions that exist only to given different mnemonics for the assembler. Use InstAlias instead. llvm-svn: 159184	2012-06-26 04:12:49 +00:00
Eli Friedman	a3ccee4b33	Make some ugly hacks for inline asm operands which name a specific register a bit more thorough. PR13196. llvm-svn: 159176	2012-06-25 23:42:33 +00:00
Manman Ren	bd339c27e1	ARM: update peephole optimization. More condition codes are included when deciding whether to remove cmp after a sub instruction. Specifically, we extend from GE\|LT\|GT\|LE to GE\|LT\|GT\|LE\|HS\|LS\|HI\|LO\|EQ\|NE. If we have "sub a, b; cmp b, a; movhs", we should be able to replace with "sub a, b; movls". rdar: 11725965 llvm-svn: 159166	2012-06-25 21:49:38 +00:00
Craig Topper	df4e56ebc1	Add SSE2 predicate to CVTPS2PD instructions. Doesn't matter much because there are no patterns in the instruction. llvm-svn: 159127	2012-06-25 06:51:42 +00:00
Craig Topper	2959047de1	Remove codegen only instruction in favor of one that has the same definition. Make some pattern operands more explicit about types. llvm-svn: 159126	2012-06-25 06:16:00 +00:00
Jakob Stoklund Olesen	76fcb51532	%RCX is not a function live-out in eh.return functions. The function live-out registers must be live at all function returns, and %RCX is only used by eh.return. When a function also has a normal return, only %RAX holds a return value. This fixes PR13188. llvm-svn: 159116	2012-06-24 15:53:01 +00:00
NAKAMURA Takumi	4599dee67a	llvm/lib: [CMake] Add explicit dependency to intrinsics_gen. llvm-svn: 159112	2012-06-24 13:32:01 +00:00
Craig Topper	3f09003ac6	Remove intrinsic specific instructions for (V)CVTPS2DQ and replace with patterns. llvm-svn: 159109	2012-06-24 07:07:16 +00:00
Craig Topper	bbce66a591	Remove intrinsic specific instructions for (V)CVTPS2DQ and replace with patterns. llvm-svn: 159108	2012-06-24 06:55:37 +00:00
Craig Topper	677692bc32	Fix build failures from r159106. llvm-svn: 159107	2012-06-24 06:08:31 +00:00
Craig Topper	58ede28106	Remove intrinsic specific instructions for CVTPD2PS and replace with just patterns. llvm-svn: 159106	2012-06-24 05:44:31 +00:00
Craig Topper	7485c05a49	Remove intrinsic specific instructions for CVTPD2DQ. Replace with patterns. llvm-svn: 159105	2012-06-24 05:33:24 +00:00
Pete Cooper	27cd6c8b19	Remove code i'd been testing with but didn't mean to commit. Oops llvm-svn: 159094	2012-06-24 00:08:36 +00:00
Pete Cooper	9f89f00988	DAG legalisation can now handle illegal fma vector types by scalarisation llvm-svn: 159092	2012-06-24 00:05:44 +00:00
Craig Topper	e824497e82	Remove intrinsic specific instructions for (V)CVTDQ2PS. Use a Pat instead instead. llvm-svn: 159090	2012-06-23 22:33:14 +00:00
Craig Topper	59fcd68657	Make CVTDQ2PS instruction use SSE2 predicate instead of SSE1. No functional change because there are no patterns in the instructions. Also fix a typo in a comment. llvm-svn: 159087	2012-06-23 20:52:45 +00:00
Craig Topper	caf5a8e7aa	Move CVTPD2DQ to use SSE2 predicate instead of SSE3. Move DQ2PD and PD2DQ to the SSE2 section of the file. llvm-svn: 159086	2012-06-23 20:15:42 +00:00
Benjamin Kramer	35d213e1c2	Add a microoptimization note. llvm-svn: 159082	2012-06-23 15:19:31 +00:00
Hans Wennborg	8c011bd43a	Extend the IL for selecting TLS models (PR9788) This allows the user/front-end to specify a model that is better than what LLVM would choose by default. For example, a variable might be declared as @x = thread_local(initialexec) global i32 42 if it will not be used in a shared library that is dlopen'ed. If the specified model isn't supported by the target, or if LLVM can make a better choice, a different model may be used. llvm-svn: 159077	2012-06-23 11:37:03 +00:00
Craig Topper	7067c92fbb	Use correct memory types for (V)CVTDQ2PD instructions. llvm-svn: 159075	2012-06-23 08:30:27 +00:00
Craig Topper	3f4f2125fc	Silence an unused variable warning on release builds. llvm-svn: 159074	2012-06-23 08:09:30 +00:00
Craig Topper	0c7acb290b	Compress flags in X86 op folding to reduce space in static tables. llvm-svn: 159073	2012-06-23 08:01:18 +00:00
Craig Topper	0dc1a9bf89	Make helper method static since it doesn't use anything in the class. llvm-svn: 159071	2012-06-23 04:58:41 +00:00
Craig Topper	b05bd41aed	Remove intrinsic specific instructions for 128-bit (V)CVTDQ2PD. Replace with intrinsic patterns. Mem forms omitted because the load size is only 64-bits. llvm-svn: 159070	2012-06-23 04:23:36 +00:00
Rafael Espindola	048a927ab5	Handle aliases to tls variables in all architectures, not just x86. llvm-svn: 159058	2012-06-23 00:30:03 +00:00
Evan Cheng	2d498dc096	(sub X, imm) gets canonicalized to (add X, -imm) There are patterns to handle immediates when they fit in the immediate field. e.g. %sub = add i32 %x, -123 => sub r0, r0, #123 Add patterns to catch immediates that do not fit but should be materialized with a single movw instruction rather than movw + movt pair. e.g. %sub = add i32 %x, -65535 => movw r1, #65535 sub r0, r0, r1 rdar://11726136 llvm-svn: 159057	2012-06-23 00:29:06 +00:00
Jim Grosbach	92de1a3f58	ARM: Add a better diagnostic for some out of range immediates. As an example of how the custom DiagnosticType can be used to provide better operand-mismatch diagnostics, add a custom diagnostic for the imm0_15 operand class used for several system instructions. Update the tests to expect the improved diagnostic. rdar://8987109 llvm-svn: 159051	2012-06-22 23:56:48 +00:00
Hal Finkel	ebe9ea8bd7	Add support for the PPC isel instruction. The isel (integer select) instruction is supported on the 440 and A2 embedded cores and on the POWER7. llvm-svn: 159045	2012-06-22 23:10:08 +00:00
Chad Rosier	25837f2c81	Whitespace. llvm-svn: 159035	2012-06-22 22:07:19 +00:00
Hal Finkel	db4f1462bf	Revert r158679 - use case is unclear (and it increases the memory footprint). Original commit message: Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 159027	2012-06-22 20:27:13 +00:00
Andrew Trick	279bd30bbc	Use "NoItineraries" for processors with no itineraries. This makes it explicit when ScoreboardHazardRecognizer will be used. "GenericItineraries" would only make sense if it contained real itinerary values and still required ScoreboardHazardRecognizer. llvm-svn: 158963	2012-06-22 03:58:51 +00:00
Jakob Stoklund Olesen	3efab18404	Functions calling __builtin_eh_return must have a frame pointer. The code in X86TargetLowering::LowerEH_RETURN() assumes that a frame pointer exists, but the frame pointer was forced by the presence of llvm.eh.unwind.init which isn't guaranteed. If llvm.eh.unwind.init is actually required in functions calling eh.return (is it?), we should diagnose that instead of emitting bad machine code. This should fix the dragonegg-x86_64-linux-gcc-4.6-test bot. llvm-svn: 158961	2012-06-22 03:04:27 +00:00
Andrew Trick	d1966640b2	ARM scheduling fix: don't guess at implicit operand latency. This is a minor drive-by fix with no robust way to unit test. As an example see neon-div.ll: SU(16): %Q8<def> = VMOVLsv4i32 %D17, pred:14, pred:%noreg, %Q8<imp-use,kill> val SU(1): Latency=2 Reg=%Q8 ...should be latency=1 llvm-svn: 158960	2012-06-22 02:50:33 +00:00
Andrew Trick	764fe3bfef	ARM scheduling fix: compute predicated implicit use properly. Minor drive by fix to cleanup latency computation. Calling getOperandLatency with a deliberately incorrect operand index does not give you the latency you want. llvm-svn: 158959	2012-06-22 02:50:31 +00:00
Lang Hames	68cf87e3ef	Rename -allow-excess-fp-precision flag to -fuse-fp-ops, and switch from a boolean flag to an enum: { Fast, Standard, Strict } (default = Standard). This option controls the creation by optimizations of fused FP ops that store intermediate results in higher precision than IEEE allows (E.g. FMAs). The behavior of this option is intended to match the behaviour specified by a soon-to-be-introduced frontend flag: '-ffuse-fp-ops'. Fast mode - allows formation of fused FP ops whenever they're profitable. Standard mode - allow fusion only for 'blessed' FP ops. At present the only blessed op is the fmuladd intrinsic. In the future more blessed ops may be added. Strict mode - allow fusion only if/when it can be proven that the excess precision won't effect the result. Note: This option only controls formation of fused ops by the optimizers. Fused operations that are explicitly requested (e.g. FMA via the llvm.fma.* intrinsic) will always be honored, regardless of the value of this option. Internally TargetOptions::AllowExcessFPPrecision has been replaced by TargetOptions::AllowFPOpFusion. llvm-svn: 158956	2012-06-22 01:09:09 +00:00
Hal Finkel	2eb4a5326e	Convert the PPC backend to use the new FMA infrastructure. The existing contraction patterns are replaced with fma/fneg. Overall functionality should be the same. llvm-svn: 158955	2012-06-22 00:49:52 +00:00
Akira Hatanaka	021bc1c8a2	1. fix null program output after some other changes 2. re-enable null.ll test 3. fix some minor style violations Patch by Reed Kotler. llvm-svn: 158935	2012-06-21 20:39:10 +00:00
Hal Finkel	bc9be7c0e5	Treat TargetGlobalAddress as a constant for the purpose of matching pre-inc stores on PPC. Thanks to Tobias von Koch for pointing out this problem. llvm-svn: 158932	2012-06-21 20:10:48 +00:00
Jack Carter	533bef32ae	The inline asm operand modifier 'c' is suppose to be generic across architectures. It has the following description in the gnu sources: Substitute immediate value without immediate syntax Several Architectures such as x86 have local implementations of operand modifier 'c' which go beyond the above description slightly. To make use of the generic modifiers without overriding local implementation one can make a call to the base class method for AsmPrinter::PrintAsmOperand() in the locally derived method's "default" case in the switch statement. That way if it is already defined locally the generic version will never get called. This change is needed when test/CodeGen/generic/asm-large-immediate.ll failed on a native Mips board. The test was assuming a generic implementation was in place. Affected files: lib/Target/Mips/MipsAsmPrinter.cpp: Changed the default case to call the base method. lib/CodeGen/AsmPrinter/AsmPrinterInlineAsm.cpp Added 'c' to the switch cases. test/CodeGen/Mips/asm-large-immediate.ll Mips compiled version of the generic one Contributer: Jack Carter llvm-svn: 158925	2012-06-21 17:14:46 +00:00
Lang Hames	662801dbc8	Add a missing llvm.fma -> VFNMS pattern to the ARM backend. llvm-svn: 158902	2012-06-21 06:10:00 +00:00
Akira Hatanaka	9a6df0f613	Revert r158846. llvm-svn: 158855	2012-06-20 21:19:39 +00:00
Akira Hatanaka	f8ce377e38	In MipsDisassembler.cpp, instead of defining register class tables, use the ones that are generated by TableGen and are already available in MipsGenRegisterInfo.inc. Suggested by Jakob Stoklund Olesen. Also, fix bug in function DecodeAFGR64RegisterClass. Patch by Vladimir Medic. llvm-svn: 158846	2012-06-20 20:39:23 +00:00
Hal Finkel	a94da28a6d	Add support for generating reg+reg (indexed) pre-inc loads on PPC. llvm-svn: 158823	2012-06-20 15:43:03 +00:00
Chandler Carruth	6f8cc37074	Remove 'static' from inline functions defined in header files. There is a pretty staggering amount of this in LLVM's header files, this is not all of the instances I'm afraid. These include all of the functions that (in my build) are used by a non-static inline (or external) function. Specifically, these issues were caught by the new '-Winternal-linkage-in-inline' warning. I'll try to just clean up the remainder of the clearly redundant "static inline" cases on functions (not methods!) defined within headers if I can do so in a reliable way. There were even several cases of a missing 'inline' altogether, or my personal favorite "static bool inline". Go figure. ;] llvm-svn: 158800	2012-06-20 08:39:33 +00:00
Craig Topper	f19d6cef51	Add predicate check around some patterns. llvm-svn: 158797	2012-06-20 07:30:23 +00:00
Craig Topper	54d8fe551b	Add predicate check around some patterns. llvm-svn: 158795	2012-06-20 07:01:11 +00:00
Craig Topper	d63e429d68	Don't insert 128-bit UNDEF into 256-bit vectors. Just keep the 256-bit vector. Original patch by Elena Demikhovsky. Tweaked by me to allow possibility of covering more cases. llvm-svn: 158792	2012-06-20 05:39:26 +00:00
Lang Hames	f0b9601a6d	Add DAG-combines for aggressive FMA formation. This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757	2012-06-19 22:51:23 +00:00
Jakob Stoklund Olesen	66e7517610	Implement PPCInstrInfo::isCoalescableExtInstr(). The PPC::EXTSW instruction preserves the low 32 bits of its input, just like some of the x86 instructions. Use it to reduce register pressure when the low 32 bits have multiple uses. This requires a small change to PeepholeOptimizer since EXTSW takes a 64-bit input register. This is related to PR5997. llvm-svn: 158743	2012-06-19 21:14:34 +00:00
Jan Wen Voung	fa15c02364	Have ARM ELF use correct reloc for "b" instr. The condition code didn't actually matter for arm "b" instructions, unlike "bl". It should just use the R_ARM_JUMP24 reloc. llvm-svn: 158722	2012-06-19 16:03:02 +00:00
Hal Finkel	12c1b6478a	Mark most PPC register classes to avoid write-after-write. For processors with the G5-like instruction-grouping scheme, this helps avoid early group termination due to a write-after-write dependency within the group. It should also help on pipelined embedded cores. On POWER7, over the test suite, this gives an average 0.5% speedup. The largest speedups are: SingleSource/Benchmarks/Stanford/Quicksort - 33% MultiSource/Applications/d/make_dparser - 21% MultiSource/Benchmarks/FreeBench/analyzer/analyzer - 12% MultiSource/Benchmarks/MiBench/telecomm-FFT/telecomm-fft - 12% Largest slowdowns: SingleSource/Benchmarks/Stanford/Bubblesort - 23% MultiSource/Benchmarks/Prolangs-C++/city/city - 21% MultiSource/Benchmarks/BitBench/uuencode/uuencode - 16% MultiSource/Benchmarks/mediabench/mpeg2/mpeg2dec/mpeg2decode - 13% llvm-svn: 158719	2012-06-19 13:57:17 +00:00
Akira Hatanaka	b98d06727e	Make MipsLongBranch::runOnMachineFunction return true. llvm-svn: 158702	2012-06-19 03:45:29 +00:00
Akira Hatanaka	2c0d5a881d	Use MachineBasicBlock::instr_iterator instead of MachineBasicBlock::iterator in MipsCodeEmitter.cpp. llvm-svn: 158701	2012-06-19 03:39:45 +00:00
Hal Finkel	42b797225a	Add support for generating reg+reg preinc stores on PPC. PPC will now generate STWUX and friends. llvm-svn: 158698	2012-06-19 02:34:32 +00:00
Rafael Espindola	38c45a939d	Move the support for using .init_array from ARM to the generic TargetLoweringObjectFileELF. Use this to support it on X86. Unlike ARM, on X86 it is not easy to find out if .init_array should be used or not, so the decision is made via TargetOptions and defaults to off. Add a command line option to llc that enables it. llvm-svn: 158692	2012-06-19 00:48:28 +00:00
Manman Ren	6d2895c506	ARM: use NOEN loads and stores if possible when handling struct byval. This change is to be enabled in clang. rdar://9877866 llvm-svn: 158684	2012-06-18 22:23:48 +00:00
Hal Finkel	56f4d93767	Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 158679	2012-06-18 21:08:18 +00:00
Jim Grosbach	6ea9efb4e5	ARM: Define generic HINT instruction. The NOP, WFE, WFI, SEV and YIELD instructions are all hints w/ a different immediate value in bits [7,0]. Define a generic HINT instruction and refactor NOP, WFI, WFI, SEV and YIELD to be assembly aliases of that. rdar://11600518 llvm-svn: 158674	2012-06-18 19:45:50 +00:00
Joel Jones	3d5ae56be4	This change handles a another case for generating the bic instruction when a compile time constant is known. This occurs when implicitly zero extending function arguments from 16 bits to 32 bits. The 8 bit case doesn't need to be handled, as the 8 bit constants are encoded directly, thereby not needing a separate load instruction to form the constant into a register. <rdar://problem/11481151> llvm-svn: 158659	2012-06-18 14:51:32 +00:00
Chandler Carruth	d2716ae111	Temporarily revert r158087. This patch causes problems when both dynamic stack realignment and dynamic allocas combine in the same function. With this patch, we no longer build the epilog correctly, and silently restore registers from the wrong position in the stack. Thanks to Matt for tracking this down, and getting at least an initial test case to Chad. I'm going to try to check a variation of that test case in so we can easily track the fixes required. llvm-svn: 158654	2012-06-18 07:03:12 +00:00
Hal Finkel	40483bafbf	Cleanup trip-count finding for PPC CTR loops (and some bug fixes). This cleans up the method used to find trip counts in order to form CTR loops on PPC. This refactoring allows the pass to find loops which have a constant trip count but also happen to end with a comparison to zero. This also adds explicit FIXMEs to mark two different classes of loops that are currently ignored. In addition, we now search through all potential induction operations instead of just the first. Also, we check the predicate code on the conditional branch and abort the transformation if the code is not EQ or NE, and we then make sure that the branch to be transformed matches the condition register defined by the comparison (multiple possible comparisons will be considered). llvm-svn: 158607	2012-06-16 20:34:07 +00:00
Kay Tiong Khoo	7247ab8114	*no need to pollute Intel syntax with bonus mnemonics; operand size is explicitly specified llvm-svn: 158603	2012-06-16 17:19:49 +00:00
NAKAMURA Takumi	b10b335713	Mips/AsmParser/CMakeLists.txt: Fix dependency. llvm-svn: 158602	2012-06-16 15:33:52 +00:00
Kevin Enderby	4964b6a4e2	Fix the encoding of the armv7m (MClass) for MSR registers other than aspr, iaspr, espr and xpsr which also needed to have 0b10 in their mask encoding bits. llvm-svn: 158560	2012-06-15 22:14:44 +00:00
Manman Ren	7ffcd63dea	ARM: optimization for sub+abs. This patch will optimize abs(x-y) FROM sub, movs, rsbmi TO subs, rsbmi For abs, we will use cmp instead of movs. This is necessary because we already have an existing peephole pass which optimizes away cmp following sub. rdar: 11633193 llvm-svn: 158551	2012-06-15 21:32:12 +00:00
Kay Tiong Khoo	a419828b83	*fixed to separate mnemonic from operands with tab llvm-svn: 158543	2012-06-15 21:04:21 +00:00
Jakob Stoklund Olesen	6fd22231ba	Preserve <undef> flags in ARMExpandPseudo. This probably mostly shows up in bugpoint-generated code. llvm-svn: 158527	2012-06-15 17:46:54 +00:00
Craig Topper	19cfb998fd	Move AVX version of convert instructions that write to GPRs to the Op1 table. llvm-svn: 158497	2012-06-15 07:02:58 +00:00
Pete Cooper	e1c5e7bf9f	Move X86::VCVTTSD2SIrr from the 2 operand to 1 operand MemRegOp table. Can someone with more knowledge of this please look at other entries to see if others need moved. llvm-svn: 158474	2012-06-14 22:12:58 +00:00
Akira Hatanaka	5e9724637e	Fix coding style violations. Remove white spaces and tabs. llvm-svn: 158471	2012-06-14 21:10:56 +00:00
Akira Hatanaka	d1b2b96ed5	1. introduce MipsPat in place of Pat in order to exclude those from being used by Mips16 or Micro Mips 2. clean up a few lines too long encountered Patch by Reed Kotler. llvm-svn: 158470	2012-06-14 21:03:23 +00:00
NAKAMURA Takumi	cf2652ae8c	MipsLongBranch.cpp: Tweak llvm::next() to appease msvc. llvm-svn: 158446	2012-06-14 12:29:48 +00:00
Richard Barton	2a7d06a53e	Replace assertion failure for badly formatted CPS instrution with error message. llvm-svn: 158445	2012-06-14 10:48:04 +00:00
Jush Lu	6dd02e5fe3	Cleanup whitespace. llvm-svn: 158443	2012-06-14 06:08:19 +00:00
Akira Hatanaka	012069bb89	Fix Mips/CMakeLists.txt. llvm-svn: 158437	2012-06-14 01:23:55 +00:00
Akira Hatanaka	70ebace503	Add file MipsLongBranch.cpp. llvm-svn: 158436	2012-06-14 01:22:24 +00:00
Akira Hatanaka	7ea45292fb	Remove code in MipsAsmPrinter and MipsMCInstLower. llvm-svn: 158434	2012-06-14 01:20:12 +00:00
Akira Hatanaka	0d20b51ff7	Add long branch expansion pass for MIPS. llvm-svn: 158433	2012-06-14 01:19:35 +00:00
Akira Hatanaka	19512459e6	Add AT to the list of registers clobbered by branches so that it is available as a scratch register when they are expanded to long branches. llvm-svn: 158432	2012-06-14 01:17:59 +00:00
Akira Hatanaka	fb3c87c739	In MipsRegisterInfo::eliminateFrameIndex, call Mips::loadImmediate to load an immediate that does not fit into 16-bit. llvm-svn: 158431	2012-06-14 01:17:36 +00:00
Akira Hatanaka	415903692b	In MipsFrameLowering::emitPrologue and emitEpilogue, call Mips::loadImmediate to load an immediate that does not fit into 16-bit. Also, take into consideration the global base register slot on the stack when computing the stack size. llvm-svn: 158430	2012-06-14 01:17:13 +00:00
Akira Hatanaka	8f2f845215	Define function MipsInstrInfo::GetInstSizeInBytes, which will be called to compute the size of basic blocks in a function. Also, define a function which emits a series of instructions to load an immediate. llvm-svn: 158429	2012-06-14 01:16:45 +00:00
Akira Hatanaka	afa4622baf	In MipsISelDAGToDAG.cpp, store the global base register to a stack frame object. Long-branches need access to the global base register to get the destination address. llvm-svn: 158428	2012-06-14 01:16:15 +00:00
Akira Hatanaka	2784db9e87	Add methods to MipsFunctionInfo for initializing and accessing the stack frame object for the global base register. This is the first of a series of patches which implements long branch expansion for MIPS. llvm-svn: 158427	2012-06-14 01:15:36 +00:00

... 2 3 4 5 6 ...

21852 Commits