llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Akira Hatanaka	f8ce377e38	In MipsDisassembler.cpp, instead of defining register class tables, use the ones that are generated by TableGen and are already available in MipsGenRegisterInfo.inc. Suggested by Jakob Stoklund Olesen. Also, fix bug in function DecodeAFGR64RegisterClass. Patch by Vladimir Medic. llvm-svn: 158846	2012-06-20 20:39:23 +00:00
Hal Finkel	a94da28a6d	Add support for generating reg+reg (indexed) pre-inc loads on PPC. llvm-svn: 158823	2012-06-20 15:43:03 +00:00
Chandler Carruth	6f8cc37074	Remove 'static' from inline functions defined in header files. There is a pretty staggering amount of this in LLVM's header files, this is not all of the instances I'm afraid. These include all of the functions that (in my build) are used by a non-static inline (or external) function. Specifically, these issues were caught by the new '-Winternal-linkage-in-inline' warning. I'll try to just clean up the remainder of the clearly redundant "static inline" cases on functions (not methods!) defined within headers if I can do so in a reliable way. There were even several cases of a missing 'inline' altogether, or my personal favorite "static bool inline". Go figure. ;] llvm-svn: 158800	2012-06-20 08:39:33 +00:00
Craig Topper	f19d6cef51	Add predicate check around some patterns. llvm-svn: 158797	2012-06-20 07:30:23 +00:00
Craig Topper	54d8fe551b	Add predicate check around some patterns. llvm-svn: 158795	2012-06-20 07:01:11 +00:00
Craig Topper	d63e429d68	Don't insert 128-bit UNDEF into 256-bit vectors. Just keep the 256-bit vector. Original patch by Elena Demikhovsky. Tweaked by me to allow possibility of covering more cases. llvm-svn: 158792	2012-06-20 05:39:26 +00:00
Lang Hames	f0b9601a6d	Add DAG-combines for aggressive FMA formation. This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757	2012-06-19 22:51:23 +00:00
Jakob Stoklund Olesen	66e7517610	Implement PPCInstrInfo::isCoalescableExtInstr(). The PPC::EXTSW instruction preserves the low 32 bits of its input, just like some of the x86 instructions. Use it to reduce register pressure when the low 32 bits have multiple uses. This requires a small change to PeepholeOptimizer since EXTSW takes a 64-bit input register. This is related to PR5997. llvm-svn: 158743	2012-06-19 21:14:34 +00:00
Jan Wen Voung	fa15c02364	Have ARM ELF use correct reloc for "b" instr. The condition code didn't actually matter for arm "b" instructions, unlike "bl". It should just use the R_ARM_JUMP24 reloc. llvm-svn: 158722	2012-06-19 16:03:02 +00:00
Hal Finkel	12c1b6478a	Mark most PPC register classes to avoid write-after-write. For processors with the G5-like instruction-grouping scheme, this helps avoid early group termination due to a write-after-write dependency within the group. It should also help on pipelined embedded cores. On POWER7, over the test suite, this gives an average 0.5% speedup. The largest speedups are: SingleSource/Benchmarks/Stanford/Quicksort - 33% MultiSource/Applications/d/make_dparser - 21% MultiSource/Benchmarks/FreeBench/analyzer/analyzer - 12% MultiSource/Benchmarks/MiBench/telecomm-FFT/telecomm-fft - 12% Largest slowdowns: SingleSource/Benchmarks/Stanford/Bubblesort - 23% MultiSource/Benchmarks/Prolangs-C++/city/city - 21% MultiSource/Benchmarks/BitBench/uuencode/uuencode - 16% MultiSource/Benchmarks/mediabench/mpeg2/mpeg2dec/mpeg2decode - 13% llvm-svn: 158719	2012-06-19 13:57:17 +00:00
Akira Hatanaka	b98d06727e	Make MipsLongBranch::runOnMachineFunction return true. llvm-svn: 158702	2012-06-19 03:45:29 +00:00
Akira Hatanaka	2c0d5a881d	Use MachineBasicBlock::instr_iterator instead of MachineBasicBlock::iterator in MipsCodeEmitter.cpp. llvm-svn: 158701	2012-06-19 03:39:45 +00:00
Hal Finkel	42b797225a	Add support for generating reg+reg preinc stores on PPC. PPC will now generate STWUX and friends. llvm-svn: 158698	2012-06-19 02:34:32 +00:00
Rafael Espindola	38c45a939d	Move the support for using .init_array from ARM to the generic TargetLoweringObjectFileELF. Use this to support it on X86. Unlike ARM, on X86 it is not easy to find out if .init_array should be used or not, so the decision is made via TargetOptions and defaults to off. Add a command line option to llc that enables it. llvm-svn: 158692	2012-06-19 00:48:28 +00:00
Manman Ren	6d2895c506	ARM: use NOEN loads and stores if possible when handling struct byval. This change is to be enabled in clang. rdar://9877866 llvm-svn: 158684	2012-06-18 22:23:48 +00:00
Hal Finkel	56f4d93767	Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 158679	2012-06-18 21:08:18 +00:00
Jim Grosbach	6ea9efb4e5	ARM: Define generic HINT instruction. The NOP, WFE, WFI, SEV and YIELD instructions are all hints w/ a different immediate value in bits [7,0]. Define a generic HINT instruction and refactor NOP, WFI, WFI, SEV and YIELD to be assembly aliases of that. rdar://11600518 llvm-svn: 158674	2012-06-18 19:45:50 +00:00
Joel Jones	3d5ae56be4	This change handles a another case for generating the bic instruction when a compile time constant is known. This occurs when implicitly zero extending function arguments from 16 bits to 32 bits. The 8 bit case doesn't need to be handled, as the 8 bit constants are encoded directly, thereby not needing a separate load instruction to form the constant into a register. <rdar://problem/11481151> llvm-svn: 158659	2012-06-18 14:51:32 +00:00
Chandler Carruth	d2716ae111	Temporarily revert r158087. This patch causes problems when both dynamic stack realignment and dynamic allocas combine in the same function. With this patch, we no longer build the epilog correctly, and silently restore registers from the wrong position in the stack. Thanks to Matt for tracking this down, and getting at least an initial test case to Chad. I'm going to try to check a variation of that test case in so we can easily track the fixes required. llvm-svn: 158654	2012-06-18 07:03:12 +00:00
Hal Finkel	40483bafbf	Cleanup trip-count finding for PPC CTR loops (and some bug fixes). This cleans up the method used to find trip counts in order to form CTR loops on PPC. This refactoring allows the pass to find loops which have a constant trip count but also happen to end with a comparison to zero. This also adds explicit FIXMEs to mark two different classes of loops that are currently ignored. In addition, we now search through all potential induction operations instead of just the first. Also, we check the predicate code on the conditional branch and abort the transformation if the code is not EQ or NE, and we then make sure that the branch to be transformed matches the condition register defined by the comparison (multiple possible comparisons will be considered). llvm-svn: 158607	2012-06-16 20:34:07 +00:00
Kay Tiong Khoo	7247ab8114	*no need to pollute Intel syntax with bonus mnemonics; operand size is explicitly specified llvm-svn: 158603	2012-06-16 17:19:49 +00:00
NAKAMURA Takumi	b10b335713	Mips/AsmParser/CMakeLists.txt: Fix dependency. llvm-svn: 158602	2012-06-16 15:33:52 +00:00
Kevin Enderby	4964b6a4e2	Fix the encoding of the armv7m (MClass) for MSR registers other than aspr, iaspr, espr and xpsr which also needed to have 0b10 in their mask encoding bits. llvm-svn: 158560	2012-06-15 22:14:44 +00:00
Manman Ren	7ffcd63dea	ARM: optimization for sub+abs. This patch will optimize abs(x-y) FROM sub, movs, rsbmi TO subs, rsbmi For abs, we will use cmp instead of movs. This is necessary because we already have an existing peephole pass which optimizes away cmp following sub. rdar: 11633193 llvm-svn: 158551	2012-06-15 21:32:12 +00:00
Kay Tiong Khoo	a419828b83	*fixed to separate mnemonic from operands with tab llvm-svn: 158543	2012-06-15 21:04:21 +00:00
Jakob Stoklund Olesen	6fd22231ba	Preserve <undef> flags in ARMExpandPseudo. This probably mostly shows up in bugpoint-generated code. llvm-svn: 158527	2012-06-15 17:46:54 +00:00
Craig Topper	19cfb998fd	Move AVX version of convert instructions that write to GPRs to the Op1 table. llvm-svn: 158497	2012-06-15 07:02:58 +00:00
Pete Cooper	e1c5e7bf9f	Move X86::VCVTTSD2SIrr from the 2 operand to 1 operand MemRegOp table. Can someone with more knowledge of this please look at other entries to see if others need moved. llvm-svn: 158474	2012-06-14 22:12:58 +00:00
Akira Hatanaka	5e9724637e	Fix coding style violations. Remove white spaces and tabs. llvm-svn: 158471	2012-06-14 21:10:56 +00:00
Akira Hatanaka	d1b2b96ed5	1. introduce MipsPat in place of Pat in order to exclude those from being used by Mips16 or Micro Mips 2. clean up a few lines too long encountered Patch by Reed Kotler. llvm-svn: 158470	2012-06-14 21:03:23 +00:00
NAKAMURA Takumi	cf2652ae8c	MipsLongBranch.cpp: Tweak llvm::next() to appease msvc. llvm-svn: 158446	2012-06-14 12:29:48 +00:00
Richard Barton	2a7d06a53e	Replace assertion failure for badly formatted CPS instrution with error message. llvm-svn: 158445	2012-06-14 10:48:04 +00:00
Jush Lu	6dd02e5fe3	Cleanup whitespace. llvm-svn: 158443	2012-06-14 06:08:19 +00:00
Akira Hatanaka	012069bb89	Fix Mips/CMakeLists.txt. llvm-svn: 158437	2012-06-14 01:23:55 +00:00
Akira Hatanaka	70ebace503	Add file MipsLongBranch.cpp. llvm-svn: 158436	2012-06-14 01:22:24 +00:00
Akira Hatanaka	7ea45292fb	Remove code in MipsAsmPrinter and MipsMCInstLower. llvm-svn: 158434	2012-06-14 01:20:12 +00:00
Akira Hatanaka	0d20b51ff7	Add long branch expansion pass for MIPS. llvm-svn: 158433	2012-06-14 01:19:35 +00:00
Akira Hatanaka	19512459e6	Add AT to the list of registers clobbered by branches so that it is available as a scratch register when they are expanded to long branches. llvm-svn: 158432	2012-06-14 01:17:59 +00:00
Akira Hatanaka	fb3c87c739	In MipsRegisterInfo::eliminateFrameIndex, call Mips::loadImmediate to load an immediate that does not fit into 16-bit. llvm-svn: 158431	2012-06-14 01:17:36 +00:00
Akira Hatanaka	415903692b	In MipsFrameLowering::emitPrologue and emitEpilogue, call Mips::loadImmediate to load an immediate that does not fit into 16-bit. Also, take into consideration the global base register slot on the stack when computing the stack size. llvm-svn: 158430	2012-06-14 01:17:13 +00:00
Akira Hatanaka	8f2f845215	Define function MipsInstrInfo::GetInstSizeInBytes, which will be called to compute the size of basic blocks in a function. Also, define a function which emits a series of instructions to load an immediate. llvm-svn: 158429	2012-06-14 01:16:45 +00:00
Akira Hatanaka	afa4622baf	In MipsISelDAGToDAG.cpp, store the global base register to a stack frame object. Long-branches need access to the global base register to get the destination address. llvm-svn: 158428	2012-06-14 01:16:15 +00:00
Akira Hatanaka	2784db9e87	Add methods to MipsFunctionInfo for initializing and accessing the stack frame object for the global base register. This is the first of a series of patches which implements long branch expansion for MIPS. llvm-svn: 158427	2012-06-14 01:15:36 +00:00
Akira Hatanaka	2f3e3d6ece	Bundle jump/branch instructions with the instructions in the delay slot in delay slot filler pass of MIPS, per suggestion of Jakob Stoklund Olesen. This change, along with the fix in r158154, enables machine verification to be run after delay slot filling. llvm-svn: 158426	2012-06-13 23:25:52 +00:00
Akira Hatanaka	0435101a38	Implement a DAGCombine in MipsISelLowering.cpp which transforms the following pattern: (add v0, (add v1, abs_lo(tjt))) => (add (add v0, v1), abs_lo(tjt)) "tjt" is a TargetJumpTable node. llvm-svn: 158419	2012-06-13 20:33:18 +00:00
Akira Hatanaka	fef6359b1c	Set a higher value for maxStoresPerMemcpy in MipsISelLowering.cpp. llvm-svn: 158414	2012-06-13 19:33:32 +00:00
Akira Hatanaka	cf4210f6d7	Simplify CreateLoadLR and CreateStoreLR in MipsISelLowering.cpp. llvm-svn: 158413	2012-06-13 19:06:08 +00:00
Akira Hatanaka	25f2f1feba	Implement fastcc calling convention for MIPS. llvm-svn: 158410	2012-06-13 18:06:00 +00:00
Richard Osborne	96c0be7351	Fix pattern for MKMSK instruction. llvm-svn: 158409	2012-06-13 17:59:12 +00:00
Kay Tiong Khoo	b631f7fd59	*typo: Cyles changed to Cycles llvm-svn: 158404	2012-06-13 15:53:04 +00:00

1 2 3 4 5 ...

21545 Commits