llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 14:02:52 +02:00

Author	SHA1	Message	Date
Chandler Carruth	35ed75156a	Hoist the insertVector helper to be a static helper. This will allow its use inside of memcpy rewriting as well. This routine is more complex than extractVector, and some of its uses are not 100% where I want them to be so there is still some work to do here. While this can technically change the output in some cases, it shouldn't be a change that matters -- IE, it can leave some dead code lying around that prior versions did not, etc. Yet another step in the refactorings leading up to the solution to the last component of PR14478. llvm-svn: 170328	2012-12-17 13:41:21 +00:00
Richard Osborne	92f0d25122	Add instruction encodings for ZEXT and SEXT. Previously these were marked with the wrong format. llvm-svn: 170327	2012-12-17 13:20:37 +00:00
Chandler Carruth	bbd2c1ab94	Lift the extractVector helper all the way out to a static helper function. The method helpers all implicitly act upon the alloca, and what we really want is a fully generic helper. Doing memcpy rewrites is more special than all other rewrites because we are at times rewriting instructions which touch pointers other than the alloca. As a consequence all of the helpers needed by memcpy rewriting of sub-vector copies will need to be generalized fully. Note that all of these helpers ({insert,extract}{Integer,Vector}) are woefully uncommented. I'm going to go back through and document them once I get the factoring correct. No functionality changed. llvm-svn: 170325	2012-12-17 13:07:30 +00:00
Chandler Carruth	af6b524242	Factor the vector load rewriting into a more generic form. This makes it suitable for use in rewriting memcpy in the presence of subvector memcpy intrinsics. No functionality changed. llvm-svn: 170324	2012-12-17 12:50:21 +00:00
Richard Osborne	cccafe2726	Add instruction encodings / disassembly support for 2r instructions. llvm-svn: 170323	2012-12-17 12:29:31 +00:00
Richard Osborne	815fca9724	Add instruction encodings / disassembly support for 0r instructions. llvm-svn: 170322	2012-12-17 12:26:29 +00:00
Richard Osborne	eb9aa13f77	Simplify assertion in XCoreInstPrinter. llvm-svn: 170321	2012-12-17 12:13:46 +00:00
Richard Osborne	dea0ec7732	Update comments to match recommended doxygen style. llvm-svn: 170320	2012-12-17 12:13:41 +00:00
Richard Osborne	ff9dff6f74	Remove unnecessary include. llvm-svn: 170319	2012-12-17 12:13:32 +00:00
Duncan Sands	eb1689f725	Fix typo that results in new landing pads not getting a name, fixing PR14617. Patch by Chris Toshok. llvm-svn: 170318	2012-12-17 12:02:36 +00:00
Duncan Sands	bea16337de	Fix comment typo. llvm-svn: 170317	2012-12-17 11:43:15 +00:00
Craig Topper	3949d5f59c	Remove EFLAGS from the BLSI/BLSMSK/BLSR patterns. The nodes created by DAG combine don't contain an EFLAGS def. llvm-svn: 170308	2012-12-17 06:13:48 +00:00
Craig Topper	1c6c752718	Simplify BMI ANDN matching to use patterns instead of a DAG combine. Also add ANDN to isDefConvertible. llvm-svn: 170305	2012-12-17 05:12:30 +00:00
Craig Topper	8069d5b1e9	Add rest of BMI/BMI2 instructions to the folding tables as well as popcnt and lzcnt. llvm-svn: 170304	2012-12-17 05:02:29 +00:00
Craig Topper	fc6a3b9038	Remove store forms of DEC/INC from isDefConvertible. Since they are stores they don't have a register def. llvm-svn: 170303	2012-12-17 04:55:07 +00:00
Chandler Carruth	a079cd0144	Fix the first part of PR14478: memset now works. PR14478 highlights a serious problem in SROA that simply wasn't being exercised due to a lack of vector input code mixed with C-library function calls. Part of SROA was written carefully to handle subvector accesses via memset and memcpy, but the rewriter never grew support for this. Fixing it required refactoring the subvector access code in other parts of SROA so it could be shared, and then fixing the splat formation logic and using subvector insertion (this patch). The PR isn't quite fixed yet, as memcpy is still broken in the same way. I'm starting on that series of patches now. Hopefully this will be enough to bring the bullet benchmark back to life with the bb-vectorizer enabled, but that may require fixing memcpy as well. llvm-svn: 170301	2012-12-17 04:07:37 +00:00
Chandler Carruth	dae722c19d	Extract the logic for inserting a subvector into a vector alloca. No functionality changed. Another step of refactoring toward solving PR14487. llvm-svn: 170300	2012-12-17 04:07:35 +00:00
Chandler Carruth	fefd557661	Lift the integer splat computation into a helper function. No functionality changed. Refactoring leading up to the fix for PR14478 which requires some significant changes to the memset and memcpy rewriting. llvm-svn: 170299	2012-12-17 04:07:30 +00:00
Craig Topper	0b33954436	Add debug prints for when optimizeLoadInstr folds a load. llvm-svn: 170298	2012-12-17 03:56:00 +00:00
Richard Osborne	91f0d743ac	Add tests for disassembly of 1r XCore instructions. llvm-svn: 170295	2012-12-16 18:06:30 +00:00
Richard Osborne	5c2cbafeba	Add instruction encodings and disassembly for 1r instructions. llvm-svn: 170293	2012-12-16 17:37:34 +00:00
Richard Osborne	4209832082	Add XCore disassembler. Currently there is no instruction encoding info and XCoreDisassembler::getInstruction() always returns Fail. I intend to add instruction encodings and tests in follow on commits. llvm-svn: 170292	2012-12-16 17:29:14 +00:00
Richard Osborne	ab845841d3	Remove invalid instruction encodings. llvm-svn: 170291	2012-12-16 16:46:31 +00:00
Richard Osborne	a3ef745900	Mark anything deriving from PseudoInstXCore as a pseudo instruction. llvm-svn: 170290	2012-12-16 16:46:28 +00:00
Richard Osborne	b8e1ae4f91	Set instruction size correctly in XCoreInstrFormats.td llvm-svn: 170289	2012-12-16 16:46:24 +00:00
Richard Osborne	cf3638ba13	Change XCoreAsmPrinter to lower MachineInstrs to MCInsts before emission. This change adds XCoreMCInstLower to do the lowering to MCInst and XCoreInstPrinter to print the MCInsts. llvm-svn: 170288	2012-12-16 16:20:48 +00:00
Richard Osborne	514735415d	Replace ${:comment} with the comment symbol. llvm-svn: 170286	2012-12-16 15:59:02 +00:00
Dmitri Gribenko	79a7f51c9a	Declare class DwarfDebug before use instead of relying on a forward declaration from some other unrelated header. Patch by Kai. llvm-svn: 170284	2012-12-16 12:57:36 +00:00
NAKAMURA Takumi	327cb9bfbd	MCPureStreamer.cpp: Try to fix build, pruning EmitDebugLabel(). llvm-svn: 170280	2012-12-16 04:23:20 +00:00
Reed Kotler	7ee48929d5	This patch is needed to make c++ exceptions work for mips16. Mips16 is really a processor decoding mode (ala thumb 1) and in the same program, mips16 and mips32 functions can exist and can call each other. If a jal type instruction encounters an address with the lower bit set, then the processor switches to mips16 mode (if it is not already in it). If the lower bit is not set, then it switches to mips32 mode. The linker knows which functions are mips16 and which are mips32. When relocation is performed on code labels, this lower order bit is set if the code label is a mips16 code label. In general this works just fine, however when creating exception handling tables and dwarf, there are cases where you don't want this lower order bit added in. This has been traditionally distinguished in gas assembly source by using a different syntax for the label. lab1: ; this will cause the lower order bit to be added lab2=. ; this will not cause the lower order bit to be added In some cases, it does not matter because in dwarf and debug tables the difference of two labels is used and in that case the lower order bits subtract each other out. To fix this, I have added to mcstreamer the notion of a debuglabel. The default is for label and debug label to be the same. So calling EmitLabel and EmitDebugLabel produce the same result. For various reasons, there is only one set of labels that needs to be modified for the mips exceptions to work. These are the "$eh_func_beginXXX" labels. Mips overrides the debug label suffix from ":" to "=." . This initial patch fixes exceptions. More changes most likely will be needed to DwarfCFException to make all of this work for actual debugging. These changes will be to emit debug labels in some places where a simple label is emitted now. Some historical discussion on this from gcc can be found at: http://gcc.gnu.org/ml/gcc-patches/2008-08/msg00623.html http://gcc.gnu.org/ml/gcc-patches/2008-11/msg01273.html llvm-svn: 170279	2012-12-16 04:00:45 +00:00
Benjamin Kramer	8d191a2586	X86: Add a couple of target-specific dag combines that turn VSELECTS into psubus if possible. We match the pattern "x >= y ? x-y : 0" into "subus x, y" and two special cases if y is a constant. DAGCombiner canonicalizes those so we first have to undo the canonicalization for those cases. The pattern occurs in gzip when the loop vectorizer is enabled. Part of PR14613. llvm-svn: 170273	2012-12-15 16:47:44 +00:00
Chandler Carruth	1855fbe7b0	Add a corollary test for PR14572. We got this code path correct already. llvm-svn: 170271	2012-12-15 09:31:54 +00:00
Chandler Carruth	0fa6260bcf	Relax an overly aggressive assert to fix PR14572. The alloca width is based on the alloc size, not the type size. llvm-svn: 170270	2012-12-15 09:26:06 +00:00
Chandler Carruth	63f4e5f8b9	Make '-mtune=x86_64' assume fast unaligned memory accesses. Not all chips targeted by x86_64 have this feature, but a dramatically increasing number do. Specifying a chip-specific tuning parameter will continue to turn the feature on or off as appropriate for that particular chip, but the generic flag should try to achieve the best performance on the most widely available hardware. Today, the number of chips with fast UA access dwarfs those without in the x86-64 space. Note that this also brings LLVM's code generation for this '-march' flag more in line with that of modern GCCs. Reviewed by Dan Gohman. llvm-svn: 170269	2012-12-15 09:01:13 +00:00
Chandler Carruth	977cde06e9	Actually update the grammar of this sentence to reflect the removal of CellSPU. llvm-svn: 170268	2012-12-15 08:56:20 +00:00
NAKAMURA Takumi	7c233e129c	Revert r170246, "Enable the loop vectorizer by default." llvm-svn: 170267	2012-12-15 06:11:13 +00:00
Reed Kotler	18187e671e	This code implements most of mips16 hardfloat as it is done by gcc. In this case, essentially it is soft float with different library routines. The next step will be to make this fully interoperational with mips32 floating point and that requires creating stubs for functions with signatures that contain floating point types. I have a more sophisticated design for mips16 hardfloat which I hope to implement at a later time that directly does floating point without the need for function calls. The mips16 encoding has no floating point instructions so one needs to switch to mips32 mode to execute floating point instructions. llvm-svn: 170259	2012-12-15 00:20:05 +00:00
Eric Christopher	9e773fa45e	To simplify some code move the unit emission into the holders. Make emitDIE public accordingly. No functional change. llvm-svn: 170258	2012-12-15 00:04:07 +00:00
Eric Christopher	5ee6b38b4a	Use begin and end label names from the section for info. llvm-svn: 170257	2012-12-15 00:04:04 +00:00
Kevin Enderby	d775b42d99	Make sure the alternate PC+imm syntax of LDR instruction with a small immediate generates the narrow version. Needed when doing round-trip assemble/disassemble testing using the alternate syntax that specifies 'pc' directly. llvm-svn: 170255	2012-12-14 23:04:25 +00:00
Michael Ilseman	5cd248cc09	Add back FoldOpIntoPhi optimizations with fix. Included test cases to help catch these errors and to test the presence of the optimization itself llvm-svn: 170248	2012-12-14 22:08:26 +00:00
Nadav Rotem	80850ab50b	Enable the loop vectorizer by default. llvm-svn: 170246	2012-12-14 21:30:23 +00:00
Nadav Rotem	92f9e3ce2c	TypeLegalizer: Do not generate target specific nodes with illegal types, because we cant type-legalize them. llvm-svn: 170245	2012-12-14 21:20:37 +00:00
Duncan Sands	b0a9b453a5	Release notes for dragonegg 3.2. llvm-svn: 170243	2012-12-14 21:10:59 +00:00
Nadav Rotem	4e0ac4b552	Fix a crash in ValueTracking on vectors of pointers. llvm-svn: 170240	2012-12-14 20:43:49 +00:00
Bill Schmidt	ac9760fe81	This patch removes some nondeterminism from direct object file output for TLS dynamic models on 64-bit PowerPC ELF. The default sort routine for relocations only sorts on the r_offset field; but with TLS, there can be two relocations with the same r_offset. For PowerPC, this patch sorts secondarily on descending r_type, which matches the behavior expected by the linker. llvm-svn: 170237	2012-12-14 20:28:38 +00:00
Pedro Artigas	11d1271302	Add more reset methods to make all objects that the backend may use for outputting code have a reset, some are not used but were declared for completeness llvm-svn: 170227	2012-12-14 18:52:11 +00:00
Shuxin Yang	fc24901300	rdar://12753946 Implement rule : "x * (select cond 1.0, 0.0) -> select cond x, 0.0" llvm-svn: 170226	2012-12-14 18:46:06 +00:00
NAKAMURA Takumi	bdad03688b	[CMake] Move libxml2 stuff from clang to llvm/cmake. llvm-svn: 170225	2012-12-14 18:30:20 +00:00
Bill Schmidt	c895316923	This patch improves the 64-bit PowerPC InitialExec TLS support by providing for a wider range of GOT entries that can hold thread-relative offsets. This matches the behavior of GCC, which was not documented in the PPC64 TLS ABI. The ABI will be updated with the new code sequence. Former sequence: ld 9,x@got@tprel(2) add 9,9,x@tls New sequence: addis 9,2,x@got@tprel@ha ld 9,x@got@tprel@l(9) add 9,9,x@tls Note that a linker optimization exists to transform the new sequence into the shorter sequence when appropriate, by replacing the addis with a nop and modifying the base register and relocation type of the ld. llvm-svn: 170209	2012-12-14 17:02:38 +00:00

1 2 3 4 5 ...

87472 Commits