llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Chandler Carruth	bb1db0e66a	Cleanup and relax a restriction on the matching of global offsets into x86 addressing modes. This allows PIE-based TLS offsets to fit directly into an addressing mode immediate offset, which is the last remaining code quality issue from PR12380. With this patch, that PR is completely fixed. To understand why this patch is correct to match these offsets into addressing mode immediates, break it down by cases: 1) 32-bit is trivially correct, and unmodified here. 2) 64-bit non-small mode is unchanged and never matches. 3) 64-bit small PIC code which is RIP-relative is handled specially in the match to try to fit RIP into the base register. If it fails, it now early exits. This behavior is unchanged by the patch. 4) 64-bit small non-PIC code which is not RIP-relative continues to work as it did before. The reason these immediates are safe is because the ABI ensures they fit in small mode. This behavior is unchanged. 5) 64-bit small PIC code which is not using RIP-relative addressing. This is the only case changed by the patch, and the primary place you see it is in TLS, either the win64 section offset TLS or Linux local-exec TLS model in a PIC compilation. Here the ABI again ensures that the immediates fit because we are in small mode, and any other operations required due to the PIC relocation model have been handled externally to the Wrapper node (extra loads etc are made around the wrapper node in ISelLowering). I've tested this as much as I can comparing it with GCC's output, and everything appears safe. I discussed this with Anton and it made sense to him at least at face value. That said, if there are issues with PIC code after this patch, yell and we can revert it. llvm-svn: 154304	2012-04-09 02:13:06 +00:00
Chandler Carruth	11c412fd2c	Teach LLVM about a PIE option which, when enabled on top of PIC, makes optimizations which are valid for position independent code being linked into a single executable, but not for such code being linked into a shared library. I discussed the design of this with Eric Christopher, and the decision was to support an optional bit rather than a completely separate relocation model. Fundamentally, this is still PIC relocation, its just that certain optimizations are only valid under a PIC relocation model when the resulting code won't be in a shared library. The simplest path to here is to expose a single bit option in the TargetOptions. If folks have different/better designs, I'm all ears. =] I've included the first optimization based upon this: changing TLS models to the *Exec models when PIE is enabled. This is the LLVM component of PR12380 and is all of the hard work. llvm-svn: 154294	2012-04-08 17:51:45 +00:00
Chandler Carruth	233e7232ae	Move the TLSModel information into the TargetMachine rather than hiding in TargetLowering. There was already a FIXME about this location being odd. The interface is simplified as a consequence. This will also make it easier to change TLS models when compiling with PIE. llvm-svn: 154292	2012-04-08 17:20:55 +00:00
Nadav Rotem	8957364ae5	AVX2: Build splat vectors by broadcasting a scalar from the constant pool. Previously we used three instructions to broadcast an immediate value into a vector register. On Sandybridge we continue to load the broadcasted value from the constant pool. llvm-svn: 154284	2012-04-08 12:54:54 +00:00
Craig Topper	a6412fb8c0	Turn avx2 vinserti128 intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove patterns for selecting the intrinsic. Similar was already done for avx1. llvm-svn: 154272	2012-04-07 22:32:29 +00:00
Craig Topper	1ddf62dc2c	Move vinsertf128 patterns near the instruction definitions. Add AddedComplexity to AVX2 vextracti128 patterns to give them priority over the integer versions of vextractf128 patterns. llvm-svn: 154268	2012-04-07 21:57:43 +00:00
Bob Wilson	059dbb715f	Fix Thumb __builtin_longjmp with integrated assembler. <rdar://problem/11203543> The tLDRr instruction with the last register operand set to the zero register prints in assembly as if no register was specified, and the assembler encodes it as a tLDRi instruction with a zero immediate. With the integrated assembler, that zero register gets emitted as "r0", so we get "ldr rx, [ry, r0]" which is broken. Emit the instruction as tLDRi with a zero immediate. I don't know if there's a good way to write a testcase for this. Suggestions welcome. Opportunities for follow-up work: 1) The asm printer should complain if a non-optional register operand is set to the zero register, instead of silently dropping it. 2) The integrated assembler should complain in the same situation, instead of silently emitting the operand as "r0". llvm-svn: 154261	2012-04-07 16:51:59 +00:00
NAKAMURA Takumi	1bfc716b7d	Target/X86/MCTargetDesc/X86MCAsmInfo.cpp: Enable DwarfCFI (aka DW2) on Cygming. Cygwin-1.7 supports dw2. Some recent mingw distros support one, too. I have confirmed test-suite/SingleSource/Benchmarks/Shootout-C++/except.cpp can pass on Cygwin. llvm-svn: 154247	2012-04-07 02:24:20 +00:00
Alexis Hunt	5c14769849	Output UTF-8-encoded characters as identifier characters into assembly by default. This is a behaviour configurable in the MCAsmInfo. I've decided to turn it on by default in (possibly optimistic) hopes that most assemblers are reasonably sane. If this proves a problem, switching to default seems reasonable. I'm not sure if this is the opportune place to test, but it seemed good to make sure it was tested somewhere. llvm-svn: 154235	2012-04-07 00:37:53 +00:00
Jim Grosbach	249356cbf3	Tidy up. 80 columns. llvm-svn: 154226	2012-04-06 23:43:50 +00:00
Jakob Stoklund Olesen	446611ae2a	ARMPat is equivalent to Requires<[IsARM]>. llvm-svn: 154210	2012-04-06 21:21:59 +00:00
Jakob Stoklund Olesen	ce15da8935	Eliminate iOS-specific tail call instructions. After register masks were introdruced to represent the call clobbers, it is no longer necessary to have duplicate instruction for iOS. llvm-svn: 154209	2012-04-06 21:17:42 +00:00
Chandler Carruth	55fe352a8c	There is no portable std::abs overload for int64_t, use the llvm::abs64 which exists for this purpose. llvm-svn: 154199	2012-04-06 20:10:52 +00:00
Jakob Stoklund Olesen	bb7b631def	Allow negative immediates in ARM and Thumb2 compares. ARM and Thumb2 mode can use cmn instructions to compare against negative immediates. Thumb1 mode can't. llvm-svn: 154183	2012-04-06 17:45:04 +00:00
Benjamin Kramer	103f74e9f8	Fix narrowing conversion. llvm-svn: 154171	2012-04-06 13:33:52 +00:00
Craig Topper	ffae2f8986	Allow 256-bit shuffles to be split if a 128-bit lane contains elements from a single source. This is a rewrite of the 256-bit shuffle splitting code based on similar code from legalize types. Fixes PR12413. llvm-svn: 154166	2012-04-06 07:45:23 +00:00
Jakob Stoklund Olesen	96c573a6c4	Deduplicate ARM call-related instructions. We had special instructions for iOS because r9 is call-clobbered, but that is represented dynamically by the register mask operands now, so there is no need for the pseudo-instructions. llvm-svn: 154144	2012-04-06 00:04:58 +00:00
Jim Grosbach	e1c687cc0a	ARM: Don't form a t2LDRi8 or t2STRi8 with an offset of zero. The load/store optimizer splits LDRD/STRD into two instructions when the register pairing doesn't work out. For negative offsets in Thumb2, it uses t2STRi8 to do that. That's fine, except for the case when the offset is in the range [-4,-1]. In that case, we'll also form a second t2STRi8 with the original offset plus 4, resulting in a t2STRi8 with a non-negative offset, which ends up as if it were an STRT, which is completely bogus. Similarly for loads. No testcase, unfortunately, as any I've been able to construct is both large and extremely fragile. rdar://11193937 llvm-svn: 154141	2012-04-05 23:51:24 +00:00
Jim Grosbach	2169e1d55c	ARM assembly aliases for add negative immediates using sub. 'add r2, #-1024' should just use 'sub r2, #1024' rather than erroring out. Thumb1 aliases for adding a negative immediate to the stack pointer, also. rdar://11192734 llvm-svn: 154123	2012-04-05 20:57:13 +00:00
Silviu Baranga	f376e00699	Added support for unpredictable ADC/SBC instructions on ARM, and also fixed some corner cases involving the PC register as an operand for these instructions. llvm-svn: 154101	2012-04-05 16:19:29 +00:00
Silviu Baranga	1c2668f700	Added support for handling unpredictable arithmetic instructions on ARM. llvm-svn: 154100	2012-04-05 16:13:15 +00:00
Jim Grosbach	5d11d38750	ARM assembly aliases for two-operand V[R]SHR instructions. rdar://11189467 llvm-svn: 154087	2012-04-05 07:23:53 +00:00
Jim Grosbach	64f4e8d5b3	ARM assembly parsing for 'msr' plain 'cpsr' operand. Plain 'cpsr' is an alias for 'cpsr_fc'. rdar://11153753 llvm-svn: 154080	2012-04-05 03:17:53 +00:00
Akira Hatanaka	e5ea70212f	Reapply 154038 without the failing test. llvm-svn: 154062	2012-04-04 22:16:36 +00:00
Owen Anderson	f6f930a990	Revert r154038. It was causing make check failures. llvm-svn: 154054	2012-04-04 21:18:58 +00:00
Akira Hatanaka	4df2267566	Fix LowerGlobalAddress to produce instructions with the correct relocation types for N32 ABI. Add new test case and update existing ones. llvm-svn: 154038	2012-04-04 19:02:38 +00:00
Akira Hatanaka	f9e02ac6e1	Fix LowerJumpTable to produce instructions with the correct relocation types for N32 ABI. Test case will be updated after the patch that fixes TargetLowering::getPICJumpTableRelocBase is checked in. llvm-svn: 154036	2012-04-04 18:31:32 +00:00
Akira Hatanaka	c8028e2551	Fix LowerConstantPool to produce instructions with the correct relocation types for N32 ABI and update test case. llvm-svn: 154034	2012-04-04 18:26:12 +00:00
Jakob Stoklund Olesen	0419ed395c	Implement ARMBaseInstrInfo::commuteInstruction() for MOVCCr. A MOVCCr instruction can be commuted by inverting the condition. This can help reduce register pressure and remove unnecessary copies in some cases. <rdar://problem/11182914> llvm-svn: 154033	2012-04-04 18:23:42 +00:00
Akira Hatanaka	913d78a99c	Fix LowerBlockAddress to produce instructions with the correct relocation types for N32 ABI and update test case. llvm-svn: 154031	2012-04-04 18:22:53 +00:00
Rafael Espindola	88a1aeb123	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Dylan Noblesmith	8ab4926be7	ARMDisassembler: drop bogus dependency on ARMCodeGen And indirectly, a dependency on most of the core LLVM optimization libraries. llvm-svn: 153957	2012-04-03 15:48:14 +00:00
Anton Korobeynikov	e70c37c738	Make PPCCompilationCallbackC function to be static, so there will be no need to issue call via PLT when LLVM is built as shared library. This mimics the X86 backend towards the approach. llvm-svn: 153938	2012-04-03 06:59:28 +00:00
Craig Topper	ce6c05e0df	Add support for AVX enhanced comparison predicates. Patch from Kay Tiong Khoo. llvm-svn: 153935	2012-04-03 05:20:24 +00:00
Akira Hatanaka	c5bbe0b434	Revert r153924. Delete test/MC/Disassembler/Mips and lib/Target/Mips/Disassembler. llvm-svn: 153926	2012-04-03 03:01:13 +00:00
Akira Hatanaka	cecb440c11	Revert r153924. There were buildbot failures. llvm-svn: 153925	2012-04-03 02:51:09 +00:00
Akira Hatanaka	058b0cfb55	MIPS disassembler support. Patch by Vladimir Medic. llvm-svn: 153924	2012-04-03 02:20:58 +00:00
Akira Hatanaka	f37a1c4323	Initial 64 bit direct object support. This patch allows llvm to recognize that a 64 bit object file is being produced and that the subsequently generated ELF header has the correct information. The test case checks for both big and little endian flavors. Patch by Jack Carter. llvm-svn: 153889	2012-04-02 19:25:22 +00:00
Hal Finkel	63edfabaaf	The binutils for the IBM BG/P are too old to support CFI. llvm-svn: 153886	2012-04-02 19:09:04 +00:00
Roman Divacky	2460282f66	Implement the SVR4 byval alignment for aggregates. Fixing a FIXME. llvm-svn: 153876	2012-04-02 15:49:30 +00:00
Benjamin Kramer	2f6189e2a5	Move getOpcodeName from the various target InstPrinters into the superclass MCInstPrinter. All implementations used the same code. llvm-svn: 153866	2012-04-02 08:32:38 +00:00
Craig Topper	fe02cb5e8b	Remove getInstructionName from MCInstPrinter implementations in favor of using the instruction name table from MCInstrInfo. Reduces static data in the InstPrinter implementations. llvm-svn: 153863	2012-04-02 07:01:04 +00:00
Craig Topper	dbc259a436	Make MCInstrInfo available to the MCInstPrinter. This will be used to remove getInstructionName and the static data it contains since the same tables are already in MCInstrInfo. llvm-svn: 153860	2012-04-02 06:09:36 +00:00
Hal Finkel	e54b93886a	Fix some 80-col. violations I introduced with the A2 PPC64 core. llvm-svn: 153852	2012-04-01 21:20:14 +00:00
Hal Finkel	1c045f6845	Enable prefetch generation on PPC64. llvm-svn: 153851	2012-04-01 20:08:17 +00:00
Hal Finkel	415234aaa4	Add LdStSTD* itin. for the PPC64 A2 core. llvm-svn: 153850	2012-04-01 20:08:08 +00:00
Nadav Rotem	2729f54295	This commit contains a few changes that had to go in together. 1. Simplify xor/and/or (bitcast(A), bitcast(B)) -> bitcast(op (A,B)) (and also scalar_to_vector). 2. Xor/and/or are indifferent to the swizzle operation (shuffle of one src). Simplify xor/and/or (shuff(A), shuff(B)) -> shuff(op (A, B)) 3. Optimize swizzles of shuffles: shuff(shuff(x, y), undef) -> shuff(x, y). 4. Fix an X86ISelLowering optimization which was very bitcast-sensitive. Code which was previously compiled to this: movd (%rsi), %xmm0 movdqa .LCPI0_0(%rip), %xmm2 pshufb %xmm2, %xmm0 movd (%rdi), %xmm1 pshufb %xmm2, %xmm1 pxor %xmm0, %xmm1 pshufb .LCPI0_1(%rip), %xmm1 movd %xmm1, (%rdi) ret Now compiles to this: movl (%rsi), %eax xorl %eax, (%rdi) ret llvm-svn: 153848	2012-04-01 19:31:22 +00:00
Hal Finkel	ff17f29a1f	Set the default PPC node scheduling preference to ILP (for the embedded cores). The 440 and A2 cores have detailed itineraries, and this allows them to be fully used to maximize throughput. llvm-svn: 153845	2012-04-01 19:23:08 +00:00
Hal Finkel	71772b9747	Add ppc440 itin. entries for LdStSTD* llvm-svn: 153844	2012-04-01 19:23:04 +00:00
Hal Finkel	f74994d731	Use full anti-dep. breaking with post-ra sched. on the embedded ppc cores. Post-RA scheduling gives a significant performance improvement on the embedded cores, so turn it on. Using full anti-dep. breaking is important for FP-intensive blocks, so turn it on (just on the embedded cores for now; this should also be good on the 970s because post-ra scheduling is all that we have for now, but that should have more testing first). llvm-svn: 153843	2012-04-01 19:22:57 +00:00
Hal Finkel	fd26145bc6	Add instruction itinerary for the PPC64 A2 core. This adds a full itinerary for IBM's PPC64 A2 embedded core. These cores form the basis for the CPUs in the new IBM BG/Q supercomputer. llvm-svn: 153842	2012-04-01 19:22:40 +00:00
Hal Finkel	42a487282a	Split the LdStGeneral PPC itin. class into LdStLoad and LdStStore. Loads and stores can have different pipeline behavior, especially on embedded chips. This change allows those differences to be expressed. Except for the 440 scheduler, there are no functionality changes. On the 440, the latency adjustment is only by one cycle, and so this probably does not affect much. Nevertheless, it will make a larger difference in the future and this removes a FIXME from the 440 itin. llvm-svn: 153821	2012-04-01 04:44:16 +00:00
Hal Finkel	548d6f1ad0	Fix dynamic linking on PPC64. Dynamic linking on PPC64 has had problems since we had to move the top-down hazard-detection logic post-ra. For dynamic linking to work there needs to be a nop placed after every call. It turns out that it is really hard to guarantee that nothing will be placed in between the call (bl) and the nop during post-ra scheduling. Previous attempts at fixing this by placing logic inside the hazard detector only partially worked. This is now fixed in a different way: call+nop codegen-only instructions. As far as CodeGen is concerned the pair is now a single instruction and cannot be split. This solution works much better than previous attempts. The scoreboard hazard detector is also renamed to be more generic, there is currently no cpu-specific logic in it. llvm-svn: 153816	2012-03-31 14:45:15 +00:00
Akira Hatanaka	4ef4aae332	Select static relocation model if it is jitting. llvm-svn: 153795	2012-03-31 02:38:36 +00:00
Jakob Stoklund Olesen	728984c476	Add a 2 byte safety margin in offset computations. ARMConstantIslandPass still has bugs where jump table compression can cause constant pool entries to go out of range. Add a safety margin of 2 bytes when placing constant islands, but use the real max displacement for verification. <rdar://problem/11156595> llvm-svn: 153789	2012-03-31 00:06:44 +00:00
Jakob Stoklund Olesen	91f86a31e7	Add more debugging output to ARMConstantIslandPass. llvm-svn: 153788	2012-03-31 00:06:42 +00:00
Benjamin Kramer	dbd6a33c45	Rip out emission of the regIsInRegClass function for the asm printer. It's slow, bloated and completely redundant with MCRegisterClass::contains. llvm-svn: 153782	2012-03-30 23:13:40 +00:00
Jim Grosbach	ab2d3b5529	ARM fix encoding fixup resolution for ldrd and friends. The 8-bit payload is not contiguous in the opcode. Move the upper nibble over 4 bits into the correct place. rdar://11158641 llvm-svn: 153780	2012-03-30 21:54:22 +00:00
Jim Grosbach	37853d6216	ARM assembler should prefer non-aliases encoding of cmp. When an immediate is both a value [t2_]so_imm and a [t2_]so_imm_neg, we want to use the non-negated form to make sure we prefer the normal encoding, not the aliased encoding via the negation of, e.g., 'cmp.w'. llvm-svn: 153770	2012-03-30 19:59:02 +00:00
Jim Grosbach	92ee2a8454	ARM encoding for VSWP got the second operand incorrect. Make the non-tied register operand names line up with what the base class encoding handler expects. rdar://11157236 llvm-svn: 153766	2012-03-30 18:53:01 +00:00
Jim Grosbach	472cefe371	ARM can only use narrow encoding for low regs. llvm-svn: 153765	2012-03-30 18:39:43 +00:00
Jim Grosbach	2536615bab	ARM integrated assembler should encoding choice for add/sub imm. For 'adds r2, r2, #56' outside of an IT block, the 16-bit encoding T2 can be used for this syntax. Prefer the narrow encoding when possible. rdar://11156277 llvm-svn: 153759	2012-03-30 17:20:40 +00:00
Jim Grosbach	9b185a753c	ARM assembly parsing needs to be paranoid about negative immediates. Make sure to treat immediates as unsigned when doing relative comparisons. rdar://11153621 llvm-svn: 153753	2012-03-30 16:31:31 +00:00
Benjamin Kramer	0365dc97a8	Add a note about a missed cmov -> sbb opportunity. llvm-svn: 153741	2012-03-30 13:02:58 +00:00
James Molloy	70a6f5ebc7	Ensure conditional BL instructions for ARM are given the fixup fixup_arm_condbranch. Patch by Tim Northover! llvm-svn: 153737	2012-03-30 09:15:32 +00:00
Evan Cheng	f3c23907f5	ARM target should allow codegenprep to duplicate ret instructions to enable tailcall opt. rdar://11140249 llvm-svn: 153717	2012-03-30 01:24:39 +00:00
Jakob Stoklund Olesen	8fe088c0ee	Invalidate liveness in ARMConstantIslandPass. This pass splits basic blocks to insert constant islands, and it doesn't recompute the live-in lists. No later passes depend on accurate liveness information. This fixes PR12410 where the machine code verifier was complaining. llvm-svn: 153700	2012-03-29 23:14:26 +00:00
Jakob Stoklund Olesen	d9c6469e9a	Prefer even-odd D-register pairs. We are sometimes allocatinog from the DPair register class which contains odd-even pairs in addition to the Q registers. Place the Q registers first in the DPair allocation order as they can be copied with a single instruction. The odd-even pairs should only be allocated as a last resort. llvm-svn: 153699	2012-03-29 22:54:32 +00:00
Lang Hames	1a0d0ec699	Try using vmov.i32 to materialize FP32 constants that can't be materialized by vmov.f32. llvm-svn: 153696	2012-03-29 21:56:11 +00:00
Jim Grosbach	ab639b8c36	ARM assembly 'cmp lr, #0 ' should not encode using 'cmn'. The CMP->CMN alias was matching for an immediate of zero when it should only match for negative values. rdar://11129224 llvm-svn: 153689	2012-03-29 21:19:52 +00:00
Jakob Stoklund Olesen	2cbfc41270	Handle register copies for the new ARM register classes. ARM recently gained DPair, DTriple, and DQuad register classes. Update copyPhysReg() to handle copies in these register classes. No test case, it is difficult to make the register allocator emit the odd copies reliably. The missing DPair copy caused a failure on partialsums in the nightly test suite. <rdar://problem/11147997> llvm-svn: 153686	2012-03-29 21:10:40 +00:00
Lang Hames	94d892c492	Make x86 REP_MOV* and REP_STO instructions use the correct operand sizes in 64-bit mode. llvm-svn: 153680	2012-03-29 19:54:28 +00:00
Akira Hatanaka	fa2f5577e9	Expand FREM. llvm-svn: 153671	2012-03-29 18:43:11 +00:00
Benjamin Kramer	e3b0c81c27	Replace assert(0) with llvm_unreachable to avoid warnings about dropping off the end of a non-void function in Release builds. llvm-svn: 153643	2012-03-29 12:37:26 +00:00
Craig Topper	9a00ba461c	Only allow symbolic names for (v)cmpss/sd/ps/pd encodings 8-31 to be used with 'v' version of instructions. llvm-svn: 153636	2012-03-29 07:11:23 +00:00
Joel Jones	486c38b0cf	For X86, change load/dec-or-inc/store into dec-or-inc, respectively. This is a code change to add support for changing instruction sequences of the form: load inc/dec of 8/16/32/64 bits store into the appropriate X86 inc/dec through memory instruction: inc[qlwb] / dec[qlwb] The checks that were in X86DAGToDAGISel::Select(SDNode *Node)>>ISD::STORE have been extracted to isLoadIncOrDecStore and reworked to use the better named wrappers for getOperand(unsigned) (e.g. getOffset()) and replaced Chain.getNode() with LoadNode. The comments have also been expanded. llvm-svn: 153635	2012-03-29 05:45:48 +00:00
Joel Jones	32f97db4b2	Reverted to revision 153616 to unblock build llvm-svn: 153623	2012-03-29 01:20:56 +00:00
Joel Jones	b4477ee31f	For X86, change load/dec-or-inc/store into dec-or-inc, respectively. This is a code change to add support for changing instruction sequences of the form: load inc/dec of 8/16/32/64 bits store into the appropriate X86 inc/dec through memory instruction: inc[qlwb] / dec[qlwb] The checks that were in X86DAGToDAGISel::Select(SDNode *Node)>>ISD::STORE have been extracted to isLoadIncOrDecStore and reworked to use the better named wrappers for getOperand(unsigned) (e.g. getOffset()) and replaced Chain.getNode() with LoadNode. The comments have also been expanded. llvm-svn: 153617	2012-03-29 00:37:47 +00:00
Jakob Stoklund Olesen	753b1e33e0	Enable machine code verification in the entire code generator. Some targets still mess up the liveness information, but that isn't verified after MRI->invalidateLiveness(). The verifier can still check other useful things like register classes and CFG, so it should be enabled after all passes. llvm-svn: 153615	2012-03-28 23:54:28 +00:00
Jakob Stoklund Olesen	e6574db283	Don't kill the base register when expanding strd. When an strd instruction doesn't get the registers it wants, it can be expanded into two str instructions. Make sure the first str doesn't kill the base register in the case where the base and data registers are identical: t2STRi12 %R0<kill>, %R0, 4, pred:14, pred:%noreg t2STRi12 %R2<kill>, %R0, 8, pred:14, pred:%noreg <rdar://problem/11101911> llvm-svn: 153611	2012-03-28 23:07:03 +00:00
Jakob Stoklund Olesen	ebee7e5cff	Preserve implicit defs in ARMLoadStoreOptimizer. When a number of sub-register VLRDS instructions are combined into a VLDM, preserve any super-register implicit defs. This is required to keep the register scavenger and machine code verifier happy. Enable machine code verification after ARMLoadStoreOptimizer. ARM/2012-01-26-CopyPropKills.ll was failing because of this. llvm-svn: 153610	2012-03-28 22:50:56 +00:00
Jakob Stoklund Olesen	7623979dd6	Spill DPair registers, not just QPR. The arm_neon intrinsics can create virtual registers from the DPair register class which allows both even-odd and odd-even D-register pairs. This fixes PR12389. llvm-svn: 153603	2012-03-28 21:20:32 +00:00
Jakob Stoklund Olesen	2c29e5d7f9	Revert r153516: "Invalidate liveness in Thumb2ITBlockPass." Revert r153519: "ARMLoadStoreOptimizer invalidates register liveness." These patches caused miscompilations in povray by turning off branch folding's updating of live-in lists. It turns out the the late scheduler depends on the live-in lists, even if it doesn't need correct kill flags. <rdar://problem/11139228> llvm-svn: 153593	2012-03-28 20:11:44 +00:00
Benjamin Kramer	b3055f03e1	Add another note about a missed compare with nsw arithmetic instcombine. llvm-svn: 153574	2012-03-28 10:50:18 +00:00
Richard Barton	201661d4bc	Fixup VST1.32 with writeback instruction. Also re-factor non-writeback version. llvm-svn: 153573	2012-03-28 10:18:11 +00:00
Akira Hatanaka	ef68ecdecc	Turn off post-RA scheduler by default. llvm-svn: 153557	2012-03-28 00:52:23 +00:00
Akira Hatanaka	d2ce66b138	Turn on post register allocation scheduler. llvm-svn: 153554	2012-03-28 00:24:17 +00:00
Akira Hatanaka	fcd8108a1d	Sort relocation entries before they are written out to a file. MIPS ABI imposes a constraint that GOT16 referring to a local symbol or HI16 has to be followed immediately by a matching LO16 relocation. llvm-svn: 153553	2012-03-28 00:23:33 +00:00
Akira Hatanaka	a495c4cbaf	Emit all directives except for ".cprestore" during asm printing rather than emit them as machine instructions. Directives ".set noat" and ".set at" are now emitted only at the beginning and end of a function except in the case where they are emitted to enclose .cpload with an immediate operand that doesn't fit in 16-bit field or unaligned load/stores. Also, make the following changes: - Remove function isUnalignedLoadStore and use a switch-case statement to determine whether an instruction is an unaligned load or store. - Define helper function CreateMCInst which generates an instance of an MCInst from an opcode and a list of operands. llvm-svn: 153552	2012-03-28 00:22:50 +00:00
Akira Hatanaka	8f8ee2351c	Mark flag neverHasSideEffects of pattern-less instructions that do not have any side effects. llvm-svn: 153551	2012-03-28 00:21:37 +00:00
Benjamin Kramer	4be5a4bca4	Add a note about a cute little fabs optimization. llvm-svn: 153543	2012-03-27 22:42:42 +00:00
Benjamin Kramer	cc6159f7d9	Add two missed instcombines related to compares with nsw arithmetic. llvm-svn: 153542	2012-03-27 22:03:19 +00:00
Akira Hatanaka	3853dcca58	Remove trailing white space. llvm-svn: 153536	2012-03-27 20:35:51 +00:00
Akira Hatanaka	0a613888c3	Add member EmitNOAT and its setter and getter functions to class MipsFunctionInfo. If EmitNOAT is true, directives ".set noat" and ".set at" are emitted at the beginning and end of a function. llvm-svn: 153528	2012-03-27 19:08:42 +00:00
Jakob Stoklund Olesen	48d7c2b088	ARMLoadStoreOptimizer invalidates register liveness. This pass tries to update kill flags, but there are still many bugs. Passes after the load/store optimizer don't need accurate liveness, so don't even try. <rdar://problem/11101911> llvm-svn: 153519	2012-03-27 17:33:52 +00:00
Jakob Stoklund Olesen	1f52931c1b	Invalidate liveness in Thumb2ITBlockPass. llvm-svn: 153516	2012-03-27 17:06:06 +00:00
Craig Topper	bf6a47d0ec	Prune some includes llvm-svn: 153502	2012-03-27 07:54:11 +00:00
Craig Topper	6bb276ae72	Remove unnecessary llvm:: qualifications llvm-svn: 153500	2012-03-27 07:21:54 +00:00
Akira Hatanaka	e41c1a7f91	Pass the llvm IR pointer value and offset to the constructor of MachinePointerInfo when getStore is called to create a node that stores an argument passed in register to the stack. Without this change, the post RA scheduler will fail to discover the dependencies between the stores instructions and the instructions that load from a structure passed by value. The link to the related discussion is here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-March/048055.html llvm-svn: 153499	2012-03-27 03:13:56 +00:00
Akira Hatanaka	55149b37e0	Fix bug in LowerConstantPool. llvm-svn: 153498	2012-03-27 02:55:31 +00:00
Akira Hatanaka	1ae82e61f1	Add T9 to the list of live-in registers of the entry basic block. llvm-svn: 153497	2012-03-27 02:46:25 +00:00
Akira Hatanaka	ccaa02d6ed	Retrieve and add the offset of a symbol in applyFixup rather than retrieve and set it in MipsMCCodeEmitter::getMachineOpValue. Assert in getMachineOpValue if MachineOperand MO is of an unexpected type. llvm-svn: 153494	2012-03-27 02:33:05 +00:00
Akira Hatanaka	b286eb94cf	Define function MipsGetSymAndOffset which returns a fixup's symbol and the offset applied to it. llvm-svn: 153493	2012-03-27 02:04:18 +00:00
Akira Hatanaka	88d8bcdbea	Rewrite computation of Value in adjustFixupValue so that the upper 48-bits are cleared. No functionality change. llvm-svn: 153491	2012-03-27 01:50:08 +00:00
Akira Hatanaka	57250eff2a	Reserve hardware registers. llvm-svn: 153486	2012-03-27 00:40:56 +00:00
Evan Cheng	72cf12e416	ARM has a peephole optimization which looks for a def / use pair. The def produces a 32-bit immediate which is consumed by the use. It tries to fold the immediate by breaking it into two parts and fold them into the immmediate fields of two uses. e.g movw r2, #40885 movt r3, #46540 add r0, r0, r3 => add.w r0, r0, #3019898880 add.w r0, r0, #30146560 ; However, this transformation is incorrect if the user produces a flag. e.g. movw r2, #40885 movt r3, #46540 adds r0, r0, r3 => add.w r0, r0, #3019898880 adds.w r0, r0, #30146560 Note the adds.w may not set the carry flag even if the original sequence would. rdar://11116189 llvm-svn: 153484	2012-03-26 23:31:00 +00:00
Craig Topper	76f7896f49	Prune some includes and forward declarations. llvm-svn: 153429	2012-03-26 06:58:25 +00:00
Craig Topper	bdc6d71a24	Prune includes and replace uses of ARMRegisterInfo.h with ARMBaeRegisterInfo.h llvm-svn: 153422	2012-03-26 00:45:15 +00:00
Craig Topper	519813bca2	Replace uses of ARMBaseInstrInfo and ARMTargetMachine with the Base versions. llvm-svn: 153421	2012-03-25 23:49:58 +00:00
Craig Topper	95a56ca1da	Prune some includes and forward declarations. llvm-svn: 153415	2012-03-25 18:10:17 +00:00
Hal Finkel	30d4df9f6d	Fix small-integer VAARG on SVR4 ABI PPC64. The PPC64 SVR4 ABI requires integer stack arguments, and thus the var. args., that are smaller than 64 bits be zero extended to 64 bits. llvm-svn: 153373	2012-03-24 03:53:55 +00:00
Justin Holewinski	715c9b53b3	PTX: Fix predicate logic bug Code such as: %vreg100 = setcc %vreg10, -1, SETNE brcond %vreg10, %tgt was being incorrectly morphed into %vreg100 = and %vreg10, 1 brcond %vreg10, %tgt where the 'and' instruction could be eliminated since such logic is on 1-bit types in the PTX back-end, leaving us with just: brcond %vreg10, %tgt which essentially gives us inverted branch conditions. llvm-svn: 153364	2012-03-24 01:23:20 +00:00
Jim Grosbach	aefb0f06a9	ARM tidy up ARMConstantIsland.cpp. No functional change, just tidy up the code and nomenclature a bit. llvm-svn: 153347	2012-03-23 23:07:03 +00:00
Benjamin Kramer	a41ae12bd5	Include cstdio in a few place that depended on getting it transitively through StringExtras.h llvm-svn: 153328	2012-03-23 11:35:30 +00:00
Benjamin Kramer	01e4003c0f	Move ftostr into its last user (cppbackend) and simplify it a bit. New code should use raw_ostream. llvm-svn: 153326	2012-03-23 11:26:29 +00:00
Eric Christopher	3839c1ffd3	Remove the C backend. llvm-svn: 153307	2012-03-23 05:50:46 +00:00
Silviu Baranga	d197baa066	Added soft fail checks for the disassembler when decoding some corner cases of the STRD, STRH, LDRD, LDRH, LDRSH and LDRSB instructions on ARM. llvm-svn: 153252	2012-03-22 14:14:49 +00:00
Silviu Baranga	7bdfb9e34d	Added soft fail cases for the disassembler when decoding LDRSBT, LDRHT or LDRSHT instruction on ARM llvm-svn: 153251	2012-03-22 13:24:43 +00:00
Silviu Baranga	c03971d4b1	Added soft fail cases for the disassembler when decoding MUL instructions on ARM. llvm-svn: 153250	2012-03-22 13:14:39 +00:00
Craig Topper	a2a674effc	Remove some unnecessary forward declarations. llvm-svn: 153245	2012-03-22 06:52:14 +00:00
Hal Finkel	84b247a2ef	PPC::DBG_VALUE must use Reg+Imm frame-index elimination even for large offsets. Fixes PR12203. I don't have a small test case yet, but I'll try to construct one. llvm-svn: 153240	2012-03-22 05:28:19 +00:00
Kevin Enderby	e64335b34a	Fix ARM disassembly of VST1 and VST2 instructions with writeback. And add test case for all opcodes handed by DecodeVSTInstruction() in ARMDisassembler.cpp . llvm-svn: 153218	2012-03-21 20:54:32 +00:00
Joerg Sonnenberger	4df2738e5f	Put Is64BitMemOperand into !defined(NDEBUG) for now. llvm-svn: 153185	2012-03-21 14:09:26 +00:00
Benjamin Kramer	ad9527ea4c	Use a signed value for this enum to avoid spuriuos warnings from gcc. llvm-svn: 153184	2012-03-21 13:48:11 +00:00
Joerg Sonnenberger	82af1c8704	Fix generation of the address size override prefix. Add assertions for the invalid cases. At least 16bit operand in 64bit mode is currently not rejected in the parser. llvm-svn: 153166	2012-03-21 05:48:07 +00:00
Craig Topper	ee63bb1ffd	Add typecast to silence -Wswitch warning introduced by r153153. llvm-svn: 153155	2012-03-21 02:28:53 +00:00
Craig Topper	32b2c8fecc	Spacing fixes and using 'unsigned' instead of 'int' to index to select shuffle elements for consistency with other shuffle code in X86 backend. llvm-svn: 153154	2012-03-21 02:14:01 +00:00
Akira Hatanaka	cfaddf5c18	Incremental big endian patch by Jack Carter. These changes allow us to compile big endian from the command line for 32 bit Mips targets. This patch will result in code and data actually being produced in the correct endianess. llvm-svn: 153153	2012-03-21 00:52:01 +00:00
Chad Rosier	17f25ea47b	[avx] Add patterns for combining vextractf128 + vmovaps/vmovups/vmobdqu to vextractf128 with 128-bit mem dest. Combines vextractf128 $0, %ymm0, %xmm0 vmovaps %xmm0, (%rdi) to vextractf128 $0, %ymm0, (%rdi) rdar://11082570 llvm-svn: 153139	2012-03-20 21:43:40 +00:00
Evan Cheng	4dfb298aac	Change conditional instructions definitions, e.g. ANDCC, ARMPseudoExpand and t2PseudoExpand. llvm-svn: 153135	2012-03-20 21:28:05 +00:00
Matt Beaumont-Gay	0702af872e	remove unused variable llvm-svn: 153116	2012-03-20 19:52:05 +00:00
Chad Rosier	f6d522341c	[avx] Add the AddedComplexity to the VINSERTI128 avx2 patterns to give precedence over the VINSERTF128 avx1 patterns. llvm-svn: 153114	2012-03-20 19:45:07 +00:00
Bob Wilson	52b3ad9532	Require a base pointer for stack realignment when SP may vary dynamically. ARMBaseRegisterInfo::canRealignStack was checking for variable-sized objects but not for stack adjustments around calls. Use hasReservedCallFrame() to check for both. The hasBasePointer function was already correctly checking both conditions, so the effect of this was that a base pointer would be used without checking whether the base pointer register could be reserved. I don't have a small testcase for this. <rdar://problem/11075906> llvm-svn: 153110	2012-03-20 19:28:25 +00:00
Bob Wilson	4fb4d4c6e0	Remove some redundant checks. ARMFrameLowering::hasReservedCallFrame is already checking for variable sized objects, so there's no point in checking it twice. llvm-svn: 153109	2012-03-20 19:28:22 +00:00
Chad Rosier	73d8191b27	Whitespace. llvm-svn: 153105	2012-03-20 18:38:33 +00:00
Chad Rosier	ffd2dbd676	[avx] Move the vextractf128 patterns closer to the vextractf128 def. Remove whitespace from test case. No functional change intended. llvm-svn: 153103	2012-03-20 18:24:55 +00:00
Kevin Enderby	b87e1e0bfd	Fix assembling ARM vst2 instructions with double-spaced registers. llvm-svn: 153099	2012-03-20 17:41:51 +00:00
Jim Grosbach	b562c4f2fa	ARM non-scattered MachO relocations for movw/movt. Needed when building -mdynamic-no-pic code. rdar://10459256 llvm-svn: 153097	2012-03-20 17:25:45 +00:00
Chad Rosier	143f33dc92	[avx] Adjust the VINSERTF128rm pattern to allow for unaligned loads. This results in things such as vmovups 16(%rdi), %xmm0 vinsertf128 $1, %xmm0, %ymm0, %ymm0 to be combined to vinsertf128 $1, 16(%rdi), %ymm0, %ymm0 rdar://11076953 llvm-svn: 153092	2012-03-20 17:08:51 +00:00
Silviu Baranga	d20ed770e5	The ARM instructions that have an unpredictable behavior when the pc register operand is given now fail with soft fail. Modified the regression tests to reflect this. llvm-svn: 153089	2012-03-20 15:54:56 +00:00
Richard Barton	25f44c807b	Test Commit - add a newline llvm-svn: 153083	2012-03-20 10:50:35 +00:00
Craig Topper	61aa773498	Remove code that prevented lowering shuffles if they are used by load and themselves used by a extract_vector_elt. This was done to allow the DAG combiner to collapse to a single element load. Unfortunately, sometimes the extract_vector_elt would disappear before DAG combine could do the transformation leaving a vector_shuffle that isel couldn't handle. New code lets the shuffle be converted to a target specific node, but then adds a combine routine that can convert target specific nodes back to vector_shuffles if the folding criteria are met. llvm-svn: 153080	2012-03-20 07:17:59 +00:00
Craig Topper	de938c64eb	Factor out target shuffle mask decoding from getShuffleScalarElt and use a SmallVector of int instead of unsigned for shuffle mask in decode functions. Preparation for another change. llvm-svn: 153079	2012-03-20 06:42:26 +00:00
Chris Lattner	9d0338734a	fix a build failure with libc++ llvm-svn: 153063	2012-03-19 23:31:01 +00:00
Jim Grosbach	0ca6b4a15d	ARM branch relaxation for unconditional t1 branches. rdar://11059157 llvm-svn: 153055	2012-03-19 21:32:32 +00:00
Jim Grosbach	65c7d4dab9	ARM assembly, accept optional '#' on lane index number. rdar://11057160 llvm-svn: 153053	2012-03-19 20:39:53 +00:00
Anton Korobeynikov	ccc669ff8f	Perform mul combine when multiplying wiht negative constants. Patch by Weiming Zhao! This fixes PR12212 llvm-svn: 153049	2012-03-19 19:19:50 +00:00
Preston Gurd	d1ae391210	This patch adds X86 instruction itineraries for non-pseudo opcodes in X86InstrCompiler.td. It also adds –mcpu-generic to the legalize-shift-64.ll test so the test will pass if run on an Intel Atom CPU, which would otherwise produce an instruction schedule which differs from that which the test expects. llvm-svn: 153033	2012-03-19 14:10:12 +00:00
Benjamin Kramer	080ccc13a6	Add a note for -ffast-math optimization of vector norm. llvm-svn: 153031	2012-03-19 00:43:34 +00:00
Craig Topper	34891f519c	isCommutedMOVLMask should only look at 128-bit vectors to match isMOVLMask. llvm-svn: 153027	2012-03-18 22:50:10 +00:00
Craig Topper	b1f171a213	Reorder includes in Target backends to following coding standards. Remove some superfluous forward declarations. llvm-svn: 152997	2012-03-17 18:46:09 +00:00
Craig Topper	bb72c24507	Fix some copy and paste remnants of Cell and SPU in Hexagon files. llvm-svn: 152981	2012-03-17 09:39:20 +00:00
Craig Topper	41786f284d	Fix typo in file header. llvm-svn: 152980	2012-03-17 09:28:37 +00:00
Craig Topper	3d39a3e5ba	Pass TargetOptions to HexagonTargetMachine constructor by reference to match other targets and the base class. llvm-svn: 152979	2012-03-17 09:24:09 +00:00
Craig Topper	0534d071b7	Reorder includes to match coding standards. Fix an issue or two exposed by that. llvm-svn: 152978	2012-03-17 07:33:42 +00:00
Bill Wendling	b427a9f177	Check if we can handle the arguments of a call (and therefore the call) in fast-isel before emitting code. If the program bails after code was emitted, then it could lead to the stack being adjusted more than once (two CALLSEQ_BEGINs emitted) but being adjuste back only once after the call. This leads to general badness and gnashing of teeth. <rdar://problem/11050630> llvm-svn: 152959	2012-03-16 23:11:07 +00:00
Jim Grosbach	7ab12a079f	ARM fix silly typo in optional operand alias. rdar://11065671 llvm-svn: 152954	2012-03-16 22:18:29 +00:00
Jim Grosbach	99aef428f3	ARM divided syntax fmrx/fmxr mnemonics. llvm-svn: 152946	2012-03-16 21:06:13 +00:00
Jim Grosbach	af19922301	ARM ldm/stm register lists can be out of order. It's not a good style idea, as the registers will be laid down in memory in numerical order, not the order they're in the list, but it's legal. vldm/vstm are stricter. rdar://11064740 llvm-svn: 152943	2012-03-16 20:48:38 +00:00
Jim Grosbach	a5d57ea09e	ARM optional operand on MRC/MCR assembly instructions. rdar://11058464 llvm-svn: 152883	2012-03-16 00:45:58 +00:00
Jim Grosbach	77151885af	ARM vmrs system registers mvfr0 and mvfr1 handling. rdar://11058464 llvm-svn: 152881	2012-03-16 00:27:18 +00:00
Jim Grosbach	e821ce3e5f	Remove inadvertant commit. llvm-svn: 152870	2012-03-15 23:00:30 +00:00
Chad Rosier	e007850778	[fast-isel] Address Eli's comments for r152847. Specifically, add a test case and still allow immediate encoding, just not with cmn. rdar://11038907 llvm-svn: 152869	2012-03-15 22:54:20 +00:00
Chad Rosier	103b276e74	[fast-isel] Don't try to encode LONG_MIN using cmn instructions. rdar://11038907 llvm-svn: 152847	2012-03-15 21:40:23 +00:00
Jim Grosbach	3812c82b92	ARM case-insensitive checking for APSR_nzcv. rdar://11056591 llvm-svn: 152846	2012-03-15 21:34:14 +00:00
Jim Grosbach	04f671dced	ARM aliases for pre-unified syntax fcmpz[sd] mnemonics. rdar://11056647 llvm-svn: 152834	2012-03-15 20:48:18 +00:00
Lang Hames	7918b0b225	Use vmov.f32 to materialize f32 consts on ARM. This relaxes constraints on register allocation by allowing all 32 D-registers to be used. Patch by Cameron Zwarich. llvm-svn: 152824	2012-03-15 18:49:02 +00:00
Kristof Beyls	5f7d669c67	Fix VCVT decoding (between floating-point and fixed-point, Floating-point). Patch by Richard Barton. llvm-svn: 152814	2012-03-15 17:50:29 +00:00
Chad Rosier	bd3e55d39c	[avx] Add patterns for VINSERTF128rm. This results in things such as vmovaps -96(%rbx), %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 to be combined to vinsertf128 $1, -96(%rbx), %ymm0, %ymm0 rdar://10643481 llvm-svn: 152762	2012-03-15 00:45:30 +00:00
Kevin Enderby	b5413ed6cc	Change the X86 assembler to not require a segment register on string instruction's destination operand like it does for the source operand. Also fix a typo in the comment for X86AsmParser::isSrcOp(). llvm-svn: 152654	2012-03-13 19:47:55 +00:00
Kevin Enderby	9f26c75ab5	Added a missing error check for X86 assembly with mismatched base and index registers not both being 64-bit or both being 32-bit registers. llvm-svn: 152580	2012-03-12 21:32:09 +00:00
Bob Wilson	a0d2185fe0	Switch to unified syntax for VFP instructions in inline assembly. <rdar://problem/11024696> llvm-svn: 152548	2012-03-12 06:15:36 +00:00
Benjamin Kramer	702df4f3ee	Remove global map. This code isn't even hot. llvm-svn: 152544	2012-03-11 18:12:04 +00:00
Craig Topper	df2bf795d6	Convert more static tables of registers used by calling convention to uint16_t to reduce space. llvm-svn: 152538	2012-03-11 07:57:25 +00:00
Craig Topper	c83a2b1ac0	Use uint16_t to store registers and opcode in static tables in the target specific backends. llvm-svn: 152537	2012-03-11 07:16:55 +00:00
Craig Topper	682445688d	Remove unused functions getArgRegs and getNumArgRegs. llvm-svn: 152535	2012-03-11 06:46:40 +00:00
Stepan Dyatkovskiy	72fdcabd4d	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Kay Tiong Khoo	aaa4140718	*fix typo in comment; test of commit access llvm-svn: 152507	2012-03-10 21:29:49 +00:00
Benjamin Kramer	ae4fd1b853	C files in llvm still have to be C89 compliant, remove C++-style comments. llvm-svn: 152495	2012-03-10 15:10:06 +00:00
Bill Wendling	1a3f2619a7	Fix disasm of iret, sysexit, and sysret when displayed with Intel syntax. Patch by Kay Tiong Khoo! llvm-svn: 152487	2012-03-10 07:37:27 +00:00
Akira Hatanaka	8cbaf16390	Do not custom lower i64 nodes if i64 is not a legal type. Move lines that set operation action of nodes. llvm-svn: 152452	2012-03-10 00:03:50 +00:00
Akira Hatanaka	93f6fbc38b	Lower SETCC nodes during legalization. Previously, it was lowered in DAG combine pass. llvm-svn: 152450	2012-03-09 23:46:03 +00:00
Akira Hatanaka	8d12a0fbf8	Remove unused header files. llvm-svn: 152447	2012-03-09 23:28:30 +00:00
Kevin Enderby	15f974a5a4	Add the missing call to Error when a bad X86 scale expression is parsed. llvm-svn: 152443	2012-03-09 22:24:10 +00:00
Kevin Enderby	1a3b6570f8	Fix the x86 disassembler to at least print the lock prefix if it is the first prefix. Added a FIXME to remind us this still does not work when it is not the first prefix. llvm-svn: 152414	2012-03-09 17:52:49 +00:00
Craig Topper	3dee24b8c3	Use uint16_t to store opcodes in static tables in X86 backend. llvm-svn: 152391	2012-03-09 07:45:21 +00:00
Ahmed Charles	cc93a16f47	Fix undefined behavior in the Mips backend. llvm-svn: 152390	2012-03-09 06:36:45 +00:00
Chad Rosier	a10cf5e1b9	Fix a regression from r147481. Original commit message from r147481: DAGCombine for transforming 128->256 casts into a vmovaps, rather then a vxorps + vinsertf128 pair if the original vector came from a load. Fix: Unaligned loads need to generate a vmovups. rdar://10974078 llvm-svn: 152366	2012-03-09 02:00:48 +00:00
Craig Topper	79f1e75059	Use uint16_t to store instruction implicit uses and defs. Reduces static data. llvm-svn: 152301	2012-03-08 08:22:45 +00:00
Stepan Dyatkovskiy	79f3dd93b7	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Akira Hatanaka	7390c7f0b4	Invoke setTargetDAGCombine for SELECT. llvm-svn: 152290	2012-03-08 03:26:37 +00:00
Akira Hatanaka	9de051a22a	Swap the operands of a select node if the false (the second) operand is 0. For example, this pattern (select (setcc lhs, rhs, cc), true, 0) is transformed to this one: (select (setcc lhs, rhs, inverse(cc)), 0, true) This enables MipsDAGToDAGISel::ReplaceUsesWithZeroReg (added in r152280) to replace 0 with $zero. llvm-svn: 152285	2012-03-08 02:14:24 +00:00
Akira Hatanaka	3440b8840f	Set minimum function alignment to 3 if target is Mips64. llvm-svn: 152282	2012-03-08 01:59:33 +00:00
Akira Hatanaka	f500c76435	This patch eliminates redundant instructions that produce 0. For example, the first instruction in the code below can be eliminated if the use of $vr0 is replaced with $zero: addiu $vr0, $zero, 0 add $vr2, $vr1, $vr0 add $vr2, $vr1, $zero llvm-svn: 152280	2012-03-08 01:51:59 +00:00
Jim Grosbach	6c4a2c8050	ARM don't use MCRelaxAll, as it's not safe on ARM. The ARM code generator makes aggressive assumptions about the encodings being selected for branches which MCRelaxAll invalidates. rdar://11006355 llvm-svn: 152268	2012-03-08 00:07:52 +00:00
Chad Rosier	07b6e758e1	[fast-isel] ARMEmitCmp generates FMSTAT, which transfers the floating-point condition flags to CPSR. This allows us to simplify SelectCmp. Patch by Zonr Chang <zonr.xchg@gmail.com>. llvm-svn: 152243	2012-03-07 20:59:26 +00:00
Jim Grosbach	d0770582f9	ARM pre-v6 assembly parsing for umull/smull. llvm-svn: 152188	2012-03-07 01:09:17 +00:00
Jim Grosbach	dbeec050c2	ARM pre-v6 alias for 'nop' to 'mov r0, r0' llvm-svn: 152185	2012-03-07 00:52:41 +00:00
Jim Grosbach	9ef7f069b5	Tidy up. Remove dead code that slipped into previous commit. llvm-svn: 152184	2012-03-07 00:52:39 +00:00
Jim Grosbach	3b5f99f716	ARM more NEON VLD/VST composite physical register refactoring. Register pair, all lanes subscripting. llvm-svn: 152157	2012-03-06 23:10:38 +00:00

... 2 3 4 5 6 ...

21179 Commits