llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Justin Bogner	265688dec4	Support: Functions for consuming endian specific data from a buffer. This adds a function to Endian.h that reads from and updates a pointer into a buffer with endian specific data. This is more convenient for stream-like reading of data than endian::read. llvm-svn: 204693	2014-03-25 01:04:44 +00:00
Manman Ren	e9c89df128	Register Allocator: check other options before using a CSR for the first time. When register allocator's stage is RS_Spill, we choose spill over using the CSR for the first time, if the spill cost is lower than CSRCost. When register allocator's stage is < RS_Split, we choose pre-splitting over using the CSR for the first time, if the cost of splitting is lower than CSRCost. CSRCost is set with command-line option "regalloc-csr-first-time-cost". The default value is 0 to generate the same codes as before this commit. With a value of 15 (1 << 14 is the entry frequency), I measured performance gain of 3% on 253.perlbmk and 1.7% on 197.parser, with instrumented PGO, on an arm device. rdar://16162005 llvm-svn: 204690	2014-03-25 00:16:25 +00:00
Kevin Enderby	780eb96e7a	Fix crashes when assembler directives are used that are not for Mach-O object files by generating an error instead. rdar://16335232 llvm-svn: 204687	2014-03-25 00:05:50 +00:00
Manman Ren	99187faec4	Register Allocator: refactoring (no functionality change). Factor out two functions calculateRegionSplitCost and doRegionSplit from tryRegionSplit. These two functions will be used in coming patches. rdar://16162005 llvm-svn: 204684	2014-03-24 23:23:42 +00:00
David Blaikie	155b9ac89b	DebugInfo: Simplify debug loc list handling by keeping separate lists Rather than using a flat list with "empty" entries (ala the actual on-disk format), keep separate lists for each variable. llvm-svn: 204680	2014-03-24 22:38:38 +00:00
David Blaikie	6c35a2d755	DwarfDebug: Simplify debug_loc merging No functional change intended. Merging up-front rather than delaying this task until later. This just seems simpler and more efficient (avoiding growing the debug loc list only to have to skip over those post-merged entries, etc). llvm-svn: 204679	2014-03-24 22:27:06 +00:00
Adrian Prantl	f42452e60a	Get rid of an unnecessary use of the * and & operators. llvm-svn: 204673	2014-03-24 21:33:01 +00:00
David Blaikie	a69e0d8494	DebugInfo: Add DW_AT_GNU_ranges_base to skeleton CUs This is used to avoid relocations in the dwo file by allowing DW_AT_ranges specified in debug_info.dwo to be relative to this base address. (r204667 implements the base-relative DW_AT_ranges side of this) llvm-svn: 204672	2014-03-24 21:31:35 +00:00
Justin Bogner	d178b46013	Support: Document Endian.h functions llvm-svn: 204671	2014-03-24 21:30:55 +00:00
David Blaikie	982d05712f	DebugInfo: Implement relative addressing for DW_AT_ranges under fission This removes the debug_ranges relocations from debug_info.dwo (but doesn't implement the DW_AT_GNU_ranges_base which is also necessary for correct functioning) llvm-svn: 204668	2014-03-24 21:07:27 +00:00
David Blaikie	4dbc173911	DebugInfo: Don't emit relocations to abbreviations in debug_info.dwo llvm-svn: 204667	2014-03-24 20:53:02 +00:00
David Blaikie	2f7fbac157	DwarfDebug: Remove an unused parameter llvm-svn: 204665	2014-03-24 20:31:01 +00:00
Matt Arsenault	ca20d1a0f2	R600: Don't viewCFG() under DEBUG() except on failure. Having these popping up every time you use -debug is really irritating. llvm-svn: 204664	2014-03-24 20:29:02 +00:00
David Blaikie	65a7d20254	Remove unused parameter llvm-svn: 204663	2014-03-24 20:28:10 +00:00
Matt Arsenault	94cdf74a4b	R600/SI: Fix extra mov from legalizing 64-bit SALU ops. Check the register class of each operand individually to avoid an extra copy to a vgpr. llvm-svn: 204662	2014-03-24 20:08:13 +00:00
Matt Arsenault	3436234471	R600/SI: Sub-optimial fix for 64-bit immediates with SALU ops. No longer asserts, but now you get moves loading legal immediates into the split 32-bit operations. llvm-svn: 204661	2014-03-24 20:08:09 +00:00
Matt Arsenault	ed12a24627	R600/SI: Fix 64-bit bit ops that require the VALU. Try to match scalar and first like the other instructions. Expand 64-bit ands to a pair of 32-bit ands since that is not available on the VALU. llvm-svn: 204660	2014-03-24 20:08:05 +00:00
Yaron Keren	3d4f34c936	In Release modes, Visual Studio complains that the Operator destructor in User.cpp never returns, which is true by design. Initially assumed that the reason is llvm_unreachable being dependent on NDEBUG. However, even if llvm_unreachable is replaced by __assume(false), VC still warns in Release modes but not in Debug modes... The real reason turned out to be optimization flags. With /Od in Debug modes the warning is not issued whereas with /O1 it is. I could not find any documentation to this effect, but it is reproducable: Try compiling http://msdn.microsoft.com/en-us/library/khwfyc5d(v=vs.90).aspx with /O1 and then with /Od. llvm-svn: 204659	2014-03-24 19:48:13 +00:00
Matt Arsenault	7ae7f52221	R600: Implement isNarrowingProfitable. llvm-svn: 204658	2014-03-24 19:43:31 +00:00
Matt Arsenault	d1a3190fc4	R600/SI: Move splitting 64-bit immediates to separate function. llvm-svn: 204651	2014-03-24 18:26:52 +00:00
Aaron Ballman	663fd6696b	Adding some very nascent information about the clang tablegen backends, with a promise to add more information later. llvm-svn: 204635	2014-03-24 18:18:31 +00:00
Ulrich Weigand	f2e33e8135	[PowerPC] Generate little-endian object files As a first step towards real little-endian code generation, this patch changes the PowerPC MC layer to actually generate little-endian object files. This involves passing the little-endian flag through the various layers, including down to createELFObjectWriter so we actually get basic little-endian ELF objects, emitting instructions in little-endian order, and handling fixups and relocations as appropriate for little-endian. The bulk of the patch is to update most test cases in test/MC/PowerPC to verify both big- and little-endian encodings. (The only test cases not updated are those that create actual big-endian ABI code, like the TLS tests.) Note that while the object files are now little-endian, the generated code itself is not yet updated, in particular, it still does not adhere to the ELFv2 ABI. llvm-svn: 204634	2014-03-24 18:16:09 +00:00
Quentin Colombet	ac3c109b60	[X86][ISelDAG] Add missing fallback patterns for avx2 broadcast instructions. Those patterns are used when the load cannot be folded into the related broadcast during the select phase. This happens when the load gets additional uses that were not anticipated during the previous lowering phases (constant vector to constant load, then constant load reused) or when selection DAG is not able to prove that folding the load will not create a cycle in the DAG. <rdar://problem/16074331> llvm-svn: 204631	2014-03-24 17:54:19 +00:00
Matt Arsenault	e063f39ed3	R600/SI: Fix 64-bit private loads. llvm-svn: 204630	2014-03-24 17:50:46 +00:00
Hans Wennborg	9751c92fd8	VS integration installer: set SUCCESS=1 if we find VS 2013 Previously we would print an error message on machines where the only VS version we find is 2013, even though we successfully install the integration files for it. Also, we shouldn't have two END labels. llvm-svn: 204629	2014-03-24 17:33:22 +00:00
Eli Bendersky	9d3cb5eed7	Add test to test/CodeGen/NVPTX for "alloca buffer" arguments. Make sure such IR gets properly lowered to PTX. llvm-svn: 204624	2014-03-24 16:52:30 +00:00
Adam Nemet	7a62bae9d5	[X86] Fix non-determinism in LowerVectorAllZeroTest This can be observed with the old testcase of CodeGen/X86/pr12312.ll: 47c47 < vorps %ymm0, %ymm1, %ymm0 --- > vorps %ymm1, %ymm0, %ymm0 97c97 < vorps %ymm1, %ymm0, %ymm0 --- > vorps %ymm0, %ymm1, %ymm0 The vector VecIns is populated with all the values from VecInMap. This is done while iterating VecInMap. VecInMap uses a hash of pointer values so the resulting order can vary depending on the memory layout. The fix is to populate the vector VecIns earlier as VecInMap is populated. This is done in DAG traversal order. Fixes <rdar://problem/16398806> llvm-svn: 204623	2014-03-24 16:52:08 +00:00
Daniel Sanders	8e41bea37f	[mips] Add error message when trying to use $at in '.set noat' mode. Summary: Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3158 llvm-svn: 204621	2014-03-24 16:48:01 +00:00
Eli Bendersky	cdebee3013	Removes the NVPTXSplitBBatBar pass. This pass is a historic remnant and actually causes less efficient code to be generated in some cases. llvm-svn: 204620	2014-03-24 16:36:39 +00:00
Tom Stellard	419d4b7ff4	R600/SI: Fix warning with gcc 4.8.2 llvm-svn: 204618	2014-03-24 16:12:34 +00:00
Tom Stellard	ca1096aa07	R600/SI: Promote fp64 SELECT to i64 This type promotion is replacing a Tablegen pattern and it is already covered by existing tests. llvm-svn: 204617	2014-03-24 16:07:30 +00:00
Tom Stellard	e7bae2dc06	SelectionDAG: Allow promotion of SELECT nodes from float to int types And vice-versa, as long as the types are the same width. There are a few R600 tests that will cover this. llvm-svn: 204616	2014-03-24 16:07:28 +00:00
Tom Stellard	219968c351	R600: Reorganize tablegen instruction definitions Each GPU family now has its own file. llvm-svn: 204615	2014-03-24 16:07:25 +00:00
Will Schmidt	8fe019f872	[PPC64LE] ELFv2 ABI updates for the .opd section [PPC64LE] ELFv2 ABI updates for the .opd section The PPC64 Little Endian (PPC64LE) target supports the ELFv2 ABI, and as such, does not have a ".opd" section. This is keyed off a _CALL_ELF=2 macro check. The CALL_ELF check is not clearly documented at this time. The basis for usage in this patch is from the gcc thread here: http://gcc.gnu.org/ml/gcc-patches/2013-11/msg01144.html > Adding comment from Uli: Looks good to me. I think the old-style JIT doesn't really work anyway for 64-bit, but at least with this patch LLVM will compile and link again on a ppc64le host ... llvm-svn: 204614	2014-03-24 16:04:15 +00:00
Daniel Sanders	09c5facb40	[mips] Add regression tests for parenthetic expressions in MIPS assembly. Summary: These expressions already worked but weren't tested. Patch by Robert N. M. Watson and David Chisnall (it was originally two patches) Their work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3156 llvm-svn: 204612	2014-03-24 15:42:21 +00:00
Daniel Sanders	318ec4f378	[mips] Allow dsubu to take an immediate as an alias for dsubiu. Summary: Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3155 llvm-svn: 204611	2014-03-24 15:38:00 +00:00
Hal Finkel	feee356b52	[PowerPC] Mark many instructions as commutative I'm under the impression that we used to infer the isCommutable flag from the instruction-associated pattern. Regardless, we don't seem to do this (at least by default) any more. I've gone through all of our instruction definitions, and marked as commutative all of those that should be trivial to commute (by exchanging the first two operands). There has been special code for the RL* instructions, and that's not changed. Before this change, we had the following commutative instructions: RLDIMI RLDIMIo RLWIMI RLWIMI8 RLWIMI8o RLWIMIo XSADDDP XSMULDP XVADDDP XVADDSP XVMULDP XVMULSP After: ADD4 ADD4o ADD8 ADD8o ADDC ADDC8 ADDC8o ADDCo ADDE ADDE8 ADDE8o ADDEo AND AND8 AND8o ANDo CRAND CREQV CRNAND CRNOR CROR CRXOR EQV EQV8 EQV8o EQVo FADD FADDS FADDSo FADDo FMADD FMADDS FMADDSo FMADDo FMSUB FMSUBS FMSUBSo FMSUBo FMUL FMULS FMULSo FMULo FNMADD FNMADDS FNMADDSo FNMADDo FNMSUB FNMSUBS FNMSUBSo FNMSUBo MULHD MULHDU MULHDUo MULHDo MULHW MULHWU MULHWUo MULHWo MULLD MULLDo MULLW MULLWo NAND NAND8 NAND8o NANDo NOR NOR8 NOR8o NORo OR OR8 OR8o ORo RLDIMI RLDIMIo RLWIMI RLWIMI8 RLWIMI8o RLWIMIo VADDCUW VADDFP VADDSBS VADDSHS VADDSWS VADDUBM VADDUBS VADDUHM VADDUHS VADDUWM VADDUWS VAND VAVGSB VAVGSH VAVGSW VAVGUB VAVGUH VAVGUW VMADDFP VMAXFP VMAXSB VMAXSH VMAXSW VMAXUB VMAXUH VMAXUW VMHADDSHS VMHRADDSHS VMINFP VMINSB VMINSH VMINSW VMINUB VMINUH VMINUW VMLADDUHM VMULESB VMULESH VMULEUB VMULEUH VMULOSB VMULOSH VMULOUB VMULOUH VNMSUBFP VOR VXOR XOR XOR8 XOR8o XORo XSADDDP XSMADDADP XSMAXDP XSMINDP XSMSUBADP XSMULDP XSNMADDADP XSNMSUBADP XVADDDP XVADDSP XVMADDADP XVMADDASP XVMAXDP XVMAXSP XVMINDP XVMINSP XVMSUBADP XVMSUBASP XVMULDP XVMULSP XVNMADDADP XVNMADDASP XVNMSUBADP XVNMSUBASP XXLAND XXLNOR XXLOR XXLXOR This is a by-inspection change, and I'm not sure how to write a reliable test case. I would like advice on this, however. llvm-svn: 204609	2014-03-24 15:07:28 +00:00
Daniel Sanders	504788912a	[mips] Implement shorthand add / sub forms for MIPS. Summary: - If only two registers are passed to a three-register operation, then the first argument is both source and destination register. - If a non-register is passed as the last argument, generate the immediate version of the instruction. Also mark DADD commutative and add scheduling information (to the generic scheduler), and implement DSUB. Patch by David Chisnall His work was sponsored by: DARPA, AFRL CC: theraven Differential Revision: http://llvm-reviews.chandlerc.com/D3148 llvm-svn: 204605	2014-03-24 14:05:39 +00:00
Justin Holewinski	dd04498e61	[NVPTX] Add isel patterns for addrspacecast llvm-svn: 204600	2014-03-24 11:17:53 +00:00
Renato Golin	0c2e68784a	Update release notes with EHABI current behaviour llvm-svn: 204598	2014-03-24 11:02:38 +00:00
Hal Finkel	829bfc7c99	[PowerPC] Don't schedule VSX copy legalization unless VSX is enabled There is no need to schedule this extra pass if it will have nothing to do. llvm-svn: 204594	2014-03-24 09:51:41 +00:00
Hal Finkel	d4fe687c4a	[PowerPC] Update comment re: VSX copy-instruction selection I've done some experimentation with this, and it looks like using the lower-latency (but lower throughput) copy instruction is essentially always the right thing to do. My assumption is that, in order to be relatively sure that the higher-latency copy will increase throughput, we'd want to have it unlikely to be in-flight with its use. On the P7, the global completion table (GCT) can hold a maximum of 120 instructions, shared among all active threads (up to 4), giving 30 instructions per thread. So specifically, I'd require at least that many instructions between the copy and the use before the high-latency variant is used. Trying this, however, over the entire test suite resulted in zero cases where the high-latency form would be preferable. This may be a consequence of the fact that the scheduler views copies as free, and so they tend to end up close to their uses. For this experiment I created a function: unsigned chooseVSXCopy(MachineBasicBlock &MBB, MachineBasicBlock::iterator I, unsigned DestReg, unsigned SrcReg, unsigned StartDist = 1, unsigned Depth = 3) const; with an implementation like: if (!Depth) return PPC::XXLOR; const unsigned MaxDist = 30; unsigned Dist = StartDist; for (auto J = I, JE = MBB.end(); J != JE && Dist <= MaxDist; ++J) { if (J->isTransient() && !J->isCopy()) continue; if (J->isCall() \|\| J->isReturn() \|\| J->readsRegister(DestReg, TRI)) return PPC::XXLOR; ++Dist; } // We've exceeded the required distance for the high-latency form, use it. if (Dist > MaxDist) return PPC::XVCPSGNDP; // If this is only an exit block, use the low-latency form. if (MBB.succ_empty()) return PPC::XXLOR; // We've reached the end of the block, check the successor blocks (up to some // depth), and use the high-latency form if that is okay with all successors. for (auto J = MBB.succ_begin(), JE = MBB.succ_end(); J != JE; ++J) { if (chooseVSXCopy(*J, (J)->begin(), DestReg, SrcReg, Dist, --Depth) == PPC::XXLOR) return PPC::XXLOR; } // All of our successor blocks seem okay with the high-latency variant, so // we'll use it. return PPC::XVCPSGNDP; and then changed the copy opcode selection from: Opc = PPC::XXLOR; to: Opc = chooseVSXCopy(MBB, std::next(I), DestReg, SrcReg); In conclusion, I'm removing the FIXME from the comment, because I believe that there is, at least absent other examples, nothing to fix. llvm-svn: 204591	2014-03-24 09:36:36 +00:00
Rafael Espindola	592a9a42e8	Teach llvm-readobj to print human friendly description of reserved sections. llvm-svn: 204584	2014-03-24 05:00:34 +00:00
Karthik Bhat	d11430a31d	Allow constant folding of ceil function whenever feasible llvm-svn: 204583	2014-03-24 04:36:06 +00:00
Rafael Espindola	d210a6a3dc	Add back tests that were reverted in r204203. They pass again with the fix in r204581. llvm-svn: 204582	2014-03-24 03:48:15 +00:00
Rafael Espindola	561b2c23ab	Propagate section from base to derived symbol. We were already propagating the section in a = b With this patch we also propagate it for a = b + 1 llvm-svn: 204581	2014-03-24 03:43:21 +00:00
Duncan P. N. Exon Smith	6b6cc94659	InstrProf: Silence spurious warnings in GCC 4.8 No functionality change. llvm-svn: 204580	2014-03-24 00:47:18 +00:00
NAKAMURA Takumi	207fc57ce7	SupportTests.LockFileManagerTest: Add assertions for Win32. - create_link doesn't work for nonexistent file. - remove cannot remove working directory. llvm-svn: 204579	2014-03-23 23:55:57 +00:00
Arnaud A. de Grandmaison	9c4ecc2b1f	ARM: no need to update SplatBits as it is not used llvm-svn: 204575	2014-03-23 21:14:32 +00:00
Justin Bogner	92a3f4949e	llvm-profdata: Check for bad data in the show command llvm-svn: 204573	2014-03-23 20:55:53 +00:00

1 2 3 4 5 ...

101408 Commits