llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Rafael Espindola	e0fa1f9b53	Expose getFlags via ELFSectionRef. llvm-svn: 240779	2015-06-26 12:44:10 +00:00
Rafael Espindola	4ef84ab21f	Add a ELFSectionRef class and use it to expose getSectionType. llvm-svn: 240778	2015-06-26 12:33:37 +00:00
Rafael Espindola	400aa8ffe6	Simplify getSymbolType. This is still a really odd function. Most calls are in object format specific contexts and should probably be replaced with a more direct query, but at least now this is not too obnoxious to use. llvm-svn: 240777	2015-06-26 12:18:49 +00:00
Javed Absar	d4144d8279	[ARM] Cortex-R4F is not VFPOnlySP Cortex-R4F TRM states that fpu supports both single and double precision. This patch corrects the information in ARM.td file and corresponding test. Reviewers: rengolin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10763 llvm-svn: 240776	2015-06-26 12:14:56 +00:00
Rafael Espindola	386eef5084	Make getOther ELF only. No other format has this field. llvm-svn: 240774	2015-06-26 11:39:57 +00:00
Rafael Espindola	9b4c1d87c6	Optimize the creation of mapping symbols. No need to create two symbols just to assign one to the other. llvm-svn: 240773	2015-06-26 11:31:13 +00:00
David Majnemer	f499207c44	[X86] Cleanup X86WindowsTargetObjectFile::getSectionForConstant No functionality changed, just keeping things clean. llvm-svn: 240762	2015-06-26 07:03:12 +00:00
Hao Liu	bfe90ecb2e	[InterleavedAccess] Fix failures "undefined type 'llvm::raw_ostream'" on windows. llvm-svn: 240760	2015-06-26 04:38:21 +00:00
Hao Liu	fc6114fe0f	[ARM] Lower interleaved memory accesses to vldN/vstN intrinsics. This patch also adds a function to calculate the cost of interleaved memory accesses. E.g. Lower an interleaved load: %wide.vec = load <8 x i32>, <8 x i32>* %ptr, align 4 %v0 = shuffle %wide.vec, undef, <0, 2, 4, 6> %v1 = shuffle %wide.vec, undef, <1, 3, 5, 7> into: %vld2 = { <4 x i32>, <4 x i32> } call llvm.arm.neon.vld2(%ptr, 4) %vec0 = extractelement { <4 x i32>, <4 x i32> } %vld2, i32 0 %vec1 = extractelement { <4 x i32>, <4 x i32> } %vld2, i32 1 E.g. Lower an interleaved store: %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr, align 4 into: %sub.v0 = shuffle <8 x i32> %v0, <8 x i32> v1, <0, 1, 2, 3> %sub.v1 = shuffle <8 x i32> %v0, <8 x i32> v1, <4, 5, 6, 7> %sub.v2 = shuffle <8 x i32> %v0, <8 x i32> v1, <8, 9, 10, 11> call void llvm.arm.neon.vst3(%ptr, %sub.v0, %sub.v1, %sub.v2, 4) Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240755	2015-06-26 02:45:36 +00:00
Hao Liu	1dc438bc4e	[AArch64] Lower interleaved memory accesses to ldN/stN intrinsics. This patch also adds a function to calculate the cost of interleaved memory accesses. E.g. Lower an interleaved load: %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle %wide.vec, undef, <0, 2, 4, 6> %v1 = shuffle %wide.vec, undef, <1, 3, 5, 7> into: %ld2 = { <4 x i32>, <4 x i32> } call llvm.aarch64.neon.ld2(%ptr) %vec0 = extractelement { <4 x i32>, <4 x i32> } %ld2, i32 0 %vec1 = extractelement { <4 x i32>, <4 x i32> } %ld2, i32 1 E.g. Lower an interleaved store: %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr into: %sub.v0 = shuffle <8 x i32> %v0, <8 x i32> v1, <0, 1, 2, 3> %sub.v1 = shuffle <8 x i32> %v0, <8 x i32> v1, <4, 5, 6, 7> %sub.v2 = shuffle <8 x i32> %v0, <8 x i32> v1, <8, 9, 10, 11> call void llvm.aarch64.neon.st3(%sub.v0, %sub.v1, %sub.v2, %ptr) Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240754	2015-06-26 02:32:07 +00:00
Hao Liu	00aff8cc17	[InterleavedAccess] Add a pass InterleavedAccess to identify interleaved memory accesses and transform into target specific intrinsics. E.g. An interleaved load (Factor = 2): %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <0, 2, 4, 6> %v1 = shuffle <8 x i32> %wide.vec, <8 x i32> undef, <1, 3, 5, 7> It can be transformed into a ld2 intrinsic in AArch64 backend or a vld2 intrinsic in ARM backend. E.g. An interleaved store (Factor = 3): %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr It can be transformed into a st3 intrinsic in AArch64 backend or a vst3 intrinsic in ARM backend. Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240751	2015-06-26 02:10:27 +00:00
Matthias Braun	917f1e4a7f	Revert "X86: Reject register operands with obvious type mismatches." Revert until http://llvm.org/PR23955 is investigated. This reverts commit r239309. llvm-svn: 240746	2015-06-26 00:26:49 +00:00
Alexey Samsonov	4c3b8a043f	[ASan] Use llvm::getDISubprogram() to get function entry debug location. It can be more robust than copying debug info from first non-alloca instruction in the entry basic block. We use the same strategy in coverage instrumentation. llvm-svn: 240738	2015-06-26 00:00:47 +00:00
Duncan P. N. Exon Smith	b8e0101e11	AsmPrinter: Use an intrusively linked list for DIE::Children Replace the `std::vector<>` for `DIE::Children` with an intrusively linked list. This is a strict memory improvement: it requires no auxiliary storage, and reduces `sizeof(DIE)` by one pointer. It also factors out the DIE-related malloc traffic. This drops llc memory usage from 735 MB down to 718 MB, or ~2.3%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240736	2015-06-25 23:52:10 +00:00
Duncan P. N. Exon Smith	249d680189	AsmPrinter: Convert DIE::Values to a linked list Change `DIE::Values` to a singly linked list, where each node is allocated on a `BumpPtrAllocator`. In order to support `push_back()`, the list is circular, and points at the tail element instead of the head. I abstracted the core list logic out to `IntrusiveBackList` so that it can be reused for `DIE::Children`, which also cares about `push_back()`. This drops llc memory usage from 799 MB down to 735 MB, about 8%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 240733	2015-06-25 23:46:41 +00:00
NAKAMURA Takumi	a5b2580063	PPCISelLowering.cpp: Appease PR23956. [-Wdocumentation] llvm-svn: 240727	2015-06-25 23:38:44 +00:00
Anna Zaks	faa9b1561e	[asan] Do not instrument special purpose LLVM sections. Do not instrument globals that are placed in sections containing "__llvm" in their name. This fixes a bug in ASan / PGO interoperability. ASan interferes with LLVM's PGO, which places its globals into a special section, which is memcpy-ed by the linker as a whole. When those goals are instrumented, ASan's memcpy wrapper reports an issue. http://reviews.llvm.org/D10541 llvm-svn: 240723	2015-06-25 23:35:48 +00:00
Anna Zaks	494d337bdd	[asan] Don't run stack malloc on functions containing inline assembly. It makes LLVM run out of registers even on 64-bit platforms. For example, the following test case fails on darwin. clang -cc1 -O0 -triple x86_64-apple-macosx10.10.0 -emit-obj -fsanitize=address -mstackrealign -o ~/tmp/ex.o -x c ex.c error: inline assembly requires more registers than available void TestInlineAssembly(const unsigned char S, unsigned int pS, unsigned char D, unsigned int pD, unsigned int h) { unsigned int sr = 4, pDiffD = pD - 5; unsigned int pDiffS = (pS << 1) - 5; char flagSA = ((pS & 15) == 0), flagDA = ((pD & 15) == 0); asm volatile ( "mov %0, %%"PTR_REG("si")"\n" "mov %2, %%"PTR_REG("cx")"\n" "mov %1, %%"PTR_REG("di")"\n" "mov %8, %%"PTR_REG("ax")"\n" : : "m" (S), "m" (D), "m" (pS), "m" (pDiffS), "m" (pDiffD), "m" (sr), "m" (flagSA), "m" (flagDA), "m" (h) : "%"PTR_REG("si"), "%"PTR_REG("di"), "%"PTR_REG("ax"), "%"PTR_REG("cx"), "%"PTR_REG("dx"), "memory" ); } http://reviews.llvm.org/D10719 llvm-svn: 240722	2015-06-25 23:35:45 +00:00
Matt Arsenault	f547928c2c	DAGCombiner: Use pop_back_val() llvm-svn: 240709	2015-06-25 22:15:05 +00:00
Rafael Espindola	9195b6922c	Add an ELFSymbolRef type. This allows user code to say Sym.getSize() instead of having to manually fetch the object. llvm-svn: 240708	2015-06-25 22:10:04 +00:00
Frederic Riss	a084fef0e8	IAS: Use the root macro instanciation for location r224810 fixed the handling of macro debug locations in AsmParser. This patch fixes the logic to actually do what was intended: it uses the first macro of the macro stack instead of the last one. The updated testcase shows that the current scheme doesn't work when macro instanciations are nested and multiple files are used. Reviewers: compnerd Differential Revision: http://reviews.llvm.org/D10463 llvm-svn: 240705	2015-06-25 21:57:33 +00:00
Sanjay Patel	9c692291d3	fix typos; NFC llvm-svn: 240699	2015-06-25 21:11:08 +00:00
Pete Cooper	c2ffa0891f	Use foreach loop over constant operands. NFC. A number of places had explicit loops over Constant::operands(). Just use foreach loops where possible. llvm-svn: 240694	2015-06-25 20:51:38 +00:00
Jingyue Wu	35a6e27706	[InstCombine] call SimplifyICmpInst with correct context Summary: Fixes PR23809. Without passing the context to SimplifyICmpInst, we would use the assume to prove that the condition feeding the assume is trivially true (see isValidAssumeForContext in ValueTracking.cpp), causing the removal of the assume which may be useful for later optimizations. Test Plan: pr23800.ll Reviewers: hfinkel, majnemer Reviewed By: hfinkel Subscribers: henryhu, llvm-commits, wengxt, broune, meheff, eliben Differential Revision: http://reviews.llvm.org/D10695 llvm-svn: 240683	2015-06-25 20:14:47 +00:00
Rafael Espindola	c53c13d76f	Diagnose undefined temporary symbols. We already disallowed .global .Lfoo so this is reasonable. This is a small cherry pick from r240130. llvm-svn: 240681	2015-06-25 20:10:45 +00:00
Yaron Keren	dce18afe3b	Rangify for loop in Inliner.cpp. NFC. llvm-svn: 240678	2015-06-25 19:28:24 +00:00
Matt Arsenault	33b322c0c3	DAGCombiner: Remove redundant check MemIntrinsicSDNode is already a subclass of MemSDNode, so the MemSDNode check is sufficient. llvm-svn: 240672	2015-06-25 18:47:02 +00:00
Peter Collingbourne	3b0fac198c	GVN: If a branch has two identical successors, we cannot declare either dead. This previously caused miscompilations as a result of phi nodes receiving undef incoming values from blocks dominated by such successors. Differential Revision: http://reviews.llvm.org/D10726 llvm-svn: 240670	2015-06-25 18:32:02 +00:00
Kit Barton	230223268c	[PPC] Implement vmrgew and vmrgow instructions This patch adds support for the vector merge even word and vector merge odd word instructions introduced in POWER8. Phabricator review: http://reviews.llvm.org/D10704 llvm-svn: 240650	2015-06-25 15:17:40 +00:00
Bruno Cardoso Lopes	43fa3a4dea	[AsmPrinter] Fix crash in handleIndirectSymViaGOTPCRel Check for symbols in MCValue before using them. Bail out early in case they are null. This fixes PR23779. Differential Revision: http://reviews.llvm.org/D10712 rdar://problem/21532830 llvm-svn: 240649	2015-06-25 15:17:23 +00:00
Rafael Espindola	2c1319e668	Use computeSymbolSizes in llvm-symbolize. llvm-svn: 240646	2015-06-25 15:06:38 +00:00
Benjamin Kramer	6586644fa4	[PPC] Replace debug value skipping with getLastNonDebugInstr. No functionality change intended. llvm-svn: 240641	2015-06-25 13:39:03 +00:00
Benjamin Kramer	4ed07455af	Replace copy-pasted debug value skipping with MBB::getLastNonDebugInstr No functional change intended. llvm-svn: 240639	2015-06-25 13:28:24 +00:00
Toma Tabacu	f1219360be	[mips] [IAS] Refactor the emitDirectiveModuleFP() functions. NFC. Summary: Simplify emitDirectiveModuleFP() by having it just print the current information from MipsABIFlagsSection and doing an updateABIInfo() before such calls. This prevents us from forgetting to update the STI.FeatureBits, because updateABIInfo() uses those to update the MipsABIFlagsSection object, and also makes sure we use the update mechanism from MipsABIFlagsSection. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits, mpf Differential Revision: http://reviews.llvm.org/D10642 llvm-svn: 240637	2015-06-25 12:44:38 +00:00
Artur Pilipenko	ccbbd4db82	Take alignment into account in isSafeToLoadUnconditionally Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D10475 llvm-svn: 240636	2015-06-25 12:18:43 +00:00
Ulrich Weigand	580894b7f9	[SystemZ] Only attempt RxSBG optimization for integer types As pointed out by Justin Bogner (see r240520), SystemZDAGToDAGISel::Select currently attempts to convert boolean operations into RxSBG even on some non-integer types (in particular, vector types). This would not work in any case, and it happened to trigger undefined behaviour in allOnes. This patch verifies that we have a (<= 64-bit) integer type before attempting to perform this optimization. llvm-svn: 240634	2015-06-25 11:52:36 +00:00
Toma Tabacu	b5146d10c2	[mips] [IAS] Refactor the emitDirectiveModuleOddSPReg() functions. NFC. Summary: We can simplify emitDirectiveModuleOddSPReg() by having it print the current OddSPReg information from MipsABIFlagsSection and doing an updateABIInfo() before such calls. This prevents us from forgetting to update the STI.FeatureBits, because updateABIInfo() uses those to update the MipsABIFlagsSection object, and also makes sure we use the update mechanism from MipsABIFlagsSection. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits, mpf Differential Revision: http://reviews.llvm.org/D10641 llvm-svn: 240630	2015-06-25 10:56:57 +00:00
Jay Foad	ccb29917f1	Teach LLVM about the PPC64 memory sanitizer implementation. Summary: This is the LLVM part of the PPC memory sanitizer implementation in D10648. Reviewers: kcc, samsonov, willschm, wschmidt, eugenis Reviewed By: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10649 llvm-svn: 240627	2015-06-25 10:34:29 +00:00
Toma Tabacu	5af17cfb8c	[mips] [IAS] Fix parsing of memory offset expressions with parenthesis depth >1. Summary: In an expression such as "(((a+b)+c)+d)", parseParenExpression() would only parse the "a+b)+c", which would result in an error later on in the parser. This means that we can only parse one level of inner parentheses. In order to fix this, I added a new function called parseParenExprOfDepth(), which parses a specified number of trailing parenthesis expressions (except for the outermost parenthesis), and changed MipsAsmParser to use it in parseMemOffset instead of parseParenExpression(). Reviewers: dsanders, rafael Reviewed By: dsanders, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9742 llvm-svn: 240625	2015-06-25 09:52:02 +00:00
Ahmed Bougacha	9f949b3f29	[X86] Accept hasAVX512() as well as hasFMA() when generating FMA. We don't always have FMA, for example when using 'clang -mavx512f' without an explicit CPU. Also check for an explicit +avx512f instead of CPUs in a couple related tests. llvm-svn: 240616	2015-06-25 00:44:46 +00:00
Swaroop Sridhar	c6d536f7a9	Enable StackMap Serialization for COFF Summary This change turns on the emission of __LLVM_Stackmaps section when generating COFF binaries. Test Plan Added a scenario to the test case: test\CodeGen\X86\statepoint-stackmap-format.ll. Code Review: http://reviews.llvm.org/D10680 llvm-svn: 240613	2015-06-25 00:28:42 +00:00
Rui Ueyama	89c36fab4b	libObject/COFF: Add a function to get pointers to relocation entries. llvm-svn: 240610	2015-06-25 00:07:39 +00:00
Duncan P. N. Exon Smith	bf3519f494	Add simplify_type<const WeakVH>; simplify IndVarSimplify r240214 fixed some UB in IndVarSimplify, and it needed a temporary `WeakVH` to do it. Add `simplify_type<const WeakVH>` so that this temporary isn't necessary. llvm-svn: 240599	2015-06-24 22:23:21 +00:00
Douglas Katzman	73a319e633	[X86] Simplify some stuff in X86DisassemblerDecoder. NFC - Deciding that insn->sibIndex is SIB_INDEX_NONE does not require another check beyond the fully decoded bits being equal to 0x4. The expression insn->sibIndex == SIB_INDEX_sib could not have been true unless index were 0x4, because SIB_INDEX_sib is merely the range base (SIB_INDEX_EAX) plus 4. Respectively SIB_INDEX_sib64. - Don't use a switch statement to perform left-shift. Differential Revision: http://reviews.llvm.org/D9762 llvm-svn: 240598	2015-06-24 22:04:55 +00:00
David Majnemer	cda72ee99b	[GVN] Intersect the IR flags when CSE'ing two instructions We performed a simple, but incomplete, intersection when it came time to CSE instructions. It didn't handle, for example, the 'exact' flag. This fixes PR23922. llvm-svn: 240595	2015-06-24 21:52:25 +00:00
David Majnemer	0a9ab36033	[Reassociate] Don't propogate flags when creating negations Reassociate mutated existing instructions in order to form negations which would create additional reassociate opportunities. This fixes PR23926. llvm-svn: 240593	2015-06-24 21:27:36 +00:00
Sanjay Patel	a334472ec5	fix typos; NFC llvm-svn: 240592	2015-06-24 20:42:33 +00:00
Sanjay Patel	43eef8bba0	don't repeat function names in comments; NFC llvm-svn: 240591	2015-06-24 20:40:57 +00:00
Akira Hatanaka	15c1a020a2	[If Converter] Convert recursion to iteration. This commit makes changes to IfConverter::AnalyzeBlock to use iteration instead of recursion. Previously, this function would get called recursively a large number of times and eventually segfault when a function with the following CFG was compiled: BB0: if (condition0) goto BB1 goto BB2 BB1: goto BB2 BB2: if (condition1) goto BB3 goto BB4 BB3: ... (repeat until BB7488) rdar://problem/21386145 Differential Revision: http://reviews.llvm.org/D10587 llvm-svn: 240589	2015-06-24 20:34:35 +00:00
Pete Cooper	cbf9b4e67d	Devirtualize Instruction::clone_impl llvm-svn: 240588	2015-06-24 20:22:23 +00:00

1 2 3 4 5 ...

80783 Commits