llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Krzysztof Parzyszek	9b5d973061	[Hexagon] Add target feature to generate long calls llvm-svn: 276638	2016-07-25 14:42:11 +00:00
Sam Parker	00a189f236	[ARM] Improve longMAC codegen test Added thumb targets and dataflow checks to the longMAC test. Differential Revision: https://reviews.llvm.org/D22684 llvm-svn: 276629	2016-07-25 10:11:00 +00:00
Simon Dardis	ec8cfdd35b	[mips] Optimize materialization of i64 constants Avoid MipsAnalyzeImmediate usage if the constant fits in an 32-bit integer. This allows us to generate the same instructions for the materialization of the same constants regardless the width of their type. Patch by: Vasileios Kalintiris Contributions by: Simon Dardis Reviewers: Daniel Sanders Differential Review: https://reviews.llvm.org/D21689 llvm-svn: 276628	2016-07-25 09:57:28 +00:00
Sam Parker	4b5db242de	[ARM] Enable ISel of SMMLS for ARM and Thumb2 Use ISelDAGToDAG to recognise the SMMLS instruction pattern. Differential Revision: https://reviews.llvm.org/D22562 llvm-svn: 276624	2016-07-25 09:20:20 +00:00
Craig Topper	7dbacc292f	[AVX512] Add load folding support for the unmasked forms of the FMA instructions. llvm-svn: 276615	2016-07-25 07:20:35 +00:00
Craig Topper	70f4ea4f4f	[AVX512] Add some additional patterns so that we can fold broadcast loads in the first argument of an FMADD/FMSUB/FNMADD/FNMSUB/FMADDSUB/FMSUBADD node. Also add patterns to support all combinations of the broadcast input and the preserved input for masked versions. llvm-svn: 276614	2016-07-25 07:20:31 +00:00
Craig Topper	f835e69f53	[AVX512] Cleanup FMA operand order in patterns to match the VEX versions and to really be 213, 231, and 132. llvm-svn: 276613	2016-07-25 07:20:28 +00:00
Sean Silva	4efed6f1fe	Fix : Partial Inliner requires AssumptionCacheTracker The public InlineFunction utility assumes that the passed in InlineFunctionInfo has a valid AssumptionCacheTracker. Patch by River Riddle! Differential Revision: https://reviews.llvm.org/D22706 llvm-svn: 276609	2016-07-25 05:00:00 +00:00
David Majnemer	d5810d0df0	[GVNHoist] Merge metadata on hoisted instructions less conservatively We can combine metadata from multiple instructions intelligently for certain metadata nodes. llvm-svn: 276602	2016-07-25 02:21:25 +00:00
David Majnemer	78f31dbf94	[GVNHoist] Properly merge alignments when hoisting If we two loads of two different alignments, we must use the minimum of the two alignments when hoisting. Same deal for stores. For allocas, use the maximum of the two allocas. llvm-svn: 276601	2016-07-25 02:21:23 +00:00
Simon Pilgrim	23ec68a853	[X86][SSE] Added PR27854 tests llvm-svn: 276571	2016-07-24 16:39:50 +00:00
Simon Pilgrim	3f978a6166	[X86] Add shift double tests for PR14593 llvm-svn: 276570	2016-07-24 16:10:21 +00:00
Simon Pilgrim	d864569c65	[X86] Add 'FeatureSlowSHLD' to cpu 'bdver4' As with all AMD CPUs, excavator has poor SHLD/SHRD performance. Also added bdver3 to the test as it was missing. llvm-svn: 276569	2016-07-24 16:00:53 +00:00
Simon Pilgrim	b026212a71	[X86] Add SHRD shift combine tests llvm-svn: 276568	2016-07-24 15:47:44 +00:00
Simon Pilgrim	c7bf081a3d	[X86] Regenerate shift by parts tests llvm-svn: 276567	2016-07-24 15:38:51 +00:00
Simon Pilgrim	8f7dac2bb5	[X86][SSE] Regenerate shifts tests llvm-svn: 276566	2016-07-24 15:25:36 +00:00
Simon Pilgrim	a9ad8ea0d7	[X86][SSE] Regenerate SSE copysign tests llvm-svn: 276565	2016-07-24 15:17:50 +00:00
Simon Pilgrim	2c10ea1638	[X86][AVX512VL] Added AVX512VL half2float vector conversions tests to demonstrate PR23941 llvm-svn: 276563	2016-07-24 13:01:51 +00:00
Craig Topper	39f037759a	[X86] Make the FMA3 instruction names consistent between VEX and EVEX encoded versions. This places the 132/213/231 form number in front of the SS/SD/PS/PD. Move the Y for 256-bit versions to be after the PS/PD. Change the AVX512 scalar forms to include a Z in the their name. This new format should be consistent with the general naming of instructions. llvm-svn: 276559	2016-07-24 08:26:38 +00:00
Elena Demikhovsky	6936839d54	[Loop Vectorizer] Handling loops FP induction variables. Allowed loop vectorization with secondary FP IVs. Like this: float *A; float x = init; for (int i=0; i < N; ++i) { A[i] = x; x -= fp_inc; } The auto-vectorization is possible when the induction binary operator is "fast" or the function has "unsafe" attribute. Differential Revision: https://reviews.llvm.org/D21330 llvm-svn: 276554	2016-07-24 07:24:54 +00:00
Simon Pilgrim	c5f18a6328	[X86][SSE] Added float widened broadcast tests llvm-svn: 276535	2016-07-23 21:24:02 +00:00
Simon Pilgrim	beb527ac55	[X86][SSE] Added more widened broadcast tests Added more vXi16 and vXi8 tests llvm-svn: 276534	2016-07-23 21:15:31 +00:00
Simon Pilgrim	9d71867ec5	[X86][SSE] Added tests where we should be trying to widen a load+splat into a broadcast llvm-svn: 276527	2016-07-23 16:19:17 +00:00
Simon Pilgrim	74ecadba24	[X86][SSE] Regenerated uitofp <2 x i32> -> <2 x float> conversion tests Demonstrate difference in codegen discussed on PR14760 llvm-svn: 276526	2016-07-23 15:55:42 +00:00
Sanjay Patel	1195d29a39	[InstCombine] allow icmp (bit-manipulation-intrinsic(), C) folds for vectors llvm-svn: 276523	2016-07-23 13:06:49 +00:00
Craig Topper	2b1e6a9706	[AVX512] Implement commuting support for EVEX encoded FMA3 instructions. llvm-svn: 276521	2016-07-23 07:16:56 +00:00
Xinliang David Li	a223f694cc	[Profile] Use explicit flag to enable IR PGO Patch by Jake VanAdrighem Differential Revision: http://reviews.llvm.org/D22607 llvm-svn: 276516	2016-07-23 04:28:52 +00:00
Sanjoy Das	d06246dfe7	[SCEV] Make isImpliedCondOperandsViaRanges smarter This change lets us prove things like "{X,+,10} s< 5000" implies "{X+7,+,10} does not sign overflow" It does this by replacing replacing getConstantDifference by computeConstantDifference (which is smarter) in isImpliedCondOperandsViaRanges. llvm-svn: 276505	2016-07-23 00:54:36 +00:00
Sanjay Patel	b0b41be26d	auto-generate checks llvm-svn: 276501	2016-07-23 00:09:54 +00:00
Tom Stellard	09e97d1cbb	Revert "[AMDGPU] Emit read-only data to .rodata for hsa" This reverts commit r276298. Data stored in .rodata can have a negative offset from .text, but we don't support negative values in relocations yet. This caused a regression in one of the amp conformance tests: 5_Data_Cont/5_2_a_v/5_2_3_m/Assignment/Test.02.01 llvm-svn: 276498	2016-07-22 23:46:40 +00:00
Adam Nemet	1c839e0506	[LoopDataPrefetch] Include hotness of region in opt remark llvm-svn: 276488	2016-07-22 22:53:17 +00:00
Sanjay Patel	dcaeebff4f	add tests for icmp vector folds llvm-svn: 276482	2016-07-22 22:19:52 +00:00
Tim Northover	4c332c49f1	GlobalISel: allow multiple types on MachineInstrs. llvm-svn: 276481	2016-07-22 22:13:36 +00:00
Vitaly Buka	fe894453fe	Unpoison stack before resume instruction Summary: Clang inserts cleanup code before resume similar way as before return instruction. This makes asan poison local variables causing false use-after-scope reports. __asan_handle_no_return does not help here as it was executed before llvm.lifetime.end inserted into resume block. To avoid false report we need to unpoison stack for resume same way as for return. PR27453 Reviewers: kcc, eugenis Differential Revision: https://reviews.llvm.org/D22661 llvm-svn: 276480	2016-07-22 22:04:38 +00:00
Alina Sbirlea	587d579c84	Add flag to PassManagerBuilder to disable GVN Hoist Pass. Summary: Adding a flag to diable GVN Hoisting by default. Note: The GVN Hoist Pass causes some Halide tests to hang. Halide will disable the pass while investigating. Reviewers: llvm-commits, chandlerc, spop, dberlin Subscribers: mehdi_amini Differential Revision: https://reviews.llvm.org/D22639 llvm-svn: 276479	2016-07-22 22:02:19 +00:00
Michael Kuperstein	cf125cd0ce	[SLPVectorizer] Vectorize reverse-order loads in horizontal reductions When vectorizing a tree rooted at a store bundle, we currently try to sort the stores before building the tree, so that the stores can be vectorized. For other trees, the order of the root bundle - which determines the order of all other bundles - is arbitrary. That is bad, since if a leaf bundle of consecutive loads happens to appear in the wrong order, we will not vectorize it. This is partially mitigated when the root is a binary operator, by trying to build a "reversed" tree when that's considered profitable. This patch extends the workaround we have for binops to trees rooted in a horizontal reduction. This fixes PR28474. Differential Revision: https://reviews.llvm.org/D22554 llvm-svn: 276477	2016-07-22 21:28:48 +00:00
Sanjay Patel	cefd5ab11e	add tests for icmp vector folds llvm-svn: 276476	2016-07-22 21:28:20 +00:00
Sanjay Patel	262eed8e23	add tests for icmp vector folds llvm-svn: 276475	2016-07-22 21:13:08 +00:00
Sanjay Patel	42b108b535	add tests for icmp vector folds llvm-svn: 276472	2016-07-22 21:02:33 +00:00
Vedant Kumar	fa48858cb0	[llvm-cov] Don't copy stylesheets into index files Just link in the stylesheet from the toplevel dir of the report. llvm-svn: 276468	2016-07-22 20:49:23 +00:00
Sanjay Patel	b927c04180	update to use FileCheck and auto-generate checks llvm-svn: 276466	2016-07-22 20:39:07 +00:00
Sanjay Patel	fe9aad543e	add tests for icmp vector folds llvm-svn: 276464	2016-07-22 20:11:08 +00:00
Tim Northover	e35b03e144	GlobalISel: implement legalization pass, with just one transformation. This adds the actual MachineLegalizeHelper to do the work and a trivial pass wrapper that legalizes all instructions in a MachineFunction. Currently the only transformation supported is splitting up a vector G_ADD into one acting on smaller vectors. llvm-svn: 276461	2016-07-22 20:03:43 +00:00
Teresa Johnson	6813bf2821	[ThinLTO/gold] Remove thin archive part of new test due to bot failures I am getting a bot failure from the thin archive part of this test: From http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/40468/steps/test_llvm/logs/LLVM%20%3A%3A%20tools__gold__X86__thinlto_emit_linked_objects.ll: Command Output (stderr): -- /home/bb/cmake-llvm-x86_64-linux/build/./bin/llvm-ar: creating /home/bb/cmake-llvm-x86_64-linux/build/test/tools/gold/X86/Output/thinlto_emit_linked_objects.ll.tmp2.a /usr/bin/ld.gold: internal error in add_writer, at ../../gold/token.h:124 -- This appears to be an issue with an older version of gold. The test case passes for me locally when I use the gold v1.12 I was testing with, but when I tried the gold installed on my system which is v1.11 I get the same error. Remove the thin archive version of the test, since there isn't a way to predicate it on gold version. llvm-svn: 276453	2016-07-22 18:32:30 +00:00
Jun Bum Lim	f7d4b99429	Recommit - [DSE]Enhance shorthening MemIntrinsic based on OverlapIntervals Recommiting r275571 after fixing crash reported in PR28270. Now we erase elements of IOL in deleteDeadInstruction(). Original Summary: This change use the overlap interval map built from partial overwrite tracking to perform shortening MemIntrinsics. Add test cases which was missing opportunities before. llvm-svn: 276452	2016-07-22 18:27:24 +00:00
Sanjay Patel	a2baf75a4a	add tests for vector bit manipulation intrinsics llvm-svn: 276451	2016-07-22 18:22:25 +00:00
Teresa Johnson	00cb55d094	[ThinLTO/gold] Support for getting list of included objects from gold Summary: In the distributed backend case, the ThinLink step and the final native object link are separate processes. This can be problematic when archive libraries are involved in the link (e.g. via --start-lib/--end-lib pairs). The linker only includes objects from libraries when there is a strong reference to them, and depending on the intervening ThinLTO backend processes' importing/inlining, the strong references may appear different in the two link steps. See D22356 and D22467 for two scenarios where this causes issues. To ensure that the final link includes the same objects, this patch adds support for an "=filename" form of the thinlto-index-only plugin option, in which case objects gold included in the link are emitted to the given filename. This should be used as input to the final link (e.g. via the @filename option to gold), instead of listing all the objects within --start-lib/--end-lib pairs again. Note that the support for the gold callback that identifies included objects was added in gold version 1.12. Reviewers: davidxl, mehdi_amini Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D22677 llvm-svn: 276450	2016-07-22 18:20:22 +00:00
Wei Mi	95c770abfc	[PM] Port BreakCriticalEdges to the new PM. Differential Revision: https://reviews.llvm.org/D22688 llvm-svn: 276449	2016-07-22 18:04:25 +00:00
Anna Thomas	6f5ce86e80	Invariant start/end intrinsics overloaded for address space Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: apilipenko, reames Subscribers: llvm-commits llvm-svn: 276447	2016-07-22 17:49:40 +00:00
Matt Arsenault	9fbc4e603d	AMDGPU: Remove redundant test llvm-svn: 276439	2016-07-22 17:01:36 +00:00

1 2 3 4 5 ...

38243 Commits