llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Lang Hames	16cfb8aacc	[ORC] Assert that the query argument to VSO::lookup must be non-null. llvm-svn: 334723	2018-06-14 15:32:59 +00:00
Lang Hames	5928b3c18f	[ORC] Add a WaitUntilReady argument to blockingLookup. If WaitUntilReady is set to true then blockingLookup will return once all requested symbols are ready. If WaitUntilReady is set to false then blockingLookup will return as soon as all requested symbols have been resolved. In the latter case, if any error occurs in finalizing the symbols it will be reported to the ExecutionSession, rather than returned by blockingLookup. llvm-svn: 334722	2018-06-14 15:32:58 +00:00
Lang Hames	68858690e9	[ORC] Strip the Materializing flag off finalized symbols in VSOs. Finalized symbols are no longer in the materializing state. llvm-svn: 334721	2018-06-14 15:32:56 +00:00
Simon Dardis	fa9e150420	[docs] Update CompilerWriterInfo.rst for MIPS Update the URL of where the documentation can be found. llvm-svn: 334720	2018-06-14 15:16:37 +00:00
Simon Pilgrim	ede5da6b40	[EarlyCSE] Fix MSVC build. NFCI. MSVC doesn't let you assign different lambdas through a ternary operator. llvm-svn: 334715	2018-06-14 14:22:03 +00:00
Simon Pilgrim	3567ed310e	[CostModel][AArch64] Add cost tests for ALTERNATE/SELECT style shuffle masks Precursor to fixing a regression with SLP vectorizer for supporting SELECT shuffles (vs the current ALTERNATE) llvm-svn: 334714	2018-06-14 14:20:20 +00:00
Sam Clegg	d86fc826ae	[MC] Move MCAssembler::dump into the correct cpp file. NFC Differential Revision: https://reviews.llvm.org/D46556 llvm-svn: 334713	2018-06-14 14:04:23 +00:00
Paul Robinson	fc9b585c42	[DWARFv5] Tolerate files not all having an MD5 checksum. In some cases, for example when compiling a preprocessed file, the front-end is not able to provide an MD5 checksum for all files. When that happens, omit the MD5 checksums from the final DWARF, because DWARF doesn't have a way to indicate that some but not all files have a checksum. When assembling a .s file, and some but not all .file directives provide an MD5 checksum, issue a warning and don't emit MD5 into the DWARF. Fixes PR37623. Differential Revision: https://reviews.llvm.org/D48135 llvm-svn: 334710	2018-06-14 13:38:20 +00:00
Simon Dardis	ee8aa2671a	[mips] Correct predicates for MSA pseudo instructions llvm-svn: 334708	2018-06-14 13:03:53 +00:00
Max Kazantsev	a1bb2064f1	[EarlyCSE] Propagate conditions of AND and OR instructions This patches teaches EarlyCSE to figure out that if `and i1 %x, %y` is true then both `%x` and `%y` are true in the taken branch, and if `or i1 %x, %y` is false then both `%x` and `%y` are false in non-taken branch. Fix for PR37635. Differential Revision: https://reviews.llvm.org/D47574 Reviewed By: reames llvm-svn: 334707	2018-06-14 13:02:13 +00:00
Florian Hahn	bd074cfa10	[TableGen] Move some shared_ptrs to avoid unnecessary copies (NFC). Those changes were suggested post-commit for D47463. llvm-svn: 334706	2018-06-14 11:56:19 +00:00
Bjorn Pettersson	8b01057380	[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue Summary: Do not convert a DbgDeclare to DbgValue if the store instruction only refer to a fragment of the variable described by the DbgDeclare. Problem was seen when for example having an alloca for an array or struct, and there were stores to individual elements. In the past we inserted a DbgValue intrinsics for each store, just as if the store wrote the whole variable. When handling store instructions we insert a DbgValue that indicates that the variable is "undefined", as we do not know which part of the variable that is updated by the store. When ConvertDebugDeclareToDebugValue is used with a load/phi instruction we assert that the referenced value is large enough to cover the whole variable. Afaict this should be true for all scenarios where those methods are used on trunk. If the assert blows in the future I guess we could simply skip to insert a dbg.value instruction. In the future I think we should examine which part of the variable that is accessed, and add a DbgValue instrinsic with an appropriate DW_OP_LLVM_fragment expression. Reviewers: dblaikie, aprantl, rnk Reviewed By: aprantl Subscribers: JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D48024 llvm-svn: 334704	2018-06-14 11:23:42 +00:00
Simon Pilgrim	2890f72ad6	[SLPVectorizer] Remove RawInstructionsData/getMainOpcode and merge into getSameOpcode This is part of the work to cleanup use of 'alternate' ops so we can use the more general SK_Select shuffle type. Only getSameOpcode calls getMainOpcode and much of the logic is repeated in both functions. This will require some reworking of D28907 but that patch has hit trouble and is unlikely to be completed anytime soon. Differential Revision: https://reviews.llvm.org/D48120 llvm-svn: 334701	2018-06-14 10:25:19 +00:00
Simon Pilgrim	dc51dd555f	[CostModel] Cleanup isSingleSourceVectorMask to match other shuffle matchers. NFCI. llvm-svn: 334699	2018-06-14 09:48:19 +00:00
Simon Pilgrim	e3436b5ceb	[CostModel] Recognise REVERSE shuffle mask if the elements come from the second src llvm-svn: 334698	2018-06-14 09:35:00 +00:00
Clement Courbet	abd471f672	[llvm-exegesis] Use BenchmarkResult::Instructions instead of OpcodeName Summary: Get rid of OpcodeName. To remove the opcode name from an old file: ``` cat old_file \| sed '/opcode_name.*/d' ``` Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D48121 llvm-svn: 334691	2018-06-14 06:57:52 +00:00
Hiroshi Inoue	51b6fc883d	[NFC] fix trivial typos in comments llvm-svn: 334687	2018-06-14 05:41:49 +00:00
Craig Topper	ad3af3e8a7	[x86] fix mappings of cvttp2si/cvttp2ui x86 intrinsics to x86-specific nodes and isel patterns (PR37551) Summary: The tests in: https://bugs.llvm.org/show_bug.cgi?id=37751 ...show miscompiles because we wrongly mapped and folded x86-specific intrinsics into generic DAG nodes. This patch corrects the mappings in X86IntrinsicsInfo.h and adds isel matching corresponding to the new patterns. The complete tests for the failure cases should be in avx-cvttp2si.ll and sse-cvttp2si.ll and avx512-cvttp2i.ll Reviewers: RKSimon, gbedwell, spatel Reviewed By: spatel Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D47993 llvm-svn: 334685	2018-06-14 03:16:58 +00:00
Matt Davis	0594746649	[llvm-mca] Introduce the ExecuteStage (was originally the Scheduler class). Summary: This patch transforms the Scheduler class into the ExecuteStage. Most of the logic remains. Reviewers: andreadb, RKSimon, courbet Reviewed By: andreadb Subscribers: mgorny, javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47246 llvm-svn: 334679	2018-06-14 01:20:18 +00:00
Tom Stellard	bec61a1866	AMDGPU/GlobalISel: Implement select() for 32-bit G_FADD and G_FMUL Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46171 llvm-svn: 334665	2018-06-13 22:30:47 +00:00
Zachary Turner	4453f047a5	Revert "Enable ThreadPool to queue tasks that return values." This is failing to compile when LLVM_ENABLE_THREADS is false, and the fix is not immediately obvious, so reverting while I look into it. llvm-svn: 334658	2018-06-13 21:24:19 +00:00
Francis Visoiu Mistrih	b200370ed3	Reland: [Timers] Use the pass argument name for JSON keys in time-passes When using clang --save-stats -mllvm -time-passes, both timers and stats end up in the same json file. We could end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.Virtual Register Map.wall": 2.9015541076660156e-04, "time.pass.Virtual Register Map.user": 2.0500000000000379e-04, "time.pass.Virtual Register Map.sys": 8.5000000000001741e-05, } This patch makes use of the pass argument name (if available) in the JSON key to end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.virtregmap.wall": 2.9015541076660156e-04, "time.pass.virtregmap.user": 2.0500000000000379e-04, "time.pass.virtregmap.sys": 8.5000000000001741e-05, } This also helps avoiding to write another JSON printer to handle all the cases that we could have in our pass names. Fixed test instead of adding a new one originally from r334649. Differential Revision: https://reviews.llvm.org/D48109 llvm-svn: 334657	2018-06-13 21:03:56 +00:00
Florian Hahn	34430c5d61	[TableGen] Make getOnlyTree return a const ref (NFC) This avoids some unnecessary copies of shared_ptrs. Those changes were suggested post-commit for D47463. llvm-svn: 334656	2018-06-13 20:59:53 +00:00
George Karpenkov	d748d090c7	Update comments of CheckedArithmetic API based on Philip Reames feedback. llvm-svn: 334655	2018-06-13 20:48:53 +00:00
Reid Kleckner	45e3558bd2	[WinASan] Don't instrument globals in sections containing '$' Such globals are very likely to be part of a sorted section array, such the .CRT sections used for dynamic initialization. The uses its own sorted sections called ATL$__a, ATL$__m, and ATL$__z. Instead of special casing them, just look for the dollar sign, which is what invokes linker section sorting for COFF. Avoids issues with ASan and the ATL uncovered after we started instrumenting comdat globals on COFF. llvm-svn: 334653	2018-06-13 20:47:21 +00:00
Francis Visoiu Mistrih	11b5edffb3	Revert r334649 "[Timers] Use the pass argument name for JSON keys in time-passes" This reverts commit r334649. This breaks a test. llvm-svn: 334651	2018-06-13 20:44:02 +00:00
Francis Visoiu Mistrih	b88ba4150b	[Timers] Use the pass argument name for JSON keys in time-passes When using clang --save-stats -mllvm -time-passes, both timers and stats end up in the same json file. We could end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.Virtual Register Map.wall": 2.9015541076660156e-04, "time.pass.Virtual Register Map.user": 2.0500000000000379e-04, "time.pass.Virtual Register Map.sys": 8.5000000000001741e-05, } This patch makes use of the pass argument name (if available) in the JSON key to end up with things like: { "asm-printer.EmittedInsts": 1, "time.pass.virtregmap.wall": 2.9015541076660156e-04, "time.pass.virtregmap.user": 2.0500000000000379e-04, "time.pass.virtregmap.sys": 8.5000000000001741e-05, } This also helps avoiding to write another JSON printer to handle all the cases that we could have in our pass names. Differential Revision: https://reviews.llvm.org/D48109 llvm-svn: 334649	2018-06-13 20:09:59 +00:00
Craig Topper	726af1b5f7	[X86] Move RCPSSr_Int, RSQRTSSr_Int, SQRTSDr_Int, SQRTSSr_Int to the correct load folding table. They were in the operand 1 folding table, but their foldable operand is operand 2. llvm-svn: 334648	2018-06-13 20:03:42 +00:00
Zachary Turner	9f6c09accc	Add missing #include. llvm-svn: 334644	2018-06-13 19:37:41 +00:00
Zachary Turner	1f1b9ca909	Enable ThreadPool to support tasks that return values. Previously ThreadPool could only queue async "jobs", i.e. work that was done for its side effects and not for its result. It's useful occasionally to queue async work that returns a value. From an API perspective, this is very intuitive. The previous API just returned a shared_future<void>, so all we need to do is make it return a shared_future<T>, where T is the type of value that the operation returns. Making this work required a little magic, but ultimately it's not too bad. Instead of keeping a shared queue<packaged_task<void()>> we just keep a shared queue<unique_ptr<TaskBase>>, where TaskBase is a class with a pure virtual execute() method, then have a templated derived class that stores a packaged_task<T()>. Everything else works out pretty cleanly. Differential Revision: https://reviews.llvm.org/D48115 llvm-svn: 334643	2018-06-13 19:29:16 +00:00
Stanislav Mekhanoshin	3138aeda82	[AMDGPU] Corrected computeKnownBits for V_PERM_B32 Differential Revision: https://reviews.llvm.org/D48133 llvm-svn: 334640	2018-06-13 18:52:54 +00:00
George Karpenkov	0a576bc0e3	Add checkMulAdd helper function to CheckedArithmetic Multiplication followed by addition (https://en.wikipedia.org/wiki/Multiply–accumulate_operation) is a sufficiently common use-case to warrant a separate helper. Differential Revision: https://reviews.llvm.org/D48138 llvm-svn: 334635	2018-06-13 18:32:02 +00:00
George Karpenkov	8af3008c20	Change checked arithmetic functions API to return Optional Returning optional is much safer. The previous API had potential to cause use of undefined variables, if the value passed by pointer was accidentally read afterwards. Differential Revision: https://reviews.llvm.org/D48137 llvm-svn: 334634	2018-06-13 18:31:43 +00:00
Andrea Di Biagio	7cc34df875	[llvm-mca] Fixed a bug in the logic that checks if a memory operation is ready to execute. Fixes PR37790. In some (very rare) cases, the LSUnit (Load/Store unit) was wrongly marking a load (or store) as "ready to execute" effectively bypassing older memory barrier instructions. To reproduce this bug, the memory barrier must be the first instruction in the input assembly sequence, and it doesn't have to perform any register writes. llvm-svn: 334633	2018-06-13 18:30:14 +00:00
Jordan Rose	04df7e8cda	[CMake] Handle 'libtool' being at a path with spaces in it. This can happen on macOS if the user's Xcode is at a path with spaces in it. llvm-svn: 334632	2018-06-13 18:21:47 +00:00
Peter Collingbourne	b08ed8f952	LTO: Keep file handles open for memory mapped files. On Windows we've observed that if you open a file, write to it, map it into memory and close the file handle, the contents of the memory mapping can sometimes be incorrect. That was what we did when adding an entry to the ThinLTO cache using the TempFile and MemoryBuffer classes, and it was causing intermittent build failures on Chromium's ThinLTO bots on Windows. More details are in the associated Chromium bug (crbug.com/786127). We can prevent this from happening by keeping a handle to the file open while the mapping is active. So this patch changes the mapped_file_region class to duplicate the file handle when mapping the file and close it upon unmapping it. One gotcha is that the file handle that we keep open must not have been created with FILE_FLAG_DELETE_ON_CLOSE, as otherwise the operating system will prevent other processes from opening the file. We can achieve this by avoiding the use of FILE_FLAG_DELETE_ON_CLOSE altogether. Instead, we use SetFileInformationByHandle with FileDispositionInfo to manage the delete-on-close bit. This lets us remove the hack that we used to use to clear the delete-on-close bit on a file opened with FILE_FLAG_DELETE_ON_CLOSE. A downside of using SetFileInformationByHandle/FileDispositionInfo as opposed to FILE_FLAG_DELETE_ON_CLOSE is that it prevents us from using CreateFile to open the file while the flag is set, even within the same process. This doesn't seem to matter for almost every client of TempFile, except for LockFileManager, which calls sys::fs::create_link to create a hard link from the lock file, and in the process of doing so tries to open the file. To prevent this change from breaking LockFileManager I changed it to stop using TempFile by effectively reverting r318550. Differential Revision: https://reviews.llvm.org/D48051 llvm-svn: 334630	2018-06-13 18:03:14 +00:00
Craig Topper	c64d2b3586	[X86] Add one more intrinsic and test cases to avx512-cvttp2i.ll. spatel noticed it was missing in D47993. llvm-svn: 334629	2018-06-13 17:55:13 +00:00
Saleem Abdulrasool	ee3f16f91b	IR: fix documentation markup Use `\brief` instead of `\Brief`. NFC. llvm-svn: 334627	2018-06-13 17:51:27 +00:00
Yaxun Liu	166d7e51aa	[AMDGPU] Change enqueue kernel handle type Currently the handle type is a global pointer which holds 8 bytes. We need a larger type which hold 16 bytes, therefore change it to [i64 x 2]. Differential Revision: https://reviews.llvm.org/D48094 llvm-svn: 334625	2018-06-13 17:31:51 +00:00
Simon Pilgrim	98546e502b	[CostModel][X86] Test showing failure to recognise REVERSE shuffle mask if the elements come from the second src llvm-svn: 334623	2018-06-13 17:12:11 +00:00
Dmitry Preobrazhensky	f3bc07c9d5	[AMDGPU][MC] Enabled parsing of relocations on VALU instructions See bug 37566: https://bugs.llvm.org/show_bug.cgi?id=37566 Reviewers: artem.tamazov, arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D47884 llvm-svn: 334622	2018-06-13 17:02:03 +00:00
Simon Pilgrim	21d47e6a7f	[CostModel] Recognise BROADCAST shuffle mask if the elements come from the second src llvm-svn: 334620	2018-06-13 16:52:02 +00:00
Andrea Di Biagio	b297f1e738	Revert: [llvm-mca] Flush the output stream before we start the analysis of a new code region. NFC Not sure why, but it breaks buildbot clang-cmake-armv8-full. It causes a failure in TEST 'Xray-armhf-linux :: TestCases/Posix/profiling-single-threaded.cc'. llvm-svn: 334617	2018-06-13 16:33:52 +00:00
Simon Pilgrim	39a11cf87c	[CostModel][X86] Test showing failure to recognise BROADCAST shuffle mask if the elements come from the second src llvm-svn: 334616	2018-06-13 16:33:42 +00:00
Andrea Di Biagio	24185018fb	[llvm-mca] Flush the output stream before we start the analysis of a new code region. NFC llvm-svn: 334610	2018-06-13 15:43:56 +00:00
Dmitry Preobrazhensky	224cb6c72d	[AMDGPU][MC][GFX8][GFX9] Allow LDS direct reads for BUFFER_LOAD_DWORDX2/X3/X4 See bug 37653: https://bugs.llvm.org/show_bug.cgi?id=37653 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D47885 llvm-svn: 334609	2018-06-13 15:32:46 +00:00
Sanjay Patel	59eabed7b6	[DAGCombiner] remove hasOneUse() check from fadd constants transform We're constant folding here, so we shouldn't check uses. This matches the IR optimizer behavior. The x86 test shows the expected win. The AArch64 test shows something else. This only seems to happen if the "generic" AArch64 CPU model is used by MachineCombiner, so I'll file a bug report to follow-up. llvm-svn: 334608	2018-06-13 15:22:48 +00:00
Tom Stellard	968b81b26b	AMDGPU: Move isSDNodeSourceOfDivergence() implementation to SITargetLowering Summary: The code that handles ISD:Register and ISD::CopyFromReg assumes the target is amdgcn, so this is broken on r600. We don't need this analysis on r600 anyway so we can safely move it to SITargetLowering. Reviewers: alex-t, arsenm, nhaehnle Reviewed By: arsenm Subscribers: msearles, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46298 llvm-svn: 334607	2018-06-13 15:06:37 +00:00
Sanjay Patel	aac81f364a	[x86] add test for fadd with more than one use; NFC The equivalent AArch64 test added at rL334556 isn't showing the expected output from the DAGCombiner code change that would fix this example. That's a machine combiner bug from what I see. llvm-svn: 334605	2018-06-13 15:01:07 +00:00
Cameron McInally	4fffc097ce	[FPEnv] Expand constrained FP operations Add a helper function to expand constrained FP operations as needed. Note that the Strict POWI operation is not handled in this patch since the format is slightly different from the others. Differential Revision: https://reviews.llvm.org/D47491 llvm-svn: 334603	2018-06-13 14:32:12 +00:00

... 3 4 5 6 7 ...

165509 Commits