llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Craig Topper	1e3a243a15	[X86] Add test case to show failure to promote i16 subtract when the LHS is a load and the result is stored to a different address. We mistakenly believe we might be able to fold this as a RMW operation, but that doesn't end up happening. llvm-svn: 328929	2018-04-01 06:29:27 +00:00
Craig Topper	35a599747e	[X86] Allow i16 subtracts to be promoted if the load is on the LHS and its not being stored. llvm-svn: 328928	2018-04-01 06:29:25 +00:00
Craig Topper	4f8f0b27b1	[X86] Add test case to show failure to promote i16 subtract because we mistakenly believe the load can be folded. NFC The left hand side of the subtract is a load, but we cna't fold those unless we also have a store. llvm-svn: 328927	2018-04-01 06:29:23 +00:00
Craig Topper	aa0e70c281	[X86] Remove unneeded temporary variable. NFC This Promote flag was alwasys set to true except in the default case. But in the default case we don't need to set PVT and can just return false. llvm-svn: 328926	2018-04-01 06:29:21 +00:00
Mandeep Singh Grang	fe0ec8aeab	[Analysis] Change std::sort to llvm::sort in response to r327219 Summary: r327219 added wrappers to std::sort which randomly shuffle the container before sorting. This will help in uncovering non-determinism caused due to undefined sorting order of objects having the same key. To make use of that infrastructure we need to invoke llvm::sort instead of std::sort. Note: This patch is one of a series of patches to replace all std::sort to llvm::sort. Refer D44363 for a list of all the required patches. Reviewers: sanjoy, dexonsmith, hfinkel, RKSimon Reviewed By: dexonsmith Subscribers: david2050, llvm-commits Differential Revision: https://reviews.llvm.org/D44944 llvm-svn: 328925	2018-04-01 01:46:51 +00:00
Sanjay Patel	559aa51573	[DAGCombine] (float)((int) f) --> ftrunc (PR36617) fptosi / fptoui round towards zero, and that's the same behavior as ISD::FTRUNC, so replace a pair of casts with the equivalent node. We don't have to account for special cases (NaN, INF) because out-of-range casts are undefined. Differential Revision: https://reviews.llvm.org/D44909 llvm-svn: 328921	2018-03-31 17:55:44 +00:00
Lang Hames	06147e9592	[llvm-rtdyld] Fix the InputFileList cl::opt description: it accepts multiple input files. llvm-svn: 328920	2018-03-31 16:01:01 +00:00
Simon Pilgrim	2c66585426	[X86][Btver2] Add MMX_PSHUFB to the JWritePSHUFB InstRW entries llvm-svn: 328918	2018-03-31 09:15:54 +00:00
Simon Pilgrim	8db72856f9	Fix trailing whitespace. NFCI. llvm-svn: 328917	2018-03-31 09:14:14 +00:00
Benjamin Kramer	024c9be02b	Unbreak the build of the go bindings after r328839. llvm-svn: 328916	2018-03-31 07:41:25 +00:00
Puyan Lotfi	b93beb070b	[MIR-Canon] Adding support for local idempotent instruction hoisting. llvm-svn: 328915	2018-03-31 05:48:51 +00:00
Craig Topper	08af09d478	[X86] Add SchedRW for PMULLD Summary: It seems many CPUs don't implement this instruction as well as the other vector multiplies. Often using a multi uop flow. Silvermont in particular has a 7 uop flow with 11 cycle throughput. Sandy Bridge implements it as a single uop with 5 cycle latency and 1 cycle throughput. But Haswell and later use 2 uops with 10 cycle latency and 2 cycle throughput. This patch adds a new X86SchedWritePair we can use to tag this instruction separately. I've provided correct information for Silvermont, Btver2, and Sandy Bridge. I've removed the InstRWs for SandyBridge. I've left Haswell/Broadwell/Skylake InstRWs in place because I wasn't sure how to account for the different load latency between 128 and 256 bits. I also left Znver1 InstRWs in place because the existing values don't match Agner's spreadsheet. I also left a FIXME in the SandyBridge model because it being used for the "generic" model is too optimistic for the 256/512-bit versions since those are multiple uops on all known CPUs. Reviewers: RKSimon, GGanesh, courbet Reviewed By: RKSimon Subscribers: gchatelet, gbedwell, andreadb, llvm-commits Differential Revision: https://reviews.llvm.org/D44972 llvm-svn: 328914	2018-03-31 04:54:32 +00:00
Teresa Johnson	e23f2d8bdb	[ThinLTO] Add an option to force summary call edges cold for debugging Summary: Useful to selectively disable importing into specific modules for debugging/triaging/workarounds. Reviewers: eraman Subscribers: inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D45062 llvm-svn: 328909	2018-03-31 00:18:08 +00:00
Fangrui Song	cd23b12dcc	Fix a bunch of typoes. NFC llvm-svn: 328907	2018-03-30 22:22:31 +00:00
Ekaterina Romanova	2f41ac8718	Prevent data races in concurrent ThinLTO processes. Make sure ThinLTO with caching doesn't use non-atomic writes to the cache file (to prevent data races and cache files corruption). 1. Place temp file to the same place where the caching directory is (instead of creating it the directory pointed to by TMP/TEMP variable). This will help to prevent using non-atomic rename and falling back to non-atomic "direct" write to the cache file. 2. if rename failed do not write to the cache file directly (direct write to the file is non-atomic and could cause data race conditions). 3. if cache file doesn't exist (e.g., because 'rename' failed or because some other reasons), bypass using the cache altogether. Differential Revision: https://reviews.llvm.org/D45076 llvm-svn: 328904	2018-03-30 21:35:42 +00:00
Jacob Gravelle	39f4890682	[WebAssembly] Register wasm passes with the PassRegistry Summary: This exposes WebAssembly passes for use on the command line (as arguments to -print-before and the like). Reviewers: dschuff, sunfish Subscribers: MatzeB, jfb, sbc100, llvm-commits, aheejin Differential Revision: https://reviews.llvm.org/D45103 llvm-svn: 328901	2018-03-30 20:36:58 +00:00
Krzysztof Parzyszek	df9bb5dabc	[Hexagon] Fix testcase llvm-svn: 328899	2018-03-30 19:46:28 +00:00
Krzysztof Parzyszek	13724f3334	[Hexagon] Reduce excessive indentation in .s output llvm-svn: 328898	2018-03-30 19:30:28 +00:00
Krzysztof Parzyszek	8e14c7cb4e	[Hexagon] Avoid creating invalid offsets in packetizer Two memory instructions with a dependency only on the address register between the two (the first one of them being post-incrememnt) can be packetized together after the offset on the second was updated to the incremement value. Make sure that the new offset is valid for the instruction. llvm-svn: 328897	2018-03-30 19:28:37 +00:00
Andrea Di Biagio	48fd4afb0f	[X86][BtVer2] Fixed the number of micro opcodes for AVX vector converts and VSQRT instructions. There were still a few AVX instructions with an incorrect number of opcodes. These should be fixed now. llvm-svn: 328892	2018-03-30 18:53:47 +00:00
Peter Collingbourne	b46b507da2	DataFlowSanitizer: wrappers of functions with local linkage should have the same linkage as the function being wrapped This patch resolves link errors when the address of a static function is taken, and that function is uninstrumented by DFSan. This change resolves bug 36314. Patch by Sam Kerner! Differential Revision: https://reviews.llvm.org/D44784 llvm-svn: 328890	2018-03-30 18:37:55 +00:00
Puyan Lotfi	262cfa2993	[MIR] Adding support for Named Virtual Registers in MIR. llvm-svn: 328887	2018-03-30 18:15:54 +00:00
Andrea Di Biagio	fda2de332f	[X86][BtVer2] Fix the number of uOps for horizontal operations. llvm-svn: 328886	2018-03-30 18:15:30 +00:00
Tim Shen	fa134803cc	[NVPTX] Enable StructuredCFG for NVPTX Summary: Make NVPTX require structured CFG. Added a temporary flag to "roll back" the behavior for easy deployment. Combined with D45008, this fixes several internal Nvidia GPU test failures that we suspect to be ptxas miscompiles (PR27738). Reviewers: jlebar Subscribers: jholewinski, sanjoy, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D45070 llvm-svn: 328885	2018-03-30 17:51:03 +00:00
Tim Shen	5321f45406	[BlockPlacement] Disable block placement tail duplciation in structured CFG. Summary: Tail duplication easily breaks the structure of CFG, e.g. duplicating on a region entry. If the structure is intended to be preserved, then we may want to configure tail duplication, or disable it for structured CFG. From our benchmark results disabling it doesn't cause performance regression. Notice that this currently affects AMDGPU backend. In the next patch, I also plan to turn on requiresStructuredCFG for NVPTX. All unit tests still pass. Reviewers: jlebar, arsenm Subscribers: jholewinski, sanjoy, wdng, tpr, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45008 llvm-svn: 328884	2018-03-30 17:51:00 +00:00
Robert Widmann	76c792ee28	[LLVM-C] Finish exception instruction bindings - Round 2 Summary: Previous revision caused a leak in the echo test that got caught by the ASAN bots because of missing free of the handlers array and was reverted in r328759. Resubmitting the patch with that correction. Add support for cleanupret, catchret, catchpad, cleanuppad and catchswitch and their associated accessors. Test is modified from SimplifyCFG because it contains many diverse usages of these instructions. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits, vlad.tsyrklevich Differential Revision: https://reviews.llvm.org/D45100 llvm-svn: 328883	2018-03-30 17:49:53 +00:00
Zachary Turner	040818cf1c	Fix some signed / unsigned conversion problems. llvm-svn: 328881	2018-03-30 17:28:35 +00:00
Zachary Turner	d9045e6999	[llvm-pdbutil] Dig deeper into the PDB and DBI streams when explaining. This will show more detail when using `llvm-pdbutil explain` on an offset in the DBI or PDB streams. Specifically, it will dig into individual header fields and substreams to give a more precise description of what the byte represents. llvm-svn: 328878	2018-03-30 17:16:50 +00:00
Derek Schuff	cecc270486	[WebAssembly] Refactor tablegen for store instructions (NFC) Summary: Add patterns similar to loads. Differential Revision: https://reviews.llvm.org/D45064 llvm-svn: 328876	2018-03-30 17:02:50 +00:00
Krzysztof Parzyszek	181b5e8164	Revert "peel loops with runtime small trip counts" This reverts commit r328854, it breaks some Hexagon tests. llvm-svn: 328875	2018-03-30 16:55:44 +00:00
Stanislav Mekhanoshin	2d1635e9be	[AMDGPU] Fixed some instructions latencies Differential Revision: https://reviews.llvm.org/D45073 llvm-svn: 328874	2018-03-30 16:19:13 +00:00
Sanjay Patel	b36dd6007b	[SelectionDAG] Removing FABS folding from DAGCombiner The code has bugs dealing with -0.0. Since D44550 introduced FABS pattern folding in InstCombine, this patch removes the now-redundant code that causes https://bugs.llvm.org/show_bug.cgi?id=36600. Patch by Mikhail Dvoretckii! Differential Revision: https://reviews.llvm.org/D44683 llvm-svn: 328872	2018-03-30 15:42:52 +00:00
Krzysztof Parzyszek	38f471d0d5	[Hexagon] Recognize and handle :endloop01 llvm-svn: 328870	2018-03-30 15:29:47 +00:00
Krzysztof Parzyszek	62f3aefde3	[Hexagon] Fix printing :mem_noshuf on compiler-generated packets llvm-svn: 328869	2018-03-30 15:09:05 +00:00
Krzysztof Parzyszek	616f663345	[Hexagon] Fix flags for store-related intrinsics llvm-svn: 328868	2018-03-30 14:57:01 +00:00
Andrea Di Biagio	f6df479dce	[X86][BtVer2] Add missing ReadAfterLd to RM variants of AVX horizontal adds and most vector logic instructions. Fixed a few InstRW that forgot to specify a ReadAfterLd for the register input operand. llvm-svn: 328867	2018-03-30 14:48:08 +00:00
Krzysztof Parzyszek	9cfcd4e292	[Hexagon] Remove unused scheduling classes llvm-svn: 328866	2018-03-30 14:34:32 +00:00
Andrea Di Biagio	0279640faa	[X86][BtVer2] Add tests that show how ReadAfterLd is missing for some instructions. In the Btver2 model, there are a few InstRW overrides that don't specify a ReadAfterLd for the register input operand. As a result, a few AVX variants of horizontal operations and most vector logic operations with a folded memory operand don't have a ReadAdvance info associated to their input register operands. llvm-svn: 328865	2018-03-30 14:29:33 +00:00
Krzysztof Parzyszek	3ed598fcbe	[Hexagon] Pass pointer to SelectionDAG to dump functions llvm-svn: 328864	2018-03-30 14:29:15 +00:00
Andrea Di Biagio	cc5850e51f	[X86] Add llvm-mca tests for r328834. Verify that the ReadAfterLd is correctly applied to FMA and 4-ops variable blend instructions. As Craig pointed out in D44726, some Intel models still have to be fixed. llvm-svn: 328861	2018-03-30 13:38:37 +00:00
Andrea Di Biagio	10ec4daa38	[X86] Add tests to verify the presence of "ReadAfterLd" after r328823. This change adds a couple of tests to verify the change introduced by revision 328823 ([X86] Correct the placement of ReadAfterLd in BEXTR and BZHI). llvm-svn: 328859	2018-03-30 11:44:48 +00:00
Vlad Tsyrklevich	9736436bca	Revert "[LLVM-C] Finish exception instruction bindings" This reverts commit r328759. It was causing LSan failures on sanitizer-x86_64-linux-bootstrap llvm-svn: 328858	2018-03-30 06:21:28 +00:00
Michael Bedy	a24a5d9000	[AMDGPU] Fix the SDWA Peephole phase to handle src for dst:UNUSED_PRESERVE. Summary: The phase attempts to transform operations that extract a portion of a value into an SDWA src operand in cases where that value is used only once. It was not prepared for this use to be the preserved portion of a value for dst:UNUSED_PRESERVE, resulting in a crash or assert. This change either rejects the illegal SDWA attempt, or in the case where dst:WORD_1 and the src_sel would be WORD_0, removes the unneeded extract instruction. Reviewers: arsenm, #amdgpu Reviewed By: arsenm, #amdgpu Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D44364 llvm-svn: 328856	2018-03-30 05:03:36 +00:00
Ikhlas Ajbar	b9b12d73b4	[Hexagon] add missing lit config file llvm-svn: 328855	2018-03-30 03:32:24 +00:00
Ikhlas Ajbar	908875cd9d	peel loops with runtime small trip counts For Hexagon, peeling loops with small runtime trip count is beneficial for our benchmarks. We set PeelCount in HexagonTargetInfo.cpp and we use PeelCount set by the target for computing the desired peel count. Differential Revision: https://reviews.llvm.org/D44880 llvm-svn: 328854	2018-03-30 03:05:34 +00:00
Eli Friedman	3c02b623fd	[MachineCopyPropagation] Handle COPY with overlapping source/dest. MachineCopyPropagation::CopyPropagateBlock has a bunch of special handling for COPY instructions. This handling assumes that COPY instructions do not modify the source of the copy; this is wrong if the COPY destination overlaps the source. To fix the bug, check explicitly for this situation, and fall back to the generic instruction handling. This bug can't happen for most register classes because they don't have this sort of overlap, but there are a few register classes where this is possible. The testcase uses the AArch64 QQQQ register class. Differential Revision: https://reviews.llvm.org/D44911 llvm-svn: 328851	2018-03-30 00:56:03 +00:00
Eugene Zelenko	6d05e6705c	[IR] Fix some Clang-tidy modernize-use-auto warnings; other minor fixes (NFC). llvm-svn: 328850	2018-03-30 00:47:31 +00:00
Rafael Espindola	1fe0ee8df0	Style update. NFC. Rename 3 functions to start with lowercase letters. Don't repeat the name in the comments. llvm-svn: 328848	2018-03-29 23:32:54 +00:00
David Blaikie	18c392aa8f	Fix some layering in StripNonLineTableDebugInfo, moving its declaration from IPO.h to Utils.h to match its implementation llvm-svn: 328844	2018-03-29 22:42:08 +00:00
David Blaikie	7ef4aaea10	Remove unused header to fix layering. llvm-svn: 328842	2018-03-29 22:35:59 +00:00

1 2 3 4 5 ...

162246 Commits