llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Sanjay Patel	7662be63cf	[InstCombine] add tests and comments for fdiv X, C; NFC llvm-svn: 325161	2018-02-14 19:54:51 +00:00
Rafael Espindola	710a5e861b	Pass a module reference to CloneModule. It can never be null and most callers were already using references or std::unique_ptr. llvm-svn: 325160	2018-02-14 19:50:40 +00:00
Rafael Espindola	1202040fcc	Update examples for API change. NFC. llvm-svn: 325157	2018-02-14 19:23:27 +00:00
Rafael Espindola	7713c013a2	Pass a reference to a module to the bitcode writer. This simplifies most callers as they are already using references or std::unique_ptr. llvm-svn: 325155	2018-02-14 19:11:32 +00:00
Craig Topper	2f7995c340	[RegisterClassInfo] Invalidate the register pressure set limit cache when reserved regs or callee saved regs change Previously we only invalidated the pressure set limit cached when the TargetRegisterInfo pointer changes. But as reserved regs and callee saved regs are used as part of calculating the limits we should invalidate when those change too. I encountered this when reverting a patch from the 6.0 branch. One of the x86 test files had a function that used rbp as a frame pointer, making it reserved. It was followed by another function which didn't use rbp but had the same TRI so the pressure set limit cache was not invalidated. If i removed the function that used rbp as a frame pointer from the file, the remaining function then got a different register pressure limit for the GR16 pressure set. This caused the machine scheduler to change the scheduling for the function. This was an unexpected change from just deleting a function. I don't have a test case for trunk because the particular x86 test case is different enough from the 6.0 branch to not be affected now. Differential Revision: https://reviews.llvm.org/D43274 llvm-svn: 325153	2018-02-14 18:53:29 +00:00
David Green	ad8913af27	Move llvm::computeLoopSafetyInfo from LICM.cpp to LoopUtils.cpp. NFC Move computeLoopSafetyInfo, defined in Transforms/Utils/LoopUtils.h, into the corresponding LoopUtils.cpp, as opposed to LICM where it resides at the moment. This will allow other functions from Transforms/Utils to reference it. llvm-svn: 325151	2018-02-14 18:34:53 +00:00
Simon Pilgrim	0d2a33eaff	[X86][SSE] truncateVectorWithPACK - Use src type instead of dst to select between PACKSDW/PACKSWB Try to keep PACKSDW/PACKSWB as wide as possible, this helps ComputeNumSignBits as it can only peek through bitcasts to wider types, pre-AVX2 codegen was already doing this as it could peek through bitcasts/subvectors more easily than AVX2 could through shuffles. This shouldn't affect existing results as calls to truncateVectorWithPACK ensure we have enough sign bits to pack to the same value, but it should make it possible to use truncateVectorWithPACK chains to perform saturation in combineTruncateWithSat with a future patch. llvm-svn: 325149	2018-02-14 18:23:58 +00:00
Craig Topper	2e34d16069	[InstCombine] Don't fold select(C, Z, binop(select(C, X, Y), W)) -> select(C, Z, binop(Y, W)) if the binop is rem or div. The select may have been preventing a division by zero or INT_MIN/-1 so removing it might not be safe. Fixes PR36362. Differential Revision: https://reviews.llvm.org/D43276 llvm-svn: 325148	2018-02-14 18:08:33 +00:00
Stanislav Mekhanoshin	5f83290ecf	[AMDGPU] Remove non-temporal flag from argument loads Kernel arguments likely read by all workitems and should not bypass cache. Fixes performance hit in sub-dword argument loads. Differential Revision: https://reviews.llvm.org/D43249 llvm-svn: 325146	2018-02-14 18:05:14 +00:00
Sanjay Patel	8c5ba41434	[InstCombine] regenerate checks; NFC llvm-svn: 325144	2018-02-14 17:37:32 +00:00
Paul Robinson	222ff14ae7	[DWARF] Fix incorrect prologue end line record. The prologue-end line record must be emitted after the last instruction that is part of the function frame setup code and before the instruction that marks the beginning of the function body. Patch by Carlos Alberto Enciso! Differential Revision: https://reviews.llvm.org/D41762 llvm-svn: 325143	2018-02-14 17:35:52 +00:00
Sanjay Patel	5547176d2d	[InstCombine] simplify isFMulOrFDivWithConstant(); NFCI llvm-svn: 325142	2018-02-14 17:16:33 +00:00
Sanjay Patel	2b94e8c2c3	[InstCombine] replace isa/cast with dyn_cast; NFC llvm-svn: 325141	2018-02-14 16:56:44 +00:00
Sanjay Patel	f8ec782c8c	[InstCombine] refactor folds for mul with negated operands; NFCI This keeps with our current usage of 'match' and is easier to see that the optional NSW only applies in the non-constant operand case. llvm-svn: 325140	2018-02-14 16:50:55 +00:00
Rafael Espindola	848a52f758	Store defined macros in MCContext. So that macros defined in inline assembly blocks are available to the whole file. This provides a consistent behavior with other assembly directives, since equations for example are already preserved between inline assembly blocks. PR: 36110 Patch by Roger! llvm-svn: 325139	2018-02-14 16:34:27 +00:00
Sanjay Patel	f030049c69	[x86] add baseline vector compare tests for D42948; NFC llvm-svn: 325138	2018-02-14 16:15:15 +00:00
Alexander Ivchenko	833b4ee927	[SelectionDAG][X86] Fix incorrect offset generated for VMASKMOV When creating high MachineMemOperand for MSTORE/MLOAD we supply it with the original PointerInfo, while the pointer itself had been incremented. The patch adds the proper offset to the PointerInfo. llvm-svn: 325135	2018-02-14 15:55:24 +00:00
Alexey Bataev	81e6c94efb	[SLP] Allow vectorization of reversed loads. Summary: Reversed loads are handled as gathering. But we can just reshuffle these values. Patch adds support for vectorization of reversed loads. Reviewers: RKSimon, spatel, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43022 llvm-svn: 325134	2018-02-14 15:29:15 +00:00
Sjoerd Meijer	1eab012a78	[ARM] f16 stack spill/reloads This adds support for handling f16 stack spills/reloads. Differential Revision: https://reviews.llvm.org/D43280 llvm-svn: 325130	2018-02-14 15:09:09 +00:00
Simon Pilgrim	0438b3f961	Fix GCC -Wlogical-op-parentheses warning. NFCI. llvm-svn: 325129	2018-02-14 15:07:36 +00:00
Lama Saba	cc71753981	[X86] Reduce Store Forward Block issues in HW - Recommit after fixing Bug 36346 If a load follows a store and reloads data that the store has written to memory, Intel microarchitectures can in many cases forward the data directly from the store to the load, This "store forwarding" saves cycles by enabling the load to directly obtain the data instead of accessing the data from cache or memory. A "store forward block" occurs in cases that a store cannot be forwarded to the load. The most typical case of store forward block on Intel Core microarchiticutre that a small store cannot be forwarded to a large load. The estimated penalty for a store forward block is ~13 cycles. This pass tries to recognize and handle cases where "store forward block" is created by the compiler when lowering memcpy calls to a sequence of a load and a store. The pass currently only handles cases where memcpy is lowered to XMM/YMM registers, it tries to break the memcpy into smaller copies. breaking the memcpy should be possible since there is no atomicity guarantee for loads and stores to XMM/YMM. Change-Id: Ic41aa9ade6512e0478db66e07e2fde41b4fb35f9 llvm-svn: 325128	2018-02-14 14:58:53 +00:00
Simon Pilgrim	ba2f6e6405	[X86][SSE] Relax type legality for combineTruncateWithSat PACKSS/PACKUS truncation While the AVX512 VTRUNCS/VTRUNCUS instructions require legal types, truncateVectorWithPACK handles cases with multiples of legal types through splitting/concatenation. So we just need to ensure that the src/dst scalar types are correct and leave truncateVectorWithPACK to handle the rest of it. llvm-svn: 325127	2018-02-14 14:14:29 +00:00
Florian Hahn	6252c9b891	Recommit r325001: [CallSiteSplitting] Support splitting of blocks with instrs before call. For basic blocks with instructions between the beginning of the block and a call we have to duplicate the instructions before the call in all split blocks and add PHI nodes for uses of the duplicated instructions after the call. Currently, the threshold for the number of instructions before a call is quite low, to keep the impact on binary size low. Reviewers: junbuml, mcrosier, davidxl, davide Reviewed By: junbuml Differential Revision: https://reviews.llvm.org/D41860 llvm-svn: 325126	2018-02-14 13:59:12 +00:00
Florian Hahn	0c62c8e2fa	[LoopInterchange] Incrementally update the dominator tree. We can use incremental dominator tree updates to avoid re-calculating the dominator tree after interchanging 2 loops. Reviewers: dmgreen, kuhar Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D43176 llvm-svn: 325122	2018-02-14 13:13:15 +00:00
Momchil Velikov	da0ab3b6ed	Use EXPECT_FALSE instead of EXPECT_EQ(false, ... Commit https://reviews.llvm.org/rL324489 added EXPECT_EQ(false, N->isUnsigned()); which older GCC versions dislike for some reason. Anyway, it looks like the proper GTest way is to use EXPECT_FALSE, etc. Differential Revision: https://reviews.llvm.org/D43233 llvm-svn: 325121	2018-02-14 13:11:56 +00:00
Petar Jovanovic	f2a43da3d7	[Utils] Salvage the debug info of DCE'ed 'and' instructions Preserve debug info from a dead 'and' instruction with a constant. Patch by Djordje Todorovic. Differential Revision: https://reviews.llvm.org/D43163 llvm-svn: 325119	2018-02-14 13:10:35 +00:00
Pavel Labath	ff70972dde	Revert r325107 (case folding DJB hash) and subsequent build fix The "knownValuesUnicode" test in the patch fails on ppc64 and arm64 bots. Reverting while I investigate. llvm-svn: 325115	2018-02-14 11:06:39 +00:00
Eugene Leviant	f2917f50a9	[IRMover] Move type name extraction to a separate function. NFC llvm-svn: 325110	2018-02-14 10:32:47 +00:00
Pavel Labath	621a8f0c9c	Fix build broken by r325107 Older gcc versions need an extra pair of {}s to convert a string literal into llvm::StringLiteral. llvm-svn: 325109	2018-02-14 10:25:32 +00:00
Pavel Labath	2bf9f58339	Implement a case-folding version of DJB hash Summary: This patch implements a variant of the DJB hash function which folds the input according to the algorithm in the Dwarf 5 specification (Section 6.1.1.4.5), which in turn references the Unicode Standard (Section 5.18, "Case Mappings"). To achieve this, I have added a llvm::sys::unicode::foldCharSimple function, which performs this mapping. The implementation of this function was generated from the CaseMatching.txt file from the Unicode spec using a python script (which is also included in this patch). The script tries to optimize the function by coalescing adjecant mappings with the same shift and stride (terms I made up). Theoretically, it could be made a bit smarter and merge adjecant blocks that were interrupted by only one or two characters with exceptional mapping, but this would save only a couple of branches, while it would greatly complicate the implementation, so I deemed it was not worth it. Since we assume that the vast majority of the input characters will be US-ASCII, the folding hash function has a fast-path for handling these, and only whips out the full decode+fold+encode logic if we encounter a character outside of this range. It might be possible to implement the folding directly on utf8 sequences, but this would also bring a lot of complexity for the few cases where we will actually need to process non-ascii characters. Reviewers: JDevlieghere, aprantl, probinson, dblaikie Subscribers: mgorny, hintonda, echristo, clayborg, vleschuk, llvm-commits Differential Revision: https://reviews.llvm.org/D42740 llvm-svn: 325107	2018-02-14 10:05:09 +00:00
Elena Demikhovsky	966bb89b1a	Adding a width of the GEP index to the Data Layout. Making a width of GEP Index, which is used for address calculation, to be one of the pointer properties in the Data Layout. p[address space]:size:memory_size:alignment:pref_alignment:index_size_in_bits. The index size parameter is optional, if not specified, it is equal to the pointer size. Till now, the InstCombiner normalized GEPs and extended the Index operand to the pointer width. It works fine if you can convert pointer to integer for address calculation and all registered targets do this. But some ISAs have very restricted instruction set for the pointer calculation. During discussions were desided to retrieve information for GEP index from the Data Layout. http://lists.llvm.org/pipermail/llvm-dev/2018-January/120416.html I added an interface to the Data Layout and I changed the InstCombiner and some other passes to take the Index width into account. This change does not affect any in-tree target. I added tests to cover data layouts with explicitly specified index size. Differential Revision: https://reviews.llvm.org/D42123 llvm-svn: 325102	2018-02-14 06:58:08 +00:00
Craig Topper	9cacb9b10c	[SelectionDAG] Remove duplicate code from TargetLowering::SimplifySetCC. This exact code already exists a little further up. llvm-svn: 325101	2018-02-14 06:51:57 +00:00
Serge Pavlov	23a9f5e3f8	Use delete[] instead of free llvm-svn: 325100	2018-02-14 06:14:30 +00:00
Serge Pavlov	18df8e9c56	Use delete[] to deallocate array of chars llvm-svn: 325099	2018-02-14 05:14:31 +00:00
Serge Pavlov	4d8a6548e7	Refactor DisassembleInfo in MachODump.cpp The change implements constructor of DisassembleInfo to avoid duplication of initialization code and gets rid of malloc/free where possible. Differential Revision: https://reviews.llvm.org/D43003 llvm-svn: 325098	2018-02-14 03:26:27 +00:00
Reid Kleckner	71205eed30	[X86] Remove dead code from retpoline thunk generation Follow-up to r325049 llvm-svn: 325085	2018-02-14 00:24:29 +00:00
Vitaly Buka	0e20395dac	[gold] Fix error report in thinlto_emit_linked_objects.ll test Summary: It's just cleanup after r323818 to avoid irrelevant error message inside the test. Existing version of test passed but generated unrelated error report about symbol redefinition. llvm-svn: 325080	2018-02-14 00:00:43 +00:00
Sam McCall	41b574ca17	Fix off-by-one in set_thread_name which causes truncation to fail on Linux llvm-svn: 325069	2018-02-13 23:23:59 +00:00
Daniel Sanders	e6e2436fba	[globalisel][legalizerinfo] Follow up on post-commit review comments after r323681 * Document most API's * Delete a useless function call * Fix a discrepancy between the single and multi-opcode variants of getActionDefinitions(). The multi-opcode variant now requires that more than one opcode is requested. Previously it acted much like the single-opcode form but unnecessarily enforced the requirements of the multi-opcode form. llvm-svn: 325067	2018-02-13 23:02:44 +00:00
Sanjay Patel	a08a14762d	[InstCombine] put tests of mul with neg operand(s) together; NFC llvm-svn: 325066	2018-02-13 23:02:12 +00:00
Vedant Kumar	91ac0024df	[GVN] Salvage debug info from dead insts This preserves an additional 581 unique source variables in a stage2 build of clang (according to `llvm-dwarfdump --statistics`). It increases the size of the .debug_loc section by 0.1% (or 87139 bytes). Differential Revision: https://reviews.llvm.org/D43255 llvm-svn: 325063	2018-02-13 22:27:17 +00:00
Sanjay Patel	dc99a6d842	[InstCombine] (lshr X, 31) * Y --> (ashr X, 31) & Y This replaces the bit-tracking based fold that did the same thing, but it only worked for scalars and not directly. There is no evidence in existing regression tests that the greater power of bit-tracking was needed here, but we should be aware of this potential loss of optimization. llvm-svn: 325062	2018-02-13 22:24:37 +00:00
Sanjay Patel	2b7d51f23d	[InstCombine] add vector tests, fix comments; NFC The scalar folds are done indirectly and use potentially expensive value tracking calls. That can be improved along with the enhancement to support vector types. llvm-svn: 325051	2018-02-13 21:19:42 +00:00
Reid Kleckner	f0fe7234f4	[X86] Use EDI for retpoline when no scratch regs are left Summary: Instead of solving the hard problem of how to pass the callee to the indirect jump thunk without a register, just use a CSR. At a call boundary, there's nothing stopping us from using a CSR to hold the callee as long as we save and restore it in the prologue. Also, add tests for this mregparm=3 case. I wrote execution tests for __llvm_retpoline_push, but they never got committed as lit tests, either because I never rewrote them or because they got lost in merge conflicts. Reviewers: chandlerc, dwmw2 Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D43214 llvm-svn: 325049	2018-02-13 20:47:49 +00:00
Sanjay Patel	1f92362a35	[InstCombine] (bool X) * Y --> X ? Y : 0 This is both a functional improvement for vectors and an efficiency improvement for scalars. The existing code below the new folds does the same thing for scalars, but in an indirect and expensive way. llvm-svn: 325048	2018-02-13 20:41:22 +00:00
Reid Kleckner	c5216dadb8	[LLD] Implement /guard:[no]longjmp Summary: This protects calls to longjmp from transferring control to arbitrary program points. Instead, longjmp calls are limited to the set of registered setjmp return addresses. This also implements /guard:nolongjmp to allow users to link in object files that call setjmp that weren't compiled with /guard:cf. In this case, the linker will approximate the set of address taken functions, but it will leave longjmp unprotected. I used the following program to test, compiling it with different -guard flags: $ cl -c t.c -guard:cf $ lld-link t.obj -guard:cf #include <setjmp.h> #include <stdio.h> jmp_buf buf; void g() { printf("before longjmp\n"); fflush(stdout); longjmp(buf, 1); } void f() { if (setjmp(buf)) { printf("setjmp returned non-zero\n"); return; } g(); } int main() { f(); printf("hello world\n"); } In particular, the program aborts when the code is compiled without -guard:cf and linked with -guard:cf. That indicates that longjmps are protected. Reviewers: ruiu, inglorion, amccarth Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43217 llvm-svn: 325047	2018-02-13 20:32:53 +00:00
Aditya Nandakumar	fdb604ca7c	[GISel]: Add Pattern Matcher for G_FMUL. https://reviews.llvm.org/D43206 llvm-svn: 325044	2018-02-13 20:09:13 +00:00
Aditya Nandakumar	2444dedb25	[GISel]: Make Pattern matcher for FADD commutative llvm-svn: 325043	2018-02-13 20:09:11 +00:00
Stanislav Mekhanoshin	f1be30ca6b	[AMDGPU] Cleanup in memory legalizer tests. NFC. llvm-svn: 325042	2018-02-13 20:03:32 +00:00
Adrian Prantl	e07824ace6	Document the shortcomings of DwarfExpression::addMachineReg(). Also make a drive-by-fix of a bug in the subregister scan code that only triggers with an incomplete or otherwise very irregular machine description. rdar://problem/37404493 This re-applies r324972 with an early exit in the case of a complete failure to make this commit NFC again as intended. llvm-svn: 325041	2018-02-13 19:54:00 +00:00

1 2 3 4 5 ...

160201 Commits