llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Craig Topper	652ef79124	[InstCombine] Remove one use restriction on the shift for calls to foldICmpAndShift. If this transformation succeeds, we're going to remove our dependency on the shift by rewriting the and. So it doesn't matter how many uses the shift has. This distributes the one use check to other transforms in foldICmpAndConstConst that do need it. Differential Revision: https://reviews.llvm.org/D38206 llvm-svn: 314233	2017-09-26 18:47:25 +00:00
Sam Clegg	f05301ced0	[WebAssembly] Use function/global index space in WasmSymbol It is useful for the symbol to contain the index of the function of global it represents in the function/global index space. For imports we also store the import index so that the linker can find, for example, the signature of the corresponding function, which is defined by the import In the long run we need to decide whether this API surface should be closer to binary (where imported functions are seperate) or the wasm spec (where the function index space is unified). Differential Revision: https://reviews.llvm.org/D38189 llvm-svn: 314230	2017-09-26 18:21:12 +00:00
Jake Ehrlich	f096324571	[llvm-objcopy] Add support for dynamic relocations This change adds support for dynamic relocations (allocated SHT_REL/SHT_RELA sections with a dynamic symbol table as their link). The binary I added for the test is here: https://drive.google.com/file/d/0B3gtIAmiMwZXSjJUZE9pUjd4M0k/view?usp=sharing Unless support for dynamic symbol tables in yaml2obj is added this is needed. Differential Revision: https://reviews.llvm.org/D37915 llvm-svn: 314227	2017-09-26 18:02:25 +00:00
Artem Belevich	66a9be35a9	[NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins. Differential Revision: https://reviews.llvm.org/D38191 llvm-svn: 314223	2017-09-26 17:07:23 +00:00
Simon Atanasyan	0bba282e33	[mips] Use llvm-dwarfdump to simplify the test. NFC llvm-svn: 314222	2017-09-26 17:02:35 +00:00
Craig Topper	69fd9c7ef1	[X86] Add support for v16i32 UMUL_LOHI/SMUL_LOHI Summary: This patch extends the v8i32/v4i32 custom lowering to support v16i32 Reviewers: zvi, RKSimon Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38274 llvm-svn: 314221	2017-09-26 16:43:57 +00:00
Krzysztof Parzyszek	1d4cc25005	[Hexagon] Fix a typo: #ifndef DEBUG -> #ifndef NDEBUG llvm-svn: 314216	2017-09-26 15:31:15 +00:00
Krzysztof Parzyszek	b9efefedbd	[Hexagon] Fix initialization of HexagonSubtarget Make sure that "initializeSubtargetDependencies" sets all members that InstrInfo and the like may depend on. llvm-svn: 314214	2017-09-26 15:06:37 +00:00
Jonas Devlieghere	57be864d99	[dwarfdump] Skip 'stripped' sections When dsymutil generates the companion file, its strips all unnecessary sections by omitting their body and setting the offset in their corresponding load command to zero. One such section is the .eh_frame section, as it contains runtime information rather than debug information and is part of the __TEXT segment. When reading this section, we would just read the number of bytes specified in the load command, starting from offset 0 (i.e. the beginning of the file). Rather than trying to parse this obviously invalid section, dwarfdump now skips this. Differential revision: https://reviews.llvm.org/D38135 llvm-svn: 314208	2017-09-26 14:22:35 +00:00
Simon Pilgrim	2625776de3	[X86][XOP] Merge rotation opcodes with AVX512 equivalents. NFCI. The XOP rotations act as ROTL with +ve values and ROTR with -ve values, which means that we can treat them all as ROTL with unsigned modulo. We already check that we're only trying to lower as ROTL for XOP rotations. Differential Revision: https://reviews.llvm.org/D37949 llvm-svn: 314207	2017-09-26 14:12:50 +00:00
Sanjay Patel	13ff833337	[DSE] Merge stores when the later store only writes to memory locations the early store also wrote to (2nd try) This is a 2nd attempt at: https://reviews.llvm.org/rL310055 ...which was reverted at rL310123 because of PR34074: https://bugs.llvm.org/show_bug.cgi?id=34074 In this version, we break out of the inner loop after we successfully merge and kill a pair of stores. In the earlier rev, we were continuing instead, which meant we could process the invalid info from a now dead store. Original commit message (authored by Filipe Cabecinhas): This fixes PR31777. If both stores' values are ConstantInt, we merge the two stores (shifting the smaller store appropriately) and replace the earlier (and larger) store with an updated constant. In the future we should also support vectors of integers. And maybe float/double if we can. Differential Revision: https://reviews.llvm.org/D30703 llvm-svn: 314206	2017-09-26 13:54:28 +00:00
Coby Tayree	a898c4daa9	[x86] fix pr29061 https://bugs.llvm.org//show_bug.cgi?id=29061 Don't try referencing REX-needed regs when not on 64bit mode Aligns to GCC Differetial Revision: https://reviews.llvm.org/D37801 llvm-svn: 314203	2017-09-26 13:28:05 +00:00
Simon Pilgrim	3f431534a8	Tidyup P->getComplexPatternInfo call by moving it inside if( != NULL) test. NFCI. llvm-svn: 314202	2017-09-26 12:59:01 +00:00
Sylvestre Ledru	76a8248dbb	Don't move llvm.localescape outside the entry block in the GCOV profiling pass Summary: This fixes https://bugs.llvm.org/show_bug.cgi?id=34714. Patch by Marco Castelluccio Reviewers: rnk Reviewed By: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38224 llvm-svn: 314201	2017-09-26 11:56:43 +00:00
Benjamin Kramer	1560b4082a	Revert "[X86] Make all the NOREX CodeGenOnly instructions into postRA pseudos like the NOREX version of TEST." Makes llc crash. This reverts commit r314151. llvm-svn: 314199	2017-09-26 10:25:27 +00:00
Jonas Devlieghere	6225068807	[dsymutil] Better support for symbol aliases This patch adds logic to follow a symbol's aliases when the symbol name cannot be found in the current object file. It checks the main binary for the symbol's address and queries the current object for its aliases (symbols with the same address) before printing out a warning. Differential revision: https://reviews.llvm.org/D38230 llvm-svn: 314198	2017-09-26 08:17:28 +00:00
Uriel Korach	ba4020d436	[X86] Finishing broadcastf32x2 and broadcasti32x2 intrinsics lowering to IR. llvm side. Removing X86 broadcast(f/i)32x2 intrinsics from llvm. Adding autoUpgrade support. Moving matching tests from avx512dq-intrinsics.ll to avx512dq-intrinsics-upgrade.ll and from avx512dqvl-intrinsics.ll to avx512dqvl-intrinsics-upgrade.ll. Differential Revision: https://reviews.llvm.org/D38220 llvm-svn: 314195	2017-09-26 07:39:39 +00:00
Matthias Braun	d2ac8bfeac	CMake: Add option to set LLVM_ENABLE_DUMP Differential Revision: https://reviews.llvm.org/D38267 llvm-svn: 314186	2017-09-26 02:36:58 +00:00
Matthias Braun	bd0cb32d95	TargetLibraryInfo: Stop guessing wchar_t size Usually the frontend communicates the size of wchar_t via metadata and we can optimize wcslen (and possibly other calls in the future). In cases without the wchar_size metadata we would previously try to guess the correct size based on the target triple; however this is fragile to keep up to date and may miss users manually changing the size via flags. Better be safe and stop guessing and optimizing if the frontend didn't communicate the size. Differential Revision: https://reviews.llvm.org/D38106 llvm-svn: 314185	2017-09-26 02:36:57 +00:00
Dylan McKay	4b6ad2c769	[AVR] Fix the build after setting alignment to 1 in r314179 Changing all types to be byte-aligned broke a small number of tests. llvm-svn: 314183	2017-09-26 02:07:54 +00:00
Dylan McKay	6558a725d9	[AVR] Prefer BasicBlock::getIterator over Function::begin() Thanks to Eli Friedman for the suggestion. llvm-svn: 314182	2017-09-26 01:37:53 +00:00
Dylan McKay	8d828ca86f	[AVR] When lowering shifts into loops, put newly generated MBBs in the same spot as the original MBB Discovered in avr-rust/rust#62 https://github.com/avr-rust/rust/issues/62 Patch by Gergo Erdi. llvm-svn: 314180	2017-09-26 00:51:03 +00:00
Dylan McKay	443349b426	[AVR] Use 1-byte alignment for all data types This was an oversight in the original backend data layout. The AVR architecture does not have the concept of unaligned loads - all loads/stores from all addresses are aligned to one byte. Discovered in avr-rust issue #64 https://github.com/avr-rust/rust/issues/64 Patch By Gergo Erdi. llvm-svn: 314179	2017-09-26 00:45:27 +00:00
Vedant Kumar	227a1e42db	[docs] llvm-cov: Make docs for boolean options more consistent llvm-svn: 314176	2017-09-25 23:10:04 +00:00
Vedant Kumar	d0c6992660	[llvm-cov] Warn if -show-functions is used without query files llvm-cov's report mode does not print any output when -show-functions is specified and no source files are specified. This can be surprising, so the tool should at least print out an error message when this happens. rdar://problem/34636859 llvm-svn: 314175	2017-09-25 23:10:03 +00:00
Adrian Prantl	2d690975d9	Modernize comments llvm-svn: 314174	2017-09-25 22:51:26 +00:00
Adrian Prantl	81130e1ff4	Modernize comments llvm-svn: 314173	2017-09-25 22:51:15 +00:00
Vlad Tsyrklevich	0b4abdf3cc	Add section headers to SpecialCaseLists Summary: Sanitizer blacklist entries currently apply to all sanitizers--there is no way to specify that an entry should only apply to a specific sanitizer. This is important for Control Flow Integrity since there are several different CFI modes that can be enabled at once. For maximum security, CFI blacklist entries should be scoped to only the specific CFI mode(s) that entry applies to. Adding section headers to SpecialCaseLists allows users to specify more information about list entries, like sanitizer names or other metadata, like so: [section1] fun:fun1 [section2\|section3] fun:fun23 The section headers are regular expressions. For backwards compatbility, blacklist entries entered before a section header are put into the '[*]' section so that blacklists without sections retain the same behavior. SpecialCaseList has been modified to also accept a section name when matching against the blacklist. It has also been modified so the follow-up change to clang can define a derived class that allows matching sections by SectionMask instead of by string. Reviewers: pcc, kcc, eugenis, vsk Reviewed By: eugenis, vsk Subscribers: vitalybuka, llvm-commits Differential Revision: https://reviews.llvm.org/D37924 llvm-svn: 314170	2017-09-25 22:11:11 +00:00
Eli Friedman	4fd32329a6	Revert r312724 ("[ARM] Remove redundant vcvt patterns."). It leads to some improvements, but also a regression for the simple case, so it's not clearly a good idea. test/CodeGen/ARM/vcvt.ll now has test coverage to show the difference. Ultimately, the right solution is probably to custom-lower fp-to-int conversions, to something like ARMISD::VCVT_F32_S32 plus a bitcast. It's hard to do the right thing when the implicit bitcast isn't visible to DAG transforms. llvm-svn: 314169	2017-09-25 22:07:33 +00:00
Quentin Colombet	ed9a033b66	[GlobalISel] Update the documentation and comment for G_[UN]MERGE_VALUES In r296921, we added the G_[UN]MERGE_VALUES node, but did not update the documentation. Fixing that. NFC. llvm-svn: 314168	2017-09-25 22:03:06 +00:00
Quentin Colombet	f124af2f5e	[GlobalISel] Update the documentation for G_SEQUENCE This instruction has been removed in r306120. NFC. llvm-svn: 314167	2017-09-25 22:03:05 +00:00
Quentin Colombet	2e2793f727	[GlobalISel] Update the documentation and comments for G_EXTRACT In r297100, G_EXTRACT changed from a multiple results instruction to a single result one. Update the documentation accordingly. NFC. llvm-svn: 314166	2017-09-25 22:03:01 +00:00
Saleem Abdulrasool	6256c01685	X86: remove R12 from CSR on Windows x64 SwiftCC R12 is used for the SwiftError parameter. It is no longer a CSR as it is used for transfer the SwiftError, and the caller must preserve it if they need to. llvm-svn: 314165	2017-09-25 22:00:17 +00:00
Eli Friedman	08f50597ed	[ARM] Fix tests for vcvt+store to return void. This is what I meant to do in r314161; I didn't realize I'd messed up because the generated assembly is currently identical. llvm-svn: 314163	2017-09-25 21:55:27 +00:00
Eli Friedman	2da10bf0b6	[ARM] Add tests for vcvt followed by store. llvm-svn: 314161	2017-09-25 21:37:52 +00:00
Eli Friedman	f06e0331f8	[ARM] Regenerate vcvt test checks. llvm-svn: 314160	2017-09-25 21:34:29 +00:00
Craig Topper	b5e7463711	[InstCombine] Move an optimization from foldICmpAndConstConst to foldICmpUsingKnownBits All this optimization cares about is knowing how many low bits of LHS is known to be zero and whether that means that the result is 0 or greater than the RHS constant. It doesn't matter where the zeros in the low bits came from. So we don't need to specifically look for an AND. Instead we can use known bits. Differential Revision: https://reviews.llvm.org/D38195 llvm-svn: 314153	2017-09-25 21:15:00 +00:00
Craig Topper	854c77fbb7	[X86] Don't select anyext GR32->GR64 to SUBREG_TO_REG. Use INSERT_SUBREG instead. As far as I know SUBREG_TO_REG is stating that the upper bits are 0. But if we are just converting the GR32 with no checks, then we have no reason to say the upper bits are 0. I don't really know how to test this today since I can't find anything that looks that closely at SUBREG_TO_REG. The test changes here seems to be some perturbance of register allocation. Differential Revision: https://reviews.llvm.org/D38001 llvm-svn: 314152	2017-09-25 21:14:59 +00:00
Craig Topper	34dfaa97d8	[X86] Make all the NOREX CodeGenOnly instructions into postRA pseudos like the NOREX version of TEST. llvm-svn: 314151	2017-09-25 21:14:55 +00:00
Jake Ehrlich	2e6e774fe6	[llvm-objcopy] Refactor code to include initialize method This change refactors some of the code to allow for some code deduplication in later diffs as well as just to make adding a new section type more self contained to the class itself. The idea for this was first mentioned by James in D 37915 and will be used in that change as recommended. This change follows changes for dynamic sections but precedes support for dynamic relocations. Differential Revision: https://reviews.llvm.org/D38008 llvm-svn: 314148	2017-09-25 20:37:28 +00:00
Sanjay Patel	115f2aa1d5	[InstCombine] remove extract-of-select vector transform (2nd try) The 1st attempt at this: https://reviews.llvm.org/rL314117 was reverted at: https://reviews.llvm.org/rL314118 because of bot fails for clang tests that were checking optimized IR. That should be fixed with: https://reviews.llvm.org/rL314144 ...so try again. Original commit message: The transform to convert an extract-of-a-select-of-vectors was added at: https://reviews.llvm.org/rL194013 And a question about the validity of this transform was raised in the review: https://reviews.llvm.org/D1539: ...but not answered AFAICT> Most of the motivating cases in that patch are now handled by other combines. These are the tests that were added with the original commit, but they are not regressing even after we remove the transform in this patch. The diffs we see after removing this transform cause us to avoid increasing the instruction count, so we don't want to do those transforms as canonicalizations. The motivation for not turning a vector-select-of-vectors into a scalar operation is shown in PR33301: https://bugs.llvm.org/show_bug.cgi?id=33301 ...in those cases, we'll get vector ops with this patch rather than the vector/scalar mix that we currently see. Differential Revision: https://reviews.llvm.org/D38006 llvm-svn: 314147	2017-09-25 20:30:53 +00:00
Benjamin Kramer	85cf27c697	[Hexagon] Avoid unused variable warnings in Release builds. No functionality change intended. llvm-svn: 314143	2017-09-25 19:42:20 +00:00
Justin Lebar	a89ad847a6	Revert "[NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins.", rL314135. Causing assertion failures on macos: > Assertion failed: (Num < NumOperands && "Invalid child # of SDNode!"), > function getOperand, file > /Users/buildslave/jenkins/workspace/clang-stage1-cmake-RA-incremental/llvm/include/llvm/CodeGen/SelectionDAGNodes.h, > line 835. http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/42739/testReport/LLVM/CodeGen_NVPTX/surf_read_cuda_ll/ llvm-svn: 314142	2017-09-25 19:41:56 +00:00
Konstantin Belochapka	e991d1a8a7	[X86] [ASM INTEL SYNTAX] fix for incorrect assembler code generation when x86-asm-syntax=intel (PR34617). Fix for incorrect code generation when x86-asm-syntax=intel. Differential Revision: https://reviews.llvm.org/D37945 llvm-svn: 314140	2017-09-25 19:26:48 +00:00
Craig Topper	1cf3918f57	[SelectionDAG] Teach simplifyDemandedBits to handle shifts by constant splat vectors This teach simplifyDemandedBits to handle constant splat vector shifts. This required changing some uses of getZExtValue to getLimitedValue since we can't rely on legalization using getShiftAmountTy for the shift amount. I believe there may have been a bug in the ((X << C1) >>u ShAmt) handling where we didn't check if the inner shift was too large. I've fixed that here. I had to add new patterns to ARM because the zext/sext the patterns were trying to look for got turned into an any_extend with this patch. Happy to split that out too, but not sure how to test without this change. Differential Revision: https://reviews.llvm.org/D37665 llvm-svn: 314139	2017-09-25 19:26:08 +00:00
Alexey Bataev	85ed5e891b	[SLP] Add a test for PR32086, NFC. llvm-svn: 314137	2017-09-25 19:12:59 +00:00
Krzysztof Parzyszek	2aac4f7aa4	[Hexagon] Better determination of register classes in bit tracker Add two callbacks to MachineEvaluator, so that specific implementations can specify more details about register classes: - composeWithSubRegIndex(RC,Idx), to provide the register class for a register from RC used in conjunction with a subregister index Idx. - getPhysRegBitWidth(Reg), to provide the size in bits of the given physical register. llvm-svn: 314136	2017-09-25 19:12:55 +00:00
Artem Belevich	8a55fc8708	[NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins. Differential Revision: https://reviews.llvm.org/D38191 llvm-svn: 314135	2017-09-25 18:53:57 +00:00
Krzysztof Parzyszek	7b0a224a6e	[Hexagon] Make getHexagonSubRegIndex take reference instead of pointer llvm-svn: 314134	2017-09-25 18:49:42 +00:00
Craig Topper	2a8db7c0ba	[AVX-512] Replace large number of explicit patterns that check for insert_subvector with zero after masked compares with fewer patterns with predicate This replaces the large number of patterns that handle every possible case of zeroing after a masked compare with a few simpler patterns that use a predicate to check for a masked compare producer. This is similar to what we do for detecting free GR32->GR64 zero extends and free xmm->ymm/zmm zero extends. This shrinks the isel table from ~590k to ~531k. This is a roughly 10% reduction in size. Differential Revision: https://reviews.llvm.org/D38217 llvm-svn: 314133	2017-09-25 18:43:13 +00:00

1 2 3 4 5 ...

154739 Commits