llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Reid Kleckner	0aa90f844e	[MC] Properly diagnose badly scoped .cfi_ directives Removes two report_fatal_errors. Implement this by removing EmitCFICommon, and do the checking in getCurrentDwarfFrameInfo. Have the callers check for null before dereferencing it. llvm-svn: 315264	2017-10-10 01:49:21 +00:00
Reid Kleckner	cb7ebedc26	Give a test a triple llvm-svn: 315263	2017-10-10 01:34:31 +00:00
Reid Kleckner	eb8a8f9e31	[SEH] Use reportError instead of report_fatal_error for bad directives This makes the .seh_ directives slightly more usable from standalone assembly files. This removes a large number of report_fatal_errors and recovers from the error by ignoring the directive. llvm-svn: 315262	2017-10-10 01:26:25 +00:00
Lang Hames	887aa34325	[MC] Plumb unique_ptr<MCWasmObjectTargetWriter> through createWasmObjectWriter to WasmObjectWriter's constructor. Fixes the same ownership issue for COFF that r315245 did for MachO: WasmObjectWriter takes ownership of its MCWasmObjectTargetWriter, so we want to pass this through to the constructor via a unique_ptr, rather than a raw ptr. llvm-svn: 315260	2017-10-10 01:15:10 +00:00
Reid Kleckner	bfd9be426f	[MC] Suppress .Lcfi labels when emitting textual assembly Summary: This suppresses the generation of .Lcfi labels in our textual assembler. It was annoying that this generated cascading .Lcfi labels: llc foo.ll -o - \| llvm-mc \| llvm-mc After three trips through MCAsmStreamer, we'd have three labels in the output when none are necessary. We should only bother creating the labels and frame data when making a real object file. This supercedes D38605, which moved the entire .seh_ implementation into MCObjectStreamer. This has the advantage that we do more checking when emitting textual assembly, as a minor efficiency cost. Outputting textual assembly is not performance critical, so this shouldn't matter. Reviewers: majnemer, MatzeB Subscribers: qcolombet, nemanjai, javed.absar, eraman, hiraditya, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D38638 llvm-svn: 315259	2017-10-10 00:57:36 +00:00
Reid Kleckner	94d7eeac4a	Fix Wasm build after r315254 llvm-svn: 315258	2017-10-10 00:52:40 +00:00
Lang Hames	f80af1598f	[MC] Plumb unique_ptr<MCWinCOFFObjectTargetWriter> through createWinCOFFObjectWriter to WinCOFFObjectWriter's constructor. Fixes the same ownership issue for COFF that r315245 did for MachO: WinCOFFObjectWriter takes ownership of its MCWinCOFFObjectTargetWriter, so we want to pass this through to the constructor via a unique_ptr, rather than a raw ptr. llvm-svn: 315257	2017-10-10 00:50:29 +00:00
Lang Hames	735ed4f2bd	[MC] Plumb unique_ptr<MCELFObjectTargetWriter> through createELFObjectWriter to ELFObjectWriter's constructor. Fixes the same ownership issue for ELF that r315245 did for MachO: ELFObjectWriter takes ownership of its MCELFObjectTargetWriter, so we want to pass this through to the constructor via a unique_ptr, rather than a raw ptr. llvm-svn: 315254	2017-10-09 23:53:15 +00:00
Adam Nemet	ec7409d86a	Rename OptimizationDiagnosticInfo.* to OptimizationRemarkEmitter.* Sync it up with the name of the class actually defined here. This has been bothering me for a while... llvm-svn: 315249	2017-10-09 23:19:02 +00:00
Zachary Turner	dff3bfd63c	[llvm-rc] Try again to fix errors on big endian systems. llvm-svn: 315248	2017-10-09 22:59:40 +00:00
Eugene Zemtsov	b8093853d2	Fix LLDB build for Android. Currently libstdc++ on Android doesn't support std::to_string(). Differential Revision: https://reviews.llvm.org/D38701 llvm-svn: 315246	2017-10-09 22:43:35 +00:00
Lang Hames	976ebea58c	[MC] Plumb unique_ptr<MCMachObjectTargetWriter> through createMachObjectWriter to MCObjectWriter's constructor. MCObjectWriter takes ownership of its MCMachObjectTargetWriter argument -- this patch plumbs that ownership relationship through the constructor (which previously took raw MCMachObjectTargetWriter*) and the createMachObjectWriter function. llvm-svn: 315245	2017-10-09 22:38:13 +00:00
Jonas Devlieghere	67b84bcdf0	[DWARF] DW_TAG_imported_unit is not a unit type. As pointed out by David in D38453 and confirmed with the DWARF mailing list, DW_TAG_imported_unit is not a valid unit type. llvm-svn: 315244	2017-10-09 22:33:53 +00:00
Francis Ricci	a55e7b5513	[llvm-objdump] Use initializer list for scoped xar api constructors llvm-svn: 315243	2017-10-09 20:27:14 +00:00
Aditya Nandakumar	a8d5bff9e7	[GISel]: Fix generation of illegal COPYs during CallLowering We end up creating COPY's that are either truncating/extending and this should be illegal. https://reviews.llvm.org/D37640 Patch for X86 and ARM by igorb, rovka llvm-svn: 315240	2017-10-09 20:07:43 +00:00
Aditya Nandakumar	e577183c62	Add a helper to build Copy instructions in MachineIRBuilder llvm-svn: 315239	2017-10-09 20:07:41 +00:00
Zvi Rackover	9391fb0d26	[X86] Unsigned saturation subtraction canonicalization [the backend part] Summary: On behalf of julia.koval@intel.com The patch transforms canonical version of unsigned saturation, which is sub(max(a,b),a) or sub(a,min(a,b)) to special psubus insturuction on targets, which support it(8bit and 16bit uints). umax(a,b) - b -> subus(a,b) a - umin(a,b) -> subus(a,b) There is also extra case handled, when right part of sub is 32 bit and can be truncated, using UMIN(this transformation was discussed in https://reviews.llvm.org/D25987). The example of special case code: ``` void foo(unsigned short p, int max, int n) { int i; unsigned m; for (i = 0; i < n; i++) { m = --p; p = (unsigned short)(m >= max ? m-max : 0); } } ``` Max in this example is truncated to max_short value, if it is greater than m, or just truncated to 16 bit, if it is not. It is vaid transformation, because if max > max_short, result of the expression will be zero. Here is the table of types, I try to support, special case items are bold: \| Size \| 128 \| 256 \| 512 \| ----- \| ----- \| ----- \| ----- \| i8 \| v16i8 \| v32i8 \| v64i8 \| i16 \| v8i16 \| v16i16 \| v32i16 \| i32 \| \| v8i32* \| v16i32 \| i64 \| \| \| v8i64 Reviewers: zvi, spatel, DavidKreitzer, RKSimon Reviewed By: zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37534 llvm-svn: 315237	2017-10-09 20:01:10 +00:00
Alexey Bataev	1bb39f50ae	[SLP] Add test for reversed load, NFC. llvm-svn: 315232	2017-10-09 19:08:15 +00:00
Zachary Turner	c7deaade4e	Fix some C++ value / reference semantics issues. Some functions were taking Twine's not by const&, these are all fixed to take by const&. We also had a case where some functions were overloaded to accept by const& and &&. Now there is only one version which accepts by value and move's the value. llvm-svn: 315229	2017-10-09 18:50:29 +00:00
Daniel Sanders	6bf61e532c	[globalisel] Add support for ValueType operands in patterns. It's rare but there are a small number of patterns like this: (set i64:$dst, (add i64:$src1, i64:$src2)) These should be equivalent to register classes except they shouldn't check for a specific register bank. This doesn't occur in AArch64/ARM/X86 but does occasionally come up in other in-tree targets such as BPF. llvm-svn: 315226	2017-10-09 18:14:53 +00:00
Lang Hames	5dab814752	[MC] Use a unique_ptr<MCAssembler> for MCObjectStreamer's Assembler member. Removes manual new/delete. llvm-svn: 315225	2017-10-09 18:11:04 +00:00
Sanjay Patel	0c7653cd67	[InstCombine] fix formatting; NFC llvm-svn: 315223	2017-10-09 17:54:46 +00:00
Adrian McCarthy	bc0e80ef8f	Fix after r315079 Microsoft's debug implementation of std::copy checks if the destination is an array and then does some bounds checking. This was causing an assertion failure in fs::rename_internal which copies to a buffer of the appropriate size but that's type-punned to an array of length 1 for API compatibility reasons. Fix is to make make the destination a pointer rather than an array. llvm-svn: 315222	2017-10-09 17:50:01 +00:00
Francis Ricci	c869c90b7c	[dsymutil] Emit valid debug locations when no symbol flags are set Summary: swiftc emits symbols without flags set, which led dsymutil to ignore them when searching for global symbols, causing dwarf location data to be omitted. Xcode's dsymutil handles this case correctly, and emits valid location data. Add this functionality to llvm-dsymutil by allowing parsing of symbols with no flags set. Reviewers: aprantl, friss, JDevlieghere Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38587 llvm-svn: 315218	2017-10-09 17:27:47 +00:00
Alexey Bataev	b7c73fbd84	[SLP] Test for wrongly vectorized set of extractelements, NFC. llvm-svn: 315217	2017-10-09 17:14:03 +00:00
Javed Absar	10e5efdf30	[TableGen] remove make_range where not necessary llvm-svn: 315209	2017-10-09 16:21:25 +00:00
Zachary Turner	738623f975	[llvm-rc] Have the tokenizer discard single & block comments. This allows rc files to have comments. Eventually we should just use clang's c preprocessor, but that's a bit larger effort for minimal gain, and this is straightforward. Differential Revision: https://reviews.llvm.org/D38651 llvm-svn: 315207	2017-10-09 15:46:13 +00:00
Sanjay Patel	34c5f48035	[DAG] combine assertsexts around a trunc This was a suggested follow-up to: D37017 / https://reviews.llvm.org/rL313577 llvm-svn: 315206	2017-10-09 15:22:20 +00:00
Amara Emerson	7210057227	[AArch64] Improve codegen for inverted overflow checking intrinsics E.g. if we have a (xor(overflow-bit), 1) where overflow-bit comes from an intrinsic like llvm.sadd.with.overflow then we can kill the xor and use the inverted condition code for the CSEL. rdar://28495949 Reviewed By: kristof.beyls Differential Revision: https://reviews.llvm.org/D38160 llvm-svn: 315205	2017-10-09 15:15:09 +00:00
Sanjay Patel	ab6a4185c7	[x86] regenerate test checks; NFC llvm-svn: 315204	2017-10-09 15:01:58 +00:00
Sanjay Patel	63ad12c22b	[AArch64] fix typos in test assertions llvm-svn: 315203	2017-10-09 01:29:54 +00:00
Craig Topper	216c83a1d0	[X86] Remove a setLoadExtAction from the AVX512 section that uses an AVX512BW type and is alraedy present in the AVX512BW section. llvm-svn: 315202	2017-10-09 01:05:16 +00:00
Craig Topper	b6f8edc8a6	[X86] Enable extended comparison predicate support for SETUEQ/SETONE when targeting AVX instructions. We believe that despite AMD's documentation, that they really do support all 32 comparision predicates under AVX. Differential Revision: https://reviews.llvm.org/D38609 llvm-svn: 315201	2017-10-09 01:05:15 +00:00
Benjamin Kramer	f499a668fc	Remove unused variables. No functionality change. llvm-svn: 315196	2017-10-08 21:23:02 +00:00
Simon Pilgrim	f157b1c86c	[X86][SSE] Don't call combineTo inside combineX86ShufflesRecursively. NFCI. Return the combined shuffle from combineX86ShufflesRecursively and perform the combineTo in the caller. Makes it easier for future patches to use this in functions that aren't actually shuffles themselves. llvm-svn: 315195	2017-10-08 20:58:14 +00:00
Simon Pilgrim	d9f42bc3c3	Tidyup with clang-format. NFCI. llvm-svn: 315187	2017-10-08 19:24:30 +00:00
Simon Pilgrim	a7e60af32b	[X86][SSE] Add test case for PR27708 llvm-svn: 315186	2017-10-08 19:18:10 +00:00
Benjamin Kramer	477e3c6cab	Remove unused variables. No functionality change. llvm-svn: 315185	2017-10-08 19:11:02 +00:00
Craig Topper	ab81766852	[X86] Regenerate fast-isel-select-pseudo-cmov.ll to prepare for D38609. llvm-svn: 315184	2017-10-08 17:54:50 +00:00
Javed Absar	048b22499c	[TableGen] Simplify, add range_loop in CodeGenSchedule llvm-svn: 315183	2017-10-08 17:23:30 +00:00
Simon Pilgrim	6cfa36331f	[X86] getTargetConstantBitsFromNode - add support for decoding scalar constants llvm-svn: 315182	2017-10-08 17:21:18 +00:00
Craig Topper	20e1cfd79a	[X86] Prefer MOVSS/SD over BLENDI during legalization. Remove BLENDI versions of scalar arithmetic patterns Summary: We currently disable some converting of shuffles to MOVSS/MOVSD during legalization if SSE41 is enabled. But later during shuffle combining we go back to prefering MOVSS/MOVSD. Additionally we have patterns that look for BLENDIs to detect scalar arithmetic operations. I believe due to the combining using MOVSS/MOVSD these are unnecessary. Interestingly, we still codegen blend instructions even though lowering/isel emit movss/movsd instructions. Turns out machine CSE commutes them to blend, and then commuting those blends back into blends that are equivalent to the original movss/movsd. This patch fixes the inconsistency in legalization to prefer MOVSS/MOVSD. The one test change was caused by this change. The problem is that we have integer types and are mostly selecting integer instructions except for the shufps. This shufps forced the execution domain, but the vpblendw couldn't have its domain changed with a naive instruction swap. We could fix this by special casing VPBLENDW based on the immediate to widen the element type. The rest of the patch is removing all the excess scalar patterns. Long term we should probably add isel patterns to make MOVSS/MOVSD emit blends directly instead of relying on the double commute. We may also want to consider emitting movss/movsd for optsize. I also wonder if we should still use the VEX encoded blendi instructions even with AVX512. Blends have better throughput, and that may outweigh the register constraint. Reviewers: RKSimon, zvi Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38023 llvm-svn: 315181	2017-10-08 16:57:23 +00:00
Benjamin Kramer	57794f7063	Make more constructors constexpr or use =default. This lets the compiler reason about the type more easily. No functionality change intended. llvm-svn: 315180	2017-10-08 15:59:35 +00:00
Amara Emerson	0ca815214b	[AArch64][GlobalISel] Add a test case for G_PHI of p0 instruction selection. llvm-svn: 315179	2017-10-08 15:29:35 +00:00
Amara Emerson	291722096e	[AArch64][GlobalISel] Add a test case for G_PHI of p0 regbank selection. llvm-svn: 315178	2017-10-08 15:29:31 +00:00
Amara Emerson	8057ba1d40	[AArch64][GlobalISel] Make G_PHI of p0 types legal. Differential Revision: https://reviews.llvm.org/D38621 llvm-svn: 315177	2017-10-08 15:29:11 +00:00
Simon Pilgrim	0030b42baa	[X86][XOP] Add XOP oddshuffles tests XOP codegen is often different to generic AVX - thank you vpperm! llvm-svn: 315176	2017-10-08 12:58:15 +00:00
Gadi Haber	9b90d5167f	[X86][SKX] Adding the scheduling information for the SKX target. Adding the scheduling information for the SkylakeServer (SKX) target. This patch adds the instruction scheduling information for the SkylakeServer (SKX) architecture target by adding the file X86SchedSkylakeServer.td located under the X86 Target. We used the scheduling information retrieved from the Skylake architects in order to create the file. The scheduling information includes latency, number of micro-Ops and used ports by each SKL instruction. The patch continues the scheduling replacement and insertion effort started with the SNB target in r310792, the HSW target in r311879 and the SkylakeClient (SKL) target in rL313613. Please expect some performance fluctuations due to code alignment effects. Reviewers: zvi, RKSimon, craig.topper, chandlerc, aymanmu Differential Revision: https://reviews.llvm.org/D38443 Change-Id: I5c228fcc09e9e5a99b6116e62b356c4f9b971185 llvm-svn: 315175	2017-10-08 12:52:54 +00:00
Ayman Musa	05575acfe0	[X86] Add missing entries in 'MemoryFoldTable2Addr' to get complete form of the table. Get the folding table 'MemoryFoldTable2Addr' to a complete state as part of the process explained in https://reviews.llvm.org/D38028 Differential Revision: https://reviews.llvm.org/D38500 llvm-svn: 315174	2017-10-08 09:46:50 +00:00
Ayman Musa	56eb606f31	[X86][TableGen] Recommitting the X86 memory folding tables TableGen backend while disabling it by default. After the original commit ([[ https://reviews.llvm.org/rL304088 \| rL304088 ]]) was reverted, a discussion in llvm-dev was opened on 'how to accomplish this task'. In the discussion we concluded that the best way to achieve our goal (which is to automate the folding tables and remove the manually maintained tables) is: # Commit the tablegen backend disabled by default. # Proceed with an incremental updating of the manual tables - while checking the validity of each added entry. # Repeat previous step until we reach a state where the generated and the manual tables are identical. Then we can safely remove the manual tables and include the generated tables instead. # Schedule periodical (1 week/2 weeks/1 month) runs of the pass: - if changes appear (new entries): - make sure the entries are legal - If they are not, mark them as illegal to folding - Commit the changes (if there are any). CMake flag added for this purpose is "X86_GEN_FOLD_TABLES". Building with this flags will run the pass and emit the X86GenFoldTables.inc file under build/lib/Target/X86/ directory which is a good reference for any developer who wants to take part in the effort of completing the current folding tables. Differential Revision: https://reviews.llvm.org/D38028 llvm-svn: 315173	2017-10-08 09:20:32 +00:00

1 2 3 4 5 ...

155170 Commits