llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	676985b151	[llvm-mca][x86] Add 32-bit instruction resource tests These aren't exhaustive, but cover some instructions that are only available in 32-bit mode (where would we be without good BCD math performance?). llvm-svn: 338404	2018-07-31 17:33:08 +00:00
Zachary Turner	b435cc5faa	Resubmit r338340 "[MS Demangler] Better demangling of template arguments." This broke the build with GCC, but has since been fixed. llvm-svn: 338403	2018-07-31 17:16:44 +00:00
Craig Topper	80b2ac8a74	[X86] Add pattern matching for PMADDUBSW Summary: Similar to D49636, but for PMADDUBSW. This instruction has the additional complexity that the addition of the two products saturates to 16-bits rather than wrapping around. And one operand is treated as signed and the other as unsigned. A C example that triggers this pattern ``` static const int N = 128; int8_t A[2N]; uint8_t B[2N]; int16_t C[N]; void foo() { for (int i = 0; i != N; ++i) C[i] = MIN(MAX((int16_t)A[2i](int16_t)B[2i] + (int16_t)A[2i+1](int16_t)B[2i+1], -32768), 32767); } ``` Reviewers: RKSimon, spatel, zvi Reviewed By: RKSimon, zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49829 llvm-svn: 338402	2018-07-31 17:12:08 +00:00
Craig Topper	845f7d7748	[X86] Add test cases that could use PMADDUBSW. llvm-svn: 338401	2018-07-31 17:12:06 +00:00
Francis Visoiu Mistrih	508d196cc4	[X86] Preserve more liveness information in emitStackProbeInline This commit fixes two issues with the liveness information after the call: 1) The code always spills RCX and RDX if InProlog == true, which results in an use of undefined phys reg. 2) FinalReg, JoinReg, RoundedReg, SizeReg are not added as live-ins to the basic blocks that use them, therefore they are seen undefined. https://llvm.org/PR38376 Differential Revision: https://reviews.llvm.org/D50020 llvm-svn: 338400	2018-07-31 16:41:12 +00:00
Hsiangkai Wang	1a1998519f	[DebugInfo] Fix build failed in 'clang-cmake-armv8-full'. Builder clang-cmake-armv8-full failed due to the assembly 'comment' notation is not '#' in the target. So, I use CHECK-SAME to avoid to check the comment notation in the same line in the test case. llvm-svn: 338398	2018-07-31 16:22:09 +00:00
Jakub Kuderski	26da35684f	[Dominators] Make slow walks shorter Summary: When DFS numbers are not yet calculated for a dominator tree, we have to walk it up to say whether one node dominates some other. This patch makes the slow walks shorter by only walking until the level of the node we check against is reached. This is because a node cannot possibly dominate something higher in its tree. When running opt with -O3, the patch results in: * 25% fewer loop iterations for `opt` (fullLTO) * 30% fewer loop iterations for sqlite Reviewers: brzycki, asbirlea, chandlerc, NutshellySima, grosser Reviewed By: NutshellySima Subscribers: mehdi_amini, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D49955 llvm-svn: 338396	2018-07-31 15:53:10 +00:00
Ewan Crawford	38ade2ebbf	Fix InstCombine address space assert Workaround bug where the InstCombine pass was asserting on the IR added in lit test, where we have a bitcast instruction after a GEP from an addrspace cast. The second bitcast in the test was getting combined into `bitcast <16 x i32>* %0 to <16 x i32> addrspace(3)`, which looks like it should be an addrspace cast instruction instead. Otherwise if control flow is allowed to continue as it is now we create a GEP instruction `<badref> = getelementptr inbounds <16 x i32>, <16 x i32> %0, i32 0`. However because the type of this instruction doesn't match the address space we hit an assert when replacing the bitcast with that GEP. ``` void llvm::Value::doRAUW(llvm::Value*, bool): Assertion `New->getType() == getType() && "replaceAllUses of value with new value of different type!"' failed. ``` Differential Revision: https://reviews.llvm.org/D50058 llvm-svn: 338395	2018-07-31 15:53:03 +00:00
Andrea Di Biagio	159d252dca	[llvm-mca][docs] Always use `llvm-mca` in place of `MCA`. llvm-svn: 338394	2018-07-31 15:29:10 +00:00
Sanjay Patel	7524afd739	[InstCombine] regenerate checks and add tests for D50035; NFC llvm-svn: 338392	2018-07-31 15:07:32 +00:00
Anastasis Grammenos	ab84cfb37d	[DebugInfo][LCSSA] Preserve debug location in lcssa phis Summary: When inserting lcssa Phi Nodes in the exit block mak sure to preserve the original instructions DL. Reviewers: vsk Subscribers: JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D50009 llvm-svn: 338391	2018-07-31 14:54:52 +00:00
Hsiangkai Wang	fa008c0805	[DebugInfo] Generate DWARF debug information for labels. There are two forms for label debug information in DWARF format. 1. Labels in a non-inlined function: DW_TAG_label DW_AT_name DW_AT_decl_file DW_AT_decl_line DW_AT_low_pc 2. Labels in an inlined function: DW_TAG_label DW_AT_abstract_origin DW_AT_low_pc We will collect label information from DBG_LABEL. Before every DBG_LABEL, we will generate a temporary symbol to denote the location of the label. The symbol could be used to get DW_AT_low_pc afterwards. So, we create a mapping between 'inlined label' and DBG_LABEL MachineInstr in DebugHandlerBase. The DBG_LABEL in the mapping is used to query the symbol before it. The AbstractLabels in DwarfCompileUnit is used to process labels in inlined functions. We also keep a mapping between scope and labels in DwarfFile to help to generate correct tree structure of DIEs. It also generates label debug information under global isel. Differential Revision: https://reviews.llvm.org/D45556 llvm-svn: 338390	2018-07-31 14:48:32 +00:00
David Bolvansky	5e6da01e64	Revert Enrich inline messages llvm-svn: 338389	2018-07-31 14:47:22 +00:00
Sanjay Patel	15e5eea781	[InstCombine] auto-generate checks; NFC llvm-svn: 338388	2018-07-31 14:27:30 +00:00
David Bolvansky	c42a835009	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338387	2018-07-31 14:25:24 +00:00
Andrea Di Biagio	ec4329b38b	[llvm-mca] Remove README.txt A detailed description of the tool has been recently added by Matt to CommandGuide/llvm-mca.rst. File README.txt is now redundant and can be removed; all the relevant user-guide information has been improved and then moved to llvm-mca.rst. In future, we should add another .rst for the "llvm-mca developer manual" to provide infromation about: - llvm-mca internals. - How to add custom stages to the simulated pipeline. - How to provide extra processor info in the scheduling model to improve the analysis performed by llvm-mca. llvm-svn: 338386	2018-07-31 14:23:49 +00:00
John Brawn	96b2d39585	[MemDep] Use PhiValuesAnalysis to improve alias analysis results This is being done in order to make GVN able to better optimize certain inputs. MemDep doesn't use PhiValues directly, but does need to notifiy it when things get invalidated. Differential Revision: https://reviews.llvm.org/D48489 llvm-svn: 338384	2018-07-31 14:19:29 +00:00
David Bolvansky	e66d9fb924	[InstSimplify] Fold another Select with And/Or pattern Summary: Proof: https://rise4fun.com/Alive/L5J Reviewers: lebedev.ri, spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49975 llvm-svn: 338383	2018-07-31 14:17:15 +00:00
Matt Arsenault	80000a76ca	DAG: Fix PromoteFloatResult for fcanonicalize llvm-svn: 338382	2018-07-31 14:15:22 +00:00
Matt Arsenault	9f1d74ae90	AMDGPU: Don't handle FP16_TO_FP in isCanonicalized This needs more special handling to do correctly. Fixes test in subsequent commit. llvm-svn: 338381	2018-07-31 14:15:16 +00:00
Alexey Bataev	97ffeb4b92	[SLP] Fix PR38339: Instruction does not dominate all uses! Summary: If the ExtractElement instructions can be optimized out during the vectorization and we need to reshuffle the parent vector, this ShuffleInstruction may be inserted in the wrong place causing compiler to produce incorrect code. Reviewers: spatel, RKSimon, mkuper, hfinkel, javed.absar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49928 llvm-svn: 338380	2018-07-31 14:02:43 +00:00
Matt Arsenault	55c4b0aada	AMDGPU: Fold undef fcanonicalize to qNaN We could choose a free 0 for this, but this matches the behavior for fmul undef, 1.0. Also, the NaN use is more useful for folding use operations although if it's not eliminated it is more expensive in terms of code size. llvm-svn: 338376	2018-07-31 13:34:31 +00:00
Matt Arsenault	4c4fc5d01d	AMDGPU: Fix test check line bugs llvm-svn: 338374	2018-07-31 13:25:23 +00:00
Peter Smith	b4654f42a6	[ARM] Complete enumeration values for Tag_ABI_VFP_args The LLD implementation of Tag_ABI_VFP_args needs to check the rarely seen values of 3 (toolchain specific) and 4 compatible with both Base and VFP. Add the missing enumeration values so that LLD can refer to them without having to use the raw numbers. Differential Revision: https://reviews.llvm.org/D50049 llvm-svn: 338373	2018-07-31 13:24:49 +00:00
Andrea Di Biagio	0e53532aeb	[llvm-mca][BtVer2] Teach how to identify dependency-breaking idioms. This patch teaches llvm-mca how to identify dependency breaking instructions on btver2. An example of dependency breaking instructions is the zero-idiom XOR (example: `XOR %eax, %eax`), which always generates zero regardless of the actual value of the input register operands. Dependency breaking instructions don't have to wait on their input register operands before executing. This is because the computation is not dependent on the inputs. Not all dependency breaking idioms are also zero-latency instructions. For example, `CMPEQ %xmm1, %xmm1` is independent on the value of XMM1, and it generates a vector of all-ones. That instruction is not eliminated at register renaming stage, and its opcode is issued to a pipeline for execution. So, the latency is not zero. This patch adds a new method named isDependencyBreaking() to the MCInstrAnalysis interface. That method takes as input an instruction (i.e. MCInst) and a MCSubtargetInfo. The default implementation of isDependencyBreaking() conservatively returns false for all instructions. Targets may override the default behavior for specific CPUs, and return a value which better matches the subtarget behavior. In future, we should teach to Tablegen how to automatically generate the body of isDependencyBreaking from scheduling predicate definitions. This would allow us to expose the knowledge about dependency breaking instructions to the machine schedulers (and, potentially, other codegen passes). Differential Revision: https://reviews.llvm.org/D49310 llvm-svn: 338372	2018-07-31 13:21:43 +00:00
Peter Smith	660c9806fe	[ELF][ARM] Add Arm ABI names for float ABI ELF Header flags The ELF for the Arm architecture document defines, for EF_ARM_EABI_VER5 and above, the flags EF_ARM_ABI_FLOAT_HARD and EF_ARM_ABI_FLOAT_SOFT. These have been defined to be compatible with the existing EF_ARM_VFP_FLOAT and EF_ARM_SOFT_FLOAT used by gcc for EF_ARM_EABI_UNKNOWN. This patch adds the flags in addition to the existing ones so that any code depending on the old names will still work. Differential Revision: https://reviews.llvm.org/D49992 llvm-svn: 338370	2018-07-31 13:03:54 +00:00
Simon Pilgrim	6a5f232f8f	Revert r338365: [X86] Improved sched models for X86 BT*rr instructions. https://reviews.llvm.org/D49243 Contains WIP code that should not have been included. llvm-svn: 338369	2018-07-31 13:00:51 +00:00
Jonas Paulsson	e4386158ff	[SystemZ] Improve decoding in case of instructions with four register operands. Since z13, the max group size will be 2 if any μop has more than 3 register sources. This has been ignored sofar in the SystemZHazardRecognizer, but is now handled by recognizing those instructions and adjusting the tracking of decoding and the cost heuristic for grouping. Review: Ulrich Weigand https://reviews.llvm.org/D49847 llvm-svn: 338368	2018-07-31 13:00:42 +00:00
Sanjay Patel	6eedd6a0ef	[InstCombine] simplify code for A & (A ^ B) --> A & ~B This fold was written in an odd way and tried to avoid an endless loop by bailing out on all constants instead of the supposedly problematic case of -1. But (X & -1) should always be simplified before we reach here, so I'm not sure how that is a problem. There were no tests for the commuted patterns, so I added those at rL338364. llvm-svn: 338367	2018-07-31 13:00:03 +00:00
Andrew V. Tischenko	fac48f4efe	[X86] Improved sched models for X86 BT*rr instructions. https://reviews.llvm.org/D49243 llvm-svn: 338365	2018-07-31 12:33:48 +00:00
Sanjay Patel	c0ea535a72	[InstCombine] move/add tests for xor+add fold; NFC llvm-svn: 338364	2018-07-31 12:31:00 +00:00
Andrew V. Tischenko	e6ccc4e407	[X86] Improved sched models for X86 SHLD/SHRD* instructions. Differential Revision: https://reviews.llvm.org/D9611 llvm-svn: 338359	2018-07-31 10:14:43 +00:00
Simon Pilgrim	3ee9dc2580	[X86][SSE] isFNEG - Use getTargetConstantBitsFromNode to handle all constant cases isFNEG was duplicating much of what was done by getTargetConstantBitsFromNode in its own calls to getTargetConstantFromNode. Noticed while reviewing D48467. llvm-svn: 338358	2018-07-31 10:13:17 +00:00
Martin Storsjo	61c7006a9d	[ARM] Allow automatically deducing the thumb instruction size for .inst This matches GAS, that allows unsuffixed .inst for thumb. Differential Revision: https://reviews.llvm.org/D49937 llvm-svn: 338357	2018-07-31 09:27:07 +00:00
Martin Storsjo	a2522f2dc9	[ARM] Support the .inst directive for MachO and COFF targets Contrary to ELF, we don't add any markers that distinguish data generated with .short/.long from normal instructions, so the .inst directive only adds compatibility with assembly that uses it. Differential Revision: https://reviews.llvm.org/D49936 llvm-svn: 338356	2018-07-31 09:27:01 +00:00
Martin Storsjo	2a982f7d52	[AArch64] Support the .inst directive for MachO and COFF targets Contrary to ELF, we don't add any markers that distinguish data generated with .long from normal instructions, so the .inst directive only adds compatibility with assembly that uses it. Differential Revision: https://reviews.llvm.org/D49935 llvm-svn: 338355	2018-07-31 09:26:52 +00:00
Sam Parker	c7c267ed61	[ARM] Revert r337821 Re-enabling ARMCodeGenPrepare by default after failing to reproduce the bootstrap issues that I was concerned it was causing. llvm-svn: 338354	2018-07-31 09:04:14 +00:00
Hsiangkai Wang	8f9880b219	Test commit. llvm-svn: 338352	2018-07-31 06:09:29 +00:00
Hiroshi Inoue	d3a99c79a3	[InstSimplify] tests for D48828, D49981: fold extraction from std::pair Minor touch up in the previous comment. llvm-svn: 338351	2018-07-31 05:29:20 +00:00
Hiroshi Inoue	d639a24157	[InstSimplify] tests for D48828, D49981: fold extraction from std::pair Updated unit tests for D48828 and D49981. llvm-svn: 338350	2018-07-31 05:10:36 +00:00
Max Kazantsev	882b66c3c6	[NFC] Collect statistics in GuardWidening llvm-svn: 338348	2018-07-31 04:37:11 +00:00
Diego Caballero	85914b77e9	[VPlan] Introduce VPLoopInfo analysis. The patch introduces loop analysis (VPLoopInfo/VPLoop) for VPBlockBases. This analysis will be necessary to perform some H-CFG transformations and detect and introduce regions representing a loop in the H-CFG. Reviewers: fhahn, rengolin, mkuper, hfinkel, mssimpso Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D48816 llvm-svn: 338346	2018-07-31 01:57:29 +00:00
Reid Kleckner	a93cf55d3c	Revert r338340 "[MS Demangler] Better demangling of template arguments." Breaks the build with GCC, apparently. llvm-svn: 338344	2018-07-31 01:08:42 +00:00
Craig Topper	37056c888c	[X86] Stop accidentally running the Bonnell LEA fixup path on Goldmont. In one place we checked X86Subtarget.slowLEA() to decide if the pass should run. But to decide what the pass should we only check isSLM. This resulted in Goldmont going down the Bonnell path. llvm-svn: 338342	2018-07-31 00:43:54 +00:00
Ana Pazos	867e72f9da	[RISCV] Fixed test case failure due to r338047 llvm-svn: 338341	2018-07-31 00:36:28 +00:00
Zachary Turner	41bfb8ed38	[MS Demangler] Better demangling of template arguments. This patch fixes demangling of template aliases as template-template arguments, and also fixes function pointers and references as not type template parameters. All of these can be properly demangled now, so I've ported over the test clang/test/CodeGenCXX/ms-template-callbacks.cpp. All of these tests pass llvm-svn: 338340	2018-07-31 00:26:52 +00:00
Amara Emerson	6a47e74b23	[AArch64][GlobalISel] Add isel support for G_BLOCK_ADDR. Also refactors some existing code to materialize addresses for the large code model so it can be shared between G_GLOBAL_VALUE and G_BLOCK_ADDR. This implements PR36390. Differential Revision: https://reviews.llvm.org/D49903 llvm-svn: 338337	2018-07-31 00:09:02 +00:00
Amara Emerson	d6b9e05a84	[AArch64][GlobalISel] Make G_BLOCK_ADDR legal. Differential Revision: https://reviews.llvm.org/D49902 llvm-svn: 338336	2018-07-31 00:08:56 +00:00
Amara Emerson	836debbb53	[GlobalISel] Add a G_BLOCK_ADDR opcode to handle IR blockaddress constants. Differential Revision: https://reviews.llvm.org/D49900 llvm-svn: 338335	2018-07-31 00:08:50 +00:00
Zachary Turner	49c01d48fd	[MS Demangler] Add ms-return-qualifiers.test. This is a copy of the tests from clang/CodeGenCXX/ms-return-qualifiers.cpp converted to demangling tests. llvm-svn: 338330	2018-07-30 23:22:39 +00:00

... 2 3 4 5 6 ...

167425 Commits