llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Krzysztof Parzyszek	b2def67068	[Hexagon] Implement HVX codegen for vector shifts llvm-svn: 323914	2018-01-31 20:49:24 +00:00
Marek Olsak	47a6a5c619	[SeparateConstOffsetFromGEP] Fix up addrspace in the AMDGPU test llvm-svn: 323913	2018-01-31 20:49:19 +00:00
Krzysztof Parzyszek	ff64aa793a	[Hexagon] Handle ANY_EXTEND_VECTOR_INREG in lowering llvm-svn: 323912	2018-01-31 20:48:11 +00:00
Krzysztof Parzyszek	d369dd4439	[Hexagon] Handle SETCC on vector pairs in lowering llvm-svn: 323911	2018-01-31 20:46:55 +00:00
Eli Friedman	75b50e6d60	[GlobalOpt] Fix exponential compile-time with selects. If you have a long chain of select instructions created from something like `int* p = &g; if (foo()) p += 4; if (foo2()) p += 4;` etc., a naive recursive visitor will recursively visit each select twice, which is O(2^N) in the number of select instructions. Use the visited set to cut off recursion in this case. (No testcase because this doesn't actually change the behavior, just the time.) Differential Revision: https://reviews.llvm.org/D42451 llvm-svn: 323910	2018-01-31 20:42:25 +00:00
Marek Olsak	09e2179bbf	AMDGPU: Fold inline offset for loads properly in moveToVALU on GFX9 Summary: This enables load merging into x2, x4, which is driven by inline offsets. 6500 shaders are affected: Code Size in affected shaders: -15.14 % Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D42078 llvm-svn: 323909	2018-01-31 20:18:11 +00:00
Marek Olsak	1325dff7c0	AMDGPU: Add intrinsics llvm.amdgcn.cvt.{pknorm.i16, pknorm.u16, pk.i16, pk.u16} Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D41663 llvm-svn: 323908	2018-01-31 20:18:04 +00:00
Marek Olsak	2608200700	[SeparateConstOffsetFromGEP] Preserve metadata when splitting GEPs Summary: !amdgpu.uniform needs to be preserved for AMDGPU, otherwise bad things happen. Reviewers: arsenm, nhaehnle, jingyue, broune, majnemer, bjarke.roune, dblaikie Subscribers: wdng, tpr, llvm-commits Differential Revision: https://reviews.llvm.org/D42744 llvm-svn: 323907	2018-01-31 20:17:52 +00:00
Geoff Berry	4f06f37de9	[MachineOutliner] Freeze registers in new functions Summary: Call MRI.freezeReservedRegs() on functions created during outlining so that calls to isReserved() by the verifier called after this pass won't assert. Reviewers: MatzeB, qcolombet, paquette Subscribers: mcrosier, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D42749 llvm-svn: 323905	2018-01-31 20:15:16 +00:00
Sam Clegg	874b9d04f1	[WebAssembly] MC: Remove unused code for handling of wasm globals For now, we are not using wasm globals, except for modeling of the stack points. Alos, factor out common struct WasmGlobalType, which matches the name for that tuple in the Wasm spec and rename methods to "isBindingGlobal", "isTypeGlobal" to avoid ambiguity. Patch by Nicholas Wilson! Differential Revision: https://reviews.llvm.org/D42750 llvm-svn: 323901	2018-01-31 19:50:14 +00:00
Sam Clegg	16ab5ea30d	[WebAssembly] MC: Resolve aliases when creating provisional table entries This change is useful for the upcoming addition of the symbol table (D41954) since in that world aliases for given function all share the same function index. This change does not effect lld because it essentially ignores the wasm "table". The table exists only to the wasm objects will validate and disassembly meaningfully. Patch by Nicholas Wilson! Differential Revision: https://reviews.llvm.org/D42095 llvm-svn: 323900	2018-01-31 19:28:47 +00:00
Amaury Sechet	d76ae2bede	[X86] Generate testl instruction through truncates. Summary: This was introduced in D42646 but ended up being reverted because the original implementation was buggy. Depends on D42646 Reviewers: craig.topper, niravd, spatel, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42741 llvm-svn: 323899	2018-01-31 19:20:06 +00:00
Chih-Hung Hsieh	598a327133	[Analysis] Disable calls to *_finite and other glibc-only functions on Android. Since r322087, glibc's finite lib calls are generated when possible. However, they are not supported on Android. This change also disables other functions not available on Android. Differential Revision: http://reviews.llvm.org/D42668 llvm-svn: 323898	2018-01-31 19:12:50 +00:00
Max Moroz	556771566c	[llvm-cov] Improvements for summary report generated in HTML format. Summary: This commit adds the following changes: 1) coverage numbers are aligned to the left and padded with spaces in order to provide better readability for percentage values, e.g.: ``` file1 \| 89.13% (123 / 2323) \| 100.00% (55 / 55) \| 9.33% (14545 / 234234) file_asda \| 1.78% ( 23 / 4323) \| 32.31% (555 / 6555) \| 67.89% (1545 / 2234) fileXXX \| 100.00% (12323 / 12323) \| 100.00% (555 / 555) \| 100.00% (12345 / 12345) ``` 2) added "hover" attribute to CSS for highlighting table row under mouse cursor see screenshot attached to the phabricator review page {F5764813} 3) table title row and "totals" row now use bold text Reviewers: vsk, morehouse Reviewed By: vsk Subscribers: kcc, llvm-commits Differential Revision: https://reviews.llvm.org/D42093 llvm-svn: 323892	2018-01-31 17:37:21 +00:00
Daniel Neilson	ba6797e185	[CodeGenPrepare] Improve source and dest alignments of memory intrinsics independently Summary: This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the CodeGenPrepare pass to be more aggressive in improving the source and destination alignments of memcpy/memmove/memset by exploiting our new ability to record independent alignments for each argument. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. ( rL323886 ) Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 323891	2018-01-31 17:24:53 +00:00
Krzysztof Parzyszek	b0c5fda27c	[Hexagon] Handle BUILD_VECTOR from undef values in buildHvxVectorReg llvm-svn: 323889	2018-01-31 16:52:15 +00:00
Amaury Sechet	33cce86cf0	[X86] Avoid using high register trick for test instruction Summary: It seems it's main effect is to create addition copies when values are inr register that do not support this trick, which increase register pressure and makes the code bigger. Reviewers: craig.topper, niravd, spatel, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42646 llvm-svn: 323888	2018-01-31 16:48:54 +00:00
Krzysztof Parzyszek	f31df17b9d	[Hexagon] Only process bitcasts of vsplats when selecting const vectors Selecting of constant HVX vectors involves some "manual processing", which mishandled an unrelated BITCAST operation causing a selection error. llvm-svn: 323887	2018-01-31 16:48:20 +00:00
Daniel Neilson	dd97f56777	[Lint] Upgrade uses of MemoryIntrinic::getAlignment() to new API. (NFCI) Summary: This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the Lint analysis to cease using the old getAlignment() API of MemoryIntrinsic in favour of getting source & dest specific alignments through the new API. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 323886	2018-01-31 16:42:15 +00:00
Petar Jovanovic	f59a462ea6	[DWARF] Allow duplication of tails with CFI instructions This commit came as a result for revert of patch r317579 (originally committed as r317100). The patch made CFI instructions duplicable, because their existence in the epilogue block was affecting the Tail duplication pass. However, duplicating blocks with CFI instructions was an issue for compact unwind info on Darwin, which is why the patch was reverted. This patch allows duplicating tails with CFI instructions, though they are not duplicable, by copying them 'manually'. Patch by Djordje Kovacevic. Differential Revision: https://reviews.llvm.org/D40979 llvm-svn: 323883	2018-01-31 15:57:57 +00:00
Sanjay Patel	2ffd9394fb	[InstCombine] move related tests into the same file; NFC llvm-svn: 323882	2018-01-31 15:47:59 +00:00
Sanjay Patel	2cc357b128	[InstCombine] add tests to show limit of canEvaluate* ; NFC llvm-svn: 323881	2018-01-31 15:28:39 +00:00
Nirav Dave	0e452c9531	[DAG] Prevent NodeId pruning of TokenFactors in Instruction Selection. Summary: Instruction Selection preserves relative orders of all nodes save TokenFactors which we treat specially. As a result Node Ids for TokenFactors may violate the topological ordering and should not be considered as valid pruning candidates in predecessor search. Fixes PR35316. Reviewers: RKSimon, hfinkel Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D42701 llvm-svn: 323880	2018-01-31 15:23:17 +00:00
Diana Picus	079ff9a80c	Fix formatting for r323876. NFC llvm-svn: 323878	2018-01-31 15:16:17 +00:00
Sanjay Patel	d02cedcdd4	[InstCombine] reduce code duplication for canEvaluate* functions; NFCI We'd have to make the change suggested in D42536 3x otherwise. llvm-svn: 323877	2018-01-31 14:55:53 +00:00
Diana Picus	102045d8d1	[ARM GlobalISel] Modernize LegalizerInfo. NFCI Start using the new LegalizerInfo API introduced in r323681. Keep the old API for opcodes that need Lowering in some circumstances (G_FNEG and G_UREM/G_SREM). llvm-svn: 323876	2018-01-31 14:55:07 +00:00
Marina Yatsina	abc8a611c3	Fix build error in r323870 Change-Id: I15a8b27764a4d817cfbe48836bf09dc6520934b7 llvm-svn: 323874	2018-01-31 14:18:37 +00:00
Florian Hahn	23d2ec44ef	[MachineCombiner] Add check for optimal pattern order. In D41587, @mssimpso discovered that the order of some patterns for AArch64 was sub-optimal. I thought a bit about how we could avoid that case in the future. I do not think there is a need for evaluating all patterns for now. But this patch adds an extra (expensive) check, that evaluates the latencies of all patterns, and ensures that the latency saved decreases for subsequent patterns. This catches the sub-optimal order fixed in D41587, but I am not entirely happy with the check, as it only applies to sub-optimal patterns seen while building with EXPENSIVE_CHECKS on. It did not discover any other sub-optimal pattern ordering. Reviewers: Gerolf, spatel, mssimpso Reviewed By: Gerolf, mssimpso Differential Revision: https://reviews.llvm.org/D41766 llvm-svn: 323873	2018-01-31 13:54:30 +00:00
Marina Yatsina	13c308caeb	Take into account the cost of local intervals when selecting split candidate. When selecting a split candidate for region splitting, the register allocator tries to predict which candidate will have the cheapest spill cost. Global splitting may cause the creation of local intervals, and they might spill. This patch makes RA take into account the spill cost of local split intervals in use blocks (we already take into account the spill cost in through blocks). A flag ("-condsider-local-interval-cost") controls weather we do this advanced cost calculation (it's on by default for X86 target, off for the rest). Differential Revision: https://reviews.llvm.org/D41585 Change-Id: Icccb8ad2dbf13124f5d97a18c67d95aa6be0d14d llvm-svn: 323870	2018-01-31 13:31:08 +00:00
Pablo Barrio	b3f8010487	[ARM] Lower lower saturate to 0 and lower saturate to -1 using bit-operations Summary: Expressions of the form x < 0 ? 0 : x; and x < -1 ? -1 : x can be lowered using bit-operations instead of branching or conditional moves In thumb-mode this results in a two-instruction sequence, a shift followed by a bic or or while in ARM/thumb2 mode that has flexible second operand the shift can be folded into a single bic/or instructions. In most cases this results in smaller code and possibly less branches, and in no case larger than before. Patch by Marten Svanfeldt. Reviewers: fhahn, pbarrio Reviewed By: pbarrio Subscribers: efriedma, rogfer01, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42574 llvm-svn: 323869	2018-01-31 13:20:10 +00:00
Amaury Sechet	a8f98a46ba	Add a regression test for problems caused by D42646 . NFC llvm-svn: 323868	2018-01-31 13:02:01 +00:00
Jonas Paulsson	4da9407bd0	[SystemZ] Check the bitwidth before calling isInt/isUInt. Since these methods will assert if the integer does not fit into 64 bits, it is necessary to do this check before calling them in supportedAddressingMode(). Review: Ulrich Weigand. llvm-svn: 323866	2018-01-31 12:41:25 +00:00
Amjad Aboud	7938af2141	[AggressiveInstCombine] Make TruncCombine class ignore unreachable basic blocks. Because dead code may contain non-standard IR that causes infinite looping or crashes in underlying analysis. See PR36134 for more details. Differential Revision: https://reviews.llvm.org/D42683 llvm-svn: 323862	2018-01-31 10:41:31 +00:00
Sjoerd Meijer	a52592d874	[ARM] Armv8.2-A FP16 code generation (part 2/3) Half-precision arguments and return values are passed as if it were an int or float for ARM. This results in truncates and bitcasts to/from i16 and f16 values, which are legalized very early to stack stores/loads. When FullFP16 is enabled, we want to avoid codegen for these bitcasts as it is unnecessary and inefficient. Differential Revision: https://reviews.llvm.org/D42580 llvm-svn: 323861	2018-01-31 10:18:29 +00:00
Jonas Paulsson	04a61f5c33	[PowerPC] Return true in enableMultipleCopyHints(). Enable multiple COPY hints to eliminate more COPYs during register allocation. Note that this is something all targets should do, see https://reviews.llvm.org/D38128. Review: Nemanja Ivanovic llvm-svn: 323858	2018-01-31 09:26:51 +00:00
Roger Ferrer Ibanez	a8d946115e	[ARM] Allow the scheduler to clone a node with glue to avoid a copy CPSR ↔ GPR. In Thumb 1, with the new ADDCARRY / SUBCARRY the scheduler may need to do copies CPSR ↔ GPR but not all Thumb1 targets implement them. The schedule can attempt, before attempting a copy, to clone the instructions but it does not currently do that for nodes with input glue. In this patch we introduce a target-hook to let the hook decide if a glued machinenode is still eligible for copying. In this case these are ARM::tADCS and ARM::tSBCS . As a follow-up of this change we should actually implement the copies for the Thumb1 targets that do implement them and restrict the hook to the targets that can't really do such copy as these clones are not ideal. This change fixes PR35836. Differential Revision: https://reviews.llvm.org/D42051 llvm-svn: 323857	2018-01-31 09:23:43 +00:00
Justin Bogner	4c0185c682	Mark two tests REQUIRES: x86-registered-backend These were introduced in r323783 and use an X86 triple. I'll follow up on the list to check if it would make more sense to remove the triple and mark them REQUIRES: default_triple instead. llvm-svn: 323847	2018-01-31 07:32:03 +00:00
Sam Clegg	12d8c60c4d	[WebAssembly] Remove some unused code and tidy logging. NFC. Patch by Nicholas Wilson! Differential Revision: https://reviews.llvm.org/D42540 llvm-svn: 323846	2018-01-31 04:21:44 +00:00
Peter Collingbourne	3ce04ea33e	LTO: Drop comdats when converting definitions to declarations. Differential Revision: https://reviews.llvm.org/D42715 llvm-svn: 323844	2018-01-31 02:51:03 +00:00
Eli Friedman	1e5d5a9573	Revert r323559 due to EXPENSIVE_CHECKS regression. I have a fix for the issue (https://reviews.llvm.org/D42655) but it's taking a while to get reviewed, so reverting in the meantime. llvm-svn: 323841	2018-01-31 00:40:42 +00:00
Craig Topper	fba47d3dbf	[X86] Add more madd reduction tests with wider vectors. We had no test case exercising 512-bit vpmaddwd usage. llvm-svn: 323840	2018-01-31 00:30:32 +00:00
Saleem Abdulrasool	dbc48913a8	docs: wordsmith some of the linker option extension Apply the grammatical improvements suggested by Rafael Avila de Espindola in post-commit review. llvm-svn: 323839	2018-01-31 00:16:23 +00:00
Kevin Enderby	d55cebfaec	llvm-nm should show a symbol type of T for symbols in the (__TEXT_EXEC,__text) section. When a the Apple link editor builds a kext bundle file type and the value of the -miphoneos-version-min argument is significantly current (like 11.0) then the (__TEXT,__text) section is changed to the (__TEXT_EXEC,__text) section. So it would be nice for llvm-nm to show symbols in that section with a type of T instead of the generic type of S for some section other than text, data, etc. rdar://36262205 llvm-svn: 323836	2018-01-31 00:00:41 +00:00
Tony Tye	8b0cbc46f7	[AMDGPU] Update relocation documentation and elf flag machine architecture numbers Differential Revision: https://reviews.llvm.org/D42714 llvm-svn: 323835	2018-01-30 23:59:43 +00:00
Rui Ueyama	01de3d8363	Rename path libpath in .linker-options. "path" is too generic name for -L or --library-path because a lot of linker options take paths as arguments. This change renames the option to avoid confusion. Differential Revision: https://reviews.llvm.org/D42705 llvm-svn: 323833	2018-01-30 23:49:27 +00:00
Krzysztof Parzyszek	9608029db8	[RDF] Clear the renamable flag when copy propagating reserved registers llvm-svn: 323831	2018-01-30 23:19:44 +00:00
Tony Tye	b2b28fd7ec	[AMDGPU] Clarify ReqdWorkGroupSize and MaxFlatWorkGroupSize metadata - If ReqdWorkGroupSize is present it must have all elements >=1. - If MaxFlatWorkGroupSize must be consistent with ReqdWorkGroupSize. - Remove FixedWorkGroupSize as now equivalent to ReqdWorkGroupSize. llvm-svn: 323829	2018-01-30 23:07:10 +00:00
Yaxun Liu	8dc80efd4f	LLParser: add an argument for overriding data layout and do not check alloca addr space Sometimes users do not specify data layout in LLVM assembly and let llc set the data layout by target triple after loading the LLVM assembly. Currently the parser checks alloca address space no matter whether the LLVM assembly contains data layout definition, which causes false alarm since the default data layout does not contain the correct alloca address space. The parser also calls verifier to check debug info and updating invalid debug info. Currently there is no way to let the verifier to check debug info only. If the verifier finds non-debug-info issues the parser will fail. For llc, the fix is to remove the check of alloca addr space in the parser and disable updating debug info, and defer the updating of debug info and verification to be after setting data layout of the IR by target. For other llvm tools, since they do not override data layout by target but instead can override data layout by a command line option, an argument for overriding data layout is added to the parser. In cases where data layout overriding is necessary for the parser, the data layout can be provided by command line. Differential Revision: https://reviews.llvm.org/D41832 llvm-svn: 323826	2018-01-30 22:32:39 +00:00
Krzysztof Parzyszek	3dc186ecf5	[Hexagon] Handle truncates in polynomial multiply idiom recognition This is in anticipation of https://reviews.llvm.org/D42424, which would otherwise break one of the pmpy testcases. llvm-svn: 323824	2018-01-30 22:03:59 +00:00
Craig Topper	40bb1d34f5	[X86] Remove redundant check for hasAVX512 before calling hasBWI. NFC hasBWI implies hasAVX512. llvm-svn: 323823	2018-01-30 21:53:35 +00:00

1 2 3 4 5 ...

159602 Commits