llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Max Kazantsev	632293dfd7	[InstCombine][Test] Test for fix of replacing select with Phis when branch has the same labels An additional test that allows to check the correctness of handling the case of the same branch labels in the dominator when trying to replace select with phi-node. Patch By: Kirill Polushin Differential Revision: https://reviews.llvm.org/D84006 Reviewed By: mkazantsev	2020-07-17 17:16:28 +07:00
Jay Foad	3f23d4b8c3	[MachineScheduler] Fix the TopDepth/BotHeightReduce latency heuristics tryLatency compares two sched candidates. For the top zone it prefers the one with lesser depth, but only if that depth is greater than the total latency of the instructions we've already scheduled -- otherwise its latency would be hidden and there would be no stall. Unfortunately it only tests the depth of one of the candidates. This can lead to situations where the TopDepthReduce heuristic does not kick in, but a lower priority heuristic chooses the other candidate, whose depth is greater than the already scheduled latency, which causes a stall. The fix is to apply the heuristic if the depth of either candidate is greater than the already scheduled latency. All this also applies to the BotHeightReduce heuristic in the bottom zone. Differential Revision: https://reviews.llvm.org/D72392	2020-07-17 11:02:13 +01:00
Florian Hahn	42cb8128e2	[ScheduleDAG] Move DBG_VALUEs after first term forward. MBBs are not allowed to have non-terminator instructions after the first terminator. Currently in some cases (see the modified test), EmitSchedule can add DBG_VALUEs after the last terminator, for example when referring a debug value that gets folded into a TCRETURN instruction on ARM. This patch updates EmitSchedule to move inserted DBG_VALUEs just before the first terminator. I am not sure if there are terminators produce values that can in turn be used by a DBG_VALUE. In that case, moving the DBG_VALUE might result in referencing an undefined register. But in any case, it seems like currently there is no way to insert a proper DBG_VALUEs for such registers anyways. Alternatively it might make sense to just remove those extra DBG_VALUES. I am not too familiar with the details of debug info in the backend and would appreciate any suggestions on how to address the issue in the best possible way. Reviewers: vsk, aprantl, jpaquette, efriedma, paquette Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D83561	2020-07-17 10:27:43 +01:00
Kai Luo	71e1d22d77	[PowerPC] Precommit test case for PR46759. NFC.	2020-07-17 08:41:15 +00:00
Marco Elver	155722cf56	[TSan] Add option for emitting compound read-write instrumentation This adds option -tsan-compound-read-before-write to emit different instrumentation for the write if the read before that write is omitted from instrumentation. The default TSan runtime currently does not support the different instrumentation, and the option is disabled by default. Alternative runtimes, such as the Kernel Concurrency Sanitizer (KCSAN) can make use of the feature. Indeed, the initial motivation is for use in KCSAN as it was determined that due to the Linux kernel having a large number of unaddressed data races, it makes sense to improve performance and reporting by distinguishing compounded operations. E.g. the compounded instrumentation is typically emitted for compound operations such as ++, +=, \|=, etc. By emitting different reports, such data races can easily be noticed, and also automatically bucketed differently by CI systems. Reviewed By: dvyukov, glider Tags: #llvm Differential Revision: https://reviews.llvm.org/D83867	2020-07-17 10:24:20 +02:00
Simon Wallis	071d6b613d	[ARM] halfword store hits llvm_unreachable with big-endian Summary: [ARM] halfword store hits llvm_unreachable with big-endian Provide missing case in getFixupKindContainerSizeBytes(). This stops execution reaching llvm_unreachable("Unknown fixup kind!") D83947 Reviewers: olista01, ostannard Reviewed By: ostannard Subscribers: ostannard, kristof.beyls, hiraditya, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83947 Change-Id: I598aa1fb51fd1c6f424c557c85d6df6d1958bc62	2020-07-17 08:56:44 +01:00
Max Kazantsev	e9d1a19025	[InstCombine] Fix replace select with Phis when branch has the same labels ``` define i32 @test(i1 %cond) { entry: br i1 %cond, label %exit, label %exit exit: %result = select i1 %cond, i32 123, i32 456 ret i32 %result } ``` In this test, after applying transformation of replacing select with Phis, the result will be: ``` define i32 @test(i1 %cond) { entry: br i1 %cond, label %exit, label %exit exit: %result = i32 phi [123, %exit], [123, %exit] ret i32 %result } ``` That is, select is transformed into an invalid Phi, which will then be reduced to 123 and the second value will be lost. But it is worth noting that this problem will arise only if select is in the InstCombine worklist will be before the branch. Otherwise, InstCombine will replace the branch condition with false and transformation will not be applied. The fix is to check the target labels in the branch condition for equality. Patch By: Kirill Polushin Differential Revision: https://reviews.llvm.org/D84003 Reviewed By: mkazantsev	2020-07-17 14:04:58 +07:00
hsmahesha	95efa5a025	Revert "[AMDGPU/MemOpsCluster] Implement new heuristic for computing max mem ops cluster size" This reverts commit cc9d69385659be32178506a38b4f2e112ed01ad4.	2020-07-17 12:20:37 +05:30
Igor Kudrin	da3ec54200	[DebugInfo] Fix a misleading usage of DWARF forms with DIEExpr. NFCI. For now, DIEExpr is used only in two places: 1) in the debug info library unit test suite to emit a DW_AT_str_offsets_base attribute with the DW_FORM_sec_offset form, see dwarfgen::DIE::addStrOffsetsBaseAttribute(); 2) in DwarfCompileUnit::addLocationAttribute() to generate the location attribute for a TLS variable. The later case used an incorrect DWARF form of DW_FORM_udata, which implies storing an uleb128 value, not a 4/8 byte constant. The generated result was as expected because DIEExpr::SizeOf() did not handle the used form, but returned the size of the code pointer by default. The patch fixes the issue by using more appropriate DWARF forms for the problematic case and making DIEExpr::SizeOf() more straightforward. Differential Revision: https://reviews.llvm.org/D83958	2020-07-17 13:49:27 +07:00
Craig Topper	5326734b74	[X86] Change the scheduler model for 'pentium4' to SandyBridgeModel. I meant to do this in D83913, but missed it while updating the feature list. Interestingly I think this is disabling the postRA scheduler. But it does match our default 64-bit behavior. Reviewed By: echristo Differential Revision: https://reviews.llvm.org/D83996	2020-07-16 22:04:29 -07:00
Craig Topper	270ec5454e	[X86] Reorder how the subtarget map key is created. We use a SmallString<512> and attempted to reserve enough space for CPU plus Features, but that doesn't account for all the things that get added to the string. Reorder the string so the shortest things go first which shouldn't exceed the small size. Finally add the feature string at the end which might be long. This should ensure at most one heap allocation without needing to use reserve. I don't know if this matters much in practice, but I was looking into something else that will require more code here and noticed the odd reserve call.	2020-07-16 21:41:45 -07:00
Jonas Devlieghere	16a18d9f67	[llvm] Add RISCVTargetParser.def to the module map This fixes the modules build.	2020-07-16 21:39:13 -07:00
Juneyoung Lee	567f280d45	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison consider noundef This patch adds support for noundef arguments. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D83752	2020-07-17 12:53:08 +09:00
Juneyoung Lee	65e5933ae8	Add a test for D83752	2020-07-17 12:50:40 +09:00
Xing GUO	9e9b375895	[DWARFYAML] Merge forms that use same encodings. NFC.	2020-07-17 11:31:49 +08:00
Juneyoung Lee	a4af7cf1e9	[LangRef] Mention that freeze does not consider aggregate's paddings Make explicit that freeze does not touch paddings of an aggregate. (Relevant comment: https://reviews.llvm.org/D83752#2152550) This implies that `v = freeze(load p); store v, q` may still leave undef bits or poison in memory if `v` is an aggregate, but it still happens for non-byte integers such as i1. Differential Revision: https://reviews.llvm.org/D83927	2020-07-17 11:53:26 +09:00
Carl Ritson	679bb4a3ab	[AMDGPU] Translate s_and/s_andn2 to s_mov in vcc optimisation When SCC is dead, but VCC is required then replace s_and / s_andn2 with s_mov into VCC when mask value is 0 or -1. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D83850	2020-07-17 11:48:57 +09:00
LLVM GN Syncbot	9476f32c02	[gn build] Port 9870f77441c	2020-07-16 23:07:46 +00:00
LLVM GN Syncbot	12bd51a094	[gn build] Port 0f6220ddd6c	2020-07-16 23:07:45 +00:00
LLVM GN Syncbot	ff4f09f824	[gn build] Port 0e940d55f8a	2020-07-16 23:07:45 +00:00
Nico Weber	754d8450bf	[gn build] (manually) merge 9870f77441c	2020-07-16 19:07:28 -04:00
Lang Hames	6cda7d4909	[ORC] Switch from initializer lists to named arguments to work around MSVC. MSVC doesn't like some of the initializer list uses in 0e940d55f8a. Switch to named arguments to work around this.	2020-07-16 15:58:31 -07:00
Lang Hames	0a5b56b9cd	[ORC] Add more explicit casts to fix a narrowing conversion errors.	2020-07-16 15:37:18 -07:00
Lang Hames	9358e00666	[ORC] Add explicit cast to fix a narrowing conversion error.	2020-07-16 15:33:02 -07:00
Albion Fung	b7134015be	[PowerPC][Power10] Add 128-bit Binary Integer Operation instruction definitions and MC Tests This patch adds the instruction definitions and MC tests for the 128-bit Binary Integer Operation instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D83516	2020-07-16 17:16:43 -05:00
Lang Hames	757f986da8	[ORC] Add TargetProcessControl and TPCIndirectionUtils APIs. TargetProcessControl is a new API for communicating with JIT target processes. It supports memory allocation and access, and inspection of some process properties, e.g. the target proces triple and page size. Centralizing these APIs allows utilities written against TargetProcessControl to remain independent of the communication procotol with the target process (which may be direct memory access/allocation for in-process JITing, or may involve some form of IPC or RPC). An initial set of TargetProcessControl-based utilities for lazy compilation is provided by the TPCIndirectionUtils class. An initial implementation of TargetProcessControl for in-process JITing is provided by the SelfTargetProcessControl class. An example program showing how the APIs can be used is provided in llvm/examples/OrcV2Examples/LLJITWithTargetProcessControl.	2020-07-16 15:09:13 -07:00
Jon Roelofs	7f5ac9d171	[SimplifyCFG] Fix crash in the EXPENSIVE_CHECKS build SimplifyCFG was incorrectly reporting to the pass manager that it had not made changes after folding away a PHI. This is detected in the EXPENSIVE_CHECKS build when the function's hash changes. Differential Revision: https://reviews.llvm.org/D83985	2020-07-16 15:34:41 -06:00
Wouter van Oortmerssen	dbcc53b15e	[WebAssembly] 64-bit (function) pointer fixes. Accounting for the fact that Wasm function indices are 32-bit, but in wasm64 we want uniform 64-bit pointers. Includes reloc types for 64-bit table indices. Differential Revision: https://reviews.llvm.org/D83729	2020-07-16 14:10:22 -07:00
Roman Lebedev	656d8c2c2e	[NFC][PhaseOrdering] Add a test demonstrating pitfails of common code hoisting on loop rotation Depending on the -rotation-max-header-size=?, hoisting common code early makes loop rotation impossible.	2020-07-16 23:53:26 +03:00
Craig Topper	555575ea6b	[X86] Move integer hadd/hsub formation into a helper function shared by combineAdd and combineSub. There was a lot of duplicate code here for checking the VT and subtarget. Moving it into a helper avoids that. It also fixes a bug that combineAdd reused Op0/Op1 after a call to isHorizontalBinOp may have changed it. The new helper function has its own local version of Op0/Op1 that aren't shared by other code. Fixes PR46455. Reviewed By: spatel, bkramer Differential Revision: https://reviews.llvm.org/D83971	2020-07-16 13:27:27 -07:00
Denis Antrushin	009dc84f1e	[Statepoint] Fix bug found by sanitaizer. Statepoint has no static operands, so it cannot be verified against MCInstrDescr. Revert NumDefs change introduced by ef658ebd629.	2020-07-16 23:06:53 +03:00
serge-sans-paille	c5e705c5f8	Harmonize Python shebang Differential Revision: https://reviews.llvm.org/D83857	2020-07-16 21:53:45 +02:00
Matt Arsenault	6bfa8ac3f9	Fix incorrect file path in documentation	2020-07-16 15:53:11 -04:00
Matt Arsenault	39e76f9342	AMDGPU: Add a few more missing test for AGPR tuple copying	2020-07-16 15:53:11 -04:00
Craig Topper	ab11c1e42d	[X86] Change the tuning settings for pentium4 to be more modern since its the default 32-bit cpu in clang Alternative to D83897. I believe the big change here is that I removed slow unaligned memory 16 Down side that it may adversely effect tuning if someone explicitly targets -march=pentium4 and expects pentium4 tuned code. Of course pentium4 is so old our default behavior with the previous settings may not have been the best either. Reviewed By: echristo, RKSimon Differential Revision: https://reviews.llvm.org/D83913	2020-07-16 12:51:25 -07:00
Matt Arsenault	92817d9c2a	AMDGPU: Add missing tests for copyPhysReg AGPR tuples	2020-07-16 15:27:57 -04:00
Mircea Trofin	5f8ff6c8ac	[llvm] Moved InlineSizeEstimatorAnalysis test to .ll Summary: Following guidance in https://llvm.org/docs/TestingGuide.html#testing-analysis Reviewers: mehdi_amini Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83918	2020-07-16 12:25:16 -07:00
LLVM GN Syncbot	207caa9df8	[gn build] Port 5e8b4be9f85	2020-07-16 19:08:09 +00:00
Wouter van Oortmerssen	53d8604583	[WebAssembly] Triple::wasm64 related cleanup Differential Revision: https://reviews.llvm.org/D83713	2020-07-16 12:01:10 -07:00
Eric Christopher	3a125536d7	Temporarily Revert "[AssumeBundles] Use operand bundles to encode alignment assumptions" due to the performance bugs filed in https://bugs.llvm.org/show_bug.cgi?id=46753. An SROA change soon may obviate some of these problems. This reverts commit 8d09f20798ac180b1749276bff364682ce0196ab.	2020-07-16 11:54:04 -07:00
Nadav Rotem	0cf1290686	[InjectTLIMappings] Use StringRef instead of std::string for FN name. https://reviews.llvm.org/D83797	2020-07-16 11:53:04 -07:00
Zakk Chen	4f4e8c6413	[RISCV] Add support for -mcpu option. Summary: 1. gcc uses `-march` and `-mtune` flag to chose arch and pipeline model, but clang does not have `-mtune` flag, we uses `-mcpu` to chose both infos. 2. Add SiFive e31 and u54 cpu which have default march and pipeline model. 3. Specific `-mcpu` with rocket-rv[32\|64] would select pipeline model only, and use the driver's arch choosing logic to get default arch. Reviewers: lenary, asb, evandro, HsiangKai Reviewed By: lenary, asb, evandro Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D71124	2020-07-16 11:46:22 -07:00
Nadav Rotem	875d6beaad	[LiveVariables] Replace std::vector with SmallVector. Replace std::vector with SmallVector to reduce the number of mallocs. This method is frequently executed, and the number of elements in the vector is typically small. https://reviews.llvm.org/D83920	2020-07-16 11:39:54 -07:00
Thomas Lively	9ac94b78c6	[WebAssembly] Implement v128.select Although the SIMD spec proposal does not specifically include a select instruction, the select instruction in MVP WebAssembly is polymorphic over the selected types, so it is able to work on v128 values when they are enabled. This patch introduces a new variant of the select instruction for each legal vector type. Additional ISel patterns are adapted from the SELECT_I32 and SELECT_I64 patterns. Depends on D83736. Differential Revision: https://reviews.llvm.org/D83737	2020-07-16 11:37:25 -07:00
Nadav Rotem	8c87e8c6cf	[TableGen] Change std::vector to SmallVector The size of VTList that is pushed into this container is usually 1, but often 6 or 7. Change the vector to SmallVector to eliminate frequent mallocs. This happens hundreds of thousands of times in each tablegen execution during the LLVM/clang build. https://reviews.llvm.org/D83849	2020-07-16 11:32:44 -07:00
Matt Arsenault	51efd8682f	AMDGPU: Move handling of AGPR copies to a separate function This is in preparation for fixing multiple problems with the way AGPR copies are handled, but this change is NFC itself. First, it's relying on recursively calling copyPhysReg, which is losing information necessary to get correct super register handling. Second, it's constructing a new RegScavenger and doing a O(N^2) walk on every single sub-spill for every AGPR tuple copy. Third, it's using the forward form of the scavenger, and not using the preferred backwards scan.	2020-07-16 14:32:24 -04:00
Arthur Eubanks	09cfe7939a	[SCEV] Fix ScalarEvolution tests under NPM Many tests use opt's -analyze feature, which does not translate well to NPM and has better alternatives. The alternative here is to explicitly add a pass that calls ScalarEvolution::print(). The legacy pass manager RUNs aren't changing, but they are now pinned to the legacy pass manager. For each legacy pass manager RUN, I added a corresponding NPM RUN using the 'print<scalar-evolution>' pass. For compatibility with update_analyze_test_checks.py and existing test CHECKs, 'print<scalar-evolution>' now prints what -analyze prints per function. This was generated by the following Python script and failures were manually fixed up: import sys for i in sys.argv: with open(i, 'r') as f: s = f.read() with open(i, 'w') as f: for l in s.splitlines(): if "RUN:" in l and ' -analyze ' in l and '\\' not in l: f.write(l.replace(' -analyze ', ' -analyze -enable-new-pm=0 ')) f.write('\n') f.write(l.replace(' -analyze ', ' -disable-output ').replace(' -scalar-evolution ', ' "-passes=print<scalar-evolution>" ').replace(" \| ", " 2>&1 \| ")) f.write('\n') else: f.write(l) There are a couple failures still in ScalarEvolution under NPM, but those are due to other unrelated naming conflicts. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D83798	2020-07-16 11:24:07 -07:00
Thomas Lively	ea404b6fcd	[WebAssembly] Autogenerate tests for simd-select.ll Updating the simd-select.ll tests manually with consistent named regexps for the register numbers was taking more time than it was worth, so this patch updates that test file to have autogenerated output. This is not a significant readability regression because the tests in that file are all very small. Depends on D83734. Differential Revision: https://reviews.llvm.org/D83736	2020-07-16 11:19:09 -07:00
Thomas Lively	1cc997c0a7	[WebAssembly] Lower vselect to v128.bitselect We were previously expanding vselect and matching on the expansion to generate bitselects, but in some cases the expansion would be further combined and a bitselect would not get generated. This patch improves codegen in those cases by legalizing vselect and lowering it to v128.bitselect. The old pattern that matches the expansion is still useful for lowering IR that already uses the expansion rather than a select operation. Differential Revision: https://reviews.llvm.org/D83734	2020-07-16 11:11:19 -07:00
Craig Topper	00bbcf7ab4	[X86] Add test case for PR46455.	2020-07-16 11:06:55 -07:00

1 2 3 4 5 ...

200285 Commits