llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 12:41:49 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	d70b0f4887	[X86ISelLowering] combineCMov - cleanup CMOV->LEA codegen. NFCI. Only compute the diff once and we don't need the truncation code (assert the bitwidth is correct just to be safe). llvm-svn: 370583	2019-08-31 14:18:26 +00:00
Simon Pilgrim	33669f6fcc	[X86ISelLowering] LowerSELECT - remove duplicate value type. NFCI. VT of SELECT result and selection ops will be the same. llvm-svn: 370581	2019-08-31 13:14:52 +00:00
Simon Pilgrim	7282dc840a	Fix cppcheck shadow variable and variable scope warnings. NFCI. llvm-svn: 370580	2019-08-31 12:30:19 +00:00
Amaury Sechet	1746c5a1e3	[DAGCombiner] Match (add X, X) as (shl X, 1) when detecting rotate. Summary: The combiner transforms (shl X, 1) into (add X, X). Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66882 llvm-svn: 370578	2019-08-31 11:40:02 +00:00
Fangrui Song	c10555e1fa	[llvm-objcopy] Simplify alignToAddr with llvm::alignTo llvm-svn: 370577	2019-08-31 10:48:09 +00:00
James Molloy	7299753dbd	[DAGCombiner] Don't create illegal narrow stores Narrowing stores when the target doesn't support the narrow version forces the target to expand into a load-modify-store sequence, which is highly suboptimal. The information narrowing throws away (legality of the inverse transform) is hard to re-analyze. If the target doesn't support a store of the narrow type, don't narrow even in pre-legalize mode. No test as this is DAGCombiner and depends on target bits. llvm-svn: 370576	2019-08-31 10:46:16 +00:00
Nikita Popov	f80ed591c4	[LVI] Extract solveBlockValueExtractValue(); NFC Extract this method in preparation for additional extractvalue support. llvm-svn: 370575	2019-08-31 09:58:50 +00:00
Nikita Popov	3625f144e6	[CVP] Add tests for simplified with.overflow + icmp; NFC These tests are based on D19867. llvm-svn: 370574	2019-08-31 09:58:42 +00:00
Nikita Popov	1b68cdbab8	[CVP] Generate simpler code for elided with.overflow intrinsics Use a { iN undef, i1 false } struct as the base, and only insert the first operand, instead of using { iN undef, i1 undef } as the base and inserting both. This is the same as what we do in InstCombine. Differential Revision: https://reviews.llvm.org/D67034 llvm-svn: 370573	2019-08-31 09:58:37 +00:00
Bjorn Pettersson	de3fdb020d	[CodeGen] Refactor DAGTypeLegalizer::ExpandIntRes_MULFIX. NFC Restructured the code a little bit in preparation for adding UMULFIXSAT. I think it will be easier to understand the code if not interleaving the codegen for signed/unsigned/saturated cases that much. llvm-svn: 370569	2019-08-31 09:28:50 +00:00
Bjorn Pettersson	99b90b9e5c	[LangRef] Update saturating examples for llvm.smul.fix.sat. NFC Some saturation examples for llvm.smul.fix.sat were not showing the correct result. I've adjusted the operands to make sure that we actually trigger overflow in those examples. llvm-svn: 370566	2019-08-31 09:01:16 +00:00
Wei Mi	f59cc38992	Fix some errors introduced by rL370563 which were not exposed on my local machine. 1. zlib::compress accept &size_t but the param is an uint64_t. 2. Some systems don't have zlib installed. Don't use compression by default. llvm-svn: 370564	2019-08-31 03:17:49 +00:00
Wei Mi	47e2f8a30e	[SampleFDO] Add profile symbol list section to discriminate function being cold versus function being newly added. This is the second half of https://reviews.llvm.org/D66374. Profile symbol list is the collection of function symbols showing up in the binary which generates the current profile. It is used to discriminate function being cold versus function being newly added. Profile symbol list is only added for profile with ExtBinary format. During profile use compilation, when profile-sample-accurate is enabled, a function without profile will be regarded as cold only when it is contained in that list. Differential Revision: https://reviews.llvm.org/D66766 llvm-svn: 370563	2019-08-31 02:27:26 +00:00
David Blaikie	1f0c809cef	llvm-dwarfdump: Cache CU low_pc when computing statistics. llvm-svn: 370559	2019-08-31 01:05:46 +00:00
Thomas Lively	41a3b659ef	[WebAssembly] Add SIMD QFMA/QFMS Summary: Adds clang builtins and LLVM intrinsics for these experimental instructions. They are not implemented in engines yet, but that is ok because the user must opt into using them by calling the builtins. Reviewers: aheejin, dschuff Reviewed By: aheejin Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D67020 llvm-svn: 370556	2019-08-31 00:12:29 +00:00
Jonas Devlieghere	8742846f67	[lit] Only set DYLD_LIBRARY_PATH for shared builds In r370135 I committed a temporary workaround for the sanitized bot to not set (DY)LD_LIBRARY_PATH when (DY)LD_INSERT_LIBRARIES was set. Setting (DY)LD_LIBRARY_PATH is only necessary for (standalone) shared-library builds, so a better solution is to only set the environment variable when necessary. Differential revision: https://reviews.llvm.org/D67012 llvm-svn: 370549	2019-08-30 23:16:02 +00:00
Alina Sbirlea	5ec60553a7	[MemorySSA] Rename all phi entries. When renaming Phis incoming values, there may be multiple edges incoming from the same block (switch). Rename all. llvm-svn: 370548	2019-08-30 23:02:53 +00:00
Wei Mi	e25039a021	[GVN] Verify value equality before doing phi translation for call instruction This is an updated version of https://reviews.llvm.org/D66909 to fix PR42605. Basically, current phi translatation translates an old value number to an new value number for a call instruction based on the literal equality of call expression, without verifying there is no clobber in between. This is incorrect. To get a finegrain check, use MachineDependence analysis to do the job. However, this is still not ideal. Although given a call instruction, `MemoryDependenceResults::getCallDependencyFrom` returns identical call instructions without clobber in between using MemDepResult with its DepType to be `Def`. However, identical is too strict here and we want it to be relaxed a little to consider phi-translation -- callee is the same, param operands can be different. That means changing the semantic of `MemDepResult::Def` and I don't know the potential impact. So currently the patch is still conservative to only handle MemDepResult::NonFuncLocal, which means the current call has no function local clobber. If there is clobber, even if the clobber doesn't stand in between the current call and the call with the new value, we won't do phi-translate. Differential Revision: https://reviews.llvm.org/D67013 llvm-svn: 370547	2019-08-30 23:01:22 +00:00
Reid Kleckner	0d5e874345	Fix SEH_NoReturn machine verifier error llvm-svn: 370543	2019-08-30 22:40:51 +00:00
Reid Kleckner	af87464af0	[MC] Avoid crashes from improperly nested or wrong target .seh_handlerdata directives llvm-svn: 370540	2019-08-30 22:25:55 +00:00
Reid Kleckner	4a1ffa80e1	[X86] Print register names in .seh_* directives Also improve assembler parser register validation for .seh_ directives. This requires moving X86-specific seh directive handling into the x86 backend, which addresses some assembler FIXMEs. Differential Revision: https://reviews.llvm.org/D66625 llvm-svn: 370533	2019-08-30 21:23:05 +00:00
Sanjay Patel	7dfc9a34d0	[x86] add tests for shift-logic-shift; NFC llvm-svn: 370529	2019-08-30 20:51:51 +00:00
Sanjay Patel	2796605a46	[AArch64] add tests for shift-logic-shift; NFC llvm-svn: 370528	2019-08-30 20:48:43 +00:00
Reid Kleckner	e996333540	[Windows] Disable TrapUnreachable for Win64, add SEH_NoReturn Users have complained llvm.trap produce two ud2 instructions on Win64, one for the trap, and one for unreachable. This change fixes that. TrapUnreachable was added and enabled for Win64 in r206684 (April 2014) to avoid poorly understood issues with the Windows unwinder. There seem to be two major things in play: - the unwinder - C++ EH, _CxxFrameHandler3 & co The unwinder disassembles forward from the return address to scan for epilogues. Inserting a ud2 had the effect of stopping the unwinder, and ensuring that it ran the EH personality function for the current frame. However, it's not clear what the unwinder does when the return address happens to be the last address of one function and the first address of the next function. The Visual C++ EH personality, _CxxFrameHandler3, needs to figure out what the current EH state number is. It does this by consulting the ip2state table, which maps from PC to state number. This seems to go wrong when the return address is the last PC of the function or catch funclet. I'm not sure precisely which system is involved here, but in order to address these real or hypothetical problems, I believe it is enough to insert int3 after a call site if it would otherwise be the last instruction in a function or funclet. I was able to reproduce some similar problems locally by arranging for a noreturn call to appear at the end of a catch block immediately before an unrelated function, and I confirmed that the problems go away when an extra trailing int3 instruction is added. MSVC inserts int3 after every noreturn function call, but I believe it's only necessary to do it if the call would be the last instruction. This change inserts a pseudo instruction that expands to int3 if it is in the last basic block of a function or funclet. I did what I could to run the Microsoft compiler EH tests, and the ones I was able to run showed no behavior difference before or after this change. Differential Revision: https://reviews.llvm.org/D66980 llvm-svn: 370525	2019-08-30 20:46:39 +00:00
Puyan Lotfi	31a7e50db7	[IFS][NFC] llvm-ifs: Fixing build bot build break: revert r370517 and r370510. llvm-svn: 370522	2019-08-30 20:25:46 +00:00
Sanjay Patel	b9b05584b8	[Thumb2] tighten CHECK lines in test; NFC The sequence between the function call and the asm start may change without affecting what this test is looking for, but we should have a better idea about what that sequence looks like. llvm-svn: 370518	2019-08-30 20:15:01 +00:00
Puyan Lotfi	d63fc8cc9e	[IFS][NFC] llvm-ifs: Fixing build bot error due to commit conflicts. r370510 and r370504 Again only on gcc. llvm-svn: 370517	2019-08-30 20:09:55 +00:00
Nico Weber	d452d2ab8b	gn build: Merge r370512 llvm-svn: 370516	2019-08-30 20:06:44 +00:00
Craig Topper	a77efde36c	[X86] Fix mul test cases in avx512-broadcast-unfold.ll to not get canonicalized to fadd. Remove the fsub test cases which were also testing fadd. Not sure how to prevent an fsub by constant getting turned into an fadd by negative constant. llvm-svn: 370515	2019-08-30 20:04:23 +00:00
Puyan Lotfi	0d053c126a	[IFS][NFC] llvm-ifs: Fixing build errors for bots using GCC. gcc produces the error: error: specialization of ‘template<class T, class Enable> struct llvm::yaml::ScalarTraits’ in different namespace For all specializations outside of llvm::yaml. So I added llvm::yaml to these specializations to fix the errors on the bots building with gcc (/usr/bin/c++). llvm-svn: 370510	2019-08-30 19:54:46 +00:00
James Molloy	74fb57df4f	[DFAPacketizer] Allow namespacing of automata per-itinerary The Hexagon itineraries are cunningly crafted such that functional units between itineraries do not clash. Because all itineraries are bundled into the same DFA, a functional unit index clash would cause an incorrect DFA to be generated. A workaround for this is to ensure all itineraries declare the universe of all possible functional units, but this isn't ideal for three reasons: 1) We only have a limited number of FUs we can encode in the packetizer, and using the universe causes us to hit the limit without care. 2) Silent codegen faults are bad, and careful triage of the FU list shouldn't be required. 3) Smooshing all itineraries into the same automaton allows combinations of instruction classes that cannot exist, which bloats the table. A simple solution is to allow "namespacing" packetizers. Differential Revision: https://reviews.llvm.org/D66940 llvm-svn: 370508	2019-08-30 19:50:49 +00:00
Craig Topper	699fc8965e	[X86] Regenerate the test cases added in r370506. Something weird happened with the v2i64/v2f64 test cases which don't use broadcast. So they should already be hoisted, but weren't in the version I submitted in r370506. This fixes that. Not sure if something changed or I screwed up. llvm-svn: 370507	2019-08-30 19:42:48 +00:00
Craig Topper	6a49ebb3c6	[X86] Add test caes for opportunities for machine LICM to unfold broadcasted constant pool loads. MachineLICM is able to unfold loads to move an invariant load out a loop, but X86 infrastructure currently lacks the ability to do this when avx512 embedded broadcasting is used. This test adds examples for the basic float point operations, add, mul, and, or, and xor. llvm-svn: 370506	2019-08-30 19:26:06 +00:00
Jinsong Ji	8c5e437e76	[PowerPC][NFC] Avoid checking non-relevant .cfi instructions Summary: This is brought up in https://reviews.llvm.org/D64662?id=209923#inline-599490 CFI information are non-relevant to quite some testcases, we should get rid of checking them when its unecessary. This patch avoid generating cfi info in testcases that are not testing prolog/epilog or exception handling. Reviewers: kbarton, hfinkel, nemanjai, #powerpc Reviewed By: hfinkel Subscribers: MaskRay, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67016 llvm-svn: 370505	2019-08-30 19:24:25 +00:00
Michael Liao	ab94c7a42e	Fix compilation warnings. NFC. llvm-svn: 370504	2019-08-30 19:23:28 +00:00
Nico Weber	99225dd97f	gn build: Merge r370500 llvm-svn: 370501	2019-08-30 18:55:11 +00:00
James Molloy	e93cbbf303	[MachinePipeliner] Separate schedule emission, NFC This is the first stage in refactoring the pipeliner and making it more accessible for backends to override and control. This separates the logic and state required to emit a scheudule from the logic that computes and validates a schedule. This will enable (a) new schedule emitters and (b) new modulo scheduling implementations to coexist. NFC. Differential Revision: https://reviews.llvm.org/D67006 llvm-svn: 370500	2019-08-30 18:49:50 +00:00
Puyan Lotfi	e0c741cd7e	[llvm-ifs][IFS] llvm Interface Stubs merging + object file generation tool. This tool merges interface stub files to produce a merged interface stub file or a stub library. Currently it for stub library generation it can produce an ELF .so stub file, or a TBD file (experimental). It will be used by the clang -emit-interface-stubs compilation pipeline to merge and assemble the per-CU stub files into a stub library. The new IFS format is as follows: --- !experimental-ifs-v1 IfsVersion: 1.0 Triple: <llvm triple> ObjectFileFormat: <ELF \| TBD> Symbols: _ZSymbolName: { Type: <type>, etc... } ... Differential Revision: https://reviews.llvm.org/D66405 llvm-svn: 370499	2019-08-30 18:26:05 +00:00
Simon Pilgrim	8503002d16	[DAGCombine] ReduceLoadWidth - remove duplicate SDLoc. NFCI. SDLoc(N0) and SDLoc(cast<LoadSDNode>(N0)) should be equivalent. llvm-svn: 370498	2019-08-30 18:19:02 +00:00
Simon Pilgrim	4727ea4c99	[TargetLowering] SimplifyDemandedBits ADD/SUB/MUL - correctly inherit SDNodeFlags from the original node. Just disable NSW/NUW flags. This matches what we're already doing for the other situations for these nodes, it was just missed for the demanded constant case. Noticed by inspection - confirmed in offline discussion with @spatel. I've checked we have test coverage in the x86 extract-bits.ll and extract-lowbits.ll tests llvm-svn: 370497	2019-08-30 17:58:55 +00:00
Matt Arsenault	c176b0704a	GlobalISel: Fix missing pass dependency llvm-svn: 370496	2019-08-30 17:41:58 +00:00
Craig Topper	44eac5c3b3	[X86] Pass v32i16/v64i8 in zmm registers on KNL target. gcc and icc pass these types in zmm registers in zmm registers. This patch implements a quick hack to override the register type before calling convention handling to one that is legal. Longer term we might want to do something similar to 256-bit integer registers on AVX1 where we just split all the operations. Fixes PR42957 Differential Revision: https://reviews.llvm.org/D66708 llvm-svn: 370495	2019-08-30 17:35:08 +00:00
Craig Topper	89128644b0	[ValueTypes] Add v16f16 and v32f16 to EVT::getEVTString and Tablegen's getEnumName Missed these when I hadded the enum entries llvm-svn: 370494	2019-08-30 17:34:29 +00:00
Nico Weber	2bd08cce34	gn build: Merge r370490 llvm-svn: 370492	2019-08-30 17:30:08 +00:00
Evgeniy Stepanov	74f96070c3	MemTag: unchecked load/store optimization. Summary: MTE allows memory access to bypass tag check iff the address argument is [SP, #imm]. This change takes advantage of this to demote uses of tagged addresses to regular FrameIndex operands, reducing register pressure in large functions. MO_TAGGED target flag is used to signal that the FrameIndex operand refers to memory that might be tagged, and needs to be handled with care. Such operand must be lowered to [SP, #imm] directly, without a scratch register. The transformation pass attempts to predict when the offset will be out of range and disable the optimization. AArch64RegisterInfo::eliminateFrameIndex has an escape hatch in case this prediction has been wrong, but it is quite inefficient and should be avoided. Reviewers: pcc, vitalybuka, ostannard Subscribers: mgorny, javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66457 llvm-svn: 370490	2019-08-30 17:23:02 +00:00
Simon Pilgrim	dfd3a1d37d	[DAGCombine] visitVSELECT - remove equivalent getValueType() call. NFCI. llvm-svn: 370489	2019-08-30 17:21:20 +00:00
Whitney Tsang	abb8dd65f9	[INSTRUCTIONS] Add support of const for getLoadStorePointerOperand() and getLoadStorePointerOperand(). Reviewer: hsaito, sebpop, reames, hfinkel, mkuper, bogner, haicheng, arsenm, lattner, chandlerc, grosser, rengolin Reviewed By: reames Subscribers: wdng, llvm-commits, bmahjour Tag: LLVM Differential Revision: https://reviews.llvm.org/D66595 llvm-svn: 370486	2019-08-30 16:41:35 +00:00
Johannes Doerfert	f5ce2f5349	[Attributor] Fix: do not pretend to preserve the CFG llvm-svn: 370485	2019-08-30 16:35:10 +00:00
Craig Topper	089dd052e5	[X86] Merge X86InstrInfo::loadRegFromAddr/storeRegToAddr into their only call site. I'm looking at unfolding broadcast loads on AVX512 which will require refactoring this code to select broadcast opcodes instead of regular load/stores in some cases. Merging them to avoid further complicating their interfaces. llvm-svn: 370484	2019-08-30 16:05:57 +00:00
Johannes Doerfert	c5b35f8dcf	[Attributor] Use existing function information for the call site Summary: Instead of recomputing information for call sites we now use the function information directly. This is always valid and once we have call site specific information we can improve here. This patch also bootstraps attributes that are created on-demand through an initial update call. Information that is known will then directly be available in the new attribute without causing an iteration delay. The tests show how this improves the iteration count. Reviewers: sstefan1, uenoku Subscribers: hiraditya, bollu, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66781 llvm-svn: 370480	2019-08-30 15:24:52 +00:00

1 2 3 4 5 ...

184186 Commits