llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Craig Topper	be4feb18ce	[X86] Don't store register and memory FMA3 opcodes in the same X86InstrFMA3Group. Nothing was using this relationship. By splitting them we no longer need to worry about register or memory entries being empty in a group. The memory folding tables in X86InstrInfo.cpp can be used to access this relationship if needed. llvm-svn: 335694	2018-06-27 00:42:24 +00:00
Vedant Kumar	9b42ff11c0	[Debugify] Diagnose mis-sized dbg.values Report an error in -check-debugify when the size of a dbg.value operand doesn't match up with the size of the variable it describes. Eventually this check should be moved into the IR verifier. For the moment, it's useful to include the check in -check-debugify as a means of catching regressions and finding existing bugs. Here are some instances of bugs the new check finds in the -O2 pipeline (all in InstCombine): 1) A float is used where a double is expected: ERROR: dbg.value operand has size 32, but its variable has size 64: call void @llvm.dbg.value(metadata float %expf, metadata !12, metadata !DIExpression()), !dbg !15 2) An i8 is used where an i32 is expected: ERROR: dbg.value operand has size 8, but its variable has size 32: call void @llvm.dbg.value(metadata i8 %t4, metadata !14, metadata !DIExpression()), !dbg !24 3) A <4 x i32> is used where something twice as large is expected (perhaps a <4 x i64>, I haven't double-checked): ERROR: dbg.value operand has size 128, but its variable has size 256: call void @llvm.dbg.value(metadata <4 x i32> %4, metadata !40, metadata !DIExpression()), !dbg !95 Differential Revision: https://reviews.llvm.org/D48408 llvm-svn: 335682	2018-06-26 22:46:41 +00:00
Evgeniy Stepanov	eb9e3a4c7e	Revert "[asan] Instrument comdat globals on COFF targets" Causes false positive ODR violation reports on __llvm_profile_raw_version. llvm-svn: 335681	2018-06-26 22:43:48 +00:00
Lang Hames	f82a6ed2d5	[ORC] Don't call isa<> on a null value. This should fix the recent builder failures in the test-global-ctors.ll testcase. llvm-svn: 335680	2018-06-26 22:43:01 +00:00
Lang Hames	db185d0f92	[ORC] Fix a missing return value. llvm-svn: 335677	2018-06-26 22:30:42 +00:00
Michael Zolotukhin	2e643205bc	[JumpThreading] Don't try to rewrite a use if it's already valid. Summary: When recording uses we need to rewrite after cloning a loop we need to check if the use is not dominated by the original def. The initial assumption was that the cloned basic block will introduce a new path and thus the original def will only dominate the use if they are in the same BB, but as the reproducer from PR37745 shows it's not always the case. This fixes PR37745. Reviewers: haicheng, Ka-Ka Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D48111 llvm-svn: 335675	2018-06-26 22:19:48 +00:00
Lang Hames	59191307e6	[ORC] Add a dependence on MC to LLVMBuild.txt llvm-svn: 335673	2018-06-26 22:12:02 +00:00
Simon Pilgrim	3a39f0dbcf	[X86] Add test for SDIV by sign bit (minsigned) value llvm-svn: 335671	2018-06-26 22:03:00 +00:00
Lang Hames	857b27372d	[ORC] Add LLJIT and LLLazyJIT, and replace OrcLazyJIT in LLI with LLLazyJIT. LLJIT is a prefabricated ORC based JIT class that is meant to be the go-to replacement for MCJIT. Unlike OrcMCJITReplacement (which will continue to be supported) it is not API or bug-for-bug compatible, but targets the same use cases: Simple, non-lazy compilation and execution of LLVM IR. LLLazyJIT extends LLJIT with support for function-at-a-time lazy compilation, similar to what was provided by LLVM's original (now long deprecated) JIT APIs. This commit also contains some simple utility classes (CtorDtorRunner2, LocalCXXRuntimeOverrides2, JITTargetMachineBuilder) to support LLJIT and LLLazyJIT. Both of these classes are works in progress. Feedback from JIT clients is very welcome! llvm-svn: 335670	2018-06-26 21:35:48 +00:00
Konstantin Zhuravlyov	4a6d2992e9	AMDGPU: Silence unused warnings in waitcnt insertion pass in release build Differential Revision: https://reviews.llvm.org/D48607 llvm-svn: 335669	2018-06-26 21:33:38 +00:00
Jessica Paquette	e5195a2cfc	[X86][AsmParser] Recommit r335658 Recommit of r335658 so that it does not change the behaviour of any existing error output. llvm-svn: 335668	2018-06-26 21:30:34 +00:00
Vedant Kumar	a8f12623d6	Rename skipDebugInfo -> skipDebugIntrinsics, NFC This addresses post-commit feedback about the name 'skipDebugInfo' being misleading. This name could be interpreted as meaning 'a function that skips instructions with debug locations'. The new name, 'skipDebugIntrinsics', makes it clear that this function only skips debug info intrinsics. Thanks to Adrian Prantl for pointing this out! llvm-svn: 335667	2018-06-26 21:16:59 +00:00
Lang Hames	b0117fde0d	[ORC] Allow IRTransformLayer2's transform to be modified after initialization. Also give the constructor's transform parameter a default no-op transform value. llvm-svn: 335665	2018-06-26 20:59:51 +00:00
Lang Hames	6916237279	[ORC] Reset AsynchronousSymbolQuery's NotifySymbolsResolved callback on error. AsynchronousSymbolQuery::canStillFail checks the value of the callback to prevent sending it redundant error notifications, so we need to reset it after running it. llvm-svn: 335664	2018-06-26 20:59:50 +00:00
Lang Hames	8a7e694834	[ORC] Move the VSOList typedef out of VSO. llvm-svn: 335663	2018-06-26 20:59:49 +00:00
Lang Hames	3819bcadf3	[ORC] Add a FIXME. llvm-svn: 335662	2018-06-26 20:59:49 +00:00
Lang Hames	927643685e	[ORC] Fix a FIXME by moving MangleAndInterner to Core.h. llvm-svn: 335661	2018-06-26 20:59:46 +00:00
Jessica Paquette	91e636b4ee	Revert "[X86][AsmParser] Emit an error when RIP-relative instructions are used in 32-bit mode" This reverts commit 4850a9aae8b38c7deadc103d634ec7397e6c323b. It caused MC/X86/x86_errors.s to fail. Will fix and recommit shortly. llvm-svn: 335660	2018-06-26 20:57:19 +00:00
Jessica Paquette	9dc2822253	[X86][AsmParser] Emit an error when RIP-relative instructions are used in 32-bit mode Right now, when we use RIP-relative instructions in 32-bit mode, we'll just assert and crash. This adds an error message which tells the user that they can't do that in 32-bit mode, so that we don't crash (and also can see the issue outside of assert builds). llvm-svn: 335658	2018-06-26 20:33:46 +00:00
Stanislav Mekhanoshin	91c15f9d04	[AMDGPU] Add llvm.amdgcn.fmad.ftz intrinsic This intrinsic selects v_mad_f32 regardless of fp32 denorm support. Differential Revision: https://reviews.llvm.org/D48573 llvm-svn: 335654	2018-06-26 20:04:19 +00:00
Sanjay Patel	ff645d85f5	[DAGCombiner] use isBitwiseNot to simplify code; NFC llvm-svn: 335652	2018-06-26 19:46:56 +00:00
Matt Arsenault	2b0231f519	AMDGPU: Add pass to lower kernel arguments to loads This replaces most argument uses with loads, but for now not all. The code in SelectionDAG for calling convention lowering is actively harmful for amdgpu_kernel. It attempts to split the argument types into register legal types, which results in low quality code for arbitary types. Since all kernel arguments are passed in memory, we just want the raw types. I've tried a couple of methods of mitigating this in SelectionDAG, but it's easier to just bypass this problem alltogether. It's possible to hack around the problem in the initial lowering, but the real problem is the DAG then expects to be able to use CopyToReg/CopyFromReg for uses of the arguments outside the block. Exposing the argument loads in the IR also has the advantage that the LoadStoreVectorizer can merge them. I'm not sure the best approach to dealing with the IR argument list is. The patch as-is just leaves the IR arguments in place, so all the existing code will still compute the same kernarg size and pointlessly lowers the arguments. Arguably the frontend should emit kernels with an empty argument list in the first place. Alternatively a dummy array could be inserted as a single argument just to reserve space. This does have some disadvantages. Local pointer kernel arguments can no longer have AssertZext placed on them as the equivalent !range metadata is not valid on pointer typed loads. This is mostly bad for SI which needs to know about the known bits in order to use the DS instruction offset, so in this case this is not done. More importantly, this skips noalias arguments since this pass does not yet convert this to the equivalent !alias.scope and !noalias metadata. Producing this metadata correctly seems to be tricky, although this logically is the same as inlining into a function which doesn't exist. Additionally, exposing these loads to the vectorizer may result in degraded aliasing information if a pointer load is merged with another argument load. I'm also not entirely sure this is preserving the current clover ABI, although I would greatly prefer if it would stop widening arguments and match the HSA ABI. As-is I think it is extending < 4-byte arguments to 4-bytes but doesn't align them to 4-bytes. llvm-svn: 335650	2018-06-26 19:10:00 +00:00
Matt Arsenault	bf0a0e516e	ConstantFold: Don't fold global address vs. null for addrspace != 0 Not sure why this logic seems to be repeated in 2 different places, one called by the other. On AMDGPU addrspace(3) globals start allocating at 0, so these checks will be incorrect (not that real code actually tries to compare these addresses) llvm-svn: 335649	2018-06-26 18:55:43 +00:00
Vedant Kumar	8d23cb549a	Use a variable to appease a no-asserts bot, NFC Failure URL: http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/22836 llvm-svn: 335648	2018-06-26 18:55:26 +00:00
Vedant Kumar	6431dad8b9	[Debugify] Don't treat missing dbg.values as an error (PR37942) When checking the debug info in a module, don't treat a missing dbg.value as an error. The dbg.value may simply have been DCE'd, in which case the debugger has enough information to display the variable as <optimized out>. llvm-svn: 335647	2018-06-26 18:54:10 +00:00
Tim Shen	a374e2fb2b	[ConstantRange] Add support of mul in makeGuaranteedNoWrapRegion. Summary: This is trying to add support for r334428. Reviewers: sanjoy Subscribers: jlebar, hiraditya, bixia, llvm-commits Differential Revision: https://reviews.llvm.org/D48399 llvm-svn: 335646	2018-06-26 18:54:10 +00:00
Matt Arsenault	ab262a454f	LoopUnroll: Allow analyzing intrinsic call costs I'm not sure why the code here is skipping calls since TTI does try to do something for general calls, but it at least should allow intrinsics. Skip intrinsics that should not be omitted as calls, which is by far the most common case on AMDGPU. llvm-svn: 335645	2018-06-26 18:51:17 +00:00
Vedant Kumar	bc596f8a18	[Local] Add a convenient insertReplacementDbgValues overload, NFC Add an overload for the common case where the replacement dbg.values have the same DIExpressions as the originals. llvm-svn: 335643	2018-06-26 18:44:53 +00:00
Vedant Kumar	c04d6ac6f4	[Local] Sink salvageDI's early exit into helper functions, NFC salvageDebugInfo() performs a check that allows it to exit early without doing a DenseMap lookup. It's a bit neater and marginally more useful to sink this early exit into the findDbg{Addr,Users,Values} helpers. llvm-svn: 335642	2018-06-26 18:44:52 +00:00
Brendon Cahoon	a4831b8a52	[Hexagon] Add a "generic" cpu Add the generic processor for Hexagon so that it can be used with 3rd party programs that create a back-end with the "generic" CPU. This patch also enables the JIT for Hexagon. Differential Revision: https://reviews.llvm.org/D48571 llvm-svn: 335641	2018-06-26 18:44:05 +00:00
Simon Pilgrim	5b2225c1d5	[DAGCombiner] Don't accept -1 sdiv divisors in sdiv-by-pow2 vector expansion (PR37119) Temporary fix until I've managed to get D45806 updated - both +1 and -1 special cases need to be properly supported. llvm-svn: 335637	2018-06-26 17:46:51 +00:00
Fangrui Song	067aab1d40	Move `REQUIRES:` line to the top llvm-svn: 335635	2018-06-26 17:44:23 +00:00
Sanjay Patel	48cffb1157	[InstSimplify] fold shifts by sext bool https://rise4fun.com/Alive/c3Y llvm-svn: 335633	2018-06-26 17:31:38 +00:00
Sanjay Patel	ad9db90157	[InstSimplify] add tests for shifts by sext bool; NFC llvm-svn: 335631	2018-06-26 17:15:07 +00:00
Simon Pilgrim	bda473bd6a	[X86][SSE] Add another sdiv by (nonuniform) minus one test (PR37119) Include a test that divides by -1 but not by 1 (another special case) llvm-svn: 335629	2018-06-26 17:06:05 +00:00
Sanjay Patel	b6f7fd926a	[InstCombine] simplify code for urem fold; NFCI llvm-svn: 335623	2018-06-26 16:39:29 +00:00
Sanjay Patel	9978774cc0	[InstCombine] fold urem with sext bool divisor Similar to other patches in this series: https://reviews.llvm.org/rL335512 https://reviews.llvm.org/rL335527 https://reviews.llvm.org/rL335597 https://reviews.llvm.org/rL335616 ...this is filling a gap in analysis that is exposed by an unrelated select-of-constants transform. I didn't see a way to unify the sext cases because each div/rem opcode results in a different fold. Note that in this case, the backend might want to convert the select into math: Name: sext urem %e = sext i1 %x to i32 %r = urem i32 %y, %e => %c = icmp eq i32 %y, -1 %z = zext i1 %c to i32 %r = add i32 %z, %y llvm-svn: 335622	2018-06-26 16:30:00 +00:00
Simon Pilgrim	2c643d6425	[SLPVectorizer] Recognise non uniform power of 2 constants Since D46637 we are better at handling uniform/non-uniform constant Pow2 detection; this patch tweaks the SLP argument handling to support them. As SLP works with arrays of values I don't think we can easily use the pattern match helpers here. Differential Revision: https://reviews.llvm.org/D48214 llvm-svn: 335621	2018-06-26 16:20:16 +00:00
Sanjay Patel	0fb77c5c0e	[InstCombine] add tests for urem with sext bool divisor; NFC llvm-svn: 335619	2018-06-26 16:01:24 +00:00
Simon Pilgrim	88b1bcbfc3	[DAGCombiner] Pull out VT bitwidth in visitSDIV. NFCI. llvm-svn: 335617	2018-06-26 15:39:16 +00:00
Sanjay Patel	4268f2f25f	[InstSimplify] fold srem with sext bool divisor llvm-svn: 335616	2018-06-26 15:32:54 +00:00
James Henderson	e41aa01cbe	Fix doc title underlining. llvm-svn: 335615	2018-06-26 15:29:09 +00:00
James Henderson	de9948f983	[FileCheck] Add CHECK-EMPTY directive for checking for blank lines Prior to this change, there was no clean way of getting FileCheck to check that a line is completely empty. The expected way of using "CHECK: {{^$}}" does not work because the '^' matches the end of the previous match (this behaviour may be desirable in certain instances). For the same reason, "CHECK-NEXT: {{^$}}" will fail when the previous match was at the end of the line, as the pattern will match there. Using the recommended [[:space:]] to match an explicit new line could also match a space, and thus is not always desired. Literal '\n' matches also do not work. A workaround was suggested in the review, but it is a little clunky. This change adds a new directive that behaves the same as CHECK-NEXT, except that it only matches against empty lines (nothing, not even whitespace, is allowed). As with CHECK-NEXT, it will fail if more than one newline occurs before the next blank line. Example usage: ; test.txt foo bar ; CHECK: foo ; CHECK-EMPTY: ; CHECK-NEXT: bar Differential Revision: https://reviews.llvm.org/D28896 Reviewed by: probinson llvm-svn: 335613	2018-06-26 15:15:45 +00:00
Krzysztof Parzyszek	974db97095	Silence "unused variable" warning in LiveIntervals.cpp after r335607 llvm-svn: 335610	2018-06-26 14:55:04 +00:00
Sanjay Patel	801d9a089b	[InstSimplify] add tests for srem with sext bool divisor; NFC llvm-svn: 335609	2018-06-26 14:47:31 +00:00
Nico Weber	c16b6ae049	Fix LLVM_ENABLE_THREADS=0 builds after r335440. llvm-svn: 335608	2018-06-26 14:42:48 +00:00
Krzysztof Parzyszek	dd966e57ff	Account for undef values from predecessors in extendSegmentsToUses It is legal for a PHI node not to have a live value in a predecessor as long as the end of the predecessor is jointly dominated by an undef value. llvm-svn: 335607	2018-06-26 14:37:16 +00:00
Simon Pilgrim	0c0e1104c8	[TargetLowering] isVectorClearMaskLegal - use ArrayRef<int> instead of const SmallVectorImpl<int>& This is more generic and matches isShuffleMaskLegal. Differential Revision: https://reviews.llvm.org/D48591 llvm-svn: 335605	2018-06-26 14:15:31 +00:00
Than McIntosh	70f3b30d46	[X86,ARM] Retain split-stack prolog check for sibling calls Summary: If a routine with no stack frame makes a sibling call, we need to preserve the stack space check even if the local stack frame is empty, since the call target could be a "no-split" function (in which case the linker needs to be able to fix up the prolog sequence in order to switch to a larger stack). This fixes PR37807. Reviewers: cherry, javed.absar Subscribers: srhines, llvm-commits Differential Revision: https://reviews.llvm.org/D48444 llvm-svn: 335604	2018-06-26 14:11:30 +00:00
Simon Pilgrim	0b0d7a9107	Fix spelling mistakes in comments. NFCI. llvm-svn: 335603	2018-06-26 14:06:23 +00:00

1 2 3 4 5 ...

165880 Commits