llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Chandler Carruth	2993c3a830	[shuffle] Make the seed an optional component and add support for letting the python very directly compute a UUID. llvm-svn: 215533	2014-08-13 10:00:46 +00:00
Chandler Carruth	a501271c18	Revert r215415 which causse MSan to crash on a great deal of C++ code. I've followed up on the original commit as well. llvm-svn: 215532	2014-08-13 09:19:39 +00:00
Chandler Carruth	be6c70dbd4	[shuffle] Teach the shuffle fuzzer to fuzz blends, including forming a tree of inputs to blend iteratively together. This required a pretty substantial rewrite of the innards. The number of shuffle instructions is now bounded in terms of tree-height. There is a flag to disable blends so that its still possible to test single input shuffles. I've also improved various aspects of how the test program is generated, primarily to simplify the test harness and allow some optimizations to clean up how we actually check the results and build up the inputs. Again, apologies for my likely horrible use of Python... But hey, it works! (Ish?) llvm-svn: 215530	2014-08-13 09:05:59 +00:00
Elena Demikhovsky	cff52c24c9	AVX-512: Fixed a bug in shufflevector lowering. PALIGNR instruction does not exist in AVX-512F set. Added a test. llvm-svn: 215526	2014-08-13 07:58:43 +00:00
Karthik Bhat	d8ea66ecbf	InstCombine: Combine (xor (or %a, %b) (xor %a, %b)) to (add %a, %b) Correctness proof of the transform using CVC3- $ cat t.cvc A, B : BITVECTOR(32); QUERY BVXOR(A \| B, BVXOR(A,B) ) = A & B; $ cvc3 t.cvc Valid. llvm-svn: 215524	2014-08-13 05:13:14 +00:00
Hal Finkel	2d0b95ee36	[NVPTX] Remove MemIntrinsicSDNode/MemSDNode duplicate checking As of r214452, isa<MemSDNode> will return true for nodes for which isa<MemIntrinsicSDNode> will return true (classof now respects the actual class hierarchy). So we no longer need to check for both MemIntrinsicSDNode and MemSDNode separately. No functionality change intended. llvm-svn: 215523	2014-08-13 04:59:51 +00:00
Nick Lewycky	8dfcdf9eb2	Fix examples of "named metadata" (some of which isn't named). llvm-svn: 215522	2014-08-13 04:54:05 +00:00
Chandler Carruth	308e776b50	[shuffle] Tweak the shuffle fuzzer to support bigger seeds. I'm currently using UUIDs to seed this in order to scan a bigger range. llvm-svn: 215521	2014-08-13 03:21:11 +00:00
Chandler Carruth	6636ee91c6	[x86] Rewrite a core part of the new vector shuffle lowering to handle one pesky test case correctly. This test case caused the old code to infloop occilating between solving the low-half and the high-half. The 'side balancing' part of single-input v8 shuffle lowering didn't handle the one pattern which can cause it to occilate. Fortunately the fuzz testing found this case. Unfortuately it was terrible to handle. I'm really sorry for the amount and density of the code here, I'd love suggestions on how to simplify it. I feel like there must be a simpler form here, but after a lot of days I've not found it. This is the only one I've found that even works. I've added the one pesky test case along with some nice comments explaining the core problem that we have to solve here. So far this has survived approximately 32k test cases. More strenuous fuzzing commencing. llvm-svn: 215519	2014-08-13 01:25:45 +00:00
Hal Finkel	97fb1d4d91	[PowerPC] Implement PPCTargetLowering::getTgtMemIntrinsic This implements PPCTargetLowering::getTgtMemIntrinsic for Altivec load/store intrinsics. As with the construction of the MachineMemOperands for the intrinsic calls used for unaligned load/store lowering, the only slight complication is that we need to represent a larger memory range than the loaded/stored value-type size (because the address is rounded down to an aligned address, and we need to conservatively represent the entire possible range of the actual access). This required adding an extra size field to TargetLowering::IntrinsicInfo, and this was done in a way that required no modifications to other targets (the size defaults to the store size of the provided memory data type). This fixes test/CodeGen/PowerPC/unal-altivec-wint.ll (so it can be un-XFAILed). llvm-svn: 215512	2014-08-13 01:15:40 +00:00
Hal Finkel	ac8c24afbf	Fix classof for ISD::INTRINSIC_W_CHAIN and INTRINSIC_VOID Unfortunately, our use of the SDNode class hierarchy for INTRINSIC_W_CHAIN and INTRINSIC_VOID nodes is somewhat broken right now. These nodes sometimes are used for memory intrinsics (those with MachineMemOperands), and sometimes not. When not, the nodes are not created as instances of MemIntrinsicSDNode, but rather created as some other subclass of SDNode using DAG::getNode. When they are memory intrinsics, they are created using DAG::getMemIntrinsicNode as instances of MemIntrinsicSDNode. MemIntrinsicSDNode is a subclass of MemSDNode, but prior to r214452, we had a non-self-consistent setup whereby MemIntrinsicSDNode::classof on INTRINSIC_W_CHAIN and INTRINSIC_VOID would return true but MemSDNode::classof on INTRINSIC_W_CHAIN and INTRINSIC_VOID would return false. In r214452, MemSDNode::classof was changed to return true for INTRINSIC_W_CHAIN and INTRINSIC_VOID, which is now self-consistent. The problem is that neither the pre-r214452 logic and the post-r214452 logic are really right. The truth is that not all INTRINSIC_W_CHAIN and INTRINSIC_VOID nodes are instances of MemIntrinsicSDNode (or MemSDNode for that matter), and the return value from classof needs to reflect that. This was broken before r214452 (because MemIntrinsicSDNode::classof always returned true), and was broken afterward (because MemSDNode::classof also always returned true), and will now be correct. The minimal solution is to grab one of the SubclassData bits (there is one left for MemIntrinsicSDNode nodes) and use it to store whether or not a particular INTRINSIC_W_CHAIN or INTRINSIC_VOID is really an instance of MemIntrinsicSDNode or not. Doing this allows both MemIntrinsicSDNode::classof and MemSDNode::classof to return the correct answer for the underlying object for both the memory-intrinsic and non-memory-intrinsic cases. This fixes the problem that r214452 created in the SelectionDAGDumper (thanks to Matt Arsenault for pointing it out). Because PowerPC does not implement getTgtMemIntrinsic, this change breaks test/CodeGen/PowerPC/unal-altivec-wint.ll. I've XFAILed it for now, and will fix it in a follow-up commit. llvm-svn: 215511	2014-08-13 01:15:37 +00:00
Adam Nemet	0585f9d880	[AVX512] Verify the code generated for the intrinsic _mm512_broadcastsd_pd llvm-svn: 215487	2014-08-13 00:30:05 +00:00
David Blaikie	e490f547d3	Fix -Wsign-compare warnings llvm-svn: 215483	2014-08-12 23:23:05 +00:00
Reid Kleckner	3f1f5c1808	APInt: Make self-move-assignment a no-op to fix stage3 clang-cl It's not clear what the semantics of a self-move should be. The consensus appears to be that a self-move should leave the object in a moved-from state, which is what our existing move assignment operator does. However, the MSVC 2013 STL will perform self-moves in some cases. In particular, when doing a std::stable_sort of an already sorted APSInt vector of an appropriate size, one of the merge steps will self-move half of the elements. We don't notice this when building with MSVC, because MSVC will not synthesize the move assignment operator for APSInt. Presumably MSVC does this because APInt, the base class, has user-declared special members that implicitly delete move special members. Instead, MSVC selects the copy-assign operator, which defends against self-assignment. Clang, on the other hand, selects the move-assign operator, and we get garbage APInts. llvm-svn: 215478	2014-08-12 22:01:39 +00:00
Adrian Prantl	7b5b539bcc	Remove a condition that can never be true, as wittnessed by the assert above. llvm-svn: 215477	2014-08-12 21:55:58 +00:00
Adam Nemet	6d75e0e06f	[AVX512] Handle valign masking intrinsic via C++ lowering I think that this will scale better in most cases than adding a Pat<> for each mapping from the intrinsic DAG to the intruction (i.e. rri, rrik, rrikz). We can just lower to the SDNode and have the resulting DAG be matches by the DAG patterns. Alternatively (long term), we could keep the Pat<>s but generate them via the new AVX512_masking multiclass. The difficulty is that in order to formulate that we would have to concatenate DAGs. Currently this is only supported if the operators of the input DAGs are identical. llvm-svn: 215473	2014-08-12 21:13:12 +00:00
Matt Arsenault	9822384258	Allwo bitcast + struct GEP transform to work with addrspacecast llvm-svn: 215467	2014-08-12 19:46:13 +00:00
Jan Vesely	c9798145af	R600: Use optimized 24bit path in udivrem v2: drop enum keyword use correct extension mode don't bother computing the sign in unsinged case Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215462	2014-08-12 17:31:20 +00:00
Jan Vesely	a72063b855	R600: Remove unused code. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215461	2014-08-12 17:31:19 +00:00
Jan Vesely	73bab311bb	R600: Use i24 optimized path for SREM v2: add tests rename LowerSDIV24 to LowerSDIVREM24 handle the rem part in this function Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215460	2014-08-12 17:31:17 +00:00
Quentin Colombet	384a195a06	Fix a parentheses warning introduced in r215394. llvm-svn: 215459	2014-08-12 17:11:26 +00:00
Duncan P. N. Exon Smith	a26436a32a	Don't upgrade global constructors when reading bitcode An optional third field was added to `llvm.global_ctors` (and `llvm.global_dtors`) in r209015. Most of the code has been changed to deal with both versions of the variables. Users of the C API might create either version, the helper functions in LLVM create the two-field version, and clang now creates the three-field version. However, the BitcodeReader was changed to always upgrade to the three-field version. This created an unnecessary inconsistency in the IR before/after serializing to bitcode. This commit resolves the inconsistency by making the third field truly optional (and not upgrading in the bitcode reader). Since `llvm-link` was relying on this upgrade code, rather than deleting it I've moved it into `ModuleLinker`, where it upgrades these arrays as necessary to resolve inconsistencies between modules. The ideal resolution would be to remove the 2-field version and make the third field required. I filed PR20506 to track that. I changed `test/Bitcode/upgrade-global-ctors.ll` to a negative test and duplicated the `llvm-link` check in `test/Linker/global_ctors.ll` to check both upgrade directions. Since I came across this as part of PR5680 (serializing use-list order), I've also added the missing `verify-uselistorder` RUN line to `test/Bitcode/metadata-2.ll`. llvm-svn: 215457	2014-08-12 16:46:37 +00:00
Sanjay Patel	177d95ef54	fixed typos llvm-svn: 215451	2014-08-12 16:00:06 +00:00
Rafael Espindola	0e5b14c8fa	Make the test a bit more strict. Before it would pass even if @b or @c ended up pointing to a variable named @a123. llvm-svn: 215450	2014-08-12 15:55:27 +00:00
Rafael Espindola	fb9ddaf798	Add a plugin testcase for merging weak variables. I initially thought I could implement COMDATs with aliases by just internalizing GVs instead of dropping them. This is a counter example: Internalizing one of the @a would make @b and @c point to different variables. llvm-svn: 215447	2014-08-12 15:39:14 +00:00
NAKAMURA Takumi	8eeeb20d11	llvm/test/TableGen/Foreach.td: Remove XFAIL:vg_leak. They have not been failing since r215176. llvm-svn: 215445	2014-08-12 14:06:21 +00:00
Toma Tabacu	2ea6736c23	Reverted my "Testing commit access" commit. llvm-svn: 215441	2014-08-12 12:41:44 +00:00
Toma Tabacu	bb28098aab	Testing commit access. llvm-svn: 215440	2014-08-12 12:29:40 +00:00
Tim Northover	3a72195452	llvm-objdump: print contents of MachO __unwind_info sections llvm-svn: 215437	2014-08-12 11:52:59 +00:00
Eric Christopher	fd30e29785	Have MachineRegisterInfo take and store the MachineFunction it was created for rather than the TargetMachine since we only needed the TM for the subtarget and we can get that from the MF. llvm-svn: 215432	2014-08-12 08:00:56 +00:00
Gerolf Hoflehner	59d9e5a101	[MachineCombiner] Fix for ICE bug 20598 The combiner ignored DBG nodes when checking the uses of a virtual register. It combined a sequence like %vreg1 = madd %vreg2, %vreg3,... DBG_VALUE (%vreg1 ...) %vreg4 = add %vreg1,... to %vreg4 = madd %vreg2, %vreg3 leaving behind a dangling DBG_VALUE with a definition. This triggered an assertion in the MachineTraceMetrics.cpp module. llvm-svn: 215431	2014-08-12 07:54:12 +00:00
Justin Bogner	ce04d92d5d	IR: Print a newline when dumping Types Type::dump() doesn't print a newline, which makes for a poor experience in a debugger. This looks like it was an ommission considering Value::dump() two lines above, so I've changed Type to add a newline as well. Of the two in-tree callers, one added a newline anyway, and I've updated the other one to use Type::print instead. llvm-svn: 215421	2014-08-12 03:24:59 +00:00
Peter Zotov	637a91873c	[OCaml] Expose Llvm.get_operand_use. Patch by Gabriel Radanne <drupyog@zoho.com> llvm-svn: 215420	2014-08-12 02:55:45 +00:00
Peter Zotov	58cb4e0217	[LLVM-C] Expose User::getOperandUse as LLVMGetOperandUse. Patch by Gabriel Radanne <drupyog@zoho.com> llvm-svn: 215419	2014-08-12 02:55:40 +00:00
Adrian Prantl	c007e0faa6	DebugLocEntry: Restore the comparison predicate from before the refactoring in 215384. This way it can unique multiple entries describing the same piece even if they don't have the exact same location. (The same piece may get merged in and be added from OpenRanges). There ought to be a more elegant solution for this, though. llvm-svn: 215418	2014-08-12 01:07:53 +00:00
Reid Kleckner	0e892accc3	msan: Handle musttail calls First, avoid calling setTailCall(false) on musttail calls. The funciton prototypes should be "congruent", so the shadow layout should be exactly the same. Second, avoid inserting instrumentation after a musttail call to propagate the return value shadow. We don't need to propagate the result of a tail call, it should already be in the right place. Reviewed By: eugenis Differential Revision: http://reviews.llvm.org/D4331 llvm-svn: 215415	2014-08-12 00:12:43 +00:00
Reid Kleckner	e48bff3246	Move helper for getting a terminating musttail call to BasicBlock No functional change. To be used in future commits that need to look for such instructions. Reviewed By: rafael Differential Revision: http://reviews.llvm.org/D4504 llvm-svn: 215413	2014-08-12 00:05:15 +00:00
David Blaikie	04dd297573	Revert "Partially revert r214761 that asserted that all concrete debug info variables had DIEs, due to a failure on Darwin." I believe this was addressed by r215157 and r215227, so let's have another go at the bots, etc. This reverts commit r214880. llvm-svn: 215412	2014-08-12 00:00:31 +00:00
Quentin Colombet	a09cf4d3ee	[MachineSink] Improve the compile time by preserving the dominance information as long as possible. Context Each time the dominance information is modified, the dominator tree analysis switches in a slow query mode. After a few queries without any modification on the dominator tree, it performs an expensive update of its internal structure to provide fast queries again. Problem Prior to this patch, the MachineSink pass was splitting the critical edges on demand while relying heavy on the dominator tree information. In some cases, this leads to pathological behavior where: - We end up in the slow query mode right after splitting an edge. - We update the dominance information. - We break the dominance information again, thus ending up in the slow query mode and so on. Proposed Solution To mitigate this effect, this patch postpones all the splitting of the edges at the end of each iteration of the main loop. The benefits are: - The dominance information is valid for the life time of an iteration. - This simplifies the code as we do not have to special treat instructions that are sunk on critical edges. Indeed, the related block will be available through the next iteration. The downside is that when edges splitting is required, this incurs an additional iteration of the main loop compared to the previous scheme. Performance Thanks to this patch, the motivating example compiles in 6+ minutes instead of 10+ minutes. No test case added as the motivating example as nothing special but being huge! I have measured only noise for both the compile time and the runtime on the llvm test-suite + SPECs with Os and O3. Note: The current implementation of MachineBasicBlock::SplitCriticalEdge also uses the dominance information and therefore, hits this problem. A subsequent patch will address that. <rdar://problem/17894619> llvm-svn: 215410	2014-08-11 23:52:01 +00:00
Michael J. Spencer	045df0bbd3	[x86] Fold extract_vector_elt of a load into the Load's address computation. llvm-svn: 215409	2014-08-11 23:49:33 +00:00
Adrian Prantl	8221613b32	Add a couple of convenience accessors to DebugLocEntry::Value to further simplify common usage patterns. llvm-svn: 215407	2014-08-11 23:22:59 +00:00
NAKAMURA Takumi	037c4a0074	R600/SIInstrInfo.cpp: Suppress an warning. [-Wunused-variable] llvm-svn: 215406	2014-08-11 23:03:38 +00:00
Quentin Colombet	eb7a255ad0	[ARM] Mark VMOVDRR with the RegSequence property and implement the related target hook. This patch teaches the compiler that: dX = VMOVDRR rY, rZ is the same as: dX = REG_SEQUENCE rY, ssub_0, rZ, ssub_1 <rdar://problem/12702965> llvm-svn: 215404	2014-08-11 22:56:22 +00:00
Adrian Prantl	ed31df5992	Make these DebugLocEntry::Value comparison operators friend functions as suggested by dblaikie in a comment on r215384. llvm-svn: 215403	2014-08-11 22:52:56 +00:00
Jim Grosbach	80cdd8d379	Add missing closing namespace comment. llvm-svn: 215402	2014-08-11 22:42:31 +00:00
Jim Grosbach	6d4234003b	AArch64: Tidy up a few comments. Have the comments match the actual parameter names. Found via clang-tidy. llvm-svn: 215401	2014-08-11 22:42:28 +00:00
David Majnemer	5a64cc5b28	InstCombine: Combine (add (and %a, %b) (or %a, %b)) to (add %a, %b) What follows bellow is a correctness proof of the transform using CVC3. $ < t.cvc A, B : BITVECTOR(32); QUERY BVPLUS(32, A & B, A \| B) = BVPLUS(32, A, B); $ cvc3 < t.cvc Valid. llvm-svn: 215400	2014-08-11 22:32:02 +00:00
Tom Stellard	12111b0735	R600/SI: Add a ComplexPattern for selecting MUBUF _OFFSET variant This saves us from having to copy a 64-bit 0 value into VGPRs for BUFFER_* instruction which only have a 12-bit immediate offset. llvm-svn: 215399	2014-08-11 22:18:17 +00:00
Tom Stellard	fcb2bdc3e4	R600/SI: Add an _OFFEN variant MUBUF_STORE_* and use it for scratch writes llvm-svn: 215398	2014-08-11 22:18:14 +00:00
Tom Stellard	553839dddc	R600/SI: Add check for low 32 bits of encoding to mubuf tests There are no variable values like registers encoded in the low 32 bits of MUBUF instructions, so it is relatively easy to check these bits, and it will help prevent us from introducing encoding bugs. llvm-svn: 215397	2014-08-11 22:18:11 +00:00

1 2 3 4 5 ...

106628 Commits