llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 21:13:02 +02:00

Author	SHA1	Message	Date
Nicolai Haehnle	634941ba37	AMDGPU: llvm.SI.fs.constant is a source of divergence Summary: This intrinsic is used to get flat-shaded fragment shader inputs. Those are uniform across a primitive, but a fragment shader wave may process pixels from multiple primitives (as indicated by the prim_mask), and so that's where divergence can arise. Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19747 llvm-svn: 268259	2016-05-02 17:37:01 +00:00
Derek Schuff	f0cc027b2e	[WebAssembly] Rename memory_size intrinsic to current_memory This follows the recent renaming in the wasm spec. llvm-svn: 268255	2016-05-02 17:25:22 +00:00
Hans Wennborg	b1d0774f3f	[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block including lifetime intrinsics Make it possible that TryToSimplifyUncondBranchFromEmptyBlock merges empty basic block including lifetime intrinsics as well as phi nodes and unconditional branch into its successor or predecessor(s). If successor of empty block has single predecessor, all contents including lifetime intrinsics are sinked into the successor. Otherwise, they are hoisted into its predecessor(s) and then merged into the predecessor(s). Patch by Josh Yoon <josh.yoon@samsung.com>! Differential Revision: http://reviews.llvm.org/D19257 llvm-svn: 268254	2016-05-02 17:22:54 +00:00
Mehdi Amini	7b2daaaedb	Move createReversePostOrderFunctionAttrsPass right after the inliner is done This is where it was originally, until LoopVersioningLICM was inserted before in r259986, I don't believe it was on purpose. Differential Revision: http://reviews.llvm.org/D19809 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268252	2016-05-02 16:53:16 +00:00
Pete Cooper	11e1eedc38	Add llvm-pdbdump to the tool substitutions list in lit. NFC. This adds llvm-pdbdump to the list of tools which get printed with the full path in verbose mode. This makes it easier to take the whole run line from verbose output and run it again without prepending with the builds bin directory. llvm-svn: 268250	2016-05-02 16:51:26 +00:00
Chad Rosier	3e660d9f35	Remove extra whitespace. NFC. llvm-svn: 268248	2016-05-02 16:45:00 +00:00
Tom Stellard	7f58d124e5	AMDGPU/SI: Use hazard recognizer to detect DPP hazards Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18603 llvm-svn: 268247	2016-05-02 16:23:09 +00:00
Sanjay Patel	b2a67e59db	remove blank lines llvm-svn: 268246	2016-05-02 15:49:09 +00:00
Sanjay Patel	e2d5365777	[InstCombine] regenerate checks llvm-svn: 268245	2016-05-02 15:32:10 +00:00
Sanjay Patel	a63214a88b	[InstCombine] regenerate checks llvm-svn: 268244	2016-05-02 15:25:49 +00:00
Sanjay Patel	bf009dccb0	[InstCombine] regenerate checks llvm-svn: 268242	2016-05-02 15:21:41 +00:00
Sanjay Patel	552b31c626	[InstCombine] regenerate checks llvm-svn: 268241	2016-05-02 15:18:13 +00:00
Sanjay Patel	125a1032ac	[InstCombine] regenerate checks llvm-svn: 268239	2016-05-02 15:06:55 +00:00
Sanjay Patel	feadfdaa37	[InstCombine] regenerate checks llvm-svn: 268232	2016-05-02 14:21:55 +00:00
David L Kreitzer	5e9178eeb3	Enable the X86 call frame optimization for the 64-bit targets that allow it. Fixes PR27241. Differential Revision: http://reviews.llvm.org/D19688 llvm-svn: 268227	2016-05-02 13:45:25 +00:00
Jonas Paulsson	ee848f9766	[SystemZ] Temporarily disable codegen test int-add-12.ll. This checks for AGSI transformation, which is temporarily disabled. llvm-svn: 268219	2016-05-02 10:42:47 +00:00
Davide Italiano	92ff603010	[llvm-readobj] Dump hash as part of -version-info. llvm-svn: 268210	2016-05-02 02:30:18 +00:00
Davide Italiano	4d35211d79	[GlobalDCE] Modernize. Use FileCheck instead of grep. llvm-svn: 268207	2016-05-01 22:51:14 +00:00
Simon Pilgrim	b50a7dc851	[InstCombine][SSE] Added support to VPERMD/VPERMPS to shuffle combine to accept UNDEF elements. llvm-svn: 268206	2016-05-01 20:43:02 +00:00
Simon Pilgrim	e0944ffa06	Dropped FIXME comment llvm-svn: 268205	2016-05-01 20:33:25 +00:00
Simon Pilgrim	a6ec426d4d	[InstCombine][SSE] Added support to VPERMILVAR to shuffle combine to accept UNDEF elements. llvm-svn: 268204	2016-05-01 20:22:42 +00:00
Simon Pilgrim	1834e02fd6	[InstCombine][AVX] Fixed PERMILVAR identity tests and added additional decode tests llvm-svn: 268203	2016-05-01 20:06:47 +00:00
Simon Pilgrim	f29cf3f78c	[InstCombine][SSE] Added support to PSHUFB to shuffle combine to accept UNDEF elements. llvm-svn: 268202	2016-05-01 19:26:21 +00:00
Simon Pilgrim	a064ae65ee	[InstCombine][SSE] Regenerate MOVSX/MOVZX tests llvm-svn: 268201	2016-05-01 18:28:45 +00:00
Craig Topper	721f6428df	[AVX512] VPACKUSWB/VPACKSSWB should not be encoded with EVEX.W=1. While there fix the execution domain for VPACKSSDW/VPACKUSDW. llvm-svn: 268200	2016-05-01 17:38:32 +00:00
Simon Pilgrim	b8dbbcff7b	[InstCombine][AVX2] Combine VPERMD/VPERMPS intrinsics with constant masks to shufflevector. llvm-svn: 268199	2016-05-01 16:41:22 +00:00
Igor Breger	fa752e801d	getelementptr instruction, support index vector of EVT. Differential Revision: http://reviews.llvm.org/D19775 llvm-svn: 268195	2016-05-01 13:29:12 +00:00
Igor Breger	a0208b4462	Change AVX512 braodcastsd/ss patterns interaction with spilling . New implementation take a scalar register and generate a vector without COPY_TO_REGCLASS (turn it into a VR128 register ) .The issue is that during register allocation we may spill a scalar value using 128-bit loads and stores, wasting cache bandwidth. Differential Revision: http://reviews.llvm.org/D19579 llvm-svn: 268190	2016-05-01 08:40:00 +00:00
Craig Topper	5387c11293	[AVX512] Prefer AVX512 VPACK instructions over AVX/AVX2 instructions when VLX and BWI are supported. llvm-svn: 268189	2016-05-01 06:52:19 +00:00
Sanjoy Das	ab0e0b65fa	[SCEV] When printing via -analysis, dump loop disposition There are currently some bugs in tree around SCEV caching an incorrect loop disposition. Printing out loop dispositions will let us write whitebox tests as those are fixed. The dispositions are printed as a list in "inside out" order, i.e. innermost loop first. llvm-svn: 268177	2016-05-01 04:51:05 +00:00
Simon Pilgrim	599ad3c0a2	[InstCombine][AVX2] Added VPERMD/VPERMPS shuffle combining placeholder tests. For future support for VPERMD/VPERMPS to generic shuffles combines llvm-svn: 268166	2016-04-30 20:41:52 +00:00
Simon Pilgrim	d377dbc835	[InstCombine][AVX] Split off VPERMILVAR tests and added additional tests for UNDEF mask elements llvm-svn: 268159	2016-04-30 07:32:19 +00:00
Tom Stellard	6245c9db08	AMDGPU/SI: Remove wait state handling for SMRD in SIInsertWaits This was supposed to be part of r268143. llvm-svn: 268154	2016-04-30 04:04:48 +00:00
Amjad Aboud	bc689dbffb	Reverting 268054 & 268063 as they caused PR27579. llvm-svn: 268150	2016-04-30 01:44:07 +00:00
Sanjoy Das	0d28098245	[LowerGuardIntrinsics] Keep track of !make.implicit metadata If a guard call being lowered by LowerGuardIntrinsics has the `!make.implicit` metadata attached, then reattach the metadata to the branch in the resulting expanded form of the intrinsic. This allows us to implement null checks as guards and still get the benefit of implicit null checks. llvm-svn: 268148	2016-04-30 00:55:59 +00:00
Lawrence Hu	840b3f8ac8	Reroll loops with multiple IV and negative step part 3 support multiple induction variables This patch enable loop reroll for the following case: for(int i=0; i<N; i += 2) { S += a++; S += a++; }; Differential Revision: http://reviews.llvm.org/D16550 llvm-svn: 268147	2016-04-30 00:51:22 +00:00
Tom Stellard	51b37329c1	AMDGPU/SI: Enable the post-ra scheduler Summary: This includes a hazard recognizer implementation to replace some of the hazard handling we had during frame index elimination. Reviewers: arsenm Subscribers: qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18602 llvm-svn: 268143	2016-04-30 00:23:06 +00:00
Sanjoy Das	cb22cc23b3	[LowerGuardIntrinsics] Preserve calling conv when lowering llvm-svn: 268142	2016-04-30 00:17:47 +00:00
Sanjay Patel	0edaaaca54	add minimal test to show dropped metadata llvm-svn: 268141	2016-04-30 00:12:54 +00:00
Sanjay Patel	4e69043b25	remove the metadata added with r267827 We can demonstrate the 'select' bug and fix with a simpler test case. The merged weight values are already tested in another test. llvm-svn: 268139	2016-04-30 00:02:36 +00:00
Sanjoy Das	42c6e9b9ce	Mark guards on true as "trivially dead" This moves some logic added to EarlyCSE in rL268120 into `llvm::isInstructionTriviallyDead`. Adds a test case for DCE to demonstrate that passes other than EarlyCSE can now pick up on the new information. llvm-svn: 268126	2016-04-29 22:23:16 +00:00
Haicheng Wu	611abb7dc9	[MBP] Use Function::optForSize() instead of checking OptimizeForSize directly. Fix a FIXME. Disable loop alignment if compiled with -Oz now. llvm-svn: 268121	2016-04-29 22:01:10 +00:00
Sanjoy Das	69de617a5f	[EarlyCSE] Simplify guard intrinsics Summary: This change teaches EarlyCSE some basic properties of guard intrinsics: - Guard intrinsics read all memory, but don't write to any memory - After a guard has executed, the condition it was guarding on can be assumed to be true - Guard intrinsics on a constant `true` are no-ops Reviewers: reames, hfinkel Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19578 llvm-svn: 268120	2016-04-29 21:52:58 +00:00
Matt Arsenault	42ea6294ae	AMDGPU: Fix crash with unreachable terminators. If a block has no successors because it ends in unreachable, this was accessing an invalid iterator. Also stop counting instructions that don't emit any real instructions. llvm-svn: 268119	2016-04-29 21:52:13 +00:00
Sriraman Tallam	899df7646a	Differential Revision: http://reviews.llvm.org/D19733 llvm-svn: 268106	2016-04-29 21:19:16 +00:00
Matt Arsenault	87a15c33eb	AMDGPU: Add kernarg.segment.ptr intrinsic llvm-svn: 268105	2016-04-29 21:16:52 +00:00
Chad Rosier	5fffe70b4f	[InstCombine] Determine the result of a select based on a dominating condition. Differential Revision: http://reviews.llvm.org/D19550 llvm-svn: 268104	2016-04-29 21:12:31 +00:00
Matt Arsenault	1e65ead116	DAGCombiner: Reduce truncated shl width llvm-svn: 268094	2016-04-29 19:53:16 +00:00
David Majnemer	8fe9b6cacf	[ValueTracking] matchSelectPattern needs to be more careful around FP matchSelectPattern attempts to see through casts which mask min/max patterns from being more obvious. Under certain circumstances, it would misidentify a sequence of instructions as a min/max because it assumed that folding casts would preserve the result. This is not the case for floating point <-> integer casts. This fixes PR27575. llvm-svn: 268086	2016-04-29 18:40:34 +00:00
Artem Tamazov	478021659a	[AMDGPU][llvm-mc] Add some missing testcases to trap.s Differential Revision: http://reviews.llvm.org/D19602 llvm-svn: 268073	2016-04-29 17:41:44 +00:00

1 2 3 4 5 ...

36107 Commits