llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Matt Arsenault	ea70093fdd	R600/SI: Fix formatting. Avoid weird line wrapping of BuildMI dest register. llvm-svn: 214608	2014-08-02 01:10:28 +00:00
Chandler Carruth	697d6af472	[SDAG] MorphNodeTo recursively deletes dead operands of the old fromulation of the node, which isn't really the desired behavior from within the combiner or legalizer, but is necessary within ISel. I've added a hopefully helpful comment and fixed the only two places where this took place. Yet another step toward the combiner and legalizer not needing to use update listeners with virtual calls to manage the worklists behind legalization and combining. llvm-svn: 214574	2014-08-01 22:09:43 +00:00
Tom Stellard	2e31693e97	Revert "R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp" This reverts commit r214566. I did not mean to commit this yet. llvm-svn: 214572	2014-08-01 21:55:50 +00:00
Tom Stellard	db07c33258	R600/SI: Remove leftover debugging code llvm-svn: 214569	2014-08-01 21:51:05 +00:00
Tom Stellard	150fd6c318	R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp SI doesn't use REGISTER_LOAD anymore, but it was still hitting this code path for 8-bit and 16-bit private loads. llvm-svn: 214566	2014-08-01 21:50:47 +00:00
Matt Arsenault	550a31beed	R600/SI: Don't display GDS bit for read2 This isn't displayed for any other instructions anymore, and isn't ever used. llvm-svn: 214523	2014-08-01 17:00:26 +00:00
Tom Stellard	fd915598f1	R600/SI: Fix build warning llvm-svn: 214475	2014-08-01 02:05:57 +00:00
Tom Stellard	827479f1ca	R600/SI: Do abs/neg folding with ComplexPatterns Abs/neg folding has moved out of foldOperands and into the instruction selection phase using complex patterns. As a consequence of this change, we now prefer to select the 64-bit encoding for most instructions and the modifier operands have been dropped from integer VOP3 instructions. llvm-svn: 214467	2014-08-01 00:32:39 +00:00
Tom Stellard	f52d670860	R600/SI: Simplify and fix handling of VOP2 in SIInstrInfo::legalizeOperands We were incorrectly assuming that all VOP2 instructions can read SGPRs in Src0, but this is not true for instructions that read carry-in from VCC. The old logic has been replaced with new logic which checks the defined register classes of the VOP2 instruction to determine whether or not to legalize the operands. llvm-svn: 214465	2014-08-01 00:32:35 +00:00
Tom Stellard	313fcff563	R600/SI: Fold immediates when shrinking instructions This will prevent us from using extra MOV instructions once we prefer selecting 64-bit instructions. llvm-svn: 214464	2014-08-01 00:32:33 +00:00
Tom Stellard	3ac3ae86a9	R600/SI: Fix incorrect commute operation in shrink instructions pass We were commuting the instruction by still shrinking it using the original opcode. NOTE: This is a candidate for the 3.5 branch. llvm-svn: 214463	2014-08-01 00:32:28 +00:00
Louis Gerbarg	8048e52537	Make sure no loads resulting from load->switch DAGCombine are marked invariant Currently when DAGCombine converts loads feeding a switch into a switch of addresses feeding a load the new load inherits the isInvariant flag of the left side. This is incorrect since invariant loads can be reordered in cases where it is illegal to reoarder normal loads. This patch adds an isInvariant parameter to getExtLoad() and updates all call sites to pass in the data if they have it or false if they don't. It also changes the DAGCombine to use that data to make the right decision when creating the new load. llvm-svn: 214449	2014-07-31 21:45:05 +00:00
Matt Arsenault	a82949eb53	R600/SI: Remove redundant setting of bits on instructions. neverHasSideEffects is deprecated, and hasSideEffects = 0 is already set on the base classes of the basic ALU instruction classes. The base classes also already set mayLoad = 0 and mayStore = 0 llvm-svn: 214283	2014-07-30 03:18:57 +00:00
Matt Arsenault	d8f2b465a3	R600/SI: Consider adjacent offsets in getLdStBaseRegImmOfs We can treat ds_read2_* as a single offset if the offsets are adjacent. No test since emission of read2 instructions for partially aligned loads isn't implemented yet. llvm-svn: 214269	2014-07-30 01:01:10 +00:00
Matt Arsenault	ccbab61a53	R600/SI: Implement getLdStBaseRegImmOfs llvm-svn: 214225	2014-07-29 21:34:55 +00:00
Matt Arsenault	d64b11dc6c	R600/SI: Enable named operand table for DS instructions llvm-svn: 214217	2014-07-29 21:00:56 +00:00
Matt Arsenault	5d4212bf5b	Remove line with no effect llvm-svn: 214216	2014-07-29 21:00:53 +00:00
Matt Arsenault	001ee9417a	R600/SI: Add isMUBUF / isMTBUF Also add missing comments about how the flags work. llvm-svn: 214195	2014-07-29 18:51:56 +00:00
Matt Arsenault	751b041f67	R600/SI: Set bits on SMRD instructions Set mayStore = 0 and enable named operand table. llvm-svn: 214194	2014-07-29 18:51:54 +00:00
Matt Arsenault	55d94a2290	Fix typos / grammar. llvm-svn: 214147	2014-07-29 00:02:40 +00:00
Matt Arsenault	8eaa31c417	Fix header including itself llvm-svn: 214146	2014-07-29 00:02:37 +00:00
Matt Arsenault	feabef83e1	R600/SI: Fix return type for isMIMG / isSMRD All the others use bool, so these should too. llvm-svn: 214106	2014-07-28 17:59:38 +00:00
Matt Arsenault	1c1d6d00fc	R600/SI: Implement getOptimalMemOpType The default guess uses i32. This needs an address space argument to really do the right thing in all cases. llvm-svn: 214104	2014-07-28 17:49:26 +00:00
Matt Arsenault	ba2df7591d	R600/SI: Make argument loads invariant llvm-svn: 214101	2014-07-28 17:31:39 +00:00
Matt Arsenault	76c7b7a591	Add alignment value to allowsUnalignedMemoryAccess Rename to allowsMisalignedMemoryAccess. On R600, 8 and 16 byte accesses are mostly OK with 4-byte alignment, and don't need to be split into multiple accesses. Vector loads with an alignment of the element type are not uncommon in OpenCL code. llvm-svn: 214055	2014-07-27 17:46:40 +00:00
Matt Arsenault	38386c76f2	R600: Move intrinsic lowering to separate functions llvm-svn: 214023	2014-07-26 06:23:37 +00:00
Matt Arsenault	860caaf266	R600/SI: Allow partial unrolling and increase thresholds. llvm-svn: 213985	2014-07-25 23:02:42 +00:00
Eric Christopher	eb42a5cd1a	Move R600 subtarget dependent variables onto the subtarget. No functional change. llvm-svn: 213982	2014-07-25 22:22:39 +00:00
Chandler Carruth	ad37c4b412	[SDAG] Enable the new assert for out-of-range result numbers in SDValues, fixing the two bugs left in the regression suite. The key for both of these was the use a single value type rather than a VTList which caused an unintentionally single-result merge-value node. Fix this by getting the appropriate VTList in place. Doing this exposed that the comments in x86's code abouth how MUL_LOHI operands are handle is wrong. The bug with the use of out-of-range result numbers was hiding the bug about the order of operands here (as best i can tell). There are more places where the code appears to get this backwards still... llvm-svn: 213931	2014-07-25 09:19:23 +00:00
Matt Arsenault	cfdaa492e4	R600: Add FMA instructions for Evergreen llvm-svn: 213882	2014-07-24 17:41:01 +00:00
Matt Arsenault	d70c38a67b	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Matt Arsenault	0a2084b376	R600: Match rcp node on pre-SI llvm-svn: 213844	2014-07-24 06:59:24 +00:00
Matt Arsenault	34661ee593	R600: Fix LowerSDIV24 Use ComputeNumSignBits instead of checking for i8 / i16 which only worked when AMDIL was lying about having legal i8 / i16. If an integer is known to fit in 24-bits, we can do division faster with float ops. llvm-svn: 213843	2014-07-24 06:59:20 +00:00
Matt Arsenault	ef91582cd5	R600: Implement enableClusterLoads() llvm-svn: 213831	2014-07-24 02:10:17 +00:00
Saleem Abdulrasool	29ded73977	R600: silence GCC warning GCC believes it may be possible to not return a value from the switch: lib/Target/R600/SIRegisterInfo.cpp:187:1: warning: control reaches end of non-void function [-Wreturn-type] Add an unreachable label to indicate that this is not possible and still permit switch coverage checking. llvm-svn: 213572	2014-07-21 17:52:00 +00:00
Tom Stellard	76787b17cd	R600/SI: Refactor VOP3 instruction definitions llvm-svn: 213571	2014-07-21 17:44:29 +00:00
Tom Stellard	e0e9c08a04	R600/SI: Separate encoding and operand definitions into their own classes llvm-svn: 213570	2014-07-21 17:44:28 +00:00
Tom Stellard	acd6079121	R600/SI: Initailize encoding fields of unused VOP3 modifiers to 0 llvm-svn: 213564	2014-07-21 17:12:40 +00:00
Tom Stellard	1906597940	R600/SI: Initialize unused VOP3 sources to 0 instead of SIOperand.ZERO llvm-svn: 213563	2014-07-21 17:12:37 +00:00
Tom Stellard	7c4b3a94b6	R600/SI: Add instruction shrinking pass This pass converts 64-bit instructions to 32-bit when possible. llvm-svn: 213561	2014-07-21 16:55:33 +00:00
Tom Stellard	e081fd12ae	R600/SI: VOPC instructions explicitly define VCC Therefore we don't need to add it to the implict defs list. llvm-svn: 213558	2014-07-21 16:27:24 +00:00
Tom Stellard	08b253cba1	R600/SI: Clean up some of the unused REGISTER_{LOAD,STORE} code There are a few more cleanups to do, but I ran into some problems with ext loads and trunc stores, when I tried to change some of the vector loads and stores from custom to legal, so I wasn't able to get rid of everything. llvm-svn: 213552	2014-07-21 15:45:06 +00:00
Tom Stellard	ed0ccca70d	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	85b5d9062d	R600/SI: Specify wavefront size for SI and CI llvm-svn: 213550	2014-07-21 15:44:58 +00:00
Tom Stellard	7a57564546	R600/SI: Remove vaddr operand from BUFFER_LOAD_*_OFFSET instructions This operand is never used. llvm-svn: 213549	2014-07-21 15:44:55 +00:00
Tom Stellard	5bfbb25d6b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
Tom Stellard	cc6c170604	R600/SI: Add isCFDepth0 Predicate to SALU addc pattern llvm-svn: 213529	2014-07-21 14:01:12 +00:00
Tom Stellard	7f35eb40ab	R600/SI: Use VALU for i1 XOR llvm-svn: 213528	2014-07-21 14:01:10 +00:00
Tom Stellard	7efced1747	R600/SI: Use a custom encoding method for simm16 in SOPP branch instructions This allows us to explicitly define the type of fixup that is needed, so we can distinguish this from future fixup types. llvm-svn: 213527	2014-07-21 14:01:08 +00:00
Tom Stellard	cecb51057f	R600/SI: Rename SOPP operands to match the encoding fields llvm-svn: 213526	2014-07-21 14:01:05 +00:00
NAKAMURA Takumi	c89172a790	SIISelLowering.cpp: Define _USE_MATH_DEFINES to let M_PI provided on MS <cmath>. FIXME: Would it be better to move it into configure? llvm-svn: 213477	2014-07-20 11:15:07 +00:00
Matt Arsenault	2c757a31e5	R600: Remove unused function llvm-svn: 213472	2014-07-20 06:31:06 +00:00
Matt Arsenault	2d097d5e02	R600/SI: Remove dead code and add missing tests. This probably was killed by some generic DAGCombiner improvements in checking the TargetBooleanContents instead of just 1. llvm-svn: 213471	2014-07-20 06:11:02 +00:00
Matt Arsenault	056a41085d	Revert accidentally committed r213459 llvm-svn: 213461	2014-07-19 19:17:33 +00:00
Matt Arsenault	893e8b7516	XXX - Increase unroll threshold llvm-svn: 213459	2014-07-19 19:16:34 +00:00
Matt Arsenault	840d57e330	R600/SI: implement range reduction for sin/cos These instructions can only take a limited input range, and return the constant value 1 out of range. We should do range reduction to be able to process arbitrary values. Use a FRACT instruction after normalization to achieve this. Also add a test for constant folding with the lowered code with unsafe-fp-math enabled. v2: use DAG lowering instead of intrinsic, adapt test v3: calculate constant, fold pattern into instruction definition v4: misc style fixes, add sin-fold testcase, cosmetics Patch by Grigori Goronzy llvm-svn: 213458	2014-07-19 18:44:39 +00:00
Matt Arsenault	53e4a5d4a3	R600: Implement a few simple TTI queries. I'm not sure if these have any effect right now. llvm-svn: 213455	2014-07-19 18:15:16 +00:00
Tim Northover	854fe649af	R600: support fpext/fptrunc operations to and from f16. llvm-svn: 213376	2014-07-18 13:01:37 +00:00
Tim Northover	de7867151d	R600: support f16 -> f64 conversion intrinsic. Unfortunately, we don't seem to have a direct truncation, but the extension can be legally split into two operations so we should support that. llvm-svn: 213357	2014-07-18 08:43:24 +00:00
Matt Arsenault	b71d4aab48	R600: Implement TTI:getPopcntSupport The test is just copied from X86, and I don't know of a better way to test it. llvm-svn: 213351	2014-07-18 06:07:13 +00:00
Matt Arsenault	0968995f30	Fix typos llvm-svn: 213285	2014-07-17 17:50:22 +00:00
Tim Northover	eae1f1c8cc	CodeGen: extend f16 conversions to permit types > float. This makes the two intrinsics @llvm.convert.from.f16 and @llvm.convert.to.f16 accept types other than simple "float". This is only strictly needed for the truncate operation, since otherwise double rounding occurs and there's no way to represent the strict IEEE conversion. However, for symmetry we allow larger types in the extend too. During legalization, we can expand an "fp16_to_double" operation into two extends for convenience, but abort when the truncate isn't legal. A new libcall is probably needed here. Even after this commit, various target tweaks are needed to actually use the extended intrinsics. I've put these into separate commits for clarity, so there are no actual tests of f64 conversion here. llvm-svn: 213248	2014-07-17 10:51:23 +00:00
Matt Arsenault	45d9529fe9	Use range for llvm-svn: 213230	2014-07-17 06:19:06 +00:00
Matt Arsenault	54393bb30e	R600: Short circuit alloca check if address space isn't private. Skip calling GetUnderlyingObject in cases where it obviously isn't from an alloca. This should only be a compile time improvement. llvm-svn: 213229	2014-07-17 06:13:41 +00:00
Matt Arsenault	15eb0d54b0	R600/SI: Allow using f32 rcp / rsq when denormals not handled. These are precise enough to use for OpenCL unless denormals are handled. llvm-svn: 213107	2014-07-15 23:50:10 +00:00
Matt Arsenault	c093eee935	R600/SI: Fix select on i1 llvm-svn: 213096	2014-07-15 21:44:37 +00:00
Matt Arsenault	1ceb5e82c1	R600/SI: Implement less wrong f32 fdiv Assuming single precision denormals and accurate sqrt/div are not reported, this passes the OpenCL conformance test. llvm-svn: 213089	2014-07-15 20:18:31 +00:00
Matt Arsenault	42709b4212	R600: Add predicate for UnsafeFPMath llvm-svn: 213088	2014-07-15 20:18:24 +00:00
Matt Arsenault	b916da43e5	R600: Remove intrinsics that appear to be unused llvm-svn: 213087	2014-07-15 20:10:27 +00:00
Jan Vesely	aa9875787e	R600: Implement zero undef variants of ctlz/cttz v2: use ffbh/l if available v3: Rebase on top of Matt's SI patches Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 213072	2014-07-15 15:51:09 +00:00
NAKAMURA Takumi	365309bb8b	Prune Redundant libdeps in CMake's target_link_libraries and LLVMBuild.txt. I checked this with Release+Asserts on x86_64-mingw32. Please restore partially if this were overkill. llvm-svn: 213064	2014-07-15 11:37:03 +00:00
Matt Arsenault	211ccabffb	R600: Add dag combine for copy of an illegal type. This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. llvm-svn: 213031	2014-07-15 02:06:31 +00:00
Matt Arsenault	24911cb984	R600: Add denormal handling subtarget features. llvm-svn: 213018	2014-07-14 23:40:49 +00:00
Matt Arsenault	62262e12fa	R600/SI: Default to no single precision denormals. llvm-svn: 213017	2014-07-14 23:40:43 +00:00
Matt Arsenault	fcf9b46646	Remove unused include llvm-svn: 212898	2014-07-13 03:08:59 +00:00
Matt Arsenault	66d430cbc3	R600: Use range for and fix missing consts. llvm-svn: 212897	2014-07-13 03:06:43 +00:00
Matt Arsenault	fdf94244f2	R600: Make ShaderType private llvm-svn: 212896	2014-07-13 03:06:39 +00:00
Matt Arsenault	235492a3e5	R600: Add option to disable promote alloca This can make writing some tests harder, so add a flag to disable it. llvm-svn: 212893	2014-07-13 02:08:26 +00:00
Marek Olsak	7757b563f1	R600/SI: Use i32 vectors for resources and samplers This affects new intrinsics only. What surprises me is that v32i8 still works. llvm-svn: 212831	2014-07-11 17:11:52 +00:00
Marek Olsak	6db1789f95	R600/SI: add sample and image intrinsics exposing all instruction fields We need the intrinsics with offsets, so why not just add them all. The R128 parameter will also be useful for reducing SGPR usage. GL_ARB_image_load_store also adds some image GLSL modifiers like "coherent", so Mesa will probably translate those to slc, glc, etc. When LLVM 3.5 is released, I'll switch Mesa to these new intrinsics. llvm-svn: 212830	2014-07-11 17:11:46 +00:00
Marek Olsak	5ce87c6a53	R600/SI: fix shadow mapping for 1D and 2D array textures It was conflicting with def TEX_SHADOW_ARRAY, which also handles them. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 212829	2014-07-11 17:11:39 +00:00
Jan Vesely	f405b95fd6	R600: Implement float to long/ulong Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773	2014-07-10 22:40:21 +00:00
Matt Arsenault	834bcebaa6	R600/SI: Add support for llvm.convert.{to\|from}.fp16 llvm-svn: 212676	2014-07-10 03:22:20 +00:00
Matt Arsenault	f39a57f581	R600: Fix mishandling of load / store chains. Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473	2014-07-07 18:34:45 +00:00
Matt Arsenault	98979d7f5f	Fix typo, weird indentation llvm-svn: 212472	2014-07-07 18:34:42 +00:00
Matt Arsenault	5c86fb1055	Use cast<> instead of dyn_cast + assert llvm-svn: 212380	2014-07-05 21:16:43 +00:00
Matt Arsenault	d1cb139e25	Fix grammar llvm-svn: 212379	2014-07-05 21:16:40 +00:00
Chandler Carruth	fc0fe5064b	[codegen,aarch64] Add a target hook to the code generator to control vector type legalization strategies in a more fine grained manner, and change the legalization of several v1iN types and v1f32 to be widening rather than scalarization on AArch64. This fixes an assertion failure caused by scalarizing nodes like "v1i32 trunc v1i64". As v1i64 is legal it will fail to scalarize v1i32. This also provides a foundation for other targets to have more granular control over how vector types are legalized. Patch by Hao Liu, reviewed by Tim Northover. I'm committing it to allow some work to start taking place on top of this patch as it adds some really important hooks to the backend that I'd like to immediately start using. =] http://reviews.llvm.org/D4322 llvm-svn: 212242	2014-07-03 00:23:43 +00:00
Tom Stellard	e220f55c61	R600: Add a comment that llvm.AMDGPU.trunc is a legacy intrinsic llvm-svn: 212218	2014-07-02 20:53:57 +00:00
Tom Stellard	c4ab9c96da	R600/SI: Use a ComplexPattern for ADDR64 addressing of MUBUF loads llvm-svn: 212217	2014-07-02 20:53:56 +00:00
Tom Stellard	209c137768	R600: Promote i64 loads to v2i32 llvm-svn: 212216	2014-07-02 20:53:54 +00:00
Tom Stellard	5343a390b0	R600/SI: Adjsut SGPR live ranges before register allocation SGPRs are written by instructions that sometimes will ignore control flow, which means if you have code like: if (VGPR0) { SGPR0 = S_MOV_B32 0 } else { SGPR0 = S_MOV_B32 1 } The value of SGPR0 will 1 no matter what the condition is. In order to deal with this situation correctly, we need to view the program as if it were a single basic block when we calculate the live ranges for the SGPRs. They way we actually update the live range is by iterating over all of the segments in each LiveRange object and setting the end of each segment equal to the start of the next segment. So a live range like: [3888r,9312r:0)[10032B,10384B:0) 0@3888r will become: [3888r,10032B:0)[10032B,10384B:0) 0@3888r This change will allow us to use SALU instructions within branches. llvm-svn: 212215	2014-07-02 20:53:48 +00:00
Tom Stellard	1f2dabfbae	R600/SI: Add verifier check for immediates in register operands. llvm-svn: 212214	2014-07-02 20:53:44 +00:00
Matt Arsenault	6aca178583	R600: Fix crashes when an illegal type load or store is not handled. I don't think anything hits this now, but will be exposed in future patches. llvm-svn: 212197	2014-07-02 17:44:53 +00:00
Matt Arsenault	09212562b8	R600: Move mul combine to separate function llvm-svn: 212052	2014-06-30 17:55:48 +00:00
Matt Arsenault	add99aadc1	R600: Remove unused declarations leftover from AMDIL llvm-svn: 212051	2014-06-30 17:37:17 +00:00
Craig Topper	4c15d35f50	Add ops() method to SDNode that returns an ArrayRef<SDUse>. Use it to simplify some code. llvm-svn: 211993	2014-06-29 00:40:57 +00:00
Matt Arsenault	5e70db4151	R600: Move trivial getters into header, use initializer list llvm-svn: 211917	2014-06-27 17:57:00 +00:00
Matt Arsenault	128df7aaf1	R600: Don't crash on unhandled instruction in promote alloca llvm-svn: 211906	2014-06-27 16:52:49 +00:00
Matt Arsenault	61dec09b87	Fix missing newline and simplify debug printing. llvm-svn: 211850	2014-06-27 02:36:59 +00:00

1 2 3 4 5 ...

1161 Commits