llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 13:02:52 +02:00

Author	SHA1	Message	Date
Matt Arsenault	ba2df7591d	R600/SI: Make argument loads invariant llvm-svn: 214101	2014-07-28 17:31:39 +00:00
Matt Arsenault	76c7b7a591	Add alignment value to allowsUnalignedMemoryAccess Rename to allowsMisalignedMemoryAccess. On R600, 8 and 16 byte accesses are mostly OK with 4-byte alignment, and don't need to be split into multiple accesses. Vector loads with an alignment of the element type are not uncommon in OpenCL code. llvm-svn: 214055	2014-07-27 17:46:40 +00:00
Matt Arsenault	38386c76f2	R600: Move intrinsic lowering to separate functions llvm-svn: 214023	2014-07-26 06:23:37 +00:00
Matt Arsenault	d70c38a67b	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Tom Stellard	08b253cba1	R600/SI: Clean up some of the unused REGISTER_{LOAD,STORE} code There are a few more cleanups to do, but I ran into some problems with ext loads and trunc stores, when I tried to change some of the vector loads and stores from custom to legal, so I wasn't able to get rid of everything. llvm-svn: 213552	2014-07-21 15:45:06 +00:00
Tom Stellard	ed0ccca70d	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	5bfbb25d6b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
NAKAMURA Takumi	c89172a790	SIISelLowering.cpp: Define _USE_MATH_DEFINES to let M_PI provided on MS <cmath>. FIXME: Would it be better to move it into configure? llvm-svn: 213477	2014-07-20 11:15:07 +00:00
Matt Arsenault	2d097d5e02	R600/SI: Remove dead code and add missing tests. This probably was killed by some generic DAGCombiner improvements in checking the TargetBooleanContents instead of just 1. llvm-svn: 213471	2014-07-20 06:11:02 +00:00
Matt Arsenault	840d57e330	R600/SI: implement range reduction for sin/cos These instructions can only take a limited input range, and return the constant value 1 out of range. We should do range reduction to be able to process arbitrary values. Use a FRACT instruction after normalization to achieve this. Also add a test for constant folding with the lowered code with unsafe-fp-math enabled. v2: use DAG lowering instead of intrinsic, adapt test v3: calculate constant, fold pattern into instruction definition v4: misc style fixes, add sin-fold testcase, cosmetics Patch by Grigori Goronzy llvm-svn: 213458	2014-07-19 18:44:39 +00:00
Matt Arsenault	15eb0d54b0	R600/SI: Allow using f32 rcp / rsq when denormals not handled. These are precise enough to use for OpenCL unless denormals are handled. llvm-svn: 213107	2014-07-15 23:50:10 +00:00
Matt Arsenault	c093eee935	R600/SI: Fix select on i1 llvm-svn: 213096	2014-07-15 21:44:37 +00:00
Matt Arsenault	1ceb5e82c1	R600/SI: Implement less wrong f32 fdiv Assuming single precision denormals and accurate sqrt/div are not reported, this passes the OpenCL conformance test. llvm-svn: 213089	2014-07-15 20:18:31 +00:00
Matt Arsenault	fdf94244f2	R600: Make ShaderType private llvm-svn: 212896	2014-07-13 03:06:39 +00:00
Jan Vesely	f405b95fd6	R600: Implement float to long/ulong Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773	2014-07-10 22:40:21 +00:00
Matt Arsenault	f39a57f581	R600: Fix mishandling of load / store chains. Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473	2014-07-07 18:34:45 +00:00
Chandler Carruth	fc0fe5064b	[codegen,aarch64] Add a target hook to the code generator to control vector type legalization strategies in a more fine grained manner, and change the legalization of several v1iN types and v1f32 to be widening rather than scalarization on AArch64. This fixes an assertion failure caused by scalarizing nodes like "v1i32 trunc v1i64". As v1i64 is legal it will fail to scalarize v1i32. This also provides a foundation for other targets to have more granular control over how vector types are legalized. Patch by Hao Liu, reviewed by Tim Northover. I'm committing it to allow some work to start taking place on top of this patch as it adds some really important hooks to the backend that I'd like to immediately start using. =] http://reviews.llvm.org/D4322 llvm-svn: 212242	2014-07-03 00:23:43 +00:00
Tom Stellard	209c137768	R600: Promote i64 loads to v2i32 llvm-svn: 212216	2014-07-02 20:53:54 +00:00
Tom Stellard	840992bb71	R600: Promote i64 stores to v2i32 Now we need only one 64-bit pattern for stores. llvm-svn: 211643	2014-06-24 23:33:04 +00:00
Matt Arsenault	99411cd1b4	R600: Move more out of AMDILISelLowering llvm-svn: 211516	2014-06-23 18:00:44 +00:00
Matt Arsenault	b319678001	R600/SI: Handle i64 sub. We can handle it the same way as add llvm-svn: 211514	2014-06-23 18:00:38 +00:00
Matt Arsenault	2f6589800f	R600: Rename AMDIL file llvm-svn: 211512	2014-06-23 18:00:31 +00:00
Matt Arsenault	48848ba546	R600/SI: Prettier operand printing for 64-bit ops. Copy what is done for 32-bit already so the order is about the same. llvm-svn: 211186	2014-06-18 17:13:51 +00:00
Matt Arsenault	990ee542e5	R600/SI: Temporary fix for f64 fneg This should be a source modifier, but this unblocks most of my math patches. llvm-svn: 211181	2014-06-18 17:05:22 +00:00
Matt Arsenault	a75d166beb	R600/SI: Use v_cvt_f32_ubyte* instructions This eliminates extra extract instructions when loading an i8 vector to a float vector. llvm-svn: 210666	2014-06-11 17:50:44 +00:00
Matt Arsenault	4f96643a42	R600: Use BCNT_INT for evergreen llvm-svn: 210569	2014-06-10 19:18:28 +00:00
Matt Arsenault	6387e9a3dc	R600/SI: Implement i64 ctpop llvm-svn: 210568	2014-06-10 19:18:24 +00:00
Matt Arsenault	8407076508	R600/SI: Use bcnt instruction for ctpop llvm-svn: 210567	2014-06-10 19:18:21 +00:00
Matt Arsenault	5bfef73e00	R600/SI: Handle sign_extend and zero_extend to i64 with patterns. llvm-svn: 210563	2014-06-10 18:54:59 +00:00
Tom Stellard	aab1db4cd9	SelectionDAG: Expand SELECT_CC to SELECT + SETCC This consolidates code from the Hexagon, R600, and XCore targets. No functionality change intended. llvm-svn: 210539	2014-06-10 16:01:22 +00:00
Matt Arsenault	d9ef70c461	Use nullptr llvm-svn: 210222	2014-06-05 00:01:12 +00:00
Matt Arsenault	ad098591b8	Fix typos llvm-svn: 210135	2014-06-03 23:06:13 +00:00
Matt Arsenault	90d0fd2ea0	R600: Add dag combine for BFE llvm-svn: 209461	2014-05-22 18:09:07 +00:00
Tom Stellard	2022c1eb1b	R600/SI: Promote f32 SELECT to i32 llvm-svn: 209024	2014-05-16 20:56:41 +00:00
Matt Arsenault	6a9e6f69e7	Use range for llvm-svn: 208922	2014-05-15 21:44:05 +00:00
Tom Stellard	dbf9b9b7af	R600/SI: Stop using VSrc_* as the default register class for types. We now use SReg_* for integer types and VReg_* for floating-point types. This should help simplify the SIFixSGPRCopies pass and no longer causes ISel to insert a COPY after termiator instuctions that output a value. This change is covered by exisitng tests. llvm-svn: 208888	2014-05-15 14:41:57 +00:00
Vincent Lejeune	03352d8b38	R600/SI: Fold fabs/fneg into src input modifier llvm-svn: 208480	2014-05-10 19:18:39 +00:00
Vincent Lejeune	840594f1e6	R600/SI: Prettier display of input modifiers llvm-svn: 208479	2014-05-10 19:18:33 +00:00
Vincent Lejeune	8467918b43	R600/SI: Use pseudo instruction for fabs/clamp/fneg llvm-svn: 208478	2014-05-10 19:18:25 +00:00
Tom Stellard	83d3208148	R600: Move MIN/MAX matching from LowerOperation() to PerformDAGCombine() llvm-svn: 208429	2014-05-09 16:42:16 +00:00
Tom Stellard	690d34daa4	R600/SI: Use VALU instructions for copying i1 values We can't use SALU instructions for this since they ignore the EXEC mask and are always executed. This fixes several OpenCV tests. llvm-svn: 207661	2014-04-30 15:31:33 +00:00
Tom Stellard	c58951c37c	R600/SI: Custom lower SI_IF and SI_ELSE to avoid machine verifier errors SI_IF and SI_ELSE are terminators which also produce a value. For these instructions ISel always inserts a COPY to move their value to another basic block. This COPY ends up between SI_(IF\|ELSE) and the S_BRANCH* instruction at the end of the block. This breaks MachineBasicBlock::getFirstTerminator() and also the machine verifier which assumes that terminators are grouped together at the end of blocks. To solve this we coalesce the copy away right after ISel to make sure there are no instructions in between terminators at the end of blocks. llvm-svn: 207591	2014-04-29 23:12:53 +00:00
Tom Stellard	cbed2c43ab	R600: Change UDIV/UREM to UDIVREM when legalizing types When legalizing ops, with UDIV/UREM set to expand, they automatically expand to UDIVREM (if legal or custom). We need to do this manually for legalize types. v2: SI should be set to Expand because the type is legal, and it is automatically lowered to UDIVREM if UDIVREM is Legal/Custom R600 should set to UDIV/UREM to Custom because it needs to lower them during type legalization Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207587	2014-04-29 23:12:43 +00:00
Craig Topper	9683cb114b	Convert more SelectionDAG functions to use ArrayRef. llvm-svn: 207397	2014-04-28 05:57:50 +00:00
Craig Topper	1e0e54db16	Convert SelectionDAG::MorphNodeTo to use ArrayRef. llvm-svn: 207378	2014-04-27 19:21:16 +00:00
Craig Topper	536995c0a7	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	e0741a0fcb	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer and size. llvm-svn: 207329	2014-04-26 19:29:41 +00:00
Craig Topper	1b1f54bcca	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Craig Topper	6d411cb95a	[C++] Use 'nullptr'. Target edition. llvm-svn: 207197	2014-04-25 05:30:21 +00:00
Matt Arsenault	fc2f0d8067	R600/SI: Use address space in allowsUnalignedMemoryAccesses llvm-svn: 207126	2014-04-24 17:08:26 +00:00

1 2 3 4

181 Commits