llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 22:12:47 +01:00

Author	SHA1	Message	Date
Elena Demikhovsky	3492489267	X86 intrinsics moved form X86ISelLowering.cpp to X86IntrinsicsInfo.h X86ISelLowering.cpp has a long switch for intrinsics. I moved a part of this long switch to the new intrinsics table in X86IntrinsicsInfo.h. No functional changes, just code and compile time optimization. llvm-svn: 223641	2014-12-08 09:03:08 +00:00
NAKAMURA Takumi	953adcfdf0	Revert a part of r223583, for now. It seems causing different emission between stage2(gcc-clang) and stage3 clang. Investigating. llvm-svn: 223624	2014-12-08 02:07:22 +00:00
Duncan P. N. Exon Smith	a6e04674e3	IR: Revert r223618 behaviour of MDNode::concatenate() r223618 including special handling of `MDNode::intersect()`: if the first operand is a self-reference with the same operands you're trying to return, return it instead. Reuse that handling in `MDNode::concatenate()` in the hopes that it fixes a polly test that seems to rely on the old behaviour [1]. [1]: http://lab.llvm.org:8011/builders/polly-amd64-linux/builds/25167 llvm-svn: 223619	2014-12-07 20:32:11 +00:00
Duncan P. N. Exon Smith	407efd2366	IR: Drop uniquing for self-referencing MDNodes It doesn't make sense to unique self-referencing nodes. Drop uniquing for them. Note that `MDNode::intersect()` occasionally returns self-referencing nodes. Previously these would be returned by `MDNode::get()`. I'm not convinced this was intended behaviour -- to me it seems it should return a node whose only operand is the self-reference -- but I don't know much about alias scopes so I'm preserving it for now. This is part of PR21532. llvm-svn: 223618	2014-12-07 19:52:06 +00:00
Duncan P. N. Exon Smith	7c303fdacc	IR: Add missing tests for function-local metadata Add assembly and bitcode tests that I neglected to add in r223564 (IR: Disallow complicated function-local metadata) and r223574 (IR: Disallow function-local metadata attachments). Found a couple of bugs: - The error message for function-local attachments gave the wrong line number -- it indicated the next token (typically on the next line) instead of the token that started the attachment. Fixed. - Metadata arguments of the form `!{i32 0, i32 %v}` (or with the arguments reversed) fired an assertion in `ValueEnumerator` in LLVM v3.5, so I suppose this never really worked. I suppose this was "fixed" by r223564. (Thanks to dblaikie for pointing out my omission.) Part of PR21532. llvm-svn: 223616	2014-12-07 17:56:16 +00:00
Marek Olsak	156b10f50e	R600/SI: Disable VMEM and SMEM clauses by breaking them with S_NOP This is only a workaround. llvm-svn: 223615	2014-12-07 17:17:43 +00:00
Marek Olsak	06992b08aa	R600/SI: Set 20-bit immediate byte offset for SMRD on VI llvm-svn: 223614	2014-12-07 17:17:38 +00:00
Marek Olsak	15875571f6	R600/SI: Update instruction conversions for VI There are 3 changes: - Convert 32-bit S_LSHL/LSHR/ASHR to their V_*REV variants for VI - Lower RSQ_CLAMP for VI - Don't generate MIN/MAX_LEGACY on VI llvm-svn: 223604	2014-12-07 12:19:03 +00:00
Marek Olsak	73de9fa64e	R600/SI: Add VI instructions llvm-svn: 223603	2014-12-07 12:18:57 +00:00
Marek Olsak	ab8bc1e699	R600/SI: Add SCC Defs/Uses to SOP1 and SOP2 opcodes llvm-svn: 223602	2014-12-07 12:18:45 +00:00
Benjamin Kramer	508d96fe98	Turn some DenseMaps that are only used for set operations into DenseSets. DenseSet has better memory efficiency now. llvm-svn: 223589	2014-12-06 19:22:54 +00:00
Benjamin Kramer	a1a4ea6fcb	Make the DenseMap bucket type configurable and use a smaller bucket for DenseSet. DenseSet used to be implemented as DenseMap<Key, char>, which usually doubled the memory footprint of the map. Now we use a compressed set so the second element uses no memory at all. This required some surgery on DenseMap as all accesses to the bucket now have to go through methods; this should have no impact on the behavior of DenseMap though. The new default bucket type for DenseMap is a slightly extended std::pair as we expose it through DenseMap's iterator and don't want to break any existing users. llvm-svn: 223588	2014-12-06 19:22:44 +00:00
Benjamin Kramer	6f898a8786	Reapply "LLVMContext: Store APInt/APFloat directly into the ConstantInt/FP DenseMaps." This reapplies r223478 with a fix for 32 bit targets. llvm-svn: 223586	2014-12-06 13:12:56 +00:00
David Majnemer	0d1f96063b	ConstantFold: Don't optimize comparisons with weak linkage objects Consider: void f() {} void __attribute__((weak)) g() {} bool b = &f != &g; It's possble for g to resolve to f if --defsym=g=f is passed on to the linker. llvm-svn: 223585	2014-12-06 11:58:33 +00:00
David Majnemer	dbb83f4a33	I didn't intend to commit this change. llvm-svn: 223584	2014-12-06 10:52:32 +00:00
David Majnemer	4388d93ae3	InstSimplify: Optimize away useless unsigned comparisons Code like X < Y && Y == 0 should always be folded away to false. llvm-svn: 223583	2014-12-06 10:51:40 +00:00
NAKAMURA Takumi	842b214603	Reformat. llvm-svn: 223580	2014-12-06 05:57:06 +00:00
Tom Stellard	ef2e57c33a	R600/SI: Restore PrivateGlobalPrefix to the default ELF value of ".L" This was changed in r223323. llvm-svn: 223579	2014-12-06 05:34:34 +00:00
Duncan P. N. Exon Smith	42afe26916	IR: Disallow function-local metadata attachments Metadata attachments to instructions cannot be function-local. This is part of PR21532. llvm-svn: 223574	2014-12-06 02:29:44 +00:00
NAKAMURA Takumi	00ddc9dd6a	LLVMInstrumentation requires MC since r223532. llvm-svn: 223573	2014-12-06 02:22:11 +00:00
Ahmed Bougacha	b46577058a	[X86] Refactor PMOV[SZ]Xrm to add missing AVX2 patterns. Most patterns will go away once the extload legalization changes land. Differential Revision: http://reviews.llvm.org/D6125 llvm-svn: 223567	2014-12-06 01:31:07 +00:00
Hans Wennborg	24cc36324d	SelectionDAG switch lowering: Replace unreachable default with most popular case. This can significantly reduce the size of the switch, allowing for more efficient lowering. I also worked with the idea of exploiting unreachable defaults by omitting the range check for jump tables, but always ended up with a non-neglible binary size increase. It might be worth looking into some more. SimplifyCFG currently does this transformation, but I'm working towards changing that so we can optimize harder based on unreachable defaults. Differential Revision: http://reviews.llvm.org/D6510 llvm-svn: 223566	2014-12-06 01:28:50 +00:00
Duncan P. N. Exon Smith	62ee08db9a	IR: Disallow complicated function-local metadata Disallow complex types of function-local metadata. The only valid function-local metadata is an `MDNode` whose sole argument is a non-metadata function-local value. Part of PR21532. llvm-svn: 223564	2014-12-06 01:26:49 +00:00
Duncan P. N. Exon Smith	98178ba39c	Utils: Style cleanups, NFC llvm-svn: 223556	2014-12-06 00:48:17 +00:00
Duncan P. N. Exon Smith	ad64906ace	Utils: Avoid RAUW on metadata in CloneFunction() llvm-svn: 223555	2014-12-06 00:48:13 +00:00
Nick Lewycky	1e8071caaf	Canonicalize multiplies by looking at whether the operands have any constants themselves. Patch by Tim Murray! llvm-svn: 223554	2014-12-06 00:45:50 +00:00
Tim Northover	2195966715	AArch64: use explicit MVT::i64 when creating EXTRACT_SUBVECTOR nodes. All our patterns use MVT::i64, but the ISelLowering nodes were inconsistent in their choice. No functional change. llvm-svn: 223551	2014-12-06 00:33:37 +00:00
Benjamin Kramer	bc883c90e1	Revert "LLVMContext: Store APInt/APFloat directly into the ConstantInt/FP DenseMaps." Somehow made DenseMap probe on forever on 32 bit machines. This reverts commit r223478. llvm-svn: 223546	2014-12-06 00:02:31 +00:00
Ahmed Bougacha	a614254259	[X86] Cleanup FCOPYSIGN lowering. NFC intended. llvm-svn: 223542	2014-12-05 23:11:36 +00:00
Kuba Brecka	f69571039b	Recommit of r223513 and r223514. Reviewed at http://reviews.llvm.org/D6488 llvm-svn: 223532	2014-12-05 22:19:18 +00:00
Colin LeMahieu	8ca972a974	[Hexagon] Relocating logical instructions and templates later in the td file. llvm-svn: 223523	2014-12-05 21:51:12 +00:00
Colin LeMahieu	07e42127a4	[Hexagon] Adding sub/and/or reg, imm forms llvm-svn: 223522	2014-12-05 21:38:29 +00:00
Rafael Espindola	702e9736ec	Remove dead code. We are only lazy about functions with bodies. llvm-svn: 223521	2014-12-05 21:36:06 +00:00
Kuba Brecka	9027246b4b	Reverting r223513 and r223514. llvm-svn: 223520	2014-12-05 21:32:46 +00:00
Sanjay Patel	88b824a8d3	Optimize merging of scalar loads for 32-byte vectors [X86, AVX] Fix the poor codegen seen in PR21710 ( http://llvm.org/bugs/show_bug.cgi?id=21710 ). Before we crack 32-byte build vectors into smaller chunks (and then subsequently glue them back together), we should look for the easy case where we can just load all elements in a single op. An example of the codegen change is: From: vmovss 16(%rdi), %xmm1 vmovups (%rdi), %xmm0 vinsertps $16, 20(%rdi), %xmm1, %xmm1 vinsertps $32, 24(%rdi), %xmm1, %xmm1 vinsertps $48, 28(%rdi), %xmm1, %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 retq To: vmovups (%rdi), %ymm0 retq Differential Revision: http://reviews.llvm.org/D6536 llvm-svn: 223518	2014-12-05 21:28:14 +00:00
Peter Collingbourne	58b6120d03	[DFSAN][MIPS][LLVM] Defining ShadowPtrMask variable for MIPS64 Patch by Kumar Sukhani! corresponding compiler-rt patch: http://reviews.llvm.org/D6437 clang patch: http://reviews.llvm.org/D6147 Differential Revision: http://reviews.llvm.org/D6459 llvm-svn: 223516	2014-12-05 21:22:32 +00:00
Colin LeMahieu	4f286f5f23	[Hexagon] Updating mux_ir/ri/ii/rr with encoding bits llvm-svn: 223515	2014-12-05 21:09:27 +00:00
Kuba Brecka	fb6741fc3b	AddressSanitizer - Don't instrument globals from cstring_literals sections. (llvm part) Reviewed at http://reviews.llvm.org/D6488 llvm-svn: 223513	2014-12-05 21:04:43 +00:00
Rafael Espindola	6941835aef	Simplify the loop linking function bodies. NFC. llvm-svn: 223512	2014-12-05 21:04:36 +00:00
Jan Wen Voung	b856ac92dc	Use 32-bit ebp for NaCl64 in a limited case: llvm.frameaddress. Summary: Follow up to [x32] "Use ebp/esp as frame and stack pointer": http://reviews.llvm.org/D4617 In that earlier patch, NaCl64 was made to always use rbp. That's needed for most cases because rbp should hold a full 64-bit address within the NaCl sandbox so that load/stores off of rbp don't require sandbox adjustment (zeroing the top 32-bits, then filling those by adding r15). However, llvm.frameaddress returns a pointer and pointers are 32-bit for NaCl64. In this case, use ebp instead, which will make the register copy type check. A similar mechanism may be needed for llvm.eh.return, but is not added in this change. Test Plan: test/CodeGen/X86/frameaddr.ll Reviewers: dschuff, nadav Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D6514 llvm-svn: 223510	2014-12-05 20:55:53 +00:00
Bill Seurer	b4d665d454	[PowerPC]Add VSX loads/stores to fastisel for PPC target This patch adds VSX floating point loads and stores to fastisel. Along with the change to tablegen (D6220), VSX instructions are now fully supported in fastisel. http://reviews.llvm.org/D6274 llvm-svn: 223507	2014-12-05 20:15:56 +00:00
Colin LeMahieu	ea4694aa6c	[Hexagon] Adding tfrih/l instructions. llvm-svn: 223506	2014-12-05 20:07:19 +00:00
Andrea Di Biagio	38a80209d6	[X86] Improved lowering of packed vector shifts to vpsllq/vpsrlq. SSE2/AVX non-constant packed shift instructions only use the lower 64-bit of the shift count. This patch teaches function 'getTargetVShiftNode' how to deal with shifts where the shift count node is of type MVT::i64. Before this patch, function 'getTargetVShiftNode' only knew how to deal with shift count nodes of type MVT::i32. This forced the backend to wrongly truncate the shift count to MVT::i32, and then zero-extend it back to MVT::i64. llvm-svn: 223505	2014-12-05 20:02:22 +00:00
Colin LeMahieu	873ff2f4a3	[Hexagon] Adding add reg, imm form with encoding bits and test. llvm-svn: 223504	2014-12-05 19:51:23 +00:00
Rafael Espindola	21c3ddd148	Remove unused arguments. NFC. llvm-svn: 223503	2014-12-05 19:35:07 +00:00
Eric Christopher	c5e1feb235	These two calls were grabbing the same register info. Unify them. llvm-svn: 223502	2014-12-05 19:23:55 +00:00
Duncan P. N. Exon Smith	c2d918ec4e	BFI: Saturate when combining edges to a successor When a loop gets bundled up, its outgoing edges are quite large, and can just barely overflow 64-bits. If one successor has multiple incoming edges -- and that successor is getting all the incoming mass -- combining just its edges can overflow. Handle that by saturating rather than asserting. This fixes PR21622. llvm-svn: 223500	2014-12-05 19:13:42 +00:00
Colin LeMahieu	5f7eada35a	[Hexagon] Adding DoubleRegs decoder. Moving C2_mux and A2_nop. Adding combine imm-imm form. llvm-svn: 223494	2014-12-05 18:24:06 +00:00
Adrian Prantl	387890f688	Fix a bug when pretty-printing DW_OP_deref. llvm-svn: 223493	2014-12-05 18:19:38 +00:00
Ahmed Bougacha	59bdb1e517	[CodeGenPrepare] Use variables for reused values. NFC. llvm-svn: 223491	2014-12-05 18:04:40 +00:00

1 2 3 4 5 ...

74732 Commits