llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Haicheng Wu	354d0a92b6	[InlineCost] Skip volatile loads when looking for repeated loads This is a follow-up fix of r320814. A test case is also added. llvm-svn: 321075	2017-12-19 13:42:58 +00:00
Simon Pilgrim	804c89f41f	[X86][SSE] Add cpu feature for aggressive combining to variable shuffles As mentioned in D38318 and D40865, modern Intel processors prefer to combine multiple shuffles to a variable shuffle mask (PSHUFB/VPERMPS etc.) instead of having multiple stage 'fixed' shuffles which put more pressure on Port 5 (at the expense of extra shuffle mask loads). This patch provides a FeatureFastVariableShuffle target flag for Haswell+ CPUs that prefers combining 2 or more fixed shuffles to a single variable shuffle (default is 3 shuffles). The long term aim is to drive more of this from schedule data (probably via the MC) but we're not close to being ready for that yet. Differential Revision: https://reviews.llvm.org/D41323 llvm-svn: 321074	2017-12-19 13:16:43 +00:00
David Green	d048c49d4c	[ARM] Register the Thumb2SizeReducePass. NFC Also adds a simple test case. llvm-svn: 321072	2017-12-19 12:19:08 +00:00
Pavel Labath	6192ffc4b1	[Support] Add WritableMemoryBuffer class Summary: The motivation here is LLDB, where we need to fixup relocations in mmapped files before their contents can be read correctly. The MemoryBuffer class does exactly what we need, except that it maps the file in read-only mode. WritableMemoryBuffer reuses the existing machinery for opening and mmapping a file. The only difference is in the argument to the mapped_file_region constructor -- we create a private copy-on-write mapping, so that we can make changes to the mapped data, but the changes aren't carried over to the underlying file. This patch is based on an initial version by Zachary Turner. Reviewers: mehdi_amini, rnk, rafael, dblaikie, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40291 llvm-svn: 321071	2017-12-19 12:15:50 +00:00
Simon Pilgrim	6dd49d11df	[X86][SSE] Use (V)PHMINPOSUW for vXi8 SMAX/SMIN/UMAX/UMIN horizontal reductions (PR32841) Extension to D39729 which performed this for vXi16, with the same bit flipping to handle SMAX/SMIN/UMAX cases, vXi8 UMIN horizontal reductions can be performed. This makes use of the fact that by performing a pair-wise i8 SHUFFLE/UMIN before PHMINPOSUW, we both get the UMIN of each pair but also zero-extend the upper bits ready for v8i16. Differential Revision: https://reviews.llvm.org/D41294 llvm-svn: 321070	2017-12-19 12:02:40 +00:00
Francis Visoiu Mistrih	859334a80d	Fix: [YAML] Always double quote UTF-8 characters llvm-svn: 321069	2017-12-19 11:59:28 +00:00
Francis Visoiu Mistrih	e1d5d5ba4e	[YAML] Always double quote UTF-8 characters llvm-svn: 321068	2017-12-19 11:51:05 +00:00
Simon Dardis	9c259722df	[mips] Handle the emission of microMIPSr6 sll instruction when used as a nop. This instruction is encoded as zero, so we have handle that case when checking for unimplemented opcodes when producing the encoding for an instruction. llvm-svn: 321066	2017-12-19 11:16:22 +00:00
Jonas Devlieghere	c562c1f132	[dwarfdump] Lookup needs to be an unsigned long long parameter. Before this patch, dwarfdump's lookup parameter only accepts unsigned. Given that for many current platforms the load address already exceeds unsigned (e.g. arm64 w/ 0x100000000), dwarfdump needs an unsigned long long parameter. Patch by: Dr. Michael 'Mickey' Lauer <mickey@vanille-media.de> llvm-svn: 321064	2017-12-19 09:45:26 +00:00
Max Kazantsev	73d7a6bc49	[JumpThreading] Restrict PRE across instructions that don't pass control to successors PRE in JumpThreading should not be able to hoist copy of non-speculable loads across instructions that don't always transfer execution to their successors, otherwise they may introduce an unsafe load which otherwise would not be executed. The same problem for GVN was fixed as rL316975. Differential Revision: https://reviews.llvm.org/D40347 llvm-svn: 321063	2017-12-19 09:10:21 +00:00
Igor Laevsky	aba02c9095	[FuzzMutate] Don't crash when mutator is unable to find operation Differential Revision: https://reviews.llvm.org/D41009 llvm-svn: 321062	2017-12-19 08:52:51 +00:00
Bjorn Steinbrink	feb288b9a0	Treat sret arguments as being dereferenceable in getPointerDereferenceableBytes() Reviewers: rnk, hfinkel, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41355 llvm-svn: 321061	2017-12-19 08:46:46 +00:00
Craig Topper	b64e60f4ce	[X86] Don't extend v16i8 non-uniform shifts to v16i32 if we have BWI. Use v16i16 instead. BWI supports shifting by word amounts. Even if VLX isn't support we can still widen to v32i16 and extract the lower half. For SKX its preferrable to not use 512-bit vector if we can. llvm-svn: 321059	2017-12-19 06:59:10 +00:00
Craig Topper	f6641677a8	[X86] Use a specific list of MVTs in combineShiftRightArithmetic instead of iterating over every integer VT and checking their size. Previously, we were checking for MVTs with sizes betwen 8 and 64 which only includes i8, i16, i32, and i64 today. But I don't think we should assume that and should list the types that are legal for x86. I also don't think we need i64 since type legalization is guaranteed to split those up. llvm-svn: 321058	2017-12-19 06:29:00 +00:00
Craig Topper	574e767a20	[X86] Remove unnecessary check for integer VT from combineShiftRightArithmetic. I doubt there's any way to create a ashr for an FP type. llvm-svn: 321057	2017-12-19 06:28:58 +00:00
Craig Topper	14f0f17b4b	[X86] Remove dead code for turning vector shifts by large amounts into a zero vector. Pretty sure these are handled by a target independent DAG combine that turns them into undef these days. llvm-svn: 321056	2017-12-19 05:21:50 +00:00
Craig Topper	20683fd338	[X86] Use ZERO_EXTEND instead of ANY_EXTEND when extending the shift amount for a non-uniform shift. My reading of the SDM says that all bits of the shift amount are used. If the value of the element is larger than the number of bits the result the shift result is zero. So I think we need to zero_extend here to avoid garbage in the upper bits. In reality we lower any_extend as zero_extend so in most cases it would be hard to hit this. llvm-svn: 321055	2017-12-19 04:52:04 +00:00
Serguei Katkov	183e73d6dc	Fix APFloat from string conversion for Inf The method IEEEFloat::convertFromStringSpecials() does not recognize the "+Inf" and "-Inf" strings but these strings are printed for the double Infinities by the IEEEFloat::toString(). This patch adds the "+Inf" and "-Inf" strings to the list of recognized patterns in IEEEFloat::convertFromStringSpecials(). Re-landing after fix. Reviewers: sberg, bogner, majnemer, timshen, rnk, skatkov, gottesmm, bkramer, scanon, anna Reviewed By: anna Subscribers: mkazantsev, FlameTop, llvm-commits, reames, apilipenko Differential Revision: https://reviews.llvm.org/D38030 llvm-svn: 321054	2017-12-19 04:27:39 +00:00
Quentin Colombet	a4783736b2	[TableGen][GlobalISel] Reset the internal map of RuleMatchers just before the emission Between the creation of the last InstructionMatcher and the first emission of the related Rule, we need to clear the internal map of IDs. We used to do that right after the creation of the main InstructionMatcher when building the rule and although that worked, this is fragile because if for some reason some later code decides to create more InstructionMatcher before the final call to emit, then the IDs would be completely messed up. Move that to the beginning of "emit" so that the IDs are guarantee to be consistent. NFC. llvm-svn: 321053	2017-12-19 02:57:23 +00:00
Reid Kleckner	aa37769e1c	Fix Wasm as a follow up to r321035 and the other one This array is tightly coupled with the .def file. Someone should look into fixing that. llvm-svn: 321050	2017-12-19 01:08:53 +00:00
Justin Bogner	837b6ed803	update_mir_test_checks: Accept IR as input as well as MIR We need to handle IR for tests that want to do lowering (or just -stop-after with IR as input). I've run this on one AArch64 test to demonstrate what it looks like. llvm-svn: 321048	2017-12-19 00:49:04 +00:00
Jake Ehrlich	9f580d4e7a	[llvm-objcopy] Add option to add a progbits section from a file This change adds support for adding progbits sections with contents from a file Differential Revision: https://reviews.llvm.org/D41212 llvm-svn: 321047	2017-12-19 00:47:30 +00:00
Matthias Braun	f51f69bd75	TargetLoweringBase: Followup to r321035 I missed some prefixes and the fact that on AArch64 we use "bzero" instead of "__bzero" as on X86 when doing my refactoring in r321035. Improve tests for bzero. llvm-svn: 321046	2017-12-19 00:43:00 +00:00
Matthias Braun	dbb949be3c	TargetLowering: Fix InitLibcallCallingConvs() overriding things set in InitLibcalls() I missed the fact that the later called InitLibcallCallingConvs() overrides some things set in InitLibcalls() when I did the refactoring in r321036. Fix by merging InitLibcallCallingConvs() into InitLibcalls() and doing the initialization earlier. llvm-svn: 321045	2017-12-19 00:20:33 +00:00
Matthias Braun	c94abbea59	TargetLowering: Fix off-by-one error This problem was present for a while, but somehow asan didn't catch it before the refactoring in r321036. llvm-svn: 321043	2017-12-19 00:05:10 +00:00
Sam Clegg	af67e33ea7	[llvm-readobj] Dump wasm init functions llvm-svn: 321042	2017-12-19 00:04:41 +00:00
Matthias Braun	5d839b5ad3	TargetLoweringBase: Remove unnecessary watchos exception; NFC WatchOS isn't report as iOS (as opposed to tvos) so the exception I added in my last commit wasn't necessary after all. llvm-svn: 321041	2017-12-18 23:33:28 +00:00
Justin Bogner	da67fa913e	update_mir_test_checks: Add "mir" to some states and regex names For tests that do lowering we need to support IR as input, so here we clarify some names to avoid ambiguity in upcoming commits. llvm-svn: 321039	2017-12-18 23:31:55 +00:00
Craig Topper	ea43ac79fd	[X86] Don't use NOPL when the assembler is passed an empty CPU string. This recommits the change from r321026. I have a fix for the lld test now. llvm-svn: 321038	2017-12-18 23:31:43 +00:00
Matthias Braun	dcb7646ac9	LiveStacks: Rename LiveStack.{h\|cpp} to LiveStacks.{h\|cpp}; NFC Filenames should match the name of the class they contain. llvm-svn: 321037	2017-12-18 23:19:44 +00:00
Matthias Braun	bc588b0484	X86/AArch64/ARM: Factor out common sincos_stret logic; NFCI Note: - X86ISelLowering: setLibcallName(SINCOS) was superfluous as InitLibcalls() already does it. - ARMISelLowering: Setting libcallnames for sincos/sincosf seemed superfluous as in the darwin case it wouldn't be used while for all other cases InitLibcalls already does it. llvm-svn: 321036	2017-12-18 23:19:42 +00:00
Matthias Braun	0e306ba2be	AArch64/X86: Factor out common bzero logic; NFC llvm-svn: 321035	2017-12-18 23:14:28 +00:00
Krzysztof Parzyszek	dcf36ce3fe	[Hexagon] Cache loads to select to avoid traversing mutating DAG llvm-svn: 321034	2017-12-18 23:13:27 +00:00
Craig Topper	76baa9a7f1	Revert part of r321026 "[X86] Don't use NOPL when the assembler is passed an empty CPU string." while I investigate how to fix an lld test failure. Looks like lld also needs to pass a -mcpu in some of its tests llvm-svn: 321033	2017-12-18 22:20:10 +00:00
Evandro Menezes	7c26890ae9	[AArch64] Expand test coverage of vector element shuffling to Exynos Make sure that all test cases are run for Exynos as well. Otherwise, NFC. llvm-svn: 321032	2017-12-18 22:17:39 +00:00
Quentin Colombet	9a77678e09	[TableGen][GlobalISel] Make the arguments of the Instruction and Operand Matchers consistent Move InsnVarID and OpIdx at the beginning of the list of arguments for all the constructors of the OperandMatcher subclasses. This matches what we do for the InstructionMatcher. NFC. llvm-svn: 321031	2017-12-18 22:12:13 +00:00
Bob Haarman	719ad0b0c5	Fix buffer overrun in WindowsResourceCOFFWriter::writeSymbolTable() Summary: We were using sprintf(..., "$R06X", <some uint32_t>) to create strings that are expected to be exactly length 8, but this results in longer strings if the uint32_t is greater than 0xffffff. This change modifies the behavior as follows: - Uses the loop counter instead of the data offset. This gives us sequential symbol names, avoiding collisions as much as possible. - Masks the value to 0xffffff to avoid generating names longer than 8 bytes. - Uses formatv instead of sprintf. Fixes PR35581. Reviewers: ruiu, zturner Reviewed By: ruiu Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D41270 llvm-svn: 321030	2017-12-18 22:10:14 +00:00
Reid Kleckner	6206654c20	Add test for .req directive starting with 'p' Reduced test case from libjpeg_turbo. llvm-svn: 321029	2017-12-18 22:01:18 +00:00
Jessica Paquette	db3d79d789	[MachineOutliner][NFC] Gardening: use std::any_of instead of bool + loop River Riddle suggested to use std::any_of instead of the bool + loop thing on r320229. This commit does that. llvm-svn: 321028	2017-12-18 21:44:52 +00:00
Craig Topper	a5f511668f	[X86] Don't use NOPL when the assembler is passed an empty CPU string. Update tests to force a CPU with NOPL Empty string should be equivalent to "generic" which doesn't allow NOPL. Force tests to use specificy 'pentiumpro' to guarantee NOPL. Fixes PR35686 llvm-svn: 321026	2017-12-18 21:37:27 +00:00
Quentin Colombet	679adbc865	[TableGen][GlobalISel] Refactor optimizeRules related bit to allow code reuse In theory, reapplying optimizeRules on each group matchers should give us a second nesting level on the matching table. In practice, we need more work to make that happen because all the predicates are actually not directly available through the predicate matchers list. NFC. llvm-svn: 321025	2017-12-18 21:25:53 +00:00
Reid Kleckner	3e1db81509	Revert "[AArch64][SVE] Asm" changes, they broke libjpeg_turbo This reverts changes r320992, r320986, r320973, and r320970. r320970 by itself breaks the test case, and the rest depend on it. Test case will land soon. llvm-svn: 321024	2017-12-18 20:58:25 +00:00
Ivan A. Kosarev	0c51903724	[Analysis] Generate more precise TBAA tags when one access encloses the other There are cases when two tags with different base types denote accesses to the same direct or indirect member of a structure type. Currently, merging of such tags results in a tag that represents an access to an object that has the type of that member. This patch changes this so that if one of the accesses encloses the other, then the generic tag is the one of the enclosed access. Differential Revision: https://reviews.llvm.org/D39557 llvm-svn: 321019	2017-12-18 20:05:20 +00:00
Teresa Johnson	b98b101080	[PGO] Fix handling of cold entry count for instrumented PGO Summary: In r277849, getEntryCount was changed to return None when the entry count was 0, specifically for SamplePGO where it means no samples were recorded. However, for instrumentation PGO a 0 entry count should be returned directly, since it does mean that the function was completely cold. Otherwise we end up treating these functions conservatively in isFunctionEntryCold() and isColdBB(). Instead, for SamplePGO use -1 when there are no samples, and change getEntryCount to return None when the value is -1. Reviewers: danielcdh, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41307 llvm-svn: 321018	2017-12-18 20:02:43 +00:00
Quentin Colombet	f669347b0d	[TableGen][GlobalISel] Optimize MatchTable for faster instruction selection * Context * Prior to this patchw, the table generated for matching instruction was straight forward but highly inefficient. Basically, each pattern generates its own set of self contained checks and actions. E.g., TableGen generated: // First pattern CheckNumOperand 3 CheckOpcode G_ADD ... Build ADDrr // Second pattern CheckNumOperand 3 CheckOpcode G_ADD ... Build ADDri // Third pattern CheckNumOperand 3 CheckOpcode G_SUB ... Build SUBrr * Problem * Because of that generation, a lot of check were redundant between each pattern and were checked every single time until we reach the pattern that matches. E.g., Taking the previous table, let say we are matching a G_SUB, that means we were going to check all the rules for G_ADD before looking at the G_SUB rule. In particular we are going to do: check 3 operands; PASS check G_ADD; FAIL ; Next rule check 3 operands; PASS (but we already knew that!) check G_ADD; FAIL (well it is still not true) ; Next rule check 3 operands; PASS (really!!) check G_SUB; PASS (at last :P) * Proposed Solution * This patch introduces a concept of group of rules (GroupMatcher) that share some predicates and only get checked once for the whole group. This patch only creates groups with one nesting level. Conceptually there is nothing preventing us for having deeper nest level. However, the current implementation is not smart enough to share the recording (aka capturing) of values. That limits its ability to do more sharing. For the given example the current patch will generate: // First group CheckOpcode G_ADD // First pattern CheckNumOperand 3 ... Build ADDrr // Second pattern CheckNumOperand 3 ... Build ADDri // Second group CheckOpcode G_SUB // Third pattern CheckNumOperand 3 ... Build SUBrr But if we allowed several nesting level, it could create a sub group for the checknumoperand 3. (We would need to call optimizeRules on the rules within a group.) * Result * With only one level of nesting, the instruction selection pass is up to 4x faster. For instance, one instruction now takes 500 checks, instead of 24k! With more nesting we could get in the tens I believe. Differential Revision: https://reviews.llvm.org/D39034 rdar://problem/34670699 llvm-svn: 321017	2017-12-18 19:47:41 +00:00
Dimitry Andric	1d6fcfc9f4	Fix more inconsistent line endings. NFC. llvm-svn: 321016	2017-12-18 19:46:56 +00:00
Craig Topper	7c5027d178	[X86] Minor formatting fix to getHostCPUFeatures. NFC llvm-svn: 321015	2017-12-18 19:40:11 +00:00
Jessica Paquette	ffb43a84c6	[MachineOutliner] Recommit r320229 LR was undefined entering outlined functions that contain calls. This made the machine verifier unhappy when expensive checks were enabled. This fixes that. llvm-svn: 321014	2017-12-18 19:33:21 +00:00
Benjamin Kramer	b502174415	[PPC] Also disable the pre-emit version of reg+reg to reg+imm transformation. This has the same issue as the early pass disabled in r321010. llvm-svn: 321013	2017-12-18 19:21:56 +00:00
Don Hinton	d4b6dda211	[cmake] Update experimental target error message Summary: Update this error message indicate this test only ensures experimental targets were passed via LLVM_EXPERIMENTAL_TARGETS_TO_BUILD. Originally, this test validated all targets, but in r184923, it was moved after the LLVMBUILDTOOL test, which also validates all targets, making that part of the test redundant. Differential Revision: https://reviews.llvm.org/D41273 llvm-svn: 321012	2017-12-18 19:15:15 +00:00

1 2 3 4 5 ...

158288 Commits