llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Alexander Richardson	178aaac5c1	[MIPS] Add static_assert that all Fixups are handled in getFixupKind Summary: I recently added a new Fixup kind to our fork of LLVM but forgot to add it to the table in MipsAsmBackend.cpp. With this static_assert the error would have been caught instead of zero-initializing the array entries for the new fixups. Reviewers: sdardis, atanasyan Reviewed By: atanasyan Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44895 llvm-svn: 328616	2018-03-27 10:08:12 +00:00
Max Kazantsev	73bfd6d30a	[LoopUnroll][NFC] Remove redundant canPeel check We check `canPeel` twice: when evaluating the number of iterations to be peeled and within the method `peelLoop` that performs peeling. This method is only executed if the calculated peel count is positive. Thus, the check in `peelLoop` can never fail. This patch replaces this check with an assert. Differential Revision: https://reviews.llvm.org/D44919 Reviewed By: fhahn llvm-svn: 328615	2018-03-27 09:40:51 +00:00
Sam Parker	c40ea02cab	[IRCE] Enable decreasing loops of non-const bound As a follow-up to r328480, this updates the logic for the decreasing safety checks in a similar manner: - CanBeMax is replaced by CannotBeMaxInLoop which queries isLoopEntryGuardedByCond on the maximum value. - SumCanReachMin is replaced by isSafeDecreasingBound which includes some logic from parseLoopStructure and, again, has been updated to use isLoopEntryGuardedByCond on the given bounds. Differential Revision: https://reviews.llvm.org/D44776 llvm-svn: 328613	2018-03-27 08:24:53 +00:00
Max Kazantsev	ba61c0effc	[NFC] Fix comments in getExact() llvm-svn: 328612	2018-03-27 08:13:55 +00:00
Max Kazantsev	a5495e778b	[SCEV] Make exact taken count calculation more optimistic Currently, `getExact` fails if it sees two exit counts in different blocks. There is no solid reason to do so, given that we only calculate exact non-taken count for exiting blocks that dominate latch. Using this fact, we can simply take min out of all exits of all blocks to get the exact taken count. This patch makes the calculation more optimistic with enforcing our assumption with asserts. It allows us to calculate exact backedge taken count in trivial loops like for (int i = 0; i < 100; i++) { if (i > 50) break; . . . } Differential Revision: https://reviews.llvm.org/D44676 Reviewed By: fhahn llvm-svn: 328611	2018-03-27 07:30:38 +00:00
Max Kazantsev	30f89ae145	[SCEV] Add one more case in computeConstantDifference This patch teaches `computeConstantDifference` handle calculation of constant difference between `(X + C1)` and `(X + C2)` which is `(C2 - C1)`. Differential Revision: https://reviews.llvm.org/D43759 Reviewed By: anna llvm-svn: 328609	2018-03-27 04:54:00 +00:00
Craig Topper	069a428012	[MachineScheduler] Add itinerary to schedcover.py. Make default work in the command line filter Summary: This patch adds itinerary support to the schedcover.py script. I've been trying to use this script to figure out why SSE and AVX instructions are ending up in separate tablegen scheduler classes and sometimes its because we are using different itineraries. Rather than using None to indicate the default scheduler model, I now use the string "default". I had to hack around the sorting a little to keep "default" at the beginning. But this also makes it so you can specify "default" on the command line to just get the defaults I also fixed the regular expression code so that the no_default wasn't evaluated twice. Reviewers: RKSimon, atrick, jmolloy, javed.absar Reviewed By: javed.absar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44834 llvm-svn: 328608	2018-03-27 04:26:39 +00:00
Mircea Trofin	b836fe81fc	Revert "Revert "[lit] Generalized /dev/null support on Windows."" Summary: This reverts commit r328596. Checking if the arguments are strings before testing if they contain "/dev/null". Reviewers: rnk Reviewed By: rnk Subscribers: delcypher, llvm-commits Differential Revision: https://reviews.llvm.org/D44914 llvm-svn: 328603	2018-03-27 01:39:17 +00:00
Sanjay Patel	a38015abef	[x86] add RUN for target before roundss; NFC llvm-svn: 328601	2018-03-27 00:32:19 +00:00
Jan Korous	a54a2fbeca	[lit] Temporarily disable shtest-timeout.py on darwin Disabled until fixed in order to avoid random failures on green dragon. rdar://problem/38774530 llvm-svn: 328598	2018-03-27 00:16:28 +00:00
Mircea Trofin	8c42530a18	Revert "[lit] Generalized /dev/null support on Windows." This reverts commit ca7fdbb974384ce5a05528b22a41d46b1cc13e92. llvm-svn: 328596	2018-03-26 23:59:39 +00:00
David Blaikie	4cf99ef51b	Add a build dependency from libMC to libDebugInfoCodeView to match the reality of header dependencies here llvm-svn: 328595	2018-03-26 23:48:52 +00:00
David Blaikie	500ebbec4b	Move CVDebugRecord from CodeView to Object to fix layering llvm-svn: 328593	2018-03-26 23:37:02 +00:00
Sanjay Patel	f98a2054f3	[x86] add tests for ftrunc; NFC llvm-svn: 328592	2018-03-26 23:18:32 +00:00
Aaron Smith	50d10e9805	[DebugInfoPDB] Print the method name along with the variant value Before this change, using dumpProperties() with PDBSymbolData would look like this: get_locationType: 3 1 After this change: get_locationType: 3 get_value: 1 llvm-svn: 328590	2018-03-26 22:53:38 +00:00
Mircea Trofin	d4239dd1a7	[lit] Generalized /dev/null support on Windows. Generalized /dev/null remapping on Windows, and added test. Reviewers: rnk Reviewed By: rnk Subscribers: amccarth, zturner, delcypher, llvm-commits Differential Revision: https://reviews.llvm.org/D44771 llvm-svn: 328589	2018-03-26 22:41:06 +00:00
Aaron Smith	64922c78cd	[DebugInfoPDB] Add methods to get the compiland and line numbers with PDBSymbolData llvm-svn: 328587	2018-03-26 22:17:12 +00:00
Aaron Smith	71285ba943	[DebugInfoPDB] Add DIA implementation of findLineNumbersByRVA This method is used to find line numbers for PDBSymbolData that have an invalid virtual address. llvm-svn: 328586	2018-03-26 22:13:22 +00:00
Aaron Smith	d23690b86e	[DebugInfoPDB] Add DIA implementation of addressForVA and addressForRVA These are used in finding line numbers for PDBSymbolData llvm-svn: 328585	2018-03-26 22:10:02 +00:00
Simon Pilgrim	d1d125665e	Fix newlines. NFCI. llvm-svn: 328583	2018-03-26 21:07:59 +00:00
Simon Pilgrim	2c2260d5e2	[X86] Add WriteCRC32 scheduler class Currently CRC32 instructions use the WriteFAdd class, this patch splits them off into their own, at the moment it is still mostly just a duplicate of WriteFAdd but it can now be tweaked on a target by target basis. Differential Revision: https://reviews.llvm.org/D44647 llvm-svn: 328582	2018-03-26 21:06:14 +00:00
Rafael Espindola	4f64ac1d9f	Use local symbols for creating .stack-size. llvm-svn: 328581	2018-03-26 20:40:22 +00:00
Reid Kleckner	6090866f7b	Fix go bindings test when using goma distributed build tool Goma[1] is a distributed build system similar to distcc and icecc primarily used to compile Chromium. The client is open source, and hopefully soon the server will be as well. The intended usage model is similar to most distributed build systems: prefix gomacc onto your compiler command line, and it transparently distributes compilation. The go lit config wants to determine the host compiler binary, so it needs some extra logic to avoid looking at these prefixes. [1] https://chromium.googlesource.com/infra/goma/client/ llvm-svn: 328580	2018-03-26 20:19:14 +00:00
Paul Robinson	2a7fd4563a	Use correct format specifier. Review comment on r328235 by James Henderson. llvm-svn: 328578	2018-03-26 19:55:01 +00:00
Eli Friedman	052f7792d3	[MemorySSA] Fix exponential compile-time updating MemorySSA. MemorySSAUpdater::getPreviousDefRecursive is a recursive algorithm, for each block, it computes the previous definition for each predecessor, then takes those definitions and combines them. But currently it doesn't remember results which it already computed; this means it can visit the same block multiple times, which adds up to exponential time overall. To fix this, this patch adds a cache. If we computed the result for a block already, we don't need to visit it again because we'll come up with the same result. Well, unless we RAUW a MemoryPHI; in that case, the TrackingVH will be updated automatically. This matches the original source paper for this algorithm. The testcase isn't really a test for the bug, but it adds coverage for the case where tryRemoveTrivialPhi erases an existing PHI node. (It's hard to write a good regression test for a performance issue.) Differential Revision: https://reviews.llvm.org/D44715 llvm-svn: 328577	2018-03-26 19:52:54 +00:00
Krzysztof Parzyszek	ab8e7bb429	[Hexagon] Assertion failure in HexagonSubtarget.cpp In restoreLatency, replace range-for loop with std::find. Patch by Jyotsna Verma. llvm-svn: 328574	2018-03-26 19:04:58 +00:00
Simon Pilgrim	4a798e74b8	[X86][Btver2] Add (U)COMISD/(U)COMISD scheduler costs Account for the "+i" integer pipe transfer cost (1cy use of JALU0 for GPR PRF write) llvm-svn: 328573	2018-03-26 19:01:06 +00:00
Haicheng Wu	6adad202da	[SLP] Add more checks to a test case. NFC. llvm-svn: 328572	2018-03-26 18:59:28 +00:00
Reid Kleckner	bc8c141758	[X86] Fix Windows `i1 zeroext` conventions to use i8 instead of i32 Summary: Re-lands r328386 and r328443, reverting r328482. Incorporates fixes from @mstorsjo in D44876 (thanks!) so that small parameters in i8 and i16 do not end up in the SysV register parameters (EDI, ESI, etc). I added tests for how we receive small parameters, since that is the important part. It's always safe to store more bytes than will be read, but the assumptions you make when loading them are what really matter. I also tested this by self-hosting clang and it passed tests on win64. Reviewers: mstorsjo, hans Subscribers: hiraditya, mstorsjo, llvm-commits Differential Revision: https://reviews.llvm.org/D44900 llvm-svn: 328570	2018-03-26 18:49:48 +00:00
Simon Pilgrim	39a33ace84	[X86] Add WriteBitScan/WriteLZCNT/WriteTZCNT/WritePOPCNT scheduler classes (PR36881) Give the bit count instructions their own scheduler classes instead of forcing them into existing classes. These were mostly overridden anyway, but I had to add in costs from Agner for silvermont and znver1 and the Fam16h SoG for btver2 (Jaguar). Differential Revision: https://reviews.llvm.org/D44879 llvm-svn: 328566	2018-03-26 18:19:28 +00:00
David Blaikie	0fc4dc612a	Remove unused file, ExecutionEngine/MCJIT/ObjectBuffer.h This header also wasn't self contained/modular - but with no users, it didn't seem worth fixing because it'd break so easily again. llvm-svn: 328565	2018-03-26 18:10:31 +00:00
Mandeep Singh Grang	0f54918b14	[XCore] Change std::sort to llvm::sort in response to r327219 Summary: r327219 added wrappers to std::sort which randomly shuffle the container before sorting. This will help in uncovering non-determinism caused due to undefined sorting order of objects having the same key. To make use of that infrastructure we need to invoke llvm::sort instead of std::sort. Note: This patch is one of a series of patches to replace all std::sort to llvm::sort. Refer the comments section in D44363 for a list of all the required patches. Reviewers: dblaikie, RKSimon, robertlytton Reviewed By: robertlytton Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44875 llvm-svn: 328564	2018-03-26 18:08:26 +00:00
Reid Kleckner	5691bcbbc1	[lit] Implement 'cat' command for internal shell Fixes PR36449 Patch by Chamal de Silva Differential Revision: https://reviews.llvm.org/D43501 llvm-svn: 328563	2018-03-26 18:05:12 +00:00
Zachary Turner	54a621de4b	Delete pdbutil diff mode. This has been made obsolete by the fact that almost all of the things it previously checked for are no longer relevant since we can just compare bytes in a lot of places. llvm-svn: 328562	2018-03-26 18:01:07 +00:00
Krzysztof Parzyszek	314e3010b6	[Hexagon] Add more lit tests llvm-svn: 328561	2018-03-26 17:53:48 +00:00
Sanjay Patel	b56e975b24	[InstCombine] improve code comment; NFC llvm-svn: 328560	2018-03-26 17:52:02 +00:00
Lei Huang	2e25caf3b0	[Power9]Legalize and emit code for quad-precision convert from double-precision Legalize and emit code for quad-precision floating point operation xscvdpqp and add option to guard the quad precision operation support. Differential Revision: https://reviews.llvm.org/D44746 llvm-svn: 328558	2018-03-26 17:46:25 +00:00
Stefan Pintilie	c67d695c7f	[PowerPC] Infrastructure work. Implement getting the opcode for a spill in one place. A new function getOpcodeForSpill should now be the only place to get the opcode for a given spilled register. Differential Revision: https://reviews.llvm.org/D43086 llvm-svn: 328556	2018-03-26 17:39:18 +00:00
Zaara Syeda	2463ef7edb	Disable [MachineLICM] Add functions to MachineLICM to hoist invariant stores Disable https://reviews.llvm.org/D40196 with setting option hoist-const-stores to false since failing s390 buildbot. llvm-svn: 328555	2018-03-26 17:22:33 +00:00
Krzysztof Parzyszek	0cdda04d81	[Pipeliner] Several node-ordering fixes First, we change the heuristic that is used to ignore the recurrent node-sets in the node ordering. In certain cases it's not important to focus on the recurrent node-sets. Instead, the algorithm begins by considering all the instructions in the node ordering step. Second, a minor change to the bottom up traversal, which needs to consider loop carried dependences (modeled as anti dependences). Previously, these instructions were skipped, which caused problems because the instruction ends up having both predecessors and sucessors in the schedule. Third, consider anti-dependences as a tie breaker when choosing between instructions in the node ordering. We want to make sure that the source of the anti-dependence does not end up with both predecesssors and sucessors in the final node ordering. Patch by Brendon Cahoon. llvm-svn: 328554	2018-03-26 17:07:41 +00:00
Tim Corringham	6c9d475041	[AMDGPU] Improve disassembler error handling Summary: llvm-objdump now disassembles unrecognised opcodes as data, using the .long directive. We treat unrecognised opcodes as being 32 bit values, so move along 4 bytes rather than the single byte which previously resulted in a cascade of bogus disassembly following an unrecognised opcode. While no solution can always disassemble code that contains embedded data correctly this provides a significant improvement. The disassembler will now cope with an arbitrary length section as it no longer truncates it to a multiple of 4 bytes, and will use the .byte directive for trailing bytes. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D44685 llvm-svn: 328553	2018-03-26 17:06:33 +00:00
Simon Pilgrim	f8bc7da8ea	[X86][Btver2] Add CVTSI2SD/CVTSI2SS scheduler costs We still need to account for how Jaguar passes data from GPR -> XMM, which isn't as clean as XMM -> GPR..... llvm-svn: 328551	2018-03-26 17:02:02 +00:00
Krzysztof Parzyszek	40781d3009	[Pipeliner] Check for affine expression in isLoopCarriedOrder The pipeliner must add a loop carried dependence between two memory operations if the base register is not an affine (linear) exression. The current implementation doesn't check how the base register is defined, which allows non-affine expressions, and then the pipeliner does not add a loop carried dependence when one is needed. This patch adds code to isLoopCarriedOrder that checks if the base register of the memory operations is defined by a phi, and the loop definition for the phi is a constant increment value. This is a very simple check for a linear expression. Patch by Brendon Cahoon. llvm-svn: 328550	2018-03-26 16:58:40 +00:00
David Blaikie	0bf5df5bda	Remove an unneeded (& mislayered) include from Target/TargetLoweringObjectFile on a CodeGen header llvm-svn: 328549	2018-03-26 16:57:31 +00:00
David Blaikie	cccc6ed65b	Remove unneeded (& mislayered) include from TargetMachine.cpp on a CodeGen header llvm-svn: 328548	2018-03-26 16:52:10 +00:00
Krzysztof Parzyszek	d764226f07	[Pipeliner] Add missing loop carried dependences The pipeliner is not adding a dependence edge for a loop carried dependence, and ends up scheduling a load from iteration n prior to an aliased store in iteration n-1. The code that adds the loop carried dependences in the pipeliner doesn't check if the memory objects for loads and stores are "identified" (i.e., distinct) objects. If they are not, then the code that adds the dependences needs to be conservative. The objects can be used to check dependences only when they are distinct objects. The code that checks for loop carried dependences has been updated to classify loads and stores that are not identified as "unknown" values. A store with an "unknown" value can potentially create a loop carried dependence with any pending load. Patch by Brendon Cahoon. llvm-svn: 328547	2018-03-26 16:50:11 +00:00
Haicheng Wu	787124ec5a	[SLP] Add a test case. NFC. llvm-svn: 328546	2018-03-26 16:47:37 +00:00
Krzysztof Parzyszek	8ff15dca37	[Pipeliner] Fix renaming in pipeliner when eliminating phis The phi renaming code in the pipeliner uses the wrong value when rewriting phi uses, which results in an undefined value. In this case, the original phi is no longer needed due to the order of instruction in the pipelined loop. The pipeliner was assuming, in this case, the the phi loop definition should be used to rewrite the uses. However, the pipeliner needs to check to make sure that the loop definition has already been scheduled. If not, then the phi initial value needs to be used instead. Patch by Brendon Cahoon. llvm-svn: 328545	2018-03-26 16:41:36 +00:00
Krzysztof Parzyszek	f3a79c7d73	[Pipeliner] Fix number of phis to generate in the epilog The pipeliner was generating too many phis in the epilog blocks, which caused incorrect code generation when rewriting an instruction that uses the phi. In this case, there 3 prolog and epilog stages. An existing phi was scheduled at stage 1. When generating the code for the 2nd epilog an extra new phi was generated. To fix this, we need to update the code that calculates the maximum number of phis that can be generated, which is based upon the current prolog stage and the stage of the original phi. In this case, when the prolog stage is 1 and the original phi stage is 1, the maximum number of phis to generate is 2. Patch by Brendon Cahoon. llvm-svn: 328543	2018-03-26 16:37:55 +00:00
Krzysztof Parzyszek	c575ffd7a5	[Pipeliner] Use latency to compute RecMII The patch contains severals changes needed to pipeline an example that was transformed so that a Phi with a subreg is converted to copies. The pipeliner wasn't working for a couple of reasons. - The RecMII was 3 instead of 2 due to the extra copies. - Copy instructions contained a latency of 1. - The node order algorithm was not choosing the best "bottom" node, which caused an instruction to be scheduled that had a predecessor and successor already scheduled. - Updated the Hexagon Machine Scheduler to check if the node is latency bound when adding the cost for a 0-latency dependence. The RecMII was 3 because the computation looks at the number of nodes in the recurrence. The extra copy is an extra node but it shouldn't increase the latency. The new RecMII computation looks at the latency of the instructions in the recurrence. We changed the latency of the dependence of a copy to 0. The latency computation for the copy also checks the use of the copy (similar to a reg_sequence). The node order algorithm was not choosing the last instruction in the recurrence for a bottom up traversal. This was when the last instruction is a copy. A check was added when choosing the instruction to check for NodeNum if the maxASAP is the same. This means that the scheduler will not end up with another node in the recurrence that has both a predecessor and successor already scheduled. The cost computation in Hexagon Machine Scheduler adds cost when an instruction can be packetized with a zero-latency instruction. We should only do this if the schedule is latency bound. Patch by Brendon Cahoon. llvm-svn: 328542	2018-03-26 16:33:16 +00:00

1 2 3 4 5 ...

162035 Commits