llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 21:42:54 +02:00

Author	SHA1	Message	Date
Tom Stellard	11a07f331e	R600/SI: Move SIInsertWaits into AMDGPUPassConfig::addPreSched2() This pass needs to be run after PrologEpilogInserter, because that pass may inserter spill code which reads or writes memory. llvm-svn: 223253	2014-12-03 18:27:08 +00:00
Tom Stellard	41dd9bba8c	R600/SI: Don't run SI passes on R600 subtargets llvm-svn: 223252	2014-12-03 18:27:05 +00:00
Tim Northover	24557369d4	AArch64: fix wrong-endian parameter passing. The blocked arguments code didn't take account of the hacks needed to support it. llvm-svn: 223247	2014-12-03 17:49:26 +00:00
Colin LeMahieu	ffe6923b15	[NFC] Fixing pendantic warning extra semicolons. llvm-svn: 223246	2014-12-03 17:36:39 +00:00
Colin LeMahieu	6020d0e681	[Hexagon] [NFC] Moving function implementations out of header. Clang-formatting files. llvm-svn: 223245	2014-12-03 17:35:39 +00:00
Nick Lewycky	e77a0e6d56	Fix test to use the right metadata node (reapply r223239 plus a fix) and also to use the correct path to the GCNO file. llvm-svn: 223244	2014-12-03 17:32:44 +00:00
Colin LeMahieu	93c8792d13	[Hexagon] [NFC] Renaming packetStart to packetBegin llvm-svn: 223243	2014-12-03 17:31:43 +00:00
Alexander Potapenko	44a90df4d9	Revert r223239, which broke some bots. llvm-svn: 223240	2014-12-03 16:03:08 +00:00
Alexander Potapenko	80e9b87af0	Fix the metadata number used by llvm.gcov to match the number of the inserted metadata node. llvm-svn: 223239	2014-12-03 15:15:58 +00:00
Aaron Ballman	6dbe89c5fb	Silencing several "multiple copy constructors" warnings from MSVC; NFC. llvm-svn: 223238	2014-12-03 14:44:16 +00:00
Aaron Ballman	73be12a12b	Silencing a 32-bit implicit conversion warning in MSVC; NFC. llvm-svn: 223237	2014-12-03 14:39:58 +00:00
Evgeniy Stepanov	a4177619df	msan] Add compile-time checks for missing origins. This change makes MemorySanitizer instrumentation a bit more strict about instructions that have no origin id assigned to them. This would have caught the bug that was fixed in r222918. This is re-commit of r222997, reverted in r223211, with 3 more missing origins added. llvm-svn: 223236	2014-12-03 14:15:53 +00:00
Erik Eckstein	b61d06dbaf	InstCombine: simplify signed range checks Try to convert two compares of a signed range check into a single unsigned compare. Examples: (icmp sge x, 0) & (icmp slt x, n) --> icmp ult x, n (icmp slt x, 0) \| (icmp sgt x, n) --> icmp ugt x, n llvm-svn: 223224	2014-12-03 10:39:15 +00:00
Hal Finkel	e95528845d	[PowerPC] Print all inline-asm consts as signed numbers Almost all immediates in PowerPC assembly (both 32-bit and 64-bit) are signed numbers, and it is important that we print them as such. To make sure that happens, we change PPCTargetLowering::LowerAsmOperandForConstraint so that it does all intermediate checks on a signed-extended int64_t value, and then creates the resulting target constant using MVT::i64. This will ensure that all negative values are printed as negative values (mirroring what is done in other backends to achieve the same sign-extension effect). This came up in the context of inline assembly like this: "add%I2 %0,%0,%2", ..., "Ir"(-1ll) where we used to print: addi 3,3,4294967295 and gcc would print: addi 3,3,-1 and gas accepts both forms, but our builtin assembler (correctly) does not. Now we print -1 like gcc does. While here, I replaced a bunch of custom integer checks with isInt<16> and friends from MathExtras.h. Thanks to Paul Hargrove for the bug report. llvm-svn: 223220	2014-12-03 09:37:50 +00:00
Charlie Turner	ab73ef8264	Emit ABI_FP_rounding attribute. LLVM understands a -enable-sign-dependent-rounding-fp-math codegen option. When the user has specified this option, the Tag_ABI_FP_rounding attribute should be emitted with value 1. This option currently does not appear to disable transformations and optimizations that assume default floating point rounding behavior, AFAICT, but the intention should be recorded in the build attributes, regardless of what the compiler actually does with the intention. Change-Id: If838578df3dc652b6f2796b8d152545674bcb30e llvm-svn: 223218	2014-12-03 08:12:26 +00:00
Charlie Turner	7f4e539ba5	Add tests for default value of Tag_ABI_FP_rounding. Change-Id: I051866d073fc6ce87ce3e693a3762da6d81f4393 llvm-svn: 223217	2014-12-03 07:59:50 +00:00
Benjamin Poulain	d00638fd57	Fix a typo in the documentation of LTO Fix defininitions->definitions. Reviewed by David Blaikie. llvm-svn: 223216	2014-12-03 07:32:36 +00:00
Rafael Espindola	02dc2705ae	Ask the module for its the identified types. When lazy reading a module, the types used in a function will not be visible to a TypeFinder until the body is read. This patch fixes that by asking the module for its identified struct types. If a materializer is present, the module asks it. If not, it uses a TypeFinder. This fixes pr21374. I will be the first to say that this is ugly, but it was the best I could find. Some of the options I looked at: * Asking the LLVMContext. This could be made to work for gold, but not currently for ld64. ld64 will load multiple modules into a single context before merging them. This causes us to see types from future merges. Unfortunately, MappedTypes is not just a cache when it comes to opaque types. Once the mapping has been made, we have to remember it for as long as the key may be used. This would mean moving MappedTypes to the Linker class and having to drop the Linker::LinkModules static methods, which are visible from C. * Adding an option to ignore function bodies in the TypeFinder. This would fix the PR by picking the worst result. It would work, but unfortunately we are currently quite dependent on the upfront type merging. I will try to reduce our dependency, but it is not clear that we will be able to get rid of it for now. The only clean solution I could think of is making the Module own the types. This would have other advantages, but it is a much bigger change. I will propose it, but it is nice to have this fixed while that is discussed. With the gold plugin, this patch takes the number of types in the LTO clang binary from 52817 to 49669. llvm-svn: 223215	2014-12-03 07:18:23 +00:00
Duncan P. N. Exon Smith	f880254c40	ADT: Rename argument in emplace_back_impl Rename a functor argument in r223201 from `emplace` to `construct` to reduce confusion. llvm-svn: 223212	2014-12-03 05:53:24 +00:00
Nick Lewycky	e945d25fb0	Revert r222997. The newly added compile-time checks are finding missing origins, testcase is being reduced and a PR will be posted shortly. llvm-svn: 223211	2014-12-03 05:47:00 +00:00
Duncan P. N. Exon Smith	cc86eec0f7	LoopVectorize: Remove unnecessary RAUW Remove an unnecessary `MDNode::replaceAllUsesWith()`. In the preceding line, `TheLoop->setLoopID()` visits all backedges and sets the new loop ID. This sufficiently updates the loop metadata. Metadata RAUW is going away as part of PR21532. llvm-svn: 223210	2014-12-03 05:41:20 +00:00
Matt Arsenault	da91ccdb14	R600/SI: Fix SIFixSGPRCopies for copies to physical registers This shows up when operands required to be passed in VCC are copied to. llvm-svn: 223208	2014-12-03 05:22:39 +00:00
Matt Arsenault	c9e7c7e638	R600/SI: Remove incorrect assertion This can be a COPY to a physical register, such as VCC llvm-svn: 223207	2014-12-03 05:22:38 +00:00
Matt Arsenault	0533e2261e	R600/SI: Remove i1 pseudo VALU ops Select i1 logical ops directly to 64-bit SALU instructions. Vector i1 values are always really in SGPRs, with each bit for each item in the wave. This saves about 4 instructions when and/or/xoring any condition, and also helps write conditions that need to be passed in vcc. This should work correctly now that the SGPR live range fixing pass works. More work is needed to eliminate the VReg_1 pseudo regclass and possibly the entire SILowerI1Copies pass. llvm-svn: 223206	2014-12-03 05:22:35 +00:00
Matt Arsenault	aed413b578	R600/SI: Fix suspicious indexing The loop is over the operands of an instruction, and checks the register with the sub reg index of the dest register. This probably meant to be checking the sub reg index of the same operand. llvm-svn: 223205	2014-12-03 05:22:32 +00:00
Matt Arsenault	43aa2fe161	R600/SI: Fix running SILowerI1Copies a second time llvm-svn: 223204	2014-12-03 05:22:30 +00:00
Matt Arsenault	10b6254c18	R600/SI: Fix live range error hidden by SIFoldOperands m0 is treated as a virtual register class with a single register rather than the physical register it really is. This was updating the live range of the used virtual copy of m0 from the first ds_read instruction, and leaving the unused copy unchanged. This resulted in a "Live segment doesn't end at a valid instruction" verifier error because the erased instructions. Update the live range of the second copy (which should be dead). No test since I'm not sure how to trigger this with SIFoldOperands enabled. llvm-svn: 223203	2014-12-03 05:22:29 +00:00
Duncan P. N. Exon Smith	905e58067d	ADT: Add SmallVector<>::emplace_back(): fixup Add missing `void` return type from `!LLVM_HAS_VARIADIC_TEMPLATES` case in r223201. llvm-svn: 223202	2014-12-03 04:49:16 +00:00
Duncan P. N. Exon Smith	27a4458649	ADT: Add SmallVector<>::emplace_back() llvm-svn: 223201	2014-12-03 04:45:09 +00:00
Tom Stellard	213a8062a7	StructurizeCFG: Use LoopInfo analysis for better loop detection We were assuming that each back-edge in a region represented a unique loop, which is not always the case. We need to use LoopInfo to correctly determine which back-edges are loops. llvm-svn: 223199	2014-12-03 04:28:32 +00:00
Duncan P. N. Exon Smith	93d49e04d2	NVPTX: Delete dead code `MDNode` does not inherit from `User`, and it never has a name. llvm-svn: 223198	2014-12-03 04:13:23 +00:00
Tom Stellard	a8af03e062	R600/SI: Enable inline assembly We just needed to remove the assertion in AMDGPURegisterInfo::getFrameRegister(), which is called when initializing the parser for inline assembly. llvm-svn: 223197	2014-12-03 04:08:00 +00:00
Peter Zotov	82181996ad	[OCaml] [cmake] Disable OCaml bindings if ctypes >=0.3 is not found. llvm-svn: 223195	2014-12-03 03:39:01 +00:00
Matt Arsenault	a51749b87e	R600/SI: Change mubuf offsets to print as decimal This matches SC's behavior. llvm-svn: 223194	2014-12-03 03:12:13 +00:00
Nick Lewycky	be2ab4ddd3	Emit the entry block first and the exit block second, then all the blocks in between afterwards. This is what gcc always does, and some out of tree tools depend on that. llvm-svn: 223193	2014-12-03 02:45:01 +00:00
NAKAMURA Takumi	8d1511b4de	GCRelocateOperands: Try to appease msc17. llvm-svn: 223192	2014-12-03 02:40:24 +00:00
Peter Collingbourne	837799f13b	Prologue support Patch by Ben Gamari! This redefines the `prefix` attribute introduced previously and introduces a `prologue` attribute. There are a two primary usecases that these attributes aim to serve, 1. Function prologue sigils 2. Function hot-patching: Enable the user to insert `nop` operations at the beginning of the function which can later be safely replaced with a call to some instrumentation facility 3. Runtime metadata: Allow a compiler to insert data for use by the runtime during execution. GHC is one example of a compiler that needs this functionality for its tables-next-to-code functionality. Previously `prefix` served cases (1) and (2) quite well by allowing the user to introduce arbitrary data at the entrypoint but before the function body. Case (3), however, was poorly handled by this approach as it required that prefix data was valid executable code. Here we redefine the notion of prefix data to instead be data which occurs immediately before the function entrypoint (i.e. the symbol address). Since prefix data now occurs before the function entrypoint, there is no need for the data to be valid code. The previous notion of prefix data now goes under the name "prologue data" to emphasize its duality with the function epilogue. The intention here is to handle cases (1) and (2) with prologue data and case (3) with prefix data. References ---------- This idea arose out of discussions[1] with Reid Kleckner in response to a proposal to introduce the notion of symbol offsets to enable handling of case (3). [1] http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-May/073235.html Test Plan: testsuite Differential Revision: http://reviews.llvm.org/D6454 llvm-svn: 223189	2014-12-03 02:08:38 +00:00
NAKAMURA Takumi	7ce4513859	ExceptionDemo: Let setMCJITMemoryManager() take unique_ptr, since r223183. llvm-svn: 223188	2014-12-03 02:05:51 +00:00
Ahmed Bougacha	ac111e2226	[X86][MC] Intel syntax: accept implicit memory operand sizes larger than 80. The X86AsmParser intel handling was refactored in r216481, making it try each different memory operand size to see which one matches. Operand sizes larger than 80 ("[xyz]mmword ptr") were forgotten, which led to an "invalid operand" error for code such as: movdqa [rax], xmm0 llvm-svn: 223187	2014-12-03 02:03:26 +00:00
Lang Hames	c0a1957308	[MCJIT] Unique-ptrify the RTDyldMemoryManager member of MCJIT. NFC. llvm-svn: 223183	2014-12-03 00:51:19 +00:00
Hal Finkel	2b306926ed	[PowerPC] Fix readcyclecounter to be custom expanded for all 32-bit targets We need to use the custom expansion of readcyclecounter on all 32-bit targets (even those with 64-bit registers). This should fix the ppc64 buildbot. llvm-svn: 223182	2014-12-03 00:19:17 +00:00
Tim Northover	0c91cccef8	AArch64: strengthen Darwin ABI alignment assumptions A global variable without an explicit alignment specified should be assumed to be ABI-aligned according to its type, like on other platforms. This allows us to use better memory operations when accessing it. rdar://18533701 llvm-svn: 223180	2014-12-02 23:53:43 +00:00
Pete Cooper	c33369576e	Use a typed enum instead of 'unsigned char' for packed field. NFC. This makes it easier to debug Twine as the 'Kind' fields now show their enum values in lldb and not escaped characters. llvm-svn: 223178	2014-12-02 23:34:23 +00:00
Tim Northover	85149ea9e5	AArch64: don't be too greedy when folding :lo12: accesses into mem ops. This frequently leads to cases like: ldr xD, [xN, :lo12:var] add xA, xN, :lo12:var ldr xD, [xA, #8] where the ADD would have been needed anyway, and the two distinct addressing modes can prevent the formation of an ldp. Because of how we handle ADRP (aggressively forming an ADRP/ADD pseudo-inst at ISel time), this pattern also results in duplicated ADRP instructions (one on its own to cover the ldr, and one combined with the add). llvm-svn: 223172	2014-12-02 23:13:39 +00:00
Michael Zolotukhin	37717c3cf1	PR21302. Vectorize only bottom-tested loops. rdar://problem/18886083 llvm-svn: 223171	2014-12-02 22:59:06 +00:00
Michael Zolotukhin	ce3e203aab	Apply loop-rotate to several vectorizer tests. Such loops shouldn't be vectorized due to the loops form. After applying loop-rotate (+simplifycfg) the tests again start to check what they are intended to check. llvm-svn: 223170	2014-12-02 22:59:02 +00:00
Simon Pilgrim	12f78b2c48	[X86][SSE] Keep 4i32 vector insertions in integer domain on SSE4.1 targets 4i32 shuffles for single insertions into zero vectors lowers to X86vzmovl which was using (v)blendps - causing domain switch stalls. This patch fixes this by using (v)pblendw instead. The updated tests on test/CodeGen/X86/sse41.ll still contain a domain stall due to the use of insertps - I'm looking at fixing this in a future patch. Differential Revision: http://reviews.llvm.org/D6458 llvm-svn: 223165	2014-12-02 22:31:23 +00:00
Chris Matthews	e15245ded8	Give lit a --xunit-xml-output option for saving results in xunit format --xunit-xml-output saves test results to disk in JUnit's xml format. This will allow Jenkins to report the details of a lit run. Based on a patch by David Chisnall. llvm-svn: 223163	2014-12-02 22:19:21 +00:00
Hal Finkel	337f550328	[PowerPC] Implement readcyclecounter for PPC32 We've long supported readcyclecounter on PPC64, but it is easier there (the read of the 64-bit time-base register can be accomplished via a single instruction). This now provides an implementation for PPC32 as well. On PPC32, the time-base register is still 64 bits, but can only be read 32 bits at a time via two separate SPRs. The ISA manual explains how to do this properly (it involves re-reading the upper bits and looping if the counter has wrapped while being read). This requires PPC to implement a custom integer splitting legalization for the READCYCLECOUNTER node, turning it into a target-specific SDAG node, which then gets turned into a pseudo-instruction, which is then expanded to the necessary sequence (which has three SPR reads, the comparison and the branch). Thanks to Paul Hargrove for pointing out to me that this was still unimplemented. llvm-svn: 223161	2014-12-02 22:01:00 +00:00
Tom Stellard	004db709b2	R600/SI: Emit amd_kernel_code_t header for AMDGPU environment llvm-svn: 223160	2014-12-02 22:00:07 +00:00

1 2 3 4 5 ...

110289 Commits