llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Bill Schmidt	b0bab996e0	[PPC64] Fix PR19893 - improve code generation for local function addresses Rafael opened http://llvm.org/bugs/show_bug.cgi?id=19893 to track non-optimal code generation for forming a function address that is local to the compile unit. The existing code was treating both local and non-local functions identically. This patch fixes the problem by properly identifying local functions and generating the proper addis/addi code. I also noticed that Rafael's earlier changes to correct the surrounding code in PPCISelLowering.cpp were also needed for fast instruction selection in PPCFastISel.cpp, so this patch fixes that code as well. The existing test/CodeGen/PowerPC/func-addr.ll is modified to test the new code generation. I've added a -O0 run line to test the fast-isel code as well. Tested on powerpc64[le]-unknown-linux-gnu with no regressions. llvm-svn: 211056	2014-06-16 21:36:02 +00:00
Eric Christopher	0c4bc0dbe3	Since the DataLayout is always found off of the subtarget go ahead and query the base target machine implementation for it. llvm-svn: 211055	2014-06-16 21:18:27 +00:00
Zachary Turner	49cc968196	Clean up some unnecessary mutex guards. These were being used as unreferenced parameters to enforce that the methods must not be called without holding a mutex, but all of the methods in question were internal, and the methods were only exposed through an interface whose entire purpose was to serialize access to these structures, so expecting the methods to be accessed under a mutex is reasonable enough. Reviewed by: blaikie Differential Revision: http://reviews.llvm.org/D4162 llvm-svn: 211054	2014-06-16 20:54:28 +00:00
Louis Gerbarg	7ea43963d7	Improve comments for r211040 Added comment to clarify why we r211040 choose to bail out of fast isel instead of generating a more complicated relocation, and fix mislabelled register in the comments of the asan test case. llvm-svn: 211052	2014-06-16 20:31:50 +00:00
Hans Wennborg	bde907d28e	Revert "lit: warn when passed invalid pathname" (r210597) It was pointed out that this breaks the "virtual test discovery" mechanism, which allows for narming tests in the test exec root. Reverting until I can figure out how to fix this. llvm-svn: 211048	2014-06-16 20:18:41 +00:00
Tim Northover	7f8ca02cf4	ARM: implement correct atomic operations on v7M ARM v7M has ldrex/strex but not ldrexd/strexd. This means 32-bit operations should work as normal, but 64-bit ones are almost certainly doomed. Patch by Phoebe Buckheister. llvm-svn: 211042	2014-06-16 18:49:36 +00:00
Louis Gerbarg	55f89e91ff	Fix illegal relocations in X86FastISel On x86_86 the lea instruction can only use a 32 bit immediate value. When the code is compiled statically the RIP register is not used, meaning the immediate is all that can be used for the relocation, which is not sufficient in the case of targets more than +/- 2GB away. This patch bails out of fast isel in those cases and reverts to DAG which does the right thing. Test case included. llvm-svn: 211040	2014-06-16 17:35:40 +00:00
Jim Grosbach	2272906641	LowerSwitch: track bounding range for the condition tree. When LowerSwitch transforms a switch instruction into a tree of ifs it is actually performing a binary search into the various case ranges, to see if the current value falls into one cases range of values. So, if we have a program with something like this: switch (a) { case 0: do0(); break; case 1: do1(); break; case 2: do2(); break; default: break; } the code produced is something like this: if (a < 1) { if (a == 0) { do0(); } } else { if (a < 2) { if (a == 1) { do1(); } } else { if (a == 2) { do2(); } } } This code is inefficient because the check (a == 1) to execute do1() is not needed. The reason is that because we already checked that (a >= 1) initially by checking that also (a < 2) we basically already inferred that (a == 1) without the need of an extra basic block spawned to check if actually (a == 1). The patch addresses this problem by keeping track of already checked bounds in the LowerSwitch algorithm, so that when the time arrives to produce a Leaf Block that checks the equality with the case value / range the algorithm can decide if that block is really needed depending on the already checked bounds . For example, the above with "a = 1" would work like this: the bounds start as LB: NONE , UB: NONE as (a < 1) is emitted the bounds for the else path become LB: 1 UB: NONE. This happens because by failing the test (a < 1) we know that the value "a" cannot be smaller than 1 if we enter the else branch. After the emitting the check (a < 2) the bounds in the if branch become LB: 1 UB: 1. This is because by checking that "a" is smaller than 2 then the upper bound becomes 2 - 1 = 1. When it is time to emit the leaf block for "case 1:" we notice that 1 can be squeezed exactly in between the LB and UB, which means that if we arrived to that block there is no need to emit a block that checks if (a == 1). Patch by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 211038	2014-06-16 16:55:20 +00:00
James Molloy	26c8f2b1cd	Refactor the disabling of Thumb-1 LDM/STM generation Originally I switched the LD/ST optimizer off in TargetMachine as it was previously, but Eric has suggested he'd prefer that it be short-circuited in the pass itself. No functionality change. llvm-svn: 211037	2014-06-16 16:42:53 +00:00
Rafael Espindola	93c342bca4	Fix pr17056. This makes llvm-nm ignore members that are not sufficiently aligned for lib/Object to handle. These archives are invalid. GNU AR is able to handle this, but in general just warns about broken archive members. We should probably start warning too, but for now just make sure llvm-nm exits with an 0. llvm-svn: 211036	2014-06-16 16:41:00 +00:00
Rafael Espindola	910ec52f4e	Convert the Archive API to use ErrorOr. Now that we have c++11, even things like ErrorOr<std::unique_ptr<...>> are easy to use. No intended functionality change. llvm-svn: 211033	2014-06-16 16:08:36 +00:00
Tilmann Scheller	c25b867f23	[AArch64] Remove dead code. Both function declarations lack a callee and an implementation. llvm-svn: 211029	2014-06-16 15:15:41 +00:00
Cameron McInally	1bfa586059	Hook up vector int_ctlz for AVX512. llvm-svn: 211024	2014-06-16 14:12:28 +00:00
Daniel Sanders	4895243b92	[mips][mips64r6] ssnop is deprecated on MIPS32r6/MIPS64r6 Summary: Depends on D4120 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: zoran.jovanovic, vmedic Differential Revision: http://reviews.llvm.org/D4121 llvm-svn: 211021	2014-06-16 13:25:35 +00:00
Daniel Sanders	495b392e19	[mips][mips64r6] cl[oz], and dcl[oz] are re-encoded in MIPS32r6/MIPS64r6 Summary: There is no change to the restrictions, just the result register is stored once in the encoding rather than twice. The rt field is zero in MIPS32r6/MIPS64r6. Depends on D4119 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4120 llvm-svn: 211019	2014-06-16 13:18:59 +00:00
Daniel Sanders	2a30e4fcab	[mips][mips64r6] ll, sc, lld, and scd are re-encoded on MIPS32r6/MIPS64r6. Summary: The linked-load, store-conditional operations have been re-encoded such that have a 9-bit offset instead of the 16-bit offset they have prior to MIPS32r6/MIPS64r6. While implementing this, I noticed that the atomic load/store pseudos always emit a sign extension using sll and sra. I have improved this to use seb/seh when they are available (MIPS32r2/MIPS64r2 and above). Depends on D4118 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4119 llvm-svn: 211018	2014-06-16 13:13:03 +00:00
Dmitri Gribenko	bb418ed93d	Support/ConvertUTF: restore compatibility with MSVC, which only implements C89 llvm-svn: 211016	2014-06-16 11:22:33 +00:00
Dmitri Gribenko	4b5fc58221	Support/ConvertUTF: implement U+FFFD insertion according to the recommendation given in the Unicode spec That is, replace every maximal subpart of an ill-formed subsequence with one U+FFFD. llvm-svn: 211015	2014-06-16 11:09:46 +00:00
James Molloy	d8293dd333	[AArch64] Fix a fencepost error in lowering for llvm.aarch64.neon.uqshl. Patch by Jiangning Liu! llvm-svn: 211014	2014-06-16 10:39:21 +00:00
Daniel Sanders	679bcf9838	[mips] Merge most of the big/little endian checks in atomic.ll Summary: There is very little difference between the big and little endian cases in test/CodeGen/Mips/atomic.ll. Merge them together using multiple FileCheck prefixes. Depends on D4117 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4118 llvm-svn: 211013	2014-06-16 10:25:17 +00:00
Daniel Sanders	14c4e7277a	[mips][mips64r6] [ls][wd]c2 were re-encoded with 11-bit signed immediates rather than 16-bit in MIPS32r6/MIPS64r6 Summary: The error message for the invalid.s cases isn't very helpful. It happens because there is an instruction with a wider immediate that would have matched if the NotMips32r6 predicate were true. I have some WIP to improve the message but it affects most error messages for removed/re-encoded instructions on MIPS32r6/MIPS64r6 and should therefore be a separate commit. Depens on D4115 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4117 llvm-svn: 211012	2014-06-16 10:00:45 +00:00
Christian Pirker	219e80de72	ARMEB: Fix trunc store for vector types Reviewed at http://reviews.llvm.org/D4135 llvm-svn: 211010	2014-06-16 09:17:30 +00:00
Jingyue Wu	ae39e54823	Canonicalize addrspacecast ConstExpr between different pointer types As a follow-up to r210375 which canonicalizes addrspacecast instructions, this patch canonicalizes addrspacecast constant expressions. Given clang uses ConstantExpr::getAddrSpaceCast to emit addrspacecast cosntant expressions, this patch is also a step towards having the frontend emit canonicalized addrspacecasts. Piggyback a minor refactor in InstCombineCasts.cpp Update three affected tests in addrspacecast-alias.ll, access-non-generic.ll and constant-fold-gep.ll and added one new test in constant-fold-address-space-pointer.ll llvm-svn: 211004	2014-06-15 21:40:57 +00:00
Matt Arsenault	8d575afe8e	Fix copy paste error llvm-svn: 211003	2014-06-15 21:22:52 +00:00
Matt Arsenault	fafd3cb5a2	R600: Add a rotr testcase I forgot to add llvm-svn: 211002	2014-06-15 21:09:00 +00:00
Matt Arsenault	a88eef222c	R600: Remove a few more things from AMDILISelLowering Try to keep all the setOperationActions for integer ops together. llvm-svn: 211001	2014-06-15 21:08:58 +00:00
Matt Arsenault	1f47d520f5	R600: Fix assert on vector sdiv llvm-svn: 211000	2014-06-15 21:08:54 +00:00
Matt Arsenault	512b09be91	R600: Move / cleanup more leftover AMDIL stuff. llvm-svn: 210998	2014-06-15 20:23:38 +00:00
Matt Arsenault	d4919ac014	R600: Move division custom lowering out of AMDILISelLowering llvm-svn: 210997	2014-06-15 20:08:02 +00:00
Eric Christopher	ac850efcf4	Temporarily revert r210953 in an attempt to bring the ARM buildbots back. llvm-svn: 210996	2014-06-15 19:55:14 +00:00
Matt Arsenault	6f5ac69231	R600: Report that integer division is expensive. Divides by weird constants now emit much better code. llvm-svn: 210995	2014-06-15 19:48:16 +00:00
Matt Arsenault	7c3e24fab1	R600: Remove dead code llvm-svn: 210994	2014-06-15 19:48:13 +00:00
David Blaikie	1043ced2ca	PR20038: DebugInfo missing DIEs for some concrete variables. I haven't nailed this down entirely, but this is about as small of a test case as I can seem to construct and adequately demonstrates the crasher. I'll continue investigating the root cause/fix(es). llvm-svn: 210993	2014-06-15 19:34:26 +00:00
Manuel Klimek	f63c0c9c87	Add specialization of FoldingSetTrait for std::pair. llvm-svn: 210990	2014-06-15 14:42:25 +00:00
Tim Northover	7dd495fd0e	LegalizeDAG: make sure cast is unsigned before using FP_TO_UINT. It's valid to use FP_TO_SINT when asking for a smaller type (e.g. all "unsigned int16" values fit into a "signed int32"), but the reverse isn't true. Unfortunately, I'm not actually aware of any architecture with asymmetric FP_TO_SINT and FP_TO_UINT handling and the logic happens to work in the symmetric case, so I can't actually write a test for this. llvm-svn: 210986	2014-06-15 09:27:20 +00:00
Tim Northover	9eac1de1e4	AArch64: improve handling & modelling of FP_TO_XINT nodes. There's probably no acatual change in behaviour here, just updating the LowerFP_TO_INT function to be more similar to the reverse implementation and updating costs to current CodeGen. llvm-svn: 210985	2014-06-15 09:27:15 +00:00
Tim Northover	0f6e617e90	AArch64: improve vector [su]itofp handling. This somehow got missed in the AArch64 merge, so should fix a performance regression since 3.4. llvm-svn: 210984	2014-06-15 09:27:06 +00:00
NAKAMURA Takumi	a6ff4c4e16	Don't expect tests always crashing. Add "REQUIRES:asserts". llvm-svn: 210983	2014-06-15 01:01:11 +00:00
Artyom Skrobov	a2c0f0b696	Replacing the private implementations of SwapValue with calls to sys::swapByteOrder() llvm-svn: 210980	2014-06-14 13:49:57 +00:00
Artyom Skrobov	8e686bd8fe	Using llvm::sys::swapByteOrder() for the common case of byte-swapping a value in place llvm-svn: 210978	2014-06-14 13:18:07 +00:00
Artyom Skrobov	33ac4d71e5	Adding llvm::sys::swapByteOrder() for the common use-case of byte-swapping a value in place llvm-svn: 210976	2014-06-14 12:52:55 +00:00
Artyom Skrobov	9d70ea6c1e	Renaming SwapByteOrder() to getSwappedBytes() The next commit will add swapByteOrder(), acting in-place llvm-svn: 210973	2014-06-14 11:36:01 +00:00
Matt Arsenault	5f7306c2c6	R600: Add failing testcases. These are reduced from assert in the OpenCV CvtColor8u.BGR5652GRAY test. llvm-svn: 210969	2014-06-14 04:26:09 +00:00
Matt Arsenault	acf5b84870	Fix typo llvm-svn: 210968	2014-06-14 04:26:07 +00:00
Matt Arsenault	b2c8575d08	R600: Fix asserts related to constant initializers This would assert if a constant address space was extern and therefore didn't have an initializer. If the initializer was undef, it would hit the unreachable unhandled initializer case. An extern global should never really occur since we don't have machine linking, but bugpoint likes to remove initializers. llvm-svn: 210967	2014-06-14 04:26:05 +00:00
Matt Arsenault	fd04db6d9e	R600: Use address space enum instead of value llvm-svn: 210966	2014-06-14 04:26:01 +00:00
Nick Lewycky	fd813dfe75	Remove extra whitespace in function declaration. No functionality change. llvm-svn: 210965	2014-06-14 03:48:29 +00:00
David Blaikie	53324d9a53	DebugInfo: Remove some extra handling of abstract variables and instead rely solely on the delayed handling introduced in r210946 Now that we handle finding abstract variables at the end of the module, remove the upfront handling and just ensure the abstract variable is built when necessary. In theory we could have a split implementation, where inlined variables are immediately constructed referencing the abstract definition, and concrete variables are delayed - but let's go with one solution for now unless there's a reason not to. llvm-svn: 210961	2014-06-13 23:52:55 +00:00
Eric Christopher	f650ca8a5b	Remove InstrItineraryData off of the TargetMachine - it's already on the subtarget and just forward the accessor. llvm-svn: 210955	2014-06-13 23:11:13 +00:00
Eric Christopher	395ff9e8de	Move ARMJITInfo off of the TargetMachine and down onto the subtarget. This required untangling a mess of headers that included around. llvm-svn: 210953	2014-06-13 23:04:46 +00:00

1 2 3 4 5 ...

104526 Commits