llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Craig Topper	d480fb4c55	Fix typo in static_assert message. NFC llvm-svn: 300179	2017-04-13 07:31:52 +00:00
Lang Hames	2559d103ff	[ORC] Temporarily disable the RPC Error/Expected unit tests while I investigate bot failures. llvm-svn: 300177	2017-04-13 06:20:45 +00:00
Lang Hames	8ebcd86ba7	[Orc] Fix bool serialization for RawByteChannels. The bool type may be larger than the char type, so assuming we can cast from bool to char and write a byte out to the stream is unsafe. Hopefully this will get RPCUtilsTest.ReturnExpectedFailure passing on the bots. llvm-svn: 300174	2017-04-13 05:23:50 +00:00
Lang Hames	29856c2e3d	[ORC] Remove more extraneous semicolons from r300167, rename the RPC Expected tests to be consistent with the Error tests. llvm-svn: 300173	2017-04-13 05:05:26 +00:00
George Burgess IV	c7cbe3ab30	Remove more lies from the LangRef. Same change as in r300168, but for invoke instead of call. llvm-svn: 300172	2017-04-13 05:00:31 +00:00
Craig Topper	ed420c20f5	[APInt] Reorder fields to avoid a hole in the middle of the class Summary: APInt is currently implemented with an unsigned BitWidth field first and then a uint_64/pointer union. Due to the 64-bit size of the union there is a hole after the bitwidth. Putting the union first allows the class to be packed. Making it 12 bytes instead of 16 bytes. An APSInt goes from 20 bytes to 16 bytes. This shows a 4k reduction on the size of the opt binary on my local x86-64 build. So this enables some other improvement to the code as well. Reviewers: dblaikie, RKSimon, hans, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D32001 llvm-svn: 300171	2017-04-13 04:59:11 +00:00
Lang Hames	18515cd525	[ORC] Remove extraneous semi-colon added in r300167. llvm-svn: 300170	2017-04-13 04:49:00 +00:00
Craig Topper	8c209cdca5	[APInt] Generalize the implementation of tcIncrement to support adding a full 'word' by introducing tcAddPart. Use this to support tcIncrement, operator++ and operator+=(uint64_t). Do the same for subtract. NFCI. llvm-svn: 300169	2017-04-13 04:36:06 +00:00
George Burgess IV	06a64bc421	Update the LangRef to reflect reality. At the very least, we have CallInst::setIsNoInline() for adding the noinline attribute to callsites, and I'm told alwaysinline seems to work. Thought of adding "not all attributes are guaranteed to work here". If someone thinks that would be better (or has a better way of phrasing that, etc.), happy to add it. llvm-svn: 300168	2017-04-13 04:01:55 +00:00
Lang Hames	8e2bbc3bed	[ORC] Add RPC and serialization support for Errors and Expecteds. This patch allows Error and Expected types to be passed to and returned from RPC functions. Serializers and deserializers for custom error types (types deriving from the ErrorInfo class template) can be registered with the SerializationTraits for a given channel type (see registerStringError in RPCSerialization.h for an example), allowing a given custom type to be sent/received. Unregistered types will be serialized/deserialized as StringErrors using the custom type's log message as the error string. llvm-svn: 300167	2017-04-13 03:51:35 +00:00
Zachary Turner	e08f706908	Remove some unused private fields. llvm-svn: 300163	2017-04-13 02:28:17 +00:00
Craig Topper	cbc8bd7d59	[InstCombine] Add vector version of a test to show missing optimization. llvm-svn: 300161	2017-04-13 01:31:40 +00:00
Peter Collingbourne	1553cd1af6	Support: Add a VCSRevision.h header file. This is a magic header file supported by the build system that provides a single definition, LLVM_REVISION, containing an LLVM revision identifier, if available. This functionality previously lived in the LTO library, but I am moving it out to lib/Support because I want to also start using it in lib/Object to create the IR symbol table. This change also fixes a bug where LLVM_REVISION was never actually being used in lib/LTO because the macro HAS_LLVM_REVISION was never defined (it was misspelled as HAVE_SVN_VERSION_INC in lib/LTO/CMakeLists.txt, and was only being defined in a non-existent file Version.cpp). I also changed the code to use "git rev-parse --git-dir" to locate the .git directory, instead of looking for it in the LLVM source root directory, which makes this compatible with monorepos as well as git worktrees. Differential Revision: https://reviews.llvm.org/D31985 llvm-svn: 300160	2017-04-13 01:26:12 +00:00
Lang Hames	6149003b00	[ORC] Add missing file from r300155. llvm-svn: 300157	2017-04-13 01:06:45 +00:00
Lang Hames	0fe1e38356	[ORC] Use native Errors rather than converted std::error_codes for ORC RPC. llvm-svn: 300155	2017-04-13 01:03:06 +00:00
Reid Kleckner	b5a125b854	[IR] Take func, ret, and arg attrs separately in AttributeList::get This seems like a much more natural API, based on Derek Schuff's comments on r300015. It further hides the implementation detail of AttributeList that function attributes come last and appear at index ~0U, which is easy for the user to screw up. git diff says it saves code as well: 97 insertions(+), 137 deletions(-) This also makes it easier to change the implementation, which I want to do next. llvm-svn: 300153	2017-04-13 00:58:09 +00:00
Craig Topper	82782780bb	[IR] Remove the APIntMoveTy typedef from ConstantRange. Use APInt type directly. This typedef used to be conditional based on whether rvalue references were supported. Looks like it got left behind when we switched to always having rvalue references with c++11. I don't think it provides any value now. llvm-svn: 300146	2017-04-13 00:20:31 +00:00
Richard Smith	91c7d3193d	Work around MSVC rejects-valid bug related to C++11 narrowing conversions. llvm-svn: 300144	2017-04-13 00:14:39 +00:00
Konstantin Zhuravlyov	309240414d	Fix compiler error in Attributes.cpp ``` Compiling Attributes.cpp ... ../../../Attributes.cpp: In member function 'std::__1::pair<unsigned int, llvm::Optional<unsigned int> > llvm::AttributeSet::getAllocSizeArgs() const': ../../../Attributes.cpp:542:69: error: operands to ?: have different types 'std::__1::pair<unsigned int, llvm::Optional<unsigned int> >' and 'std::__1::pair<int, int>' return SetNode ? SetNode->getAllocSizeArgs() : std::make_pair(0, 0); ^ ../../../Attributes.cpp:543:1: error: control reaches end of non-void function [-Werror=return-type] } ^ ``` Differential Revision: https://reviews.llvm.org/D31981 llvm-svn: 300143	2017-04-12 23:57:37 +00:00
Wei Ding	57ada34a3f	AMDGPU : Fix common dominator of two incoming blocks terminates with uniform branch issue. Differential Revision: http://reviews.llvm.org/D31350 llvm-svn: 300142	2017-04-12 23:51:47 +00:00
Richard Smith	e6e29e1f51	Fix some ArgList uses after API change in r300135. llvm-svn: 300139	2017-04-12 23:43:58 +00:00
Zachary Turner	322f929d52	Fix initialization order of class members. llvm-svn: 300137	2017-04-12 23:27:43 +00:00
Richard Smith	b21c01c07b	ArgList: cache index ranges containing arguments with each ID Improve performance of argument list parsing with large numbers of IDs and large numbers of arguments, by tracking a conservative range of indexes within the argument list that might contain an argument with each ID. In the worst case (when the first and last argument with a given ID are at the opposite ends of the argument list), this still results in a linear-time walk of the list, but it helps substantially in the common case where each ID occurs only once, or a few times close together in the list. This gives a ~10x speedup to clang's `test/Driver/response-file.c`, which constructs a very large set of command line arguments and feeds them to the clang driver. Differential Revision: https://reviews.llvm.org/D30130 llvm-svn: 300135	2017-04-12 23:19:51 +00:00
Zachary Turner	89377d6ac2	[llvm-pdbdump] Minor prepatory refactor of Class Def Dumper. In a followup patch I intend to introduce an additional dumping mode which dumps a graphical representation of a class's layout. In preparation for this, the text-based layout printer needs to be split out from the graphical layout printer, and both need to be able to use the same code for printing the intro and outro of a class's definition (e.g. base class list, etc). This patch does so, and in the process introduces a skeleton definition for the graphical printer, while currently making the graphical printer just print nothing. NFC llvm-svn: 300134	2017-04-12 23:18:51 +00:00
Zachary Turner	8a8f84f312	[llvm-pdbdump] More advanced class definition dumping. Previously the dumping of class definitions was very primitive, and it made it hard to do more than the most trivial of output formats when dumping. As such, we would only dump one line for each field, and then dump non-layout items like nested types and enums. With this patch, we do a complete analysis of the object hierarchy including aggregate types, bases, virtual bases, vftable analysis, etc. The only immediately visible effects of this are that a) we can now dump a line for the vfptr where before we would treat that as padding, and b) we now don't treat virtual bases that come at the end of a class as padding since we have a more detailed analysis of the class's storage usage. In subsequent patches, we should be able to use this analysis to display a complete graphical view of a class's layout including recursing arbitrarily deep into an object's base class / aggregate member hierarchy. llvm-svn: 300133	2017-04-12 23:18:21 +00:00
Akira Hatanaka	1b72ae62d8	[libFuzzer] XFAIL fuzzer-oom.test on Darwin. The test fails on Darwin because Fuzzer::DeathCallback (which calls DumpCurrentUnit("crash-")) is called before DumpCurrentUnit("oom-") is called in Fuzzer::RssLimitCallback. DeathCallback is transitively called from __sanitizer_print_memory_profile. This should fix the fuzzer bot that has been failing for a while: http://lab.llvm.org:8080/green/job/libFuzzer/ llvm-svn: 300127	2017-04-12 23:15:10 +00:00
Craig Topper	8d04711bd9	[InstSimplify] Don't try to constant fold AllocaInsts since it won't do anything. Should give a small compile time improvement. llvm-svn: 300125	2017-04-12 22:54:24 +00:00
Reid Kleckner	c764621a52	[IR] Make AttributeSet constructor from AttributeSetNode* explicit llvm-svn: 300119	2017-04-12 22:30:37 +00:00
Craig Topper	7552ec6918	[ValueTracking] Teach GetUnderlyingObject to stop when it reachs an alloca instruction. Previously it tried to call SimplifyInstruction which doesn't know anything about alloca so defers to constant folding which also doesn't do anything with alloca. This results in wasted cycles making calls that won't do anything. Given the frequency with which this function is called this time adds up. llvm-svn: 300118	2017-04-12 22:29:23 +00:00
Reid Kleckner	008567c0bb	[IR] Assert that we never create an empty AttributeListImpl, NFC Delete following conditional that is always true as a result. llvm-svn: 300117	2017-04-12 22:22:01 +00:00
Matt Arsenault	8e264191d7	AMDGPU: Fix invalid copies when copying i1 to phys reg Insert a VReg_1 virtual register so the i1 workaround pass can handle it. llvm-svn: 300113	2017-04-12 21:58:23 +00:00
Stanislav Mekhanoshin	51dba4fa40	[AMDGPU] Generate range metadata for workitem id If workgroup size is known inform llvm about range returned by local id and local size queries. Differential Revision: https://reviews.llvm.org/D31804 llvm-svn: 300102	2017-04-12 20:48:56 +00:00
Piotr Padlewski	cd98794503	Remove readnone from invariant.group.barrier Summary: Readnone attribute would cause CSE of two barriers with the same argument, which is invalid by example: struct Base { virtual int foo() { return 42; } }; struct Derived1 : Base { int foo() override { return 50; } }; struct Derived2 : Base { int foo() override { return 100; } }; void foo() { Base *x = new Base{}; new (x) Derived1{}; int a = std::launder(x)->foo(); new (x) Derived2{}; int b = std::launder(x)->foo(); } Here 2 calls of std::launder will produce @llvm.invariant.group.barrier, which would be merged into one call, causing devirtualization to devirtualize second call into Derived1::foo() instead of Derived2::foo() Reviewers: chandlerc, dberlin, hfinkel Subscribers: llvm-commits, rsmith, amharc Differential Revision: https://reviews.llvm.org/D31531 llvm-svn: 300101	2017-04-12 20:45:12 +00:00
Vassil Vassilev	f54522eae6	Append -w when LLVM_ENABLE_WARNINGS is Off. Reviewed by rnk (D31702)! llvm-svn: 300100	2017-04-12 20:43:11 +00:00
Peter Collingbourne	49cfbdded9	Bitcode: Move version and global value module code parsers to separate functions. NFCI. This will make it easier to teach this code about the string table. Differential Revision: https://reviews.llvm.org/D31828 llvm-svn: 300099	2017-04-12 20:02:09 +00:00
Zachary Turner	fbbe67869c	[Support] Add support for unique_ptr<> to Casting.h. Often you have a unique_ptr<T> where T supports LLVM's casting methods, and you wish to cast it to a unique_ptr<U>. Prior to this patch, this requires doing hacky things like: unique_ptr<U> Casted; if (isa<U>(Orig.get())) Casted.reset(cast<U>(Orig.release())); This is overly verbose, and it would be nice to just be able to use unique_ptr directly with cast and dyn_cast. To this end, this patch updates cast<> to work directly with unique_ptr<T>, so you can now write: auto Casted = cast<U>(std::move(Orig)); Since it's possible for dyn_cast<> to fail, however, we choose to use a slightly different API here, because it's awkward to write if (auto Casted = dyn_cast<U>(std::move(Orig))) {} when Orig may end up not having been moved at all. So the interface for dyn_cast is if (auto Casted = unique_dyn_cast<U>(Orig)) {} Where the inclusion of `unique` in the name of the cast operator re-affirms that regardless of success of or fail of the casting, exactly one of the input value and the return value will contain a non-null result. Differential Revision: https://reviews.llvm.org/D31890 llvm-svn: 300098	2017-04-12 19:59:37 +00:00
Craig Topper	b4b11616b8	[InstCombine] Teach SimplifyMultipleUseDemandedBits to handle And/Or/Xor known bits using the LHS/RHS known bits it already acquired without recursing back into computeKnownBits. This replicates the known bits and constant creation code from the single use case for these instructions and adds it here. The computeKnownBits and constant creation code for other instructions is now in the default case of the opcode switch. llvm-svn: 300094	2017-04-12 19:32:47 +00:00
Craig Topper	14865ed642	[InstCombine] Remove unreachable code for turning an And where all demanded bits on both sides are known to be zero into a constant 0. We already handled a superset check that included the known ones too and folded to a constant that may include ones. But it can also handle the case of no ones. llvm-svn: 300093	2017-04-12 19:08:03 +00:00
Sanjay Patel	013822ac22	[InstCombine] fix wrong undef handling when converting select to shuffle As discussed in: https://bugs.llvm.org/show_bug.cgi?id=32486 ...the canonicalization of vector select to shufflevector does not hold up when undef elements are present in the condition vector. Try to make the undef handling clear in the code and the LangRef. Differential Revision: https://reviews.llvm.org/D31980 llvm-svn: 300092	2017-04-12 18:39:53 +00:00
Craig Topper	9be0fc31bf	[SelectionDAG] Use APInt move assignment to avoid 2 memory allocations and copies when bit width is larger than 64-bits. llvm-svn: 300091	2017-04-12 18:39:27 +00:00
Kyle Butt	19ecb63f39	CodeGen: BlockPlacement: Add comment about DenseMap Safety. The use of a DenseMap in precomputeTriangleChains does not cause non-determinism, even though it is iterated over, as the only thing the iteration does is to insert entries into a new DenseMap, which is not iterated. Comment only change. llvm-svn: 300088	2017-04-12 18:30:32 +00:00
Peter Collingbourne	47c4d3700d	llvm-lto2: Add a dump-symtab subcommand. This allows us to test the symbol table APIs for LTO input files. Differential Revision: https://reviews.llvm.org/D31920 llvm-svn: 300086	2017-04-12 18:27:00 +00:00
Craig Topper	0059ecd4b1	[InstCombine] In SimplifyMultipleUseDemandedBits, use a switch instead of cascaded ifs on opcode. NFC llvm-svn: 300085	2017-04-12 18:25:25 +00:00
Craig Topper	5d10384cce	[InstCombine] Teach SimplifyDemandedInstructionBits that even if we reach an instruction that has multiple uses, if we know all the bits for the demanded bits for this context we can go ahead and create a constant. Currently if we reach an instruction with multiples uses we know we can't do any optimizations to that instruction itself since we only have the demanded bits for one of the users. But if we know all of the bits are zero/one for that one user we can still go ahead and create a constant to give to that user. This might then reduce the instruction to having a single use and allow additional optimizations on the other path. This picks up an additional case that r300075 didn't catch. Differential Revision: https://reviews.llvm.org/D31552 llvm-svn: 300084	2017-04-12 18:17:46 +00:00
Matthias Braun	1c66620e08	MachineScheduler: Skip acyclic latency heuristic for in-order cores The current heuristic is triggered on `InFlightCount > BufferLimit` which isn't really helpful on in-order cores where BufferLimit is zero. Note that we already get latency hiding effects for in order cores by instructions staying in the pending queue on stalls; The additional latency scheduling heuristics only have minimal effects after that while occasionally increasing register pressure too much resulting in extra spills. My motivation here is additional spills/reloads ending up in a loop in 464.h264ref / BlockMotionSearch function resulting in a 4% overal regression on an in order core. rdar://30264380 llvm-svn: 300083	2017-04-12 18:09:05 +00:00
Craig Topper	ed32b47b7b	[InstCombine] Move portion of SimplifyDemandedUseBits that deals with instructions with multiple uses out to a separate method. NFCI llvm-svn: 300082	2017-04-12 18:05:21 +00:00
Renato Golin	4d592d0072	[SystemZ] Fix more target specific tests llvm-svn: 300081	2017-04-12 18:03:09 +00:00
Renato Golin	941c1f67e7	[SystemZ] Fix target specific tests llvm-svn: 300078	2017-04-12 17:14:46 +00:00
Dmitry Preobrazhensky	6e59517a49	[AMDGPU][MC] Added support for several VI-specific opcodes (s_wakeup, etc) Added support for VI: - s_endpgm_saved - s_wakeup - s_rfe_restore_b64 - v_perm_b32 Enabled for VI: - v_mov_fed_b32 - v_mov_fed_b32_e64 See bug 32593: https://bugs.llvm.org//show_bug.cgi?id=32593 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D31931 llvm-svn: 300076	2017-04-12 17:10:07 +00:00
Craig Topper	566511cc61	Teach SimplifyDemandedUseBits that adding or subtractings 0s from every bit below the highest demanded bit can be simplified If we are adding/subtractings 0s below the highest demanded bit we can just use the other operand and remove the operation. My primary motivation is observing that we can call ShrinkDemandedConstant for the add/sub and create a 0 constant, rather than removing the add completely. In the case I saw, we modified the constant on an add instruction to a 0, but the add is not put into the worklist. So we didn't revisit it until the next InstCombine iteration. This caused an IR modification to remove add and a subsequent iteration to be ran. With this change we get bypass the add in the first iteration and prevent the second iteration from changing anything. Differential Revision: https://reviews.llvm.org/D31120 llvm-svn: 300075	2017-04-12 16:49:59 +00:00

... 2 3 4 5 6 ...

147562 Commits