llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Tim Shen	ed9415a1e1	Cleanup some GraphTraits iteration code Use children<> and nodes<> in appropriate places to cleanup the code. Also, as part of the cleanup, change the signature of DominatorTreeBase's Split. It is a protected non-virtual member function called only twice, both from within the class, and the removed passed argument in both cases is 'this'. The reason for the existence of that argument seems to be that back before r43115 Split was a free function, so an argument to get 'this' was needed - but now that is no longer the case. Patch by Yoav Ben-Shalom! Differential Revision: https://reviews.llvm.org/D32118 llvm-svn: 300656	2017-04-19 03:22:50 +00:00
Serge Pavlov	b972759f78	ARM: Use methods to access data stored with frame instructions In r300196 several methods were added to TarfetInstrInfo to access data stored with call frame setup/destroy instructions. This change replaces calls to getOperand with calls to such special methods in ARM target. Differential Revision: https://reviews.llvm.org/D32127 llvm-svn: 300655	2017-04-19 03:12:05 +00:00
Reid Kleckner	1963bafdf3	Remove buggy 'addAttributes(unsigned, AttrBuilder)' overload The 'addAttributes(unsigned, AttrBuilder)' overload delegated to 'get' instead of 'addAttributes'. Since we can implicitly construct an AttrBuilder from an AttributeSet, just standardize on AttrBuilder. llvm-svn: 300651	2017-04-19 01:51:13 +00:00
Kostya Serebryany	cc304f1491	[libFuzzer] update -help: mention -exact_artifact_path in help for -minimize_crash and -cleanse_crash llvm-svn: 300642	2017-04-19 01:22:04 +00:00
Leslie Zhai	2cfa01472a	[AVR] Migrate to new MCAsmInfo CodePointerSize Reviewers: dylanmckay, rengolin, kzhuravl, jroelofs Reviewed By: kzhuravl, jroelofs Subscribers: kzhuravl, llvm-commits Differential Revision: https://reviews.llvm.org/D32154 llvm-svn: 300641	2017-04-19 01:20:43 +00:00
Matthias Braun	47b301a014	ARMFrameLowering: Reserve emergency spill slot for large arguments We need to reserve an emergency spill slot in cases with large argument types that could overflow immediate offsets for FP relative address calculations. rdar://31317893 Differential Revision: https://reviews.llvm.org/D31643 llvm-svn: 300639	2017-04-19 01:16:07 +00:00
Craig Topper	7366743efb	[DataLayout] Removed default value from a variable that isn't used without being overwritten. Make variable an enum instead of an int to avoid a cast later. NFC llvm-svn: 300634	2017-04-19 00:31:38 +00:00
Dean Michael Berris	02c3f48c88	[XRay][tools] Fix yaml matching to be more permissive Account for a potentially empty function name. Follow-up to D32153. llvm-svn: 300631	2017-04-19 00:10:09 +00:00
Xin Tong	dc8dd17743	Allow suppressing host and target info in VersionPrinter Summary: VersionPrinter by default outputs information about the Host CPU and Default target. Printing this information requires linking in a large amount of data, such as supported target triples as C strings, which in turn bloats the binary size. Enable a new CMake option LLVM_VERSION_PRINTER_SHOW_HOST_TARGET_INFO which controls printing of the host and target info. This allows the target triple names to be dead-code stripped. This is a nice win for LLVM clients that wish to minimize their binary size, such as graphics drivers. By default this is ON, so there is no change in the default behavior. Clients who wish to suppress this printing can do so by setting this option to off via CMake. A test app on Linux that uses ParseCommandLineOptions() shows a binary size reduction of 23KB (from 149K to 126K) for a Release build, and 24KB (from 135K to 111K) in a MinSizeRel build. Reviewers: klimek, beanz, bogner, chandlerc, compnerd Reviewed By: compnerd Patch by pammon (Peter Ammon) ! Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30904 llvm-svn: 300630	2017-04-19 00:03:36 +00:00
Dylan McKay	b65c650066	[AVR] Fix the build 'PointerSize' was renamed to 'CodePointerSize'. llvm-svn: 300629	2017-04-18 23:53:10 +00:00
Dean Michael Berris	bb6b9f32df	[XRay][tools] Add option to llvm-xray extract to symbolize functions Summary: This allows us to, if the symbol names are available in the binary, be able to provide the function name in the YAML output. Reviewers: dblaikie, pelikan Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32153 llvm-svn: 300624	2017-04-18 23:23:54 +00:00
Craig Topper	1c6b3b5c27	[ConstantRange] Optimize APInt creation in getSignedMax/getSignedMin. We were creating an APInt at the top of these methods that isn't always returned. For ranges wider than 64-bits this results in an allocation and deallocation when its not used. In getSignedMax we were creating Upper-1 to use in a compare and then creating it again for a return value. The compiler is unable to determine that these can be shared. So help it out and create the Upper-1 in a temporary that can be reused. This provides a little compile time improvement. llvm-svn: 300621	2017-04-18 23:02:39 +00:00
Sanjay Patel	155d5e434f	[x86] add tests for potential andn optimization; NFC llvm-svn: 300617	2017-04-18 22:36:59 +00:00
Reid Kleckner	03188149b5	Fix crash in AttributeList::addAttributes, add test llvm-svn: 300614	2017-04-18 22:10:18 +00:00
Sanjoy Das	038a5db80a	Add a getPointerOperandType() helper to LoadInst and StoreInst; NFC I will use this in a later change. llvm-svn: 300613	2017-04-18 22:00:54 +00:00
Craig Topper	3ff7c5ae67	[MemoryBuiltins] Add isMallocOrCallocLikeFn so BasicAA can check for both at the same time BasicAA wants to know if a function is either a malloc or calloc like function. Currently we have to check both separately. This means both calls check if its an intrinsic, query TLI, check the nobuiltin attribute, scan the AllocationFnData, etc. This patch adds a isMallocOrCallocLikeFn so we can go through all of the checks once per call. This also changes the one other location I saw that called both together. Differential Revision: https://reviews.llvm.org/D32188 llvm-svn: 300608	2017-04-18 21:43:46 +00:00
Davide Italiano	79a2f25422	[LoopReroll] Prefer hasNUses/hasNUses or more as they're cheaper. NFCI. llvm-svn: 300607	2017-04-18 21:42:21 +00:00
Matt Arsenault	1fd25dc5d0	DAG: Make mayBeEmittedAsTailCall parameter const llvm-svn: 300603	2017-04-18 21:16:46 +00:00
Matt Arsenault	8f5e36a0ef	Fix typo llvm-svn: 300597	2017-04-18 20:59:46 +00:00
Matt Arsenault	8611f8573b	AMDGPU: Make MFI fields private llvm-svn: 300596	2017-04-18 20:59:40 +00:00
Craig Topper	ef23c7e15c	[MemoryBuiltins] Use ImmutableCallSite instead of CallSite to remove a const_cast and const correct. NFCI llvm-svn: 300585	2017-04-18 20:17:23 +00:00
Daniel Berlin	3aa04388f3	NewGVN: Fix memory congruence verification. The return true should be a return false. Merge the appropriate if statements so it doesn't happen again. llvm-svn: 300584	2017-04-18 20:15:47 +00:00
Chih-Hung Hsieh	047ba89d96	[X86] Keep EXTRACT_VECTOR_ELT result type as f128 for Android x86_64. Android x86_64 target uses f128 type and stores f128 values in %xmm* registers. SoftenFloatRes_EXTRACT_VECTOR_ELT should not convert result value from f128 to i128. Differential Revision: http://reviews.llvm.org/D32102 llvm-svn: 300583	2017-04-18 20:15:18 +00:00
Craig Topper	dab4df0257	[APInt] Inline the single word case of lshrInPlace similar to what we do for <<=. llvm-svn: 300577	2017-04-18 19:13:27 +00:00
Simon Pilgrim	d9eb4d21ab	[X86][SSE] Add scheduling latency/throughput tests for (most) SSE1 instructions llvm-svn: 300576	2017-04-18 19:04:40 +00:00
Easwaran Raman	7b83777c46	[SLP vectorizer] Allow phi node reordering in tryToVectorizeList. In tryToVectorizeList, under a very limited circumstance (when entered from tryToVectorizePair), the values may be reordered (swapped) and the SLP tree is built with the new order. This extends that to the case when starting from phis in vectorizeChainsInBlock when there are exactly two phis. The textual order of phi nodes shouldn't really matter. Without this change, the loop body in the accompnaying test case is fully vectorized when we swap the orde of the phis but not with this order. While this doesn't solve the phi-ordering problem in a general way (for more than 2 phis), this is simple fix that piggybacks on an existing mechanism and is useful in cases like multiplying two complex numbers. Differential revision: https://reviews.llvm.org/D32065 llvm-svn: 300574	2017-04-18 18:16:57 +00:00
Simon Pilgrim	029fe287df	[X86] Use for-range loop. NFCI. llvm-svn: 300567	2017-04-18 17:18:54 +00:00
Craig Topper	debb291669	[APInt] Use lshrInPlace to replace lshr where possible This patch uses lshrInPlace to replace code where the object that lshr is called on is being overwritten with the result. This adds an lshrInPlace(const APInt &) version as well. Differential Revision: https://reviews.llvm.org/D32155 llvm-svn: 300566	2017-04-18 17:14:21 +00:00
Daniel Berlin	20327c70bd	NewGVN: Don't waste time value numbering unreachable blocks llvm-svn: 300565	2017-04-18 17:06:11 +00:00
Nirav Dave	9e2d41cff4	[DAG] Improve store merge candidate pruning. Remove non-consecutive stores from store merge candidate search as they cannot be merged and will prevent us from finding subsequent mergeable store cases. Reviewers: jyknight, bogner, javed.absar, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32086 llvm-svn: 300561	2017-04-18 15:36:34 +00:00
Nirav Dave	23e3be6e6e	Add base-index-based store merge test llvm-svn: 300559	2017-04-18 15:12:13 +00:00
Zvi Rackover	9a0fbd2ef8	LoopRerollPass: Prefer Value::hasOneUse() over Value::getNumUses(). NFC. getNumUses() can be more expensive as it iterates over all list's elements. llvm-svn: 300558	2017-04-18 14:55:43 +00:00
Gil Rapaport	e43ac15a7c	[LV] Cache block mask values This patch is part of D28975's breakdown. Add caching for block masks similar to the cache already used for edge masks, replacing generation per user with reusing the first generated value which dominates all uses. Differential Revision: https://reviews.llvm.org/D32054 llvm-svn: 300557	2017-04-18 14:43:43 +00:00
Sanjay Patel	2d612df495	[ConstantRange] fix doxygen comment formatting; NFC llvm-svn: 300554	2017-04-18 14:27:24 +00:00
Nikolai Bozhenov	ed492b9592	Make globalaa-retained.ll test catching more cases. Summary: * Add checks for store. That is needed because GlobalsAA is called twice in the current pipeline with different sets of Function passes following it. However, the loads are eliminated using instcombine which happens everywhere. On the other hand, DeadStoreElimination is performed only once so by checking for store we'll be able to catch more cases when GlobalsAA is invalidated unintentionally. * Add empty function above/below the test so that we don't depend on the relative order of instcombine/dead-store-elimination and the pass that invalidates the analysis (inside the same FunctionPassManager). Reviewers: kristof.beyls Reviewed By: kristof.beyls Subscribers: llvm-commits, n.bozhenov Differential Revision: https://reviews.llvm.org/D32015 Patch by Andrei Elovikov <andrei.elovikov@intel.com> llvm-svn: 300553	2017-04-18 13:29:26 +00:00
Nikolai Bozhenov	bd24efc989	[GVNHoist] Mark GlobalsAA as preserved by GVNHoist. Reviewers: sebpop, hiraditya Reviewed By: sebpop Subscribers: n.bozhenov, llvm-commits Differential Revision: https://reviews.llvm.org/D32158 Patch by Andrei Elovikov <andrei.elovikov@intel.com> llvm-svn: 300552	2017-04-18 13:25:49 +00:00
Nirav Dave	38398a2ddd	Add store Merge test. llvm-svn: 300551	2017-04-18 13:25:19 +00:00
Oliver Stannard	76132aaa0b	[ARM] Add hardware build attributes in assembler In the assembler, we should emit build attributes based on the target selected with command-line options. This matches the GNU assembler's behaviour. We only do this for build attributes which describe the hardware that is expected to be available, not the ones that describe ABI compatibility. This is done by moving some of the attribute emission code to ARMTargetStreamer, so that it can be shared between the assembly and code-generation code paths. Since the assembler only creates a MCSubtargetInfo, not an ARMSubtarget, the code had to be changed to check raw features, and not use the convenience functions in ARMSubtarget. If different attributes are later specified using the .eabi_attribute directive, then they will take precedence, as happens when the same .eabi_attribute is specified twice. This must be enabled by an option, because we don't want to do this when parsing inline assembly. The attributes would match the ones emitted at the start of the file, so wouldn't actually change the emitted object file, but the extra directives would be added to every inline assembly block when emitting assembly, which we'd like to avoid. The majority of the changes in the build-attributes.ll test are just re-ordering the directives, because the hardware attributes are now emitted before the ABI ones. However, I did fix one bug which I spotted: Tag_CPU_arch_profile was not being emitted for v6M. Differential revision: https://reviews.llvm.org/D31812 llvm-svn: 300547	2017-04-18 12:52:35 +00:00
Diana Picus	3f9333697f	[ARM] GlobalISel: Add support for G_SUB Support G_SUB throughout the GlobalISel pipeline. It is exactly the same as G_ADD, nothing fancy. llvm-svn: 300546	2017-04-18 12:35:28 +00:00
Andrea Di Biagio	66b10b7920	[SampleProfile] Don't assert when printing the DebugLoc of a branch. NFC. llvm-svn: 300544	2017-04-18 11:27:58 +00:00
Andrea Di Biagio	de6c1f4aaf	[SampleProfile] Skip intrinsic calls when visiting callsites in InlineHotFunctions. Before this patch, we always called method 'findCalleeFunctionSamples()' on intrinsic calls. However, intrinsic calls like llvm.dbg.value() are not viable candidates for obvious reasons. No functional change intended. Differential Revision: https://reviews.llvm.org/D32008 llvm-svn: 300541	2017-04-18 10:08:53 +00:00
Kristof Beyls	1c3a629c41	Revert "[GlobalISel] Support vector-of-pointers in LLT" This reverts r300535 and r300537. The newly added tests in test/CodeGen/AArch64/GlobalISel/arm64-fallback.ll produces slightly different code between LLVM versions being built with different compilers. E.g., dependent on the compiler LLVM is built with, either one of the following can be produced: remark: <unknown>:0:0: unable to legalize instruction: %vreg0<def>(p0) = G_EXTRACT_VECTOR_ELT %vreg1, %vreg2; (in function: vector_of_pointers_extractelement) remark: <unknown>:0:0: unable to legalize instruction: %vreg2<def>(p0) = G_EXTRACT_VECTOR_ELT %vreg1, %vreg0; (in function: vector_of_pointers_extractelement) Non-determinism like this is clearly a bad thing, so reverting this until I can find and fix the root cause of the non-determinism. llvm-svn: 300538	2017-04-18 09:26:36 +00:00
Kristof Beyls	e329f3f7e1	Fix gcc build after r300535. llvm-svn: 300537	2017-04-18 08:47:55 +00:00
Diana Picus	93efa79fb7	[ARM] Check for correct HW div when lowering divmod For subtargets that use the custom lowering for divmod, e.g. gnueabi, we used to check if the subtarget has hardware divide and then lower to a div-mul-sub sequence if true, or to a libcall if false. However, judging by the usage of hasDivide vs hasDivideInARMMode, it seems that hasDivide only refers to Thumb. For instance, in the ARMTargetLowering constructor, the code that specifies whether to use libcalls for (S\|U)DIV looks like this: bool hasDivide = Subtarget->isThumb() ? Subtarget->hasDivide() : Subtarget->hasDivideInARMMode(); In the case of divmod for arm-gnueabi, using only hasDivide() to determine what to do means that instead of lowering to __aeabi_idivmod to get the remainder, we lower to div-mul-sub and then further lower the div to __aeabi_idiv. Even worse, if we have hardware divide in ARM but not in Thumb, we generate a libcall instead of using it (this is not an issue in practice since AFAICT none of the cores that we support have hardware divide in ARM but not Thumb). This patch fixes the code dealing with custom lowering to take into account the mode (Thumb or ARM) when deciding whether or not hardware division is available. Differential Revision: https://reviews.llvm.org/D32005 llvm-svn: 300536	2017-04-18 08:32:27 +00:00
Kristof Beyls	507db3264b	[GlobalISel] Support vector-of-pointers in LLT This fixes PR32471. As comment 10 on that bug report highlights (https://bugs.llvm.org//show_bug.cgi?id=32471#c10), there are quite a few different defendable design tradeoffs that could be made, including not representing pointers at all in LLT. I decided to go for representing vector-of-pointer as a concept in LLT, while keeping the size of the LLT type 64 bits (this is an increase from 48 bits before). My rationale for keeping pointers explicit is that on some targets probably it's very handy to have the distinction between pointer and non-pointer (e.g. 68K has a different register bank for pointers IIRC). If we keep a scalar pointer, it probably is easiest to also have a vector-of-pointers to keep LLT relatively conceptually clean and orthogonal, while we don't have a very strong reason to break that orthogonality. Once we gain more experience on the use of LLT, we can of course reconsider this direction. Rejecting vector-of-pointer types in the IRTranslator is also an option to avoid the crash reported in PR32471, but that is only a very short-term solution; also needs quite a bit of code tweaks in places, and is probably fragile. Therefore I didn't consider this the best option. llvm-svn: 300535	2017-04-18 08:12:45 +00:00
Leslie Zhai	1c03a501b8	test commit llvm-svn: 300532	2017-04-18 07:28:54 +00:00
Craig Topper	044fb2ba9d	[APInt] Cleanup the reverseBits slow case a little. Use lshrInPlace. Use single bit extract and operator\|=(uint64_t) to avoid a few temporary APInts. llvm-svn: 300527	2017-04-18 05:02:21 +00:00
Craig Topper	b0d5814eef	[APInt] Make operator<<= shift in place. Improve the implementation of tcShiftLeft and use it to implement operator<<=. llvm-svn: 300526	2017-04-18 04:39:48 +00:00
Adrian Prantl	f625c157a1	PR32382: Fix emitting complex DWARF expressions. The DWARF specification knows 3 kinds of non-empty simple location descriptions: 1. Register location descriptions - describe a variable in a register - consist of only a DW_OP_reg 2. Memory location descriptions - describe the address of a variable 3. Implicit location descriptions - describe the value of a variable - end with DW_OP_stack_value & friends The existing DwarfExpression code is pretty much ignorant of these restrictions. This used to not matter because we only emitted very short expressions that we happened to get right by accident. This patch makes DwarfExpression aware of the rules defined by the DWARF standard and now chooses the right kind of location description for each expression being emitted. This would have been an NFC commit (for the existing testsuite) if not for the way that clang describes captured block variables. Based on how the previous code in LLVM emitted locations, DW_OP_deref operations that should have come at the end of the expression are put at its beginning. Fixing this means changing the semantics of DIExpression, so this patch bumps the version number of DIExpression and implements a bitcode upgrade. There are two major changes in this patch: I had to fix the semantics of dbg.declare for describing function arguments. After this patch a dbg.declare always takes the address of a variable as the first argument, even if the argument is not an alloca. When lowering a DBG_VALUE, the decision of whether to emit a register location description or a memory location description depends on the MachineLocation — register machine locations may get promoted to memory locations based on their DIExpression. (Future) optimization passes that want to salvage implicit debug location for variables may do so by appending a DW_OP_stack_value. For example: DBG_VALUE, [RBP-8] --> DW_OP_fbreg -8 DBG_VALUE, RAX --> DW_OP_reg0 +0 DBG_VALUE, RAX, DIExpression(DW_OP_deref) --> DW_OP_reg0 +0 All testcases that were modified were regenerated from clang. I also added source-based testcases for each of these to the debuginfo-tests repository over the last week to make sure that no synchronized bugs slip in. The debuginfo-tests compile from source and run the debugger. https://bugs.llvm.org/show_bug.cgi?id=32382 <rdar://problem/31205000> Differential Revision: https://reviews.llvm.org/D31439 llvm-svn: 300522	2017-04-18 01:21:53 +00:00
George Burgess IV	fec4b3628a	Add const to a const method. NFC llvm-svn: 300520	2017-04-18 01:04:05 +00:00

1 2 3 4 5 ...

147734 Commits