llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Mehdi Amini	679ff92d07	[ADT] Zip range adapter This augments the STLExtras toolset with a zip iterator and range adapter. Zip comes in two varieties: `zip`, which will zip to the shortest of the input ranges, and `zip_first`, which limits its `begin() == end()` checks to just the first krange. Patch by: Bryant Wong <github.com/bryant> Differential Revision: https://reviews.llvm.org/D23252 llvm-svn: 284035	2016-10-12 19:43:02 +00:00
Matt Arsenault	fb5c675468	AMDGPU: Initial implementation of VGPR indexing mode This is the most basic handling of the indirect access pseudos using GPR indexing mode. This currently only enables the mode for a single v_mov_b32 and then disables it. This is much more complicated to use than the movrel instructions, so a new optimization pass is probably needed to fold the access into the uses and keep the mode enabled for them. llvm-svn: 284031	2016-10-12 18:49:05 +00:00
Teresa Johnson	de773ccc50	[ThinLTO] Don't link module level assembly when importing Module inline asm was always being linked/concatenated when running the IRLinker. This is correct for full LTO but not when we are importing for ThinLTO, as it can result in multiply defined symbols when the module asm defines a global symbol. In order to test with llvm-lto2, I had to work around PR30396, where a symbol that is defined in module assembly but defined in the LLVM IR appears twice. Added workaround to llvm-lto2 with a FIXME. Fixes PR30610. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25359 llvm-svn: 284030	2016-10-12 18:39:29 +00:00
Sanjoy Das	54e3386b64	[SimplifyCFG] Don't create PHI nodes for constant bundle operands Summary: Constant bundle operands may need to retain their constant-ness for correctness. I'll admit that this is slightly odd, but it looks like SimplifyCFG already does this for things like @llvm.frameaddress and @llvm.stackmap, so I suppose adding one more case is not a big deal. It is possible to add a mechanism to denote bundle operands that need to remain constants, but that's probably too complicated for the time being. Reviewers: jmolloy Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D25502 llvm-svn: 284028	2016-10-12 18:15:33 +00:00
Matt Arsenault	cb0c02c980	AMDGPU: Add instruction definitions for VGPR indexing VI added a second method of indexing into VGPRs besides using v_movrel* llvm-svn: 284027	2016-10-12 18:00:51 +00:00
Zvi Rackover	75900dac59	[X86] Add the v4i32 flavor test-case for pr30371 llvm-svn: 284025	2016-10-12 17:06:30 +00:00
Tom Stellard	52a5870f77	AMDGPU/SI: Change mimg intrinsic signatures This makes more fields overridable and removes redundant bits. Patch by: Changpeng Fang llvm-svn: 284024	2016-10-12 16:35:29 +00:00
Artur Pilipenko	6245b73221	[ValueTracking] An improvement to IR ValueTracking on Non-negative Integers Since this change is known to cause performance degradations in some cases it's commited under a temporary flag which is turned off by default. Patch by Li Huang Differential Revision: https://reviews.llvm.org/D18777 llvm-svn: 284022	2016-10-12 16:18:43 +00:00
Matt Arsenault	f80a9a7346	BranchRelaxation: Unique live ins when creating block llvm-svn: 284018	2016-10-12 15:32:04 +00:00
Nirav Dave	81454c748f	[MC] Fix Error Location for ParseIdentifier Prevent partial parsing of '$' or '@' of invalid identifiers and fixup workaround points. NFC Intended. llvm-svn: 284017	2016-10-12 13:58:07 +00:00
Simon Pilgrim	da31facd2c	[DAGCombiner] Update most ADD combines to support general vector combines Add a number of helper functions to match scalar or vector equivalent constant/splat values to allow most of the combine patterns to be used by vectors. Differential Revision: https://reviews.llvm.org/D25374 llvm-svn: 284015	2016-10-12 13:48:10 +00:00
Konstantin Zhuravlyov	da8ee8e7dc	[DAGCombiner] Do not remove the load of stored values when optimizations are disabled This combiner breaks debug experience and should not be run when optimizations are disabled. For example: int main() { int j = 0; j += 2; if (j == 2) return 0; return 5; } When debugging this code compiled in /O0, it should be valid to break at line "j+=2;" and edit the value of j. It should change the return value of the function. Differential Revision: https://reviews.llvm.org/D19268 llvm-svn: 284014	2016-10-12 13:44:24 +00:00
Chad Rosier	3984bf527a	[CVP] Convert an AShr to a LShr if 1st operand is known to be nonnegative. An arithmetic shift can be safely changed to a logical shift if the first operand is known positive. This allows ComputeKnownBits (and similar analysis) to determine the sign bit of the shifted value in some cases. In turn, this allows InstCombine to canonicalize a signed comparison (a > 0) into an equality check (a != 0). PR30577 Differential Revision: https://reviews.llvm.org/D25119 llvm-svn: 284013	2016-10-12 13:41:38 +00:00
Alexey Bataev	e0af07de49	NFC: The Cost Model specialization, by Andrey Tischenko The current Cost Model implementation is very inaccurate and has to be updated, improved, re-implemented to be able to take into account the concrete CPU models and the concrete targets where this Cost Model is being used. For example, the Latency Cost Model should be differ from Code Size Cost Model, etc. This patch is the first step to launch the developing and implementation of a new Cost Model generation. Differential Revision: https://reviews.llvm.org/D25186 llvm-svn: 284012	2016-10-12 13:24:13 +00:00
Simon Pilgrim	72f99d6988	[InstCombine] Fix constexpr issue in select combining As discussed by Andrea on PR30486, we have an unsafe cast to an Instruction type in the select combine which doesn't take into account that it could be a ConstantExpr instead. Differential Revision: https://reviews.llvm.org/D25466 llvm-svn: 284000	2016-10-12 10:20:15 +00:00
Alex Lorenz	9730f4a915	[Support][CommandLine] Display subcommands in help when there are less than 3 subcommands This commit fixes a bug where the help output doesn't display subcommands when a tool has less than 3 subcommands. This change doesn't include a corresponding unittest as there is no viable way to provide a unittest for it. Differential Revision: https://reviews.llvm.org/D25463 llvm-svn: 283998	2016-10-12 10:04:35 +00:00
George Rimar	0bb5c6ff74	[Support/ELF] - Sort PT_OPENBSD_* added previously. NFC. llvm-svn: 283992	2016-10-12 09:20:28 +00:00
Diana Picus	2f31822cdb	Add AArch64 unit tests Add unit tests for checking a few tricky instruction sizes. Also remove the old tests for the instruction sizes, which were clunky and brittle. Since this is the first set of target-specific unit tests, we need to add some CMake plumbing. In the future, adding unit tests for a given target will be as simple as creating a directory with the same name as the target under unittests/Target. The tests are only run if the target is enabled in LLVM_TARGETS_TO_BUILD. Differential Revision: https://reviews.llvm.org/D24548 llvm-svn: 283990	2016-10-12 09:00:44 +00:00
Chandler Carruth	bfe535b7fd	[LCG] Cleanup various places where comments said `SCC` but meant `RefSCC`. Also improve the comments surrounding the lazy post-order iterator as they had grown stale since the RefSCC/SCC split. I'm sure there are more comments that need updating here, but I saw and fixed these and didn't want to lose them. I've not gotten to doing a really complete audit of every comment yet. llvm-svn: 283987	2016-10-12 08:40:51 +00:00
Chandler Carruth	aa9da3439e	[LCG] Add the necessary functionality to the LazyCallGraph to support inlining. The basic inlining operation makes the following changes to the call graph: 1) Add edges that were previously transitive edges. This is always trivial and this patch gives the LCG helper methods to make this more convenient. 2) Remove the inlined edge. We had existing support for this, but it contained bugs that needed to be fixed. Testing in the same pattern as the inliner exposes these bugs very nicely. 3) Delete a function when it becomes dead because it is internal and all calls have been inlined. The LCG had no support at all for this operation, so this adds that support. Two unittests have been added that exercise this specific mutation pattern to the call graph. They were extremely effective in uncovering bugs. Sadly, a large fraction of the code here is just to implement those unit tests, but I think they're paying for themselves. =] This was split out of a patch that actually uses the routines to implement inlining in the new pass manager in order to isolate (with unit tests) the logic that was entirely within the LCG. Many thanks for the careful review from folks! There will be a few minor follow-up patches based on the comments in the review as well. Differential Revision: https://reviews.llvm.org/D24225 llvm-svn: 283982	2016-10-12 07:59:56 +00:00
Daniel Jasper	1efc6eadb0	Revert "[libFuzzer] refactoring to speed things up, NFC" This reverts commit r283946. This breaks when build with GCC: lib/Fuzzer/FuzzerTracePC.cpp:169:6: error: always_inline function might not be inlinable [-Werror=attributes] lib/Fuzzer/FuzzerTracePC.cpp:169:6: error: inlining failed in call to always_inline 'void fuzzer::TracePC::HandleCmp(void*, T, T) [with T = long unsigned int]': target specific option mismatch lib/Fuzzer/FuzzerTracePC.cpp:198:65: error: called from here llvm-svn: 283979	2016-10-12 07:26:46 +00:00
Quentin Colombet	224e76f6f2	[AArch64][InstructionSelector] Fix unintended test changes in r283973. I screwed up my merge conflict and lost some of the CHECK lines. llvm-svn: 283974	2016-10-12 04:12:44 +00:00
Quentin Colombet	85ea9448d6	[AArch64][InstrustionSelector] Teach the selector about G_BITCAST. llvm-svn: 283973	2016-10-12 03:57:52 +00:00
Quentin Colombet	442f16b444	[AArch64][InstructionSelector] Refactor the handling of copies. Although Copies are not specific to preISel, we still have to assign them a proper register class. However, given they are not constrained to anything we do not have to handle the source register at the copy. It will be properly mapped when reaching the related definition. In the process, the handlong of G_ANYEXT is slightly modified as those end up being selected as copy. The difference is that when register size do not match on both sides, we need to insert SUBREG_TO_REG operation, otherwise the post RA copy expansion will not be happy! llvm-svn: 283972	2016-10-12 03:57:49 +00:00
Quentin Colombet	41c0e46bf3	[AArch64][InstructionSelector] Fix typos in the related mir file. NFC. llvm-svn: 283971	2016-10-12 03:57:46 +00:00
Quentin Colombet	4d726ae788	[AArch64][MachineLegalizer] Mark more bitcasts as legal. Those are copies, we do not have to do any legalization action for them. llvm-svn: 283970	2016-10-12 03:57:43 +00:00
Brian Gesiak	eab338b7d5	[lit] Run unit tests as part of lit test suite Summary: The Python file `utils/lit/lit/ShUtil.py` contains: 1. Logic used by lit itself 2. A set of unit tests for that logic, which can be run by invoking `python utils/lit/lit/ShUtil.py` Move these unit tests to a `tests/unit` subdirectory of lit, and run the tests as part of lit's test suite. This ensures that, should the lit test suite be included in LLVM's own regression test suite, these unit tests will also be run. (Instructions on how to run lit's test suite can be found in `utils/lit/README.txt`.) Reviewers: ddunbar, echristo, delcypher, beanz Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D25411 llvm-svn: 283968	2016-10-12 03:35:04 +00:00
Sebastian Pop	5871a933a8	Memory-SSA cleanup of clobbers interface, NFC This implements the cleanup that Danny asked to commit separately from the previous fix to GVN-hoist in https://reviews.llvm.org/D25476#inline-219818 Tested with ninja check on x86_64-linux. llvm-svn: 283967	2016-10-12 03:08:40 +00:00
Sebastian Pop	fdf5952343	GVN-hoist: fix store past load dependence analysis (PR30216, PR30499) This is a refreshed version of a patch that was reverted: it fixes the problems reported in both PR30216 and PR30499, and contains all the test-cases from both bugs. To hoist stores past loads, we used to search for potential conflicting loads on the hoisting path by following a MemorySSA def-def link from the store to be hoisted to the previous defining memory access, and from there we followed the def-use chains to all the uses that occur on the hoisting path. The problem is that the def-def link may point to a store that does not alias with the store to be hoisted, and so the loads that are walked may not alias with the store to be hoisted, and even as in the testcase of PR30216, the loads that may alias with the store to be hoisted are not visited. The current patch visits all loads on the path from the store to be hoisted to the hoisting position and uses the alias analysis to ask whether the store may alias the load. I was not able to use the MemorySSA functionality to ask for whether load and store are clobbered: I'm not sure which function to call, so I used a call to AA->isNoAlias(). Store past store is still working as before using a MemorySSA query: I added an extra test to pr30216.ll to make sure store past store does not regress. Tested on x86_64-linux with check and a test-suite run. Differential Revision: https://reviews.llvm.org/D25476 llvm-svn: 283965	2016-10-12 02:23:39 +00:00
Tim Shen	97a4f6a6e9	[PPCMIPeephole] Fix splat elimination Summary: In PPCMIPeephole, when we see two splat instructions, we can't simply do the following transformation: B = Splat A C = Splat B => C = Splat A because B may still be used between these two instructions. Instead, we should make the second Splat a PPC::COPY and let later passes decide whether to remove it or not: B = Splat A C = Splat B => B = Splat A C = COPY B Fixes PR30663. Reviewers: echristo, iteratee, kbarton, nemanjai Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D25493 llvm-svn: 283961	2016-10-12 00:48:25 +00:00
Reid Kleckner	a7c7d8e498	Fix the stage2 MSVC 2013 build with less constexpr in RNG llvm-svn: 283954	2016-10-11 23:02:21 +00:00
Michael Kuperstein	0d331811ac	[DAG] Fix crash in build_vector -> vector_shuffle combine Fixes a crash in the build_vector -> vector_shuffle combine when the first vector input is twice as wide as the output, and the second input vector is even wider. llvm-svn: 283953	2016-10-11 22:44:31 +00:00
Tim Northover	dfe83991c2	GlobalISel: support same-size casts on AArch64. Mostly Ahmed's work again, I'm just sprucing things up slightly before committing. llvm-svn: 283952	2016-10-11 22:29:23 +00:00
Vedant Kumar	77ee1e1b0e	[InstrProf] Add support for dead_strip+live_support functionality On Darwin, marking a section as "regular,live_support" means that a symbol in the section should only be kept live if it has a reference to something that is live. Otherwise, the linker is free to dead-strip it. Turn this functionality on for the __llvm_prf_data section. This means that counters and data associated with dead functions will be removed from dead-stripped binaries. This will result in smaller profiles and binaries, and should speed up profile collection. Tested with check-profile, llvm-lit test/tools/llvm-{cov,profdata}, and check-llvm. Differential Revision: https://reviews.llvm.org/D25456 llvm-svn: 283947	2016-10-11 21:48:16 +00:00
Kostya Serebryany	77bcf6126c	[libFuzzer] refactoring to speed things up, NFC llvm-svn: 283946	2016-10-11 21:27:37 +00:00
Reid Kleckner	f071744d81	Re-land "[Thumb] Save/restore high registers in Thumb1 pro/epilogues" Reverts r283938 to reinstate r283867 with a fix. The original change had an ArrayRef referring to a destroyed temporary initializer list. Use plain C arrays instead. llvm-svn: 283942	2016-10-11 21:14:03 +00:00
Kevin Enderby	78b9ef248b	Next set of additional error checks for invalid Mach-O files for the load commands that uses the MachO::linker_option_command type but not used in llvm libObject code but used in llvm tool code. This includes just LC_LINKER_OPTION load command. llvm-svn: 283939	2016-10-11 21:04:39 +00:00
Reid Kleckner	a5f5d77747	Revert "[Thumb] Save/restore high registers in Thumb1 pro/epilogues" This reverts r283867. This appears to be an infinite loop: while (HiRegToSave != AllHighRegs.end() && CopyReg != AllCopyRegs.end()) { if (HiRegsToSave.count(*HiRegToSave)) { ... CopyReg = findNextOrderedReg(++CopyReg, CopyRegs, AllCopyRegs.end()); HiRegToSave = findNextOrderedReg(++HiRegToSave, HiRegsToSave, AllHighRegs.end()); } } llvm-svn: 283938	2016-10-11 20:54:41 +00:00
Tim Northover	f238c4b92c	GlobalISel: support selection of extend operations. Patch mostly by Ahmed Bougaca. llvm-svn: 283937	2016-10-11 20:50:21 +00:00
Tim Northover	573697e8ef	MIRParser: allow types on registers with a RegBank. This fixes some GlobalISel regression tests. llvm-svn: 283936	2016-10-11 20:50:04 +00:00
Jordan Rose	201cd0c255	Re-apply "Disallow ArrayRef assignment from temporaries." This re-applies r283798, disabled in r283803, with the static_assert tests disabled under MSVC. The deleted functions still seem to catch mistakes in MSVC, so it's not a significant loss. Part of rdar://problem/16375365 llvm-svn: 283935	2016-10-11 20:39:16 +00:00
Kyle Butt	abb4b9d9ee	Codegen: Tail-duplicate during placement. The tail duplication pass uses an assumed layout when making duplication decisions. This is fine, but passes up duplication opportunities that may arise when blocks are outlined. Because we want the updated CFG to affect subsequent placement decisions, this change must occur during placement. In order to achieve this goal, TailDuplicationPass is split into a utility class, TailDuplicator, and the pass itself. The pass delegates nearly everything to the TailDuplicator object, except for looping over the blocks in a function. This allows the same code to be used for tail duplication in both places. This change, in concert with outlining optional branches, allows triangle shaped code to perform much better, esepecially when the taken/untaken branches are correlated, as it creates a second spine when the tests are small enough. Issue from previous rollback fixed, and a new test was added for that case as well. Issue was worklist/scheduling/taildup issue in layout. Issue from 2nd rollback fixed, with 2 additional tests. Issue was tail merging/loop info/tail-duplication causing issue with loops that share a header block. Issue with early tail-duplication of blocks that branch to a fallthrough predecessor fixed with test case: tail-dup-branch-to-fallthrough.ll Differential revision: https://reviews.llvm.org/D18226 llvm-svn: 283934	2016-10-11 20:36:43 +00:00
Sanjay Patel	0a908a5fae	[x86] add tests for negate bool llvm-svn: 283930	2016-10-11 20:15:20 +00:00
Reid Kleckner	01e7d754c4	Avoid braced initialization for default member initializers for MSVC 2013 llvm-svn: 283928	2016-10-11 20:02:57 +00:00
Arnold Schwaighofer	7430f50b4d	Silence -Wunused-but-set-variable warning llvm-svn: 283927	2016-10-11 19:49:29 +00:00
Rui Ueyama	35bd62db88	Re-submit r283823: Define DbiStreamBuilder::addDbgStream to add stream. The previous commit was failing because we filled empty slots of the debug stream index with kInvalidStreamIndex. It should've been 0. llvm-svn: 283925	2016-10-11 19:43:12 +00:00
Kostya Serebryany	cb7566ce29	[sanitizer-coverage] use private linkage for coverage guards, delete old commented-out code. llvm-svn: 283924	2016-10-11 19:36:50 +00:00
Rui Ueyama	7d7380c0c5	Fix build error on LP64 platforms. llvm-svn: 283922	2016-10-11 19:28:56 +00:00
Zachary Turner	75f19d6b26	[raw_ostream] Raise some helper functions out of raw_ostream. Low level functionality to format numbers were embedded in the implementation of raw_ostream. I have need to use these through an interface other than the overloaded stream operators, so they need to be raised to a level that they can be used from either raw_ostream operators or other code. llvm-svn: 283921	2016-10-11 19:24:45 +00:00
Konstantin Zhuravlyov	277eacdebe	[AMDGPU] Refactor waitcnt encoding - Refactor bit packing/unpacking - Calculate bit mask given bit shift and bit width - Introduce function for decoding bits of waitcnt - Introduce function for encoding bits of waitcnt - Introduce function for getting waitcnt mask (instead of using bare numbers) - Introduce function fot getting max waitcnt(s) (instead of using bare numbers) Differential Revision: https://reviews.llvm.org/D25298 llvm-svn: 283919	2016-10-11 18:58:22 +00:00

1 2 3 4 5 ...

139306 Commits