llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Chandler Carruth	a780975d83	[x86] Switch to using the variable we extracted this operand into. Spotted this missed refactoring by inspection when reading code, and it doesn't changethe functionality at all. llvm-svn: 214627	2014-08-02 10:29:36 +00:00
Chandler Carruth	9b3e1eb850	[x86] Fix a few typos in my comments spotted in passing. llvm-svn: 214626	2014-08-02 10:29:34 +00:00
Chandler Carruth	6875b68d8c	[x86] Teach the target shuffle mask extraction to recognize unary forms of normally binary shuffle instructions like PUNPCKL and MOVLHPS. This detects cases where a single register is used for both operands making the shuffle behave in a unary way. We detect this and adjust the mask to use the unary form which allows the existing DAG combine for shuffle instructions to actually work at all. As a consequence, this uncovered a number of obvious bugs in the existing DAG combine which are fixed. It also now canonicalizes several shuffles even with the existing lowering. These typically are trying to match the shuffle to the domain of the input where before we only really modeled them with the floating point variants. All of the cases which change to an integer shuffle here have something in the integer domain, so there are no more or fewer domain crosses here AFAICT. Technically, it might be better to go from a GPR directly to the floating point domain, but detecting floating point outputs despite integer inputs is a lot more code and seems unlikely to be worthwhile in practice. If folks are seeing domain-crossing regressions here though, let me know and I can hack something up to fix it. Also as a consequence, a bunch of missed opportunities to form pshufb now can be formed. Notably, splats of i8s now form pshufb. Interestingly, this improves the existing splat lowering too. We go from 3 instructions to 1. Yes, we may tie up a register, but it seems very likely to be worth it, especially if splatting the 0th byte (the common case) as then we can use a zeroed register as the mask. llvm-svn: 214625	2014-08-02 10:27:38 +00:00
Chandler Carruth	746864d542	[x86] Teach my pshufb comment printer to handle VPSHUFB forms as well as PSHUFB forms. This will be important to update some AVX tests when I add PSHUFB combining. llvm-svn: 214624	2014-08-02 10:08:17 +00:00
Chandler Carruth	f8f86e7b92	[SDAG] Refactor the code which deletes nodes in the DAG combiner to do so using a single helper which adds operands back onto the worklist. Several places didn't rigorously do this but a couple already did. Factoring them together and doing it rigorously is important to delete things recursively early on in the combiner and get a chance to see accurate hasOneUse values. While no existing test cases change, an upcoming patch to add DAG combining logic for PSHUFB requires this to work correctly. llvm-svn: 214623	2014-08-02 10:02:07 +00:00
Owen Anderson	6c991acdcf	Fix issues with ISD::FNEG and ISD::FMA SDNodes where they would not be constant-folded during DAGCombine in certain circumstances. Unfortunately, the circumstances required to trigger the issue seem to require a pretty specific interaction of DAGCombines, and I haven't been able to find a testcase that reproduces on X86, ARM, or AArch64. The functionality added here is replicated in essentially every other DAG combine, so it seems pretty obviously correct. llvm-svn: 214622	2014-08-02 08:45:33 +00:00
Justin Bogner	af51eedfd7	CodeGen: Remove commented out code These two lines have been commented out for over 4 years. They aren't helping anyone. llvm-svn: 214615	2014-08-02 06:47:07 +00:00
Akira Hatanaka	5a2758bfe7	[ARM] In dynamic-no-pic mode, ARM's post-RA pseudo expansion was incorrectly expanding pseudo LOAD_STATCK_GUARD using instructions that are normally used in pic mode. This patch fixes the bug. <rdar://problem/17886592> llvm-svn: 214614	2014-08-02 05:40:40 +00:00
Lang Hames	f2bb6bf8f0	[MCJIT] Fix an overly-aggressive check in RuntimeDyldMachOARM. This should fix the MachO_ARM_PIC_relocations.s test failures on some 32-bit testers. llvm-svn: 214613	2014-08-02 03:00:49 +00:00
Matt Arsenault	7ac48e4d02	R600: Cleanup fneg tests llvm-svn: 214612	2014-08-02 02:26:51 +00:00
Michael Gottesman	ebdb0cf646	Add a small utility called bisect that enables commandline bisecting on a counter. This is something that I have found to be very useful in my work and I wanted to contribute it back to the community since several people in the past have asked me for something along these lines. (Jakob, I know this has been a while coming ; )] The way you use this is you create a script that takes in as its first argument a count. The script passes into LLVM the count via a command line flag that disables a pass after LLVM has run after the pass has run for count number of times. Then the script invokes a test of some sort and indicates whether LLVM successfully compiled the test via the scripts exit status. Then you invoke bisect as follows: bisect --start=<start_num> --end=<end_num> ./script.sh "%(count)s" And bisect will continually call ./script.sh with various counts using the exit status to determine success and failure. llvm-svn: 214610	2014-08-02 01:39:08 +00:00
Eric Fiselier	644e75522f	[lit] Add --show-xfail flag to LIT. Summary: This patch add a --show-xfail flag. If this flag is specified then each xfail test will be printed to output. When it is not given xfail tests are ignored. Ignoring xfail tests is the current behavior. This flag is meant to mirror the --show-unsupported flag that was recently added. Reviewers: ddunbar, EricWF Reviewed By: EricWF Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4750 llvm-svn: 214609	2014-08-02 01:29:52 +00:00
Matt Arsenault	ea70093fdd	R600/SI: Fix formatting. Avoid weird line wrapping of BuildMI dest register. llvm-svn: 214608	2014-08-02 01:10:28 +00:00
Chandler Carruth	09bc492376	[x86] Make some questionable tests not spew assembly to stdout, which makes a mess of the lit output when they ultimately fail. The 2012-10-02-DAGCycle test is really frustrating because the only explanation for what it is testing is a rdar link. I would really rather that rdar links (which are not public or part of the open source project) were not committed to the source code. Regardless, the actual problem must be described as the rdar link is completely opaque. The fact that this test didn't check for any particular output further exacerbates the inability of any other developer to debug failures. The mem-promote-integers test has nice comments and seems to be a great test for our lowering... except that we don't actually check that any of the generated code is correct or matches some pattern. We just avoid crashing. It would be great to go back and populate this test with the actual expectations. llvm-svn: 214605	2014-08-02 00:50:10 +00:00
Alexey Samsonov	a08644f560	[ASan] Use metadata to pass source-level information from Clang to ASan. Instead of creating global variables for source locations and global names, just create metadata nodes and strings. They will be transformed into actual globals in the instrumentation pass (if necessary). This approach is more flexible: 1) we don't have to ensure that our custom globals survive all the optimizations 2) if globals are discarded for some reason, we will simply ignore metadata for them and won't have to erase corresponding globals 3) metadata for source locations can be reused for other purposes: e.g. we may attach source location metadata to alloca instructions and provide better descriptions for stack variables in ASan error reports. No functionality change. llvm-svn: 214604	2014-08-02 00:35:50 +00:00
Chandler Carruth	70ba0f5168	[SDAG] Allow the legalizer to delete an illegally typed intermediate introduced during legalization. This pattern is based on other patterns in the legalizer that I changed in the same way. Now, the legalizer eagerly collects its garbage when necessary so that we can survive leaving such nodes around for it. Instead, we add an assert to make sure the node will be correctly handled by that layer. llvm-svn: 214602	2014-08-02 00:24:54 +00:00
Chandler Carruth	d58d700ebe	[SDAG] Let the DAG combiner take care of dead nodes rather than manually deleting them. This already seems to work, as no tests fail without this. llvm-svn: 214601	2014-08-02 00:19:10 +00:00
Tyler Nowicki	0eb1e96567	Add diagnostics to the vectorizer cost model. When the cost model determines vectorization is not possible/profitable these remarks print an analysis of that decision. Note that in selectVectorizationFactor() we can assume that OptForSize and ForceVectorization are mutually exclusive. Reviewed by Arnold Schwaighofer llvm-svn: 214599	2014-08-02 00:14:03 +00:00
NAKAMURA Takumi	2cf0524ac0	BitcodeTests: Fix LINK_COMPONENTS. llvm-svn: 214598	2014-08-02 00:12:54 +00:00
Duncan P. N. Exon Smith	0fe753a70b	verify-uselistorder: Reverse use-lists at every verification Updated `verify-uselistorder` to more than double the number of use-list orders it checks. - Every time it verifies an order, it then reverses the order and verifies again. - It now verifies the initial order, before running any shuffles. Changed the default to `-num-shuffles=1`, since this is already four checks, and after r214584 shuffling is guaranteed to make a new order. This is part of PR5680. llvm-svn: 214596	2014-08-01 23:49:41 +00:00
Duncan P. N. Exon Smith	acd5f9dd8a	verify-uselistorder: Add missing `static` llvm-svn: 214595	2014-08-01 23:31:13 +00:00
Duncan P. N. Exon Smith	1780c7d423	IR: Add Value::reverseUseList() I'm going to use this to improve `verify-uselistorder`. Part of PR5680. llvm-svn: 214594	2014-08-01 23:28:49 +00:00
Peter Collingbourne	ca6ae41998	PartiallyInlineLibCalls: Check sqrt result type before transforming it. Some configure scripts declare this with the wrong prototype, which can lead to an assertion failure. llvm-svn: 214593	2014-08-01 23:21:21 +00:00
Duncan P. N. Exon Smith	3116b2db7f	verify-uselistorder: Move shuffleUseLists() out of lib/IR `shuffleUseLists()` is only used in `verify-uselistorder`, so move it there to avoid bloating other executables. As a drive-by, update some of the header docs. This is part of PR5680. llvm-svn: 214592	2014-08-01 23:03:36 +00:00
Adrian Prantl	187cbf3b4e	Cleanup this test some more. llvm-svn: 214591	2014-08-01 23:01:32 +00:00
Adrian Prantl	0dd010e548	Add the missing target triple to this testcase. llvm-svn: 214590	2014-08-01 23:01:30 +00:00
Adrian Prantl	61248f44e7	Attempt to increase the overall happiness of the MSCV-based buildbots. llvm-svn: 214588	2014-08-01 22:56:10 +00:00
Duncan P. N. Exon Smith	6c93809cf2	verify-uselistorder: Make the verification logic easier to reuse llvm-svn: 214587	2014-08-01 22:52:06 +00:00
Justin Bogner	48a07cae49	InstrProf: Allow multiple functions with the same name This updates the instrumentation based profiling format so that when we have multiple functions with the same name (but different function hashes) we keep all of them instead of rejecting the later ones. There are a number of scenarios where this can come up where it's more useful to keep multiple function profiles: * Name collisions in unrelated libraries that are profiled together. * Multiple "main" functions from multiple tools built against a common library. * Combining profiles from different build configurations (ie, asserts and no-asserts) The profile format now stores the number of counters between the hash and the counts themselves, so that multiple sets of counts can be stored. Since this is backwards incompatible, I've bumped the format version and added some trivial logic to skip this when reading the old format. llvm-svn: 214585	2014-08-01 22:50:07 +00:00
Duncan P. N. Exon Smith	2097a3a4ce	UseListOrder: Guarantee that shuffles change use-list order Change shuffleUseLists() always to change use-list order by rejecting orders that have no changes. This is part of PR5680. llvm-svn: 214584	2014-08-01 22:50:04 +00:00
Duncan P. N. Exon Smith	03001780da	UseListOrder: Fix blockaddress use-list order `parseBitcodeFile()` uses the generic `getLazyBitcodeFile()` function as a helper. Since `parseBitcodeFile()` isn't actually lazy -- it calls `MaterializeAllPermanently()` -- bypass the unnecessary call to `materializeForwardReferencedFunctions()` by extracting out a common helper function. This removes the last of the use-list churn caused by blockaddresses. This highlights that we can't reproduce use-list order of globals and constants when parsing lazily -- but that's necessarily out of scope. When we're parsing lazily, we never have all the functions in memory, so the use-lists of globals (and constants that reference globals) are always incomplete. This is part of PR5680. llvm-svn: 214581	2014-08-01 22:27:19 +00:00
Akira Hatanaka	2cf112b51e	[X86] Simplify X87 stackifier pass. Stop using ST registers for function returns and inline-asm instructions and use FP registers instead. This allows removing a large amount of code in the stackifier pass that was needed to track register liveness and handle copies between ST and FP registers and function calls returning floating point values. It also fixes a bug which manifests when an ST register defined by an inline-asm instruction was live across another inline-asm instruction, as shown in the following sequence of machine instructions: 1. INLINEASM <es:frndint> $0:[regdef], %ST0<imp-def,tied5> 2. INLINEASM <es:fldcw $0> 3. %FP0<def> = COPY %ST0 <rdar://problem/16952634> llvm-svn: 214580	2014-08-01 22:19:41 +00:00
NAKAMURA Takumi	6e3fcdb3f0	llvm/test/CodeGen/Mips/cconv/arguments-varargs.ll: Add explicit -mtriple=(mips\|mipsel)-linux on 4 lines. llvm-svn: 214578	2014-08-01 22:15:38 +00:00
Adrian Prantl	bce66c31aa	Debug info: Infrastructure to support debug locations for fragmented variables (for example, by-value struct arguments passed in registers, or large integer values split across several smaller registers). On the IR level, this adds a new type of complex address operation OpPiece to DIVariable that describes size and offset of a variable fragment. On the DWARF emitter level, all pieces describing the same variable are collected, sorted and emitted as DWARF expressions using the DW_OP_piece and DW_OP_bit_piece operators. http://reviews.llvm.org/D3373 rdar://problem/15928306 What this patch doesn't do / Future work: - This patch only adds the backend machinery to make this work, patches that change SROA and SelectionDAG's type legalizer to actually create such debug info will follow. (http://reviews.llvm.org/D2680) - Making the DIVariable complex expressions into an argument of dbg.value will reduce the memory footprint of the debug metadata. - The sorting/uniquing of pieces should be moved into DebugLocEntry, to facilitate the merging of multi-piece entries. llvm-svn: 214576	2014-08-01 22:11:58 +00:00
Chandler Carruth	697d6af472	[SDAG] MorphNodeTo recursively deletes dead operands of the old fromulation of the node, which isn't really the desired behavior from within the combiner or legalizer, but is necessary within ISel. I've added a hopefully helpful comment and fixed the only two places where this took place. Yet another step toward the combiner and legalizer not needing to use update listeners with virtual calls to manage the worklists behind legalization and combining. llvm-svn: 214574	2014-08-01 22:09:43 +00:00
Tom Stellard	2e31693e97	Revert "R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp" This reverts commit r214566. I did not mean to commit this yet. llvm-svn: 214572	2014-08-01 21:55:50 +00:00
Reid Kleckner	9e05075d55	MS inline asm: Hide symbol to attempt to fix test failure on darwin If the symbol comes from an external DSO, it apparently requires indirection through a register. llvm-svn: 214571	2014-08-01 21:54:37 +00:00
Duncan P. N. Exon Smith	323f635bbe	BitcodeReader: Change mechanics of BlockAddress forward references, NFC Now that we can reliably handle forward references to `BlockAddress` (r214563), change the mechanics to simplify predicting use-list order. Previously, we created dummy `GlobalVariable`s to represent block addresses. After every function was materialized, we'd go through any forward references to its blocks and RAUW them with a proper `BlockAddress` constant. This causes some (potentially a lot of) unnecessary use-list churn, since any constant expression that it's a part of will need to be rematerialized as well. Instead, pre-construct a `BasicBlock` immediately -- without attaching it to its (empty) `Function` -- and use that to construct a `BlockAddress`. This constant will not have to be regenerated. When the function body is parsed, hook this pre-constructed basic block up in the right place using `BasicBlock::insertInto()`. Both before and after this change, the IR is temporarily in an invalid state that gets resolved when `materializeForwardReferencedFunctions()` gets called. This is a prep commit that's part of PR5680, but the only functionality change is the reduction of churn in the constant pool. llvm-svn: 214570	2014-08-01 21:51:52 +00:00
Tom Stellard	db07c33258	R600/SI: Remove leftover debugging code llvm-svn: 214569	2014-08-01 21:51:05 +00:00
Tom Stellard	150fd6c318	R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp SI doesn't use REGISTER_LOAD anymore, but it was still hitting this code path for 8-bit and 16-bit private loads. llvm-svn: 214566	2014-08-01 21:50:47 +00:00
Reid Kleckner	eddef3ffd2	docs: Strongly recommend setting rpath when using a local GCC toolchain Users keep emailing us about the difficulties of getting LD_LIBRARY_PATH into their environment, which should be completely unecessary. Try to strengthen the rpath recommentation by putting in an example cmake invocation. Speaking of which, we might want to make CMake the recommended build system in GettingStarted.html. llvm-svn: 214565	2014-08-01 21:40:53 +00:00
Duncan P. N. Exon Smith	d51a043e5b	IR: Add BasicBlock::insertInto() Although unlinked `BasicBlock`s can be created, there's currently no way to insert them into `Function`s after the fact. In particular, `moveAfter()` and `moveBefore()` require that the basic block is already linked. Extract the logic for initially linking a `BasicBlock` out of the constructor and into a member function that can be used for lazy insertion. - Asserts that the basic block is currently unlinked. - Matches the logic of the constructor. - Changed the constructor to use it since the logic matches. This is needed in a follow-up commit for PR5680. llvm-svn: 214563	2014-08-01 21:22:04 +00:00
Peter Collingbourne	dd57da004b	[dfsan] Correctly handle loads and stores of zero size. llvm-svn: 214561	2014-08-01 21:18:18 +00:00
Eric Christopher	dfc6457da2	Add a non-const subtarget returning function to the target machine so that we can use it to get the old-style JIT out of the subtarget. This code should be removed when the old-style JIT is removed (imminently). llvm-svn: 214560	2014-08-01 21:18:01 +00:00
Duncan P. N. Exon Smith	9829cda628	BitcodeReader: Fix some BlockAddress forward reference corner cases `BlockAddress`es are interesting in that they can reference basic blocks from outside the block's function. Since basic blocks are not global values, this presents particular challenges for lazy parsing. One corner case was found in PR11677 and fixed in r147425. In that case, a global variable references a block address. It's necessary to load the relevant function to resolve the forward reference before doing anything with the module. By inspection, I found (and have fixed here) two other cases: - An instruction from one function references a block address from another function, and only the first function is lazily loaded. I fixed this the same way as PR11677: by eagerly loading the referenced function. - A function whose block address is taken is dematerialized, leaving invalid references to it. I fixed this by refusing to dematerialize functions whose block addresses are taken (if you have to load it, you can't unload it). llvm-svn: 214559	2014-08-01 21:11:34 +00:00
Duncan P. N. Exon Smith	9f6cc29330	Try to fix configure+make after r214556 llvm-svn: 214558	2014-08-01 21:06:59 +00:00
Duncan P. N. Exon Smith	2f470a350f	Rewrite BitReaderTest, NFC Rewrite the single unit test in `BitReaderTest` so that it's easier to add more tests. - Parse from an assembly string rather than using API. - Use more helper functions. - Use a separate context for the module on the other side. Aside from relying on the assembly parser, there's no functionality change intended. llvm-svn: 214556	2014-08-01 21:01:04 +00:00
Reid Kleckner	bfbea18b59	MS inline asm: Use memory constraints for functions instead of registers This is consistent with how we parse them in a standalone .s file, and inline assembly shouldn't differ. This fixes errors about requiring more registers than available in cases like this: void f(); void __declspec(naked) g() { __asm pusha __asm call f __asm popa __asm ret } There are no registers available to pass the address of 'f' into the asm blob. The asm should now directly call 'f'. Tests will land in Clang shortly. llvm-svn: 214550	2014-08-01 20:21:24 +00:00
Justin Bogner	a0f98c4e13	llvm-profdata: Replace redundant tests with more targeted ones llvm-svn: 214548	2014-08-01 19:59:48 +00:00
Chandler Carruth	d0836ba062	[SDAG] Begin simplifying the way in which the legalizer deletes nodes. This lifts the (very few) places the legalizer would delete dead nodes into the outer loop around the legalizer. This is significantly simpler because it doesn't require the legalizer itself to manage the iterator validity, and it doesn't require the legalizer to be a DAG update listener in order to remove things from the legalized set. It also makes the interface much less contrived for the case of the legalizer running inside the last phase of DAG combining. I'm working on centralizing the deletion of nodes during both legalizing and combining as much as possible. My hope is to remove the need for DAG update listeners from the combiner next, which would remove a costly virtual dispatch chain on every deletion. This in turn should allow us to more aggressively delete DAG nodes during combining which will in turn allow us to combine more aggressively by exposing the actual nodes which have single users to the combine phases. llvm-svn: 214546	2014-08-01 19:49:59 +00:00

1 2 3 4 5 ...

106254 Commits