llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Igor Breger	1577d53536	[GlobalISel][X86] extend G_ZEXT support. Summary: Mark G_ZEXT/G_SEXT i1 to i8/i16, i8 to i16 as legal. Support G_ZEXT i1 to i8/i16 instruction selection ( C++ code). This patch requred to support G_LOAD/G_STORE i1. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D35177 llvm-svn: 307526	2017-07-10 09:07:34 +00:00
Kirill Bobyrev	69173e21c2	[docs] NFC: Fix links in the tutorial r274441 introduced Chapter 10 of "Implementing a Language with LLVM" tutorial, which caused all files in the tutorial to start using two digit numbering. But many links were not changed and therefore appear to be broken. This patch addresses described issue. As a result, following command does not produce any output anymore: $ grep -nR '<LangImpl[0-9].html>' ./docs/tutorial/ llvm-svn: 307525	2017-07-10 09:07:23 +00:00
Hiroshi Inoue	0df97f9412	fix formatting; NFC llvm-svn: 307523	2017-07-10 06:32:52 +00:00
Craig Topper	40963ccb4a	[X86] Fix typo in comment. NFC llvm-svn: 307522	2017-07-10 06:09:22 +00:00
Mikael Holmen	d33ff77445	[ArgumentPromotion] Change use of removed argument in llvm.dbg.value to undef Summary: This solves PR33641. When removing a dead argument we must also handle possibly existing calls to llvm.dbg.value that use the removed argument. Now we change the use of the otherwise dead argument to an undef for some other pass to cleanup later. If the calls are left untouched, they will later on cause errors: "function-local metadata used in wrong function" since the ArgumentPromotion rewrites the code by creating a new function with the wanted signature, but the metadata is not recreated so the new function may then erroneously use metadata from the old function. Reviewers: mstorsjo, rnk, arsenm Reviewed By: rnk Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D34874 llvm-svn: 307521	2017-07-10 06:07:24 +00:00
Craig Topper	83f6936cbe	[X86] Remove asserts from getX86CpuIDAndInfo/getX86CpuIDAndInfoEx. Restore past behavior of returning an unsupported indication to the caller instead. These asserts could only occur if we fail to properly detect the compiler, but an assert is not a good way to do that because it doesn't work in release builds. I wonder if we could use #error? llvm-svn: 307520	2017-07-10 06:04:11 +00:00
Chandler Carruth	ba3fe3e64d	[ADT] Fix another "oops" spotted by eddyb and reported in IRC. This test pretty clearly should be calling 'maxnum' here. =] llvm-svn: 307519	2017-07-10 05:41:14 +00:00
David Blaikie	ade81caa2c	llvm-profdata: Reduce memory usage by using Error callback rather than member Reduces llvm-profdata memory usage on a large profile from 7.8GB to 5.1GB. The ProfData API now supports reporting all the errors/warnings rather than only the first, though llvm-profdata ignores everything after the first for now to preserve existing behavior. (if there's a desire for other behavior, happy to implement that - but might be as well left for a separate patch) Reviewers: davidxl Differential Revision: https://reviews.llvm.org/D35149 llvm-svn: 307516	2017-07-10 03:04:59 +00:00
NAKAMURA Takumi	6a1725f368	CGSCCPassManagerTest.cpp: Fix warnings. [-Wunused-variable] llvm-svn: 307511	2017-07-09 23:06:05 +00:00
Davide Italiano	ac5e953a20	[X86] Relax an assertion when legalizing vector types. WidenVSELECTAndMask can fold (and it folds in this case) so we get a BUILD_VECTOR of constants as mask. convertMask() seems to work fine when the input is a vector of constants, and we still need to call it to extend/add elements at the end. but the current code just asserts on anything but a SETCC or AND/OR/XOR of 2xSETCC. This change was discussed briefly with Simon Pilgrim, who also suggests we might consider dropping this assertion in the future. Fixes PR33715. llvm-svn: 307508	2017-07-09 19:22:48 +00:00
Simon Pilgrim	4cbd680770	[X86] Allow GHC calling convention to use YMM and ZMM registers GHC 8.4 will know how to use YMM and ZMM registers for calls. Submitted on behalf of @bgamari (Ben Gamari) Differential Revision: https://reviews.llvm.org/D34854 llvm-svn: 307504	2017-07-09 16:57:10 +00:00
Dylan McKay	d5e39b8d0f	[AVR] Fix test errors due to tied operands not matching Broken due to r307259. llvm-svn: 307503	2017-07-09 16:36:35 +00:00
Simon Pilgrim	78bcdd73d3	Handle ConstantExpr correctly in SelectionDAGBuilder This change fixes a bug in SelectionDAGBuilder::visitInsertValue and SelectionDAGBuilder::visitExtractValue where constant expressions (InsertValueConstantExpr and ExtractValueConstantExpr) would be treated as non-constant instructions (InsertValueInst and ExtractValueInst). This bug resulted in an incorrect memory access, which manifested as an assertion failure in SDValue::SDValue. Fixes PR#33094. Submitted on behalf of @Praetonus (Benoit Vey) Differential Revision: https://reviews.llvm.org/D34538 llvm-svn: 307502	2017-07-09 16:01:04 +00:00
Simon Pilgrim	0dab557813	[X86][AVX512] Regenerate AVX512VL comparison tests. Show poor codegen on KNL targets as mentioned on D35179 llvm-svn: 307500	2017-07-09 15:47:43 +00:00
Chandler Carruth	fda9b703c9	[PM] Fix a nasty bug in the new PM where we failed to properly invalidation of analyses when merging SCCs. While I've added a bunch of testing of this, it takes something much more like the inliner to really trigger this as you need to have partially-analyzed SCCs with updates at just the right time. So I've added a direct test for this using the inliner and verifying the domtree. Without the changes here, this test ends up finding a stale dominator tree. However, to handle this properly, we need to invalidate analyses before merging the SCCs. After talking to Philip and Sanjoy about this they convinced me this was the right approach. To do this, we need a callback mechanism when merging SCCs so we can observe the cycle that will be merged before the merge happens. This API update ended up being surprisingly easy. With this commit, the new PM passes the test-suite again. It hadn't since MemorySSA was enabled for EarlyCSE as that also will find this bug very quickly. llvm-svn: 307498	2017-07-09 13:45:11 +00:00
Chandler Carruth	267b806fd9	[PM] Add unittesting of the call graph update logic with complex dependencies between analyses. This uncovers even more issues with the proxies and the splitting apart of SCCs which are fixed in this patch. I discovered this while trying to add more rigorous testing for a change I'm making to the call graph update invalidation logic. llvm-svn: 307497	2017-07-09 13:16:55 +00:00
Chandler Carruth	2651011b9f	[ADT] Fix a test case to use a correct escape for a null byte followed by a valid octal digit. The length argument shows that this was in fact the intent. This was pointed out in IRC, thanks to eddyb! llvm-svn: 307496	2017-07-09 07:37:47 +00:00
Craig Topper	ff8bea79d2	[X86] Remove check for AVX512 support from skylake-avx512 detection in getHostCPUName. Users of getHostCPUName should also use getHostCPUFeatures which will take care of making sure avx512 is disabled if the CPU doesn't support it. This is consistent with what we do for other CPUs. llvm-svn: 307495	2017-07-09 07:26:14 +00:00
Igor Breger	f0a257f801	[GlobalISel][X86] Add legalizer tests for G_LOAD/G_STORE operations. NFC. llvm-svn: 307494	2017-07-09 07:25:57 +00:00
Chandler Carruth	36d470ca31	[PM] Teach PreservedAnalyses to have an `allInSet` static factory function template to simplify building a quick object with a set marked as preserved. llvm-svn: 307493	2017-07-09 07:23:27 +00:00
Craig Topper	8652178bc5	[IR] Add Type::isIntOrIntVectorTy(unsigned) similar to the existing isIntegerTy(unsigned), but also works for vectors. llvm-svn: 307492	2017-07-09 07:04:03 +00:00
Craig Topper	86739c18e2	[IR] Make use of Type::isPtrOrPtrVectorTy/isIntOrIntVectorTy/isFPOrFPVectorTy to shorten code. NFC llvm-svn: 307491	2017-07-09 07:04:00 +00:00
Chandler Carruth	36d9dcb3f0	[ADT] Add a default constructor and a bool conversion to function_ref. The internal representation has a natural way to handle this and it seems nicer than having to wrap this in an optional (with its own separate flag). This also matches how std::function works. llvm-svn: 307490	2017-07-09 06:12:56 +00:00
Igor Breger	7805125c03	[FastISel] fix a fallback diagnostic. Summary: FastISel was marked as failed in case instruction selection succeeded. Reviewers: qcolombet, zvi, rovka, ab Reviewed By: zvi Subscribers: javed.absar, ab, qcolombet, bogner, llvm-commits Differential Revision: https://reviews.llvm.org/D34438 llvm-svn: 307489	2017-07-09 05:55:20 +00:00
Hiroshi Inoue	1d578b7e75	fix trivial typos; NFC sucessor -> successor llvm-svn: 307488	2017-07-09 05:54:44 +00:00
Chandler Carruth	e28f591d48	[PM] Finish implementing and fix a chain of bugs uncovered by testing the invalidation propagation logic from an SCC to a Function. I wrote the infrastructure to test this but didn't actually use it in the unit test where it was designed to be used. =[ My bad. Once I actually added it to the test case I discovered that it also hadn't been properly implemented, so I've implemented it. The logic in the FAM proxy for an SCC pass to propagate invalidation follows the same ideas as the FAM proxy for a Module pass, but the implementation is a bit different to reflect the fact that it is forwarding just for an SCC. However, implementing this correctly uncovered a surprising "bug" (it was conservatively correct but relatively very expensive) in how we handle invalidation when splitting one SCC into multiple SCCs. We did an eager invalidation when in reality we should be deferring invaliadtion for the current SCC to the CGSCC pass manager and just invaliating the newly constructed SCCs. Otherwise we end up invalidating too much too soon. This was exposed by the inliner test case that I've updated. Now, we invalidate just the split off '(test1_f)' SCC when doing the CG update, and then the inliner finishes and invalidates the '(test1_g, test1_h)' SCC's analyses. The first few attempts at fixing this hit still more bugs, but all of those are covered by existing tests. For example, the inliner should also preserve the FAM proxy to avoid unnecesasry invalidation, and this is safe because the CG update routines it uses handle any necessary adjustments to the FAM proxy. Finally, the unittests for the CGSCC pass manager needed a bunch of updates where we weren't correctly preserving the FAM proxy because it hadn't been fully implemented and failing to preserve it didn't matter. Note that this doesn't yet fix the current crasher due to MemSSA finding a stale dominator tree, but without this the fix to that crasher doesn't really make any sense when testing because it relies on the proxy behavior. llvm-svn: 307487	2017-07-09 03:59:31 +00:00
Craig Topper	ddb906e55b	[InstCombine] Speculatively implement a fix for what might be the root cause of PR33721 by making sure that we have integer types before doing select C, -1, 0 -> sext C to int I recently changed m_One and m_AllOnes to use Constant::isOneValue/isAllOnesValue which work on floating point values too. The original implementation looked specifically for ConstantInt scalars and splats. So I'm guessing we are accidentally trying to issue sext/zexts on floating point types now. Hopefully I figure out how to reproduce the failure from the PR soon. llvm-svn: 307486	2017-07-09 03:25:17 +00:00
Simon Pilgrim	7f479d6ac7	[AMDGPU] Fix -Wimplicit-fallthrough warning. NFCI. llvm-svn: 307485	2017-07-08 19:50:03 +00:00
Simon Pilgrim	82a8d24e96	[AArch64] Fix -Wimplicit-fallthrough warnings. NFCI. Add breaks - doesn't affect results as both GPR/FPU both check for 32/64 bit sizes. So will still default to GenericOps in the same way. llvm-svn: 307484	2017-07-08 19:28:24 +00:00
Simon Pilgrim	5d9ea7a075	[ARM] Fix -Wimplicit-fallthrough warning. NFCI. llvm-svn: 307480	2017-07-08 18:42:04 +00:00
Yuka Takahashi	c2d79dd55e	[Bash-autocompletion] Auto complete cc1 options if -cc1 is specified Summary: We don't want to autocomplete flags whose Flags class has `NoDriverOption` when argv[1] is not `-cc1`. Another idea for this implementation is to make --autocomplete a cc1 option and handle it in clang Frontend, by porting --autocomplete handler from Driver to Frontend, so that we can handle Driver options and CC1 options in unified manner. Differential Revision: https://reviews.llvm.org/D34770 llvm-svn: 307479	2017-07-08 17:48:59 +00:00
Max Kazantsev	9e73a14fba	Re-enable "[IndVars] Canonicalize comparisons between non-negative values and indvars" The patch was reverted due to a bug. The bug was that if the IV is the 2nd operand of the icmp instruction, then the "Pred" variable gets swapped and differs from the instruction's predicate. In this patch we use the original predicate to do the transformation. Also added a test case that exercises this situation. Differentian Revision: https://reviews.llvm.org/D35107 llvm-svn: 307477	2017-07-08 17:17:30 +00:00
Sanjay Patel	f72737b43a	[LoopVectorize] partly revert r307475 Bots are failing because of the additional checks. llvm-svn: 307476	2017-07-08 16:34:46 +00:00
Sanjay Patel	65961c9989	[LoopVectorize] auto-generate complete checks; NFC I'm looking at a cmp transform in InstCombine that would affect these tests, but it's hard to know if it makes things better or worse without seeing the full IR. OTOH, maybe these tests shouldn't be running a bunch of transform passes in the first place? llvm-svn: 307475	2017-07-08 16:10:42 +00:00
Simon Pilgrim	1a783539c1	Fix -Wimplicit-fallthrough warning. NFCI. llvm-svn: 307473	2017-07-08 15:26:26 +00:00
Sanjay Patel	0c17c0063e	[x86] add SBB optimization for SETBE (ule) condition code x86 scalar select-of-constants (Cond ? C1 : C2) combining/lowering is a mess with missing optimizations. We handle some patterns, but miss logical variants. To clean that up, we should convert all select-of-constants to logic/math and enhance the combining for the expected patterns from that. Selecting 0 or -1 needs extra attention to produce the optimal code as shown here. Attempt to verify that all of these IR forms are logically equivalent: http://rise4fun.com/Alive/plxs Earlier steps in this series: rL306040 rL306072 rL307404 (D34652) As acknowledged in the earlier review, there's a possibility that some Intel uarch would prefer to produce an xor to clear the fake register operand with sbb %eax, %eax. This will likely need to be addressed in a separate pass. llvm-svn: 307471	2017-07-08 14:04:48 +00:00
Kamil Rytarowski	0cc0979214	[Solaris] get rid of _RESTRICT_KYWD warning during the build Summary: (re)definition of _RESTRICT_KYWD rightfully causes a warning message during the Solaris build. This hack is not needed if build compiler is properly configured (.e.g /usr/bin/gcc) so just remove it. Reviewers: ro, mgorny, krytarowski, joerg Reviewed By: joerg Subscribers: quenelle, llvm-commits Patch by Fedor Sergeev (Oracle). Differential Revision: https://reviews.llvm.org/D35054 llvm-svn: 307469	2017-07-08 11:27:56 +00:00
Craig Topper	5289ced7ae	[X86] In getHostCPUName, remove some code that changes some AMD CPU names based on features not being enabled. The CPU name is really just used for scheduler and other microarchitectural optimizations. The feature flags should be determined by getHostCPUFeatures which should always be used with getHostCPUName. Trying to alter CPU name strings to control features just isn't practical. Most of these types of things were removed from Intel CPUs a while ago. This is part of my plan to bring compiler-rt's cpu_model.c file up to date with the equivalent functionality in libgcc. A lot of the code in that file is copied from Host.cpp and we want to keep them reasonably in sync. llvm-svn: 307467	2017-07-08 06:44:36 +00:00
Craig Topper	5e2ff56deb	[X86] Correct the BDVER4 model numbers to include 0x70-0x7f. According to wikipedia and some other googling suggests these should also be considered as BDVER4. llvm-svn: 307466	2017-07-08 06:44:35 +00:00
Craig Topper	bd57d98faa	[X86] Minor formatting fix. NFC llvm-svn: 307465	2017-07-08 06:44:34 +00:00
Craig Topper	4e4e7645c0	[X86] Use 'unsigned' instead of 'unsigned int' for consistency in the X86 portion of Host.cpp. llvm-svn: 307463	2017-07-08 05:16:14 +00:00
Craig Topper	f7e25b5bcd	[X86] Cleanup some CPUID usage in getAvailableFeatures. We should make sure leaf 1 is available before accessing it. Same with leaf 0x80000001. llvm-svn: 307462	2017-07-08 05:16:13 +00:00
Eric Beckmann	0a571cb380	Revert "Revert "Revert "Revert "Switch external cvtres.exe for llvm's own resource library."""" This reverts commit 147f45ff24456aea59575fa4ac16c8fa554df46a. Revert "Revert "Revert "Revert "Replace trivial use of external rc.exe by writing our own .res file."""" This reverts commit 61a90a67ed54a1f0dfeab457b65abffa129569e4. The patches were intially reverted because they were causing a failure on CrWinClangLLD. Unfortunately, this was done haphazardly and didn't compile, so the revert was reverted again quickly to fix this. One that was done, the revert of the revert was itself reverted. This allowed me to finally fix the actual bug in r307452. This patch re-enables the code path that had originally been causing the bug, now that it (should) be fixed. llvm-svn: 307460	2017-07-08 03:06:10 +00:00
Eric Christopher	8fe591d225	Remove a variable that was only used in asserts and had a duplicate copy in something we did use anyhow. llvm-svn: 307457	2017-07-08 01:03:29 +00:00
Eric Beckmann	fdff2a0519	Add name offset flags, for parity with cvtres.exe. Summary: The original cvtres.exe sets the high bit when an identifier offset points to a string. Even though this is not mentioned in the spec, and in fact does not seem to cause errors with most cases, for some reason this causes a failure in Chromium where the new resource file is not verified as a new version. This patch sets this high bit flag, and also adds a test case to check that the output of our library is always identical to original cvtres. Reviewers: zturner, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D35099 llvm-svn: 307452	2017-07-07 23:23:53 +00:00
Craig Topper	ce5664ff66	[InstCombine] Make InstCombine's IRBuilder be passed by reference everywhere Previously the InstCombiner class contained a pointer to an IR builder that had been passed to the constructor. Sometimes this would be passed to helper functions as either a pointer or the pointer would be dereferenced to be passed by reference. This patch makes it a reference everywhere including the InstCombiner class itself so there is more inconsistency. This a large, but mechanical patch. I've done very minimal formatting changes on it despite what clang-format wanted to do. llvm-svn: 307451	2017-07-07 23:16:26 +00:00
Lei Huang	eab61d9acf	[PowerPC] NFC : Common up definitions of isIntS16Immediate and update parameter to int16_t llvm-svn: 307442	2017-07-07 21:12:35 +00:00
David Blaikie	5cdf6b0e88	ProfData: Fix some unchecked Errors in unit tests The 'NoError' function was meant to be used as the input to ASSERT/EXPECT_TRUE, but it is easy to forget this (it could be annotated with nodiscard to help this) so many sites that look like they're checked are not (& silently discard the failure). Only one site actually has an Error sneaking out this way and I've replaced that one with a FIXME+consumeError. The rest of the code has been modified to use the EXPECT_THAT_ERROR macros Zach introduced a while back. Between the options available this seems OK/good/something to standardize on - though it's difficult to build a matcher that could handle checking for a specific llvm::Error result, so those remain using the custom ErrorEquals (& the nodiscard added to ensure it is not misused as it was previous to this patch). It could still be generalized a bit further (even not as far as a matcher, but at least support multiple kinds of Error, etc) & added to the general Error utility header. llvm-svn: 307440	2017-07-07 21:02:59 +00:00
Dehao Chen	93a53b9d0d	Increase the import-threshold for crtical functions. Summary: For interative sample-pgo, if a hot call site is inlined in the profiling binary, we should inline it in before profile annotation in the backend. Before that, the compile phase first collects all GUIDs that needs to be imported and creates virtual "hot" call edge in the summary. However, "hot" is not good enough to guarantee the callsites get inlined. This patch introduces "critical" call edge, and assign much higher importing threshold for those edges. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: sanjoy, mehdi_amini, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D35096 llvm-svn: 307439	2017-07-07 21:01:00 +00:00
Dehao Chen	a2cdd9ac6d	Add sample PGO support to ThinLTO new pass manager. Summary: For SamplePGO + ThinLTO, because profile annotation is done twice at both PrepareForThinLTO pipeline and backend compiler, the following changes are needed at the PrepareForThinLTO phase to ensure the IR is not changed dramatically. Otherwise the profile annotation will be inaccurate in the backend compiler. * disable hot-caller heuristic * disable loop unrolling * disable indirect call promotion This will unblock the new PM testing for sample PGO (tools/clang/test/CodeGen/pgo-sample-thinlto-summary.c), which will be covered in another cfe patch. Reviewers: chandlerc, tejohnson, davidxl Reviewed By: tejohnson Subscribers: sanjoy, mehdi_amini, Prazek, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D34895 llvm-svn: 307437	2017-07-07 20:53:10 +00:00

1 2 3 4 5 ...

151443 Commits