llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 05:52:53 +02:00

Author	SHA1	Message	Date
Tim Northover	68aeddde08	ARM: add patterns for [su]xta[bh] from just a shift. Although the final shifter operand is a rotate, this actually only matters for the half-word extends when the amount == 24. Otherwise folding a shift in is just as good. llvm-svn: 213753	2014-07-23 13:59:07 +00:00
James Molloy	41a2f5b855	Enable partial libcall inlining for all targets by default. This pass attempts to speculatively use a sqrt instruction if one exists on the target, falling back to a libcall if the target instruction returned NaN. This was enabled for MIPS and System-Z, but is well guarded and is good for most targets - GCC does this for (that I've checked) X86, ARM and AArch64. llvm-svn: 213752	2014-07-23 13:33:00 +00:00
Tilmann Scheller	189ad507d0	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STRB instructions. The ARM ARM prohibits STRB instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STRB instructions with unpredictable behavior. llvm-svn: 213750	2014-07-23 13:03:47 +00:00
Daniel Sanders	8566b88442	Added release notes for MIPS. llvm-svn: 213749	2014-07-23 12:59:26 +00:00
Tim Northover	c357579164	AArch64: remove "arm64_be" support in favour of "aarch64_be". There really is no arm64_be: it was a useful fiction to test big-endian support while both backends existed in parallel, but now the only platform that uses the name (iOS) doesn't have a big-endian variant, let alone one called "arm64_be". llvm-svn: 213748	2014-07-23 12:58:11 +00:00
Tilmann Scheller	ea661e77fd	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STR instructions. The ARM ARM prohibits STR instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STR instructions with unpredictable behavior. llvm-svn: 213745	2014-07-23 12:38:17 +00:00
Tim Northover	3e8fd62854	AArch64: remove arm64 triple enumerator. Having both Triple::arm64 and Triple::aarch64 is extremely confusing, and invites bugs where only one is checked. In reality, the only legitimate difference between the two (arm64 usually means iOS) is also present in the OS part of the triple and that's what should be checked. We still parse the "arm64" triple, just canonicalise it to Triple::aarch64, so there aren't any LLVM-side test changes. llvm-svn: 213743	2014-07-23 12:32:47 +00:00
Andrea Di Biagio	e373d3d06d	Revert r211771. It was: "[X86] Improve the selection of SSE3/AVX addsub instructions". This chang fully reverts r211771. That revision added a canonicalization rule which has the potential to causes a combine-cycle in the target-independent canonicalizing DAG combine. The plan is to move the logic that forms target specific addsub nodes as part of the lowering of shuffles. llvm-svn: 213736	2014-07-23 11:20:24 +00:00
Chandler Carruth	d673be21b7	[x86] Clean up a test case to use check labels and spell out the exact instruction sequences with CHECK-NEXT for these test cases. This notably exposes how absolutely horrible the generated code is for several of these test cases, and will make any future updates to the test as our vector instruction selection gets better. llvm-svn: 213732	2014-07-23 09:11:48 +00:00
Tilmann Scheller	92abecadc2	[ARM] Add regression test for the earlyclobber constraint of ARM STRB. The constraint was added in r213369. llvm-svn: 213730	2014-07-23 08:39:50 +00:00
Tilmann Scheller	e0c6a75a6a	[ARM] Add earlyclobber constraint to pre/post-indexed ARM STRH instructions. The post-indexed instructions were missing the constraint, causing unpredictable STRH instructions to be emitted. The earlyclobber constraint on the pre-indexed STR instructions is not strictly necessary, as the instruction selection for pre-indexed STR instructions goes through an additional layer of pseudo instructions which have the constraint defined, however it doesn't hurt to specify the constraint directly on the pre-indexed instructions as well, since at some point someone might create instances of them programmatically and then the constraint is definitely needed. llvm-svn: 213729	2014-07-23 08:12:51 +00:00
Chandler Carruth	908e62868c	[SDAG] Make the DAGCombine worklist not grow endlessly due to duplicate insertions. The old behavior could cause arbitrarily bad memory usage in the DAG combiner if there was heavy traffic of adding nodes already on the worklist to it. This commit switches the DAG combine worklist to work the same way as the instcombine worklist where we null-out removed entries and only add new entries to the worklist. My measurements of codegen time shows slight improvement. The memory utilization is unsurprisingly dominated by other factors (the IR and DAG itself I suspect). This change results in subtle, frustrating churn in the particular order in which DAG combines are applied which causes a number of minor regressions where we fail to match a pattern previously matched by accident. AFAICT, all of these should be using AddToWorklist to directly or should be written in a less brittle way. None of the changes seem drastically bad, and a few of the changes seem distinctly better. A major change required to make this work is to significantly harden the way in which the DAG combiner handle nodes which become dead (zero-uses). Previously, we relied on the ability to "priority-bump" them on the combine worklist to achieve recursive deletion of these nodes and ensure that the frontier of remaining live nodes all were added to the worklist. Instead, I've introduced a routine to just implement that precise logic with no indirection. It is a significantly simpler operation than that of the combiner worklist proper. I suspect this will also fix some other problems with the combiner. I think the x86 changes are really minor and uninteresting, but the avx512 change at least is hiding a "regression" (despite the test case being just noise, not testing some performance invariant) that might be looked into. Not sure if any of the others impact specific "important" code paths, but they didn't look terribly interesting to me, or the changes were really minor. The consensus in review is to fix any regressions that show up after the fact here. Thanks to the other reviewers for checking the output on other architectures. There is a specific regression on ARM that Tim already has a fix prepped to commit. Differential Revision: http://reviews.llvm.org/D4616 llvm-svn: 213727	2014-07-23 07:08:53 +00:00
Nick Lewycky	08dae2c274	We may visit a call that uses an alloca multiple times in callUsesLocalStack, sometimes with IsNocapture true and sometimes with IsNocapture false. We accidentally skipped work we needed to do in the IsNocapture=false case if we were called with IsNocapture=true the first time. Fixes PR20405! llvm-svn: 213726	2014-07-23 06:24:49 +00:00
NAKAMURA Takumi	413d897e7d	Rework to let RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s pass on win32. FIXME: "llvm-rtdyld -verify -check" is still sensitive to path separator. Fix searching StubMap to be tolerant of both '/' and '\\' on Win32. llvm-svn: 213723	2014-07-23 04:32:21 +00:00
NAKAMURA Takumi	4555cd0e60	Suppress a test on win32 for now, llvm/test/ExecutionEngine/RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s. FIXME: Fix searching StubMap with '/' and '\\' on Win32. llvm-svn: 213721	2014-07-23 04:05:58 +00:00
NAKAMURA Takumi	a6795992fa	RuntimeDyld/X86/MachO_x86-64_PIC_relocations.s: Use %/T here, or sed(1) would be confused with dos path. llvm-svn: 213720	2014-07-23 04:05:46 +00:00
NAKAMURA Takumi	8e44b58021	Trailing whitespace. llvm-svn: 213711	2014-07-23 00:42:52 +00:00
NAKAMURA Takumi	d63be9bfd5	RuntimeDyldMachOAArch64.h: Fix a warning. [-Wunused-variable] llvm-svn: 213710	2014-07-23 00:17:44 +00:00
Lang Hames	0a8073389f	[MCJIT] Make stub_addr functionality in RuntimeDyldChecker work in release mode. There's no reason to restrict this particular piece of RuntimeDyldChecker functionality to +Asserts builds. This should fix failures in MachO_x86-64_PIC_relocations.s on release bots. llvm-svn: 213708	2014-07-22 23:50:51 +00:00
Lang Hames	8cf25fbd15	[MCJIT] Teach RuntimeDyldChecker to handle underscores at the start of symbols. RuntimeDyldChecker had been testing isalpha(Expr[0]) to recognise symbol tokens, and throwing unrecognized token errors when it hit symbols with leading underscores. This fixes that. llvm-svn: 213706	2014-07-22 23:17:21 +00:00
Juergen Ributzka	00d4bb61a8	XFAIL the test on MIPS Not sure how to debug this one without a MIPS machine. Any takers? llvm-svn: 213705	2014-07-22 23:15:01 +00:00
Juergen Ributzka	c2d9ee45f3	[FastIsel][AArch64] Add support for the FastLowerCall and FastLowerIntrinsicCall target-hooks. This commit modifies the existing call lowering functions to be used as the FastLowerCall and FastLowerIntrinsicCall target-hooks instead. This enables patchpoint intrinsic lowering for AArch64. This fixes <rdar://problem/17733076> llvm-svn: 213704	2014-07-22 23:14:58 +00:00
Juergen Ributzka	49aab6445a	[AArch64] Use CHECK-LABEL in ARM64 ABI unit tests. llvm-svn: 213703	2014-07-22 23:14:54 +00:00
Lang Hames	b38fc8cc0c	[MCJIT] Improve stub_addr file-not-found diagnostic to help track down a buildbot failure. llvm-svn: 213701	2014-07-22 23:07:52 +00:00
Lang Hames	59246bdf0c	[MCJIT] Refactor and add stub inspection to the RuntimeDyldChecker framework. This patch introduces a 'stub_addr' builtin that can be used to find the address of the stub for a given (<file>, <section>, <symbol>) tuple. This address can be used both to verify the contents of stubs (by loading from the returned address) and to verify references to stubs (by comparing against the returned address). Example (1) - Verifying stub contents: Load 8 bytes (assuming a 64-bit target) from the stub for 'x' in the __text section of f.o, and compare that value against the addres of 'x'. # rtdyld-check: *{8}(stub_addr(f.o, __text, x) = x Example (2) - Verifying references to stubs: Decode the immediate of the instruction at label 'l', and verify that it's equal to the offset from the next instruction's PC to the stub for 'y' in the __text section of f.o (i.e. it's the correct PC-rel difference). # rtdyld-check: decode_operand(l, 4) = stub_addr(f.o, __text, y) - next_pc(l) l: movq y@GOTPCREL(%rip), %rax Since stub inspection requires cooperation with RuntimeDyldImpl this patch pimpl-ifies RuntimeDyldChecker. Its implementation is moved in to a new class, RuntimeDyldCheckerImpl, that has access to the definition of RuntimeDyldImpl. llvm-svn: 213698	2014-07-22 22:47:39 +00:00
Juergen Ributzka	09c7bc6096	Appease the buildbots. llvm-svn: 213694	2014-07-22 22:02:19 +00:00
Juergen Ributzka	b056ac8df3	[RuntimeDyld][MachO][AArch64] Add a helper function for encoding addends in instructions. Factor out the addend encoding into a helper function and simplify the processRelocationRef. Also add a few simple rtdyld tests. More tests to come once GOTs can be tested too. Related to <rdar://problem/17768539> llvm-svn: 213689	2014-07-22 21:42:55 +00:00
Juergen Ributzka	c81b44ac9b	[RuntimeDyld][MachO][AArch64] Implement the decodeAddend method. This adds the required functionality to decode the immediate encoded in an instruction that is referenced in a relocation entry. llvm-svn: 213688	2014-07-22 21:42:51 +00:00
Juergen Ributzka	4141db1926	[RuntimeDyld][MachO][AArch64] Add assertion to check for duplicate addend definition. In MachO for AArch64 it is possible to have an explicit addend defined by the ARM64_RELOC_ADDEND relocation or having an addend encoded within the instruction. Only one of them are allowed per relocation. llvm-svn: 213687	2014-07-22 21:42:49 +00:00
Juergen Ributzka	256ed910fb	[RuntimeDyld] Change the return type of decodeAddend to match the storage type. llvm-svn: 213686	2014-07-22 21:42:46 +00:00
Suyog Sarda	959fecbe70	This patch implements optimization as mentioned in PR19753: Optimize comparisons with "ashr/lshr exact" of a constanst. It handles the errors which were seen in PR19958 where wrong code was being emitted due to earlier patch. Added code for lshr as well as non-exact right shifts. It implements : (icmp eq/ne (ashr/lshr const2, A), const1)" -> (icmp eq/ne A, Log2(const2/const1)) -> (icmp eq/ne A, Log2(const2) - Log2(const1)) Differential Revision: http://reviews.llvm.org/D4068 llvm-svn: 213678	2014-07-22 19:19:36 +00:00
Suyog Sarda	2092947078	Added InstCombine transform for pattern "(A & B) ^ (A ^ B) -> (A \| B)" Patch idea by Ankit Jain ! Differential Revision: http://reviews.llvm.org/D4618 llvm-svn: 213677	2014-07-22 18:30:54 +00:00
Suyog Sarda	65dba610e3	Added InstCombine Transform for patterns: "((~A & B) \| A) -> (A \| B)" and "((A & B) \| ~A) -> (~A \| B)" Original Patch credit to Ankit Jain !! Differential Revision: http://reviews.llvm.org/D4591 llvm-svn: 213676	2014-07-22 18:09:41 +00:00
Dan Liew	f94690ae86	Revert "Treat warnings in Sphinx as errors. The reasons for doing this are..." This reverts commit r213661. Reverting at the request of Sean Silva. llvm-svn: 213675	2014-07-22 18:09:17 +00:00
Dan Liew	44ce1f4a26	Add LLVM_TOOLS_BINARY_DIR variable to LLVMConfig.cmake so clients of LLVM using CMake can easily find the tools directory. LLVM_BUILD_TOOLS_BINARY_DIR was removed because it is now superfluous. llvm-svn: 213674	2014-07-22 17:48:51 +00:00
Alexey Samsonov	c6b197bed0	[ASan] Fix comments about __sanitizer_cov function llvm-svn: 213673	2014-07-22 17:46:09 +00:00
Hal Finkel	3c4b506191	Make use of the align parameter attribute for all pointer arguments We previously supported the align attribute on all (pointer) parameters, but we only used it for byval parameters. However, it is completely consistent at the IR level to treat 'align n' on all pointer parameters as an alignment assumption on the pointer, and now we wll. Specifically, this causes computeKnownBits to use the align attribute on all pointer parameters, not just byval parameters. I've also added an explicit parameter attribute test for this to test/Bitcode/attributes.ll. And I've updated the LangRef to document the align parameter attribute (as it turns out, it was not documented at all previously, although the byval documentation mentioned that it could be used). There are (at least) two benefits to doing this: - It allows enhancing alignment based on the pointer alignment after inlining callees. - It allows simplification of pointer arithmetic. llvm-svn: 213670	2014-07-22 16:58:55 +00:00
Tim Northover	832d000766	X86: drop relocations on __eh_frame sections globally. Without this, we produce non-extern relocations when targeting older OS X versions that ld64 can't cope with in the particular context of __eh_frame sections (who'd want generic relocation-processing anyway?). This means that an updated linker (ld64 from Xcode 3.2.6 or later) may be needed when targeting such platforms with a modern version of LLVM, but this is probably the case anyway and a reasonable requirement. PR20212, rdar://problem/17544795 llvm-svn: 213665	2014-07-22 15:47:09 +00:00
Dan Liew	5c83480dd0	Export LLVM_ENABLE_RTTI and LLVM_ENABLE_EH in LLVMConfig.cmake so clients of LLVM know if RTTI and/or EH were enabled in the build of LLVM they are trying to link against. llvm-svn: 213664	2014-07-22 15:41:33 +00:00
Dan Liew	59dfdd0290	Added LLVM_ENABLE_RTTI and LLVM_ENABLE_EH options that allow RTTI and EH to globally be controlled. Individual targets (e.g. ExceptionDemo) can still override this by using LLVM_REQUIRE_RTTI and LLVM_REQUIRE_EH if they need to be compiled with RTTI or exception handling respectively. llvm-svn: 213663	2014-07-22 15:41:18 +00:00
Suyog Sarda	7289a7b99e	This patch implements transform for pattern "(A \| B) ^ (~A) -> (A \| ~B)". Patch Credit to Ankit Jain !! Differential Revision: http://reviews.llvm.org/D4588 llvm-svn: 213662	2014-07-22 15:37:39 +00:00
Dan Liew	fdf2d511df	Treat warnings in Sphinx as errors. The reasons for doing this are... - When CMake builds the documentation with sphinx-build it treats warnings as errors. We should be consistent with what we do in CMake. - Having warnings treated as errors will hopefully encourage developers to write documentation correctly. llvm-svn: 213661	2014-07-22 15:07:35 +00:00
Dan Liew	9a81a597c9	Fix Sphinx warning. llvm-svn: 213660	2014-07-22 14:59:38 +00:00
Peter Zotov	f79b84b24e	[OCaml] Don't truncate constants over 32 bits in Llvm.const_int. llvm-svn: 213655	2014-07-22 13:55:20 +00:00
Sasa Stankovic	6c6f1ac7c2	[mips] Fix two patterns that select i32's (for MIPS32r6) / i64's (for MIPS64r6) from setne comparison with an i32. The patterns that are fixed: * (select (i32 (setne i32, immZExt16)), i32, i32) (for MIPS32r6) * (select (i32 (setne i32, immZExt16)), i64, i64) (for MIPS64r6) llvm-svn: 213653	2014-07-22 13:36:02 +00:00
Elena Demikhovsky	50a62c2883	AVX-512: Fixed intrinsic of VSQRTPS/PD instructions. I set number and types of parameters according to GCC intrinsics. llvm-svn: 213640	2014-07-22 11:07:31 +00:00
Sanjay Patel	24f9331065	fixed typo in comment llvm-svn: 213614	2014-07-22 04:57:06 +00:00
Chandler Carruth	e861e0b7f6	[SDAG] Refactor the code for inserting a newly allocated SDNode into the DAG into a helper function. This adds a trip through the (very minimal) verification logic in a bunch of places that were missing it, but shouldn't have any other impact outside of refactoring. I'm hoping to use this to do more clever things when DAG nodes are inserted into the graph. llvm-svn: 213612	2014-07-22 04:07:55 +00:00
Chandler Carruth	c75e5f8575	[SDAG] Remove a giant pile of asserts that may have helped track down a bug in 2010 when they were added but are adding no value today. In fact, they are utter lies. NodeAllocator is used to allocate almost all of these node types. I don't know what we were trying to assert here, and the docs don't give any answer. Until we once again stumble upon a bug needing help, let's clear the path for improvements. llvm-svn: 213610	2014-07-22 04:03:22 +00:00
Bill Wendling	dc721ccb03	Add openmp to the list of tagged things. llvm-svn: 213608	2014-07-22 03:17:30 +00:00

... 5 6 7 8 9 ...

106106 Commits