llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Eli Bendersky	074c9e1b36	Cleanup formatting, comments and naming. llvm-svn: 169762	2012-12-10 20:13:43 +00:00
Akira Hatanaka	c10e48ba6a	[mips] Set HWEncoding field of registers. Use delete function getMipsRegisterNumbering and use MCRegisterInfo::getEncodingValue instead. llvm-svn: 169760	2012-12-10 20:04:40 +00:00
Eric Christopher	c67794597d	Use the somewhat semantic term "split dwarf" it more matches what's going on and makes a lot of the terminology in comments make more sense. llvm-svn: 169758	2012-12-10 19:51:21 +00:00
Eric Christopher	2bf7bdcd23	Delete the FissionCU. llvm-svn: 169757	2012-12-10 19:51:18 +00:00
Eric Christopher	67243c354a	Reorder fission variables. llvm-svn: 169756	2012-12-10 19:51:13 +00:00
Bill Wendling	bb1f8f293a	Don't use a red zone for code coverage if the user specified `-mno-red-zone'. The `-mno-red-zone' flag wasn't being propagated to the functions that code coverage generates. This allowed some of them to use the red zone when that wasn't allowed. <rdar://problem/12843084> llvm-svn: 169754	2012-12-10 19:46:49 +00:00
Nadav Rotem	196fc7cc8c	Add support for reverse induction variables. For example: while (i--) sum+=A[i]; llvm-svn: 169752	2012-12-10 19:25:06 +00:00
Eli Bendersky	9c8e9c6edd	This patch adds statistics for other non-DWARF fragments emitted by the assembler. This is useful in order to know how the numbers add up, since in particular the Align fragments account for a non-trivial portion of the emitted fragments (especially on -O0 which sets relax-all). llvm-svn: 169747	2012-12-10 18:59:39 +00:00
Hal Finkel	3b65689ab9	Use GetUnderlyingObjects in misched misched used GetUnderlyingObject in order to break false load/store dependencies, and the -enable-aa-sched-mi feature similarly relied on GetUnderlyingObject in order to ensure it is safe to use the aliasing analysis. Unfortunately, GetUnderlyingObject does not recurse through phi nodes, and so (especially due to LSR) all of these mechanisms failed for induction-variable-dependent loads and stores inside loops. This change replaces uses of GetUnderlyingObject with GetUnderlyingObjects (which will recurse through phi and select instructions) in misched. Andy reviewed, tested and simplified this patch; Thanks! llvm-svn: 169744	2012-12-10 18:49:16 +00:00
Chandler Carruth	7e4aad1c1f	Revert "Make '-mtune=x86_64' assume fast unaligned memory accesses." Accidental commit... git svn betrayed me. Sorry for the noise. llvm-svn: 169741	2012-12-10 18:23:52 +00:00
Chandler Carruth	a64587b996	Make '-mtune=x86_64' assume fast unaligned memory accesses. Summary: Not all chips targeted by x86_64 have this feature, but a dramatically increasing number do. Specifying a chip-specific tuning parameter will continue to turn the feature on or off as appropriate for that particular chip, but the generic flag should try to achieve the best performance on the most widely available hardware. Today, the number of chips with fast UA access dwarfs those without in the x86-64 space. Note that this also brings LLVM's code generation for this '-march' flag more in line with that of modern GCCs. CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D195 llvm-svn: 169740	2012-12-10 18:22:42 +00:00
Chandler Carruth	d5a4e512ca	Fix a typo in my previous commit -- bloomfield is 0x1A not 0x2A. Thanks to the PaX folks for noticing in review! We need some tests here, any sugestions welcome... llvm-svn: 169739	2012-12-10 18:22:40 +00:00
Chandler Carruth	3e5f2c328d	Address a FIXME and update the fast unaligned memory feature for newer Intel chips. The model number rules were determined by inspecting Intel's documentation for their newer chip model numbers. My understanding is that all of the newer Intel chips have fast unaligned memory access, but if anyone is concerned about a particular chip, just shout. No tests updated; it's not clear we have dedicated tests for the chips' various features, but if anyone would like tests (or can point me at some existing ones), I'm happy to oblige. llvm-svn: 169730	2012-12-10 09:18:44 +00:00
Chandler Carruth	4686de879c	Add a new visitor for walking the uses of a pointer value. This visitor provides infrastructure for recursively traversing the use-graph of a pointer-producing instruction like an alloca or a malloc. It maintains a worklist of uses to visit, so it can handle very deep recursions. It automatically looks through instructions which simply translate one pointer to another (bitcasts and GEPs). It tracks the offset relative to the original pointer as long as that offset remains constant and exposes it during the visit as an APInt offset. Finally, it performs conservative escape analysis. However, currently it has some limitations that should be addressed going forward: 1) It doesn't handle vectors of pointers. 2) It doesn't provide a cheaper visitor when the constant offset tracking isn't needed. 3) It doesn't support non-instruction pointer values. The current functionality is exactly what is required to implement the SROA pointer-use visitors in terms of this one, rather than in terms of their own ad-hoc base visitor, which was always very poorly specified. SROA has been converted to use this, and the code there deleted which this utility now provides. Technically speaking, using this new visitor allows SROA to handle a few more cases than it previously did. It is now more aggressive in ignoring chains of instructions which look like they would defeat SROA, but in fact do not because they never result in a read or write of memory. While this is "neat", it shouldn't be interesting for real programs as any such chains should have been removed by others passes long before we get to SROA. As a consequence, I've not added any tests for these features -- it shouldn't be part of SROA's contract to perform such heroics. The goal is to extend the functionality of this visitor going forward, and re-use it from passes like ASan that can benefit from doing a detailed walk of the uses of a pointer. Thanks to Ben Kramer for the code review rounds and lots of help reviewing and debugging this patch. llvm-svn: 169728	2012-12-10 08:28:39 +00:00
Craig Topper	0f4945c76d	Teach DAG combine to handle vector add/sub with vectors of all 0s. llvm-svn: 169727	2012-12-10 08:12:29 +00:00
NAKAMURA Takumi	10a9cdfc27	[CMake] Update dependencies to intrinsics_gen corresponding to r169711. llvm-svn: 169724	2012-12-10 05:27:15 +00:00
Chandler Carruth	c9b6bd9712	Fix PR14548: SROA was crashing on a mixture of i1 and i8 loads and stores. When SROA was evaluating a mixture of i1 and i8 loads and stores, in just a particular case, it would tickle a latent bug where we compared bits to bytes rather than bits to bits. As a consequence of the latent bug, we would allow integers through which were not byte-size multiples, a situation the later rewriting code was never intended to handle. In release builds this could trigger all manner of oddities, but the reported issue in PR14548 was forming invalid bitcast instructions. The only downside of this fix is that it makes it more clear that SROA in its current form is not capable of handling mixed i1 and i8 loads and stores. Sometimes with the previous code this would work by luck, but usually it would crash, so I'm not terribly worried. I'll watch the LNT numbers just to be sure. llvm-svn: 169719	2012-12-10 00:54:45 +00:00
Michael Ilseman	2f7539fd12	Reorganize FastMathFlags to be a wrapper around unsigned, and streamline some interfaces. llvm-svn: 169712	2012-12-09 21:12:04 +00:00
Paul Redmond	e43761293d	LoopVectorize: support vectorizing intrinsic calls - added function to VectorTargetTransformInfo to query cost of intrinsics - vectorize trivially vectorizable intrinsic calls such as sin, cos, log, etc. Reviewed by: Nadav llvm-svn: 169711	2012-12-09 20:42:17 +00:00
Michael Ilseman	92f7651045	Have the bitcode reader/writer just use FPMathOperator's fast math enum directly llvm-svn: 169710	2012-12-09 20:23:16 +00:00
Paul Redmond	b778deb83a	test commit. llvm-svn: 169709	2012-12-09 19:46:31 +00:00
Jakub Staszak	3375cd11b4	Use m_OneUse pattern instead of hasOneUse() method. No functionality change. llvm-svn: 169703	2012-12-09 16:06:44 +00:00
Jakub Staszak	30bae3f07e	Remove trailing spaces. llvm-svn: 169701	2012-12-09 15:37:46 +00:00
Chandler Carruth	1e72559a05	Switch SROA to pop Uses off the back of its visitors' queues. This will more closely match the behavior of the new PtrUseVisitor that I am adding. Hopefully this will not change the actual behavior in any way, but by making the processing order more similar help in debugging. llvm-svn: 169697	2012-12-09 11:56:01 +00:00
Craig Topper	c03f63d739	Remove extra blank line. llvm-svn: 169692	2012-12-09 08:20:52 +00:00
Shuxin Yang	7221b14d96	- Re-enable population count loop idiom recognization - fix a bug which cause sigfault. - add two testing cases which was causing crash llvm-svn: 169687	2012-12-09 03:12:46 +00:00
Craig Topper	a6f44fb06b	Teach DAG combine to handle vector logical operations with vectors of all 1s or all 0s. These cases can show up when vectors are split for legalizing. Fix some tests that were dependent on these cases not being combined. llvm-svn: 169684	2012-12-08 22:49:19 +00:00
Chandler Carruth	329a5c1e03	Revert the patches adding a popcount loop idiom recognition pass. There are still bugs in this pass, as well as other issues that are being worked on, but the bugs are crashers that occur pretty easily in the wild. Test cases have been sent to the original commit's review thread. This reverts the commits: r169671: Fix a logic error. r169604: Move the popcnt tests to an X86 subdirectory. r168931: Initial commit adding the pass. llvm-svn: 169683	2012-12-08 22:18:29 +00:00
Benjamin Kramer	c1a1be1aa4	Simplify code. Sort includes. No functionality change. llvm-svn: 169676	2012-12-08 10:45:24 +00:00
Shuxin Yang	d80db0a201	Fix an inadvertent typo error. llvm-svn: 169671	2012-12-08 05:00:59 +00:00
Chandler Carruth	762d7408db	Fix a use-after-free bug found by ASan. You can't assign a temporary std::string to a StringRef. Moreover, the method being called accepts a Twine to simplify these patterns. Fixes this ASan failure: ==6312== ERROR: AddressSanitizer: heap-use-after-free on address 0x7fd558b1af58 at pc 0xcb7529 bp 0x7fffff572080 sp 0x7fffff572078 READ of size 1 at 0x7fd558b1af58 thread T0 #0 0xcb7528 .../llvm/include/llvm/ADT/StringRef.h:192 llvm::StringRef::operator[]() #1 0x1d53c0a .../llvm/include/llvm/ADT/StringExtras.h:128 llvm::HashString() #2 0x1d53878 .../llvm/lib/Support/StringMap.cpp:64 llvm::StringMapImpl::LookupBucketFor() #3 0x1b6872f .../llvm/include/llvm/ADT/StringMap.h:352 llvm::StringMap<>::GetOrCreateValue<>() #4 0x1b61836 .../llvm/lib/MC/MCContext.cpp:109 llvm::MCContext::GetOrCreateSymbol() #5 0xe9fd47 .../llvm/lib/Target/ARM/MCTargetDesc/ARMELFStreamer.cpp:154 (anonymous namespace)::ARMELFStreamer::EmitMappingSymbol() #6 0xea01dd .../llvm/lib/Target/ARM/MCTargetDesc/ARMELFStreamer.cpp:133 (anonymous namespace)::ARMELFStreamer::EmitDataMappingSymbol() #7 0xe9f78b .../llvm/lib/Target/ARM/MCTargetDesc/ARMELFStreamer.cpp:91 (anonymous namespace)::ARMELFStreamer::EmitBytes() #8 0x1b15d82 .../llvm/lib/MC/MCStreamer.cpp:89 llvm::MCStreamer::EmitIntValue() #9 0xcc0f9b .../llvm/lib/Target/ARM/ARMAsmPrinter.cpp:713 llvm::ARMAsmPrinter::emitAttributes() #10 0xcc0d44 .../llvm/lib/Target/ARM/ARMAsmPrinter.cpp:632 llvm::ARMAsmPrinter::EmitStartOfAsmFile() #11 0x14692ad .../llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp:162 llvm::AsmPrinter::doInitialization() #12 0x1bc4677 .../llvm/lib/VMCore/PassManager.cpp:1561 llvm::FPPassManager::doInitialization() #13 0x1bc4990 .../llvm/lib/VMCore/PassManager.cpp:1595 llvm::MPPassManager::runOnModule() #14 0x1bc55e5 .../llvm/lib/VMCore/PassManager.cpp:1705 llvm::PassManagerImpl::run() #15 0x1bc5878 .../llvm/lib/VMCore/PassManager.cpp:1740 llvm::PassManager::run() #16 0xc3954d .../llvm/tools/llc/llc.cpp:378 compileModule() #17 0xc38001 .../llvm/tools/llc/llc.cpp:194 main #18 0x7fd557d6a11c __libc_start_main 0x7fd558b1af58 is located 24 bytes inside of 29-byte region [0x7fd558b1af40,0x7fd558b1af5d) freed by thread T0 here: #0 0xc337da .../llvm/projects/compiler-rt/lib/asan/asan_new_delete.cc:56 operator delete() #1 0x1ee9cef .../libstdc++-v3/include/bits/basic_string.h:535 std::string::~string() #2 0xea01dd .../llvm/lib/Target/ARM/MCTargetDesc/ARMELFStreamer.cpp:133 (anonymous namespace)::ARMELFStreamer::EmitDataMappingSymbol() #3 0xe9f78b .../llvm/lib/Target/ARM/MCTargetDesc/ARMELFStreamer.cpp:91 (anonymous namespace)::ARMELFStreamer::EmitBytes() #4 0x1b15d82 .../llvm/lib/MC/MCStreamer.cpp:89 llvm::MCStreamer::EmitIntValue() #5 0xcc0f9b .../llvm/lib/Target/ARM/ARMAsmPrinter.cpp:713 llvm::ARMAsmPrinter::emitAttributes() #6 0xcc0d44 .../llvm/lib/Target/ARM/ARMAsmPrinter.cpp:632 llvm::ARMAsmPrinter::EmitStartOfAsmFile() #7 0x14692ad .../llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp:162 llvm::AsmPrinter::doInitialization() #8 0x1bc4677 .../llvm/lib/VMCore/PassManager.cpp:1561 llvm::FPPassManager::doInitialization() #9 0x1bc4990 .../llvm/lib/VMCore/PassManager.cpp:1595 llvm::MPPassManager::runOnModule() #10 0x1bc55e5 .../llvm/lib/VMCore/PassManager.cpp:1705 llvm::PassManagerImpl::run() #11 0x1bc5878 .../llvm/lib/VMCore/PassManager.cpp:1740 llvm::PassManager::run() #12 0xc3954d .../llvm/tools/llc/llc.cpp:378 compileModule() #13 0xc38001 .../llvm/tools/llc/llc.cpp:194 main #14 0x7fd557d6a11c __libc_start_main llvm-svn: 169668	2012-12-08 03:10:14 +00:00
Jim Grosbach	0855006f1e	Add C API for specifying CPU to the disassembler. It was a nasty oversight that we didn't include this when we added this API in the first place. Blech. rdar://12839439 llvm-svn: 169653	2012-12-07 23:53:27 +00:00
Bill Wendling	3f153ce37b	s/AttrListPtr/AttributeSet/g to better label what this class is going to be in the near future. llvm-svn: 169651	2012-12-07 23:16:57 +00:00
Eli Bendersky	05445f1b5a	Make the contents of encoded sections SmallVector<char, N> instead of SmallString. This makes it possible to use the length-erased SmallVectorImpl in the interface without imposing buffer size. Thus, the size of MCInstFragment is back down since a preallocated 8-byte contents buffer is enough. It would be generally a good idea to rid all the fragments of SmallString as contents, because a vector just makes more sense. llvm-svn: 169644	2012-12-07 22:06:56 +00:00
Nadav Rotem	d3d6e7cfb2	When we use the BLEND instruction that uses the MSB as a mask, we can remove the VSRI instruction before it since it does not affect the MSB. Thanks Craig Topper for suggesting this. llvm-svn: 169638	2012-12-07 21:43:11 +00:00
Matthew Curtis	596e698b79	In hexagon convertToHardwareLoop, don't deref end() iterator In particular, check if MachineBasicBlock::iterator is end() before using it to call getDebugLoc(); See also this thread on llvm-commits: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20121112/155914.html llvm-svn: 169634	2012-12-07 21:03:15 +00:00
Eli Bendersky	3fad54c5f3	Refactor MCInstFragment and MCDataFragment to adhere to a common interface, which removes code duplication and prepares the ground for future additions. Full discussion: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20121203/158233.html llvm-svn: 169626	2012-12-07 19:13:57 +00:00
Nadav Rotem	fc6e3e3272	X86: Prefer using VPSHUFD over VPERMIL because it has better throughput. llvm-svn: 169624	2012-12-07 19:01:13 +00:00
Eli Bendersky	58ccdb5e69	Add separate statistics for Data and Inst fragments emitted during relaxation. Also fixes a test that was overly-sensitive to the exact order of statistics emitted. llvm-svn: 169619	2012-12-07 17:59:21 +00:00
Eli Bendersky	2bf8413fe8	Some common functionality from WinCOFFStreamer::EmitAssignment can be now delegated to MCObjectStreamer. llvm-svn: 169617	2012-12-07 17:55:28 +00:00
Eli Bendersky	8c2b2d7ccb	Lift EmitAssignment into MCObjectStreamer which gets rid of at least three duplicate implementations in format-specific streamers. llvm-svn: 169613	2012-12-07 17:42:41 +00:00
Tim Northover	2dbbd221f7	Added Mapping Symbols for ARM ELF Before this patch, when you objdump an LLVM-compiled file, objdump tried to decode data-in-code sections as if they were code. This patch adds the missing Mapping Symbols, as defined by "ELF for the ARM Architecture" (ARM IHI 0044D). Patch based on work by Greg Fitzgerald. llvm-svn: 169609	2012-12-07 16:50:23 +00:00
Logan Chien	4bcb7e5304	Split MCELFStreamer into a header file. llvm-svn: 169603	2012-12-07 15:50:40 +00:00
Evgeniy Stepanov	00819e335a	[msan] Remove readonly/readnone attributes from all called functions. MSan uses a TLS slot to pass shadow for function arguments and return values. This makes all instrumented functions not readonly, and at the same time requires that all callees of an instrumented function that may be MSan-instrumented do not have readonly attribute (otherwise some of the instrumentation may be optimized out). llvm-svn: 169591	2012-12-07 09:08:32 +00:00
Jakob Stoklund Olesen	5da1402f7a	Use the new MIBundleBuilder class in the Mips target. This is the preferred way of creating bundled machine instructions. llvm-svn: 169585	2012-12-07 04:23:40 +00:00
Jakob Stoklund Olesen	1386769d53	Add higher-level API for dealing with bundled MachineInstrs. This is still a work in progress. The purpose is to make bundling and unbundling operations explicit, and to catch errors where bundles are broken or created inadvertently. The old IsInsideBundle flag is replaced by two MI flags: BundledPred which has the same meaning as IsInsideBundle, and BundledSucc which is set on instructions that are bundled with a successor. Having two flags provdes redundancy to detect when a bundle is inadvertently torn by a splice() or insert(), and it makes it possible to write bundle iterators that don't need to peek at adjacent instructions. The new flags can't be manipulated directly (once setIsInsideBundle is gone). Instead there are MI functions to make and break bundle bonds. The setIsInsideBundle function will be removed in a future commit. It should be replaced by bundleWithPred(). llvm-svn: 169583	2012-12-07 04:23:29 +00:00
Akira Hatanaka	bb4c3cdc37	[mips] Delete nodes and instructions for dynamic alloca that are no longer in use. llvm-svn: 169580	2012-12-07 03:10:18 +00:00
Akira Hatanaka	57161323ed	[mips] Shorten predicate name. llvm-svn: 169579	2012-12-07 03:06:09 +00:00
Akira Hatanaka	43a7c61b16	[mips] Delete unused sub-target features. llvm-svn: 169578	2012-12-07 03:04:05 +00:00
Akira Hatanaka	9894b24617	[mips] Remove unnecessary predicates. llvm-svn: 169577	2012-12-07 03:01:24 +00:00
Chandler Carruth	9290708acb	Add support to ValueTracking for determining that a pointer is non-null by virtue of inbounds GEPs that preclude a null pointer. This is a very common pattern in the code generated by std::vector and other standard library routines which use allocators that test for null pervasively. This is one step closer to teaching Clang+LLVM to be able to produce an empty function for: void f() { std::vector<int> v; v.push_back(1); v.push_back(2); v.push_back(3); v.push_back(4); } Which is related to getting them to completely fold SmallVector push_back sequences into constants when inlining and other optimizations make that a possibility. llvm-svn: 169573	2012-12-07 02:08:58 +00:00
Matt Beaumont-Gay	dd419ee85c	Add a 'using' declaration to suppress GCC's -Woverloaded-virtual while we decide what pattern we want to follow in the future. llvm-svn: 169561	2012-12-06 23:15:36 +00:00
Pedro Artigas	8e61717b97	fixed valgrind issues of prior commit, this change applies r169456 changes back to the tree with fixes. on darwin no valgrind issues exist in the tests that used to fail. original change description: change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169553	2012-12-06 22:12:44 +00:00
Jakub Staszak	4c754ba4e3	Remove unused field. llvm-svn: 169551	2012-12-06 22:08:59 +00:00
Jakub Staszak	b3e92a1e9c	Remove trailing spaces. llvm-svn: 169550	2012-12-06 21:57:16 +00:00
Evan Cheng	4cdc6c4eef	Replace r169459 with something safer. Rather than having computeMaskedBits to understand target implementation of any_extend / extload, just generate zero_extend in place of any_extend for liveouts when the target knows the zero_extend will be implicit (e.g. ARM ldrb / ldrh) or folded (e.g. x86 movz). rdar://12771555 llvm-svn: 169536	2012-12-06 19:13:27 +00:00
Jakub Staszak	6a77423b5f	Remove unneeded function, since PR8156 was fixed over a year ago. llvm-svn: 169534	2012-12-06 19:05:46 +00:00
Jakub Staszak	b18925a384	Simplify code. llvm-svn: 169521	2012-12-06 18:22:59 +00:00
Nadav Rotem	5b43aa0b29	Fix a bug in the code that merges consecutive stores. Previously we did not check if loads that happen in between stores alias with the first store in the chain, only with the second store onwards. llvm-svn: 169516	2012-12-06 17:34:13 +00:00
NAKAMURA Takumi	8847f19308	MemorySanitizer.cpp: Suppress a warning. [-Wunused-variable] llvm-svn: 169504	2012-12-06 13:38:00 +00:00
Evgeniy Stepanov	1ad21b4153	[msan] Fix a typo in a comment. llvm-svn: 169491	2012-12-06 11:58:59 +00:00
Evgeniy Stepanov	97d6933f46	[msan] Do not store origin for clean values. Instead of unconditionally storing origin with every application store, only do this when the shadow of the stored value is != 0. This change also delays instrumentation of stores until after the walk over function's instructions, because adding new basic blocks confuses InstVisitor. We only keep 1 origin value per 4 bytes of application memory. This change fixes the bug when a store of a single clean byte wiped the origin for the whole 4-byte area. Since stores of uninitialized values are relatively uncommon, this change improves performance of track-origins mode by 5% median and by up to 47% on specs. llvm-svn: 169490	2012-12-06 11:41:03 +00:00
Bill Wendling	9cb80ecabb	s/getLowerBoundDefault/getDefaultLowerBound/ for consistency. Also put the more natural check first in the if-then statement. llvm-svn: 169486	2012-12-06 07:55:19 +00:00
Bill Wendling	979b24c6ec	Handle non-default array bounds. Some languages, e.g. Ada and Pascal, allow you to specify that the array bounds are different from the default (1 in these cases). If we have a lower bound that's non-default, then we emit the lower bound. We also calculate the correct upper bound in those cases. llvm-svn: 169484	2012-12-06 07:38:10 +00:00
Craig Topper	e907c5dd53	Remove intrinsic specific instructions for (V)MOVQUmr with patterns pointing to the normal instructions. llvm-svn: 169482	2012-12-06 07:31:16 +00:00
Craig Topper	32786c4c50	Mark MOVDQ(A/U)rm as ReMaterializable. Mark all MOVDQ(A/U) instructions as neverHasSideEffects. llvm-svn: 169477	2012-12-06 06:49:16 +00:00
NAKAMURA Takumi	79cb136d31	Revert r169456, "change MCContext to work on the doInitialization/doFinalization model" It broke many builders. llvm-svn: 169462	2012-12-06 02:00:13 +00:00
Chad Rosier	c7e4217273	[arm fast-isel] Make the fast-isel implementation of memcpy respect alignment. rdar://12821569 llvm-svn: 169460	2012-12-06 01:34:31 +00:00
Evan Cheng	c1db873871	Let targets provide hooks that compute known zero and ones for any_extend and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459	2012-12-06 01:28:01 +00:00
Pedro Artigas	e62d631d01	change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169456	2012-12-06 00:50:55 +00:00
Bill Wendling	7119fab4de	Set the 'MadeChange' variable if we are deleting blocks. llvm-svn: 169455	2012-12-06 00:30:20 +00:00
Michael Ilseman	bd2183badf	Have CannotBeNegativeZero() be aware of the nsz fast-math flag llvm-svn: 169452	2012-12-06 00:07:09 +00:00
Andrew Trick	1a2b4ddd59	RegPressureTracker::dump(): Remove unnecessary argument. llvm-svn: 169443	2012-12-05 23:05:22 +00:00
Eli Bendersky	7b68fe1bd8	Change std::vector to SmallVector<4> and remove some unused methods. This is more consistent with other vectors in this code. In addition, I ran some tests compiling a large program and >96% of fragments have 4 or less fixups, so SmallVector<4> is a good optimization. llvm-svn: 169433	2012-12-05 22:11:02 +00:00
Jyotsna Verma	9995a62d05	Define new-value store instructions with base+immediate addressing mode using multiclass. llvm-svn: 169432	2012-12-05 22:02:56 +00:00
Bill Wendling	826cff4dbc	Fix name. The array is unboundED. llvm-svn: 169428	2012-12-05 21:43:30 +00:00
Andrew Trick	69c3ea61f2	RegisterPressureTracker: fix findUseBetween to handle DebugValue llvm-svn: 169427	2012-12-05 21:37:50 +00:00
Andrew Trick	4360a22194	RegisterPressureTracker: unify virtual registers and physical regunits. Now that live register units are tracked individually, the code can be simplified. llvm-svn: 169426	2012-12-05 21:37:47 +00:00
Andrew Trick	67c084b14c	RegisterPresssureTracker: Track live physical register by unit. This is much simpler to reason about, more efficient, and fixes some corner cases involving implicit super-register defs. Fixed rdar://12797931. llvm-svn: 169425	2012-12-05 21:37:42 +00:00
Nadav Rotem	6dac3b0c66	Cost Model: change the default cost of control flow instructions (br / ret / ...) to zero. llvm-svn: 169423	2012-12-05 21:21:26 +00:00
David Sehr	f67cd34524	Correct ARM NOP encoding The encoding of NOP in ARMAsmBackend.cpp is missing a trailing zero, which causes the emission of a coprocessor instruction rather than "mov r0, r0" as indicated in the comment. The test also checks for the wrong encoding. http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20121203/157919.html llvm-svn: 169420	2012-12-05 21:01:27 +00:00
Justin Holewinski	22f8d09057	[NVPTX] Fix crash with unnamed struct arguments Patch by Eric Holk llvm-svn: 169418	2012-12-05 20:50:28 +00:00
Jyotsna Verma	c8d145f6f6	Use multiclass to define store instructions with base+immediate offset addressing mode and immediate stored value. llvm-svn: 169408	2012-12-05 19:32:03 +00:00
Bob Wilson	29106bfeff	Adjust JIT target triple on OS X to match the current architecture. For OS X builds, we generate one version of config.h but then build for multiple architectures. This means that the LLVM_HOSTTRIPLE setting may have the wrong architecture. Adjust it dynamically to match the current architecture. <rdar://problem/12715470> llvm-svn: 169405	2012-12-05 19:09:13 +00:00
Matthew Curtis	67b48517cb	Fix misplaced closing brace. llvm-svn: 169404	2012-12-05 19:00:34 +00:00
Benjamin Kramer	e66030212a	Try to unbreak the build on hosts that don't transitively pull in a definition for int64_t. Also use the portable (ugly) format string macros, for MSVC compatibility. llvm-svn: 169396	2012-12-05 18:31:11 +00:00
Jakob Stoklund Olesen	75ccdbb1dc	Remove unused MachineInstr constructors. A MachineInstr can only ever be constructed by CreateMachineInstr() and CloneMachineInstr(), and those factories don't use the removed constructors. llvm-svn: 169395	2012-12-05 18:27:39 +00:00
Kevin Enderby	94941df94f	Added a option to the disassembler to print immediates as hex. This is for the lldb team so most of but not all of the values are to be printed as hex with this option. Some small values like the scale in an X86 address were requested to printed in decimal without the leading 0x. There may be some tweaks need to places that may still be in decimal that they want in hex. Specially for arm. I made my best guess. Any tweaks from here should be simple. I also did the best I know now with help from the C++ gurus creating the cleanest formatImm() utility function and containing the changes. But if someone has a better idea to make something cleaner I'm all ears and game for changing the implementation. rdar://8109283 llvm-svn: 169393	2012-12-05 18:13:19 +00:00
Pedro Artigas	cd715362f7	- Added calls to doInitialization/doFinalization to immutable passes - fixed ordering of calls to doFinalization to be the reverse of the pass run order due to potential dependencies - fixed machine module info to operate in the doInitialization/doFinalization model, also fixes some FIXMEs reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169391	2012-12-05 17:12:22 +00:00
Evgeniy Stepanov	21dc8412a5	[msan] Instrument bswap intrinsic. llvm-svn: 169383	2012-12-05 14:39:55 +00:00
Evgeniy Stepanov	06e754c093	[msan] Initialize callbacks in runOnFunction as opposed to doInitialization. This mirrors the change in ASan & TSan done in r168864. llvm-svn: 169378	2012-12-05 13:14:33 +00:00
Evgeniy Stepanov	597a835dbc	[msan] Change linkage type of __msan_track_origins. LinkOnceODRLinkage globals may be removed in GlobalOpt if not used in the current module. llvm-svn: 169377	2012-12-05 12:49:41 +00:00
Elena Demikhovsky	fa1ab36215	Simplified BLEND pattern matching for shuffles. Generate VPBLENDD for AVX2 and VPBLENDW for v16i16 type on AVX2. llvm-svn: 169366	2012-12-05 09:24:57 +00:00
Andrew Trick	7da96db06b	Added RegisterPressureTracker::dump() for debugging. llvm-svn: 169359	2012-12-05 06:47:08 +00:00
Michael J. Spencer	c0bd8f95ce	Copy clang/Driver/<Option parsing stuff> to llvm. llvm-svn: 169344	2012-12-05 00:29:32 +00:00
Evan Cheng	f87900c4b1	Add x86 isel lowering logic to form bit test with inverted condition. e.g. x ^ -1. Patch by David Majnemer. rdar://12755626 llvm-svn: 169339	2012-12-05 00:10:38 +00:00
Matt Beaumont-Gay	fd804eb89d	Appease GCC's -Wparentheses. (TIL that Clang's -Wparentheses ignores 'x \|\| y && "foo"' on purpose. Neat.) llvm-svn: 169337	2012-12-04 23:54:02 +00:00
Bill Wendling	e6735aeca0	Split up the ParseOptionalAttrs method into three different methods for each class of attributes. This makes it much easier to check for errors and to reuse the code. llvm-svn: 169336	2012-12-04 23:40:58 +00:00
Nadav Rotem	4990f6a07b	LoopVectorizer: Increase the number of pointers that can be tested at runtime. If we cant prove statically that the pointers are disjoint then we add the runtime check. llvm-svn: 169334	2012-12-04 23:25:24 +00:00
Nadav Rotem	c58f94b665	Enable if-conversion during vectorization. llvm-svn: 169331	2012-12-04 22:59:52 +00:00
Evan Cheng	cae5a79f6b	ARM custom lower ctpop for vector types. Patch by Pete Couperus. llvm-svn: 169325	2012-12-04 22:41:50 +00:00
Nadav Rotem	452993ad1a	Fix a bug in vectorization of if-converted reduction variables. If the reduction variable is not used outside the loop then we ran into an endless loop. This change checks if we found the original PHI. llvm-svn: 169324	2012-12-04 22:40:22 +00:00
Jakob Stoklund Olesen	2db1b3c4ac	Speed up the AllocationOrder class a bit. Allow the central functions to be inlined, and use the argumentless isHint() function when possible. llvm-svn: 169319	2012-12-04 22:25:16 +00:00
Shuxin Yang	c390be6a5d	For rdar://12329730, last piece. This change attempts to simplify (X^Y) -> X or Y in the user's context if we know that only bits from X or Y are demanded. A minimized case is provided bellow. This change will simplify "t>>16" into "var1 >>16". ============================================================= unsigned foo (unsigned val1, unsigned val2) { unsigned t = val1 ^ 1234; return (t >> 16) \| t; // NOTE: t is used more than once. } ============================================================= Note that if the "t" were used only once, the expression would be finally optimized as well. However, with with this change, the optimization will take place earlier. Reviewed by Nadav, Thanks a lot! llvm-svn: 169317	2012-12-04 22:15:32 +00:00
David Blaikie	b56f347902	Comment change made in r169304 as requested by Eric Christopher. llvm-svn: 169315	2012-12-04 22:02:33 +00:00
Jyotsna Verma	52d66f1d1e	Define store instructions with base+register offset addressing mode using multiclass. llvm-svn: 169314	2012-12-04 21:58:25 +00:00
Bill Wendling	32e4544198	Use the 'count' attribute to calculate the upper bound of an array. The count attribute is more accurate with regards to the size of an array. It also obviates the upper bound attribute in the subrange. We can also better handle an unbound array by setting the count to -1 instead of the lower bound to 1 and upper bound to 0. llvm-svn: 169312	2012-12-04 21:34:03 +00:00
David Blaikie	567718b686	Reapply r160148 (reverted in r163570) fixing spurious breakpoints in modern GDB This reapplies the fix for PR13303 now with more justification. Based on my execution of the GDB 7.5 test suite this results in: expected passes: 16101 -> 20890 (+30%) unexpected failures: 4826 -> 637 (-77%) There are 23 checks that used to pass and now fail. They are all in gdb.reverse. Investigating a few looks like they were accidentally passing due to extra breakpoints being set by this bug. They're generally due to the difference in end location between gcc and clang, the test suite is trying to set breakpoints on the closing '}' that clang doesn't associate with any instructions. llvm-svn: 169304	2012-12-04 21:05:36 +00:00
Eli Bendersky	e13c0c6d00	Make NaCl naming consistent. The triple OSType is called NaCl and is represented textually as NativeClient. Also added a link to the native client project for readers unfamiliar with it. A Clang patch will follow shortly. llvm-svn: 169291	2012-12-04 18:37:26 +00:00
Nadav Rotem	4f22c83996	Add support for reduction variables when IF-conversion is enabled. llvm-svn: 169288	2012-12-04 18:17:33 +00:00
Jyotsna Verma	ec3f676ca1	Add patterns to define 'combine', 'tstbit', 'ct0/cl0' (count trailing/leading zeros) instructions. llvm-svn: 169287	2012-12-04 18:05:01 +00:00
Jyotsna Verma	0dd9064b5a	Add constant extender support to ALU32 instructions for V2. llvm-svn: 169284	2012-12-04 17:12:00 +00:00
Bill Schmidt	9d8cdcda41	This patch introduces initial-exec model support for thread-local storage on 64-bit PowerPC ELF. The patch includes code to handle external assembly and MC output with the integrated assembler. It intentionally does not support the "old" JIT. For the initial-exec TLS model, the ABI requires the following to calculate the address of external thread-local variable x: Code sequence Relocation Symbol ld 9,x@got@tprel(2) R_PPC64_GOT_TPREL16_DS x add 9,9,x@tls R_PPC64_TLS x The register 9 is arbitrary here. The linker will replace x@got@tprel with the offset relative to the thread pointer to the generated GOT entry for symbol x. It will replace x@tls with the thread-pointer register (13). The two test cases verify correct assembly output and relocation output as just described. PowerPC-specific selection node variants are added for the two instructions above: LD_GOT_TPREL and ADD_TLS. These are inserted when an initial-exec global variable is encountered by PPCTargetLowering::LowerGlobalTLSAddress(), and later lowered to machine instructions LDgotTPREL and ADD8TLS. LDgotTPREL is a pseudo that uses the same LDrs support added for medium code model's LDtocL, with a different relocation type. The rest of the processing is straightforward. llvm-svn: 169281	2012-12-04 16:18:08 +00:00
Chandler Carruth	a98c778194	Sort includes for all of the .h files under the 'lib' tree. These were missed in the first pass because the script didn't yet handle include guards. Note that the script is now able to handle all of these headers without manual edits. =] llvm-svn: 169224	2012-12-04 07:12:27 +00:00
Nadav Rotem	aa136f70ba	Give scalar if-converted blocks half the score because they are not always executed due to CF. llvm-svn: 169223	2012-12-04 07:11:52 +00:00
Chandler Carruth	10bce90160	Add a comment about the requirement that the Windows.h header be last. This comment has the dual effect of blocking reorderings with the sort_include script. llvm-svn: 169221	2012-12-04 07:04:57 +00:00
Bill Wendling	8ade948576	Add a 'count' field to the DWARF subrange. The count field is necessary because there isn't a difference between the 'lo' and 'hi' attributes for a one-element array and a zero-element array. When the count is '0', we know that this is a zero-element array. When it's >=1, then it's a normal constant sized array. When it's -1, then the array is unbounded. llvm-svn: 169218	2012-12-04 06:20:49 +00:00
Nadav Rotem	43d200ded1	Add the last part that is needed for vectorization of if-converted code. Added the code that actually performs the if-conversion during vectorization. We can now vectorize this code: for (int i=0; i<n; ++i) { unsigned k = 0; if (a[i] > b[i]) <------ IF inside the loop. k = k * 5 + 3; a[i] = k; <---- K is a phi node that becomes vector-select. } llvm-svn: 169217	2012-12-04 06:15:11 +00:00
Kostya Serebryany	caa1c534df	[asan] add experimental -asan-realign-stack option (true by default, which does not change the current behavior) llvm-svn: 169216	2012-12-04 06:14:01 +00:00
Matt Beaumont-Gay	3e68d7d342	Add 'using' declarations to suppress -Woverloaded-virtual warnings. llvm-svn: 169214	2012-12-04 05:41:27 +00:00
Jyotsna Verma	2878599f9d	Move all operand definitions into HexagonOperands.td llvm-svn: 169213	2012-12-04 05:00:31 +00:00
Jyotsna Verma	5743b854a0	Move generic Hexagon subtarget information into Hexagon.td llvm-svn: 169212	2012-12-04 04:29:16 +00:00
Shuxin Yang	ac685f44b0	rdar://12329730 (2nd part, revised) The type of shirt-right (logical or arithemetic) should remain unchanged when transforming "X << C1 >> C2" into "X << (C1-C2)" llvm-svn: 169209	2012-12-04 03:28:32 +00:00
Alexey Samsonov	84fd1cd1a4	ASan: add initial support for handling llvm.lifetime intrinsics in ASan - emit calls into runtime library that poison memory for local variables when their lifetime is over and unpoison memory when their lifetime begins. llvm-svn: 169200	2012-12-04 01:34:23 +00:00
Jakub Staszak	cd1920fdd8	Simplify code. No functionality change. llvm-svn: 169198	2012-12-04 01:00:52 +00:00
Manman Ren	40ba054405	Stack Alignment: when creating stack objects in MachineFrameInfo, make sure the alignment is clamped to TargetFrameLowering.getStackAlignment if the target does not support stack realignment or the option "realign-stack" is off. This will cause miscompile if the address is treated as aligned and add is replaced with or in DAGCombine. Added a bool StackRealignable to TargetFrameLowering to check whether stack realignment is implemented for the target. Also added a bool RealignOption to MachineFrameInfo to check whether the option "realign-stack" is on. rdar://12713765 llvm-svn: 169197	2012-12-04 00:52:33 +00:00
Jakub Staszak	ad1ddb8aa6	Use dyn_cast instead of isa and cast. No functionality change. llvm-svn: 169196	2012-12-04 00:50:06 +00:00
NAKAMURA Takumi	3ba9b62972	LoopVectorize.cpp: Suppress a warning. [-Wunused-variable] llvm-svn: 169195	2012-12-04 00:49:34 +00:00
NAKAMURA Takumi	350924d8cc	Fix whitespace. llvm-svn: 169194	2012-12-04 00:49:28 +00:00
Jakob Stoklund Olesen	9cd01b82ea	Remove the old TRI::ResolveRegAllocHint() and getRawAllocationOrder() hooks. These functions have been replaced by TRI::getRegAllocationHints() which provides the same capabilities. llvm-svn: 169192	2012-12-04 00:46:13 +00:00
Jakob Stoklund Olesen	305fb6fe6c	Remove VirtRegMap::getRegAllocPref(). Now that there can be multiple hint registers from targets, it doesn't make sense to have a function that returns 'the' preferred register. llvm-svn: 169190	2012-12-04 00:35:59 +00:00
Jakob Stoklund Olesen	d099763751	Use MRI::getSimpleHint() instead of getRegAllocPref() in remaining cases. Targets can provide multiple hints now, so getRegAllocPref() doesn't make sense any longer because it only returns one preferred register. Replace it with getSimpleHint() in the remaining heuristics. This function only llvm-svn: 169188	2012-12-04 00:30:22 +00:00
Manman Ren	4c21a13abd	Stack Alignment: move functions from header file MachineFrameInfo.h. No functional change for this commit. The follow-up patch will add more stuff to these functions. rdar://12713765 llvm-svn: 169186	2012-12-04 00:26:44 +00:00
NAKAMURA Takumi	5d8058a563	RuntimeDyld: Fix up r169178. MSVC doesn't like "or". llvm-svn: 169183	2012-12-04 00:08:14 +00:00
Shuxin Yang	f6948fd368	rdar://12329730 (2nd part) This change tries to simmplify E1 = " X >> C1 << C2" into : - E2 = "X << (C2 - C1)" if C2 > C1, or - E2 = "X >> (C1 - C2)" if C1 > C2, or - E2 = X if C1 == C2. Reviewed by Nadav. Thanks! llvm-svn: 169182	2012-12-04 00:04:54 +00:00
Jakob Stoklund Olesen	753c5da13c	Add VirtRegMap::hasKnownPreference(). Virtual registers with a known preferred register are prioritized by RAGreedy. This function makes the condition explicit without depending on getRegAllocPref(). llvm-svn: 169179	2012-12-03 23:23:50 +00:00
Akira Hatanaka	893e507de7	Runtime dynamic linker for MCJIT should support MIPS BigEndian architecture. This small change adds support for that. It will make all MCJIT tests pass in make-check on BigEndian platforms. Patch by Petar Jovanovic. llvm-svn: 169178	2012-12-03 23:12:19 +00:00
Akira Hatanaka	fc23893bc5	Classic JIT is still being supported by MIPS, along with MCJIT. This change adds endian-awareness to MipsJITInfo and emitWordLE in MipsCodeEmitter has become emitWord now to support both endianness. Patch by Petar Jovanovic. llvm-svn: 169177	2012-12-03 23:11:12 +00:00
Michael Ilseman	6ff2a88905	Minor tweaking to SmallVector static size. llvm-svn: 169176	2012-12-03 22:57:47 +00:00
Nadav Rotem	7a8cea8699	minor renaming, documentation and cleanups. llvm-svn: 169175	2012-12-03 22:57:09 +00:00
Akira Hatanaka	dfdb5c7406	Functions in MipsCodeEmitter.cpp that expand unaligned loads/stores are dead code. Removing it. Patch by Petar Jovanovic. llvm-svn: 169174	2012-12-03 22:51:22 +00:00
Jakob Stoklund Olesen	cfbf55a3fb	Use the new getRegAllocationHints() hook from AllocationOrder. This simplifies the hinting code quite a bit while making the targets easier to write at the same time. llvm-svn: 169173	2012-12-03 22:51:04 +00:00
Nadav Rotem	bf466188c6	constify the cost API llvm-svn: 169172	2012-12-03 22:47:12 +00:00
Nadav Rotem	6da9592bb1	IF-conversion: teach the cost-model how to grade if-converted loops. llvm-svn: 169171	2012-12-03 22:46:31 +00:00
Jakob Stoklund Olesen	86b7be3eac	Implement ARMBaseRegisterInfo::getRegAllocationHints(). This provides the same functionality as getRawAllocationOrder() for the even/odd hints, but without the many constant register arrays. llvm-svn: 169169	2012-12-03 22:35:35 +00:00
Jyotsna Verma	e17db34a1c	Define store instructions with base+immediate offset addressing mode using multiclass. llvm-svn: 169168	2012-12-03 22:26:28 +00:00
Michael J. Spencer	8d8b62ec5d	[Support] Make FileOutputBuffer work on Windows. llvm-svn: 169167	2012-12-03 22:09:52 +00:00
Michael J. Spencer	925e2a7203	[Support][FileSystem] Fix open mode in resize_file on Windows. llvm-svn: 169166	2012-12-03 22:09:31 +00:00
Michael J. Spencer	34411981cd	Revert the header sort on this file. "Windows.h" includes <Windows.h> which defines a bunch of stuff it shouldn't (even with all the restriction macros). We have no control over this file, so make it's scope as small as possible. llvm-svn: 169165	2012-12-03 22:07:00 +00:00
Pedro Artigas	80c84a9de4	moves doInitialization and doFinalization to the Pass class and removes some unreachable code in MachineModuleInfo reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169164	2012-12-03 21:56:57 +00:00

1 2 3 4 5 ...

57938 Commits