llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 16:33:37 +01:00

Author	SHA1	Message	Date
Benjamin Kramer	786f7671ab	Switch the select to branch transformation on by default. The primitive conservative heuristic seems to give a slight overall improvement while not regressing stuff. Make it available to wider testing. If you notice any speed regressions (or significant code size regressions) let me know! llvm-svn: 156258	2012-05-06 14:25:16 +00:00
Jakub Staszak	b3bddb41cb	Remove trailing spaces. llvm-svn: 156257	2012-05-06 13:52:31 +00:00
NAKAMURA Takumi	9ae88f2f80	Unix/Process.inc: Give more useful random seed to srand. Workaround for PR12743. llvm-svn: 156252	2012-05-06 08:24:24 +00:00
NAKAMURA Takumi	a7d133afee	Support/Process: Move llvm::sys::Process::GetRandomNumber() from Process.cpp to Unix/Process.inc. FIXME: GetRandomNumber() is not implemented in Win32. llvm-svn: 156251	2012-05-06 08:24:18 +00:00
Chris Lattner	cf8284517f	reapply my patch, with a fix for an off-by-one error. Turned out to be a lot of work for a drive-by fix :) llvm-svn: 156246	2012-05-05 22:17:32 +00:00
Chris Lattner	206bf447c0	revert my patches, which are causing problems. llvm-svn: 156245	2012-05-05 22:11:04 +00:00
Chris Lattner	372a67f8d9	add missing header <shame> llvm-svn: 156244	2012-05-05 22:04:11 +00:00
Chris Lattner	4c8c651c04	refactor some code to expose column numbers more and make diagnostic printing slightly more efficient. llvm-svn: 156243	2012-05-05 21:39:51 +00:00
Jim Grosbach	f7461026c2	Nuke a few dead remnants of the CBE. llvm-svn: 156241	2012-05-05 17:45:12 +00:00
Daniel Dunbar	af0500eb5d	[Support] Add missing include. llvm-svn: 156240	2012-05-05 16:49:11 +00:00
Daniel Dunbar	d7d85c4a85	[Support] Fix up comments. llvm-svn: 156239	2012-05-05 16:39:22 +00:00
Daniel Dunbar	6fbad750f3	[Support] Rewrite sys::fs::unique_file to not be stupid with /dev/urandom. - Just use sys::Process::GetRandomNumber instead of having two poor implementations. - This is ~70 times (!) faster on my OS X machine. llvm-svn: 156238	2012-05-05 16:36:24 +00:00
Daniel Dunbar	457eab2ad7	[Support] Add sys::Process::GetRandomNumber(). - Primitive API, but we rarely have need for random numbers. llvm-svn: 156237	2012-05-05 16:36:20 +00:00
Daniel Dunbar	c69f7c9234	[build] Add build check for ::arc4random(). llvm-svn: 156236	2012-05-05 16:36:16 +00:00
Benjamin Kramer	97de009760	Update all outdated autoconf files in the sample project. We might just use symlinks here, but I'm afraid of possible portability issues. llvm-svn: 156235	2012-05-05 15:02:39 +00:00
Benjamin Kramer	0463564612	CodeGenPrepare: Add a transform to turn selects into branches in some cases. This came up when a change in block placement formed a cmov and slowed down a hot loop by 50%: ucomisd (%rdi), %xmm0 cmovbel %edx, %esi cmov is a really bad choice in this context because it doesn't get branch prediction. If we emit it as a branch, an out-of-order CPU can do a better job (if the branch is predicted right) and avoid waiting for the slow load+compare instruction to finish. Of course it won't help if the branch is unpredictable, but those are really rare in practice. This patch uses a dumb conservative heuristic, it turns all cmovs that have one use and a direct memory operand into branches. cmovs usually save some code size, so we disable the transform in -Os mode. In-Order architectures are unlikely to benefit as well, those are included in the "predictableSelectIsExpensive" flag. It would be better to reuse branch probability info here, but BPI doesn't support select instructions currently. It would make sense to use the same heuristics as the if-converter pass, which does the opposite direction of this transform. Test suite shows a small improvement here and there on corei7-level machines, but the actual results depend a lot on the used microarchitecture. The transformation is currently disabled by default and available by passing the -enable-cgp-select2branch flag to the code generator. Thanks to Chandler for the initial test case to him and Evan Cheng for providing me with comments and test-suite numbers that were more stable than mine :) llvm-svn: 156234	2012-05-05 12:49:22 +00:00
Benjamin Kramer	7a9528b540	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Benjamin Kramer	39afd32d88	NVPTX: Initialize the UseF32FTZ flag. llvm-svn: 156232	2012-05-05 11:22:02 +00:00
Stepan Dyatkovskiy	469935e0ae	Small fix in InstCombineCasts.cpp. Restored "alloca + bitcast" reducing for case when alloca's size is calculated within the "add/sub/... nsw". Also added fix to 2011-06-13-nsw-alloca.ll test. llvm-svn: 156231	2012-05-05 07:09:40 +00:00
Eric Christopher	d0a426dbe4	Typo. llvm-svn: 156226	2012-05-05 01:16:06 +00:00
Jakob Stoklund Olesen	fb40ced513	Order register classes by spill size first, members last. This is still a topological ordering such that every register class gets a smaller enum value than its sub-classes. Placing the smaller spill sizes first makes a difference for the super-register class bit masks. When looking for a super-register class, we usually want the smallest possible kind of super-register. That is now available as the first bit set in the bit mask. llvm-svn: 156222	2012-05-04 23:12:22 +00:00
Jakob Stoklund Olesen	90ad9e9f13	Make sure findRepresentativeClass picks the widest super-register. We want the representative register class to contain the largest super-registers available. This makes the function less sensitive to the register class numbering. llvm-svn: 156220	2012-05-04 22:53:28 +00:00
Jakob Stoklund Olesen	c169683227	Remove extra comma in debug output. llvm-svn: 156219	2012-05-04 22:53:26 +00:00
David Blaikie	4f57670dab	Fix warnings in release build. This fixes a couple of Clang warnings in release builds of LLVM: * Missing return in ISelLowering * Unused variable in NVPTXutil.cpp llvm-svn: 156216	2012-05-04 22:34:16 +00:00
Kevin Enderby	7bc52bcfad	Tweak to the fix in r156212, as with the change in removing the shift the SignExtend32<22>(Val<<1) also needs to change to SignExtend32<21>(Val) . llvm-svn: 156213	2012-05-04 22:09:52 +00:00
Kevin Enderby	8c41cffe0b	Fix a bug in the ARM disassembler for wide branch conditional instructions where the symbolic operand's displacement was incorrectly shifted left by 1. rdar://11387046 llvm-svn: 156212	2012-05-04 22:02:27 +00:00
Chandler Carruth	856e83e1c1	Fix a Clang warning in the new NVPTX backend: In file included from ../lib/Target/NVPTX/VectorElementize.cpp:53: ../lib/Target/NVPTX/NVPTX.h:44:3: warning: default label in switch which covers all enumeration values [-Wcovered-switch-default] default: assert(0 && "Unknown condition code"); ^ 1 warning generated. The prevailing pattern in LLVM is to not use a default label, and instead to use llvm_unreachable to denote that the switch in fact covers all return paths from the function. llvm-svn: 156209	2012-05-04 21:35:49 +00:00
Chandler Carruth	51819a2bcf	Teach the code extractor how to extract a sequence of blocks from RegionInfo's RegionNode. This mirrors the logic for automating the extraction from a Loop. llvm-svn: 156208	2012-05-04 21:33:30 +00:00
Chandler Carruth	4478e73ac5	Rename the Region::block_iterator to Region::block_node_iterator, and add a new Region::block_iterator which actually iterates over the basic blocks of the region. The old iterator, now call 'block_node_iterator' iterates over RegionNodes which contain a single basic block. This works well with the GraphTraits-based iterator design, however most users actually want an iterator over the BasicBlocks inside these RegionNodes. Now the 'block_iterator' is a wrapper which exposes exactly this interface. Internally it uses the block_node_iterator to walk all nodes which are single basic blocks, but transparently unwraps the basic block to make user code simpler. While this patch is a bit of a wash, most of the updates are to internal users, not external users of the RegionInfo. I have an accompanying patch to Polly that is a strict simplification of every user of this interface, and I'm working on a pass that also wants the same simplified interface. This patch alone should have no functional impact. llvm-svn: 156202	2012-05-04 20:55:23 +00:00
Justin Holewinski	4ca961430f	This patch adds a new NVPTX back-end to LLVM which supports code generation for NVIDIA PTX 3.0. This back-end will (eventually) replace the current PTX back-end, while maintaining compatibility with it. The new target machines are: nvptx (old ptx32) => 32-bit PTX nvptx64 (old ptx64) => 64-bit PTX The sources are based on the internal NVIDIA NVPTX back-end, and contain more functionality than the current PTX back-end currently provides. NV_CONTRIB llvm-svn: 156196	2012-05-04 20:18:50 +00:00
Sebastian Pop	2b868d474e	Added missing CMN case in Thumb2SizeReduction pass so that LLVM emits 16-bits encoding of CMN instructions. llvm-svn: 156195	2012-05-04 19:53:56 +00:00
Preston Gurd	8de39bd4f6	Adds Intel Atom scheduling latencies to X86InstrSystem.td. llvm-svn: 156194	2012-05-04 19:26:37 +00:00
Matt Beaumont-Gay	c6b2d69140	Pacify GCC's -Wreturn-type llvm-svn: 156189	2012-05-04 18:34:27 +00:00
Chandler Carruth	8cdf727fc7	Factor the computation of input and output sets into a public interface of the CodeExtractor utility. This allows speculatively computing input and output sets to measure the likely size impact of the code extraction. These sets cannot be reused sadly -- we mutate the function prior to forming the final sets used by the actual extraction. The interface has been revamped slightly to make it easier to use correctly by making the interface const and sinking the computation of the number of exit blocks into the full extraction function and away from the rest of this logic which just computed two output parameters. llvm-svn: 156168	2012-05-04 11:20:27 +00:00
Chandler Carruth	b6a7c286f3	Rather than trying to gracefully handle input sequences with repeated blocks, assert that this doesn't happen. We don't want to bother trying to support this call pattern as it isn't necessary. llvm-svn: 156167	2012-05-04 11:17:06 +00:00
Chandler Carruth	a74f417e2e	Fix a goof with my previous commit by completely returning when we detect an in-eligible block rather than just breaking out of the loop. llvm-svn: 156166	2012-05-04 11:14:19 +00:00
Chandler Carruth	8ce878e46c	Hoist a safety assert from the extraction method into the construction of the extractor itself. llvm-svn: 156164	2012-05-04 10:26:45 +00:00
Chandler Carruth	67c334679c	Move the CodeExtractor utility to a dedicated header file / source file, and expose it as a utility class rather than as free function wrappers. The simple free-function interface works well for the bugpoint-specific pass's uses of code extraction, but in an upcoming patch for more advanced code extraction, they simply don't expose a rich enough interface. I need to expose various stages of the process of doing the code extraction and query information to decide whether or not to actually complete the extraction or give up. Rather than build up a new predicate model and pass that into these functions, just take the class that was actually implementing the functions and lift it up into a proper interface that can be used to perform code extraction. The interface is cleaned up and re-documented to work better in a header. It also is now setup to accept the blocks to be extracted in the constructor rather than in a method. In passing this essentially reverts my previous commit here exposing a block-level query for eligibility of extraction. That is no longer necessary with the more rich interface as clients can query the extraction object for eligibility directly. This will reduce the number of walks of the input basic block sequence by quite a bit which is useful if this enters the normal optimization pipeline. llvm-svn: 156163	2012-05-04 10:18:49 +00:00
Hans Wennborg	b3c41d012d	Make ARM and Mips use TargetMachine::getTLSModel() This moves the logic for selecting a TLS model to a single place, instead of the previous three (ARM, Mips, and X86 which already uses this function). llvm-svn: 156162	2012-05-04 09:40:39 +00:00
Craig Topper	88bf1f4404	Fix some loops to match coding standards. No functional change intended. llvm-svn: 156159	2012-05-04 06:39:13 +00:00
Craig Topper	3845ea5b9e	Fix up some spacing. No functional change. llvm-svn: 156158	2012-05-04 06:18:33 +00:00
Craig Topper	71aab70d71	Simplify broadcast lowering code. No functional change intended. llvm-svn: 156157	2012-05-04 05:49:51 +00:00
Craig Topper	6881f1067c	Allow v16i16 and v32i8 shuffles to be rewritten as narrower shuffles. llvm-svn: 156156	2012-05-04 04:44:49 +00:00
Bill Wendling	8661cdc0f4	Add 'landingpad' instructions to the list of instructions to ignore. Also combine the code in the 'assert' statement. llvm-svn: 156155	2012-05-04 04:22:32 +00:00
Craig Topper	f7516089b7	Simplify shuffle narrowing code a bit. No functional change intended. llvm-svn: 156154	2012-05-04 04:08:44 +00:00
Jakob Stoklund Olesen	7bdae32bfd	Remove the SubRegClasses field from RegisterClass descriptions. This information in now computed by TableGen. llvm-svn: 156152	2012-05-04 03:30:34 +00:00
Jakob Stoklund Olesen	2c4618568d	Remove TargetRegisterClass::SuperRegClasses. This manually enumerated list of super-register classes has been superceeded by the automatically computed super-register class masks available through SuperRegClassIterator. llvm-svn: 156151	2012-05-04 03:30:28 +00:00
Rafael Espindola	43ad1ac2ef	Pass -fcolor-diagnostics when it is supported. This makes a difference when using cmake+ninja, since ninja buffers the compiler output. llvm-svn: 156150	2012-05-04 03:23:36 +00:00
Jakob Stoklund Olesen	8fbea83a95	Use SuperRegClassIterator for findRepresentativeClass(). The masks returned by SuperRegClassIterator are computed automatically by TableGen. This is better than depending on the manually specified SuperRegClasses. llvm-svn: 156147	2012-05-04 02:19:22 +00:00
Jakob Stoklund Olesen	aeb874991d	Initialize SparcInstrInfo before SparcTargetLowering. The TargetLowering construction needs to use a valid TargetRegisterInfo instance. llvm-svn: 156146	2012-05-04 02:16:39 +00:00

1 2 3 4 5 ...

82114 Commits