llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	63489eb1bd	Fix bug in TRI::getCommonSuperRegClass(). Test cases for this code are coming. It is not used for anything yet. llvm-svn: 156327	2012-05-07 21:59:31 +00:00
Owen Anderson	8adb0322ce	Teach DAG combine to fold x-x to 0.0 when unsafe FP math is enabled. llvm-svn: 156324	2012-05-07 20:51:25 +00:00
Owen Anderson	1e7a4f0f91	Teach reassociate to commute FMul's and FAdd's in order to canonicalize the order of their operands across instructions. This allows for greater CSE opportunities. llvm-svn: 156323	2012-05-07 20:47:23 +00:00
Preston Gurd	cd8273f842	Make IntelJITEvents and OProfileJIT as optional libraries and add optional library support to the llvm-build tool: - Add new command line parameter to llvm-build: “--enable-optional-libraries” - Add handing of new llvm-build library type “OptionalLibrary” - Update Cmake and automake build systems to pass correct flags to llvm-build based on configuration Patch by Dan Malea! llvm-svn: 156319	2012-05-07 19:38:40 +00:00
Jordy Rose	7d6fc98ccf	Constify (trivially) ImmutableSet::iterator::getVisitState(). This was probably intended all along. llvm-svn: 156318	2012-05-07 19:24:40 +00:00
Jakob Stoklund Olesen	d92280013d	Add TRI::getCommonSuperRegClass(). This function is a generalization of getMatchingSuperRegClass() to the symmetric case where both sides are using a sub-register index. It will find a super-register class and sub-register indexes that make this diagram commute: PreA SuperRC ----------> RCA \| \| \| \| PreB \| \| SubA \| \| \| \| V V RCB ----------> SubRC SubB This can be used to coalesce copies like: %vreg1:sub16 = COPY %vreg2:sub16; GR64:%vreg1, GR32: %vreg2 llvm-svn: 156317	2012-05-07 19:14:58 +00:00
Chad Rosier	3e284d8bd6	Fix a regression from r147481. This combine should only happen if there is a single use. rdar://11360370 llvm-svn: 156316	2012-05-07 18:47:44 +00:00
Matt Beaumont-Gay	ee7706f1a3	Don't assume size_t is unsigned long long. Fixes a -Woverflow warning from gcc when building for 32-bit platforms. llvm-svn: 156313	2012-05-07 18:12:42 +00:00
Manman Ren	6fde9f74b4	X86: optimization for -(x != 0) This patch will optimize -(x != 0) on X86 FROM cmpl $0x01,%edi sbbl %eax,%eax notl %eax TO negl %edi sbbl %eax %eax In order to generate negl, I added patterns in Target/X86/X86InstrCompiler.td: def : Pat<(X86sub_flag 0, GR32:$src), (NEG32r GR32:$src)>; rdar: 10961709 llvm-svn: 156312	2012-05-07 18:06:23 +00:00
Eric Christopher	c2cd5bdf83	Add support for the 'x' constraint. Patch by Jack Carter. llvm-svn: 156295	2012-05-07 06:25:19 +00:00
Eric Christopher	87e8163c57	Add support for the 'l' constraint. Patch by Jack Carter. llvm-svn: 156294	2012-05-07 06:25:15 +00:00
Eric Christopher	af8eabbbd8	Add support for the 'c' constraint. Patch by Jack Carter. llvm-svn: 156293	2012-05-07 06:25:10 +00:00
Eric Christopher	0f1a0afa75	Add support for the 'P' constraint. Patch by Jack Carter. llvm-svn: 156292	2012-05-07 06:25:02 +00:00
John McCall	98163678d4	Fix trivial typo in llvm_move. llvm-svn: 156288	2012-05-07 06:00:23 +00:00
Craig Topper	02644ca6b7	Fix some issues in the f16c instructions. llvm-svn: 156287	2012-05-07 06:00:15 +00:00
Eric Christopher	a6552ba637	Add support for the 'O' constraint. Patch by Jack Carter. llvm-svn: 156285	2012-05-07 05:46:48 +00:00
Eric Christopher	5e1efebf09	Add support for the 'N' inline asm constraint. Patch by Jack Carter. llvm-svn: 156284	2012-05-07 05:46:43 +00:00
Eric Christopher	e5a46b70b3	Add support for the 'L' inline asm constraint. Patch by Jack Carter. llvm-svn: 156283	2012-05-07 05:46:37 +00:00
Eric Christopher	267aa256cb	Add support for the inline asm constraint 'K'. llvm-svn: 156282	2012-05-07 05:46:29 +00:00
Craig Topper	c6d0bc2afc	Add SSE4A MOVNTSS/MOVNTSD instructions. llvm-svn: 156281	2012-05-07 05:36:19 +00:00
Eric Christopher	bf784be9ae	Support the 'J' constraint. Patch by Jack Carter. llvm-svn: 156280	2012-05-07 03:13:42 +00:00
Eric Christopher	929ba63dcf	Add support for the 'I' inline asm constraint. Also add tests from the previous 2 patches. Patch by Jack Carter. llvm-svn: 156279	2012-05-07 03:13:32 +00:00
Eric Christopher	0c140afa87	Allow 64 bit integer values in gpu registers if arch and abi are 64 bit. Patch by Jack Carter. llvm-svn: 156278	2012-05-07 03:13:22 +00:00
Eric Christopher	6397520b96	When using inline asm constraints representing non-floating point general registers allow 8 and 16-bit elements. Patch by Jack Carter. llvm-svn: 156277	2012-05-07 03:13:16 +00:00
Jim Grosbach	970a443ab5	Tidy up. Whitespace. llvm-svn: 156276	2012-05-07 02:25:53 +00:00
Craig Topper	4246b08208	Use MVT instead of EVT as the argument to all the shuffle decode functions. Simplify some of the decode functions. llvm-svn: 156268	2012-05-06 19:46:21 +00:00
Craig Topper	b3b4c9476d	Add VPERMQ/VPERMPD to the list of target specific shuffles that can be looked through for DAG combine purposes. llvm-svn: 156266	2012-05-06 18:54:26 +00:00
Craig Topper	b95ee6cfc1	Add shuffle decode support for VPERMQ/VPERMPD. llvm-svn: 156265	2012-05-06 18:44:02 +00:00
Jim Grosbach	d48551961d	TableGen: AsmMatcher diagnostic when missing instruction mnemonic. Previously, if an instruction definition was missing the mnemonic, the next line would just assert(). Issue a real diagnostic instead. llvm-svn: 156263	2012-05-06 17:33:14 +00:00
Chris Lattner	d4a526b3cc	make SourceMgr tolerate empty SMLoc()'s better. llvm-svn: 156260	2012-05-06 16:20:49 +00:00
Benjamin Kramer	786f7671ab	Switch the select to branch transformation on by default. The primitive conservative heuristic seems to give a slight overall improvement while not regressing stuff. Make it available to wider testing. If you notice any speed regressions (or significant code size regressions) let me know! llvm-svn: 156258	2012-05-06 14:25:16 +00:00
Jakub Staszak	b3bddb41cb	Remove trailing spaces. llvm-svn: 156257	2012-05-06 13:52:31 +00:00
NAKAMURA Takumi	9ae88f2f80	Unix/Process.inc: Give more useful random seed to srand. Workaround for PR12743. llvm-svn: 156252	2012-05-06 08:24:24 +00:00
NAKAMURA Takumi	a7d133afee	Support/Process: Move llvm::sys::Process::GetRandomNumber() from Process.cpp to Unix/Process.inc. FIXME: GetRandomNumber() is not implemented in Win32. llvm-svn: 156251	2012-05-06 08:24:18 +00:00
Chris Lattner	cf8284517f	reapply my patch, with a fix for an off-by-one error. Turned out to be a lot of work for a drive-by fix :) llvm-svn: 156246	2012-05-05 22:17:32 +00:00
Chris Lattner	206bf447c0	revert my patches, which are causing problems. llvm-svn: 156245	2012-05-05 22:11:04 +00:00
Chris Lattner	372a67f8d9	add missing header <shame> llvm-svn: 156244	2012-05-05 22:04:11 +00:00
Chris Lattner	4c8c651c04	refactor some code to expose column numbers more and make diagnostic printing slightly more efficient. llvm-svn: 156243	2012-05-05 21:39:51 +00:00
Jim Grosbach	f7461026c2	Nuke a few dead remnants of the CBE. llvm-svn: 156241	2012-05-05 17:45:12 +00:00
Daniel Dunbar	af0500eb5d	[Support] Add missing include. llvm-svn: 156240	2012-05-05 16:49:11 +00:00
Daniel Dunbar	d7d85c4a85	[Support] Fix up comments. llvm-svn: 156239	2012-05-05 16:39:22 +00:00
Daniel Dunbar	6fbad750f3	[Support] Rewrite sys::fs::unique_file to not be stupid with /dev/urandom. - Just use sys::Process::GetRandomNumber instead of having two poor implementations. - This is ~70 times (!) faster on my OS X machine. llvm-svn: 156238	2012-05-05 16:36:24 +00:00
Daniel Dunbar	457eab2ad7	[Support] Add sys::Process::GetRandomNumber(). - Primitive API, but we rarely have need for random numbers. llvm-svn: 156237	2012-05-05 16:36:20 +00:00
Daniel Dunbar	c69f7c9234	[build] Add build check for ::arc4random(). llvm-svn: 156236	2012-05-05 16:36:16 +00:00
Benjamin Kramer	97de009760	Update all outdated autoconf files in the sample project. We might just use symlinks here, but I'm afraid of possible portability issues. llvm-svn: 156235	2012-05-05 15:02:39 +00:00
Benjamin Kramer	0463564612	CodeGenPrepare: Add a transform to turn selects into branches in some cases. This came up when a change in block placement formed a cmov and slowed down a hot loop by 50%: ucomisd (%rdi), %xmm0 cmovbel %edx, %esi cmov is a really bad choice in this context because it doesn't get branch prediction. If we emit it as a branch, an out-of-order CPU can do a better job (if the branch is predicted right) and avoid waiting for the slow load+compare instruction to finish. Of course it won't help if the branch is unpredictable, but those are really rare in practice. This patch uses a dumb conservative heuristic, it turns all cmovs that have one use and a direct memory operand into branches. cmovs usually save some code size, so we disable the transform in -Os mode. In-Order architectures are unlikely to benefit as well, those are included in the "predictableSelectIsExpensive" flag. It would be better to reuse branch probability info here, but BPI doesn't support select instructions currently. It would make sense to use the same heuristics as the if-converter pass, which does the opposite direction of this transform. Test suite shows a small improvement here and there on corei7-level machines, but the actual results depend a lot on the used microarchitecture. The transformation is currently disabled by default and available by passing the -enable-cgp-select2branch flag to the code generator. Thanks to Chandler for the initial test case to him and Evan Cheng for providing me with comments and test-suite numbers that were more stable than mine :) llvm-svn: 156234	2012-05-05 12:49:22 +00:00
Benjamin Kramer	7a9528b540	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Benjamin Kramer	39afd32d88	NVPTX: Initialize the UseF32FTZ flag. llvm-svn: 156232	2012-05-05 11:22:02 +00:00
Stepan Dyatkovskiy	469935e0ae	Small fix in InstCombineCasts.cpp. Restored "alloca + bitcast" reducing for case when alloca's size is calculated within the "add/sub/... nsw". Also added fix to 2011-06-13-nsw-alloca.ll test. llvm-svn: 156231	2012-05-05 07:09:40 +00:00
Eric Christopher	d0a426dbe4	Typo. llvm-svn: 156226	2012-05-05 01:16:06 +00:00

1 2 3 4 5 ...

82144 Commits