llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-31 07:52:55 +01:00

Author	SHA1	Message	Date
Arnold Schwaighofer	abd363c1bc	LoopVectorizer: Pass OperandValueKind information to the cost model Pass down the fact that an operand is going to be a vector of constants. This should bring the performance of MultiSource/Benchmarks/PAQ8p/paq8p on x86 back. It had degraded to scalar performance due to my pervious shift cost change that made all shifts expensive on x86. radar://13576547 llvm-svn: 178809	2013-04-04 23:26:27 +00:00
Arnold Schwaighofer	52871434dd	X86 cost model: Differentiate cost for vector shifts of constants SSE2 has efficient support for shifts by a scalar. My previous change of making shifts expensive did not take this into account marking all shifts as expensive. This would prevent vectorization from happening where it is actually beneficial. With this change we differentiate between shifts of constants and other shifts. radar://13576547 llvm-svn: 178808	2013-04-04 23:26:24 +00:00
Arnold Schwaighofer	861251004b	CostModel: Add parameter to instruction cost to further classify operand values On certain architectures we can support efficient vectorized version of instructions if the operand value is uniform (splat) or a constant scalar. An example of this is a vector shift on x86. We can efficiently support for (i = 0 ; i < ; i += 4) w[0:3] = v[0:3] << <2, 2, 2, 2> but not for (i = 0; i < ; i += 4) w[0:3] = v[0:3] << x[0:3] This patch adds a parameter to getArithmeticInstrCost to further qualify operand values as uniform or uniform constant. Targets can then choose to return a different cost for instructions with such operand values. A follow-up commit will test this feature on x86. radar://13576547 llvm-svn: 178807	2013-04-04 23:26:21 +00:00
Manman Ren	d4f1e1e1df	Debug Info: revert 178722 for now. There is a difference for FORM_ref_addr between DWARF 2 and DWARF 3+. Since Eric is against guarding DWARF 2 ref_addr with DarwinGDBCompat, we are still in discussion on how to handle this. The correct solution is to update our header to say version 4 instead of version 2 and update tool chains as well. rdar://problem/13559431 llvm-svn: 178806	2013-04-04 23:13:11 +00:00
Adrian Prantl	679308d57e	typo llvm-svn: 178804	2013-04-04 22:56:49 +00:00
Hal Finkel	02fd9b0859	Rename the current PPC BCL definition to BCLalways BCL is normally a conditional branch-and-link instruction, but has an unconditional form (which is used in the SjLj code, for example). To make clear that this BCL instruction definition is specifically the special unconditional form (which does not meaningfully take a condition-register input), rename it to BCLalways. No functionality change intended. llvm-svn: 178803	2013-04-04 22:55:54 +00:00
Hal Finkel	1dc78e666b	PPC: Improve code generation for mixed-precision reciprocal sqrt The DAGCombine logic that recognized a/sqrt(b) and transformed it into a multiplication by the reciprocal sqrt did not handle cases where the sqrt and the division were separated by an fpext or fptrunc. llvm-svn: 178801	2013-04-04 22:44:12 +00:00
Jyotsna Verma	c3ebace56c	Hexagon: Expand br_cc. It fixes following tests for Hexagon: CodeGen/Generic/2003-07-29-BadConstSbyte.ll CodeGen/Generic/2005-10-21-longlonggtu.ll CodeGen/Generic/2009-04-28-i128-cmp-crash.ll CodeGen/Generic/MachineBranchProb.ll CodeGen/Generic/builtin-expect.ll CodeGen/Generic/pr12507.ll llvm-svn: 178794	2013-04-04 21:18:26 +00:00
Benjamin Kramer	d4c69ec04b	Reassociate: Avoid iterator invalidation. OpndPtrs stored pointers into the Opnd vector that became invalid when the vector grows. Store indices instead. Sadly I only have a large testcase that only triggers under valgrind, so I didn't include it. llvm-svn: 178793	2013-04-04 21:15:42 +00:00
Jyotsna Verma	8017d2c8b6	Disable 2010-10-01-crash.ll for Hexagon as the Hexagon frontend will never produce a byval parameter with size < 8 bytes. llvm-svn: 178792	2013-04-04 21:05:46 +00:00
Rafael Espindola	5e796656b1	Add back parsing of header charactestics. It had been dropped during the switch to yaml::IO. Also add a test going from yaml2obj to llvm-readobj. It can be extended as we add more fields/formats to yaml2obj. llvm-svn: 178786	2013-04-04 20:30:52 +00:00
Richard Osborne	25a2bd3084	[XCore] Add bru instruction. llvm-svn: 178783	2013-04-04 20:05:35 +00:00
Richard Osborne	2eabe25672	[XCore] The RRegs register class is a superset of GRRegs. At the time when the XCore backend was added there were some issues with with overlapping register classes but these all seem to be fixed now. Describing the register classes correctly allow us to get rid of a codegen only instruction (LDAWSP_lru6_RRegs) and it means we can disassemble ru6 instructions that use registers above r11. llvm-svn: 178782	2013-04-04 19:57:46 +00:00
Eli Bendersky	91eef1d3c8	Missing word llvm-svn: 178774	2013-04-04 18:29:19 +00:00
Jakob Stoklund Olesen	a53fa8d450	Avoid high-latency false CPSR dependencies even for tMOVSi. The Thumb2SizeReduction pass avoids false CPSR dependencies, except it still aggressively creates tMOVi8 instructions because they are so common. Avoid creating false CPSR dependencies even for tMOVi8 instructions when the the CPSR flags are known to have high latency. This allows integer computation to overlap floating point computations. Also process blocks in a reverse post-order and propagate high-latency flags to successors. <rdar://problem/13468102> llvm-svn: 178773	2013-04-04 18:25:36 +00:00
Eli Bendersky	13fd057bb8	Formatting llvm-svn: 178771	2013-04-04 18:03:41 +00:00
Evan Cheng	dd1796b8b4	Revert r178713 llvm-svn: 178769	2013-04-04 17:40:53 +00:00
Stepan Dyatkovskiy	0562afa331	New-password-test commit. llvm-svn: 178765	2013-04-04 16:11:18 +00:00
Vincent Lejeune	3a22d07044	R600: Use a mask for offsets when encoding instructions llvm-svn: 178763	2013-04-04 14:00:09 +00:00
Vincent Lejeune	d5f0b3821e	R600: Fix wrong address when substituting ENDIF llvm-svn: 178762	2013-04-04 14:00:03 +00:00
Vincent Lejeune	a680946842	R600: Take export into account when computing cf address llvm-svn: 178761	2013-04-04 13:59:59 +00:00
Alexey Samsonov	9e33512a49	Propagate path to ASan/MSan symbolizer into test environment to produce useful reports on errors. llvm-svn: 178749	2013-04-04 07:41:00 +00:00
Nadav Rotem	289f297421	Document the return value of SmallSet insert. llvm-svn: 178742	2013-04-04 04:54:21 +00:00
Jakob Stoklund Olesen	1969a96fcd	Add SPARC v9 support for select on 64-bit compares. This requires v9 cmov instructions using the %xcc flags instead of the %icc flags. Still missing: - Select floats on %xcc flags. - Select i64 on %fcc flags. llvm-svn: 178737	2013-04-04 03:08:00 +00:00
Rafael Espindola	f0937372e8	Explicitly add -Wl,--export-all-symbols on mingw/cygwin. Looks like cmake on windows is not expanding ENABLE_EXPORTS to -Wl,--export-all-symbols on mingw or cygwin, so add this back. llvm-svn: 178730	2013-04-04 01:19:55 +00:00
Rafael Espindola	bf8dcf15ed	Don't export symbols in every binary on linux. On freebsd this makes sure that symbols are exported on the binaries that need them. The net result is that we should get symbols in the binaries that need them on every platform. On linux x86-64 this reduces the size of the bin directory from 262MB to 250MB. Patch by Stephen Checkoway. llvm-svn: 178725	2013-04-04 01:01:32 +00:00
Manman Ren	2dafd4d1df	Debug Info: according to DWARF 2, FORM_ref_addr the same size as an address on the target system. It was hard-coded to 4 bytes before. I can't get llvm to generate a ref_addr on a reasonably sized testing case. rdar://problem/13559431 llvm-svn: 178722	2013-04-04 00:22:54 +00:00
Michael Gottesman	d8686ebbd6	Refactored out the helper method FindPredecessorAutoreleaseWithSafePath from ObjCARCOpt::OptimizeReturns. Now ObjCARCOpt::OptimizeReturns is easy to read and reason about. llvm-svn: 178715	2013-04-03 23:39:14 +00:00
Michael Gottesman	2560f9cf28	Refactored out the helper function FindPredecessorRetainWithSafePath from ObjCARCOpt::OptimizeReturns. llvm-svn: 178714	2013-04-03 23:16:05 +00:00
Evan Cheng	9170d95869	Make it possible to include llvm-c without including C++ headers. Patch by Filip Pizlo. llvm-svn: 178713	2013-04-03 23:12:39 +00:00
Michael Gottesman	f7fe76689b	Small cleanups. Cleaned up trailing whitespace and added extra slashes in front of a function level comment so that it follow the convention of having 3 slashes. llvm-svn: 178712	2013-04-03 23:07:45 +00:00
Michael Gottesman	964b7a9c7b	Refactored out a part of ObjCARCOpt::OptimizeReturns into its own method HasSafePathToPredecessorCall. llvm-svn: 178710	2013-04-03 23:04:28 +00:00
Michael Gottesman	aadcbf8008	Removed an old comment. llvm-svn: 178709	2013-04-03 23:04:24 +00:00
Michael Gottesman	99dccd50bc	Clean up arc annotations by moving the top/bottom BB annotations into conditional macros that no-op in Release mode instead of #ifdef sections of the code. This is to follow the example of the DEBUG macro. llvm-svn: 178705	2013-04-03 22:41:59 +00:00
Arnold Schwaighofer	329430aeac	X86 cost model: Vector shifts are expensive in most cases The default logic does not correctly identify costs of casts because they are marked as custom on x86. For some cases, where the shift amount is a scalar we would be able to generate better code. Unfortunately, when this is the case the value (the splat) will get hoisted out of the loop, thereby making it invisible to ISel. radar://13130673 radar://13537826 llvm-svn: 178703	2013-04-03 21:46:05 +00:00
Rafael Espindola	af01832c73	Implement the "mips endian" for r_info. Normally r_info is just a 32 of 64 bit number matching the endian of the rest of the file. Unfortunately, mips 64 bit little endian is special: The top 32 bits are a little endian number and the following 32 are a big endian one. llvm-svn: 178694	2013-04-03 21:02:51 +00:00
Richard Osborne	8c4177b262	[XCore] Check disassembly of the st8 instruction. llvm-svn: 178689	2013-04-03 20:07:11 +00:00
Richard Osborne	a7413cf3e7	[XCore] Update disassembler test to improve coverage of the instructions. Previously some instructions were unintentionally covered twice and others were not covered at all. llvm-svn: 178688	2013-04-03 20:07:06 +00:00
Eric Christopher	df46cef31b	Implements low-level object file format specific output for COFF and ELF with support for: - File headers - Section headers + data - Relocations - Symbols - Unwind data (only COFF/Win64) The output format follows a few rules: - Values are almost always output one per line (as elf-dump/coff-dump already do). - Many values are translated to something readable (like enum names), with the raw value in parentheses. - Hex numbers are output in uppercase, prefixed with "0x". - Flags are sorted alphabetically. - Lists and groups are always delimited. Example output: ---------- snip ---------- Sections [ Section { Index: 1 Name: .text (5) Type: SHT_PROGBITS (0x1) Flags [ (0x6) SHF_ALLOC (0x2) SHF_EXECINSTR (0x4) ] Address: 0x0 Offset: 0x40 Size: 33 Link: 0 Info: 0 AddressAlignment: 16 EntrySize: 0 Relocations [ 0x6 R_386_32 .rodata.str1.1 0x0 0xB R_386_PC32 puts 0x0 0x12 R_386_32 .rodata.str1.1 0x0 0x17 R_386_PC32 puts 0x0 ] SectionData ( 0000: 83EC04C7 04240000 0000E8FC FFFFFFC7 \|.....$..........\| 0010: 04240600 0000E8FC FFFFFF31 C083C404 \|.$.........1....\| 0020: C3 \|.\| ) } ] ---------- snip ---------- Relocations and symbols can be output standalone or together with the section header as displayed in the example. This feature set supports all tests in test/MC/COFF and test/MC/ELF (and I suspect all additional tests using elf-dump), making elf-dump and coff-dump deprecated. Patch by Nico Rieck! llvm-svn: 178679	2013-04-03 18:31:38 +00:00
Eric Christopher	4dc4bfd311	Don't disassemble symbols with an unknown address or size. Patch by Nico Rieck! llvm-svn: 178678	2013-04-03 18:31:23 +00:00
Eric Christopher	99a330354d	Implement sectionContainsSymbol for ELF. Patch by Nico Rieck! llvm-svn: 178677	2013-04-03 18:31:19 +00:00
Eric Christopher	8cfce53956	When dumping clear the arm/thumb flag for now. Patch by Nico Rieck! llvm-svn: 178676	2013-04-03 18:31:12 +00:00
Vincent Lejeune	6a4ef74f44	R600: Fix last ALU of a clause being emitted in a separate clause llvm-svn: 178675	2013-04-03 18:24:47 +00:00
Aaron Ballman	3d040e9fc0	Ensuring that both bits are set, and not just a combination of one or the other. llvm-svn: 178674	2013-04-03 18:00:22 +00:00
Hal Finkel	3e38cb94ec	Cleanup PPC reciprocal-estimate functionality Incorporating review feedback from Bill Schmidt on r178617. No functionality change intended. llvm-svn: 178672	2013-04-03 17:44:56 +00:00
Vincent Lejeune	9bc67cfa08	R600: Factorize maximum alu per clause in a single location llvm-svn: 178667	2013-04-03 16:49:34 +00:00
Aaron Ballman	1e35169e8e	Testing for Visual Studio 2010 SP1 or greater before calling the _xgetbv intrinsic. This also fixes a minor code formatting issue. llvm-svn: 178666	2013-04-03 16:28:24 +00:00
Vincent Lejeune	bab4692335	R600: Simplify data structure and add DEBUG to R600ControlFlowFinalizer llvm-svn: 178665	2013-04-03 16:24:09 +00:00
Vincent Lejeune	6b257b347d	R600: Consider KILLGT as an ALU instruction Mesa does not override llvm behavior wrt KILLGT anymore so llvm has to handle KILLGT on its own. llvm-svn: 178664	2013-04-03 16:24:04 +00:00
Eli Bendersky	70bff7a437	Measure time that IR parsing took as part of the -time-passes measurement. llvm-svn: 178662	2013-04-03 15:33:45 +00:00

1 2 3 4 5 ...

90843 Commits