llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Eric Christopher	2da5548487	Update function name and add some helpful comments. llvm-svn: 198979	2014-01-11 00:23:16 +00:00
Eric Christopher	8d9cd2a9e2	Fix odd whitespace. llvm-svn: 198978	2014-01-11 00:23:11 +00:00
Diego Novillo	f47aa4d47f	Extend and simplify the sample profile input file. 1- Use the line_iterator class to read profile files. 2- Allow comments in profile file. Lines starting with '#' are completely ignored while reading the profile. 3- Add parsing support for discriminators and indirect call samples. Our external profiler can emit more profile information that we are currently not handling. This patch does not add new functionality to support this information, but it allows profile files to provide it. I will add actual support later on (for at least one of these features, I need support for DWARF discriminators in Clang). A sample line may contain the following additional information: Discriminator. This is used if the sampled program was compiled with DWARF discriminator support (http://wiki.dwarfstd.org/index.php?title=Path_Discriminators). This is currently only emitted by GCC and we just ignore it. Potential call targets and samples. If present, this line contains a call instruction. This models both direct and indirect calls. Each called target is listed together with the number of samples. For example, 130: 7 foo:3 bar:2 baz:7 The above means that at relative line offset 130 there is a call instruction that calls one of foo(), bar() and baz(). With baz() being the relatively more frequent call target. Differential Revision: http://llvm-reviews.chandlerc.com/D2355 4- Simplify format of profile input file. This implements earlier suggestions to simplify the format of the sample profile file. The symbol table is not necessary and function profiles do not need to know the number of samples in advance. Differential Revision: http://llvm-reviews.chandlerc.com/D2419 llvm-svn: 198973	2014-01-10 23:23:51 +00:00
Diego Novillo	9e8454b3fe	Propagation of profile samples through the CFG. This adds a propagation heuristic to convert instruction samples into branch weights. It implements a similar heuristic to the one implemented by Dehao Chen on GCC. The propagation proceeds in 3 phases: 1- Assignment of block weights. All the basic blocks in the function are initial assigned the same weight as their most frequently executed instruction. 2- Creation of equivalence classes. Since samples may be missing from blocks, we can fill in the gaps by setting the weights of all the blocks in the same equivalence class to the same weight. To compute the concept of equivalence, we use dominance and loop information. Two blocks B1 and B2 are in the same equivalence class if B1 dominates B2, B2 post-dominates B1 and both are in the same loop. 3- Propagation of block weights into edges. This uses a simple propagation heuristic. The following rules are applied to every block B in the CFG: - If B has a single predecessor/successor, then the weight of that edge is the weight of the block. - If all the edges are known except one, and the weight of the block is already known, the weight of the unknown edge will be the weight of the block minus the sum of all the known edges. If the sum of all the known edges is larger than B's weight, we set the unknown edge weight to zero. - If there is a self-referential edge, and the weight of the block is known, the weight for that edge is set to the weight of the block minus the weight of the other incoming edges to that block (if known). Since this propagation is not guaranteed to finalize for every CFG, we only allow it to proceed for a limited number of iterations (controlled by -sample-profile-max-propagate-iterations). It currently uses the same GCC default of 100. Before propagation starts, the pass builds (for each block) a list of unique predecessors and successors. This is necessary to handle identical edges in multiway branches. Since we visit all blocks and all edges of the CFG, it is cleaner to build these lists once at the start of the pass. Finally, the patch fixes the computation of relative line locations. The profiler emits lines relative to the function header. To discover it, we traverse the compilation unit looking for the subprogram corresponding to the function. The line number of that subprogram is the line where the function begins. That becomes line zero for all the relative locations. llvm-svn: 198972	2014-01-10 23:23:46 +00:00
Tom Roeder	1954800ba6	Space formatting fix for r198966. llvm-svn: 198971	2014-01-10 23:17:39 +00:00
Roman Divacky	e429f2c937	Constant propagate MachineInstrClassName. llvm-svn: 198969	2014-01-10 22:59:49 +00:00
Tom Roeder	6ed65beaae	Fixing build break: should be in the if statement, not outside. llvm-svn: 198966	2014-01-10 22:55:25 +00:00
Tom Roeder	9b68196561	Restore the library dependency of LLVMgold on LTO; this was removed recently but is needed for LLVMgold to load in ld. llvm-svn: 198965	2014-01-10 22:48:35 +00:00
Rafael Espindola	4746cd5531	Add a note about the old asm printer being removed. llvm-svn: 198960	2014-01-10 22:06:26 +00:00
Rafael Espindola	728814cedc	All backends use MC now. llvm-svn: 198959	2014-01-10 21:49:27 +00:00
Rafael Espindola	13382e2639	Use the simpler version of sys::fs::remove when possible. llvm-svn: 198958	2014-01-10 21:40:29 +00:00
Rafael Espindola	7c8a2f4a58	Remove remove_all. A compiler has no need for recursively deleting a directory. llvm-svn: 198955	2014-01-10 20:36:42 +00:00
Duncan P. N. Exon Smith	3b0b19af25	LTO: whitespace changes llvm-svn: 198954	2014-01-10 20:24:35 +00:00
Arnold Schwaighofer	702d83d3d8	LoopVectorizer: Handle strided memory accesses by versioning for (i = 0; i < N; ++i) A[i * Stride1] += B[i * Stride2]; We take loops like this and check that the symbolic strides 'Strided1/2' are one and drop to the scalar loop if they are not. This is currently disabled by default and hidden behind the flag 'enable-mem-access-versioning'. radar://13075509 llvm-svn: 198950	2014-01-10 18:20:32 +00:00
Arnold Schwaighofer	bd8b4df02b	SCEVRewriter: Optionally interpret constants in value map as SCEVConstant An upcoming loop vectorizer commit will want to replace a SCEVUnknown(Value*) by a SCEVConstant. This commit modifies the SCEVParameterRewriter to support this. The SCEVParameterRewriter constructor can optionally specify to follow this behavior. llvm-svn: 198949	2014-01-10 18:20:29 +00:00
Artyom Skrobov	cbb9547cdc	Amending test/MC/ARM/thumb2-mclass.s to match its apparent original purpose (to test the ARMv6M/ARMv7M commonality), and creating a new test case for the differences between ARMv6M and ARMv7M llvm-svn: 198946	2014-01-10 16:49:49 +00:00
Artyom Skrobov	759f6384e9	Must not produce Tag_CPU_arch_profile for pre-ARMv7 cores (e.g. cortex-m0) llvm-svn: 198945	2014-01-10 16:42:55 +00:00
Saleem Abdulrasool	f544263238	ARM: fix regression caused by r198914 The disassembler would no longer be able to disambiguage between the two variants (explicit immediate #0 vs implicit, omitted #0) for the ldrt, strt, ldrbt, strbt mnemonics as both versions indicated the disassembler routine. llvm-svn: 198944	2014-01-10 16:22:47 +00:00
Kristof Beyls	c9499d899d	Silence unused variable warning for non-asserting builds that was introduced in r198937. llvm-svn: 198941	2014-01-10 14:20:45 +00:00
Rafael Espindola	4ef724a8d2	Use 'w' instead of 'c' to represent the win32 mangling. This change was requested to avoid confusion if we ever support non windows coff systems. llvm-svn: 198938	2014-01-10 13:42:12 +00:00
Kristof Beyls	082ab7548c	Make sure -use-init-array has intended effect on all AArch64 ELF targets, not just linux. llvm-svn: 198937	2014-01-10 13:41:49 +00:00
NAKAMURA Takumi	e848ddb6c2	Whitespace. llvm-svn: 198934	2014-01-10 11:12:01 +00:00
NAKAMURA Takumi	dbf5da4276	Sink add_llvm_library(gtest_main) to UnitTestMain/CMakeLists.txt. llvm-svn: 198933	2014-01-10 11:02:26 +00:00
NAKAMURA Takumi	7508c87ff4	llvm/test/ExecutionEngine/MCJIT/load-object-a.ll: Remove "REQUIRES:shell". This doesn't depend on shell's behavior. llvm-svn: 198931	2014-01-10 10:38:52 +00:00
NAKAMURA Takumi	8f51412a92	llvm/test/ExecutionEngine/MCJIT/lit.local.cfg: Add "AMD64" in the host_arch list. FIXME: We should not take CMake's ${CMAKE_SYSTEM_PROCESSOR}... llvm-svn: 198930	2014-01-10 10:38:46 +00:00
NAKAMURA Takumi	7f9f4d1b7d	lli: Tweak CacheName not to contain DOS driveletter. llvm-svn: 198929	2014-01-10 10:38:40 +00:00
NAKAMURA Takumi	f770ac39bf	lli: LLIObjectCache: Use llvm::sys::path to get dirname. llvm-svn: 198928	2014-01-10 10:38:34 +00:00
NAKAMURA Takumi	da4e382ace	Whitespace. llvm-svn: 198927	2014-01-10 10:38:28 +00:00
NAKAMURA Takumi	168c7d5d01	llvm/test/ExecutionEngine/MCJIT/load-object-a.ll: Fix not to use %t.cachedir/%p. %p is like X:\foo\bar. llvm-svn: 198926	2014-01-10 10:38:23 +00:00
Kostya Serebryany	25ca0cb80c	reapply r198858: Disable LeakSanitizer in TableGen binaries, see PR18325; this time LeakSanitizerIsTurnedOffForTheCurrentProcess is used instead of __lsan_is_turned_off llvm-svn: 198922	2014-01-10 08:05:42 +00:00
Saleem Abdulrasool	4af9bf355f	ARM IAS: support #:{lower,upper}16: for GNU compatibility The GNU assembler supports prefixing the expression with a '#' to indiciate that the value that is being moved is infact a constant. This improves the compatibility of the integrated assembler's parser for this. llvm-svn: 198916	2014-01-10 04:38:40 +00:00
Saleem Abdulrasool	54cac13cc3	ARM IAS: support GNU extension for ldrd, strd The GNU assembler has an extension that allows for the elision of the paired register (dt2) for the LDRD and STRD mnemonics. Add support for this in the assembly parser. Canonicalise the usage during the instruction parsing from the specified version. llvm-svn: 198915	2014-01-10 04:38:35 +00:00
Saleem Abdulrasool	b7a097a617	ARM IAS: support implicit immediate 0s for {LD,ST}R{B,}T The ARM ARM indicates the mnemonics as follows: ldrbt{<c>}{<q>} <Rt>, [<Rn>], {, #+/-<imm>} ldrt{<c>}{<q>} <Rt>, [<Rn>] {, #+/-<imm>} strbt{<c>}{<q>} <Rt>, [<Rn>] {, #<imm>} strt{<c>}{<q>} <Rt>, [<Rn>] {, #+/-<imm>} This improves the parser to deal with the implicit immediate 0 for the mnemonics as per the specification. Thanks to Joerg Sonnenberger for the tests! llvm-svn: 198914	2014-01-10 04:38:31 +00:00
Venkatraman Govindaraju	b0841deda2	[Sparc] Emit retl/ret instead of jmp instruction. It improves the readability of the assembly generated. llvm-svn: 198910	2014-01-10 02:55:27 +00:00
Venkatraman Govindaraju	c4cec3f2a4	[Sparc] Add support for parsing jmpl instruction and make indirect call and jmp instructions as aliases to jmpl. llvm-svn: 198909	2014-01-10 01:48:17 +00:00
David Blaikie	7af36c3a6d	Revert "Revert r198851, "Prototype of skeleton type units for fission"" This reverts commit r198865 which reverts r198851. ASan identified a use-of-uninitialized of the DwarfTypeUnit::Ty variable in skeleton type units. llvm-svn: 198908	2014-01-10 01:38:41 +00:00
Kevin Enderby	5c3b5e0470	Fix a bug with the ARM thumb2 CBNZ and CBNZ instructions that branch to the next instruction. This can not be encoded but can be turned into a NOP. rdar://15062072 llvm-svn: 198904	2014-01-10 00:43:32 +00:00
Chandler Carruth	f17fdf0afe	Update the developer policy to more clearly spell out the steps for contributors to submit patches to the LLVM project. Thanks to Danny, Chris, Alp, and others for reviewing. llvm-svn: 198901	2014-01-10 00:08:34 +00:00
Justin Bogner	551f5d2c7f	Bitcode: Fix a typo in an assert llvm-svn: 198894	2014-01-09 22:02:05 +00:00
Venkatraman Govindaraju	ae0e1515ef	[Sparc] Multiclass for loads/stores. No functionality change intended. llvm-svn: 198893	2014-01-09 21:49:18 +00:00
Evan Cheng	8ece7aa601	Clean up an inconsistency in v7s feature default. llvm-svn: 198889	2014-01-09 20:24:00 +00:00
Rafael Espindola	8d5e2752b6	Add a unit test for the copy constructor. I would not normally add tests like these, but the copy constructor is not used at all in our codebase with c++11, so having this tests might prevent breaking the c++03 build again. llvm-svn: 198886	2014-01-09 19:47:39 +00:00
Alp Toker	671c9fe7b0	Revert "Disable LeakSanitizer in TableGen binaries, see PR18325" To declare or define reserved identifers is undefined behaviour in standard C++. This needs to be addressed in compiler-rt before it can be used in LLVM. See the list discussion for details. This reverts commit r198858. llvm-svn: 198884	2014-01-09 19:40:55 +00:00
Nadav Rotem	0ee224c122	Re-remove dead code. This reverts r198854. llvm-svn: 198879	2014-01-09 19:22:07 +00:00
Rafael Espindola	f8bce2c4fa	Update example to be more idiomatic. llvm-svn: 198872	2014-01-09 14:40:43 +00:00
NAKAMURA Takumi	d87c5b8748	Revert r198851, "Prototype of skeleton type units for fission" It caused undefined behavior. DwarfTypeUnit::Ty might not be initialized properly, I guess. llvm-svn: 198865	2014-01-09 13:08:00 +00:00
Stepan Dyatkovskiy	3ed8a09218	Fixed old typo in ScalarEvolution, that caused wrong SCEVs zext operation. Detailed description is here: http://llvm.org/bugs/show_bug.cgi?id=18000#c16 For participation in bugfix process special thanks to David Wiberg. llvm-svn: 198863	2014-01-09 12:26:12 +00:00
Richard Sandiford	9a2e030d71	[SystemZ] Fix RNSBG bug introduced by r197802 The zext handling added in r197802 wasn't right for RNSBG. This patch restricts it to ROSBG, RXSBG and RISBG. (The tests for RISBG were added in r197802 since RISBG was the motivating example.) llvm-svn: 198862	2014-01-09 11:28:53 +00:00
Richard Sandiford	62f3dcdad8	Handle masked rotate amounts At the moment we expect rotates to have the form: (or (shl X, Y), (shr X, Z)) where Y == bitsize(X) - Z or Z == bitsize(X) - Y. This form means that the (or ...) is undefined for Y == 0 or Z == 0. This undefinedness can be avoided by using Y == (C * bitsize(X) - Z) & (bitsize(X) - 1) or Z == (C * bitsize(X) - Y) & (bitsize(X) - 1) for any integer C (including 0, the most natural choice). llvm-svn: 198861	2014-01-09 10:56:42 +00:00
Richard Sandiford	e9b9f0fa27	Match the InstCombine form of rotates by X+C InstCombine converts (sub 32, (add X, C)) into (sub 32-C, X), so a rotate left of a 32-bit Y by X+C could appear as either: (or (shl Y, (add X, C)), (shr Y, (sub 32, (add X, C)))) without InstCombine or: (or (shl Y, (add X, C)), (shr Y, (sub 32-C, X))) with it. We already matched the first form. This patch handles the second too. llvm-svn: 198860	2014-01-09 10:49:40 +00:00

1 2 3 4 5 ...

99032 Commits