llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 22:42:46 +02:00

Author	SHA1	Message	Date
JF Bastien	f82b3e73a6	x86: Emit LAHF/SAHF instead of PUSHF/POPF NaCl's sandbox doesn't allow PUSHF/POPF out of security concerns (priviledged emulators have forgotten to mask system bits in the past, and EFLAGS's DF bit is a constant source of hilarity). Commit r220529 fixed PR20376 by saving cmpxchg's flags result using EFLAGS, this commit now generated LAHF/SAHF instead, for all of x86 (not just NaCl) because it leads to an overall performance gain over PUSHF/POPF. As with the previous patch this code generation is pretty bad because it occurs very later, after register allocation, and in many cases it rematerializes flags which were already available (e.g. already in a register through SETE). Fortunately it's somewhat rare that this code needs to fire. I did [[ https://github.com/jfbastien/benchmark-x86-flags \| a bit of benchmarking ]], the results on an Intel Haswell E5-2690 CPU at 2.9GHz are: \| Time per call (ms) \| Runtime (ms) \| Benchmark \| \| 0.000012514 \| 6257 \| sete.i386 \| \| 0.000012810 \| 6405 \| sete.i386-fast \| \| 0.000010456 \| 5228 \| sete.x86-64 \| \| 0.000010496 \| 5248 \| sete.x86-64-fast \| \| 0.000012906 \| 6453 \| lahf-sahf.i386 \| \| 0.000013236 \| 6618 \| lahf-sahf.i386-fast \| \| 0.000010580 \| 5290 \| lahf-sahf.x86-64 \| \| 0.000010304 \| 5152 \| lahf-sahf.x86-64-fast \| \| 0.000028056 \| 14028 \| pushf-popf.i386 \| \| 0.000027160 \| 13580 \| pushf-popf.i386-fast \| \| 0.000023810 \| 11905 \| pushf-popf.x86-64 \| \| 0.000026468 \| 13234 \| pushf-popf.x86-64-fast \| Clearly `PUSHF`/`POPF` are suboptimal. It doesn't really seems to be worth teaching LLVM about individual flags, at least not for this purpose. Reviewers: rnk, jvoung, t.p.northover Subscribers: llvm-commits Differential revision: http://reviews.llvm.org/D6629 llvm-svn: 244503	2015-08-10 20:59:36 +00:00
Rafael Espindola	6e87ed90fd	Use higher level functions in llvm-objdump. This matches the rest of llvm-objdump better and isolates it from upcoming changes to ELFFile. llvm-svn: 244500	2015-08-10 20:50:40 +00:00
Sanjay Patel	bea667f5ae	fix minsize detection: minsize attribute implies optimizing for size llvm-svn: 244499	2015-08-10 20:45:44 +00:00
Sanjay Patel	d7a321e834	[x86, SSE]]add missing tests for load folding with partial register update The minsize case is wrong; that will be fixed in the next commit. llvm-svn: 244498	2015-08-10 20:34:34 +00:00
Rafael Espindola	e12302b7c2	Delete getDotSymtabSec. Another step in avoiding iterating over all sections in the ELFFile constructor. llvm-svn: 244496	2015-08-10 20:25:04 +00:00
Simon Pilgrim	65266a8e22	[InstCombine] Move SSE2/AVX2 arithmetic vector shift folding to instcombiner As discussed in D11760, this patch moves the (V)PSRA(WD) arithmetic shift-by-constant folding to InstCombine to match the logical shift implementations. Differential Revision: http://reviews.llvm.org/D11886 llvm-svn: 244495	2015-08-10 20:21:15 +00:00
Tyler Nowicki	430003c6a7	Removed unused and incorrectly implemented classof() on Optimization Remark base class. llvm-svn: 244494	2015-08-10 20:13:32 +00:00
Colin LeMahieu	822cee11a6	[TableGen] NFC improving comments about what the tokenized identifiers will contain. llvm-svn: 244493	2015-08-10 19:58:06 +00:00
Jonathan Roelofs	135252e238	Fix a few more cases of 'CHECK[^:]*$'. NFCI llvm-svn: 244491	2015-08-10 19:56:39 +00:00
Tyler Nowicki	6edbef9016	Late evaluation of the fast-math vectorization requirement. This patch moves the verification of fast-math to just before vectorization is done. This way we can tell clang to append the command line options would that allow floating-point commutativity. Specifically those are enableing fast-math or specifying a loop hint. llvm-svn: 244489	2015-08-10 19:51:46 +00:00
Jonathan Roelofs	87c34551dd	Fix another case of 'CHECK[^:]*$'. NFCI llvm-svn: 244486	2015-08-10 19:22:55 +00:00
Tyler Nowicki	455075e570	Modify diagnostic messages to clearly indicate the why interleaving wasn't done. Sometimes interleaving is not beneficial, as determined by the cost-model and sometimes it is disabled by a loop hint (by the user). This patch modifies the diagnostic messages to make it clear why interleaving wasn't done. llvm-svn: 244485	2015-08-10 19:14:16 +00:00
James Y Knight	2a6af41342	[Sparc] Implement i64 load/store support for 32-bit sparc. The LDD/STD instructions can load/store a 64bit quantity from/to memory to/from a consecutive even/odd pair of (32-bit) registers. They are part of SparcV8, and also present in SparcV9. (Although deprecated there, as you can store 64bits in one register). As recommended on llvmdev in the thread "How to enable use of 64bit load/store for 32bit architecture" from Apr 2015, I've modeled the 64-bit load/store operations as working on a v2i32 type, rather than making i64 a legal type, but with few legal operations. The latter does not (currently) work, as there is much code in llvm which assumes that if i64 is legal, operations like "add" will actually work on it. The same assumption does not hold for v2i32 -- for vector types, it is workable to support only load/store, and expand everything else. This patch: - Adds a new register class, IntPair, for even/odd pairs of registers. - Modifies the list of reserved registers, the stack spilling code, and register copying code to support the IntPair register class. - Adds support in AsmParser. (note that in asm text, you write the name of the first register of the pair only. So the parser has to morph the single register into the equivalent paired register). - Adds the new instructions themselves (LDD/STD/LDDA/STDA). - Hooks up the instructions and registers as a vector type v2i32. Adds custom legalizer to transform i64 load/stores into v2i32 load/stores and bitcasts, so that the new instructions can actually be generated, and marks all operations other than load/store on v2i32 as needing to be expanded. - Copies the unfortunate SelectInlineAsm hack from ARMISelDAGToDAG. This hack undoes the transformation of i64 operands into two arbitrarily-allocated separate i32 registers in SelectionDAGBuilder. and instead passes them in a single IntPair. (Arbitrarily allocated registers are not useful, asm code expects to be receiving a pair, which can be passed to ldd/std.) Also adds a bunch of test cases covering all the bugs I've added along the way. Differential Revision: http://reviews.llvm.org/D8713 llvm-svn: 244484	2015-08-10 19:11:39 +00:00
Rafael Espindola	2f3cc0d8e8	rename toELFShdrIter to getSection and move it closer to getSymbol. NFC. llvm-svn: 244483	2015-08-10 19:10:37 +00:00
Rafael Espindola	8c3629f049	toELFSymIter and getSymbol are now the same thing. Merge them. llvm-svn: 244482	2015-08-10 19:07:56 +00:00
Jonathan Roelofs	3ac128b6b7	Fix a bunch of trivial cases of 'CHECK[^:]*$' in the tests. NFCI I looked into adding a warning / error for this to FileCheck, but there doesn't seem to be a good way to avoid it triggering on the instances of it in RUN lines. llvm-svn: 244481	2015-08-10 19:01:27 +00:00
Rafael Espindola	29d199aa4d	Use continue to reduce indentation. NFC. llvm-svn: 244480	2015-08-10 18:57:42 +00:00
Chad Rosier	e027a2d1cc	[AArch64] Convert a conditional check that will always be true to an assert. NFC. llvm-svn: 244479	2015-08-10 18:42:45 +00:00
Yaron Keren	61f6d7c22d	Recommit r244470+ r244471 together, the bot failed between them. llvm-svn: 244476	2015-08-10 18:27:51 +00:00
Igor Laevsky	dc6b4a78a4	[IndVarSimplify] Make cost estimation in RewriteLoopExitValues smarter Differential Revision: http://reviews.llvm.org/D11687 llvm-svn: 244474	2015-08-10 18:23:58 +00:00
Yaron Keren	88665eff12	Revert r244470 and 244471 while looking into it. llvm-svn: 244472	2015-08-10 18:14:56 +00:00
Yaron Keren	47dad377e0	Second part of r244470 (source file was unsaved in editor). llvm-svn: 244471	2015-08-10 18:06:01 +00:00
Yaron Keren	0093118b2a	Really implement David Blaikie suggestion in full of seperating variable initialization from its usage in the push_back making collapse of the two statements unlikely even without a comment. llvm-svn: 244470	2015-08-10 18:03:35 +00:00
Mark Heffernan	ba9e336c90	Add new llvm.loop.unroll.enable metadata. This change adds the unroll metadata "llvm.loop.unroll.enable" which directs the optimizer to unroll a loop fully if the trip count is known at compile time, and unroll partially if the trip count is not known at compile time. This differs from "llvm.loop.unroll.full" which explicitly does not unroll a loop if the trip count is not known at compile time. The "llvm.loop.unroll.enable" is intended to be added for loops annotated with "#pragma unroll". llvm-svn: 244466	2015-08-10 17:28:08 +00:00
Chad Rosier	216eb1ed4b	Typo. Move comment closer to relevant code. NFC. llvm-svn: 244465	2015-08-10 17:17:19 +00:00
Sanjay Patel	d654d315bb	fix minsize detection: minsize attribute implies optimizing for size llvm-svn: 244464	2015-08-10 17:15:17 +00:00
Sanjay Patel	5fcdfefe10	fix minsize detection: minsize attribute implies optimizing for size llvm-svn: 244463	2015-08-10 17:00:44 +00:00
Yaron Keren	ea9a1cc024	Fully apply David Blaikie suggestion and add comment explaining why. llvm-svn: 244461	2015-08-10 16:53:30 +00:00
Sanjay Patel	c0b2c539ba	fix minsize detection: minsize attribute implies optimizing for size llvm-svn: 244460	2015-08-10 16:47:47 +00:00
Sanjay Patel	c41be0722a	fix minsize detection: minsize attribute implies optimizing for size llvm-svn: 244458	2015-08-10 16:43:20 +00:00
Yaron Keren	b598ba7c7c	Add missing include guard to FuzzerInternal.h, NFC. llvm-svn: 244457	2015-08-10 16:37:40 +00:00
Yaron Keren	620d03e422	Modify r244405 to clearer code, per David Blaikie suggestion. llvm-svn: 244455	2015-08-10 16:15:51 +00:00
Aaron Ballman	04b689be68	Silence a sign mismatch warning; NFC. llvm-svn: 244452	2015-08-10 15:22:39 +00:00
Silviu Baranga	fccd898d4b	[TTI] Add a hook for specifying per-target defaults for Interleaved Accesses Summary: This adds a hook to TTI which enables us to selectively turn on by default interleaved access vectorization for targets on which we have have performed the required benchmarking. Reviewers: rengolin Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D11901 llvm-svn: 244449	2015-08-10 14:50:54 +00:00
Fraser Cormack	c174b92fa2	Prevent the scalarizer from caching incorrect entries The scalarizer can cache incorrect entries when walking up a chain of insertelement instructions. This occurs when it encounters more than one instruction that it is not actively searching for, as it unconditionally caches every element it finds. The fix is to only cache the first element that it isn't searching for so we don't overwrite correct entries. Reviewers: hfinkel Differential Revision: http://reviews.llvm.org/D11559 llvm-svn: 244448	2015-08-10 14:48:47 +00:00
Rafael Espindola	16999b6002	elf2yaml: Use existing section walk to find the symbol table. NFC. llvm-svn: 244447	2015-08-10 14:27:50 +00:00
Michael Kruse	106510316b	[RegionInfo] Fix typo llvm-svn: 244445	2015-08-10 13:26:09 +00:00
Michael Kruse	8153cc8a10	[RegionInfo] Add debug-time region viewer functions Summary: Analogously to Function::viewCFG(), RegionInfo::view() and RegionInfo::viewOnly() are meant to be called in debugging sessions. They open a viewer to show how RegionInfo currently understands the region hierarchy. The functions viewRegion(Function) and viewRegionOnly(Function) invoke a fresh region analysis of the function in contrast to viewRegion(RegionInfo) and viewRegionOnly(RegionInfo) which show the current analysis result. Reviewers: grosser Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11875 llvm-svn: 244444	2015-08-10 13:21:59 +00:00
Michael Kruse	3373ce735a	[RegionInfo] Use RegionInfo* instead of RegionInfoPass* as graph type This allows printing region graphs when only the RegionInfo (e.g. Region::getRegionInfo()), but no RegionInfoPass object is available. Specifically, we will use this to print RegionInfo graphs in the debugger. Differential version: http://reviews.llvm.org/D11874 Reviewed-by: grosser llvm-svn: 244442	2015-08-10 12:57:23 +00:00
Michael Kruse	f4a0009c80	[RegionInfo] Update old-style comments Authorized-by: grosser llvm-svn: 244441	2015-08-10 12:40:41 +00:00
Michael Kruse	13ae2127ab	[RegionInfo] More descriptive error messages in verifier llvm-svn: 244440	2015-08-10 12:28:52 +00:00
Robert Lougher	ac5d349432	Trace copies when checking for rematerializability in spill weight calculation PR24139 contains an analysis of poor register allocation. One of the findings was that when calculating the spill weight, a rematerializable interval once split is no longer rematerializable. This is because the isRematerializable check in CalcSpillWeights.cpp does not follow the copies introduced by live range splitting (after splitting, the live interval register definition is a copy which is not rematerializable). Reviewers: qcolombet Differential Revision: http://reviews.llvm.org/D11686 llvm-svn: 244439	2015-08-10 11:59:44 +00:00
Marina Yatsina	b999526f3d	Test commit to verify commit access llvm-svn: 244438	2015-08-10 11:33:10 +00:00
Yaron Keren	8d83d7181d	Rangify for loop, NFC. llvm-svn: 244434	2015-08-10 07:04:29 +00:00
NAKAMURA Takumi	1b224b5ec5	Reformat headers in ADT and Support partially. Note, I didn't reformat entirely, but partially where I touched in previous commits. llvm-svn: 244432	2015-08-10 04:22:36 +00:00
NAKAMURA Takumi	c1197d9021	Whitespace. llvm-svn: 244431	2015-08-10 04:22:09 +00:00
NAKAMURA Takumi	b93b06ef3e	Reformat linebreaks. llvm-svn: 244430	2015-08-10 04:21:43 +00:00
NAKAMURA Takumi	06ac411c34	llvm/include/llvm/Support/Memory.h: Fix comment header. llvm-svn: 244429	2015-08-10 04:21:19 +00:00
Craig Topper	53690ef92d	[TableGen] Make StringInit constructor take a StringRef instead of const std::string&. NFC. llvm-svn: 244426	2015-08-09 22:03:04 +00:00
Saleem Abdulrasool	e810d83543	X86: remove a dead store (NFC) The SP was always unconditionally assigned to later, but initialised early. This delays the initialisation, and avoids the dead store. Identified by clang static analysis. No functional change intended. llvm-svn: 244423	2015-08-09 20:39:09 +00:00

... 3 4 5 6 7 ...

120554 Commits