llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Yuchen Wu	dda2c44c38	llvm-cov: Changed extension from .llcov to .gcov. llvm-svn: 196530	2013-12-05 20:45:36 +00:00
Matt Arsenault	f4228e2204	Revert part of GCC warning fix to fix debug build. The typedef is used inside the DEBUG(), and apparently can't be moved inside of it. llvm-svn: 196528	2013-12-05 20:02:18 +00:00
Matt Arsenault	57be52c610	Fix minor GCC warnings. Unused typedefs and unused variables. llvm-svn: 196526	2013-12-05 19:37:36 +00:00
Michael Gottesman	f2e7969964	Change std::deque => std::vector. No functionality change. There is no reason to use std::deque here over std::vector. Thus given the performance differences inbetween the two it makes sense to change deque to vector. llvm-svn: 196524	2013-12-05 18:42:12 +00:00
Yunzhong Gao	f3138b60c3	Document that dllexported symbols are preserved by optimization passes. llvm-svn: 196523	2013-12-05 18:37:54 +00:00
Rafael Espindola	2ad993fb14	Fix non-deterministic behavior. We use CSEBlocks to initialize a worklist: SmallVector<BasicBlock *, 8> CSEWorkList(CSEBlocks.begin(), CSEBlocks.end()); so it must have a deterministic order. llvm-svn: 196520	2013-12-05 18:28:01 +00:00
Eric Christopher	e62b076d5b	Rename DwarfUnits to DwarfFile to help avoid some naming confusion. llvm-svn: 196519	2013-12-05 18:06:10 +00:00
Andrew Trick	7eb7f7648b	MI-Sched: Model "reserved" processor resources. This allows a target to use MI-Sched as an in-order scheduler that will model strict resource conflicts without defining a processor itinerary. Instead, the target can now use the new per-operand machine model and define in-order resources with BufferSize=0. For example, this would allow restricting the type of operations that can be formed into a dispatch group. (Normally NumMicroOps is sufficient to enforce dispatch groups). If the intent is to model latency in in-order pipeline, as opposed to resource conflicts, then a resource with BufferSize=1 should be defined instead. This feature is only casually tested as there are no in-tree targets using it yet. However, Hal will be experimenting with POWER7. llvm-svn: 196517	2013-12-05 17:56:02 +00:00
Andrew Trick	192311ab9a	MI-Sched: handle latency of in-order operations with the new machine model. The per-operand machine model allows the target to define "unbuffered" processor resources. This change is a quick, cheap way to model stalls caused by the latency of operations that use such resources. This only applies when the processor's micro-op buffer size is non-zero (Out-of-Order). We can't precisely model in-order stalls during out-of-order execution, but this is an easy and effective heuristic. It benefits cortex-a9 scheduling when using the new machine model, which is not yet on by default. MI-Sched for armv7 was evaluated on Swift (and only not enabled because of a performance bug related to predication). However, we never evaluated Cortex-A9 performance on MI-Sched in its current form. This change adds MI-Sched functionality to reach performance goals on A9. The only remaining change is to allow MI-Sched to run as a PostRA pass. I evaluated performance using a set of options to estimate the performance impact once MI sched is default on armv7: -mcpu=cortex-a9 -disable-post-ra -misched-bench -scheditins=false For a simple saxpy loop I see a 1.7x speedup. Here are the llvm-testsuite results: (min run time over 2 runs, filtering tiny changes) Speedups: \| Benchmarks/BenchmarkGame/recursive \| 52.39% \| \| Benchmarks/VersaBench/beamformer \| 20.80% \| \| Benchmarks/Misc/pi \| 19.97% \| \| Benchmarks/Misc/mandel-2 \| 19.95% \| \| SPEC/CFP2000/188.ammp \| 18.72% \| \| Benchmarks/McCat/08-main/main \| 18.58% \| \| Benchmarks/Misc-C++/Large/sphereflake \| 18.46% \| \| Benchmarks/Olden/power \| 17.11% \| \| Benchmarks/Misc-C++/mandel-text \| 16.47% \| \| Benchmarks/Misc/oourafft \| 15.94% \| \| Benchmarks/Misc/flops-7 \| 14.99% \| \| Benchmarks/FreeBench/distray \| 14.26% \| \| SPEC/CFP2006/470.lbm \| 14.00% \| \| mediabench/mpeg2/mpeg2dec/mpeg2decode \| 12.28% \| \| Benchmarks/SmallPT/smallpt \| 10.36% \| \| Benchmarks/Misc-C++/Large/ray \| 8.97% \| \| Benchmarks/Misc/fp-convert \| 8.75% \| \| Benchmarks/Olden/perimeter \| 7.10% \| \| Benchmarks/Bullet/bullet \| 7.03% \| \| Benchmarks/Misc/mandel \| 6.75% \| \| Benchmarks/Olden/voronoi \| 6.26% \| \| Benchmarks/Misc/flops-8 \| 5.77% \| \| Benchmarks/Misc/matmul_f64_4x4 \| 5.19% \| \| Benchmarks/MiBench/security-rijndael \| 5.15% \| \| Benchmarks/Misc/flops-6 \| 5.10% \| \| Benchmarks/Olden/tsp \| 4.46% \| \| Benchmarks/MiBench/consumer-lame \| 4.28% \| \| Benchmarks/Misc/flops-5 \| 4.27% \| \| Benchmarks/mafft/pairlocalalign \| 4.19% \| \| Benchmarks/Misc/himenobmtxpa \| 4.07% \| \| Benchmarks/Misc/lowercase \| 4.06% \| \| SPEC/CFP2006/433.milc \| 3.99% \| \| Benchmarks/tramp3d-v4 \| 3.79% \| \| Benchmarks/FreeBench/pifft \| 3.66% \| \| Benchmarks/Ptrdist/ks \| 3.21% \| \| Benchmarks/Adobe-C++/loop_unroll \| 3.12% \| \| SPEC/CINT2000/175.vpr \| 3.12% \| \| Benchmarks/nbench \| 2.98% \| \| SPEC/CFP2000/183.equake \| 2.91% \| \| Benchmarks/Misc/perlin \| 2.85% \| \| Benchmarks/Misc/flops-1 \| 2.82% \| \| Benchmarks/Misc-C++-EH/spirit \| 2.80% \| \| Benchmarks/Misc/flops-2 \| 2.77% \| \| Benchmarks/NPB-serial/is \| 2.42% \| \| Benchmarks/ASC_Sequoia/CrystalMk \| 2.33% \| \| Benchmarks/BenchmarkGame/n-body \| 2.28% \| \| Benchmarks/SciMark2-C/scimark2 \| 2.27% \| \| Benchmarks/Olden/bh \| 2.03% \| \| skidmarks10/skidmarks \| 1.81% \| \| Benchmarks/Misc/flops \| 1.72% \| Slowdowns: \| Benchmarks/llubenchmark/llu \| -14.14% \| \| Benchmarks/Polybench/stencils/seidel-2d \| -5.67% \| \| Benchmarks/Adobe-C++/functionobjects \| -5.25% \| \| Benchmarks/Misc-C++/oopack_v1p8 \| -5.00% \| \| Benchmarks/Shootout/hash \| -2.35% \| \| Benchmarks/Prolangs-C++/ocean \| -2.01% \| \| Benchmarks/Polybench/medley/floyd-warshall \| -1.98% \| \| Polybench/linear-algebra/kernels/3mm \| -1.95% \| \| Benchmarks/McCat/09-vor/vor \| -1.68% \| llvm-svn: 196516	2013-12-05 17:55:58 +00:00
Andrew Trick	24a9064bbd	Machine model comments. Explain a ProcessorUnit's BufferSize. llvm-svn: 196515	2013-12-05 17:55:53 +00:00
Andrew Trick	9518dde658	Fix the A9 machine model. VTRN writes two registers. llvm-svn: 196514	2013-12-05 17:55:49 +00:00
Andrew Trick	6bd4b82476	comment typo and reformat llvm-svn: 196513	2013-12-05 17:55:47 +00:00
Rafael Espindola	eb989f9afc	Add a default constructor to get deterministic behavior. Should fix the msan and valgrind bots. llvm-svn: 196509	2013-12-05 16:21:17 +00:00
Arnold Schwaighofer	120880c780	SLPVectorizer: An in-tree vectorized entry cannot also be a scalar external use We were creating external uses for scalar values in MustGather entries that also had a ScalarToTreeEntry (they also are present in a vectorized tuple). This meant we would keep a value 'alive' as a scalar and vectorized causing havoc. This is not necessary because when we create a MustGather vector we explicitly create external uses entries for the insertelement instructions of the MustGather vector elements. Fixes PR18129. radar://15582184 llvm-svn: 196508	2013-12-05 15:14:40 +00:00
Kostya Serebryany	eb57b3e248	[tsan] fix PR18146: sometimes a variable written into vptr could have an integer type (after other optimizations) llvm-svn: 196507	2013-12-05 15:03:02 +00:00
Justin Holewinski	925169cb4e	[NVPTX] Fix off-by-one error when creating the VT list for an SDNode llvm-svn: 196503	2013-12-05 12:58:00 +00:00
Alexey Samsonov	66f3fd41ae	Add forgotten header guards llvm-svn: 196500	2013-12-05 12:52:32 +00:00
Matheus Almeida	b651cddc0c	[mips] Small code generation improvement for conditional operator (select) in case the operands are constants and its difference is \|1\|. It should be possible in those cases to rematerialize the result using MIPS's slt and similar instructions. The small update to some of the tests in cmov.ll, sel1c.ll and sel2c.ll was needed otherwise the optimization implemented in this patch would have been triggered (difference between the operands was 1) and that would have changed the semantic of the tests. llvm-svn: 196498	2013-12-05 12:07:05 +00:00
Matheus Almeida	adbbd704d1	[mips] Add some comments related to the optimization performed in performSELECTCombine. The structure of the code was slightly modified so that the next patch is easier to read/review. No functional changes. llvm-svn: 196496	2013-12-05 11:56:56 +00:00
Matheus Almeida	f0fc3cf095	[mips][msa] Fix issue with immediate fields of LD/ST instructions not being correctly encoded/decoded. In more detail, immediate fields of LD/ST instructions should be divided/multiplied by the size of the data format before encoding and after decoding, respectively. llvm-svn: 196494	2013-12-05 11:06:22 +00:00
Tim Northover	d04bb11dd7	ARM: fix yet another stack-folding bug We were trying to fold the stack adjustment into the wrong instruction in the situation where the entire basic-block was epilogue code. Really, it can only ever be valid to do the folding precisely where the "add sp, ..." would be placed so there's no need for a separate iterator to track that. Should fix PR18136. llvm-svn: 196493	2013-12-05 11:02:02 +00:00
David Blaikie	042cd582a0	DwarfDebug/DwarfUnit: Push abbreviation structures down into DwarfUnits to reduce duplication llvm-svn: 196479	2013-12-05 07:43:55 +00:00
Matt Arsenault	414264b933	Use isIntrinsic() instead of checking for "llvm." llvm-svn: 196473	2013-12-05 06:05:43 +00:00
Rafael Espindola	2b4db8d379	Remove the isImplicitlyPrivate argument of getNameWithPrefix. getSymbolWithGlobalValueBase use is to create a name of a new symbol based on the name of an existing GV. Assert that and then remove the last call to pass true to isImplicitlyPrivate. This gives the mangler API a 1:1 mapping from GV to names, which is what we need to drop the mangler dependency on the target (and use an extended datalayout instead). llvm-svn: 196472	2013-12-05 05:53:12 +00:00
Alp Toker	e845f8af67	Correct word hyphenations This patch tries to avoid unrelated changes other than fixing a few hyphen-related ambiguities and contractions in nearby lines. llvm-svn: 196471	2013-12-05 05:44:44 +00:00
Rafael Espindola	b4226966a9	Hide the stub created for MO_ExternalSymbol too. given declare void @llvm.memset.p0i8.i32(i8* nocapture, i8, i32, i32, i1) declare void @foo() define void @bar() { call void @foo() call void @llvm.memset.p0i8.i32(i8* null, i8 0, i32 188, i32 1, i1 false) ret void } We used to produce L_foo$stub: .indirect_symbol _foo .ascii "\364\364\364\364\364" _memset$stub: .indirect_symbol _memset .ascii "\364\364\364\364\364" We not produce a private stub for memset too. Stubs are not needed with recent linkers, but we still produce them for darwin8. Thanks to David Fang for confirming that gcc used to do this too. llvm-svn: 196468	2013-12-05 05:19:12 +00:00
Matt Arsenault	6f14dd54b4	R600/SI: Add comments for number of used registers. llvm-svn: 196467	2013-12-05 05:15:35 +00:00
Rafael Espindola	6963247d3a	Try harder to get a consistent floating point results. This just extends the existing hack. It should be enough to get a reproducible bootstrap on 32 bits. I will open a bug to track getting a real fix for this. llvm-svn: 196462	2013-12-05 04:14:33 +00:00
NAKAMURA Takumi	44a125b7f6	Move llvm/test/MC/ELF/thumb-st_other.s to test/MC/ARM. llvm-svn: 196457	2013-12-05 02:21:44 +00:00
Jiangning Liu	7825595e77	For AArch64, add missing register cost calculation for big value types like v4i64 and v8i64. llvm-svn: 196456	2013-12-05 02:12:01 +00:00
Cameron McInally	00a0d8b6f3	Add FileCheck statements for r196435. llvm-svn: 196449	2013-12-05 01:20:36 +00:00
Reid Kleckner	27cfa30e39	Compiler.h: Disable initializer list usage with clang-cl Most people are using MSVC 2012, which lacks the <initializer_list> header. MSVC 2013 shipped with that header, but it has not yet been tested. If clang works with the 2013 header, then we can enable this by checking the value of _MSC_VER. llvm-svn: 196448	2013-12-05 01:03:23 +00:00
Will Dietz	dd5418f361	Export symbols in tools that support loading plugins. llvm-svn: 196447	2013-12-05 01:01:58 +00:00
David Blaikie	5e586ea3ed	DwarfDebug: Avoid unnecessary abbreviation lookup when emitting DIEs DIEs already contain references directly to their DIEAbbrev, use that instead of looking it up based on index. llvm-svn: 196446	2013-12-05 01:01:41 +00:00
David Blaikie	a2745869cd	DwarfDebug: Remove trivial function wrapper llvm-svn: 196445	2013-12-05 01:01:37 +00:00
Eric Christopher	fe3790d105	Make these two tests resilient in the face of compile unit size changes. llvm-svn: 196444	2013-12-05 01:00:12 +00:00
Eric Christopher	b81841285f	80-column. llvm-svn: 196442	2013-12-05 00:36:21 +00:00
Eric Christopher	2d6d0fc3f2	Remove special handling for DW_AT_ranges support by constructing the values with the correct behavior. llvm-svn: 196441	2013-12-05 00:36:17 +00:00
Logan Chien	558333e1e1	[mc] Fix ELF st_other flag. ELF_Other_Weakref and ELF_Other_ThumbFunc seems to be LLVM internal ELF symbol flags. These should not be emitted to object file. This commit defines ELF_STO_Shift for the target-defined flags for st_other, and increase the value of ELF_Other_Shift to 16. llvm-svn: 196440	2013-12-05 00:34:11 +00:00
Michael Ilseman	fb9a99d2cf	Use present fast-math flags when applicable in CreateBinOp We were previously not adding fast-math flags through CreateBinOp() when it happened to be making a floating point binary operator. This patch updates it to do so similarly to directly calling CreateF*(). llvm-svn: 196438	2013-12-05 00:32:09 +00:00
Eric Christopher	d051658b5c	Fix comment. llvm-svn: 196437	2013-12-05 00:13:15 +00:00
Cameron McInally	675f9245aa	Add AVX512 patterns for v16i32 broadcast and v2i64 zero extend load. Patch by Aleksey Bader. llvm-svn: 196435	2013-12-05 00:11:25 +00:00
Eric Christopher	4670039c5b	Fix typo. llvm-svn: 196434	2013-12-04 23:55:09 +00:00
David Blaikie	561838d222	DwarfUnit: Correct comment by generalizing over all units, not just compilation units. Code review feedback on r196394 by Paul Robinson. llvm-svn: 196433	2013-12-04 23:39:02 +00:00
Kevin Enderby	218f72b95b	Fix a bug in darwin's 32-bit X86 handling of evaluating fixups. Where it would use a scattered relocation entry but falls back to a normal relocation entry because the FixupOffset is more than 24-bits. The bug is in the X86MachObjectWriter::RecordScatteredRelocation() where it changes reference parameter FixedValue but then returns false to indicate it did not create a scattered relocation entry. The fix is simply to save the original value of the parameter FixedValue at the start of the method and restore it if we are returning false in that case. rdar://15526046 llvm-svn: 196432	2013-12-04 23:36:24 +00:00
Eric Christopher	f46aa7d453	Update comment. llvm-svn: 196431	2013-12-04 23:24:38 +00:00
Eric Christopher	a054e191ad	Update comment. llvm-svn: 196430	2013-12-04 23:24:28 +00:00
Eric Christopher	435de44e9d	Remove incorrect comment and pointless cast. llvm-svn: 196427	2013-12-04 23:05:21 +00:00
Eric Christopher	f8ead1f600	const on its own line is confusing. llvm-svn: 196426	2013-12-04 22:54:45 +00:00
David Peixotto	b6710ff7c7	Add support for parsing ARM symbol variants on ELF targets ARM symbol variants are written with parens instead of @ like this: .word __GLOBAL_I_a(target1) This commit adds support for parsing these symbol variants in expressions. We introduce a new flag to MCAsmInfo that indicates the parser should use parens to parse the symbol variant. The expression parser is modified to look for symbol variants using parens instead of @ when the corresponding MCAsmInfo flag is true. The MCAsmInfo parens flag is enabled only for ARM on ELF. By adding this flag to MCAsmInfo, we are able to get rid of redundant ARM-specific symbol variants and use the generic variants instead (e.g. VK_GOT instead of VK_ARM_GOT). We use the new UseParensForSymbolVariant attribute in MCAsmInfo to correctly print the symbol variants for arm. To achive this we need to keep a handle to the MCAsmInfo in the MCSymbolRefExpr class that we can check when printing the symbol variant. Updated Tests: Changed case of symbol variant to match the generic kind. test/CodeGen/ARM/tls-models.ll test/CodeGen/ARM/tls1.ll test/CodeGen/ARM/tls2.ll test/CodeGen/Thumb2/tls1.ll test/CodeGen/Thumb2/tls2.ll PR18080 llvm-svn: 196424	2013-12-04 22:43:20 +00:00

1 2 3 4 5 ...

98158 Commits