llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Sanjoy Das	99ca015392	[OperandBundles] Treat "deopt" operand bundles specially Teach LLVM optimize to more precisely in the presence of "deopt" operand bundles. "deopt" operand bundles imply that the call they're attached to is at least `readonly` (i.e. they don't imply clobber semantics), and they don't capture their bundle operands. llvm-svn: 254118	2015-11-26 01:16:05 +00:00
Tom Stellard	eb7e999b29	AMDGPU: Add llvm.amdgcn.dispatch.ptr intrinsic Summary: This returns a pointer to the dispatch packet, which can be used to load information about the kernel dispach. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14898 llvm-svn: 254116	2015-11-26 00:43:29 +00:00
Xinliang David Li	df875576b5	Fix a typo introduced in previous patches llvm-svn: 254112	2015-11-26 00:02:23 +00:00
Xinliang David Li	739b63a55d	[PGO] Implement ValueProfiling Closure interfaces for runtime value profile data This is one of the many steps to commonize value profiling support between profile runtime and compiler/llvm tools. After this change, profiler runtime now can share the same C APIs to do VP serialization/deseriazation with LLVM host tools (and produces value data in identical format between indexed and raw profile). It is not yet enabled in profiler runtime yet. Also added a unit test case to test runtime profile data serialization/deserialization interfaces implemented using common closure code. llvm-svn: 254110	2015-11-25 23:31:18 +00:00
Artyom Skrobov	3803dae0a6	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC) Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085	2015-11-25 19:41:11 +00:00
Eric Christopher	5f84aed4f6	Fix some places where we were assuming that memory type had been legalized to a simple type when lowering a truncating store of a vector type. In this case for an EVT we'll return Expand as we should in all of the cases anyhow. The testcase triggered at the one in VectorLegalizer::LegalizeOp, inspection found the rest. llvm-svn: 254061	2015-11-25 09:11:53 +00:00
Xinliang David Li	29597bc958	[PGO] Convert InstrProfRecord based serialization methods to use common C methods 1. Convert serialization methods using InstrProfRecord as source into C (impl) interfaces using Closure. 2. Reimplement InstrProfRecord serialization method to use new C interface as dummy wrapper. Now it is ready to implement wrapper for runtime value profile data. (The new code need better source location -- but not changed in this patch to minimize diffs. ) llvm-svn: 254057	2015-11-25 06:23:38 +00:00
Xinliang David Li	960920a165	[PGO] convert a subset of C++ interfaces into C (for sharing) (NFC) llvm-svn: 254056	2015-11-25 04:29:24 +00:00
Xinliang David Li	03ec0b37d7	Add missing documentation. (NFC) llvm-svn: 254051	2015-11-25 01:13:44 +00:00
Sanjoy Das	dcc5bddb02	[OperandBundles] Extract duplicated code into a helper function, NFC llvm-svn: 254047	2015-11-25 00:42:24 +00:00
Sanjoy Das	d16b4e5c5e	[InstCombine] Don't drop operand bundles Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14857 llvm-svn: 254046	2015-11-25 00:42:19 +00:00
Rong Xu	c4f897c441	[PGO] Revert revision r254021,r254028,r254035 Revert the above revision due to multiple issues. llvm-svn: 254040	2015-11-24 23:49:08 +00:00
Xinliang David Li	ddeee8f963	[PGO] Add mapper callback to interfaces retrieving value data for site (NFC) This allows cleaner implementation and merging retrieving/mapping in one pass. llvm-svn: 254038	2015-11-24 23:36:52 +00:00
Rong Xu	025bf7be0c	[PGO] MST based PGO instrumentation infrastructure This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 254021	2015-11-24 21:31:25 +00:00
Cong Hou	c0bb26286b	[X86] Fix several issues related to X86's psadbw instruction. This patch fixes the following issues: 1. Fix the return type of X86psadbw: it should not be the same type of inputs. For vNi8 inputs the output should be vMi64, where M = N/8. 2. Fix the return type of int_x86_avx512_psad_bw_512 accordingly. 3. Fix the definiton of PSADBW, VPSADBW, and VPSADBWY accordingly. 4. Adjust the return type when building a DAG node of X86ISD::PSADBW type. 5. Update related tests. Differential revision: http://reviews.llvm.org/D14897 llvm-svn: 254010	2015-11-24 19:51:26 +00:00
Xinliang David Li	9766247b4e	[PGO] Introduce value profile data closure type. The closure is designed to abstact away two types of value profile data: - InstrProfRecord which is the primary data structure used to represent profile data in host tools (reader, writer, and profile-use) - value profile runtime data structure suitable to be used by C runtime library. Both sources of data need to serialize to disk/memory-buffer in common format: ValueProfData. The abstraction allows compiler-rt's raw profiler writer to share the same code with indexed profile writer. llvm-svn: 254008	2015-11-24 19:21:15 +00:00
Xinliang David Li	620aee58f0	[PGO] Small interface change to be profile rt ready Convert two C++ static member functions to be C APIs. This is one of the many steps to get ready to share VP writer code with profiler runtime. llvm-svn: 253999	2015-11-24 18:15:46 +00:00
Xinliang David Li	57937fbcb6	Minor refactor to make VP writing more efficient llvm-svn: 253994	2015-11-24 17:03:24 +00:00
Krzysztof Parzyszek	ce2383b240	Add vector types for intrinsics Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 253992	2015-11-24 16:28:14 +00:00
Krzysztof Parzyszek	450319e8a0	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253978	2015-11-24 13:07:35 +00:00
Cong Hou	5747eb82f8	Let SelectionDAG start to use probability-based interface to add successors. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes. 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights. 3. Use new interfaces in all other passes. 4. Remove old interfaces. This the second patch above. In this patch SelectionDAG starts to use probability-based interfaces in MBB to add successors but other MC passes are still using weight-based interfaces. Therefore, we need to maintain correct weight list in MBB even when probability-based interfaces are used. This is done by updating weight list in probability-based interfaces by treating the numerator of probabilities as weights. This change affects many test cases that check successor weight values. I will update those test cases once this patch looks good to you. Differential revision: http://reviews.llvm.org/D14361 llvm-svn: 253965	2015-11-24 08:51:23 +00:00
Mehdi Amini	2fe02188ef	Add a FunctionImporter helper to perform summary-based cross-module function importing Summary: This is a helper to perform cross-module import for ThinLTO. Right now it is importing naively every possible called functions. Reviewers: tejohnson Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14914 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253954	2015-11-24 06:07:49 +00:00
Mehdi Amini	53aa625845	Add findFunctionInfoList() accessor to FunctionInfoIndex. Summary: This allows to query for a function in the map without creating an entry, allowing to use a const FunctionInfoIndex. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14912 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253953	2015-11-24 06:07:42 +00:00
Davide Italiano	44f68f6357	[DIE] Make DIE.h NDEBUG conditional-free. Switch dump()/print() method definitions to LLVM_DUMP_METHOD instead. llvm-svn: 253945	2015-11-24 02:21:43 +00:00
Xinliang David Li	baa2f77b42	Use make_unique [NFC] llvm-svn: 253942	2015-11-24 00:32:00 +00:00
Xinliang David Li	02a0716447	Remove trailing space in comments llvm-svn: 253941	2015-11-24 00:31:41 +00:00
Krzysztof Parzyszek	af76cac3cc	Revert r253923. Per Eric's request. llvm-svn: 253928	2015-11-23 22:19:57 +00:00
Krzysztof Parzyszek	6c363eee43	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253923	2015-11-23 22:00:17 +00:00
Nathan Slingerland	a06290a805	[Support] Add optional argument to SaturatingAdd() and SaturatingMultiply() to indicate that overflow occurred Summary: Adds the ability for callers to detect when saturation occurred on the result of saturating addition/multiplication. Reviewers: davidxl, silvas, rsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14931 llvm-svn: 253921	2015-11-23 21:54:22 +00:00
Xinliang David Li	0f29a15199	[PGO] Add --text option for llvm-profdata show\|merge commands The new option is similar to the SampleProfile dump option. - dump raw/indexed format into text profile format - merge the profile and output into text profile format. Note that Value Profiling data text format is not yet designed. That functionality will be added later. Differential Revision: http://reviews.llvm.org/D14894 llvm-svn: 253913	2015-11-23 20:47:38 +00:00
Teresa Johnson	c0ecb3ad3f	[ThinLTO] Deduplicate function index loading into shared helper (NFC) Add a shared helper routine to read the function index from a file and create/return the function index object. Use it in llvm-link and llvm-lto. llvm-svn: 253903	2015-11-23 19:19:11 +00:00
Xinliang David Li	0b39dbc2f8	[PGO] Introduce alignment macro for instr-prof control data(NFC) llvm-svn: 253893	2015-11-23 18:02:59 +00:00
Xinliang David Li	a6bd292bad	Fix comment not allowed in C90 llvm-svn: 253880	2015-11-23 17:05:45 +00:00
Nathan Slingerland	3f093e190d	[Support] Fix SaturatingMultiply<T>() to be correct (and fast), Re-enable Unit Tests Summary: This change fixes the SaturatingMultiply<T>() function template to not cause undefined behavior with T=uint16_t. Thanks to Richard Smith's contribution, it also no longer requires an integer division. Patch by Richard Smith. Reviewers: silvas, davidxl Subscribers: rsmith, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14845 llvm-svn: 253870	2015-11-23 15:33:43 +00:00
Xinliang David Li	4878e2f9f9	Move two Value Profiler data structs to InstrProfData.inc (NFC) llvm-svn: 253848	2015-11-23 05:29:51 +00:00
Xinliang David Li	076c9c59a2	[PGO] Fix remaining bugs in ProfData template file (when used by compiler-rt) 1. move const qualifier out of raw header field type as runtime use of the header needs to initialze the fields 2. use C style casting for integer types. llvm-svn: 253844	2015-11-23 03:49:07 +00:00
Mehdi Amini	b100a96489	Add const qualifier for FunctionInfoIndex in ModuleLinker and linkInModule() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253840	2015-11-23 01:59:16 +00:00
Mehdi Amini	60d59439ad	Add const qualifier on FunctionInfoIndex::hasExportedFunctions() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253839	2015-11-23 01:59:12 +00:00
Benjamin Kramer	ddaab51a02	[SCEV] Simplify code. NFC. llvm-svn: 253825	2015-11-22 17:27:27 +00:00
Krzysztof Parzyszek	56c6ad68a8	Revert r253810. The builds should be fine now. llvm-svn: 253822	2015-11-22 16:13:51 +00:00
Krzysztof Parzyszek	73d3ee06e8	Avoid dependency between TableGen and CodeGen Duplicate a few common definitions between DFAPacketizer.cpp and DFAPacketizerEmitter.cpp to avoid including files from CodeGen in TableGen. llvm-svn: 253820	2015-11-22 15:20:19 +00:00
Xinliang David Li	0a6ec9cd2c	[PGO] move names of runtime sections definitions to InstrProfData.inc In profile runtime implementation for Darwin, Linux and FreeBSD, the names of sections holding profile control/counter/naming data need to be known by the runtime in order to locate the start/end of the data. Moving the name definitions to the common file to specify the connection. llvm-svn: 253814	2015-11-22 05:42:31 +00:00
NAKAMURA Takumi	c91f2dd88f	Temporary fix broken build.ninja after r253790. FIXME: This can be reverted several hours later. r253790 introduced cyclic deps around llvm-tblgen and it was affecting after reverting. ninja: error: dependency cycle: include/llvm/IR/Attributes.inc -> include/llvm/IR/Attributes.inc.tmp -> bin/llvm-tblgen -> utils/TableGen/CMakeFiles/obj.llvm-tblgen.dir/DFAPacketizerEmitter.cpp.o -> include/llvm/IR/Attributes.inc It may be a ninja's bug. FYI, renaming DFAPacketizerEmitter.cpp would be useless. llvm-svn: 253810	2015-11-22 02:32:49 +00:00
Xinliang David Li	fc1d455417	[PGO] move raw magic and version def to InstrProfData.inc These are shared definitions too. (NFC) llvm-svn: 253809	2015-11-22 02:05:50 +00:00
Xinliang David Li	dde2bc76c3	[PGO] InstrProf Template file documentation change Add more complete description of the content and structure of the template file. Made the comment in C style to be shared by C runtime. Also enhance the file structure so that it can included as standalone header for common definitions. llvm-svn: 253807	2015-11-22 01:51:31 +00:00
Xinliang David Li	241cfd4800	[PGO] Move Value Profile Kind to InstrProfData.inc ValueProfKind value affects runtime data structure and definition is shared between compiler-rt and LLVM. llvm-svn: 253806	2015-11-22 01:39:07 +00:00
Xinliang David Li	b823f0597b	[PGO] Define value profiling updater API signature in InstrProfData.inc (NFC) llvm-svn: 253805	2015-11-22 00:22:07 +00:00
Rafael Espindola	9cb8841b77	Have a single way for creating unique value names. We had two code paths. One would create names like "foo.1" and the other names like "foo1". For globals it is important to use "foo.1" to help C++ name demangling. For locals there is no strong reason to go one way or the other so I kept the most common mangling (foo1). llvm-svn: 253804	2015-11-22 00:16:24 +00:00
Xinliang David Li	64fc4b9d72	[PGO] Move Raw Header def into template file InstrProfData.inc To enable code sharing with compiler-rt (NFC) llvm-svn: 253803	2015-11-22 00:06:39 +00:00
Teresa Johnson	2b4369dd03	[ThinLTO] Handle bitcode without function summary sections gracefully Summary: Several fixes to the handling of bitcode files without function summary sections so that they are skipped during ThinLTO processing in llvm-lto and the gold plugin when appropriate instead of aborting. 1 Don't assert when trying to add a FunctionInfo that doesn't have a summary attached. 2 Skip FunctionInfo structures that don't have attached function summary sections when trying to create the combined function summary. 3 In both llvm-lto and gold-plugin, check whether a bitcode file has a function summary section before trying to parse the index, and skip the bitcode file if it does not. 4 Fix hasFunctionSummaryInMemBuffer in BitcodeReader, which had a bug where we returned to early while looking for the summary section. Also added llvm-lto and gold-plugin based tests for cases where we don't have function summaries in the bitcode file. I verified that either the first couple fixes described above are enough to avoid the crashes, or fixes 1,3,4. But have combined them all here for added robustness. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14903 llvm-svn: 253796	2015-11-21 21:55:48 +00:00
Simon Pilgrim	37096d919a	[MachineInstrBuilder] Support for adding a ConstantPoolIndex MO with an additional offset. MachineInstrBuilder::addDisp can already add an immediate or global address MO with an adjusted offset, this patch adds support for constant pool indices as well. All remaining MO types still assert - there are a number of other types that could support adjusted offsets but I have no test cases at this time. Required to fix a regression in D13988 found by Mikael Holmén during stress testing (test case attached). Differential Revision: http://reviews.llvm.org/D14867 llvm-svn: 253795	2015-11-21 21:42:26 +00:00
Krzysztof Parzyszek	3a2a5e0f60	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. Reapply the previous patch, this time without circular dependencies. llvm-svn: 253793	2015-11-21 20:00:45 +00:00
Krzysztof Parzyszek	daec852689	Revert r253790: it breaks all builds for some reason. llvm-svn: 253791	2015-11-21 17:38:33 +00:00
Krzysztof Parzyszek	e1cf64ffc3	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. llvm-svn: 253790	2015-11-21 17:23:52 +00:00
Rong Xu	c3cbbad5f2	Add some constantness to GetSuccessorNumber(). llvm-svn: 253733	2015-11-20 23:02:06 +00:00
Nathan Slingerland	aae398c96d	[llvm-profdata] Add merge() to InstrProfRecord Summary: This change refactors two aspects of InstrProfRecord: 1) Add a merge() method to InstrProfRecord (previously InstrProfWriter combineInstrProfRecords()) in order to better encapsulate this functionality and to make the InstrProfRecord and SampleRecord APIs more consistent. 2) Make InstrProfRecord mergeValueProfData() a private method since it is only ever called internally by merge(). Reviewers: dnovillo, bogner, davidxl Subscribers: silvas, vsk, llvm-commits Differential Revision: http://reviews.llvm.org/D14786 llvm-svn: 253695	2015-11-20 19:12:43 +00:00
Artyom Skrobov	418223d583	Avoid duplicate entry for cortex-a7 in the TargetParser (NFC) Reviewers: t.p.northover, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14757 llvm-svn: 253676	2015-11-20 16:46:14 +00:00
Artyom Skrobov	433c2c5f72	Handle ARMv6-J as an alias, instead of fake architecture Summary: This follows D14577 to treat ARMv6-J as an alias for ARMv6, instead of an architecture in its own right. The functional change is that the default CPU when targeting ARMv6-J changes from arm1136j-s to arm1136jf-s, which is currently used as the default CPU for ARMv6; both are, in fact, ARMv6-J CPUs. The J-bit (Jazelle support) is irrelevant to LLVM, and it doesn't affect code generation, attributes, optimizations, or anything else, apart from selecting the default CPU. Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14755 llvm-svn: 253675	2015-11-20 16:46:09 +00:00
Teresa Johnson	45c27ecc06	[ThinLTO] Add MODULE_CODE_METADATA_VALUES record Summary: This is split out from the ThinLTO metadata mapping patch http://reviews.llvm.org/D14752. To avoid needing to parse the module level metadata during function importing, a new module-level record is added which holds the number of module-level metadata values. This is required because metadata value ids are assigned implicitly during parsing, and the function-level metadata ids start after the module-level metadata ids. I made a change to this version of the code compared to D14752 in order to add more consistent and thorough assertion checking of the new record value. We now unconditionally use the record value to initialize the MDValueList size, and handle it the same in parseMetadata for all module level metadata cases (lazy loading or not). Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14825 llvm-svn: 253668	2015-11-20 14:51:27 +00:00
Daniel Sanders	d8933d6afc	Revert the revert 253497 and 253539 - These commits aren't the cause of the clang-cmake-mips failures. Sorry for the noise. llvm-svn: 253662	2015-11-20 13:13:53 +00:00
Daniel Sanders	be30394dc7	Revert 253497 and 253539 to try to fix clang-cmake-mips buildbot. It caused link errors of the form: InstrProfiling.c:(.text.__llvm_profile_instrument_target+0x1c0): undefined reference to `__sync_fetch_and_add_8' We had a network outage at the time of the commit so the first build to show a problem is http://lab.llvm.org:8011/builders/clang-cmake-mips/builds/10827 llvm-svn: 253656	2015-11-20 10:07:11 +00:00
Tobias Edler von Koch	0ee12e019e	[LTO] Add option to emit assembly from LTOCodeGenerator This adds a new API, LTOCodeGenerator::setFileType, to choose the output file format for LTO CodeGen. A corresponding change to use this new API from llvm-lto and a test case is coming in a separate commit. Differential Revision: http://reviews.llvm.org/D14554 llvm-svn: 253622	2015-11-19 23:59:24 +00:00
Arch D. Robison	395b71970b	Cleanup some -Wundef warnings in include/llvm/Support/MathExtras.h Fix avoids gratuitous warnings from gcc for "_MSC_VER" not being defined. Differential Revision: http://reviews.llvm.org/D14598 Patch by Tony Kelman <tony@kelman.net> llvm-svn: 253614	2015-11-19 22:37:26 +00:00
Hans Wennborg	1ead7346cd	X86: More efficient legalization of wide integer compares In particular, this makes the code for 64-bit compares on 32-bit targets much more efficient. Example: define i32 @test_slt(i64 %a, i64 %b) { entry: %cmp = icmp slt i64 %a, %b br i1 %cmp, label %bb1, label %bb2 bb1: ret i32 1 bb2: ret i32 2 } Before this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax setae %al cmpl 16(%esp), %ecx setge %cl je .LBB2_2 movb %cl, %al .LBB2_2: testb %al, %al jne .LBB2_4 movl $1, %eax retl .LBB2_4: movl $2, %eax retl After this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax sbbl 16(%esp), %ecx jge .LBB1_2 movl $1, %eax retl .LBB1_2: movl $2, %eax retl Differential Revision: http://reviews.llvm.org/D14496 llvm-svn: 253572	2015-11-19 16:35:08 +00:00
Diego Novillo	0b8eea8df5	SamplePGO - Sort samples by source location when emitting as text. When dumping function samples or writing them out as text format, it helps if the samples are emitted sorted by source location. The sorting of the maps is a bit slow, so we only do it on demand. llvm-svn: 253568	2015-11-19 15:33:08 +00:00
Igor Breger	0a68600909	AVX512: Implemented encoding, intrinsics and DAG lowering for VMOVDDUP instructions. Differential Revision: http://reviews.llvm.org/D14702 llvm-svn: 253548	2015-11-19 08:26:56 +00:00
Pete Cooper	b753649d63	Revert "Change memcpy/memset/memmove to have dest and source alignments." This reverts commit r253511. This likely broke the bots in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/builds/20202 http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/3787 llvm-svn: 253543	2015-11-19 05:56:52 +00:00
Mehdi Amini	b5fccc4f2e	Do not require a Context to extract the FunctionIndex from Bitcode (NFC) The LLVMContext was only used for Diagnostic. Pass a DiagnosticHandler instead. Differential Revision: http://reviews.llvm.org/D14794 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253540	2015-11-19 05:52:29 +00:00
Reid Kleckner	709b4c44da	Initialize PersistentId for HandleSDNode, as these will never be inserted into the DAG llvm-svn: 253524	2015-11-19 00:05:09 +00:00
Xinliang David Li	d80e9e19b2	Minor cleanups (from review feedback) 1. remove uneeded header inclusion 2. use reinterpret_cast instead of c ctyle 3. other format change llvm-svn: 253515	2015-11-18 22:42:27 +00:00
Pete Cooper	aca4c5cdc6	Change memcpy/memset/memmove to have dest and source alignments. Note, this was reviewed (and more details are in) http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html These intrinsics currently have an explicit alignment argument which is required to be a constant integer. It represents the alignment of the source and dest, and so must be the minimum of those. This change allows source and dest to each have their own alignments by using the alignment attribute on their arguments. The alignment argument itself is removed. There are a few places in the code for which the code needs to be checked by an expert as to whether using only src/dest alignment is safe. For those places, they currently take the minimum of src/dest alignments which matches the current behaviour. For example, code which used to read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 500, i32 8, i1 false) will now read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 8 %dest, i8* align 8 %src, i32 500, i1 false) For out of tree owners, I was able to strip alignment from calls using sed by replacing: (call.llvm\.memset.)i32\ [0-9]\,\ i1 false\) with: $1i1 false) and similarly for memmove and memcpy. I then added back in alignment to test cases which needed it. A similar commit will be made to clang which actually has many differences in alignment as now IRBuilder can generate different source/dest alignments on calls. In IRBuilder itself, a new argument was added. Instead of calling: CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, / isVolatile / false) you now call CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, SrcAlign, / isVolatile */ false) There is a temporary class (IntegerAlignment) which takes the source alignment and rejects implicit conversion from bool. This is to prevent isVolatile here from passing its default parameter to the source alignment. Note, changes in future can now be made to codegen. I didn't change anything here, but this change should enable better memcpy code sequences. Reviewed by Hal Finkel. llvm-svn: 253511	2015-11-18 22:17:24 +00:00
Nathan Slingerland	7f6dd7b9db	[llvm-profdata] Add SaturatingAdd/SaturatingMultiply Helper Functions (2nd try) Summary: This change adds MathExtras helper functions for handling unsigned, saturating addition and multiplication. It also updates the instrumentation and sample profile merge implementations to use them. Reviewers: dnovillo, bogner, davidxl Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14720 llvm-svn: 253497	2015-11-18 20:40:41 +00:00
Sanjoy Das	0579a5794f	[OperandBundles] Address review on r253446; NFC Post-commit review by David Blaikie, thanks David! llvm-svn: 253494	2015-11-18 19:44:59 +00:00
Betul Buyukkurt	b3b3ea9a07	[PGO] Value profiling support This change introduces an instrumentation intrinsic instruction for value profiling purposes, the lowering of the instrumentation intrinsic and raw reader updates. The raw profile data files for llvm-profdata testing are updated. llvm-svn: 253484	2015-11-18 18:14:55 +00:00
Bradley Smith	dfed7faa7a	[ARM] Add +feature names to TargetParser extensions table llvm-svn: 253470	2015-11-18 16:32:12 +00:00
Manuel Klimek	5e6bc701b0	Fix bug where WinCOFFObjectWriter would assume starting from an empty output. Starting on an input stream that is not at offset 0 would trigger the assert in WinCOFFObjectWriter.cpp:1065: assert(getStream().tell() <= (*i)->Header.PointerToRawData && "Section::PointerToRawData is insane!"); llvm-svn: 253464	2015-11-18 15:24:17 +00:00
Fraser Cormack	8db5b22a00	Fix typo in comment. NFC. llvm-svn: 253462	2015-11-18 15:02:59 +00:00
Asaf Badouh	e49f73285d	[X86][AVX512CD] add mask broadcast intrinsics Differential Revision: http://reviews.llvm.org/D14573 llvm-svn: 253450	2015-11-18 09:42:45 +00:00
Sanjoy Das	3bc2ba29a2	[OperandBundles] Tighten OperandBundleDef's interface; NFC llvm-svn: 253446	2015-11-18 08:30:07 +00:00
Rafael Espindola	123f2ae05e	Default SetVector to use a DenseSet. We use to have an odd difference among MapVector and SetVector. The map used a DenseMop, but the set used a SmallSet, which in turn uses a std::set. I have changed SetVector to use a DenseSet. If you were depending on the old behaviour you can pass an explicit set type or use SmallSetVector. The common cases for needing to do it are: * Optimizing for small sets. * Sets for types not supported by DenseSet. llvm-svn: 253439	2015-11-18 06:52:18 +00:00
Sanjoy Das	f5a4d357df	Teach the inliner to track deoptimization state Summary: This change teaches LLVM's inliner to track and suitably adjust deoptimization state (tracked via deoptimization operand bundles) as it inlines through call sites. The operation is described in more detail in the LangRef changes. Reviewers: reames, majnemer, chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14552 llvm-svn: 253438	2015-11-18 06:23:38 +00:00
Rafael Espindola	2c21fe4650	Stop producing .data.rel sections. If a section is rw, it is irrelevant if the dynamic linker will write to it or not. It looks like llvm implemented this because gcc was doing it. It looks like gcc implemented this in the hope that it would put all the relocated items close together and speed up the dynamic linker. There are two problem with this: * It doesn't work. Both bfd and gold will map .data.rel to .data and concatenate the input sections in the order they are seen. * If we want a feature like that, it can be implemented directly in the linker since it knowns where the dynamic relocations are. llvm-svn: 253436	2015-11-18 06:02:15 +00:00
Reid Kleckner	279a366058	Attempt to fix uninitialized SDAG persistent ids detected by MSan llvm-svn: 253422	2015-11-18 01:21:06 +00:00
Cong Hou	bef97bc828	Let += and -= operators in BranchProbability have saturation behaviors. This commit is for a later patch that is depend on it. The sum of two branch probabilities can be greater than 1 due to rounding. It is safer to saturate the results of sum and subtraction. llvm-svn: 253421	2015-11-18 01:20:37 +00:00
Cong Hou	d07ce5a36e	Modify the interface BranchProbability::normalizeProbabilities to let it accept a pair of iterators. NFC. llvm-svn: 253417	2015-11-18 01:03:19 +00:00
Nathan Slingerland	07e999d3a8	Revert "[llvm-profdata] Add SaturatingAdd/SaturatingMultiply Helper Functions" Not ready for merge. llvm-svn: 253415	2015-11-18 00:55:15 +00:00
Nathan Slingerland	4ee48b13f5	[llvm-profdata] Add SaturatingAdd/SaturatingMultiply Helper Functions Summary: This change adds MathExtras helper functions for handling unsigned, saturating addition and multiplication. It also updates the instrumentation and sample profile merge implementations to use them. No functional changes. Reviewers: dnovillo, bogner, davidxl Subscribers: davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D14720 llvm-svn: 253412	2015-11-18 00:52:43 +00:00
David Blaikie	133cb3aa66	Generalize ownership/passing semantics to allow dsymutil to own abbreviations via unique_ptr While still allowing CodeGen/AsmPrinter in llvm to own them using a bump ptr allocator. (might be nice to replace the pointers there with something that at least automatically calls their dtors, if that's necessary/useful, rather than having it done explicitly (I think a typed BumpPtrAllocator already does this, or maybe a unique_ptr with a custom deleter, etc)) llvm-svn: 253409	2015-11-18 00:34:10 +00:00
David Blaikie	7535079532	Fix read-of-uninitialized introduced in r253277 exposed on some buildbots Verified that this was at least /an/ issue, if not the only one, by initializing NumBuckets to 1 (previously it was uninitialized, so if this change made a difference, which it did (causing a bunch of tests to crash) it demonstrates use-of-uninitialized memory). Initializing then removes the crashes. Thanks Reid for the debugging assistance llvm-svn: 253395	2015-11-17 23:26:06 +00:00
Xinliang David Li	71e8ce0ba8	[PGO] Move value profile data definitions out of IndexedInstrProf Move the data structure defintions out of the namespace. The defs will be shared by raw format. [NFC] llvm-svn: 253394	2015-11-17 23:00:40 +00:00
Reid Kleckner	00daa6cd20	[WinEH] Move WinEHFuncInfo from MachineModuleInfo to MachineFunction Summary: Now that there is a one-to-one mapping from MachineFunction to WinEHFuncInfo, we don't need to use a DenseMap to select the right WinEHFuncInfo for the current funclet. The main challenge here is that X86WinEHStatePass is an IR pass that doesn't have access to the MachineFunction. I gave it its own WinEHFuncInfo object that it uses to calculate state numbers, which it then throws away. As long as nobody creates or removes EH pads between this pass and SDAG construction, we will get the same state numbers. The other thing X86WinEHStatePass does is to mark the EH registration node. Instead of communicating which alloca was the registration through WinEHFuncInfo, I added the llvm.x86.seh.ehregnode intrinsic. This intrinsic generates no code and simply marks the alloca in use. Reviewers: JCTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14668 llvm-svn: 253378	2015-11-17 21:10:25 +00:00
David Blaikie	60b032961b	dwarfdump: Reference the appropriate line table segment when dumping dwp files Also improves .dwo type unit dumping which didn't handle this either. llvm-svn: 253377	2015-11-17 21:08:05 +00:00
Yunzhong Gao	b3793fa262	Switch lto codegen to using diagnostic handlers. This patch removes the std::string& argument from a number of C++ LTO API calls and instead makes them use the installed diagnostic handler. This would also improve consistency of diagnostic handling infrastructure: if an LTO client used lto_codegen_set_diagnostic_handler() to install a custom error handler, we do not want some error messages to go through the custom error handler, and some other error messages to go into sLastErrorString. llvm-svn: 253367	2015-11-17 19:48:12 +00:00
Diego Novillo	ffc4999903	SamplePGO - Move debug/dump function bodies out of header files. NFC. No point polluting the header declarations with debugging code. llvm-svn: 253361	2015-11-17 19:04:46 +00:00
David Blaikie	6d88ef926e	StringRef-ify some Option APIs Patch by Eugene Kosov! Differential Revision: http://reviews.llvm.org/D14711 llvm-svn: 253360	2015-11-17 19:00:52 +00:00
Oliver Stannard	e538054e6d	[Assembler] Make fatal assembler errors non-fatal Currently, if the assembler encounters an error after parsing (such as an out-of-range fixup), it reports this as a fatal error, and so stops after the first error. However, for most of these there is an obvious way to recover after emitting the error, such as emitting the fixup with a value of zero. This means that we can report on all of the errors in a file, not just the first one. MCContext::reportError records the fact that an error was encountered, so we won't actually emit an object file with the incorrect contents. Differential Revision: http://reviews.llvm.org/D14717 llvm-svn: 253328	2015-11-17 10:00:43 +00:00
Oliver Stannard	90a74252e6	[Assembler] Allow non-fatal errors after parsing This adds reportError to MCContext, which can be used as an alternative to reportFatalError when the assembler wants to try to continue processing the rest of the file after the error is reported, so that all of the errors ina file can be reported. It records the fact that an error was encountered, so we can avoid emitting an object file if any errors occurred. This patch doesn't add any uses of this function (a later patch will convert most uses of reportFatalError to use it), but there is a small functional change: we use the SourceManager to print the error message, even if we have a null SMLoc. This means that we get a SourceManager-style message, with the file and line information shown as <unknown>, rather than the "LLVM ERROR" style used by report_fatal_error. llvm-svn: 253327	2015-11-17 09:58:07 +00:00
Jay Foad	bdde8e00f5	Fix typos in comments. llvm-svn: 253324	2015-11-17 08:54:53 +00:00
David Majnemer	7fe7d83fab	[AliasAnalysis] CatchPad and CatchRet can modify escaped memory CatchPad and CatchRet behave a lot like function calls: they can potentially modify any memory which has been escaped. llvm-svn: 253323	2015-11-17 08:15:14 +00:00
Rafael Espindola	b080f6e9fc	Add MemoryBufferRef(MemoryBuffer&) constructor. patch by Jonathan Anderson! llvm-svn: 253311	2015-11-17 05:11:44 +00:00

1 2 3 4 5 ...

24931 Commits