llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Klaus Kretzschmar	90b39019b6	[Support] - Check nullptr after allocation with malloc in MallocAllocator - Differential Revision: http://reviews.llvm.org/D34753 llvm-svn: 322944	2018-01-19 14:17:53 +00:00
Florian Hahn	0dd4ced3cd	[ValueLattice] Use getters instead of direct accesses (NFC). Reviewers: reames, davide, anna Reviewed By: reames, davide Differential Revision: https://reviews.llvm.org/D42270 llvm-svn: 322933	2018-01-19 10:32:48 +00:00
Matthias Braun	c4d207b4b3	Split MachineLICM into EarlyMachineLICM and MachineLICM; NFC This avoids playing games with pseudo pass IDs and avoids using an unreliable MRI::isSSA() check to determine whether register allocation has happened. Note that this renames: - MachineLICMID -> EarlyMachineLICM - PostRAMachineLICMID -> MachineLICMID to be consistent with the EarlyTailDuplicate/TailDuplicate naming. llvm-svn: 322927	2018-01-19 06:46:10 +00:00
Matthias Braun	b211ae273d	Split TailDuplicatePass into pre- and post-RA variant; NFC Split TailDuplicatePass into EarlyTailDuplicate and TailDuplicate. This avoids playing games with fake pass IDs and using MRI::isSSA() to determine pre-/post-RA state. llvm-svn: 322926	2018-01-19 06:08:17 +00:00
Matthias Braun	e45ebab2b3	AArch64: Fix emergency spillslot being out of reach for large callframes Re-commit of r322200: The testcase shouldn't hit machineverifiers anymore with r322917 in place. Large callframes (calls with several hundreds or thousands or parameters) could lead to situations in which the emergency spillslot is out of range to be addressed relative to the stack pointer. This commit forces the use of a frame pointer in the presence of large callframes. This commit does several things: - Compute max callframe size at the end of instruction selection. - Add mirFileLoaded target callback. Use it to compute the max callframe size after loading a .mir file when the size wasn't specified in the file. - Let TargetFrameLowering::hasFP() return true if there exists a callframe > 255 bytes. - Always place the emergency spillslot close to FP if we have a frame pointer. - Note that `useFPForScavengingIndex()` would previously return false when a base pointer was available leading to the emergency spillslot getting allocated late (that's the whole effect of this callback). Which made no sense to me so I took this case out: Even though the emergency spillslot is technically not referenced by FP in this case we still want it allocated early. Differential Revision: https://reviews.llvm.org/D40876 llvm-svn: 322919	2018-01-19 03:16:36 +00:00
Lang Hames	a44f59218b	[ORC] Revert r322913 while I investigate an ASan failure. llvm-svn: 322914	2018-01-19 01:40:26 +00:00
Lang Hames	6784638299	[ORC] Redesign the JITSymbolResolver interface to support bulk queries. Bulk queries reduce IPC/RPC overhead for cross-process JITing and expose opportunities for parallel compilation. The two new query methods are lookupFlags, which finds the flags for each of a set of symbols; and lookup, which finds the address and flags for each of a set of symbols. (See doxygen comments for more details.) The existing JITSymbolResolver class is renamed LegacyJITSymbolResolver, and modified to extend the new JITSymbolResolver class using the following scheme: - lookupFlags is implemented by calling findSymbolInLogicalDylib for each of the symbols, then returning the result of calling getFlags() on each of these symbols. (Importantly: lookupFlags does NOT call getAddress on the returned symbols, so lookupFlags will never trigger materialization, and lookupFlags will never call findSymbol, so only symbols that are part of the logical dylib will return results.) - lookup is implemented by calling findSymbolInLogicalDylib for each symbol and falling back to findSymbol if findSymbolInLogicalDylib returns a null result. Assuming a symbol is found its getAddress method is called to materialize it and the result (if getAddress succeeds) is stored in the result map, or the error (if getAddress fails) is returned immediately from lookup. If any symbol is not found then lookup returns immediately with an error. This change will break any out-of-tree derivatives of JITSymbolResolver. This can be fixed by updating those classes to derive from LegacyJITSymbolResolver instead. llvm-svn: 322913	2018-01-19 01:12:40 +00:00
Craig Topper	1c2f80aef6	[X86] Add intrinsic support for the RDPID instruction This adds a new instrinsic to support the rdpid instruction. The implementation is a bit weird because the intrinsic is defined as always returning 32-bits, but the assembler support thinks the instruction produces a 64-bit register in 64-bit mode. But really it zeros the upper 32 bits. So I had to add separate patterns where 64-bit mode uses an extract_subreg. Differential Revision: https://reviews.llvm.org/D42205 llvm-svn: 322910	2018-01-18 23:52:31 +00:00
Reid Kleckner	6c728206f3	[CodeView] Sink complex inline functions to .cpp file, NFC I'm cleaning up this code before I attempt to fix a line table bug. llvm-svn: 322904	2018-01-18 22:55:14 +00:00
Eric Christopher	2cb6bd8f84	Typo fix SIBABRT -> SIGABRT. Based on a patch by Henry Wong! llvm-svn: 322902	2018-01-18 21:45:51 +00:00
Zachary Turner	af5c5622dd	Speed up iteration of CodeView record streams. There's some abstraction overhead in the underlying mechanisms that were being used, and it was leading to an abundance of small but not-free copies being made. This showed up on a profile. Eliminating this and going back to a low-level byte-based implementation speeds up lld with /DEBUG between 10 and 15%. Differential Revision: https://reviews.llvm.org/D42148 llvm-svn: 322871	2018-01-18 18:35:01 +00:00
Francis Visoiu Mistrih	a5ce5aa864	[CodeGen][NFC] Rename IsVerbose to IsStandalone in Machine*::print Committed r322867 too soon. Differential Revision: https://reviews.llvm.org/D42239 llvm-svn: 322868	2018-01-18 18:05:15 +00:00
Francis Visoiu Mistrih	0281a4fd10	[CodeGen] Print RegClasses on MI in verbose mode r322086 removed the trailing information describing reg classes for each register. This patch adds printing reg classes next to every register when individual operands/instructions/basic blocks are printed. In the case of dumping MIR or printing a full function, by default don't print it. Differential Revision: https://reviews.llvm.org/D42239 llvm-svn: 322867	2018-01-18 17:59:06 +00:00
Benjamin Kramer	716dfd78d8	[ADT] Just give up on GCC, I can't fix this. While the memmove workaround fixed it for GCC 6.3. GCC 4.8 and GCC 7.1 are still broken. I have no clue what's going on, just blacklist GCC for now. Needless to say this code is ubsan, asan and msan-clean. llvm-svn: 322862	2018-01-18 16:23:40 +00:00
Benjamin Kramer	fbb4905b3a	[ADT] Add a workaround for GCC miscompiling the trivially copyable Optional I've seen random crashes with GCC 4.8, GCC 6.3 and GCC 7.3, triggered by my Optional change. All of them affect a different set of targets. This change fixes the instance of the problem I'm seeing on my local machine, let's hope it's good enough for the other instances too. llvm-svn: 322859	2018-01-18 15:47:59 +00:00
Klaus Kretzschmar	5779bce977	test commit llvm-svn: 322844	2018-01-18 12:58:50 +00:00
Benjamin Kramer	88a16a89f7	[ADT] Split optional to only include copy mechanics and dtor for non-trivial types. This makes uses of Optional more transparent to the compiler (and clang-tidy) and generates slightly smaller code. This is a re-land of r317019, which had issues with GCC 4.8 back then. Those issues don't reproduce anymore, but I'll watch the buildbots closely in case anything goes wrong. llvm-svn: 322838	2018-01-18 11:26:24 +00:00
Clement Courbet	b50de8f5ac	Revert "Add a value_type to ArrayRef." clang OOMs on arm. This reverts commit a272b2f2ef63f7f602c9ef4d9e10dc4eb9f00aa1. llvm-svn: 322818	2018-01-18 07:26:34 +00:00
Rafael Espindola	bfea7bc2f8	Make GlobalValues with non-default visibilility dso_local. This is similar to r322317, but for visibility. It is not as neat because we have to special case extern_weak. The idea is the same as the previous change, make the transition to explicit dso_local easier for the frontends. With this they only have to add dso_local to symbols where we need some external information to decide if it is dso_local (like it being part of an ELF executable). llvm-svn: 322806	2018-01-18 02:08:23 +00:00
Justin Bogner	e1037058df	GlobalISel: Make MachineCSE runnable in the middle of the GlobalISel Right now, it is not possible to run MachineCSE in the middle of the GlobalISel pipeline. Being able to run generic optimizations between the core passes of GlobalISel was one of the goals of the new ISel framework. This is the first attempt to do it. The problem is that MachineCSE pass assumes all register operands have a register class, which, in GlobalISel context, won't be true until after the InstructionSelect pass. The reason for this behaviour is that before replacing one virtual register with another, MachineCSE pass (and most of the other optimization machine passes) must check if the virtual registers' constraints have a (sufficiently large) intersection, and constrain the resulting register appropriately if such intersection exists. GlobalISel extends the representation of such constraints from just a register class to a triple (low-level type, register bank, register class). This commit adds MachineRegisterInfo::constrainRegAttrs method that extends MachineRegisterInfo::constrainRegClass to such a triple. The idea is that going forward we should use: - RegisterBankInfo::constrainGenericRegister within GlobalISel's InstructionSelect pass - MachineRegisterInfo::constrainRegClass within SelectionDAG ISel - MachineRegisterInfo::constrainRegAttrs everywhere else regardless the target and instruction selector it uses. Patch by Roman Tereshin. Thanks! llvm-svn: 322805	2018-01-18 02:06:56 +00:00
Volkan Keles	4b7dec5ac6	Add a TargetOption to enable/disable GlobalISel Summary: This patch adds a new target option in order to control GlobalISel. This will allow the users to enable/disable GlobalISel prior to the backend by calling `TargetMachine::setGlobalISel(bool Enable)`. No test case as there is already a test to check GlobalISel command line options. See: CodeGen/AArch64/GlobalISel/gisel-commandline-option.ll. Reviewers: qcolombet, aemerson, ab, dsanders Reviewed By: qcolombet Subscribers: rovka, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42137 llvm-svn: 322773	2018-01-17 22:34:21 +00:00
Benjamin Kramer	69fbcfdf9d	Add support for emitting libcalls for x86_fp80 -> fp128 and vice-versa compiler_rt doesn't provide them (yet), but libgcc does. PR34076. llvm-svn: 322772	2018-01-17 22:29:16 +00:00
Easwaran Raman	a01ba52e92	Add a ProfileCount class to represent entry counts. Summary: The class wraps a uint64_t and an enum to represent the type of profile count (real and synthetic) with some helper methods. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41883 llvm-svn: 322771	2018-01-17 22:24:23 +00:00
Zaara Syeda	abe7cb3cc8	Revert [PowerPC] This reverts commit rL322721 Failing build bots. Revert the commit now. llvm-svn: 322748	2018-01-17 20:00:15 +00:00
Aditya Nandakumar	b99ffe01f9	[GISel] Make constrainSelectedInstRegOperands() available to the legalizer. NFC https://reviews.llvm.org/D42149 llvm-svn: 322743	2018-01-17 19:31:33 +00:00
Sam Clegg	b1a2c7d2eb	[WebAssembly] Remove debug names from symbol table Get rid of DEBUG_FUNCTION_NAME symbols. When we actually debug data, maybe we'll want somewhere to put it... but having a symbol that just stores the name of another symbol seems odd. It means you have multiple Symbols with the same name, one containing the actual function and another containing the name! Store the names in a vector on the WasmObjectFile when reading them in. Also stash them on the WasmFunctions themselves. The names are //not// "symbol names" or aliases or anything, they're just the name that a debugger should show against the function body itself. NB. The WasmObjectFile stores them so that they can be exported in the YAML losslessly, and hence the tests can be precise. Enforce that the CODE section has been read in before reading the "names" section. Requires minor adjustment to some tests. Patch by Nicholas Wilson! Differential Revision: https://reviews.llvm.org/D42075 llvm-svn: 322741	2018-01-17 19:28:43 +00:00
Zaara Syeda	8aa2772618	[PowerPC] Add handling for ColdCC calling convention and a pass to mark candidates with coldcc attribute. This patch adds support for the coldcc calling convention for Power. This changes the set of non-volatile registers. It includes a pass to stress test the implementation by marking all static directly called functions with the coldcc attribute through the option -enable-coldcc-stress-test. It also includes an option, -ppc-enable-coldcc, to add the coldcc attribute to functions which are cold at all call sites based on BlockFrequencyInfo when the containing function does not call any non cold functions. Differential Revision: https://reviews.llvm.org/D38413 llvm-svn: 322721	2018-01-17 18:22:55 +00:00
Daniil Fukalov	1683768871	[AMDGPU] add LDS f32 intrinsics added llvm.amdgcn.atomic.{add\|min\|max}.f32 intrinsics to allow generate ds_{add\|min\|max}[_rtn]_f32 instructions needed for OpenCL float atomics in LDS Reviewed by: arsenm Differential Revision: https://reviews.llvm.org/D37985 llvm-svn: 322656	2018-01-17 14:05:05 +00:00
Ivan A. Kosarev	0fdfdf2dc8	[Transforms] Support making mutable versions of new-format TBAA access tags Differential Revision: https://reviews.llvm.org/D41565 llvm-svn: 322650	2018-01-17 13:29:54 +00:00
George Burgess IV	b312be79c8	[Support] Return an enum instead of an unsigned; NFC. We seem to be (logically) returning ArchExtKinds here in all cases, so the return type should reflect that. The static_cast is necessary because `A.ID` is actually an `unsigned`, presumably since we use `decltype(A)` to represent extended attributes for both ARM and AArch64, which use distinct `ArchExtKinds`. We can't trivially make the same change for ARM, because one of the values it returns is the bitwise-or of two `ARM::ArchExtKind`s. llvm-svn: 322613	2018-01-17 03:12:06 +00:00
Aaron Smith	a4bf47e131	Fix pretty printing the unspecified param of a variadic function Summary: - Fix a bug in PrettyBuiltinDumper that returns "void" as the name for an unspecified builtin type. Since the unspecified param of a variadic function is considered a builtin of unspecified type in PDBs, we set "..." for its name. - Provide a method to determine if a PDBSymbolFunc is variadic in PrettyFunctionDumper since PDBSymbolFunc::getArgument() doesn't return the last unspecified-type param. - Add a pretty-func-dumper.test to test pretty dumping of variadic functions. Reviewers: zturner, llvm-commits Reviewed By: zturner Differential Revision: https://reviews.llvm.org/D41801 llvm-svn: 322608	2018-01-17 01:22:03 +00:00
Lang Hames	45bbce283d	[ExecutionEngine] Rename JITSymbol::isStrongDefinition to isStrong. For symmetry with isWeak, isCommon. llvm-svn: 322594	2018-01-16 20:39:51 +00:00
Volkan Keles	eea46f246c	[GlobalISel][TableGen] Add support for SDNodeXForm Summary: This patch adds CustomRenderer which renders the matched operands to the specified instruction. Targets can enable the matching of SDNodeXForm by adding a definition that inherits from GICustomOperandRenderer and GISDNodeXFormEquiv as follows. def gi_imm8 : GICustomOperandRenderer<"renderImm8”>, GISDNodeXFormEquiv<imm8_xform>; Custom renderer functions should be of the form: void render(MachineInstrBuilder &MIB, const MachineInstr &I); Reviewers: dsanders, ab, rovka Reviewed By: dsanders Subscribers: kristof.beyls, javed.absar, llvm-commits, mgrang, qcolombet Differential Revision: https://reviews.llvm.org/D42012 llvm-svn: 322582	2018-01-16 18:44:05 +00:00
Jonas Devlieghere	54c6b81933	[DebugInfo] Unify dumping of address ranges Summary: This patch unifies the printing of address ranges as [0x0, 0x1). rdar://34822059 Reviewers: aprantl, dblaikie Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D42056 llvm-svn: 322543	2018-01-16 11:17:57 +00:00
Francis Visoiu Mistrih	b898f1165e	[CodeGen][NFC] Correct case for printSubRegIdx llvm-svn: 322541	2018-01-16 10:53:11 +00:00
Clement Courbet	504d87cee2	Add a value_type to ArrayRef. Summary: Not sure this needs a review or not. Erring on the safe side. Reviewers: dblaikie Differential Revision: https://reviews.llvm.org/D41666 llvm-svn: 322538	2018-01-16 09:11:20 +00:00
Sam McCall	148ec2b051	Allow function_ref(nullptr) like std::function, since it's nullable already llvm-svn: 322487	2018-01-15 14:43:04 +00:00
Pavel Labath	fe0ecce9d4	[Support] Remove MemoryBuffer::getNewMemBuffer all callers have been switched the the Writable version (which does not require const_casting to be useful). llvm-svn: 322475	2018-01-15 11:03:30 +00:00
Amara Emerson	e611130658	[GlobalISel][Legalizer] Convert some typedefs to using. NFC. llvm-svn: 322466	2018-01-15 00:44:20 +00:00
Rui Ueyama	18bbb959e4	Allow unaligned access to ELF file data structures. The ELF specification says that all ELF data structures are aligned to their natural alignments both in memory and file. That means when we access mmap'ed ELF files, we could assume that all data structures are aligned properly. However, in reality, we assume that the data structures are aligned only to two bytes because .a files only guarantee that their member files are aligned to two bytes in archive files. So the data access is already unaligned. This patch relaxes the alignment requirement even more, so that we accept unaligned access to all ELF data structures. This patch in particular makes lld bug-compatible with icc. Intel C compiler doesn't seem to care about data alignment and generates unaligned relocation sections (https://bugs.llvm.org/show_bug.cgi?id=35854). I also saw another instance of compatibility issues with our internal tool which creates unaligned section headers. Because GNU linkers are not picky about alignment, looks like it is not uncommon that ELF-generating tools create unaligned files. There is a performance penalty with this patch on host machines on which unaligned access is expensive. x86 and AArch64 are fine. ARMv6 is a problem, but I don't think using ARMv6 machines as hosts is common, so I believe it's not a real problem. Differential Revision: https://reviews.llvm.org/D41978 llvm-svn: 322407	2018-01-12 22:09:19 +00:00
Daniel Neilson	b58a2de2b4	[NFC] Change MemIntrinsicInst::setAlignment() to take an unsigned instead of a Constant Summary: In preparation for https://reviews.llvm.org/D41675 this NFC changes this prototype of MemIntrinsicInst::setAlignment() to accept an unsigned instead of a Constant. llvm-svn: 322403	2018-01-12 21:33:37 +00:00
Brian M. Rzycki	504eb62dfb	[JumpThreading] Preservation of DT and LVI across the pass Summary: See D37528 for a previous (non-deferred) version of this patch and its description. Preserves dominance in a deferred manner using a new class DeferredDominance. This reduces the performance impact of updating the DominatorTree at every edge insertion and deletion. A user may call DDT->flush() within JumpThreading for an up-to-date DT. This patch currently has one flush() at the end of runImpl() to ensure DT is preserved across the pass. LVI is also preserved to help subsequent passes such as CorrelatedValuePropagation. LVI is simpler to maintain and is done immediately (not deferred). The code to perform the preversation was minimally altered and simply marked as preserved for the PassManager to be informed. This extends the analysis available to JumpThreading for future enhancements such as threading across loop headers. Reviewers: dberlin, kuhar, sebpop Reviewed By: kuhar, sebpop Subscribers: mgorny, dmgreen, kuba, rnk, rsmith, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D40146 llvm-svn: 322401	2018-01-12 21:06:48 +00:00
Rui Ueyama	18f10e166c	Remove ELFDataTypeTypedefHelper class. Differential Revision: https://reviews.llvm.org/D41973 llvm-svn: 322395	2018-01-12 19:59:43 +00:00
Sam Clegg	3c7181ca32	MC: Remove redundant `SetUsed` arguments in MCSymbol methods We can probably take this a step further since the only user of the isUsed flag is AsmParser it should probably be doing this explicitly. For now this is a step in the right direction though. Differential Revision: https://reviews.llvm.org/D41971 llvm-svn: 322386	2018-01-12 18:05:40 +00:00
Rafael Espindola	2ce48efc54	Allow dso_local on ifunc. It was never fully disallowed. We were rejecting it in the asm parser, but not in the verifier. Currently TargetMachine::shouldAssumeDSOLocal returns true for hidden ifuncs. I considered changing it and moving the check from the asm parser to the verifier. The reason for deciding to allow it instead is that all linkers handle a direct reference just fine. They use the plt address as the address of the function. In fact doing that means that clang doesn't have the same bug as gcc: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83782. This patch then removes the check from the asm parser and updates the bitcode reader and writer. llvm-svn: 322378	2018-01-12 17:03:43 +00:00
Rui Ueyama	45248cfb84	Instead of ELFFile<ELFT>::Type, use ELFT::Type. NFC. llvm-svn: 322346	2018-01-12 02:28:31 +00:00
Rui Ueyama	555bcff50e	Fix typo. llvm-svn: 322344	2018-01-12 02:15:41 +00:00
Rui Ueyama	03fe6a20b4	Use ELF{32,64}{LE,BE} instead of ELFType<{little,big}, {true,false}>. NFC. llvm-svn: 322342	2018-01-12 01:40:32 +00:00
Lang Hames	0857f0fe49	[ORC] Add a stub ExecutionSession and VModuleKey type. ExecutionSession will represent a running JIT program. VModuleKey is a unique key assigned to each module added as part of an ExecutionSession. The Layer concept will be updated in future to require a VModuleKey when a module is added. llvm-svn: 322336	2018-01-12 00:22:05 +00:00
Matthias Braun	63dd47f323	PeepholeOptimizer: Fix for vregs without defs The PeepholeOptimizer would fail for vregs without a definition. If this was caused by an undef operand abort to keep the code simple (so we don't need to add logic everywhere to replicate the undef flag). Differential Revision: https://reviews.llvm.org/D40763 llvm-svn: 322319	2018-01-11 22:30:43 +00:00
Rafael Espindola	3457994310	Make internal/private GVs implicitly dso_local. While updating clang tests for having clang set dso_local I noticed that: - There are a lot of tests to update. - Many of the updates are redundant. They are redundant because a GV is "obviously dso_local". This patch starts formalizing that a bit by requiring that internal and private GVs be dso_local too. Since they all are, we don't have to print dso_local to the textual representation, making it a bit more compact and easier to read. llvm-svn: 322317	2018-01-11 22:15:05 +00:00
Sam Clegg	f07600c58f	[WebAssemlby] MC: Don't write COMDAT symbols as global imports This was causing undefined references at link time in lld. Differential Revision: https://reviews.llvm.org/D41959 llvm-svn: 322309	2018-01-11 20:35:17 +00:00
Aaron Ballman	b6d4429479	Use size_t to represent the size of a StringMapEntry length and alignment rather than unsigned. Patch by Matt Davis. llvm-svn: 322305	2018-01-11 18:47:15 +00:00
Daniel Neilson	8093180999	[NFC] Abstract out source argument index in MemTransferInst. Summary: References to the source operand within class MemTransferInst are currently by a constant 1. Abstract this out into a named constant. llvm-svn: 322289	2018-01-11 16:28:32 +00:00
Igor Laevsky	f728d8a69f	[FuzzMutate] Avoid using swifterror as a source operand Differential Revision: https://reviews.llvm.org/D41107 llvm-svn: 322280	2018-01-11 14:43:05 +00:00
Dmitry Venikov	ad27a6a17a	[InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x) Summary: This patch enables folding sin(x) / cos(x) -> tan(x), cos(x) / sin(x) -> 1 / tan(x) under -ffast-math flag Reviewers: hfinkel, spatel Reviewed By: spatel Subscribers: andrew.w.kaylor, efriedma, scanon, llvm-commits Differential Revision: https://reviews.llvm.org/D41286 llvm-svn: 322255	2018-01-11 06:33:00 +00:00
Wolfgang Pieb	4698ad4a47	[DWARF][NFC] Overload AsmPrinter::emitDwarfStringOffsets() to take a DwarfStringPoolEntry record. Differential Revision: https://reviews.llvm.org/D41920 llvm-svn: 322250	2018-01-11 02:35:00 +00:00
Matt Morehouse	3c0bcb4da0	SmallVector: fix use-after-poison MSAN error in destructor Summary: Addresses issue: https://bugs.llvm.org/show_bug.cgi?id=34595 The topmost class, `SmallVector`, has internal storage for some elements; `N - 1` elements' bytes worth of space. Meanwhile a base class `SmallVectorTemplateCommon` has room for one element as well, totaling `N` elements' worth of space. The space for the N elements is contiguous and straddles `SmallVectorTemplateCommon` and `SmallVector`. A class "between" those two owning the storage, `SmallVectorImpl`, in its destructor, calls the destructor for elements contained in the vector, if any. It uses `destroy_range(begin, end)` and deletes all items in sequence, starting from the end. By the time the destructor for `SmallVectorImpl` is running, though, the memory for elements `[1, N)` is already poisoned, due to `SmallVector`'s destructor having done its thing already. So if the element type `T` has a nontrivial destructor that accesses any members of the `T` instance being destroyed, we'll run into a user-after-poison bug. This patch moves the destruction loop into `SmallVector`'s destructor, so any memory being accessed while dtors are running is not yet poisoned. Confirmed this broke before (and now works with this patch) with these compiler flags: -fsanitize=memory -fsanitize-memory-use-after-dtor -fsanitize-memory-track-origins and with the cmake flag `-DLLVM_USE_SANITIZER='MemoryWithOrigins;Undefined'` as well as `MSAN_OPTIONS=poison_in_dtor=1`. Patch By: elsteveogrande Reviewers: eugenis, morehouse, dblaikie Reviewed By: eugenis, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41916 llvm-svn: 322241	2018-01-10 23:53:11 +00:00
Matthias Braun	0e3f9cfb8b	Revert "AArch64: Fix emergency spillslot being out of reach for large callframes" Revert for now as the testcase is hitting a pre-existing verifier error that manifest as a failure when expensive checks are enabled (or -verify-machineinstrs) is used. This reverts commit r322200. llvm-svn: 322231	2018-01-10 22:36:28 +00:00
Matthias Braun	f103029d0d	LiveRangeEdit: Inline markDeadRemat() into only user; NFC This function was only called from a single place in which we didn't even need the `if (DeadRemats)` check. llvm-svn: 322230	2018-01-10 22:36:26 +00:00
Matthias Braun	798a2b9e65	LiveRangeEdit: Simplify code; NFC Simplify the code slightly: Instead of creating empty subranges in one case and immediately removing them, do not create them in the first place. llvm-svn: 322226	2018-01-10 21:41:02 +00:00
Craig Topper	8d3d87a0cc	[SelectionDAG][X86] Explicitly store the scale in the gather/scatter ISD nodes Currently we infer the scale at isel time by analyzing whether the base is a constant 0 or not. If it is we assume scale is 1, else we take it from the element size of the pass thru or stored value. This seems a little weird and I think it makes more sense to make it explicit in the DAG rather than doing tricky things in the backend. Most of this patch is just making sure we copy the scale around everywhere. Differential Revision: https://reviews.llvm.org/D40055 llvm-svn: 322210	2018-01-10 19:16:05 +00:00
Matthias Braun	63ad6b7f05	AArch64: Fix emergency spillslot being out of reach for large callframes Large callframes (calls with several hundreds or thousands or parameters) could lead to situations in which the emergency spillslot is out of range to be addressed relative to the stack pointer. This commit forces the use of a frame pointer in the presence of large callframes. This commit does several things: - Compute max callframe size at the end of instruction selection. - Add mirFileLoaded target callback. Use it to compute the max callframe size after loading a .mir file when the size wasn't specified in the file. - Let TargetFrameLowering::hasFP() return true if there exists a callframe > 255 bytes. - Always place the emergency spillslot close to FP if we have a frame pointer. - Note that `useFPForScavengingIndex()` would previously return false when a base pointer was available leading to the emergency spillslot getting allocated late (that's the whole effect of this callback). Which made no sense to me so I took this case out: Even though the emergency spillslot is technically not referenced by FP in this case we still want it allocated early. Differential Revision: https://reviews.llvm.org/D40876 llvm-svn: 322200	2018-01-10 18:16:24 +00:00
Simon Pilgrim	fff6ed4958	Fix -Wdocumentation warning by removing empty @brief. NFCI llvm-svn: 322186	2018-01-10 13:52:30 +00:00
Sander de Smalen	dee11fd6e7	[TableGen][AsmMatcherEmitter] Generate assembler checks for tied operands Summary: This extends TableGen's AsmMatcherEmitter with code that generates a table with tied-operand constraints. The constraints are checked when parsing the instruction. If an operand is not equal to its tied operand, the assembler will give an error. Patch [2/3] in a series to add operand constraint checks for SVE's predicated ADD/SUB. Reviewers: olista01, rengolin, mcrosier, fhahn, craig.topper, evandro, echristo Reviewed By: fhahn Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D41446 llvm-svn: 322166	2018-01-10 10:10:56 +00:00
Lang Hames	8e3463b662	[ORC] Incorporate Dave Blaikie's feedback on r319839. - Turn some member functions into free functions. - Avoid a redundant map lookup - Simplify a loop index llvm-svn: 322159	2018-01-10 04:01:44 +00:00
Sam Clegg	06d24e0d9c	[llvm-readobj] Consistent use of ScopedPrinter There were a few places where outs() was being used directly rather than the ScopedPrinter object. Differential Revision: https://reviews.llvm.org/D41370 llvm-svn: 322141	2018-01-10 00:14:19 +00:00
Lang Hames	b0f9fb8cbd	[ORC] Re-apply r321838 again with a workaround for a bug present in the libcxx version being used on some of the green dragon builders (plus a clang-format). Workaround: AsynchronousSymbolQuery and VSO want to work with JITEvaluatedSymbols anyway, so just use them (instead of JITSymbol, which happens to tickle the bug). The libcxx bug being worked around was fixed in r276003, and there are plans to update the offending builders. llvm-svn: 322140	2018-01-10 00:09:38 +00:00
Adrian McCarthy	de2d0196a2	Reland "Emit Function IDs table for Control Flow Guard" Adds option /guard:cf to clang-cl and -cfguard to cc1 to emit function IDs of functions that have their address taken into a section named .gfids$y for compatibility with Microsoft's Control Flow Guard feature. The original patch didn't have the lit.local.cfg file that restricts the new test to x86, thus the new test was failing on the non-x86 bots. Differential Revision: https://reviews.llvm.org/D40531 The reverts r322008, which was a revert of r322005. This reverts commit a05b89f9aca70597dc79fe97bc49b50b51f525ba. llvm-svn: 322136	2018-01-09 23:49:30 +00:00
Sam Clegg	9c22504bad	[WebAssembly] Add COMDAT support This adds COMDAT support to the Wasm object-file format. Spec: https://github.com/WebAssembly/tool-conventions/pull/31 Corresponding LLD change: https://bugs.llvm.org/show_bug.cgi?id=35533, and D40845 Patch by Nicholas Wilson Differential Revision: https://reviews.llvm.org/D40844 llvm-svn: 322135	2018-01-09 23:43:14 +00:00
Paul Robinson	bfc554dace	[DWARFv5] MC support for MD5 file checksums Extend .file directive syntax to allow specifying an MD5 checksum for the source file. Emit the checksums in DWARF v5 line tables. llvm-svn: 322134	2018-01-09 23:31:48 +00:00
Rafael Espindola	a45d438e5a	Use a MCExpr for the size of MCFillFragment. This allows the size to be found during ralaxation. This fixes pr35858. llvm-svn: 322131	2018-01-09 22:48:37 +00:00
David Blaikie	1862f3b288	ADT: Add a range-based version of std::copy llvm-svn: 322127	2018-01-09 22:13:56 +00:00
Chris Bieneman	152dec707a	[IPSCCP] Remove calls without side effects Summary: When performing constant propagation for call instructions we have historically replaced all uses of the return from a call, but not removed the call itself. This is required for correctness if the calls have side effects, however the compiler should be able to safely remove calls that don't have side effects. This allows the compiler to completely fold away calls to functions that have no side effects if the inputs are constant and the output can be determined at compile time. Reviewers: davide, sanjoy, bruno, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38856 llvm-svn: 322125	2018-01-09 21:58:46 +00:00
Sam Clegg	9ebc8a13dc	[WebAssembly] Explicitly specify function/global index space in YAML These indexes are useful because they are not always zero based and functions and globals are referenced elsewhere by their index. This matches what we already do for the type index space. Differential Revision: https://reviews.llvm.org/D41877 llvm-svn: 322121	2018-01-09 21:38:53 +00:00
Rui Ueyama	2eec2bea38	[COFF] Process /EXPORT option in fastpath Patch by Takuto Ikuta. This patch reduces lld link time of chromium's blink_core.dll in component build. Total size of input argument in .directives become nearly 300MB in the build and almost all its content are /EXPORT. To reduce time of parsing too many /EXPORT option in the build, I introduce fastpath for /EXPORT in ArgParser::parseDirectives. On my desktop machine, 4 times stats of the link time are like below. Improved around 20%. This patch TotalSeconds : 8.6217627 TotalSeconds : 8.5402175 TotalSeconds : 8.6855853 TotalSeconds : 8.3624441 Ave : 8.5525024 master TotalSeconds : 10.9975031 TotalSeconds : 11.3409428 TotalSeconds : 10.6332897 TotalSeconds : 10.7650687 Ave : 10.934201075 llvm-svn: 322117	2018-01-09 20:36:42 +00:00
Rafael Espindola	07ecf146a7	Don't duplicate names in comments. NFC. llvm-svn: 322113	2018-01-09 20:02:35 +00:00
Rafael Espindola	8c62496ec5	Inline a emitFill variant that is only used once. NFC. llvm-svn: 322111	2018-01-09 19:50:29 +00:00
Easwaran Raman	f04207e3b2	Add a pass to generate synthetic function entry counts. Summary: This pass synthesizes function entry counts by traversing the callgraph and using the relative block frequencies of the callsites. The intended use of these counts is in inlining to determine hot/cold callsites in the absence of profile information. The pass is split into two files with the code that propagates the counts in a callgraph in a Utils file. I plan to add support for propagation in the thinlto link phase and the propagation code will be shared and hence this split. I did not add support to the old PM since hot callsite determination in inlining is not possible in old PM (although we could use hot callee heuristic with synthetic counts in the old PM it is not worth the effort tuning it) Reviewers: davidxl, silvas Subscribers: mgorny, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D41604 llvm-svn: 322110	2018-01-09 19:39:35 +00:00
Rafael Espindola	0f0fe4383d	Make one of the emitFill methods non virtual. NFC. This is just preparatory work to fix PR35858. llvm-svn: 322108	2018-01-09 19:29:33 +00:00
Craig Topper	b38fcb7fd8	[lli] Make lli support -mcpu=native for CPU autodetection llc, opt, and clang can all autodetect the CPU and supported features. lli cannot as far as I could tell. This patch uses the getCPUStr() and introduces a new getCPUFeatureList() and uses those in lli in place of MCPU and MAttrs. Ideally, we would merge getCPUFeatureList and getCPUFeatureStr, but opt and llc need a string and lli wanted a list. Maybe we should just return the SubtargetFeature object and let the caller decide what it needs? Differential Revision: https://reviews.llvm.org/D41833 llvm-svn: 322100	2018-01-09 18:14:18 +00:00
Pavel Labath	198e4bdc0a	[Support] Add WritableMemoryBuffer::getNewMemBuffer Summary: The idea is that it would replace (non-Writable)MemoryBuffer::getNewMemBuffer, which is quite useless unless you const_cast its contents to write to it (which all (both) callers of this function were doing). This patch also fixes one of the usages in COFFWriter. After fixing the other usage in clang, I plan to delete the old function. Reviewers: dblaikie, Bigcheese Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41540 llvm-svn: 322094	2018-01-09 17:26:06 +00:00
Sanjay Patel	e515369ba1	[SelectionDAG] lower math intrinsics to finite version of libcalls when possible (PR35672) Ingredients in this patch: 1. Add HANDLE_LIBCALL defs for finite mathlib functions that correspond to LLVM intrinsics. 2. Plumbing to send TargetLibraryInfo down to SelectionDAGLegalize. 3. Relaxed math and library checking in SelectionDAGLegalize::ConvertNodeToLibcall() to choose finite libcalls. There was a bug about determining the availability of the finite calls that should be fixed with: rL322010 Not in this patch: This doesn't resolve the question/bug of clang creating the intrinsic IR in the first place. There's likely follow-up work needed to support the long double variants better. There's room for improvement to reduce the code duplication. Create finite calls that don't originate from a corresponding intrinsic or DAG node? Differential Revision: https://reviews.llvm.org/D41338 llvm-svn: 322087	2018-01-09 15:41:00 +00:00
Craig Topper	63aae39c34	[X86] Remove llvm.x86.avx512.cvt2mask. intrinsics and autoupgrade to (icmp slt X, 0) I had to drop fast-isel-abort from a test because we can't fast isel some of the mask stuff. When we used intrinsics we implicitly fell back to SelectionDAG for the intrinsic call without triggering the abort error. But with native IR that doesn't happen the same way. llvm-svn: 322050	2018-01-09 00:50:47 +00:00
Jessica Paquette	3075bb4222	[MachineOutliner] AArch64: Handle instrs that use SP and will never need fixups This commit does two things. Firstly, it adds a collection of flags which can be passed along to the target to encode information about the MBB that an instruction lives in to the outliner. Second, it adds some of those flags to the AArch64 outliner in order to add more stack instructions to the list of legal instructions that are handled by the outliner. The two flags added check if - There are calls in the MachineBasicBlock containing the instruction - The link register is available in the entire block If the link register is available and there are no calls, then a stack instruction can always be outlined without fixups, regardless of what it is, since in this case, the outliner will never modify the stack to create a call or outlined frame. The motivation for doing this was checking which instructions are most often missed by the outliner. Instructions like, say %sp<def> = ADDXri %sp, 32, 0; flags: FrameDestroy are very common, but cannot be outlined in the case that the outliner might modify the stack. This commit allows us to outline instructions like this. llvm-svn: 322048	2018-01-09 00:26:18 +00:00
Craig Topper	ff763aa60e	[X86] Remove GCCBuiltin from int_x86_avx512_cvtb2mask_128 and similar intrinsics. I'm going to convert these to 'icmp slt X, zeroinitializer' in clang's CGBuiltin.cpp, but the GCCBuiltin names need to be removed to do that. llvm-svn: 322037	2018-01-08 22:37:49 +00:00
Justin Bogner	c95467b366	AlwaysInliner: Alow setting InsertLifetime in the new-style pass llvm-svn: 322033	2018-01-08 22:07:42 +00:00
Justin Bogner	e9bc465e2c	ArgPromotion: Allow setting MaxElements in the new-style pass llvm-svn: 322025	2018-01-08 21:13:35 +00:00
Simon Pilgrim	005dbb9707	Fixed spelling mistake. NFCI. llvm-svn: 322009	2018-01-08 17:16:59 +00:00
Adrian McCarthy	77150ad708	Revert "Emit Function IDs table for Control Flow Guard" The new test fails on the Hexagon bot. Reverting while I investigate. This reverts https://reviews.llvm.org/rL322005 This reverts commit b7e0026b4385180c378edc658ec91a39566f2942. llvm-svn: 322008	2018-01-08 17:12:01 +00:00
Adrian McCarthy	1914213a11	Emit Function IDs table for Control Flow Guard Adds option /guard:cf to clang-cl and -cfguard to cc1 to emit function IDs of functions that have their address taken into a section named .gfids$y for compatibility with Microsoft's Control Flow Guard feature. Differential Revision: https://reviews.llvm.org/D40531 llvm-svn: 322005	2018-01-08 16:33:42 +00:00
Nirav Dave	d9a55e3d2b	[DAG] Teach BaseIndexOffset to correctly handle with indexed operations BaseIndexOffset address analysis incorrectly ignores offsets folded into indexed memory operations causing potential errors in alias analysis of pre-indexed operations. Reviewers: efriedma, RKSimon, hfinkel, jyknight Subscribers: hiraditya, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D41701 llvm-svn: 322003	2018-01-08 16:21:35 +00:00
Alexey Bataev	d73719cbbe	[SLP] Fix PR35777: Incorrect handling of aggregate values. Summary: Fixes the bug with incorrect handling of InsertValue\|InsertElement instrucions in SLP vectorizer. Currently, we may use incorrect ExtractElement instructions as the operands of the original InsertValue\|InsertElement instructions. Reviewers: mkuper, hfinkel, RKSimon, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41767 llvm-svn: 321994	2018-01-08 14:43:06 +00:00
Lang Hames	70682b175b	[ORC] Remove AsynchronousSymbolQuery while I debug an issue on one of the builders. llvm-svn: 321941	2018-01-06 20:14:22 +00:00
Lang Hames	bf29e2d4a9	[ORC] Fix the counter type on SymbolStringPool entries. Hopefully this will fix the build failure in http://lab.llvm.org:8011/builders/llvm-mips-linux/builds/3417 llvm-svn: 321926	2018-01-06 05:19:06 +00:00
Vedant Kumar	740b9e13ef	[Utils] Simplify salvageDebugInfo, NFCI Having a single call to findDbgUsers() allows salvageDebugInfo() to return earlier. Differential Revision: https://reviews.llvm.org/D41787 llvm-svn: 321915	2018-01-05 23:27:02 +00:00
Lang Hames	fe7624b772	[ORC] Re-apply just the AsynchronousSymbolLookup class from r321838 while I investigate builder / test failures. llvm-svn: 321910	2018-01-05 22:50:43 +00:00
Bjorn Pettersson	7bfe1ca83a	[DebugInfo] Align comments in debug_loc section Summary: This commit updates the BufferByteStreamer, used by DebugLocStream to buffer bytes/comments to put in the debug_loc section, to make sure that the Buffer and Comments vectors are synced. Previously, when an SLEB128 or ULEB128 was emitted together with a comment, the vectors could be out-of-sync if the LEB encoding added several entries to the Buffer vectors, while we only added a single entry to the Comments vector. The goal with this is to get the comments in the debug_loc section in the .s file correctly aligned. Example (using ARM as target): Instead of .byte 144 @ sub-register DW_OP_regx .byte 128 @ 256 .byte 2 @ DW_OP_piece .byte 147 @ 8 .byte 8 @ sub-register DW_OP_regx .byte 144 @ 257 .byte 129 @ DW_OP_piece .byte 2 @ 8 .byte 147 @ .byte 8 @ we now get .byte 144 @ sub-register DW_OP_regx .byte 128 @ 256 .byte 2 @ .byte 147 @ DW_OP_piece .byte 8 @ 8 .byte 144 @ sub-register DW_OP_regx .byte 129 @ 257 .byte 2 @ .byte 147 @ DW_OP_piece .byte 8 @ 8 Reviewers: JDevlieghere, rnk, aprantl Reviewed By: aprantl Subscribers: davide, Ka-Ka, uabelho, aemerson, javed.absar, kristof.beyls, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D41763 llvm-svn: 321907	2018-01-05 22:20:30 +00:00
Reid Kleckner	20c2d2b3b9	Fix -Wsign-compare warnings on Windows These arise because enums are 'int' by default. llvm-svn: 321887	2018-01-05 19:53:51 +00:00
Zachary Turner	8f4cf9cdb7	[MSF] Fix FPM interval calcluation We have some code to try to determine how many pieces an MSF Free Page Map is split into, and this code had an off by one error which would cause the calculation to be incorrect when there were exactly 4096*k + 1 blocks in an MSF file. Original investigation and patch outline by Colden Cullen. Differential Revision: https://reviews.llvm.org/D41742 llvm-svn: 321880	2018-01-05 18:12:14 +00:00

1 2 3 4 5 ...

33390 Commits