llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Jingyue Wu	537669aad8	[SLSR] handle candidate form &B[i * S] Summary: This patch enhances SLSR to handle another candidate form &B[i * S]. If we found two candidates S1: X = &B[i * S] S2: Y = &B[i' * S] and S1 dominates S2, we can replace S2 with Y = &X[(i' - i) * S] Test Plan: slsr-gep.ll X86/no-slsr.ll: verify that we do not run SLSR on GEPs that already fit into an addressing mode Reviewers: eliben, atrick, meheff, hfinkel Reviewed By: hfinkel Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D7459 llvm-svn: 233286	2015-03-26 16:49:24 +00:00
Aaron Ballman	b816aeb026	Sometimes report_fatal_error is called when there is not a handler function used to fail gracefully. In that case, RunInterruptHandlers is called, which attempts to enter a critical section object. Ensure that the critical section is properly initialized so that this code functions properly, and tools like clang-tidy do not crash in Debug builds. llvm-svn: 233282	2015-03-26 16:24:38 +00:00
Toma Tabacu	6480ac9680	[mips] Move the setATReg definition inside the MipsAssemblerOptions class. NFC. Summary: This groups all of the MipsAssemblerOptions functionality together, making it more reader-friendly. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8445 llvm-svn: 233271	2015-03-26 13:08:55 +00:00
Andrea Di Biagio	e2ef12f536	[X86][FastIsel] Teach how to select vector load instructions. This patch teaches fast-isel how to select 128-bit vector load instructions. Added test CodeGen/X86/fast-isel-vecload.ll Differential Revision: http://reviews.llvm.org/D8605 llvm-svn: 233270	2015-03-26 11:29:02 +00:00
Duncan P. N. Exon Smith	d904d63dcd	Revert "Linker: Drop function pointers for overridden subprograms" This reverts commit r233164 and its testcase follow-ups in r233165, r233207, r233214, and r233221. It apparently unleashed an LTO bootstrap failure, at least on Darwin: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/3376/ I'm reproducing now. llvm-svn: 233254	2015-03-26 05:27:45 +00:00
Quentin Colombet	f4be6ae977	[RegisterCoalescer] Add a rule to consider more profitable copies first when those are in the same basic block. The previous approach was the topological order of the basic block. By default this rule is disabled. Related to PR22768. llvm-svn: 233241	2015-03-26 01:01:48 +00:00
Eric Christopher	dfc6a65e21	Add computeFSAdditions to the function based subtarget creation for PPC due to some unfortunate default setting via TargetMachine creation. I've added a FIXME on how this can be unraveled in the backend and a test to make sure we successfully legalize 64-bit things if we say we're 64-bits. llvm-svn: 233239	2015-03-26 00:50:23 +00:00
Nico Weber	de9ddb83e2	Fix typo in comment. llvm-svn: 233226	2015-03-25 22:34:16 +00:00
Sanjoy Das	40f3beb387	[ValueTracking] Fix PR23011. Summary: `ComputeNumSignBits` returns incorrect results for `srem` instructions. This change fixes the issue and adds a test case. Reviewers: nadav, nicholas, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8600 llvm-svn: 233225	2015-03-25 22:33:53 +00:00
Simon Pilgrim	2751d2eb79	[DAGCombiner] Add support for TRUNCATE + FP_EXTEND vector constant folding This patch adds supports for the vector constant folding of TRUNCATE and FP_EXTEND instructions and tidies up the SINT_TO_FP and UINT_TO_FP instructions to match. It also moves the vector constant folding for the FNEG and FABS instructions to use the DAG.getNode() functionality like the other unary instructions. Differential Revision: http://reviews.llvm.org/D8593 llvm-svn: 233224	2015-03-25 22:30:31 +00:00
Andrew Kaylor	8d68022307	Fix remaining MSVC warning llvm-svn: 233220	2015-03-25 21:33:24 +00:00
Matthias Braun	72b82f2894	RegisterCoalescer: Fix implicit def handling in register coalescer If liveranges induced by an IMPLICIT_DEF get completely covered by a proper liverange the IMPLICIT_DEF instructions and its corresponding definitions have to be removed from the live ranges. This has to happen in the subregister live ranges as well (I didn't see this case earlier because in most programs only some subregisters are covered and the IMPLCIT_DEF won't get removed). No testcase, I spent hours trying to create one for one of the public targets, but ultimately failed because I couldn't manage to properly control the placement of COPY and IMPLICIT_DEF instructions from an .ll file. llvm-svn: 233217	2015-03-25 21:18:24 +00:00
Matthias Braun	98bf0fa094	MachineVerifier: slightly simplify code that is only called with vregs llvm-svn: 233216	2015-03-25 21:18:22 +00:00
Krzysztof Parzyszek	5a3d37974f	Revert r233206 llvm-svn: 233213	2015-03-25 20:21:16 +00:00
Reid Kleckner	a894d59f4a	WinEH: Create an unwind help alloca for __CxxFrameHandler3 xdata tables We don't have any logic to emit those tables yet, so the sdag lowering of this intrinsic is just a stub. We can see the intrinsic in the prepared IR, though. llvm-svn: 233209	2015-03-25 20:10:36 +00:00
Krzysztof Parzyszek	9173594bab	[Hexagon] Keep the bare getSubtargetImpl for now llvm-svn: 233206	2015-03-25 19:51:52 +00:00
Kit Barton	d0dd6e5750	Add Hardware Transactional Memory (HTM) Support This patch adds Hardware Transaction Memory (HTM) support supported by ISA 2.07 (POWER8). The intrinsic support is based on GCC one [1], but currently only the 'PowerPC HTM Low Level Built-in Function' are implemented. The HTM instructions follows the RC ones and the transaction initiation result is set on RC0 (with exception of tcheck). Currently approach is to create a register copy from CR0 to GPR and comapring. Although this is suboptimal, since the branch could be taken directly by comparing the CR0 value, it generates code correctly on both test and branch and just return value. A possible future optimization could be elimitate the MFCR instruction to branch directly. The HTM usage requires a recently newer kernel with PPC HTM enabled. Tested on powerpc64 and powerpc64le. This is send along a clang patch to enabled the builtins and option switch. [1] https://gcc.gnu.org/onlinedocs/gcc/PowerPC-Hardware-Transactional-Memory-Built-in-Functions.html Phabricator Review: http://reviews.llvm.org/D8247 llvm-svn: 233204	2015-03-25 19:36:23 +00:00
Rafael Espindola	9448b64d9c	clang-format bits of code to make another patch readable. llvm-svn: 233203	2015-03-25 19:24:39 +00:00
Peter Collingbourne	4fbf146280	DebugInfo: Permit DW_TAG_structure_type, DW_TAG_member, DW_TAG_typedef tags with empty file names. Some languages, such as Go, have pre-defined structure types (e.g. "string" is essentially a pointer/length pair) or pre-defined "typedef" types (e.g. "error" is essentially a typedef for a specific interface type). Such types do not have associated source location, so a Go frontend would be correct not to associate a file name with such types. This change relaxes the DIType verifier to permit unlocated types with these tags. Differential Revision: http://reviews.llvm.org/D8588 llvm-svn: 233200	2015-03-25 17:44:49 +00:00
Sanjay Patel	57d34fb183	[X86, AVX] improve insertion into zero element of 256-bit vector This patch allows AVX blend instructions to handle insertion into the low element of a 256-bit vector for the appropriate data types. For f32, instead of: vblendps $1, %xmm1, %xmm0, %xmm1 ## xmm1 = xmm1[0],xmm0[1,2,3] vblendps $15, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0,1,2,3],ymm0[4,5,6,7] we get: vblendps $1, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0],ymm0[1,2,3,4,5,6,7] For f64, instead of: vmovsd %xmm1, %xmm0, %xmm1 ## xmm1 = xmm1[0],xmm0[1] vblendpd $3, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0,1],ymm0[2,3] we get: vblendpd $1, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0],ymm0[1,2,3] For the hardware-neglected integer data types, I left a TODO comment in the code and added regression tests for a follow-on patch. Differential Revision: http://reviews.llvm.org/D8609 llvm-svn: 233199	2015-03-25 17:36:01 +00:00
Benjamin Kramer	3deba1d2df	[APInt] Add an isSplat helper and use it in some places. To complement getSplat. This is more general than the binary decomposition method as it also handles non-pow2 splat sizes. llvm-svn: 233195	2015-03-25 16:49:59 +00:00
Benjamin Kramer	2ed53cd9d5	[Hexagon] Pattern match a CTZ loop into a call to countTrailingZeros. No functional change intended. llvm-svn: 233192	2015-03-25 15:36:57 +00:00
Benjamin Kramer	6c5bd093e6	[ARM] Rewrite .save/.vsave emission with bit math Hopefully makes it a bit easier to understand what's going on. No functional change intended. llvm-svn: 233191	2015-03-25 15:27:58 +00:00
Rafael Espindola	bfef1fc433	Fix fixup evaluation when deciding what to relocate with. The previous logic was to first try without relocations at all and failing that stop on the first defined symbol. That was inefficient and incorrect in the case part of the expression could be simplified and another part could not (see included test). We now stop the evaluation when we get to a variable whose value can change (i.e. is weak). llvm-svn: 233187	2015-03-25 13:16:53 +00:00
Andrea Di Biagio	713ffecfc7	[optnone] Skip pass Float2Int on optnone functions. Added test Float2Int/float2int-optnone.ll to verify that pass Float2Int is not run on optnone functions. llvm-svn: 233183	2015-03-25 12:22:37 +00:00
James Molloy	17b6105997	Reapply r233062: "float2int": Add a new pass to demote from float to int where possible. Now with a fix for PR23008 and extra regression test. llvm-svn: 233175	2015-03-25 10:03:42 +00:00
Craig Topper	eb82fb3203	[X86] Remove GetCpuIDAndInfo, GetCpuIDAndInfoEx and DetectFamilyModel functions from X86 MC layer. They haven't been used since CPU autodetection was removed from X86Subtarget.cpp. llvm-svn: 233170	2015-03-25 04:16:50 +00:00
Lang Hames	0568b3b823	[Orc] Refactor JITCompileCallbackManagerBase and CompileOnDemandLayer to support target-independent callback management. This is a prerequisite for adding orc-based lazy-jitting to lli. llvm-svn: 233166	2015-03-25 02:45:50 +00:00
Duncan P. N. Exon Smith	c1b7b4cc37	Linker: Drop function pointers for overridden subprograms Instead of dropping subprograms that have been overridden, just set their function pointers to `nullptr`. This is a minor adjustment to the stop-gap fix for PR21910 committed in r224487, and fixes the crasher from PR22792. The problem that r224487 put a band-aid on: how do we find the canonical subprogram for a `Function`? Since the backend currently relies on `DebugInfoFinder` (which does a naive in-order traversal of compile units and picks the first subprogram) for this, r224487 tried dropping non-canonical subprograms. Dropping subprograms fails because the backend also builds up a map from subprogram to compile unit (`DwarfDebug::SPMap`) based on the subprogram lists. A missing subprogram causes segfaults later when an inlined reference (such as in this testcase) is created. Instead, just drop the `Function` pointer to `nullptr`, which nicely mirrors what happens when an already-inlined `Function` is optimized out. We can't really be sure that it's the same definition anyway, as the testcase demonstrates. This still isn't completely satisfactory. Two flaws at least that I can think of: - I still haven't found a straightforward way to make this symmetric in the IR. (Interestingly, the DWARF output is already symmetric, and I've tested for that to be sure we don't regress.) - Using `DebugInfoFinder` to find the canonical subprogram for a function is kind of crazy. We should just attach metadata to the function, like this: define weak i32 @foo(i32, i32) !dbg !MDSubprogram(...) { llvm-svn: 233164	2015-03-25 02:26:32 +00:00
Rafael Espindola	249b878ed5	Fix warning on non-assert build. llvm-svn: 233158	2015-03-25 00:45:41 +00:00
Rafael Espindola	b167831202	Produce an error instead of asserting on invalid .sleb128/.uleb128. llvm-svn: 233155	2015-03-25 00:25:37 +00:00
Paul Robinson	ef36d53059	'optnone' should not disable DAG combiner. Reverts the code change from r221168 and the relevant test. It was a mistake to disable the combiner, and based on the ultimate definition of 'optnone' we shouldn't have considered the test case as failing in the first place. llvm-svn: 233153	2015-03-25 00:10:24 +00:00
Philip Reames	63d3545e07	!invariant.load semantics with potentially clobbering calls A load from an invariant location is assumed to not alias any otherwise potentially aliasing stores. Our implementation only applied this rule to store instructions themselves whereas they it should apply for any memory accessing instruction. This results in both FRE and PRE becoming more effective at eliminating invariant loads. Note that as a follow on change I will likely move this into AliasAnalysis itself. That's where the TBAA constant flag is handled and the semantics are essentially the same. I'd like to separate the semantic change from the refactoring and thus have extended the hack that's already in MemoryDependenceAnalysis for this change. Differential Revision: http://reviews.llvm.org/D8591 llvm-svn: 233140	2015-03-24 23:54:54 +00:00
Rafael Espindola	0f74449354	Don't be over eager in evaluating a subtraction with a weak symbol. In a subtraction of the form A - B, if B is weak, there is no way to represent that on ELF since all relocations add the value of a symbol. llvm-svn: 233139	2015-03-24 23:48:44 +00:00
Reid Kleckner	b3c593a951	X86: Fix frameescape when not using an FP We can't use TargetFrameLowering::getFrameIndexOffset directly, because Win64 really wants the offset from the stack pointer at the end of the prologue. Instead, use X86FrameLowering::getFrameIndexOffsetFromSP(), which is a pretty close approximiation of that. It fails to handle cases with interestingly large stack alignments, which is pretty uncommon on Win64 and is TODO. llvm-svn: 233137	2015-03-24 23:46:01 +00:00
Andrew Kaylor	614de5f815	Disabling warnings for MSVC build to enable /W4 use. Differential Revision: http://reviews.llvm.org/D8572 llvm-svn: 233133	2015-03-24 23:37:10 +00:00
David Blaikie	27c176e356	Opaque Pointer Types: GEP API migrations to specify the gep type explicitly The changes to InstCombine (& SCEV) do seem a bit silly - it doesn't make anything obviously better to have the caller access the pointers element type (the thing I'm trying to remove) than the GEP itself, but it's a helpful migration step. This will allow me to more obviously lock down GEP (& Load, etc) API usage, then fix all the code that accesses pointer element types except the places that need to be removed (most of the InstCombines) anyway - at which point I'll need to just remove all that code because it won't be meaningful anymore (there will be no pointer types, so no bitcasts to combine) SCEV looks like it'll need some restructuring - we'll have to do a bit more work for GEP canonicalization, since it'll depend on how it's used if we can even manage to canonicalize it to a non-ugly GEP. I guess we can do some fun stuff like voting (do 2 out of 3 load from the GEP with a certain type that gives a pretty GEP? Does every typed use of the GEP use either a specific type or a generic type (i8*, etc)?) llvm-svn: 233131	2015-03-24 23:34:31 +00:00
Sanjay Patel	1a17022bda	optimize the AVX2 (integer) version of vperm2 into a shuffle ...because this is what happens when an instruction set puts its underwear on after its pants. This is an extension of r232852, r233100, and 233110: http://llvm.org/viewvc/llvm-project?view=revision&revision=232852 http://llvm.org/viewvc/llvm-project?view=revision&revision=233100 http://llvm.org/viewvc/llvm-project?view=revision&revision=233110 llvm-svn: 233127	2015-03-24 22:39:29 +00:00
David Blaikie	0dc1532dd9	Opaque Pointer Types: GEP API migrations to specify the gep type explicitly The changes to InstCombine do seem a bit silly - it doesn't make anything obviously better to have the caller access the pointers element type (the thing I'm trying to remove) than the GEP itself, but it's a helpful migration step. This will allow me to more obviously lock down GEP (& Load, etc) API usage, then fix all the code that accesses pointer element types except the places that need to be removed (most of the InstCombines) anyway - at which point I'll need to just remove all that code because it won't be meaningful anymore (there will be no pointer types, so no bitcasts to combine) llvm-svn: 233126	2015-03-24 22:38:16 +00:00
Philip Reames	d57b6c2219	Merge empty landing pads in SimplifyCFG This patch tries to merge duplicate landing pads when they branch to a common shared target. Given IR that looks like this: lpad1: %exn = landingpad {i8, i32} personality i32 (...) @__gxx_personality_v0 cleanup br label %shared_resume lpad2: %exn2 = landingpad {i8, i32} personality i32 (...) @__gxx_personality_v0 cleanup br label %shared_resume shared_resume: call void @fn() ret void } We can rewrite the users of both landing pad blocks to use one of them. This will generally allow the shared_resume block to be merged with the common landing pad as well. Without this change, tail duplication would likely kick in - creating N (2 in this case) copies of the shared_resume basic block. Differential Revision: http://reviews.llvm.org/D8297 llvm-svn: 233125	2015-03-24 22:28:45 +00:00
David Blaikie	8b46a1bcb2	Revert "Remove an InstCombine that seems to have become redundant." Assertion fires in compiler-rt. Guess it does fire.. This reverts commit r233116. llvm-svn: 233121	2015-03-24 21:50:35 +00:00
Rafael Espindola	4d759582ac	Reset the CFA offset at the start of every FDE. This fixes PR21515. llvm-svn: 233120	2015-03-24 21:47:31 +00:00
Peter Collingbourne	c55866b5f0	AArch64: use a different means to determine whether to byte swap relocations. This code depended on a bug in the FindAssociatedSection function that would cause it to return the wrong result for certain absolute expressions. Instead, use EvaluateAsRelocatable. llvm-svn: 233119	2015-03-24 21:47:03 +00:00
David Blaikie	0914ef9c6b	Remove an InstCombine that seems to have become redundant. Assert that this doesn't fire - I'll remove all of this later, but just leaving it in for a while in case this is firing & we just don't have test coverage. llvm-svn: 233116	2015-03-24 21:31:31 +00:00
Sanjay Patel	c077161b46	[X86, AVX] instcombine vperm2 intrinsics with zero inputs into shuffles This is the IR optimizer follow-on patch for D8563: the x86 backend patch that converts this kind of shuffle back into a vperm2. This is also a continuation of the transform that started in D8486. In that patch, Andrea suggested that we could convert vperm2 intrinsics that use zero masks into a single shuffle. This is an implementation of that suggestion. Differential Revision: http://reviews.llvm.org/D8567 llvm-svn: 233110	2015-03-24 20:36:42 +00:00
Hans Wennborg	fbd19841f0	Revert r233062 ""float2int": Add a new pass to demote from float to int where possible." This caused PR23008, compiles failing with: "Use still stuck around after Def is destroyed: %.sroa.speculated" Also reverting follow-up r233064. llvm-svn: 233105	2015-03-24 20:07:08 +00:00
Sanjoy Das	5e0d47a08c	[IRCE] Fix how IRCE checks for no-sign-overflow. IRCE requires the induction variables it handles to not sign-overflow. The current scheme of checking if sext({X,+,S}) == {sext(X),+,sext(S)} fails when SCEV simplifies sext(X) too. After this change we //also// check no-signed-wrap by looking at the flags set on the SCEVAddRecExpr. llvm-svn: 233102	2015-03-24 19:29:22 +00:00
Sanjoy Das	867451d1ec	[IRCE] Fix a regression introduced in r232444. IRCE should not try to eliminate range checks that check an induction variable against a loop-varying length. llvm-svn: 233101	2015-03-24 19:29:18 +00:00
Sanjay Patel	b1b1054a09	[X86, AVX] recognize shufflevector with zero input as a vperm2 (PR22984) vperm2x128 instructions have the special ability (aka free hardware capability) to shuffle zero values into a vector. This patch recognizes that type of shuffle and generates the appropriate control byte. https://llvm.org/bugs/show_bug.cgi?id=22984 Differential Revision: http://reviews.llvm.org/D8563 llvm-svn: 233100	2015-03-24 19:19:07 +00:00
Duncan P. N. Exon Smith	f3cc216d43	Verifier: Start recursing into !dbg attachments The main verifier already recurses through the other entry points, so we might as well descend here too. This temporarily duplicates some work already done in `verifyDebugInfo()`, but eventually I'll be removing the other side. llvm-svn: 233095	2015-03-24 17:32:19 +00:00

1 2 3 4 5 ...

78219 Commits